Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4059 Pig on Spark
  3. PIG-4842

Collected group doesn't work in some cases

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • spark-branch
    • spark
    • None

    Description

      Scenario:
      1. input data:
      cat collectedgroup1

      1
      1
      2
      

      2. pig script:

      A = LOAD 'collectedgroup1' USING myudfs.DummyCollectableLoader() AS (id);
      B = GROUP A by $0 USING 'collected';
      C = GROUP B by $0 USING 'collected';
      DUMP C;
      

      The expected output:

      (1,{(1,{(1),(1)})})
      (2,{(2,{(2)})})
      

      The actual output:

      (1,{(1,{(1),(1)})})
      (1,)
      (2,{(2,{(2)})})
      

      Attachments

        1. PIG-4842-2.patch
          1 kB
          Xianda Ke
        2. PIG-4842.patch
          2 kB
          Xianda Ke

        Issue Links

          Activity

            People

              kexianda Xianda Ke
              kexianda Xianda Ke
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: