Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4059 Pig on Spark
  3. PIG-4842

Collected group doesn't work in some cases

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: spark-branch
    • Component/s: spark
    • Labels:
      None

      Description

      Scenario:
      1. input data:
      cat collectedgroup1

      1
      1
      2
      

      2. pig script:

      A = LOAD 'collectedgroup1' USING myudfs.DummyCollectableLoader() AS (id);
      B = GROUP A by $0 USING 'collected';
      C = GROUP B by $0 USING 'collected';
      DUMP C;
      

      The expected output:

      (1,{(1,{(1),(1)})})
      (2,{(2,{(2)})})
      

      The actual output:

      (1,{(1,{(1),(1)})})
      (1,)
      (2,{(2,{(2)})})
      

        Attachments

        1. PIG-4842-2.patch
          1 kB
          Xianda Ke
        2. PIG-4842.patch
          2 kB
          Xianda Ke

          Issue Links

            Activity

              People

              • Assignee:
                kexianda Xianda Ke
                Reporter:
                kexianda Xianda Ke
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: