Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3641

Split "otherwise" producing incorrect output when combined with ColumnPruning

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.10.0, 0.12.0, 0.11.1, 0.13.0
    • 0.12.1
    • None
    • None

    Description

      Our user was observing incorrect outputs depending on if the query had intermediate output or not. Below is a simplified testcase I came up with.

      knoguchi pig> cat test.txt
      9,1,ignored
      9,1,ignored
      9,1,ignored
      knoguchi pig> cat bz-6590644/test.pig
      A = load 'test.txt' using PigStorage(',') as (a1:int, a2:int, a3:chararray);
      B = foreach A generate a1,a2;
      SPLIT B into C1 if a2 == 1, D1 otherwise;
      C2 = foreach C1 generate a2;
      store C2 into '/tmp/testC';
      store D1 into '/tmp/testD';
      knoguchi@nameother-lm pig>
      

      Incorrect output shown below. /tmp/testD should be empty but somehow has data in it.

      knoguchi@nameother-lm pig> cat /tmp/testC/part-m-00000
      1
      1
      1
      knoguchi pig> cat /tmp/testD/part-m-00000
      9       1
      9       1
      9       1
      knoguchi pig>
      

      Attachments

        1. pig-3641_v01.patch
          5 kB
          Koji Noguchi
        2. pig-3641_v02_withe2etest.patch
          7 kB
          Koji Noguchi

        Activity

          People

            knoguchi Koji Noguchi
            knoguchi Koji Noguchi
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: