Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4410

Fix testRankWithEmptyReduce in tez mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.15.0
    • tez
    • None
    • Reviewed

    Description

      testRankWithEmptyReduce added in PIG-4392 failed in tez mode. The reason is POReservoirSample produce more sample than necessary. In particular, if the input of the vertex is empty, it produces a fake tuple which does not have the original data, but a marked field plus 0 rowNum. That cause the WeightedRangePartitioner fail:

      Caused by: java.lang.ClassCastException: java.lang.String cannot be cast to java.lang.Integer
      	at org.apache.pig.backend.hadoop.HDataType.getWritableComparableTypes(HDataType.java:115)
      	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.getPigNullableWritable(WeightedRangePartitioner.java:192)
      

      Another issue I found is GetMemNumRows, I erroneously add the size of mark tuple, which make the size estimation inaccurate. I put the fix in the same patch.

      Attachments

        1. PIG-4410-1.patch
          1 kB
          Daniel Dai

        Issue Links

          Activity

            People

              daijy Daniel Dai
              daijy Daniel Dai
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: