Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4057

Group All followed by CROSS with default parallelism produces wrong results

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.14.0
    • None
    • None
    • Reviewed

    Description

      SET default_parallel 199;
      ......
      by_size = ...
      uniq_vals = .....
      grpd = group uniq_vals all;
      all_vals = FOREACH grpd GENERATE uniq_vals;
      cross_result = CROSS by_size, all_vals;
      store cross_result into '/tmp/roh/cross/out/recipient_asns';

      Job1: grpd, all_vals, cross_result (The plan does GFCross function here for
      all_vals assuming cross parallelism to be 1 taking that of the current job even
      though it should consider default parallelism 199 of Job 2. Parallelism of Job1
      is 1 because of group all)
      Job2: cross_result (Actual CROSS of by_size and all_vals)

      Attachments

        1. PIG-4057-8.patch
          1 kB
          Daniel Dai
        2. PIG-4057-7.patch
          0.8 kB
          Daniel Dai
        3. PIG-4057-6.patch
          48 kB
          Daniel Dai
        4. PIG-4057-5.patch
          46 kB
          Daniel Dai
        5. PIG-4057-4.patch
          45 kB
          Daniel Dai
        6. PIG-4057-3.patch
          45 kB
          Daniel Dai
        7. PIG-4057-2.patch
          45 kB
          Daniel Dai
        8. PIG-4057-1.patch
          42 kB
          Daniel Dai

        Activity

          People

            daijy Daniel Dai
            rohini Rohini Palaniswamy
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: