Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-1709

Skewed join use fewer reducer for extreme large key

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8.0
    • 0.8.0
    • impl
    • None
    • Reviewed
    • In skewed join, if one large key requires more reducers than available, we give it all the available reducers.

    Description

      In skewed join, we use PartitionSkewedKeys to calculate number of reducers needed for a single key. If the result if larger than the number of total reducers, we will round it with reducer#. Eg, if Pig calculates that we need 12 reducers to hold a key in memory, and total reducers for this job is 10, we then allocate 2 reducers to this key; We shall use all 10 reducers in this case.

      Attachments

        1. PIG-1709-1.patch
          3 kB
          Daniel Dai

        Activity

          People

            daijy Daniel Dai
            daijy Daniel Dai
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: