Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4001

POPartialAgg aggregates too aggressively when multiple values aggregated

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 0.13.0
    • None
    • None
    • None

    Description

      POPartialAgg.aggregateRawRow() is counting values in numEntriesInTarget, but all the values that numEntriesInTarget is compared to count tuples.

      This is similar to PIG-3649 but has less of a performance impact, as it generally does not cause map-side aggregation to be disabled entirely (since aggregateRawRow() is unlikely to be called before checkSizeReduction(); however, it can cause second-level aggregation to be run too often.

      Attachments

        1. PIG-4001.patch
          0.7 kB
          Travis Woodruff

        Activity

          People

            tmwoodruff Travis Woodruff
            tmwoodruff Travis Woodruff
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: