Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-1544

Filtering out NULL-keyed rows in ReduceSinkOperator when no outer join involved

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      As discussed in HIVE-741, if a plan indicates that a non-outer join is the first operator in the reducer, the ReduceSinkOperator should filter out (not sending) rows with NULL as keys since they will not generate any results anyways. This should save both bandwidth and processing power.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              nzhang Ning Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: