Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-3972

Support using multiple reducer for fetching order by results

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • Query Processor
    • None

    Description

      Queries for fetching results which have lastly "order by" clause make final MR run with single reducer, which can be too much. For example,

      select value, sum(key) as sum from src group by value order by sum;
      

      If number of reducer is reasonable, multiple result files could be merged into single sorted stream in the fetcher level.

      Attachments

        1. HIVE-3972.D8349.4.patch
          66 kB
          Phabricator
        2. HIVE-3972.D8349.3.patch
          66 kB
          Phabricator
        3. HIVE-3972.D8349.2.patch
          54 kB
          Phabricator
        4. HIVE-3972.D8349.1.patch
          53 kB
          Phabricator
        5. HIVE-3972.9.patch.txt
          37 kB
          Navis Ryu
        6. HIVE-3972.8.patch.txt
          37 kB
          Navis Ryu
        7. HIVE-3972.10.patch.txt
          73 kB
          Navis Ryu
        8. D8349.7.patch
          69 kB
          Phabricator
        9. D8349.6.patch
          69 kB
          Phabricator
        10. D8349.5.patch
          68 kB
          Phabricator

        Activity

          People

            navis Navis Ryu
            navis Navis Ryu
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: