Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2004

Configuring a Map-Reduce job with dynamic input in case of a LIMIT query that does not contain an ORDER BY

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.8.0
    • None
    • Query Processor

    Description

      Refer to JIRA 1928 - https://issues.apache.org/jira/browse/MAPREDUCE-1928
      The JIRA proposed the option of adding input on the fly to a job that has been submitted and may as well be in a running stage. The JIRA was implemented on Hadoop-20.2 version.
      With the support for such a feature in Hadoop ( after application of the patch ), Hive can use the feature to optimize LIMIT queries that do not have an ORDER BY. For each query that qualifies to be of this kind, Hive needs to set appropriate parameters in the corresponding JobConf instance that gets created. The JobConf must have the attribute "dynamic.job" set to true and should have an appropriate InputProvider set. The input provider for optimizing the LIMIT query has been provided as part of the patch on Hadoop.

      Attachments

        Activity

          People

            jsensarma Joydeep Sen Sarma
            ramang Raman Grover
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - 168h
                168h
                Remaining:
                Remaining Estimate - 168h
                168h
                Logged:
                Time Spent - Not Specified
                Not Specified