Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8313

Optimize evaluation for ExprNodeConstantEvaluator and ExprNodeNullEvaluator

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.12.0, 0.13.0, 0.14.0
    • 0.14.0
    • Query Processor

    Description

      Consider the following query:

      SELECT foo, bar, goo, id
      FROM myTable
      WHERE id IN ( 'A', 'B', 'C', 'D', ... , 'ZZZZZZ' );
      

      One finds that when the IN clause has several thousand elements (and the table has several million rows), the query above takes orders-of-magnitude longer to run on Hive 0.12 than say Hive 0.10.

      I have a possibly incomplete fix.

      Attachments

        1. HIVE-8313.2.patch
          3 kB
          Mithun Radhakrishnan
        2. HIVE-8313.1.patch
          2 kB
          Mithun Radhakrishnan

        Issue Links

          Activity

            People

              mithun Mithun Radhakrishnan
              mithun Mithun Radhakrishnan
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: