Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26072

Enable vectorization for stats gathering (tablescan op)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Hive

    Description

      https://issues.apache.org/jira/browse/HIVE-24510 enabled vectorization for compute_bit_vector.

      But tablescan operator for stats gathering is disabled by default.

      https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/Vectorizer.java#L2577

      Need to enable vectorization for this. This can significantly reduce runtimes for analyze statements for large tables.

      Attachments

        Activity

          People

            ayushtkn Ayush Saxena
            rajesh.balamohan Rajesh Balamohan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 1h 40m
                1h 40m