Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-33 [Hive]: Add optimizer statistics in Hive
  3. HIVE-1648

Automatically gathering statistics when reading a table/partition

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Statistics
    • None

    Description

      HIVE-1361 introduces a new command 'ANALYZE TABLE T COMPUTE STATISTICS' to gathering stats. This requires additional scan of the data. Stats gathering can be piggy-backed on TableScanOperator whenever a table/partition is scanned (given not LIMIT operator).

      Attachments

        1. hive-1648.svn.patch
          13 kB
          He Yongqiang
        2. HIVE-1648.patch
          23 kB
          Paul Butler
        3. HIVE-1648.5.patch
          95 kB
          Paul Butler
        4. HIVE-1648.4.patch
          99 kB
          Paul Butler
        5. HIVE-1648.3.patch
          69 kB
          Paul Butler
        6. HIVE-1648.2.patch
          24 kB
          Paul Butler

        Issue Links

          Activity

            People

              paulbutler Paul Butler
              nzhang Ning Zhang
              Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: