Hive
  1. Hive
  2. HIVE-1940

Query Optimization Using Column Statistics and Histograms

    Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None
    • Tags:
      MetaStore

      Description

      The current basis for cost-based query optimization in Hive is information gathered on tables and partitions. To make further improvements in query optimization possible, the next step is to develop and implement possibilities to gather information on columns as discussed in issue HIVE-33. After that, an implementation of histograms is a possible option to use and collect run-time statistics. Next to the actual implementation of these features, it is also necessary to develop a consistent storage model for the MetaStore.

      1. Agruenheid_ideas11.pdf
        253 kB
        Carl Steinbach
      2. HiveMetaStore.pdf
        221 kB
        Anja Gruenheid

        Issue Links

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              Anja Gruenheid
            • Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development