Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16274

Support tuning of NDV of columns using lower/upper bounds

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.1.0
    • 2.3.0
    • None

    Description

      For partitioned tables, the distinct value (nDV) estimate for a column is by default set to the largest nDV value in any of the partitions being considered, which is a lower bound on the nDV estimate.
      This provides a config setting to allow the estimate to a specified fraction (0.0 - 1.0) of the higher bound on the nDV estimate (the sum of all the nDVs in all partitions).

      Attachments

        1. HIVE-16274.01.patch
          29 kB
          Pengcheng Xiong
        2. HIVE-16274.02.patch
          31 kB
          Pengcheng Xiong

        Issue Links

          Activity

            People

              pxiong Pengcheng Xiong
              pxiong Pengcheng Xiong
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: