XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.3.0
    • Fix Version/s: 2.3.0
    • Component/s: Optimizer
    • Labels:
      None

      Description

      Equi-height histogram is effective in handling skewed data distribution.

      For equi-height histogram, the heights of all bins(intervals) are the same. The default number of bins we use is 254.

      Now we use a two-step method to generate an equi-height histogram:
      1. use percentile_approx to get percentiles (end points of the equi-height bin intervals);
      2. use a new aggregate function to get distinct counts in each of these bins.

      Note that this method takes two table scans. In the future we may provide other algorithms which need only one table scan.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ZenWzh Zhenhua Wang
                Reporter:
                ron8hu Ron Hu
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: