Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-3490

For single column queries, only dictionaries are enough

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • v2.5.0
    • None
    • None

    Description

      A common use case for BI tools is as follows:

      1. Firstly, extract all of the values of a dimension column
      2. Then, select part of the values as filter condition.

      Previously query for the first step requires to hit all of the segments' cuboid data, which may not be efficient, especially when the segments occupy many regions.

      To use dictionary rather than cuboid data to answer this kind of queries, will reduce the cost of many rpcs to hbase.

      Sample queries are as follows:

      select A
      from T
      group by A
      
      select distinct A
      from T
      
      select max(A)
      from T
      

      Attachments

        1. APACHE-KYLIN-3490.patch
          20 kB
          Zhong Yanghong

        Issue Links

          Activity

            People

              yaho Zhong Yanghong
              yaho Zhong Yanghong
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: