Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-3490

For single column queries, only dictionaries are enough

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: v2.5.0
    • Component/s: None
    • Labels:
      None

      Description

      A common use case for BI tools is as follows:

      1. Firstly, extract all of the values of a dimension column
      2. Then, select part of the values as filter condition.

      Previously query for the first step requires to hit all of the segments' cuboid data, which may not be efficient, especially when the segments occupy many regions.

      To use dictionary rather than cuboid data to answer this kind of queries, will reduce the cost of many rpcs to hbase.

      Sample queries are as follows:

      select A
      from T
      group by A
      
      select distinct A
      from T
      
      select max(A)
      from T
      

        Attachments

        1. APACHE-KYLIN-3490.patch
          20 kB
          Zhong Yanghong

          Issue Links

            Activity

              People

              • Assignee:
                yaho Zhong Yanghong
                Reporter:
                yaho Zhong Yanghong
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: