Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-7547

For distinct queries use dictionary encoded page instead of reading all data

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Backend
    • ghx-label-7

    Description

      When dictionary encoding is in use the lookup table should contain a distinct list of all values in the data, can skip reading the values and just read the header to get distinct values.

       

      Realize this would be a big change to the read/scanner threads but could greatly speed up distinct queries.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              peter.ebert Peter Ebert
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: