Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-3718 Provide user-level documentation for column encoding feature
  3. PHOENIX-3560

Aggregate query performance is worse with encoded columns for schema with large number of columns

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Information Provided
    • None
    • 4.11.0
    • None
    • None

    Description

      Schema with 5K columns

      create table (k1 integer, k2 integer, c1 varchar ... c5000 varchar CONSTRAINT PK PRIMARY KEY (K1, K2)) 
      VERSIONS=1, MULTI_TENANT=true, IMMUTABLE_ROWS=true
      

      In this test, there are no null columns and each column contains 200 chars i.e. 1MB of data per row.

      Count * aggregation is about 5X slower with encoded columns when compared to table non-encoded columns using the same schema.

      Attachments

        1. DataGenerator.java
          3 kB
          Mujtaba Chohan
        2. PHOENIX-3565.patch
          1 kB
          Samarth Jain

        Activity

          People

            samarthjain Samarth Jain
            mujtabachohan Mujtaba Chohan
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: