Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-5794

Add Spark parquet footer read cache

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • None
    • None

    Description

      Optimization Proposal

      In Kylin, the index storage is in Parquet format. When querying, if Spark reads Parquet data, it needs to read the Footer information first. In cases where there are many columns, reading the Footer can consume a considerable amount of time. Therefore, caching the Footer information can improve query performance.

      Attachments

        Issue Links

          Activity

            People

              pfzhan Pengfei Zhan
              Jueyi Zhimin Wu
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: