Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7395

Partial Partition By to CTAS Parquet files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.16.0
    • None
    • Storage - Parquet
    • None

    Description

      In the case of a data set with few value are prevailing while most have weak occurrences, it will be useful to have the abilities to create Parquet with a partial PARTITION BY.

      It would then be possible to group all the small occurrences together without being "impacted" by the "too" common values.

      It's not exactly the same, but it exists partial index on some database (https://www.postgresql.org/docs/current/indexes-partial.html)

      Attachments

        Activity

          People

            Unassigned Unassigned
            benj641 benj
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: