Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1414

Limit page size based on maximum row count

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.11.0
    • None

    Description

      For column index based filtering it is important to have enough pages for a column. In case of a perfectly matching encoding for the suitable data it can happen that all of the values can be encoded in one page (e.g. a column of an ascending counter).

      With this improvement we would be able to limit the pages by the maximum number of rows to be written in it so we would have enough pages for every column.

      Based on the benchmarks listed here 20k seems to be a good choice for the default value.

      Attachments

        Issue Links

          Activity

            People

              gszadovszky Gabor Szadovszky
              gszadovszky Gabor Szadovszky
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: