Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-2480

Clarify what "page index" means in Parquet.thrift

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • format-2.11.0
    • None
    • None

    Description

      I have always found it very confusing that people refer to the term "page index" when referring to parquet, for example https://lists.apache.org/thread/o9nxbmv1z4hph3v5s2z63jsklywpkyyj

      However, the term "page index" is not used in the the parquet thirft file https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift

      The term does appears as the name of the file that describes the `ColumnIndex` spec.

      https://github.com/apache/parquet-format/blob/master/PageIndex.md

       

      I would like to clarify that ColumnIndex is the implementation of the Page index concept

       

      Attachments

        Issue Links

          Activity

            People

              alamb Andrew Lamb
              alamb Andrew Lamb
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: