Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
I have always found it very confusing that people refer to the term "page index" when referring to parquet, for example https://lists.apache.org/thread/o9nxbmv1z4hph3v5s2z63jsklywpkyyj
However, the term "page index" is not used in the the parquet thirft file https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift
The term does appears as the name of the file that describes the `ColumnIndex` spec.
https://github.com/apache/parquet-format/blob/master/PageIndex.md
I would like to clarify that ColumnIndex is the implementation of the Page index concept
Attachments
Issue Links
- links to