Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-10620

[Rust][Parquet] move column chunk range logic to metadata.rs

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 3.0.0
    • Rust

    Description

      Getting the range of bytes of a column chunk inside a parquet file can be useful for external crates (for instance if they want to pre-fetch the columns), and is not completely obvious (it is enough to take a look at [1] and [2] to see that things can quickly get messy).

      I think it would be nice to move this logic in the metadata definition rather than have lost it in the middle of the reader implem.

      [1] https://stackoverflow.com/questions/55225108/why-is-dictionary-page-offset-0-for-plain-dictionary-encoding/
      [2] https://issues.apache.org/jira/browse/PARQUET-816

      Attachments

        Issue Links

          Activity

            People

              rdettai Rémi Dettai
              rdettai Rémi Dettai
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h