Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1820

[C++] Use a column filter hint to inform read prefetching in Arrow reads

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

      Description

      As a follow up to PARQUET-1698 and ARROW-7995, we should use the I/O coalescing facility (where available and enabled), in combination with a column filter hint, to compute and prefetch the exact byte ranges we will be reading (using the metadata). This should further improve performance on remote object stores like Amazon S3.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              lidavidm David Li
              Reporter:
              lidavidm David Li

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 10h 40m
                10h 40m

                  Issue deployment