Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-18028 High performance S3A input stream with prefetching & caching
  3. HADOOP-18792

s3a prefetching to use split start/end options to limit prefetch range

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      the bundled hadoop record readers pass the split start/end down to openFile() -these can be used by the prefetching stream as hints as to where to start and stop prefetching.

      Hints only as

      • records can extend past split end; that last record will still be read when split end < EOF
      • for formats with footers which get read, caching that footer is important

      Attachments

        Activity

          People

            stevel@apache.org Steve Loughran
            stevel@apache.org Steve Loughran
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: