Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1.0
    • None
    • fs/s3

    Description

      the default skip() does a read and discard of all bytes, no matter how far ahead the skip is. This is very inefficient if the skip() is being done on S3A random IO, though exactly what to do when in sequential mode.

      Proposed:

      • add an optimized version of S3AInputStream.skip() which does a lazy seek, which itself will decided when to skip() vs issue a new GET.
      • add some more instrumentation to measure how often this gets used

      Attachments

        Issue Links

          Activity

            People

              ahmar Ahmar Suhail
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 4h 20m
                  4h 20m