Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-3594

FsInput to use openFile() API for cloud storage read performance

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.11.2
    • None
    • java

    Description

      avro can now use the FileSystem.openFile() API to open a file on a hadoop filesystem connector (HADOOP-15229).

      by setting the file length and fadvise policy through opt() calls, the clients can

      • skip a HEAD request when opening a file
      • optimise the ranges of GET request for sequential access, even in clusters where s3a has been configured to use random iO (which some hive clusters do)

      filesystems/releases which don't recognise the options added in HADOOP-16202 will ignore them; the api will fall back to classic open(path) API call if the connector doesn't have a custom implementation.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 10m
                  1h 10m