Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-10444

Node Manager to use openFile() with whole-file read policy for localizing files.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.3.0
    • 3.3.5
    • nodemanager
    • None

    Description

      HADOOP-16202 adds standard options for declaring the read/seek
      Policy when reading a file. These should be set to sequential IO
      When localising resources, so that if the default/cluster settings
      For a file system are optimized for random IO, artifact downloads
      are still read at the maximum speed possible (one big GET to the EOF).

      Most of this happens in hadoop-common, but some tuning of FSDownload
      can assist

      • tar/jar download must also be sequential
      • if the FileStatus is passed around, that can be used
        in the open request to skip checks when loading the file.

      Together this can save 3 HEAD requests per resource, with the sequential
      IO avoiding any splitting of the big read into separate block GETs

      Attachments

        Issue Links

          Activity

            People

              stevel@apache.org Steve Loughran
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: