Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-6870

Add FileSystem#listLocatedStatus to list a directory's content together with each file's block locations

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.22.0
    • 0.22.0
    • fs
    • None
    • Reviewed

    Description

      This jira implements the new FileSystem API as proposed in HDFS-202. The new API aims to eliminate individual "getFileBlockLocations" calls to NN for each file in the input directory of a job. Instead, a file's block locations are returned together with FileStatus when listing a directory, thus improving getSplits performance.

      Attachments

        1. listFiles.patch
          17 kB
          Hairong Kuang
        2. listFiles1.patch
          19 kB
          Hairong Kuang
        3. listFiles2.patch
          19 kB
          Hairong Kuang
        4. listFiles3.patch
          21 kB
          Hairong Kuang
        5. listFiles4.patch
          22 kB
          Hairong Kuang

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            hairong Hairong Kuang
            hairong Hairong Kuang
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment