Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3672

Expose disk-location information for blocks to enable better scheduling

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0-alpha
    • 2.0.2-alpha
    • None
    • None
    • Reviewed

    Description

      Currently, HDFS exposes on which datanodes a block resides, which allows clients to make scheduling decisions for locality and load balancing. Extending this to also expose on which disk on a datanode a block resides would enable even better scheduling, on a per-disk rather than coarse per-datanode basis.

      This API would likely look similar to Filesystem#getFileBlockLocations, but also involve a series of RPCs to the responsible datanodes to determine disk ids.

      Attachments

        1. design-doc-v1.pdf
          73 kB
          Andrew Wang
        2. design-doc-v2.pdf
          73 kB
          Andrew Wang
        3. hdfs-3672-1.patch
          37 kB
          Andrew Wang
        4. hdfs-3672-10.patch
          68 kB
          Andrew Wang
        5. hdfs-3672-11.patch
          68 kB
          Andrew Wang
        6. hdfs-3672-12.patch
          69 kB
          Andrew Wang
        7. hdfs-3672-2.patch
          48 kB
          Andrew Wang
        8. hdfs-3672-3.patch
          49 kB
          Andrew Wang
        9. hdfs-3672-4.patch
          52 kB
          Andrew Wang
        10. hdfs-3672-5.patch
          52 kB
          Andrew Wang
        11. hdfs-3672-6.patch
          60 kB
          Andrew Wang
        12. hdfs-3672-7.patch
          61 kB
          Andrew Wang
        13. hdfs-3672-8.patch
          61 kB
          Andrew Wang
        14. hdfs-3672-9.patch
          68 kB
          Andrew Wang

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            andrew.wang Andrew Wang
            andrew.wang Andrew Wang
            Votes:
            0 Vote for this issue
            Watchers:
            30 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment