Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-8031 Follow-on work for erasure coding phase I (striping layout)
  3. HDFS-9866

BlockManager#chooseExcessReplicasStriped may weaken rack fault tolerance

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.0-alpha1
    • 3.0.0-alpha1
    • namenode
    • None
    • Reviewed

    Description

      In Takuya Fukudome's system tests, we find the following issue:

      A striped block group B has redundant internal block replicas. 9 internal blocks are stored in 10 datanodes across 6 racks. Datanode d1 and d2 both store a replica for internal block b1. d1's rack contains multiple internal blocks while d2's rack only has b1. Then when choosing a duplicated replica to delete, the current implementation may wrongly choose d2 thus causes the total number of racks to be decreased to 5.

      Attachments

        1. HDFS-9866.000.patch
          31 kB
          Jing Zhao

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jingzhao Jing Zhao
            tfukudom Takuya Fukudome
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment