Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1094

Intelligent block placement policy to decrease probability of block loss

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • namenode
    • None

    Description

      The current HDFS implementation specifies that the first replica is local and the other two replicas are on any two random nodes on a random remote rack. This means that if any three datanodes die together, then there is a non-trivial probability of losing at least one block in the cluster. This JIRA is to discuss if there is a better algorithm that can lower probability of losing a block.

      Attachments

        1. calculate_probs.py
          2 kB
          Rodrigo Schmidt
        2. failure_rate.py
          11 kB
          Rodrigo Schmidt
        3. prob.pdf
          67 kB
          Aravind Menon
        4. prob.pdf
          67 kB
          Aravind Menon

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rschmidt Rodrigo Schmidt
            dhruba Dhruba Borthakur

            Dates

              Created:
              Updated:

              Slack

                Issue deployment