Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-125

Consistency of different replicas of the same block is not checked.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      HDFS currently detects corrupted replicas by verifying that its contents matches the checksum stored in the block meta-file. This is done independently for each replica of the block on the data-node it belongs to. But we do not check that the replicas are identical across data-nodes as long as they have the same size.
      This is not common but can happen as a result of a software bug or an operator mismanagement. And in this case different clients will read different data from the same file.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              shv Konstantin Shvachko
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: