Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1124

ChecksumFileSystem does not handle ChecksumError correctly

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.12.0
    • 0.14.0
    • fs
    • None

    Description

      When handle ChecksumError, the checksumed file system tries to recover by rereading from a different replica.

      I have three comments:
      1. One bug in the code is that when retrying, the object that computes checksum does not get restored to the old state.
      2. The code also assumes that the first byte read and the byte being read when ChecksumError occurs are in the same block.
      3. It would be more efficient if we roll back to the first byte in the chunk that's being checksumed instead of rolling back to the first byte that was read.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            hairong Hairong Kuang
            hairong Hairong Kuang
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment