Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5522

Datanode disk error check may be incorrectly skipped

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.23.9, 2.2.0
    • 2.5.0
    • None
    • None
    • Reviewed

    Description

      After HDFS-4581 and HDFS-4699, checkDiskError() is not called when network errors occur during processing data node requests. This appears to create problems when a disk is having problems, but not failing I/O soon.

      If I/O hangs for a long time, network read/write may timeout first and the peer may close the connection. Although the error was caused by a faulty local disk, disk check is not being carried out in this case.

      Attachments

        1. HDFS-5522-v3.patch
          11 kB
          Rushabh Shah
        2. HDFS-5522-v2.patch
          11 kB
          Rushabh Shah
        3. HDFS-5522.patch
          10 kB
          Rushabh Shah

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            shahrs87 Rushabh Shah
            kihwal Kihwal Lee
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment