Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5522

Datanode disk error check may be incorrectly skipped

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.23.9, 2.2.0
    • 2.5.0
    • None
    • None
    • Reviewed

    Description

      After HDFS-4581 and HDFS-4699, checkDiskError() is not called when network errors occur during processing data node requests. This appears to create problems when a disk is having problems, but not failing I/O soon.

      If I/O hangs for a long time, network read/write may timeout first and the peer may close the connection. Although the error was caused by a faulty local disk, disk check is not being carried out in this case.

      Attachments

        1. HDFS-5522.patch
          10 kB
          Rushabh Shah
        2. HDFS-5522-v2.patch
          11 kB
          Rushabh Shah
        3. HDFS-5522-v3.patch
          11 kB
          Rushabh Shah

        Issue Links

          Activity

            People

              shahrs87 Rushabh Shah
              kihwal Kihwal Lee
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: