Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5522

Datanode disk error check may be incorrectly skipped

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.23.9, 2.2.0
    • Fix Version/s: 2.5.0
    • Component/s: None
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      After HDFS-4581 and HDFS-4699, checkDiskError() is not called when network errors occur during processing data node requests. This appears to create problems when a disk is having problems, but not failing I/O soon.

      If I/O hangs for a long time, network read/write may timeout first and the peer may close the connection. Although the error was caused by a faulty local disk, disk check is not being carried out in this case.

        Attachments

        1. HDFS-5522.patch
          10 kB
          Rushabh S Shah
        2. HDFS-5522-v2.patch
          11 kB
          Rushabh S Shah
        3. HDFS-5522-v3.patch
          11 kB
          Rushabh S Shah

          Issue Links

            Activity

              People

              • Assignee:
                shahrs87 Rushabh S Shah
                Reporter:
                kihwal Kihwal Lee
              • Votes:
                0 Vote for this issue
                Watchers:
                11 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: