Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-9445

Datanode may deadlock while handling a bad volume

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 2.7.2
    • Fix Version/s: 2.8.0, 2.7.2, 2.6.4, 3.0.0-alpha1
    • Component/s: None
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Found one Java-level deadlock:
      =============================
      "DataXceiver for client DFSClient_attempt_xxx at /1.2.3.4:100 [Sending block BP-xxxxx:blk_123_456]":
        waiting to lock monitor 0x00007f77d0731768 (object 0x00000000d60d9930, a org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl),
        which is held by "Thread-565"
      "Thread-565":
        waiting for ownable synchronizer 0x00000000d55613c8, (a java.util.concurrent.locks.ReentrantReadWriteLock$NonfairSync),
        which is held by "DataNode: heartbeating to my-nn:8020"
      "DataNode: heartbeating to my-nn:8020":
        waiting to lock monitor 0x00007f77d0731768 (object 0x00000000d60d9930, a org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl),
        which is held by "Thread-565"
      

        Attachments

        1. HDFS-9445.00.patch
          5 kB
          Walter Su
        2. HDFS-9445.01.patch
          8 kB
          Walter Su
        3. HDFS-9445.02.patch
          7 kB
          Walter Su
        4. HDFS-9445-branch-2.6_02.patch
          7 kB
          Walter Su
        5. HDFS-9445-branch-2.6.02.patch
          7 kB
          Akira Ajisaka

          Activity

            People

            • Assignee:
              walter.k.su Walter Su
              Reporter:
              kihwal Kihwal Lee
            • Votes:
              0 Vote for this issue
              Watchers:
              25 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: