Hadoop HDFS
  1. Hadoop HDFS
  2. HDFS-4222

NN is unresponsive and loses heartbeats of DNs when Hadoop is configured to use LDAP and LDAP has issues

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 1.0.0, 0.23.3, 2.0.0-alpha
    • Fix Version/s: 1.2.0, 0.23.7, 2.1.0-beta
    • Component/s: namenode
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      For Hadoop clusters configured to access directory information by LDAP, the FSNamesystem calls on behave of DFS clients might hang due to LDAP issues (including LDAP access issues caused by networking issues) while holding the single lock of FSNamesystem. That will result in the NN unresponsive and loss of the heartbeats from DNs.

      The places LDAP got accessed by FSNamesystem calls are the instantiation of FSPermissionChecker, which could be moved out of the lock scope since the instantiation does not need the FSNamesystem lock. After the move, a DFS client hang will not affect other threads by hogging the single lock. This is especially helpful when we use separate RPC servers for ClientProtocol and DatanodeProtocol since the calls for DatanodeProtocol do not need to access LDAP. So even if DFS clients hang due to LDAP issues, the NN will still be able to process the requests (including heartbeats) from DNs.

      1. hdfs-4222-branch-0.23.3.patch
        29 kB
        Xiaobo Peng
      2. hdfs-4222-release-1.0.3.patch
        45 kB
        Xiaobo Peng
      3. HDFS-4222.patch
        27 kB
        Suresh Srinivas
      4. HDFS-4222.patch
        30 kB
        Suresh Srinivas
      5. HDFS-4222.23.patch
        28 kB
        Suresh Srinivas
      6. HDFS-4222-branch-1.patch
        49 kB
        Xiaobo Peng

        Issue Links

          Activity

          Xiaobo Peng created issue -
          Xiaobo Peng made changes -
          Field Original Value New Value
          Assignee Xiaobo Peng [ teledriver ]
          Ted Yu made changes -
          Summary NN is unresponsive and lose hearbeats of DNs when Hadoop is configured to use LADP and LDAP has issues NN is unresponsive and lose heartbeats of DNs when Hadoop is configured to use LDAP and LDAP has issues
          Xiaobo Peng made changes -
          Attachment hdfs-4222-release-1.0.3.patch [ 12568929 ]
          Attachment hdfs-4222-branch-0.23.3.patch [ 12568928 ]
          Suresh Srinivas made changes -
          Attachment HDFS-4222.patch [ 12570045 ]
          Suresh Srinivas made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Affects Version/s 2.0.0-alpha [ 12320353 ]
          Affects Version/s 1.0.0 [ 12318243 ]
          Suresh Srinivas made changes -
          Attachment HDFS-4222.patch [ 12570230 ]
          Suresh Srinivas made changes -
          Fix Version/s 2.0.4-beta [ 12324031 ]
          Suresh Srinivas made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Suresh Srinivas made changes -
          Attachment HDFS-4222.23.patch [ 12570416 ]
          Suresh Srinivas made changes -
          Fix Version/s 0.23.7 [ 12323955 ]
          Xiaobo Peng made changes -
          Attachment HDFS-4222-branch-1.patch [ 12570612 ]
          Suresh Srinivas made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Hadoop Flags Reviewed [ 10343 ]
          Fix Version/s 1.2.0 [ 12321657 ]
          Resolution Fixed [ 1 ]
          Jing Zhao made changes -
          Link This issue relates to HDFS-4622 [ HDFS-4622 ]
          Xiaobo Peng made changes -
          Summary NN is unresponsive and lose heartbeats of DNs when Hadoop is configured to use LDAP and LDAP has issues NN is unresponsive and loses heartbeats of DNs when Hadoop is configured to use LDAP and LDAP has issues
          Matt Foley made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Xiaobo Peng
              Reporter:
              Xiaobo Peng
            • Votes:
              0 Vote for this issue
              Watchers:
              17 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development