Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-862

Potential NN deadlock in processDistributedUpgradeCommand

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 0.22.0, 0.23.1
    • None
    • namenode
    • None

    Description

      Haven't seen this in practice, but the lock order is inconsistent. processReport locks FSNamesystem, then calls UpgradeManager.startUpgrade, getUpgradeState, and getUpgradeStatus (each of which locks the UpgradeManager). FSNameSystem.processDistributedUpgradeCommand calls upgradeManager.processUpgradeCommand which is synchronized on UpgradeManager, which can call FSNameSystem.leaveSafeMode which synchronizes on FSNamesystem.

      Attachments

        1. org.apache.hadoop.hdfs.server.common.TestDistributedUpgrade-output.txt
          164 kB
          Andrey Klochkov
        2. cycle.png
          45 kB
          Todd Lipcon

        Activity

          People

            Unassigned Unassigned
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: