Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4904

Deadlock while leaving safe mode.

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.18.3
    • 0.18.3
    • None
    • None
    • Reviewed

    Description

      SafeModeInfo.leave() acquires locks in an incorrect order, which causes the deadlock.
      It first acquires the SafeModeInfo lock, then calls FSNamesystem.processMisReplicatedBlocks(), which requires the global FSNamesystem lock.
      It should be the other way around: first FSNamesystem lock, then SafeModeInfo.

      Attachments

        1. safeModeDeadlock-0-18.patch
          1 kB
          Konstantin Shvachko
        2. safeModeDeadlock-0-18.patch
          1 kB
          Konstantin Shvachko
        3. safeModeDeadlock.patch
          1 kB
          Konstantin Shvachko

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            shv Konstantin Shvachko
            shv Konstantin Shvachko
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment