Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4176

EditLogTailer should call rollEdits with a timeout

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.2-alpha, 3.0.0-alpha1
    • 2.9.0, 3.0.0-alpha1
    • ha, namenode
    • None

    Description

      When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.

      This may actually considered a side effect of HADOOP-6762 – if the RPC were interruptible, that would also fix the issue.

      Attachments

        1. namenode.jstack4
          70 kB
          Marc Heide
        2. HDFS-4176-branch-2.2.patch
          9 kB
          Lei (Eddy) Xu
        3. HDFS-4176-branch-2.1.patch
          9 kB
          Lei (Eddy) Xu
        4. HDFS-4176-branch-2.003.patch
          9 kB
          Lei (Eddy) Xu
        5. HDFS-4176-branch-2.0.patch
          9 kB
          Lei (Eddy) Xu
        6. HDFS-4176.04.patch
          10 kB
          Lei (Eddy) Xu
        7. HDFS-4176.03.patch
          10 kB
          Lei (Eddy) Xu
        8. HDFS-4176.02.patch
          10 kB
          Lei (Eddy) Xu
        9. HDFS-4176.01.patch
          9 kB
          Lei (Eddy) Xu
        10. HDFS-4176.00.patch
          9 kB
          Lei (Eddy) Xu

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            eddyxu Lei (Eddy) Xu
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            16 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment