Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4176

EditLogTailer should call rollEdits with a timeout

VotersStop watchingWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.2-alpha, 3.0.0-alpha1
    • Fix Version/s: 2.9.0, 3.0.0-alpha1
    • Component/s: ha, namenode
    • Labels:
      None

      Description

      When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.

      This may actually considered a side effect of HADOOP-6762 – if the RPC were interruptible, that would also fix the issue.

        Attachments

        1. HDFS-4176-branch-2.003.patch
          9 kB
          Lei (Eddy) Xu
        2. HDFS-4176-branch-2.2.patch
          9 kB
          Lei (Eddy) Xu
        3. HDFS-4176-branch-2.1.patch
          9 kB
          Lei (Eddy) Xu
        4. HDFS-4176-branch-2.0.patch
          9 kB
          Lei (Eddy) Xu
        5. HDFS-4176.04.patch
          10 kB
          Lei (Eddy) Xu
        6. HDFS-4176.03.patch
          10 kB
          Lei (Eddy) Xu
        7. HDFS-4176.02.patch
          10 kB
          Lei (Eddy) Xu
        8. HDFS-4176.01.patch
          9 kB
          Lei (Eddy) Xu
        9. HDFS-4176.00.patch
          9 kB
          Lei (Eddy) Xu
        10. namenode.jstack4
          70 kB
          Marc Heide

        Issue Links

          Activity

            People

            • Assignee:
              eddyxu Lei (Eddy) Xu
              Reporter:
              tlipcon Todd Lipcon

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment