Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4176

EditLogTailer should call rollEdits with a timeout

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.0.2-alpha, 3.0.0-alpha1
    • 2.9.0, 3.0.0-alpha1
    • ha, namenode
    • None

    Description

      When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.

      This may actually considered a side effect of HADOOP-6762 – if the RPC were interruptible, that would also fix the issue.

      Attachments

        1. namenode.jstack4
          70 kB
          Marc Heide
        2. HDFS-4176.00.patch
          9 kB
          Lei (Eddy) Xu
        3. HDFS-4176.01.patch
          9 kB
          Lei (Eddy) Xu
        4. HDFS-4176.02.patch
          10 kB
          Lei (Eddy) Xu
        5. HDFS-4176.03.patch
          10 kB
          Lei (Eddy) Xu
        6. HDFS-4176.04.patch
          10 kB
          Lei (Eddy) Xu
        7. HDFS-4176-branch-2.0.patch
          9 kB
          Lei (Eddy) Xu
        8. HDFS-4176-branch-2.1.patch
          9 kB
          Lei (Eddy) Xu
        9. HDFS-4176-branch-2.2.patch
          9 kB
          Lei (Eddy) Xu
        10. HDFS-4176-branch-2.003.patch
          9 kB
          Lei (Eddy) Xu

        Issue Links

          Activity

            People

              eddyxu Lei (Eddy) Xu
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: