Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4176

EditLogTailer should call rollEdits with a timeout

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.2-alpha, 3.0.0-alpha1
    • Fix Version/s: 2.9.0, 3.0.0-alpha1
    • Component/s: ha, namenode
    • Labels:
      None

      Description

      When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.

      This may actually considered a side effect of HADOOP-6762 – if the RPC were interruptible, that would also fix the issue.

        Attachments

        1. namenode.jstack4
          70 kB
          Marc Heide
        2. HDFS-4176-branch-2.2.patch
          9 kB
          Lei (Eddy) Xu
        3. HDFS-4176-branch-2.1.patch
          9 kB
          Lei (Eddy) Xu
        4. HDFS-4176-branch-2.003.patch
          9 kB
          Lei (Eddy) Xu
        5. HDFS-4176-branch-2.0.patch
          9 kB
          Lei (Eddy) Xu
        6. HDFS-4176.04.patch
          10 kB
          Lei (Eddy) Xu
        7. HDFS-4176.03.patch
          10 kB
          Lei (Eddy) Xu
        8. HDFS-4176.02.patch
          10 kB
          Lei (Eddy) Xu
        9. HDFS-4176.01.patch
          9 kB
          Lei (Eddy) Xu
        10. HDFS-4176.00.patch
          9 kB
          Lei (Eddy) Xu

          Issue Links

            Activity

              People

              • Assignee:
                eddyxu Lei (Eddy) Xu
                Reporter:
                tlipcon Todd Lipcon
              • Votes:
                0 Vote for this issue
                Watchers:
                16 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: