[HDFS-4176] EditLogTailer should call rollEdits with a timeout - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.2-alpha, 3.0.0-alpha1
Fix Version/s: 2.9.0, 3.0.0-alpha1
Component/s: ha, namenode
Labels:
None

Target Version/s:

2.9.0, 3.0.0-alpha2
Hadoop Flags:

Reviewed

Description

When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.

This may actually considered a side effect of ~~HADOOP-6762~~ – if the RPC were interruptible, that would also fix the issue.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-4176-branch-2.003.patch
02/Aug/16 22:26
9 kB
Lei (Eddy) Xu
HDFS-4176-branch-2.2.patch
02/Aug/16 18:23
9 kB
Lei (Eddy) Xu
HDFS-4176-branch-2.1.patch
29/Jul/16 17:31
9 kB
Lei (Eddy) Xu
HDFS-4176-branch-2.0.patch
28/Jul/16 21:41
9 kB
Lei (Eddy) Xu
HDFS-4176.04.patch
28/Jul/16 17:39
10 kB
Lei (Eddy) Xu
HDFS-4176.03.patch
27/Jul/16 22:46
10 kB
Lei (Eddy) Xu
HDFS-4176.02.patch
27/Jul/16 22:32
10 kB
Lei (Eddy) Xu
HDFS-4176.01.patch
27/Jul/16 20:52
9 kB
Lei (Eddy) Xu
HDFS-4176.00.patch
27/Jul/16 17:22
9 kB
Lei (Eddy) Xu
namenode.jstack4
30/Oct/14 07:58
70 kB
Marc Heide

Issue Links

is depended upon by

HDFS-10734 Rename "dfs.ha.tail-edits.rolledits.timeout" to "dfs.ha.log-roll.execution.timeout"

Open

Activity

People

Assignee:: Lei (Eddy) Xu

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 16 Start watching this issue

Dates

Created:: 12/Nov/12 19:11

Updated:: 30/Aug/16 01:42

Resolved:: 09/Aug/16 00:19