Affects Version/s: None
Fix Version/s: v1.6.0
This is running on a snapshot of Flume-1.5 with the git hash 99db32ccd163daf9d7685f0e8485941701e1133d
When a datanode goes unresponsive for a significant amount of time(for example a big gc) an append failure will occur followed by repeated time outs appearing in the log, and failure to close the stream. Relevant section of logs attached(where it first starts appearing.
The same log repeats periodically, consistently running into a TimeoutException.
Restarting flume(or presumably just the HDFSSink) solves the issue.
Probable cause in comments
|Transition||Time In Source Status||Execution Times||Last Executer||Last Execution Date|
|173d 18h 23m||1||Hari Shreedharan||15/May/14 00:44|
|Fix Version/s||v1.6.0 [ 12327047 ]|
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Assignee||Brock Noland [ brocknoland ]|
|Resolution||Fixed [ 1 ]|
|Remote Link||This issue links to "Review (Web Link)" [ 15067 ]|