There are currently a number of ways that a collector can die, typically due to errors on a DN or a NN that's being restarted. A collector should have some combination of retry logic followed by failing back to the agent, but the collector process should not die.
|Assignee||Bill Graham [ billgraham ]|
|Status||Open [ 1 ]||Patch Available [ 10002 ]|
|Release Note||Chukwa collector is more fault-tolerant of partial HDFS outages.|
|Status||Patch Available [ 10002 ]||Resolved [ 5 ]|
|Fix Version/s||0.5.0 [ 12315030 ]|
|Resolution||Fixed [ 1 ]|