Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
None
Description
In order to reduce the number of file on HDFS we need to have a rolling mechanism for the demux output
- avoid immediate merging if there's already file for the same time range, create a spill file instead
- merge all raw files every hours
- merge all hourly files every days
Attachments
Issue Links
- is part of
-
HADOOP-4709 Update chukwa source code and develop chukwa from public repository
- Resolved