Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
1. Should we start accounting for these overheads in the memory calculations ?
2. Put a hard limit on how many in-mem segments will be maintained, before they're aggregated to disk.
3. Eventually, get the In-Memory merge functional so that we don't maintain 500K tiny buffers.
4. Evaluate a lower overhead collection to store mapoutputs, inmemoryreader, and segments.
5. Remove DataInputBuffer from TezMerger.Segment