Issue Details (XML | Word | Printable)

Key: HADOOP-2774
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Ravi Gummadi
Reporter: Owen O'Malley
Votes: 0
Watchers: 4
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

Add counters to show number of key/values that have been sorted and merged in the maps and reduces

Created: 02/Feb/08 08:12 AM   Updated: 08/Jul/09 04:53 PM
Return to search
Component/s: None
Affects Version/s: None
Fix Version/s: 0.20.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-25 11:40 AM Ravi Gummadi 30 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-25 07:02 AM Ravi Gummadi 30 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-25 05:19 AM Ravi Gummadi 30 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-24 04:43 PM Ravi Gummadi 30 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-21 06:18 PM Ravi Gummadi 30 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-14 09:25 AM Ravi Gummadi 17 kB
Text File Licensed for inclusion in ASF works HADOOP-2774.patch 2008-11-11 07:35 PM Ravi Gummadi 18 kB

Hadoop Flags: Reviewed
Resolution Date: 25/Nov/08 10:38 PM


 Description  « Hide
For each pass of the sort and merge, I would like a count of the number of records. So for example, if the map output 100 records and they were sorted once, the counter would be 100. If it spilled twice and was merged together, it would be 200. Clearly in a multi-level merge, it may not be a multiple of the number of map output records. This would let the users easily see if they have values like io.sort.mb or io.sort.factor set too low.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
No work has yet been logged on this issue.