[MAPREDUCE-1073] Progress reported for pipes tasks is incorrect. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 0.20.1
Fix Version/s: None
Component/s: pipes
Labels:
None

Description

Currently in pipes, org.apache.hadoop.mapred.pipes.PipesMapRunner.run(RecordReader<K1, V1>, OutputCollector<K2, V2>, Reporter) we do the following:

        while (input.next(key, value)) {
          downlink.mapItem(key, value);
          if(skipping) {
            downlink.flush();
          }
        }

This would result in consumption of all the records for current task and taking task progress to 100% whereas the actual pipes application would be trailing behind.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-1073_yhadoop20.patch
24/Feb/10 19:23
1 kB
Arun Murthy
mapreduce-1073--2010-03-31.patch
31/Mar/10 21:24
2 kB
Dick King
mapreduce-1073--2010-04-06.patch
06/Apr/10 21:47
282 kB
Dick King
MAPREDUCE-1073--yhadoop20--2010-07-22.patch
22/Jul/10 20:56
27 kB
Dick King
MAPREDUCE-1073--yhadoop20--2010-07-22--1530.patch
22/Jul/10 22:34
26 kB
Dick King

Activity

People

Assignee:: Dick King

Reporter:: Sreekanth Ramakrishnan

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 07/Oct/09 08:27

Updated:: 29/Jan/12 03:14