[MAPREDUCE-4933] MR1 final merge asks for length of file it just wrote before flushing it - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.1.1
Fix Version/s: 1.2.0
Component/s: mrv1, task
Labels:
None

Description

createKVIterator in ReduceTask contains the following code:

          try {
            Merger.writeFile(rIter, writer, reporter, job);
            addToMapOutputFilesOnDisk(fs.getFileStatus(outputPath));
          } catch (Exception e) {
            if (null != outputPath) {
              fs.delete(outputPath, true);
            }
            throw new IOException("Final merge failed", e);
          } finally {
            if (null != writer) {
              writer.close();
            }
          }

Merger#writeFile() does not close the file after writing it, so when fs.getFileStatus() is called on it, it may not return the correct length. This causes bad accounting further down the line, which can lead to map output data being lost.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-4933-branch-1.patch
10/Jan/13 21:46
1.0 kB
Sandy Ryza

Activity

People

Assignee:: Sandy Ryza

Reporter:: Sandy Ryza

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 10/Jan/13 21:30

Updated:: 15/May/13 05:15

Resolved:: 28/Feb/13 05:29