[HADOOP-4927] Part files on the output filesystem are created irrespective of whether the corresponding task has anything to write there - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.21.0
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
All output part files are created regardless of whether the corresponding task has output.

Description

When OutputFormat.getRecordWriter is invoked, a part file is created on the output filesystem. But the created RecordWriter is not used until the OutputCollector.collect call is made by the task (user's code). This results in empty part files even if the OutputCollector.collect is never invoked by the corresponding tasks.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hadoop-4927.patch
22/Jan/09 09:10
15 kB
Jothi Padmanabhan
hadoop-4927-v1.patch
27/Jan/09 07:36
17 kB
Jothi Padmanabhan
hadoop-4927-v2.patch
30/Jan/09 12:22
18 kB
Jothi Padmanabhan
hadoop-4927-v3.patch
05/Feb/09 12:11
24 kB
Jothi Padmanabhan
hadoop-4927-v4.patch
12/Feb/09 16:02
33 kB
Jothi Padmanabhan
hadoop-4927-v5.patch
13/Feb/09 12:42
34 kB
Jothi Padmanabhan
hadoop-4927-v6.patch
19/Feb/09 08:22
34 kB
Jothi Padmanabhan
hadoop-4927-y20.patch
17/Jul/09 04:28
34 kB
Jothi Padmanabhan

Activity

People

Assignee:: Jothi Padmanabhan

Reporter:: Devaraj Das

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 22/Dec/08 05:59

Updated:: 24/Aug/10 20:34

Resolved:: 23/Feb/09 08:22