[MAPREDUCE-7158] Inefficient Flush Logic in JobHistory EventWriter - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.2.0
Fix Version/s: 3.1.2, 3.3.0, 3.2.1
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed

Description

In HDFS, if the flush is implemented to send server request to actually commit the pending writes on the storage service side, we could observe in the benchmark runs that the MR jobs are taking much longer. From investigation we see the current implementation for writing events doesn't look right:
EventWriter# write()
This flush is redundant and this statement should be removed. It defeats the purpose of having a separate flush function itself.
Encoder.flush calls flush of the underlying output stream
After patching with the fix the MR jobs could complete normally, please kindly find the patch in attached.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-7158-001.patch
12/Nov/18 19:30
1 kB
Zichen Sun

Issue Links

breaks

MAPREDUCE-7162 TestEvents#testEvents fails

Resolved

Activity

People

Assignee:: Zichen Sun

Reporter:: Zichen Sun

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 12/Nov/18 19:25

Updated:: 25/Nov/18 11:04

Resolved:: 13/Nov/18 19:37