[MAPREDUCE-6923] Optimize MapReduce Shuffle I/O for small partitions - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.9.0, 3.0.0-beta1
Component/s: None
Labels:
None
Environment:

Observed in Hadoop 2.7.3 and above (judging from the source code of future versions), and Ubuntu 16.04.

Target Version/s:

2.9.0, 3.0.0-beta1

Description

When a job configuration results in small partitions read by each reducer from each mapper (e.g. 65 kilobytes as in my setup: a TeraSort of 256 gigabytes using 2048 mappers and reducers each), and setting

<property>
  <name>mapreduce.shuffle.transferTo.allowed</name>
  <value>false</value>
</property>

then the default setting of

<property>
  <name>mapreduce.shuffle.transfer.buffer.size</name>
  <value>131072</value>
</property>

results in almost 100% overhead in reads during shuffle in YARN, because for each 65K needed, 128K are read.

I propose a fix in FadvisedFileRegion.java as follows:

ByteBuffer byteBuffer = ByteBuffer.allocate(Math.min(this.shuffleBufferSize, trans > Integer.MAX_VALUE ? Integer.MAX_VALUE : (int) trans));

e.g. here. This sets the shuffle buffer size to the minimum value of the shuffle buffer size specified in the configuration (128K by default), and the actual partition size (65K on average in my setup). In my benchmarks this reduced the read overhead in YARN from about 100% (255 additional gigabytes as described above) down to about 18% (an additional 45 gigabytes). The runtime of the job remained the same in my setup.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-6923.00.patch
03/Aug/17 07:11
2 kB
Robert Schmidtke
MAPREDUCE-6923.01.patch
04/Aug/17 08:11
3 kB
Robert Schmidtke

Issue Links

relates to

MAPREDUCE-5791 Shuffle phase is slow in Windows - FadviseFileRegion::transferTo does not read disks efficiently

Closed

Activity

People

Assignee:: Robert Schmidtke

Reporter:: Robert Schmidtke

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 31/Jul/17 21:28

Updated:: 10/Aug/17 17:19

Resolved:: 09/Aug/17 22:42