[MAPREDUCE-5785] Derive heap size or mapreduce.*.memory.mb automatically - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 3.0.0-alpha1
Component/s: mr-am, task
Labels:
None

Target Version/s:
Hadoop Flags:

Incompatible change
Release Note:

Hide
The memory values for mapreduce.map/reduce.memory.mb keys, if left to their default values of -1, will now be automatically inferred from the heap size value system property (-Xmx) specified for mapreduce.map/reduce.java.opts keys.

The converse is also done, i.e. if mapreduce.map/reduce.memory.mb values are specified, but no -Xmx is supplied for mapreduce.map/reduce.java.opts keys, then the -Xmx value will be derived from the former's value.

If neither is specified, then a default value of 1024 MB gets used.

For both these conversions, a scaling factor specified by property mapreduce.job.heap.memory-mb.ratio is used, to account for overheads between heap usage vs. actual physical memory usage.

Existing configs or job code that already specify both the set of properties explicitly would not be affected by this inferring change.

Show
The memory values for mapreduce.map/reduce.memory.mb keys, if left to their default values of -1, will now be automatically inferred from the heap size value system property (-Xmx) specified for mapreduce.map/reduce.java.opts keys. The converse is also done, i.e. if mapreduce.map/reduce.memory.mb values are specified, but no -Xmx is supplied for mapreduce.map/reduce.java.opts keys, then the -Xmx value will be derived from the former's value. If neither is specified, then a default value of 1024 MB gets used. For both these conversions, a scaling factor specified by property mapreduce.job.heap.memory-mb.ratio is used, to account for overheads between heap usage vs. actual physical memory usage. Existing configs or job code that already specify both the set of properties explicitly would not be affected by this inferring change.

Description

Currently users have to set 2 memory-related configs per Job / per task type. One first chooses some container size map reduce.*.memory.mb and then a corresponding maximum Java heap size Xmx < map reduce.*.memory.mb. This makes sure that the JVM's C-heap (native memory + Java heap) does not exceed this mapreduce.*.memory.mb. If one forgets to tune Xmx, MR-AM might be

allocating big containers whereas the JVM will only use the default -Xmx200m.
allocating small containers that will OOM because Xmx is too high.

With this JIRA, we propose to set Xmx automatically based on an empirical ratio that can be adjusted. Xmx is not changed automatically if provided by the user.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

MAPREDUCE-5785.v01.patch
08/Mar/14 12:52
10 kB
Gera Shegalov
MAPREDUCE-5785.v02.patch
22/Mar/14 20:22
26 kB
Gera Shegalov
MAPREDUCE-5785.v03.patch
11/May/14 05:44
25 kB
Gera Shegalov
mr-5785-4.patch
20/Nov/14 23:00
25 kB
Karthik Kambatla
mr-5785-5.patch
21/Nov/14 17:14
24 kB
Karthik Kambatla
mr-5785-6.patch
21/Nov/14 23:04
24 kB
Karthik Kambatla
mr-5785-7.patch
08/Jan/15 18:27
23 kB
Karthik Kambatla
mr-5785-8.patch
19/Jan/15 15:42
23 kB
Karthik Kambatla
mr-5785-9.patch
21/Jan/15 02:05
23 kB
Karthik Kambatla

Issue Links

breaks

MAPREDUCE-6223 TestJobConf#testNegativeValueForTaskVmem failures

Resolved

MAPREDUCE-6234 TestHighRamJob fails due to the change in MAPREDUCE-5785

Resolved

causes

HIVE-21929 Hive on Tez requers explicite set of property hive.tez.container.size

Patch Available

is related to

MAPREDUCE-5892 Derive MR-AM container size and Xmx based on the job size: number of splits and reduces

Open

relates to

TEZ-699 Have sensible defaults for java opts

Closed

MAPREDUCE-6343 JobConf.parseMaximumHeapSizeMB() fails to parse value greater than 2GB expressed in bytes

Resolved

(1 relates to)

Activity

People

Assignee:: Gera Shegalov

Reporter:: Gera Shegalov

Votes:: 0 Vote for this issue

Watchers:: 24 Start watching this issue

Dates

Created:: 08/Mar/14 12:44

Updated:: 27/Jun/19 08:54

Resolved:: 22/Jan/15 02:57