[SPARK-20994] Alleviate memory pressure in StreamManager - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.1.1
Fix Version/s: 2.3.0
Component/s: Spark Core
Labels:
None

Description

In my cluster, we are suffering from OOM of shuffle-service.
We found that a lot of executors are fetching blocks from a single shuffle-service. Analyzing the memory, we found that the blockIds(shuffle_shuffleId_mapId_reduceId) takes about 1.5GBytes.

In current code, chunks are fetched from shuffle service in two steps:
Step-1. Send OpenBlocks, which contains the blocks list to to fetch;
Step-2. Fetch the consecutive chunks from shuffle-service by streamId and chunkIndex

Thus memory cost can be improved for step-1.

Attachments

Issue Links

links to

[Github] Pull Request #18211 (jinxing64)

[Github] Pull Request #18231 (jinxing64)

Activity

People

Assignee:: Jin Xing

Reporter:: Jin Xing

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 06/Jun/17 06:46

Updated:: 16/Jun/17 12:10

Resolved:: 16/Jun/17 12:10