Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Done
-
1.16.0, 1.14.6, 1.15.3
Description
When running TPC-DS jobs in a session cluster, we observed that sometimes tasks are not evenly distributed in TMs. The root cause turned out to be that the downstream tasks tend to be TM or host local with its input tasks. This helps to reduce network shuffle.
However, in certain cases, like the topology presented in the attached image, jamming the input task's TM and machine with downstream tasks would affect the performance. In this case, respecting input location preferences is causing troubles more than bringing benefits.
So I'm wondering whether we should introduce a config so that users can disable input location preferences?
Attachments
Attachments
Issue Links
- links to