Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Within a session, if the same set of HDFS blocks are accessed by different tasks - these should ideally be launched on the same node for better buffer cache, etc utilization.
This will likely end up being another level of requests higher up than NODE_LOCAL for the scheduler.