Description
Currently, it will fork an Python worker for each task, it will better if we can reuse the worker for later tasks.
This will be very useful for large dataset with big broadcast, so it does not need to sending broadcast to worker again and again. Also it can reduce the overhead of launch a task.
Attachments
Issue Links
- is related to
-
SPARK-5363 Spark 1.2 freeze without error notification
- Resolved
- links to