Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
2.4.0
-
None
-
None
Description
Setting spark.broadcast.compress to false disables compression while sending broadcast variable to executors (https://spark.apache.org/docs/latest/configuration.html#compression-and-serialization)
However this does not disable compression for any child relations sent by the executors to the driver.
Setting spark.boradcast.compress to false should disable both sides of the traffic, allowing users to disable compression for the whole broadcast join for example.
^here `executeCollectIterator` calls `getByteArrayRdd` which by default always gets a compressed stream