Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-30732

BroadcastExchangeExec does not fully honor "spark.broadcast.compress"

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 2.4.0
    • None
    • Spark Core
    • None

    Description

      Setting spark.broadcast.compress to false disables compression while sending broadcast variable to executors (https://spark.apache.org/docs/latest/configuration.html#compression-and-serialization)

      However this does not disable compression for any child relations sent by the executors to the driver.

      Setting spark.boradcast.compress to false should disable both sides of the traffic, allowing users to disable compression for the whole broadcast join for example.

      https://github.com/puneetguptanitj/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/BroadcastExchangeExec.scala#L89

      ^here `executeCollectIterator` calls `getByteArrayRdd` which by default always gets a compressed stream

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            puneetguptanitj Puneet
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: