[SPARK-28939] SQL configuration are not always propagated - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.3.4, 2.4.4
Fix Version/s: 2.4.5, 3.0.0
Component/s: SQL
Labels:
None

Description

The SQL configurations are propagated to executors in order to be effective.
Unfortunately, in some cases, we are missing to propagate them, making them un-effective.

The problem happens every time rdd or queryExecution.toRdd are used. And this is pretty frequent in the codebase.

Please notice that there are 2 parts of this issue:

when a user directly uses those APIs
when Spark invokes them (eg. throughout the ML lib and other usages or the describe method on the Dataset class)