Description
File "/.../spark/python/pyspark/sql/connect/column.py", line 106, in pyspark.sql.connect.column.Column.eqNullSafe Failed example: df1.join(df2, df1["value"] == df2["value"]).count() Exception raised: Traceback (most recent call last): File "/.../miniconda3/envs/python3.9/lib/python3.9/doctest.py", line 1336, in __run exec(compile(example.source, filename, "single", File "<doctest pyspark.sql.connect.column.Column.eqNullSafe[4]>", line 1, in <module> df1.join(df2, df1["value"] == df2["value"]).count() File "/.../spark/python/pyspark/sql/connect/dataframe.py", line 151, in count pdd = self.agg(_invoke_function("count", lit(1))).toPandas() File "/.../spark/python/pyspark/sql/connect/dataframe.py", line 1031, in toPandas return self._session.client.to_pandas(query) File "/.../spark/python/pyspark/sql/connect/client.py", line 413, in to_pandas return self._execute_and_fetch(req) File "/.../spark/python/pyspark/sql/connect/client.py", line 573, in _execute_and_fetch self._handle_error(rpc_error) File "/.../spark/python/pyspark/sql/connect/client.py", line 619, in _handle_error raise SparkConnectAnalysisException( pyspark.sql.connect.client.SparkConnectAnalysisException: [AMBIGUOUS_REFERENCE] Reference `value` is ambiguous, could be: [`value`, `value`].
Attachments
Issue Links
- is duplicated by
-
SPARK-41823 DataFrame.join creating ambiguous column names
- Resolved
- links to