Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
4.0.0
Description
When using Connect with PySpark, string collation silently gets dropped:
Client connected to the Spark Connect server at localhost SparkSession available as 'spark'. >>> spark.sql("select 'abc' collate 'UNICODE'") DataFrame[collate(abc): string] >>> from pyspark.sql.types import StructType, StringType, StructField >>> spark.createDataFrame([], StructType([StructField('id', StringType(2))])) DataFrame[id: string]
Instead of "string" type in dataframe, we should be seeing "string COLLATE 'UNICODE'".
Attachments
Issue Links
- links to