Description
after SPARK-2446, the compatibility with parquet file created by old spark release (spark 1.0.x) and by impala (all of versions until now: 1.4.x-cdh5) is broken.
strings in those parquet files are not annotated with UTF8 or are just only ASCII char set (impala doesn't support UTF8 yet)
this ticket aims to add a configuration option or some version check to support those parquet files.
Attachments
Issue Links
- duplicates
-
SPARK-2927 Add a conf to configure if we always read Binary columns stored in Parquet as String columns
- Resolved
- relates to
-
SPARK-2927 Add a conf to configure if we always read Binary columns stored in Parquet as String columns
- Resolved
- links to