Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.0.0, 2.0.1
-
None
Description
Currently, when you write a sorted DataFrame to Parquet, then reading the data back out is not sorted by default. This is due to a bug in Parquet that was fixed in 1.9.
There is a workaround to read the file back in using a file glob (filepath/*).
Attachments
Issue Links
- is blocked by
-
PARQUET-1027 release Parquet-mr 1.9.1
- Open
- is duplicated by
-
SPARK-18860 Update Parquet to 1.9.0
- Resolved
-
SPARK-18140 Parquet NPE / Update to 1.9
- Closed
-
SPARK-20406 Upgrade parquet to 1.9
- Resolved
- is related to
-
SPARK-10109 NPE when saving Parquet To HDFS
- Resolved
-
SPARK-10294 When Parquet writer's close method throws an exception, we will call close again and trigger a NPE
- Resolved
- links to