Description
There are many outstanding issues with the parquet format that have been resolved between the version depended on by spark (1.6.0rc3 as of spark 1.3.1) and the most recent parquet release (1.6.0).
Some of these are things include not supporting schema migration when using parquet with avro, not supporting summary metadata in the parquet footers causing null pointer exceptions reading, and many others.
See https://github.com/apache/parquet-mr/blob/master/CHANGES.md#version-160 for the full list of fixes.
Attachments
Issue Links
- is related to
-
SPARK-4258 NPE with new Parquet Filters
- Resolved
-
SPARK-5451 And predicates are not properly pushed down
- Resolved
-
SPARK-2551 Cleanup FilteringParquetRowInputFormat
- Resolved
- links to