[SPARK-20937] Describe spark.sql.parquet.writeLegacyFormat property in Spark SQL, DataFrames and Datasets Guide - ASF JIRA

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Trivial
Resolution: Fixed
Affects Version/s: 2.3.0
Fix Version/s: 2.4.0
Component/s: Documentation, SQL
Labels:
None

Description

As a follow-up to ~~SPARK-20297~~ (and ~~SPARK-10400~~) in which spark.sql.parquet.writeLegacyFormat property was recommended for Impala and Hive, Spark SQL docs for Parquet Files should have it documented.

p.s. It was asked about in Why can't Impala read parquet files after Spark SQL's write? on StackOverflow today.

p.s. It's also covered in Holden Karau's "High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark" book (in Table 3-10. Parquet data source options) that gives the option some wider publicity.