[SPARK-20937] Describe spark.sql.parquet.writeLegacyFormat property in Spark SQL, DataFrames and Datasets Guide - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Trivial
Resolution: Fixed
Affects Version/s: 2.3.0
Fix Version/s: 2.4.0
Component/s: Documentation, SQL
Labels:
None

Description

As a follow-up to ~~SPARK-20297~~ (and ~~SPARK-10400~~) in which spark.sql.parquet.writeLegacyFormat property was recommended for Impala and Hive, Spark SQL docs for Parquet Files should have it documented.

p.s. It was asked about in Why can't Impala read parquet files after Spark SQL's write? on StackOverflow today.

p.s. It's also covered in holden.karau@gmail.com's "High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark" book (in Table 3-10. Parquet data source options) that gives the option some wider publicity.

Attachments

Issue Links

links to

[Github] Pull Request #22453 (seancxmao)

Activity

People

Assignee:: Chenxiao Mao

Reporter:: Jacek Laskowski

Votes:: 1 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 31/May/17 09:28

Updated:: 12/Dec/22 18:11

Resolved:: 26/Sep/18 14:15