[HUDI-2526] Make spark.sql.parquet.writeLegacyFormat configurable - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.10.0
Component/s: None
Labels:
- pull-request-available

Epic Link:
Hudi Spark Datasource

Description

From the community,

"I am observing that HUDI bulk inser in 0.9.0 version is not honoring
spark.sql.parquet.writeLegacyFormat=true
config. Can you suggest way to set this config.
Reason to use this config:
Current Bulk insert use spark dataframe writer and don't do avro conversion. The decimal columns in my DF are written as INT32 type in parquet.
The upsert functionality which uses avro conversion is generating Fixed Length byte array for decimal types which is failing with datatype mismatch."

The main reason is that the config is hardcoded. We can make it configurable.

Attachments

Issue Links

is related to

HUDI-3245 Convert uppercase letters to lowercase in storage configs

Closed

links to

GitHub Pull Request #3917

Activity

People

Assignee:: Sagar Sumit

Reporter:: Sagar Sumit

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 04/Oct/21 16:43

Updated:: 15/Feb/22 19:05

Resolved:: 15/Feb/22 19:05