[SPARK-33230] FileOutputWriter jobs have duplicate JobIDs if launched in same second - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.4.7, 3.0.1
Fix Version/s: 2.4.8, 3.0.2, 3.1.0
Component/s: SQL
Labels:
None

Description

The Hadoop S3A staging committer has problems with >1 spark sql query being launched simultaneously, as it uses the jobID for its path in the clusterFS to pass the commit information from tasks to job committer.

If two queries are launched in the same second, they conflict and the output of job 1 includes that of all job2 files written so far; job 2 will fail with FNFE.

Proposed:
job conf to set "spark.sql.sources.writeJobUUID" to the value of WriteJobDescription.uuid

That was the property name which used to serve this purpose; any committers already written which use this property will pick it up without needing any changes.

Attachments

Issue Links

is depended upon by

SPARK-31911 Using S3A staging committer, pending uploads are committed more than once and listed incorrectly in _SUCCESS data

Resolved

is related to

SPARK-33402 Jobs launched in same second have duplicate MapReduce JobIDs

Resolved

relates to

HADOOP-17318 S3A committer to support concurrent jobs with same app attempt ID & dest dir

Resolved

links to

[Github] Pull Request #30141 (steveloughran)

Activity

People

Assignee:: Steve Loughran

Reporter:: Steve Loughran

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 23/Oct/20 13:39

Updated:: 09/Nov/20 20:02

Resolved:: 26/Oct/20 19:32