[HADOOP-18004] abfs and s3a disk buffer factories to use UUIDs for file prefixes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: 3.3.2
Fix Version/s: None
Component/s: fs/azure, fs/s3
Labels:
None

Description

the disk buffers created in s3a and abfs output streams use a simple String.format("datablock-%04d-", index) pattern for the prefix for File.tmpFile

this means there will be contention for filenames across streams, especially across processes.

if each stream had a uuid prefix there'd be no contention. That'd change the API though. Alternatively: each disk block factory has the uuid, and the index is simply total number blocks created.

Attachments

Activity

People

Assignee:: Mehakmeet Singh

Reporter:: Steve Loughran

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 11/Nov/21 12:55

Updated:: 11/Nov/21 12:55