[HUDI-3758] Optimize flink partition table with BucketIndex - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.11.1
Component/s: flink
Labels:
- pull-request-available

Description

When using flink bucket index , I meet two problems

without use all streamWriter tasks when partition table with small Bucket number
crashed with the following step

start job
killed before first commit success ( left some log files)
restart job run nomal after one successful commit
kill job and restart throws `Duplicate fileID 00000001-6f57-4c71-bf6f-ee7616ec7b14 from bucket 1 of partition found during the BucketStreamWriteFunction index bootstrap`

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

image-2022-03-31-15-44-30-480.png
31/Mar/22 07:44
13 kB
konwu
image-2022-03-31-15-44-34-450.png
31/Mar/22 07:44
13 kB
konwu

Issue Links

links to

GitHub Pull Request #5185

Activity

People

Assignee:: Unassigned

Reporter:: konwu

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 31/Mar/22 07:35

Updated:: 18/Jun/22 19:56

Resolved:: 18/Jun/22 19:56