Details
-
Task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
For very large tables (200+ partitions and 100K+ files), the current initialization code in HoodieBackedTableMetadataWriter is slow as it uses a sequential listing to list all partitions and files.
Also, the above code is inefficient as it list each directory twice - first for getting list of partitions and later for getting list of files. This can be done together.
Attachments
Issue Links
- links to