Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
1.3.0
-
None
Description
as of HIVE-11983, when a TransactionBatch is opened in StreamingAPI, a full compliment of bucket files (AbstractRecordWriter.createRecordUpdaters()) is created on disk even though some may end up receiving no data.
It would be better to create them on demand and not clog the FS.
Tez can handle missing (empty) buckets and on MR bucket join algorithms will check if all buckets are there and bail out if not.
Attachments
Attachments
Issue Links
- is related to
-
HIVE-11719 acid insert with dynamic partitioning doesn't create empty buckets
- Open
-
HIVE-17138 FileSinkOperator/Compactor doesn't create empty files for acid path
- Open
- relates to
-
HIVE-11983 Hive streaming API uses incorrect logic to assign buckets to incoming records
- Resolved