Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
ghx-label-4
Description
When Hive inserts data into partitioned tables, it generates a lot of ALTER_PARTITION (and possibly INSERT_EVENT) in quick succession. Currently, such events are processed one by one by EventsProcessor which is can be slow and can cause EventsProcessor to lag behind. This JIRA proposes to use batching for such ALTER_PARTITION events such that all the successive ALTER_PARTITION events for the same table are batched together into one ALTER_PARTITIONS event and then are processed together to refresh all the partitions from the events. This can significantly speed up the event processing in such cases.
Attachments
Issue Links
- is part of
-
IMPALA-7954 Support automatic invalidates using metastore notification events
- Resolved
- is related to
-
IMPALA-10949 Improve batching logic of events
- Resolved