Description
Optimize the time taken for HDFS-SYNC full snapshot.
- Decrease total time taken for HDFS ACL synchronization.
- Reduce/Avoid the duration HMS threads are blocked by synchronizing the notifications with sentry.
Here are the time intensive operations in the hdfs-acl synchronization ordered by time taken.
- Fetching full snapshot from HMS.
- Persisting the snapshot in the database.
- Retrieving full snapshot from database.
Attachments
1.
|
Optimize the performence by tuning sentry and HMS properties | Open | Unassigned | |
2.
|
Optimize the time taken for HMS snapshot creation. | Open | Unassigned | |
3.
|
Optimize time taken for persistence HMS snapshot by persisting in parallel | Patch Available | Krishna Kalyan | |
4.
|
Reduce the HMS snapshot size by ignoring partitions at default locations | Patch Available | Krishna Kalyan | |
5.
|
Avoid HMS event synchronization while sentry is fetching full snapshot. | Resolved | Krishna Kalyan | |
6.
|
Filter the path information in the fullupdate send to sentry namenode plug-in | Open | Unassigned | |
7.
|
Create a cache of Path and Perm Full Updates to send to NN | Patch Available | Arjun Mishra | |
8.
|
Minimize the time sentry blocks HMS threads | Patch Available | Krishna Kalyan | |
9.
|
Increase the allocation size for auto-increment of id's for Snapshot tables. | Resolved | Krishna Kalyan | |
10.
|
Avoid getting the complete partition objects and fetch the location information only. | Open | Unassigned | |
11.
|
Persist HMS paths in batches. | Resolved | Krishna Kalyan |