Details
Description
Consider a table partitioned by 2 keys (dt, region). Say a dt partition could have 10000 associated regions. Consider that the user does:
ALTER TABLE my_table DROP PARTITION (dt='20150101');
As things stand now, StorageBasedAuthProvider will make individual DistributedFileSystem.listStatus() calls for each partition-directory, and authorize each one separately. It'd be faster to batch the calls, and examine multiple FileStatus objects at once.
Attachments
Attachments
Issue Links
- depends upon
-
HIVE-9681 Extend HiveAuthorizationProvider to support partition-sets.
- Closed
- is depended upon by
-
HIVE-9629 HCatClient.dropPartitions() needs speeding up.
- Open
- relates to
-
HIVE-10223 Consolidate several redundant FileSystem API calls.
- Closed
- links to