Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
There is a bunch of work (depending on query) which hive client does once query finishes. This has multiple effects which would be nice to avoid. Few of these work items include:
- Moving temp files / dirs around to their final destination.
- Creating empty files if required.
- Updating metastore with metadata.
- More ?
If we start using OutputCommitter then clients can fire and forget, they won't need to be around for all the time query is executing. Plus, since processing is going to happen in cluster, this further makes client lightweight.
Attachments
Issue Links
- relates to
-
HIVE-4331 Integrated StorageHandler for Hive and HCat using the HiveStorageHandler
- Closed