Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
HIVE-23891 adds the ability to deduplicate the task result that under the directory,
<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_1,
but turns out to ignore taking the same action to the output directory for the same query:
<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_2.
So user may still have the same data duplication problem upon multiple tez task attempts.
Attachments
Attachments
Issue Links
- relates to
-
HIVE-23891 UNION ALL and multiple task attempts can cause file duplication
- Resolved
- links to