Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.23.3, 2.0.0-alpha
-
None
-
Reviewed
Description
Hive get unexpected result when using MR2(When using MR1, always get expected result).
In MR2, when Total input paths to process == 1, CombinefileInputFormat.getSplits() returns 0 split.
The calling code in Hive, in Hadoop23Shims.java:
InputSplit[] splits = super.getSplits(job, numSplits);
this get splits.length == 0.
In MR1, everything goes fine, the calling code in Hive, in Hadoop20Shims.java:
CombineFileSplit[] splits = (CombineFileSplit[]) super.getSplits(job, numSplits);
this get splits.length == 1.