Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
None
Description
The changes added in HADOOP-17901 and HADOOP-17905 introduced some issues in Hive side. UDFSpace is generating false output because the result length is not determined correctly. It is causing unexpected characters in the result.
Repro steps:
create table t(i int);
insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);
select i, SPACE(i) from t;
Faulty output:
+-----+-----------------+ | i | _c1 | +-----+-----------------+ | 5 | | | 6 | | | 7 | | | 8 | | | 9 | | | 10 | | | 11 | | | 12 | | | 13 | | | 14 | | +-----+-----------------+
Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.
Attachments
Issue Links
- is caused by
-
HADOOP-17901 Performance degradation in Text.append() after HADOOP-16951
- Resolved
-
HADOOP-17905 Modify Text.ensureCapacity() to efficiently max out the backing array size
- Resolved
- links to