Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.14.0
-
ORC Table ~ 12 string columns
-
Use Text writables directly in ORC dictionaries to avoid String allocations.
Description
When ORC string dictionary writes data out, it suffers from bad GC performance due to a few allocations in-loop.
The conversions are as follows
StringTreeWriter::getStringValue() causes 2 conversions
LazyString -> Text (LazyString::getWritableObject)
Text -> String (LazyStringObjectInspector::getPrimitiveJavaObject)
Then StringRedBlackTree::add() does one conversion
String -> Text
This causes some GC pressure with un-necessary String and byte[] array allocations.
Attachments
Attachments
Issue Links
- is related to
-
HIVE-7364 Trunk cannot be built on -Phadoop1 after HIVE-7144
- Closed