Details
-
Sub-task
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
-
None
-
None
Description
Currently ORC StringTreeReader::nextVector creates dictionary of strings for each call to nextVector. This leads to bad perf as there is huge memory allocation and deallocation on each call. Since the dictionary does not change within a stripe, StringTreeReader::nextVector should be optimized to create this dictionary only on stripe read.