Description
1) Don't use ORC dictionary.
2) Use VectorDeserialize.
3) Don't parse the columns that are not included (cannot avoid reading them).
4) Send VRB to the pipeline and write ORC in parallel (in background). HIVE-15672
Also add an option to disable the encoding pipeline server-side.
Attachments
Attachments
Issue Links
- is cloned by
-
HIVE-15672 LLAP text cache: improve first query perf II
- Closed
- is related to
-
HIVE-21123 LLAP text cache : same sql and TPCDS data occurs different results using textfile fileformat
- Open
- links to