Description
Currently dictionary check happens after creation of first row group entry. Even when row indexes are disabled, rows end up in red-black tree first before getting flushed during write stripe (into direct stream).
If dictionary threshold is set to <= 0.0 disable dictionary, we should write directly to stream instead of RBTree. This is useful for hive streaming ingest where delta files explicitly disables dictionaries.
Attachments
Issue Links
- is related to
-
ORC-299 Improve heuristics for bailing on dictionary encoding
- Open
- links to