Uploaded image for project: 'ORC'
  1. ORC
  2. ORC-373

If "orc.dictionary.key.threshold" is set to 0, don't try dictionary encoding.

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.5.2
    • Fix Version/s: 1.5.2, 1.6.0
    • Component/s: None
    • Labels:
      None

      Description

      Currently dictionary check happens after creation of first row group entry. Even when row indexes are disabled, rows end up in red-black tree first before getting flushed during write stripe (into direct stream).
      If dictionary threshold is set to <= 0.0 disable dictionary, we should write directly to stream instead of RBTree. This is useful for hive streaming ingest where delta files explicitly disables dictionaries.

        Attachments

          Activity

            People

            • Assignee:
              prasanth_j Prasanth Jayachandran
              Reporter:
              prasanth_j Prasanth Jayachandran
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: