Hive
  1. Hive
  2. HIVE-4244

Make string dictionaries adaptive in ORC

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: File Formats
    • Labels:
      None

      Description

      The ORC writer should adaptively switch between dictionary and direct encoding. I'd propose looking at the first 100,000 values in each column and decide whether there is sufficient loading in the dictionary to use dictionary encoding.

      There are no Sub-Tasks for this issue.

        Activity

        Owen O'Malley made changes -
        Component/s File Formats [ 12320633 ]
        Component/s Serializers/Deserializers [ 12312585 ]
        Kevin Wilfong made changes -
        Field Original Value New Value
        Assignee Owen O'Malley [ owen.omalley ] Kevin Wilfong [ kevinwilfong ]
        Owen O'Malley created issue -

          People

          • Assignee:
            Kevin Wilfong
            Reporter:
            Owen O'Malley
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:

              Development