Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-24831

Support writing bloom filters in Parquet

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Parquet
    • None

    Description

      Parquet-mr 1.12.0 will add support for reading and writing Bloom filters.

      Reading doesn't need any action from Hive side, as it will be applied automatically if there is an Eq predicate on a column and the file contains a bloom filter.

      Writing needs some configuration, as Parquet-mr doesn't write bloom filters by default.
      Similar table properties could be used as in ORC, e.g. 'orc.bloom.filter.columns' to set the columns where Parquet-mr should write bloom filters. The same table property could be used by both Hive and Impala for the same purpose.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              csringhofer Csaba Ringhofer
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: