Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-11384

[C++][Dataset] Support bloom filters in predicate pushdown

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • C++

    Description

      The parquet spec includes bloom filters which can be useful during filtration. In the context of dataset::, this would be expressed as additional parquet statistics expressions on each row group, allowing entirely-excluded row groups to be skipped more aggressively.

      Prerequisite: https://issues.apache.org/jira/browse/PARQUET-1327 (reader/writer support for bloom filters)

      Attachments

        Activity

          People

            Unassigned Unassigned
            bkietz Ben Kietzman
            Votes:
            2 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: