Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2735

Push down conjunct evaluation into Parquet column readers

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • Impala 2.3.0
    • None
    • Backend

    Description

      Currently EvalConjuncts() operates on an entire row, which requires that values be materialised into the row format before evaluating conjuncts. In some cases we partially materialise each tuple before evaluating conjuncts, but this is not cache-efficient if we want to materialise column-by-column.

      In some cases, e.g. the Parquet scanner, it would be helpful if we could evaluate conjuncts that only reference a single slot on a simple value outside of the row format so that we can materialise a dense array of that columns values then filter out rows that don't pass conjuncts.

      Attachments

        Issue Links

          Activity

            People

              kwho Michael Ho
              tarmstrong Tim Armstrong
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: