Uploaded image for project: 'Apache HAWQ'
  1. Apache HAWQ
  2. HAWQ-1404

PXF to leverage file-level stats of ORC file and emit records for COUNT(*)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.3.0.0-incubating
    • PXF
    • None

    Description

      For cases when user issues COUNT queries without WHERE clause PXF should be able to leverage file-level stats for a ORC file and emit given number of records back to HAWQ, avoiding reading actual tuples from disk. This should be a first step in enabling PXF to use ORC stats(file, stripe and row group levels) so we can improve a wider range of aggregate queries.

      So whenever PXF receives "count" as AGG-TYPE parameters value - it should optimize it by emitting tuples using ORC file-level stats.

      Attachments

        Issue Links

          Activity

            People

              odiachenko Oleksandr Diachenko
              odiachenko Oleksandr Diachenko
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: