Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
For cases when user issues COUNT queries without WHERE clause PXF should be able to leverage file-level stats for a ORC file and emit given number of records back to HAWQ, avoiding reading actual tuples from disk. This should be a first step in enabling PXF to use ORC stats(file, stripe and row group levels) so we can improve a wider range of aggregate queries.
So whenever PXF receives "count" as AGG-TYPE parameters value - it should optimize it by emitting tuples using ORC file-level stats.
Attachments
Issue Links
- links to