Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
Right now all the records from ColStats for all columns, for all files are being read to compose the index used in Data Skipping.
In reality, individual queries touch up only a handful of columns at any given moment, so we can very effectively prune the # of records we fetch simply fetching records for the columns referenced in the query (by the key prefix, since CS record key is concatenation of column, partition-path, filename)
Attachments
Issue Links
- links to