Description
when query only use metadata (example: partition key), it can return results based on metadata without scaning files. Hive did it in HIVE-1003.
design document:
https://docs.google.com/document/d/1Bmi4-PkTaBQ0HVaGjIqa3eA12toKX52QaiUyhb6WQiM/edit?usp=sharing
Attachments
Issue Links
- causes
-
SPARK-26709 OptimizeMetadataOnlyQuery does not correctly handle the files with zero record
- Resolved
-
SPARK-23523 Incorrect result caused by the rule OptimizeMetadataOnlyQuery
- Resolved
- relates to
-
SPARK-12890 Spark SQL query related to only partition fields should not scan the whole data.
- Resolved
- links to