Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
3.1.0
-
None
-
None
Description
The rule PruneFileSourcePartitions deletes the advances statistics (cardinality, column stats) of the underlying LogicalRelation and keeps only sizeInBytes as that is the only value we can be sure that is accurate.
I think we should keep all statistics as they are a good upper limit estimates and by keeping them other rules that depend on the presence of these statistics (like CostBasedJoinReorder) can still ran.
Attachments
Attachments
Issue Links
- duplicates
-
SPARK-34119 Keep necessary stats after partition pruning
- Resolved
- links to