Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-30516

statistic estimation of FileScan should take partitionFilters and partition number into account

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • 3.1.0
    • None
    • SQL
    • None

    Description

      Currently, FileScan.estimateStatistics does not take partitionFilters and partition number into account, which may lead to bigger sizeInBytes. It should be reasonable to change it to involve partitionFilters and partition number when estimating the statistics.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              wwg28103 Hu Fuwang
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: