Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-46041

Increase Test Coverage

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.0.0
    • None
    • PySpark, Tests
    • None

    Description

      Our current PySpark test coverage stands at approximately 80% as reported on Codecov, which shows a recent decrease in test coverage. It is essential that we make efforts to enhance our test coverage going forward.

      Our objective is to significantly increase our PySpark test coverage to reach >90%.

      Attachments

        Issue Links

          1.
          Improve test coverage of udf.py Sub-task Resolved Xinrong Meng
          2.
          Test invalid error class (pyspark.errors.utils) Sub-task Resolved Hyukjin Kwon
          3.
          Test SparkPandasNotImplementedError (pyspark.pandas.exceptions) Sub-task Resolved Hyukjin Kwon
          4.
          Test string representation of TestResult (pyspark.mllib.stat.test) Sub-task Resolved Hyukjin Kwon
          5.
          Improve test coverage of memory_profiler.py Sub-task Resolved Xinrong Meng
          6.
          Improve test coverage of pyspark utils Sub-task Resolved Xinrong Meng
          7.
          Test captured errors (pyspark.errors.exceptions.captured) Sub-task Resolved Hyukjin Kwon
          8.
          Exclude generated files from the code coverage report Sub-task Resolved Hyukjin Kwon
          9.
          Test catalog error classes (pyspark.sql.catalog) Sub-task Resolved Hyukjin Kwon
          10.
          Test missing test coverage for Column (pyspark.sql.column) Sub-task Resolved Hyukjin Kwon
          11.
          Test invalid key for spark.conf.get (pyspark.sql.conf) Sub-task Resolved Hyukjin Kwon
          12.
          Test missing cases for SparkSession (pyspark.sql.session) Sub-task Resolved Hyukjin Kwon
          13.
          Exclude ported loose_version from the code coverage report Sub-task Resolved Hyukjin Kwon
          14.
          Test missing cases for functions (pyspark.sql.functions) Sub-task Resolved Hyukjin Kwon
          15.
          Test aggregate functions for groups (pyspark.sql.group) Sub-task Resolved Xinrong Meng
          16.
          Test rangeBetween window function (pyspark.sql.window) Sub-task Resolved Xinrong Meng
          17.
          Improve and test exceptions of TimedeltaIndex Sub-task Resolved Xinrong Meng
          18.
          Test custom data source and input partition (pyspark.sql.datasource) Sub-task Resolved Xinrong Meng
          19.
          Remove unnecessary dependency from test_mixed_udf_and_sql.py Sub-task Resolved Xinrong Meng
          20.
          Improve and test warning for Arrow-optimized Python UDF Sub-task Resolved Xinrong Meng

          Activity

            People

              Unassigned Unassigned
              XinrongM Xinrong Meng
              Xinrong Meng Xinrong Meng
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: