Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23292

python tests related to pandas are skipped with python 2

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Incomplete
    • 2.3.0
    • None
    • Tests

    Description

      I was running python tests and found that pyspark.sql.tests.GroupbyAggPandasUDFTests.test_unsupported_types does not run with Python 2 because the test uses "assertRaisesRegex" (supported by Python 3) instead of "assertRaisesRegexp" (supported by Python 2). However, spark jenkins does not fail because of this issue (see run history at here). After looking into this issue, seems test script will skip tests related to pandas if pandas is not installed, which means that jenkins does not have pandas installed. 
       
      Since pyarrow related tests have the same skipping logic, we will need to check if jenkins has pyarrow installed correctly as well. 
       
      Since features using pandas and pyarrow are in 2.3, we should fix the test issue and make sure all tests pass before we make the release.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              yhuai Yin Huai
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: