Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48411

Add E2E test for DropDuplicateWithinWatermark

    XMLWordPrintableJSON

Details

    Description

      Currently we do not have a e2e test for DropDuplicateWithinWatermark, we should add one. We can simply use one of the test written in Scala here (with the testStream API) and replicate it to python:

      https://github.com/apache/spark/commit/0e9e34c1bd9bd16ad5efca77ce2763eb950f3103

       

      The change should happen in https://github.com/apache/spark/blob/eee179135ed21dbdd8b342d053c9eda849e2de77/python/pyspark/sql/tests/streaming/test_streaming.py#L29

       

      so we can test it in both connect and non-connect.

       

      Test with:

      ```
      python/run-tests --testnames pyspark.sql.tests.streaming.test_streaming
      python/run-tests --testnames pyspark.sql.tests.connect.streaming.test_parity_streaming
      ```

      Attachments

        Issue Links

          Activity

            People

              liuyuchen777 Yuchen Liu
              WweiL Wei Liu
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: