Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-27032

Flaky test: org.apache.spark.sql.execution.streaming.HDFSMetadataLogSuite.HDFSMetadataLog: metadata directory collision

    XMLWordPrintableJSON

Details

    • Test
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • Spark Core, Tests
    • None

    Description

      Locally and on Jenkins, I've frequently seen this test fail:

      Error Message
      The await method on Waiter timed out.
      Stacktrace
            org.scalatest.exceptions.TestFailedException: The await method on Waiter timed out.
            at org.scalatest.concurrent.Waiters$Waiter.awaitImpl(Waiters.scala:406)
            at org.scalatest.concurrent.Waiters$Waiter.await(Waiters.scala:540)
            at org.apache.spark.sql.execution.streaming.HDFSMetadataLogSuite.$anonfun$new$19(HDFSMetadataLogSuite.scala:158)
            at org.apache.spark.sql.execution.streaming.HDFSMetadataLogSuite.$anonfun$new$19$adapted(HDFSMetadataLogSuite.scala:133)
      ...
      

      See for example https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7/6057/testReport/

      There aren't obvious errors or problems with the test. Because it passes sometimes, my guess is that the timeout is simply too short or the test too long. I'd like to try reducing the number of threads/batches in the test.

      Attachments

        Issue Links

          Activity

            People

              srowen Sean R. Owen
              srowen Sean R. Owen
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: