Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-15646 Track failing tests in HDFS
  3. HDFS-15308

TestReconstructStripedFile#testNNSendsErasureCodingTasks fails intermittently

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.3.0
    • 3.4.0
    • erasure-coding
    • Reviewed

    Description

      In HDFS-14353, TestReconstructStripedFile.testNNSendsErasureCodingTasks failed once due to pending reconstruction timeout as follows.

      java.lang.AssertionError: Found 4 timeout pending reconstruction tasks
      	at org.junit.Assert.fail(Assert.java:88)
      	at org.junit.Assert.assertTrue(Assert.java:41)
      	at org.apache.hadoop.hdfs.TestReconstructStripedFile.testNNSendsErasureCodingTasks(TestReconstructStripedFile.java:502)
      	at org.apache.hadoop.hdfs.TestReconstructStripedFile.testNNSendsErasureCodingTasks(TestReconstructStripedFile.java:458)
      	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
      	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      	at java.lang.reflect.Method.invoke(Method.java:498)
      	at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
      	at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
      	at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
      	at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
      	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298)
      	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      	at java.lang.Thread.run(Thread.java:748)
      

      The error occurred on the following assertion.

      // Make sure that all pending reconstruction tasks can be processed.
      while (ns.getPendingReconstructionBlocks() > 0) {
        long timeoutPending = ns.getNumTimedOutPendingReconstructions();
        assertTrue(String.format("Found %d timeout pending reconstruction tasks",
            timeoutPending), timeoutPending == 0);
        Thread.sleep(1000);
      }
      

      The failure could not be reproduced in the reporter's docker environment (start-build-environment.sh).

      Attachments

        1. HDFS-15308.002.patch
          2 kB
          Hemanth Boyina
        2. HDFS-15308.001.patch
          1 kB
          Hemanth Boyina

        Issue Links

          Activity

            People

              hemanthboyina Hemanth Boyina
              touchida Toshihiko Uchida
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: