Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-15646 Track failing tests in HDFS
  3. HDFS-15308

TestReconstructStripedFile#testNNSendsErasureCodingTasks fails intermittently

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.3.0
    • Fix Version/s: 3.4.0
    • Component/s: erasure-coding
    • Labels:
    • Hadoop Flags:
      Reviewed

      Description

      In HDFS-14353, TestReconstructStripedFile.testNNSendsErasureCodingTasks failed once due to pending reconstruction timeout as follows.

      java.lang.AssertionError: Found 4 timeout pending reconstruction tasks
      	at org.junit.Assert.fail(Assert.java:88)
      	at org.junit.Assert.assertTrue(Assert.java:41)
      	at org.apache.hadoop.hdfs.TestReconstructStripedFile.testNNSendsErasureCodingTasks(TestReconstructStripedFile.java:502)
      	at org.apache.hadoop.hdfs.TestReconstructStripedFile.testNNSendsErasureCodingTasks(TestReconstructStripedFile.java:458)
      	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
      	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      	at java.lang.reflect.Method.invoke(Method.java:498)
      	at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
      	at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
      	at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
      	at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
      	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298)
      	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292)
      	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      	at java.lang.Thread.run(Thread.java:748)
      

      The error occurred on the following assertion.

      // Make sure that all pending reconstruction tasks can be processed.
      while (ns.getPendingReconstructionBlocks() > 0) {
        long timeoutPending = ns.getNumTimedOutPendingReconstructions();
        assertTrue(String.format("Found %d timeout pending reconstruction tasks",
            timeoutPending), timeoutPending == 0);
        Thread.sleep(1000);
      }
      

      The failure could not be reproduced in the reporter's docker environment (start-build-environment.sh).

        Attachments

        1. HDFS-15308.002.patch
          2 kB
          Hemanth Boyina
        2. HDFS-15308.001.patch
          1 kB
          Hemanth Boyina

          Issue Links

            Activity

              People

              • Assignee:
                hemanthboyina Hemanth Boyina
                Reporter:
                touchida Toshihiko Uchida
              • Votes:
                0 Vote for this issue
                Watchers:
                9 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: