Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4941

TestNNThroughputBenchmark fails because of interrupted ReplicationMonitor

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 3.0.0-alpha1
    • None
    • namenode, test
    • None

    Description

      This has been breaking trunk test runs pretty consistently.

      HDFS-4840 introduced some new code to BlockManager that terminates the NN if the ReplicationMonitor thread is interrupted unexpectedly:

                if (!namesystem.isRunning()) {
                  LOG.info("Stopping ReplicationMonitor.");
                  if (!(t instanceof InterruptedException)) {
                    LOG.info("ReplicationMonitor received an exception"
                        + " while shutting down.", t);
                  }
                  break;
                }
                LOG.fatal("ReplicationMonitor thread received Runtime exception. ", t);
                terminate(1, t);
      

      Unfortunately, NNThroughputBenchmark interrupts in ReplicationStats#generateInputs:

            // stop replication monitor
            BlockManagerTestUtil.getReplicationThread(namesystem.getBlockManager())
                .interrupt();
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              andrew.wang Andrew Wang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: