Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4426

Secondary namenode shuts down immediately after startup

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 2.0.3-alpha, 0.23.6
    • 2.0.3-alpha, 0.23.6, 0.23.7
    • namenode
    • None

    Description

      After HADOOP-9181 went in, the secondary namenode immediately shuts down after it is started. From the startup logs:

      2013-01-22 19:54:28,826 INFO  namenode.SecondaryNameNode (SecondaryNameNode.java:initialize(299)) - Checkpoint Period   :3600 secs (60 min)
      2013-01-22 19:54:28,826 INFO  namenode.SecondaryNameNode (SecondaryNameNode.java:initialize(301)) - Log Size Trigger    :40000 txns
      2013-01-22 19:54:28,845 INFO  namenode.SecondaryNameNode (StringUtils.java:run(616)) - SHUTDOWN_MSG: 
      /************************************************************
      SHUTDOWN_MSG: Shutting down SecondaryNameNode at xx
      ************************************************************/
      

      I looked into the issue, and it's shutting down because SecondaryNameNode.main starts a bunch of daemon threads then returns. With nothing but daemon threads remaining, the JVM sees no reason to keep going and proceeds to shutdown. Apparently we were implicitly relying on the fact that the HttpServer QueuedThreadPool threads were not daemon threads to keep the secondary namenode process up.

      Attachments

        1. HDFS-4426.1.patch
          1 kB
          Arpit Agarwal
        2. HDFS-4426.branch-23.patch
          18 kB
          Suresh Srinivas
        3. HDFS-4426.patch
          0.9 kB
          Suresh Srinivas
        4. HDFS-4426.patch
          0.9 kB
          Suresh Srinivas
        5. HDFS-4426.patch
          1 kB
          Arpit Agarwal

        Issue Links

          Activity

            People

              arp Arpit Agarwal
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: