Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1695

Secondary Namenode halt when SocketTimeoutException at startup

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.12.3
    • 0.15.0
    • None
    • None

    Description

      When we start the namenode and secondary-namenode at the same time, usually primary namenode is busy handling the blockreports.
      If secondary namenode fail to connect at startup, it crashes leaving the following exception in the .out file.
      I hope it will catch the exception and retry later.

      Exception in thread "main" java.net.SocketTimeoutException: timed out waiting for rpc response
      at org.apache.hadoop.ipc.Client.call(Client.java:471)
      at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:163)
      at org.apache.hadoop.dfs.$Proxy0.getProtocolVersion(Unknown Source)
      at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:247)
      at org.apache.hadoop.dfs.SecondaryNameNode.<init>(SecondaryNameNode.java:96)
      at org.apache.hadoop.dfs.SecondaryNameNode.main(SecondaryNameNode.java:474)

      Attachments

        1. socketTimeout2.patch
          0.8 kB
          Dhruba Borthakur

        Activity

          People

            dhruba Dhruba Borthakur
            knoguchi Koji Noguchi
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: