Description
Noticed this in a run of the 1.7.0 ITs against an Ambari-installed cluster.
Cluster was stop and restarted, but the master got stuck stopping and didnt' come back up correctly.
2015-05-13 09:28:14,093 [standalone.StandaloneClusterControl] INFO : Running: '/usr/hdp/current/accumulo-client/bin/accumulo org.apache.accumulo.master.state.SetGoalState 'NORMAL'' on $hostname 2015-05-13 09:28:14,093 [cluster.RemoteShell] DEBUG: Executing full command [/usr/bin/ssh -o StrictHostKeyChecking=no $hostname "/usr/hdp/current/accumulo-client/bin/accumulo org.apache.accumulo.master.state.SetGoalState 'NORMAL'"] 2015-05-13 09:28:19,164 [standalone.StandaloneClusterControl] ERROR: Failed to run command ExitCodeException exitCode=1: stderr: , stdout: 2015-05-13 09:28:18,268 [fs.VolumeManagerImpl] WARN : dfs.datanode.synconclose set to false in hdfs-site.xml: data loss is possible on hard system reset or power loss 2015-05-13 09:28:18,278 [server.Accumulo] INFO : Attempting to talk to zookeeper 2015-05-13 09:28:18,444 [server.Accumulo] INFO : ZooKeeper connected and initialized, attempting to talk to HDFS 2015-05-13 09:28:18,639 [server.Accumulo] INFO : Connected to HDFS 2015-05-13 09:28:18,736 [start.Main] ERROR: Thread 'org.apache.accumulo.master.state.SetGoalState' died. org.apache.zookeeper.KeeperException$NoAuthException: KeeperErrorCode = NoAuth for /accumulo/ca8f1eff-042c-46b6-9365-261e98fc6f0e/masters/goal_state at org.apache.zookeeper.KeeperException.create(KeeperException.java:113) at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783) at org.apache.accumulo.fate.zookeeper.ZooUtil.putData(ZooUtil.java:288) at org.apache.accumulo.fate.zookeeper.ZooUtil.putPersistentData(ZooUtil.java:267) at org.apache.accumulo.fate.zookeeper.ZooReaderWriter.putPersistentData(ZooReaderWriter.java:68) at org.apache.accumulo.master.state.SetGoalState.main(SetGoalState.java:47) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.accumulo.start.Main$2.run(Main.java:130) at java.lang.Thread.run(Thread.java:745) at org.apache.accumulo.cluster.standalone.StandaloneClusterControl.exec(StandaloneClusterControl.java:88) at org.apache.accumulo.cluster.standalone.StandaloneClusterControl.execWithStdout(StandaloneClusterControl.java:113) at org.apache.accumulo.cluster.standalone.StandaloneClusterControl.exec(StandaloneClusterControl.java:98) at org.apache.accumulo.cluster.standalone.StandaloneAccumuloCluster.start(StandaloneAccumuloCluster.java:131) at org.apache.accumulo.test.functional.CompactionIT.test(CompactionIT.java:172) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47) at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44) at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) at org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
Need to make sure we set the correct configuration directory so the program can write to ZooKeeper w/ the necessary ACL.