Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
1.3.0
Description
When trying to add a new OM to the already working cluster, the ozone om --bootstrap command fails with the following:
2022-12-22 17:23:09,306 [Listener at ...] ERROR protocolPB.OMAdminProtocolClientSideImpl: Failed to retrieve configuration of OM om3 com.google.protobuf.ServiceException: org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.authorize.AuthorizationException): Unknown protocol: org.apache.hadoop.ozone.om.protocol.OzoneManagerMetadataProtocol at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:250) at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:123) at com.sun.proxy.$Proxy53.getOMConfiguration(Unknown Source) at sun.reflect.GeneratedMethodAccessor13.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:422) at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeMethod(RetryInvocationHandler.java:165) at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invoke(RetryInvocationHandler.java:157) at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeOnce(RetryInvocationHandler.java:95) at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:359) at com.sun.proxy.$Proxy53.getOMConfiguration(Unknown Source) at org.apache.hadoop.ozone.om.protocolPB.OMAdminProtocolClientSideImpl.getOMConfiguration(OMAdminProtocolClientSideImpl.java:156) at org.apache.hadoop.ozone.om.OzoneManager.checkConfigBeforeBootstrap(OzoneManager.java:1643) at org.apache.hadoop.ozone.om.OzoneManager.start(OzoneManager.java:1470) at org.apache.hadoop.ozone.om.OzoneManagerStarter$OMStarterHelper.bootstrap(OzoneManagerStarter.java:223) at org.apache.hadoop.ozone.om.OzoneManagerStarter.bootstrapOM(OzoneManagerStarter.java:162) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at picocli.CommandLine.executeUserObject(CommandLine.java:1972) at picocli.CommandLine.access$1300(CommandLine.java:145) at picocli.CommandLine$RunLast.executeUserObjectOfLastSubcommandWithSameParent(CommandLine.java:2352) at picocli.CommandLine$RunLast.handle(CommandLine.java:2346) at picocli.CommandLine$RunLast.handle(CommandLine.java:2311) at picocli.CommandLine$AbstractParseResultHandler.execute(CommandLine.java:2179) at picocli.CommandLine.execute(CommandLine.java:2078) at org.apache.hadoop.hdds.cli.GenericCli.execute(GenericCli.java:100) at org.apache.hadoop.hdds.cli.GenericCli.run(GenericCli.java:91) at org.apache.hadoop.ozone.om.OzoneManagerStarter.main(OzoneManagerStarter.java:58) Caused by: org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.authorize.AuthorizationException): Unknown protocol: org.apache.hadoop.ozone.om.protocol.OzoneManagerMetadataProtocol at org.apache.hadoop.ipc.Client.getRpcResponse(Client.java:1573) at org.apache.hadoop.ipc.Client.call(Client.java:1519) at org.apache.hadoop.ipc.Client.call(Client.java:1416) at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:236) ... 30 more 2022-12-22 17:23:09,310 [Listener at ...] ERROR om.OzoneManager: Remote OM config check failed on OM om3 java.io.IOException: Remote OM om3 configuration returned null at org.apache.hadoop.ozone.om.OzoneManager.checkRemoteOMConfig(OzoneManager.java:1665) at org.apache.hadoop.ozone.om.OzoneManager.checkConfigBeforeBootstrap(OzoneManager.java:1644) at org.apache.hadoop.ozone.om.OzoneManager.start(OzoneManager.java:1470) at org.apache.hadoop.ozone.om.OzoneManagerStarter$OMStarterHelper.bootstrap(OzoneManagerStarter.java:223) at org.apache.hadoop.ozone.om.OzoneManagerStarter.bootstrapOM(OzoneManagerStarter.java:162) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at picocli.CommandLine.executeUserObject(CommandLine.java:1972) at picocli.CommandLine.access$1300(CommandLine.java:145) at picocli.CommandLine$RunLast.executeUserObjectOfLastSubcommandWithSameParent(CommandLine.java:2352) at picocli.CommandLine$RunLast.handle(CommandLine.java:2346) at picocli.CommandLine$RunLast.handle(CommandLine.java:2311) at picocli.CommandLine$AbstractParseResultHandler.execute(CommandLine.java:2179) at picocli.CommandLine.execute(CommandLine.java:2078) at org.apache.hadoop.hdds.cli.GenericCli.execute(GenericCli.java:100) at org.apache.hadoop.hdds.cli.GenericCli.run(GenericCli.java:91) at org.apache.hadoop.ozone.om.OzoneManagerStarter.main(OzoneManagerStarter.java:58) 2022-12-22 17:23:19,457 [Listener at .../9862] ERROR om.OzoneManager: Terminating with exit status 1: OM(s) [om1[:9862],om3[:9862],om2[:9862]] do not have or have incorrect information of the bootstrapping OM. Update their ozone-site.xml before proceeding.
The failure happens during the config verification phase due to incorrect protocol description in org.apache.hadoop.ozone.om.protocolPB.OMAdminProtocolPB.
Instead of org.apache.hadoop.ozone.om.protocol.OMAdminProtocol it uses org.apache.hadoop.ozone.om.protocol.OzoneManagerMetadataProtocol which leads to a ClassNotFoundException on every checked node
Attachments
Attachments
Issue Links
- relates to
-
HDDS-4330 Bootstrap new OM node
- Resolved
- links to