Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3156

Tez client keeps trying to talk to RM even if RM does not know about the application

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.7.1, 0.8.3
    • Component/s: None
    • Labels:
      None

      Description

      Scenario :

      • Set RM/NM recovery to false.
         <property>
              <name>yarn.resourcemanager.recovery.enabled</name>
              <value>false</value>
            </property>
         <property>
              <name>yarn.nodemanager.recovery.enabled</name>
              <value>false</value>
            </property>
        
      • Start Mrrsleep application (application_1456883132071_0001)
        hadoop jar tez-tests-*.jar mrrsleep -m 1 -r 1 -mt 1000000 -rt 1000
        
      • When application is running, restart RM

      Since recovery is disabled and RM is restarted, it forgets mrrsleep application. At this point, mrrsleep application's tez-client keep trying to communicate with RM and loads RM with below exception.

      2016-03-02 02:01:24,708 INFO  ipc.Server (Server.java:run(2172)) - IPC Server handler 18 on 8050, call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplicationReport from xx.xx.xx.xxx:36191 Call#500250 Retry#0
      org.apache.hadoop.yarn.exceptions.ApplicationNotFoundException: Application with id 'application_1456883132071_0001' doesn't exist in RM.
      	at org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getApplicationReport(ClientRMService.java:328)
      	at org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.getApplicationReport(ApplicationClientProtocolPBServiceImpl.java:175)
      	at org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:417)
      	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
      	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2151)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2147)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:415)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
      	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2145)
      2016-03-02 02:01:24,709 INFO  ipc.Server (Server.java:run(2172)) - IPC Server handler 27 on 8050, call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplicationReport from xx.xx.xx.xxx:36191 Call#500251 Retry#0
      org.apache.hadoop.yarn.exceptions.ApplicationNotFoundException: Application with id 'application_1456883132071_0001' doesn't exist in RM.
      	at org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getApplicationReport(ClientRMService.java:328)
      	at org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.getApplicationReport(ApplicationClientProtocolPBServiceImpl.java:175)
      	at org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:417)
      	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
      	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2151)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2147)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:415)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
      	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2145)
      

        Attachments

        1. TEZ-3156.2.patch
          14 kB
          Hitesh Shah
        2. TEZ-3156.1.patch
          9 kB
          Hitesh Shah

          Issue Links

            Activity

              People

              • Assignee:
                hitesh Hitesh Shah
                Reporter:
                yeshavora Yesha Vora
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: