Description
com.datatorrent.stram.StreamingAppMasterService.execute() calculates numRequestedContainers incorrectly in some cases (e.g. RM container allocation failure) which prevents an application from shutting down when it is requested externally. An example is where we ask RM to remove previous container allocation request (where the count should be decremented but is NOT) and add a new one (where the count should be and IS incremented). Another example is the "alreadyAllocated" case where we release the container and still increment numRequestedContainers which seems wrong.
This bug is showing up in multiple Apex deployments.
Attachments
Issue Links
- breaks
-
APEXCORE-737 AppMaster does not shut down because numRequestedContainers becomes negative
- Closed
- links to