Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-2821

Distributed shell app master becomes unresponsive sometimes

    Details

    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      We've noticed that once in a while the distributed shell app master becomes unresponsive and is eventually killed by the RM. snippet of the logs -

      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: appattempt_1415123350094_0017_000001 received 0 previous attempts' running containers on AM registration.
      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: Requested container ask: Capability[<memory:10, vCores:1>]Priority[0]
      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: Requested container ask: Capability[<memory:10, vCores:1>]Priority[0]
      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: Requested container ask: Capability[<memory:10, vCores:1>]Priority[0]
      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: Requested container ask: Capability[<memory:10, vCores:1>]Priority[0]
      14/11/04 18:21:37 INFO distributedshell.ApplicationMaster: Requested container ask: Capability[<memory:10, vCores:1>]Priority[0]
      14/11/04 18:21:38 INFO impl.AMRMClientImpl: Received new token for : onprem-tez2:45454
      14/11/04 18:21:38 INFO distributedshell.ApplicationMaster: Got response from RM for container ask, allocatedCnt=1
      14/11/04 18:21:38 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000002, containerNode=onprem-tez2:45454, containerNodeURI=onprem-tez2:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:38 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000002
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: START_CONTAINER for Container container_1415123350094_0017_01_000002
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez2:45454
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: QUERY_CONTAINER for Container container_1415123350094_0017_01_000002
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez2:45454
      14/11/04 18:21:39 INFO impl.AMRMClientImpl: Received new token for : onprem-tez3:45454
      14/11/04 18:21:39 INFO impl.AMRMClientImpl: Received new token for : onprem-tez4:45454
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Got response from RM for container ask, allocatedCnt=3
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000003, containerNode=onprem-tez2:45454, containerNodeURI=onprem-tez2:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000004, containerNode=onprem-tez3:45454, containerNodeURI=onprem-tez3:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000005, containerNode=onprem-tez4:45454, containerNodeURI=onprem-tez4:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000003
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000005
      14/11/04 18:21:39 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000004
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: START_CONTAINER for Container container_1415123350094_0017_01_000005
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: START_CONTAINER for Container container_1415123350094_0017_01_000003
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez4:45454
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez2:45454
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: START_CONTAINER for Container container_1415123350094_0017_01_000004
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez3:45454
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: QUERY_CONTAINER for Container container_1415123350094_0017_01_000005
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez4:45454
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: QUERY_CONTAINER for Container container_1415123350094_0017_01_000003
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez2:45454
      14/11/04 18:21:39 INFO impl.NMClientAsyncImpl: Processing Event EventType: QUERY_CONTAINER for Container container_1415123350094_0017_01_000004
      14/11/04 18:21:39 INFO impl.ContainerManagementProtocolProxy: Opening proxy : onprem-tez3:45454
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Got response from RM for container ask, completedCnt=1
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: appattempt_1415123350094_0017_000001 got container status for containerID=container_1415123350094_0017_01_000002, state=COMPLETE, exitStatus=0, diagnostics=
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Container completed successfully., containerId=container_1415123350094_0017_01_000002
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Got response from RM for container ask, allocatedCnt=2
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000006, containerNode=onprem-tez2:45454, containerNodeURI=onprem-tez2:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Launching shell command on a new container., containerId=container_1415123350094_0017_01_000007, containerNode=onprem-tez3:45454, containerNodeURI=onprem-tez3:50060, containerResourceMemory1024, containerResourceVirtualCores1
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000007
      14/11/04 18:21:40 INFO distributedshell.ApplicationMaster: Setting up container launch container for containerid=container_1415123350094_0017_01_000006
      
      1. apache-yarn-2821.0.patch
        9 kB
        Varun Vasudev
      2. apache-yarn-2821.1.patch
        15 kB
        Varun Vasudev
      3. YARN-2821.002.patch
        23 kB
        Varun Vasudev
      4. YARN-2821.003.patch
        22 kB
        Varun Vasudev
      5. YARN-2821.004.patch
        11 kB
        Varun Vasudev
      6. YARN-2821.005.patch
        12 kB
        Varun Vasudev

        Activity

        Hide
        vvasudev Varun Vasudev added a comment -

        The root cause appears to be an unexpected over-allocation. In this case the app master got allocated one more container than it expected and went into an infinite loop in the finish function. With regards to the extra container, it's possible we're seeing a variant of YARN-110. Unfortunately the RM doesn't log asks so we can't tell the sequence of asks that led to the extra allocation.

        Show
        vvasudev Varun Vasudev added a comment - The root cause appears to be an unexpected over-allocation. In this case the app master got allocated one more container than it expected and went into an infinite loop in the finish function. With regards to the extra container, it's possible we're seeing a variant of YARN-110 . Unfortunately the RM doesn't log asks so we can't tell the sequence of asks that led to the extra allocation.
        Hide
        vvasudev Varun Vasudev added a comment -

        Uploaded patch with fix.

        Show
        vvasudev Varun Vasudev added a comment - Uploaded patch with fix.
        Hide
        hadoopqa Hadoop QA added a comment -

        +1 overall. Here are the results of testing the latest attachment
        http://issues.apache.org/jira/secure/attachment/12679948/apache-yarn-2821.0.patch
        against trunk revision 1670578.

        +1 @author. The patch does not contain any @author tags.

        +1 tests included. The patch appears to include 1 new or modified test files.

        +1 javac. The applied patch does not increase the total number of javac compiler warnings.

        +1 javadoc. There were no new javadoc warning messages.

        +1 eclipse:eclipse. The patch built with eclipse:eclipse.

        +1 findbugs. The patch does not introduce any new Findbugs (version 2.0.3) warnings.

        +1 release audit. The applied patch does not increase the total number of release audit warnings.

        +1 core tests. The patch passed unit tests in hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell.

        +1 contrib tests. The patch passed contrib unit tests.

        Test results: https://builds.apache.org/job/PreCommit-YARN-Build/5757//testReport/
        Console output: https://builds.apache.org/job/PreCommit-YARN-Build/5757//console

        This message is automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - +1 overall . Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12679948/apache-yarn-2821.0.patch against trunk revision 1670578. +1 @author . The patch does not contain any @author tags. +1 tests included . The patch appears to include 1 new or modified test files. +1 javac . The applied patch does not increase the total number of javac compiler warnings. +1 javadoc . There were no new javadoc warning messages. +1 eclipse:eclipse . The patch built with eclipse:eclipse. +1 findbugs . The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 release audit . The applied patch does not increase the total number of release audit warnings. +1 core tests . The patch passed unit tests in hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell. +1 contrib tests . The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-YARN-Build/5757//testReport/ Console output: https://builds.apache.org/job/PreCommit-YARN-Build/5757//console This message is automatically generated.
        Hide
        jianhe Jian He added a comment -

        thanks Varun ! the patch should solve the problem to release extra allocated containers. but seems in some cases, we may still run into the infinite loop.
        In finish(), it's checking numCompletedContainers.get() != numTotalContainers, but numCompletedContainers could be incremented elsewhere. e.g. onStartContainerError will also increase the numCompletedContainers count. Could the numCompletedContainers go beyond the numTotalContainers in such scenario also ?

        Show
        jianhe Jian He added a comment - thanks Varun ! the patch should solve the problem to release extra allocated containers. but seems in some cases, we may still run into the infinite loop. In finish(), it's checking numCompletedContainers.get() != numTotalContainers , but numCompletedContainers could be incremented elsewhere. e.g. onStartContainerError will also increase the numCompletedContainers count. Could the numCompletedContainers go beyond the numTotalContainers in such scenario also ?
        Hide
        vvasudev Varun Vasudev added a comment -

        Thanks for the review Jian! I thought about changing the comparison but it feels like treating the symptom. I'd like to get it to work right without changing that if possible.

        Thanks for pointing out the increment in onStartContainerError, I've addressed that as well as made some more fixes in the latest patch.

        Show
        vvasudev Varun Vasudev added a comment - Thanks for the review Jian! I thought about changing the comparison but it feels like treating the symptom. I'd like to get it to work right without changing that if possible. Thanks for pointing out the increment in onStartContainerError, I've addressed that as well as made some more fixes in the latest patch.
        Hide
        hadoopqa Hadoop QA added a comment -

        -1 overall. Here are the results of testing the latest attachment
        http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch
        against trunk revision 61effcb.

        +1 @author. The patch does not contain any @author tags.

        +1 tests included. The patch appears to include 1 new or modified test files.

        +1 javac. The applied patch does not increase the total number of javac compiler warnings.

        +1 javadoc. There were no new javadoc warning messages.

        +1 eclipse:eclipse. The patch built with eclipse:eclipse.

        +1 findbugs. The patch does not introduce any new Findbugs (version 2.0.3) warnings.

        +1 release audit. The applied patch does not increase the total number of release audit warnings.

        -1 core tests. The patch failed these unit tests in hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell:

        org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell

        +1 contrib tests. The patch passed contrib unit tests.

        Test results: https://builds.apache.org/job/PreCommit-YARN-Build/5777//testReport/
        Console output: https://builds.apache.org/job/PreCommit-YARN-Build/5777//console

        This message is automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall . Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch against trunk revision 61effcb. +1 @author . The patch does not contain any @author tags. +1 tests included . The patch appears to include 1 new or modified test files. +1 javac . The applied patch does not increase the total number of javac compiler warnings. +1 javadoc . There were no new javadoc warning messages. +1 eclipse:eclipse . The patch built with eclipse:eclipse. +1 findbugs . The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 release audit . The applied patch does not increase the total number of release audit warnings. -1 core tests . The patch failed these unit tests in hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell: org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell +1 contrib tests . The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-YARN-Build/5777//testReport/ Console output: https://builds.apache.org/job/PreCommit-YARN-Build/5777//console This message is automatically generated.
        Hide
        jianhe Jian He added a comment -
        • we can use Collections.synchronizedSet(new HashSet()); instead of concurrentHashMap
        • entry in releasedContainers once added , never removed.
        • found that actually in this case, the container will expire and the exit status is ABORTED, numCompletedContainers will not be incremented in onContainersCompleted
                  // no need to update numCompletedContainers because we'll get
                // a notification from the RM anyway and we'll handle it there
          
        Show
        jianhe Jian He added a comment - we can use Collections.synchronizedSet(new HashSet()); instead of concurrentHashMap entry in releasedContainers once added , never removed. found that actually in this case, the container will expire and the exit status is ABORTED, numCompletedContainers will not be incremented in onContainersCompleted // no need to update numCompletedContainers because we'll get // a notification from the RM anyway and we'll handle it there
        Hide
        hadoopqa Hadoop QA added a comment -



        -1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 14m 44s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 0s The patch appears to include 1 new or modified test files.
        +1 javac 7m 33s There were no new javac warning messages.
        +1 javadoc 9m 33s There were no new javadoc warning messages.
        +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings.
        -1 checkstyle 0m 22s The applied patch generated 3 new checkstyle issues (total was 46, now 49).
        +1 whitespace 0m 0s The patch has no lines that end in whitespace.
        +1 install 1m 34s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings.
        +1 yarn tests 6m 58s Tests passed in hadoop-yarn-applications-distributedshell.
            42m 18s  



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / e8d0ee5
        checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7669/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7669/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7669/testReport/
        Java 1.7.0_55
        uname Linux asf902.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7669/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 pre-patch 14m 44s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 0s The patch appears to include 1 new or modified test files. +1 javac 7m 33s There were no new javac warning messages. +1 javadoc 9m 33s There were no new javadoc warning messages. +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings. -1 checkstyle 0m 22s The applied patch generated 3 new checkstyle issues (total was 46, now 49). +1 whitespace 0m 0s The patch has no lines that end in whitespace. +1 install 1m 34s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 yarn tests 6m 58s Tests passed in hadoop-yarn-applications-distributedshell.     42m 18s   Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / e8d0ee5 checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7669/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7669/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7669/testReport/ Java 1.7.0_55 uname Linux asf902.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7669/console This message was automatically generated.
        Hide
        hadoopqa Hadoop QA added a comment -



        -1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 14m 37s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 1s The patch appears to include 1 new or modified test files.
        +1 javac 7m 35s There were no new javac warning messages.
        +1 javadoc 9m 32s There were no new javadoc warning messages.
        +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings.
        -1 checkstyle 0m 20s The applied patch generated 3 new checkstyle issues (total was 47, now 50).
        +1 whitespace 0m 1s The patch has no lines that end in whitespace.
        +1 install 1m 32s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings.
        +1 yarn tests 7m 1s Tests passed in hadoop-yarn-applications-distributedshell.
            42m 14s  



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / a319771
        checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7675/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7675/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7675/testReport/
        Java 1.7.0_55
        uname Linux asf905.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7675/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 pre-patch 14m 37s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 1s The patch appears to include 1 new or modified test files. +1 javac 7m 35s There were no new javac warning messages. +1 javadoc 9m 32s There were no new javadoc warning messages. +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings. -1 checkstyle 0m 20s The applied patch generated 3 new checkstyle issues (total was 47, now 50). +1 whitespace 0m 1s The patch has no lines that end in whitespace. +1 install 1m 32s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 yarn tests 7m 1s Tests passed in hadoop-yarn-applications-distributedshell.     42m 14s   Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12680098/apache-yarn-2821.1.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / a319771 checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7675/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7675/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7675/testReport/ Java 1.7.0_55 uname Linux asf905.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7675/console This message was automatically generated.
        Hide
        vinodkv Vinod Kumar Vavilapalli added a comment -

        Varun Vasudev, Jian had some unaddressed comments from before, please look at them.

        Canceling the patch and setting 2.8.0 as the target-version.

        Show
        vinodkv Vinod Kumar Vavilapalli added a comment - Varun Vasudev , Jian had some unaddressed comments from before, please look at them. Canceling the patch and setting 2.8.0 as the target-version.
        Hide
        vvasudev Varun Vasudev added a comment -

        Uploaded a new patch that correctly handles AM restarts and doesn't launch unnecessary containers.

        Show
        vvasudev Varun Vasudev added a comment - Uploaded a new patch that correctly handles AM restarts and doesn't launch unnecessary containers.
        Hide
        hadoopqa Hadoop QA added a comment -



        -1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 14m 49s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 0s The patch appears to include 1 new or modified test files.
        +1 javac 7m 35s There were no new javac warning messages.
        +1 javadoc 9m 37s There were no new javadoc warning messages.
        +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings.
        -1 checkstyle 0m 22s The applied patch generated 6 new checkstyle issues (total was 151, now 155).
        +1 whitespace 0m 2s The patch has no lines that end in whitespace.
        +1 install 1m 33s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 35s The patch does not introduce any new Findbugs (version 2.0.3) warnings.
        -1 yarn tests 15m 25s Tests failed in hadoop-yarn-applications-distributedshell.
            50m 56s  



        Reason Tests
        Failed unit tests hadoop.yarn.applications.distributedshell.TestDistributedShellWithNodeLabels
        Timed out tests org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12731235/YARN-2821.002.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / daf3e4e
        checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7770/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7770/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7770/testReport/
        Java 1.7.0_55
        uname Linux asf907.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7770/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 pre-patch 14m 49s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 0s The patch appears to include 1 new or modified test files. +1 javac 7m 35s There were no new javac warning messages. +1 javadoc 9m 37s There were no new javadoc warning messages. +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings. -1 checkstyle 0m 22s The applied patch generated 6 new checkstyle issues (total was 151, now 155). +1 whitespace 0m 2s The patch has no lines that end in whitespace. +1 install 1m 33s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 35s The patch does not introduce any new Findbugs (version 2.0.3) warnings. -1 yarn tests 15m 25s Tests failed in hadoop-yarn-applications-distributedshell.     50m 56s   Reason Tests Failed unit tests hadoop.yarn.applications.distributedshell.TestDistributedShellWithNodeLabels Timed out tests org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12731235/YARN-2821.002.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / daf3e4e checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7770/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7770/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7770/testReport/ Java 1.7.0_55 uname Linux asf907.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7770/console This message was automatically generated.
        Hide
        vvasudev Varun Vasudev added a comment -

        Uploaded YARN-2821.003.patch which fixes the failing tests and addresses some checkstyle complaints.

        Show
        vvasudev Varun Vasudev added a comment - Uploaded YARN-2821 .003.patch which fixes the failing tests and addresses some checkstyle complaints.
        Hide
        hadoopqa Hadoop QA added a comment -



        -1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 15m 4s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 0s The patch appears to include 1 new or modified test files.
        +1 javac 7m 46s There were no new javac warning messages.
        +1 javadoc 9m 54s There were no new javadoc warning messages.
        +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings.
        -1 checkstyle 0m 23s The applied patch generated 2 new checkstyle issues (total was 150, now 150).
        +1 whitespace 0m 3s The patch has no lines that end in whitespace.
        +1 install 1m 37s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 37s The patch does not introduce any new Findbugs (version 2.0.3) warnings.
        +1 yarn tests 7m 2s Tests passed in hadoop-yarn-applications-distributedshell.
            43m 25s  



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12731438/YARN-2821.003.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / 241a72a
        checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7806/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7806/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7806/testReport/
        Java 1.7.0_55
        uname Linux asf904.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7806/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - -1 overall Vote Subsystem Runtime Comment 0 pre-patch 15m 4s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 0s The patch appears to include 1 new or modified test files. +1 javac 7m 46s There were no new javac warning messages. +1 javadoc 9m 54s There were no new javadoc warning messages. +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings. -1 checkstyle 0m 23s The applied patch generated 2 new checkstyle issues (total was 150, now 150). +1 whitespace 0m 3s The patch has no lines that end in whitespace. +1 install 1m 37s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 37s The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 yarn tests 7m 2s Tests passed in hadoop-yarn-applications-distributedshell.     43m 25s   Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12731438/YARN-2821.003.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / 241a72a checkstyle https://builds.apache.org/job/PreCommit-YARN-Build/7806/artifact/patchprocess/diffcheckstylehadoop-yarn-applications-distributedshell.txt hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7806/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7806/testReport/ Java 1.7.0_55 uname Linux asf904.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7806/console This message was automatically generated.
        Hide
        vvasudev Varun Vasudev added a comment -

        Jian He - can you please review? Thanks!

        Show
        vvasudev Varun Vasudev added a comment - Jian He - can you please review? Thanks!
        Hide
        jianhe Jian He added a comment -

        The current patch makes sense because there's no way to figure out previously finished apps other than persisting. But I'm thinking if it is a bit over-kill to do this for an example app. One thing in my mind is that, inside the onContainersCompleted we can filter out the previously attempts' finished containers and only do it for current attempt's finished containers. And in the finish() method, we compare numFinishedContainersOfcurrentAttempt==numContainersAskedByCurrentAttempt; will this work ? I know doing this, the total finished containers may be larger than user specified, but given this is just an example app, maybe tolerable ?

        On the other hand, the current patch may not work on secure cluster because it communicates with hdfs. renameScriptFile method is an example to talk with hdfs.

        Show
        jianhe Jian He added a comment - The current patch makes sense because there's no way to figure out previously finished apps other than persisting. But I'm thinking if it is a bit over-kill to do this for an example app. One thing in my mind is that, inside the onContainersCompleted we can filter out the previously attempts' finished containers and only do it for current attempt's finished containers. And in the finish() method, we compare numFinishedContainersOfcurrentAttempt==numContainersAskedByCurrentAttempt; will this work ? I know doing this, the total finished containers may be larger than user specified, but given this is just an example app, maybe tolerable ? On the other hand, the current patch may not work on secure cluster because it communicates with hdfs. renameScriptFile method is an example to talk with hdfs.
        Hide
        vvasudev Varun Vasudev added a comment -

        Uploaded a new patch that uses a simpler approach without the checkpointing.

        Show
        vvasudev Varun Vasudev added a comment - Uploaded a new patch that uses a simpler approach without the checkpointing.
        Hide
        hadoopqa Hadoop QA added a comment -



        +1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 14m 46s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 0s The patch appears to include 1 new or modified test files.
        +1 javac 7m 33s There were no new javac warning messages.
        +1 javadoc 9m 37s There were no new javadoc warning messages.
        +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings.
        +1 checkstyle 0m 24s There were no new checkstyle issues.
        +1 whitespace 0m 0s The patch has no lines that end in whitespace.
        +1 install 1m 32s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings.
        +1 yarn tests 6m 58s Tests passed in hadoop-yarn-applications-distributedshell.
            42m 25s  



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12733521/YARN-2821.004.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / 363c355
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7967/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7967/testReport/
        Java 1.7.0_55
        uname Linux asf909.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7967/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - +1 overall Vote Subsystem Runtime Comment 0 pre-patch 14m 46s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 0s The patch appears to include 1 new or modified test files. +1 javac 7m 33s There were no new javac warning messages. +1 javadoc 9m 37s There were no new javadoc warning messages. +1 release audit 0m 22s The applied patch does not increase the total number of release audit warnings. +1 checkstyle 0m 24s There were no new checkstyle issues. +1 whitespace 0m 0s The patch has no lines that end in whitespace. +1 install 1m 32s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 36s The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 yarn tests 6m 58s Tests passed in hadoop-yarn-applications-distributedshell.     42m 25s   Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12733521/YARN-2821.004.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / 363c355 hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7967/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7967/testReport/ Java 1.7.0_55 uname Linux asf909.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7967/console This message was automatically generated.
        Hide
        jianhe Jian He added a comment -

        thanks Varun ! looks good overall.
        test case - for below, we need to test that if AM receives any unknown completed container, the numCompletedContainers still equals to the numTotalContainers

                // ignore containers we know nothing about - probably from a previous
                // attempt
                if (!launchedContainers.contains(containerStatus.getContainerId())) {
                  LOG.info("Ignoring completed status of "
                      + containerStatus.getContainerId()
                      + "; unknown container(probably launched by previous attempt)");
                  continue;
                }
        
        Show
        jianhe Jian He added a comment - thanks Varun ! looks good overall. test case - for below, we need to test that if AM receives any unknown completed container, the numCompletedContainers still equals to the numTotalContainers // ignore containers we know nothing about - probably from a previous // attempt if (!launchedContainers.contains(containerStatus.getContainerId())) { LOG.info( "Ignoring completed status of " + containerStatus.getContainerId() + "; unknown container(probably launched by previous attempt)" ); continue ; }
        Hide
        vvasudev Varun Vasudev added a comment -

        Uploaded 005.patch which adds the tests requested by Jian He.

        Show
        vvasudev Varun Vasudev added a comment - Uploaded 005.patch which adds the tests requested by Jian He .
        Hide
        hadoopqa Hadoop QA added a comment -



        +1 overall



        Vote Subsystem Runtime Comment
        0 pre-patch 14m 40s Pre-patch trunk compilation is healthy.
        +1 @author 0m 0s The patch does not contain any @author tags.
        +1 tests included 0m 0s The patch appears to include 1 new or modified test files.
        +1 javac 7m 36s There were no new javac warning messages.
        +1 javadoc 9m 37s There were no new javadoc warning messages.
        +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings.
        +1 checkstyle 0m 18s There were no new checkstyle issues.
        +1 whitespace 0m 1s The patch has no lines that end in whitespace.
        +1 install 1m 33s mvn install still works.
        +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse.
        +1 findbugs 0m 35s The patch does not introduce any new Findbugs (version 3.0.0) warnings.
        +1 yarn tests 6m 56s Tests passed in hadoop-yarn-applications-distributedshell.
            42m 15s  



        Subsystem Report/Notes
        Patch URL http://issues.apache.org/jira/secure/attachment/12733765/YARN-2821.005.patch
        Optional Tests javadoc javac unit findbugs checkstyle
        git revision trunk / eb4c9dd
        hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7995/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt
        Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7995/testReport/
        Java 1.7.0_55
        uname Linux asf905.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
        Console output https://builds.apache.org/job/PreCommit-YARN-Build/7995/console

        This message was automatically generated.

        Show
        hadoopqa Hadoop QA added a comment - +1 overall Vote Subsystem Runtime Comment 0 pre-patch 14m 40s Pre-patch trunk compilation is healthy. +1 @author 0m 0s The patch does not contain any @author tags. +1 tests included 0m 0s The patch appears to include 1 new or modified test files. +1 javac 7m 36s There were no new javac warning messages. +1 javadoc 9m 37s There were no new javadoc warning messages. +1 release audit 0m 23s The applied patch does not increase the total number of release audit warnings. +1 checkstyle 0m 18s There were no new checkstyle issues. +1 whitespace 0m 1s The patch has no lines that end in whitespace. +1 install 1m 33s mvn install still works. +1 eclipse:eclipse 0m 33s The patch built with eclipse:eclipse. +1 findbugs 0m 35s The patch does not introduce any new Findbugs (version 3.0.0) warnings. +1 yarn tests 6m 56s Tests passed in hadoop-yarn-applications-distributedshell.     42m 15s   Subsystem Report/Notes Patch URL http://issues.apache.org/jira/secure/attachment/12733765/YARN-2821.005.patch Optional Tests javadoc javac unit findbugs checkstyle git revision trunk / eb4c9dd hadoop-yarn-applications-distributedshell test log https://builds.apache.org/job/PreCommit-YARN-Build/7995/artifact/patchprocess/testrun_hadoop-yarn-applications-distributedshell.txt Test Results https://builds.apache.org/job/PreCommit-YARN-Build/7995/testReport/ Java 1.7.0_55 uname Linux asf905.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Console output https://builds.apache.org/job/PreCommit-YARN-Build/7995/console This message was automatically generated.
        Hide
        jianhe Jian He added a comment -

        committed to trunk and branch-2, thanks Varun !

        Show
        jianhe Jian He added a comment - committed to trunk and branch-2, thanks Varun !
        Hide
        hudson Hudson added a comment -

        FAILURE: Integrated in Hadoop-trunk-Commit #7868 (See https://builds.apache.org/job/Hadoop-trunk-Commit/7868/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/CHANGES.txt
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Show
        hudson Hudson added a comment - FAILURE: Integrated in Hadoop-trunk-Commit #7868 (See https://builds.apache.org/job/Hadoop-trunk-Commit/7868/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/CHANGES.txt hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Hide
        hudson Hudson added a comment -

        FAILURE: Integrated in Hadoop-Yarn-trunk-Java8 #202 (See https://builds.apache.org/job/Hadoop-Yarn-trunk-Java8/202/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/CHANGES.txt
        Show
        hudson Hudson added a comment - FAILURE: Integrated in Hadoop-Yarn-trunk-Java8 #202 (See https://builds.apache.org/job/Hadoop-Yarn-trunk-Java8/202/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/CHANGES.txt
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Hadoop-Yarn-trunk #933 (See https://builds.apache.org/job/Hadoop-Yarn-trunk/933/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/CHANGES.txt
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Hadoop-Yarn-trunk #933 (See https://builds.apache.org/job/Hadoop-Yarn-trunk/933/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/CHANGES.txt hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Hide
        hudson Hudson added a comment -

        FAILURE: Integrated in Hadoop-Hdfs-trunk #2131 (See https://builds.apache.org/job/Hadoop-Hdfs-trunk/2131/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/CHANGES.txt
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Show
        hudson Hudson added a comment - FAILURE: Integrated in Hadoop-Hdfs-trunk #2131 (See https://builds.apache.org/job/Hadoop-Hdfs-trunk/2131/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/CHANGES.txt hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Hide
        hudson Hudson added a comment -

        FAILURE: Integrated in Hadoop-Hdfs-trunk-Java8 #191 (See https://builds.apache.org/job/Hadoop-Hdfs-trunk-Java8/191/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/CHANGES.txt
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        Show
        hudson Hudson added a comment - FAILURE: Integrated in Hadoop-Hdfs-trunk-Java8 #191 (See https://builds.apache.org/job/Hadoop-Hdfs-trunk-Java8/191/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/CHANGES.txt hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Hadoop-Mapreduce-trunk-Java8 #201 (See https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Java8/201/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        • hadoop-yarn-project/CHANGES.txt
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Hadoop-Mapreduce-trunk-Java8 #201 (See https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Java8/201/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java hadoop-yarn-project/CHANGES.txt
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Hadoop-Mapreduce-trunk #2149 (See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/2149/)
        YARN-2821. Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106)

        • hadoop-yarn-project/CHANGES.txt
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java
        • hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Hadoop-Mapreduce-trunk #2149 (See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/2149/ ) YARN-2821 . Fixed a problem that DistributedShell AM may hang if restarted. Contributed by Varun Vasudev (jianhe: rev 7438966586f1896ab3e8b067d47a4af28a894106) hadoop-yarn-project/CHANGES.txt hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/pom.xml hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/ApplicationMaster.java hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/test/java/org/apache/hadoop/yarn/applications/distributedshell/TestDSAppMaster.java

          People

          • Assignee:
            vvasudev Varun Vasudev
            Reporter:
            vvasudev Varun Vasudev
          • Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development