Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5690

TestLocalMRNotification.testMR occasionally fails

    Details

    • Type: Bug Bug
    • Status: Patch Available
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 2.2.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      TestLocalMRNotificationis occasionally failing with the error:

      -------------------------------------------------------------------------------
      Test set: org.apache.hadoop.mapred.TestLocalMRNotification
      -------------------------------------------------------------------------------
      Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 24.992 sec <<< FAILURE! - in org.apache.hadoop.mapred.TestLocalMRNotification
      testMR(org.apache.hadoop.mapred.TestLocalMRNotification)  Time elapsed: 24.881 sec  <<< ERROR!
      java.io.IOException: Job cleanup didn't start in 20 seconds
              at org.apache.hadoop.mapred.UtilsForTests.runJobKill(UtilsForTests.java:685)
              at org.apache.hadoop.mapred.NotificationTestCase.testMR(NotificationTestCase.java:178)
              at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
              at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
              at java.lang.reflect.Method.invoke(Method.java:597)
              at junit.framework.TestCase.runTest(TestCase.java:168)
              at junit.framework.TestCase.runBare(TestCase.java:134)
              at junit.framework.TestResult$1.protect(TestResult.java:110)
              at junit.framework.TestResult.runProtected(TestResult.java:128)
              at junit.framework.TestResult.run(TestResult.java:113)
              at junit.framework.TestCase.run(TestCase.java:124)
              at junit.framework.TestSuite.runTest(TestSuite.java:243)
              at junit.framework.TestSuite.run(TestSuite.java:238)
              at org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:83)
              at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:254)
              at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:149)
              at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
              at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
              at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
              at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
      

        Activity

        Hide
        Liyin Liang added a comment -

        The failure of TestLocalMRNotification.testMR is caused by UtilsForTests.runJobKill(). During UtilsForTests.runJobKill(), a job with KillMapper is submitted to LocalJobRunner. When the job is in RUNNING status, kill it by job.killJob(). Then wait the job to complete with 20 seconds timeout.
        The problem is job.killJob() intends to interrupt the KillMapper, which will sleep for a long time. While if job.killJob() is invoked before KillMapper is launched, the job will continue run the mapper with a long time.

        Show
        Liyin Liang added a comment - The failure of TestLocalMRNotification.testMR is caused by UtilsForTests.runJobKill(). During UtilsForTests.runJobKill(), a job with KillMapper is submitted to LocalJobRunner. When the job is in RUNNING status, kill it by job.killJob(). Then wait the job to complete with 20 seconds timeout. The problem is job.killJob() intends to interrupt the KillMapper, which will sleep for a long time. While if job.killJob() is invoked before KillMapper is launched, the job will continue run the mapper with a long time.
        Hide
        Liyin Liang added a comment -

        This patch adds waiting job's map progress before job.killJob().

        Show
        Liyin Liang added a comment - This patch adds waiting job's map progress before job.killJob().
        Hide
        Hadoop QA added a comment -

        -1 overall. Here are the results of testing the latest attachment
        http://issues.apache.org/jira/secure/attachment/12619471/MAPREDUCE-5690.1.diff
        against trunk revision .

        +1 @author. The patch does not contain any @author tags.

        +1 tests included. The patch appears to include 1 new or modified test files.

        +1 javac. The applied patch does not increase the total number of javac compiler warnings.

        +1 javadoc. The javadoc tool did not generate any warning messages.

        +1 eclipse:eclipse. The patch built with eclipse:eclipse.

        +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings.

        +1 release audit. The applied patch does not increase the total number of release audit warnings.

        -1 core tests. The patch failed these unit tests in hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient:

        org.apache.hadoop.mapreduce.security.TestJHSSecurity

        +1 contrib tests. The patch passed contrib unit tests.

        Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4268//testReport/
        Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4268//console

        This message is automatically generated.

        Show
        Hadoop QA added a comment - -1 overall . Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12619471/MAPREDUCE-5690.1.diff against trunk revision . +1 @author . The patch does not contain any @author tags. +1 tests included . The patch appears to include 1 new or modified test files. +1 javac . The applied patch does not increase the total number of javac compiler warnings. +1 javadoc . The javadoc tool did not generate any warning messages. +1 eclipse:eclipse . The patch built with eclipse:eclipse. +1 findbugs . The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit . The applied patch does not increase the total number of release audit warnings. -1 core tests . The patch failed these unit tests in hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient: org.apache.hadoop.mapreduce.security.TestJHSSecurity +1 contrib tests . The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4268//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4268//console This message is automatically generated.

          People

          • Assignee:
            Liyin Liang
            Reporter:
            Liyin Liang
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:

              Development