Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-5245

TestGrunt.testStopOnFailure is flaky

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.17.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      The test is supposed to run two tests in parallel, and one when fails other should be killed when stop on failure is configured. But the test is actually running only job at a time and based on the order in which jobs are run it is passing. This is because of the capacity scheduler configuration of the MiniCluster. It runs only one AM at a time due to resource restrictions. In a 16G node, when the first job runs it takes up 1536 (AM) + 1024 (task) memory.mb. Since only 10% of cluster resource is the default for running AMs and a single AM already takes up memory close to 1.6G, second job AM is not launched.

        Attachments

        1. PIG-5245-1.patch
          2 kB
          Rohini Palaniswamy
        2. PIG-5245-2.patch
          5 kB
          Rohini Palaniswamy
        3. PIG-5245-2.addingSparkLibsToMiniCluster.patch
          2 kB
          Ádám Szita
        4. PIG-5245-3.patch
          6 kB
          Rohini Palaniswamy

          Activity

            People

            • Assignee:
              rohini Rohini Palaniswamy
              Reporter:
              rohini Rohini Palaniswamy
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: