Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-5245

TestGrunt.testStopOnFailure is flaky

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.17.0
    • None
    • None
    • Reviewed

    Description

      The test is supposed to run two tests in parallel, and one when fails other should be killed when stop on failure is configured. But the test is actually running only job at a time and based on the order in which jobs are run it is passing. This is because of the capacity scheduler configuration of the MiniCluster. It runs only one AM at a time due to resource restrictions. In a 16G node, when the first job runs it takes up 1536 (AM) + 1024 (task) memory.mb. Since only 10% of cluster resource is the default for running AMs and a single AM already takes up memory close to 1.6G, second job AM is not launched.

      Attachments

        1. PIG-5245-3.patch
          6 kB
          Rohini Palaniswamy
        2. PIG-5245-2.patch
          5 kB
          Rohini Palaniswamy
        3. PIG-5245-2.addingSparkLibsToMiniCluster.patch
          2 kB
          Ádám Szita
        4. PIG-5245-1.patch
          2 kB
          Rohini Palaniswamy

        Activity

          People

            rohini Rohini Palaniswamy
            rohini Rohini Palaniswamy
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: