Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9198

Corrupted state from a previous version can still cause RM to fail with NPE on FairScheduler

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Patch Available
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 3.1.0, 2.8.5
    • Fix Version/s: None
    • Labels:
      None

      Description

      Previously, RM may fail with NPE due to YARN-4347,YARN-4000. After these fixes, FairScheduler still has the same potential issue.

       
      201x-xx-xx xx:xx:xx,xxx ERROR resourcemanager.ResourceManager (ResourceManager.java:serviceStart) - Failed to load/recover state
      java.lang.NullPointerException
      at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.addApplicationAttempt(FairScheduler.java)

        Attachments

        1. YARN-9198.001.patch
          1 kB
          Dapeng Sun

          Activity

            People

            • Assignee:
              dapengsun Dapeng Sun
              Reporter:
              dapengsun Dapeng Sun
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated: