Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9198

Corrupted state from a previous version can still cause RM to fail with NPE on FairScheduler

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 3.1.0, 2.8.5
    • None
    • None

    Description

      Previously, RM may fail with NPE due to YARN-4347,YARN-4000. After these fixes, FairScheduler still has the same potential issue.

       
      201x-xx-xx xx:xx:xx,xxx ERROR resourcemanager.ResourceManager (ResourceManager.java:serviceStart) - Failed to load/recover state
      java.lang.NullPointerException
      at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.addApplicationAttempt(FairScheduler.java)

      Attachments

        1. YARN-9198.001.patch
          1 kB
          Dapeng Sun

        Activity

          People

            dapengsun Dapeng Sun
            dapengsun Dapeng Sun
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: