Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5597 YARN Federation improvements
  3. YARN-8760

[AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.2.0
    • None
    • None

    Description

      When home YarnRM is failing over, FinishApplicationMaster call from AM can have multiple retry threads outstanding in FederationInterceptor. When new YarnRM come back up, all retry threads will re-register to YarnRM. The first one will succeed but the rest will get "Application Master is already registered" exception. We should catch and swallow this exception and move on. 

      Attachments

        1. YARN-8760.v1.patch
          7 kB
          Botong Huang

        Activity

          People

            botong Botong Huang
            botong Botong Huang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: