Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4087

Followup fixes after YARN-2019 regarding RM behavior when state-store error occurs

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.8.0, 2.7.2, 2.6.2, 3.0.0-alpha1
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Incompatible change
    • Release Note:
      Set YARN_FAIL_FAST to be false by default. If HA is enabled and if there's any state-store error, after the retry operation failed, we always transition RM to standby state.

      Description

      Several fixes:
      1. Set YARN_FAIL_FAST to be false by default, since this makes more sense in production environment.
      2. If HA is enabled and if there's any state-store error, after the retry operation failed, we always transition RM to standby state. Otherwise, we may see two active RMs running. YARN-4107 is one example.

        Attachments

        1. YARN-4087-branch-2.6.patch
          4 kB
          Xuan Gong
        2. YARN-4087.7.patch
          4 kB
          Xuan Gong
        3. YARN-4087.6.patch
          4 kB
          Jian He
        4. YARN-4087.5.patch
          14 kB
          Jian He
        5. YARN-4087.3.patch
          8 kB
          Jian He
        6. YARN-4087.2.patch
          2 kB
          Jian He
        7. YARN-4087.1.patch
          1 kB
          Jian He

          Issue Links

            Activity

              People

              • Assignee:
                jianhe Jian He
                Reporter:
                jianhe Jian He
              • Votes:
                1 Vote for this issue
                Watchers:
                12 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: