Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7952

RM should be able to recover log aggregation status after restart/fail-over

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.1.0
    • None
    • None
    • Reviewed

    Description

      Right now, the NM would send its own log aggregation status to RM periodically to RM. And RM would aggregate the status for each application, but it will not generate the final status until a client call(from web ui or cli) trigger it. But RM never persists the log aggregation status. So, when RM restarts/fails over, the log aggregation status will become “NOT_STARTED”. This is confusing, maybe we should change it to “NOT_AVAILABLE” (will create a separate ticket for this). Anyway, we need to persist the log aggregation status for the future use.

      Attachments

        1. YARN-7952-poc.patch
          31 kB
          Xuan Gong
        2. YARN-7952.1.patch
          36 kB
          Xuan Gong
        3. YARN-7952.2.patch
          42 kB
          Xuan Gong
        4. YARN-7952.3.patch
          44 kB
          Xuan Gong
        5. YARN-7952.3.patch
          44 kB
          Xuan Gong
        6. YARN-7952.5.patch
          44 kB
          Xuan Gong
        7. YARN-7952.6.patch
          45 kB
          Xuan Gong
        8. YARN-7952.7.patch
          46 kB
          Xuan Gong
        9. YARN-7952.8.patch
          46 kB
          Xuan Gong
        10. YARN-7952.9.patch
          47 kB
          Xuan Gong

        Issue Links

          Activity

            People

              xgong Xuan Gong
              xgong Xuan Gong
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: