Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1336 [Umbrella] Work-preserving nodemanager restart
  3. YARN-3449

Recover appTokenKeepAliveMap upon nodemanager restart

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Invalid
    • 2.6.0, 2.7.0
    • None
    • nodemanager
    • None

    Description

      appTokenKeepAliveMap in NodeStatusUpdaterImpl is used to keep application alive after application is finished but NM still need app token to do log aggregation (when enable security and log aggregation).
      The applications are only inserted into this map when receiving getApplicationsToCleanup() from RM heartbeat response. And RM only send this info one time in RMNodeImpl.updateNodeHeartbeatResponseForCleanup(). NM restart work preserving should put appTokenKeepAliveMap into NMStateStore and get recovered after restart. Without doing this, RM could terminate application earlier, so log aggregation could be failed if security is enabled.

      Attachments

        Activity

          People

            Unassigned Unassigned
            junping_du Junping Du
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: