Uploaded image for project: 'Apache Twill (Retired)'
  1. Apache Twill (Retired)
  2. TWILL-132

Twill AppMaster does not restart on failure as many times specified in yarn.resourcemanager.am.max-attempts

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.5.0-incubating
    • None
    • yarn
    • None

    Description

      The Apache Hadoop YARN has config property that allow restart on AM due to failures certain amount of times. It is specified by yarn.resourcemanager.am.max-attempts (default is 2).

      So, this config parameter allow AM to have HA like behavior to resilient to failures up to certain number of times.

      The Twill AppMaster seems to have problem restarting when failed due to uncertain condition (e.g.: kill signal)

      <code>
      <name>yarn.resourcemanager.am.max-attempts</name>
      <value>2</value>
      </property>
      </code>

      Attachments

        Activity

          People

            jwang47 Alvin Wang
            hsaputra Henry Saputra
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: