Uploaded image for project: 'Apache Twill (Retired)'
  1. Apache Twill (Retired)
  2. TWILL-63

Speed up application launch time

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.2.0-incubating
    • 0.10.0
    • yarn
    • None

    Description

      Currently when launching an application, two new jars are always created locally, one for AM (appMaster.jar) and one for Container (container.jar) and copied to HDFS before submitting the application. The jar files could potentially be big and if it doesn't changed, it should require copying to HDFS again.

      The general approach is better jar files management and to cache and reuse jar files created through
      class dependency tracing. The changes are further broken down as follows:

      1. Refactor jars generation
        • One jar containing the TwillLauncher (launcher.jar), created through dependency tracing.
          • This jar is the same for all applications.
        • One jar containing all twill classes (twill.jar), created through dependency tracing.
          • This jar is the same for all applications.
        • One jar containing the application class, created through dependency tracing.
          • This jar is generated based on the application being launched. It is reusable when launching the same app multiple times.
        • One jar containing user resources setup through TwillPreparer.
          • This jar is not reused between apps.
        • One jar containing runtime config needed by Twill
          • logback.xml, jvm opts, environment, classpaths, ... etc
      2. Let YARN to expand jars when localizing to containers instead of expanding it programatically
        • This save time in jar expansion when multiple containers are running on the same host
      3. Introduce a new configuration "twill.location.cache.dir" to enable jar caching and reuse
        • Currently only the launcher.jar, twill.jar and application jar will be cached and reuse when possible
        • Cache cleanup logic is also in place to remove files in cache directory that is no longer used by application
      4. The ApplicationBundler is improved to allow more flexible usage

      Attachments

        Issue Links

          Activity

            People

              chtyim Terence Yim
              chtyim Terence Yim
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: