Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-1178

Workflow Application Master in YARN

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      It is useful to have a workflow application master, which will be capable of running a DAG of jobs. The workflow client submits a DAG request to the AM and then the AM will manage the life cycle of this application in terms of requesting the needed resources from the RM, and starting, monitoring and retrying the application's individual tasks.

      Compared to running Oozie with the current MapReduce Application Master, these are some of the advantages:

      • Less number of consumed resources, since only one application master will be spawned for the whole workflow.
      • Reuse of resources, since the same resources can be used by multiple consecutive jobs in the workflow (no need to request/wait for resources for every individual job from the central RM).
      • More optimization opportunities in terms of collective resource requests.
      • Optimization opportunities in terms of rewriting and composing jobs in the workflow (e.g. pushing down Mappers).
      • This Application Master can be reused/extended by higher systems like Pig and hive to provide an optimized way of running their workflows.

      Attachments

        1. MAPREDUCE-4495-v1.1.patch
          450 kB
          Bo Wang
        2. MAPREDUCE-4495-v1.patch
          450 kB
          Bo Wang
        3. MapReduceWorkflowAM.pdf
          865 kB
          Bo Wang
        4. yapp_proposal.txt
          8 kB
          Arun Murthy

        Issue Links

          Activity

            People

              Unassigned Unassigned
              bowang Bo Wang
              Votes:
              5 Vote for this issue
              Watchers:
              67 Start watching this issue

              Dates

                Created:
                Updated: