Uploaded image for project: 'Apache Gobblin'
  1. Apache Gobblin
  2. GOBBLIN-1528

Reduce overshoot of Yarn containers in Gobblin-on-Yarn mode

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.15.0
    • 0.17.0
    • gobblin-yarn
    • None

    Description

      Gobblin can overallocate containers when a Yarn application is started up. The main reason for the overallocation is because the AMRM client does not track which container requests have already been submitted. This can result in container requests being re-sent and the consequent over-allocation. (See: YARN-1902). For Gobblin-on-Yarn, the overallocation results in creation of excess znodes by Helix, since each container maps to a Helix instance.

      This task attempts to limit the container over-shoot during application start up, by explicitly tracking the number of inflight container requests.

      Attachments

        Activity

          People

            suvasude Sudarshan Vasudevan
            suvasude Sudarshan Vasudevan
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 2.5h
                2.5h