Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7294

Only application master should upload resource to Yarn Shared Cache

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.10.0, 3.3.0, 3.2.1, 3.1.4
    • 3.2.2, 3.4.0, 3.3.1, 2.10.2
    • mrv2
    • None
    • Reviewed

    Description

      The design of yarn shared cache manager is only to allow application master should upload the jar/files/resource. However, there was a bug in the code since 2.9.0. Every node manager that take the job task will try to upload the jar/resources. Let's say one job have 5000 tasks. Then there will be up to 5000 NMs try to upload the jar. This is like DDOS and create a snowball effect. It will end up with inavailability of yarn shared cache manager. It wil cause time out in localization and lead to job failure.

      Attachments

        Issue Links

          Activity

            People

              wzzdreamer zhenzhao wang
              wzzdreamer zhenzhao wang
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: