Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-426

Failure to download a public resource on a node prevents further downloads of the resource from that node

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 2.0.3-alpha, 0.23.6
    • 0.23.7, 2.1.0-beta
    • nodemanager
    • None

    Description

      If the NM encounters an error while downloading a public resource, it fails to empty the list of request events corresponding to the resource request in attempts. If the same public resource is subsequently requested on that node, PublicLocalizer.addResource will skip the download since it will mistakenly believe a download of that resource is already in progress. At that point any container that requests the public resource will just hang in the LOCALIZING state.

      Attachments

        1. YARN-426.patch
          10 kB
          Jason Darrell Lowe

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jlowe Jason Darrell Lowe
            jlowe Jason Darrell Lowe
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Issue deployment