Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4721

RM to try to auth with HDFS on startup, retry with max diagnostics on failure

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: 2.8.0
    • Fix Version/s: None
    • Component/s: resourcemanager, security
    • Labels:
    • Target Version/s:

      Description

      If the RM can't auth with HDFS, this can first surface during job submission, which can cause confusion about what's wrong and whose credentials are playing up.

      Instead, the RM could try to talk to HDFS on launch, ls / should suffice. If it can't auth, it can then tell UGI to log more and retry.

      I don't know what the policy should be if the RM can't auth to HDFS at this point. Certainly it can't currently accept work. But should it fail fast or keep going in the hope that the problem is in the KDC or NN and will fix itself without an RM restart?

        Attachments

        1. HADOOP-12289-002.patch
          15 kB
          Steve Loughran
        2. HADOOP-12289-003.patch
          19 kB
          Steve Loughran
        3. HADOOP-12889-001.patch
          15 kB
          Steve Loughran

        Issue Links

          Activity

            People

            • Assignee:
              stevel@apache.org Steve Loughran
              Reporter:
              stevel@apache.org Steve Loughran

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment