Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-15763 Über-JIRA: abfs phase II: Hadoop 3.3 features & fixes
  3. HADOOP-17092

ABFS: Long waits and unintended retries when multiple threads try to fetch token using ClientCreds

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.3.0
    • Fix Version/s: 3.3.1
    • Component/s: fs/azure
    • Labels:

      Description

      Issue reported by DB:

      we recently experienced some problems with ABFS driver that highlighted a possible issue with long hangs following synchronized retries when using the ClientCredsTokenProvider and calling AbfsClient.getAccessToken. We have seen https://github.com/apache/hadoop/pull/1923, but it does not directly apply since we are not using a custom token provider, but instead ClientCredsTokenProvider that ultimately relies on AzureADAuthenticator

       

      The problem was that the critical section of getAccessToken, combined with a possibly redundant retry policy, made jobs hanging for a very long time, since only one thread at a time could make progress, and this progress amounted to basically retrying on a failing connection for 30-60 minutes.

       

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                bilahari.th Bilahari T H
                Reporter:
                snehav Sneha Vijayarajan
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: