VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.9.0, 3.0.0-beta1
    • fs/azure
    • None
    • Up to 34% throughput improvement for the wasb:// (Azure) file system when fs.azure.selfthrottling.enable is false fs.azure.autothrottling.enable is true.
    • Patch

    Description

      Big data workloads frequently exceed the Azure Storage max ingress and egress limits (https://docs.microsoft.com/en-us/azure/azure-subscription-service-limits). For example, the max ingress limit for a GRS account in the United States is currently 10 Gbps. When the limit is exceeded, the Azure Storage service fails a percentage of incoming requests, and this causes the client to initiate the retry policy. The retry policy delays requests by sleeping, but the sleep duration is independent of the client throughput and account limit. This results in low throughput, due to the high number of failed requests and thrashing causes by the retry policy.

      To fix this, we introduce a client-side throttle which minimizes failed requests and maximizes throughput. Tests have shown that this improves throughtput by ~34% when the storage account max ingress and/or egress limits are exceeded.

      Attachments

        1. HADOOP-14660-001.patch
          46 kB
          Thomas Marqardt
        2. HADOOP-14660-002.patch
          46 kB
          Thomas Marqardt
        3. HADOOP-14660-003.patch
          46 kB
          Thomas Marqardt
        4. HADOOP-14660-004.patch
          46 kB
          Thomas Marqardt
        5. HADOOP-14660-005.patch
          49 kB
          Thomas Marqardt
        6. HADOOP-14660-006.patch
          65 kB
          Thomas Marqardt
        7. HADOOP-14660-007.patch
          65 kB
          Thomas Marqardt
        8. HADOOP-14660-008.patch
          65 kB
          Thomas Marqardt
        9. HADOOP-14660-010.patch
          59 kB
          Steve Loughran
        10. HADOOP-14660-branch-2-001.patch
          59 kB
          Thomas Marqardt

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            tmarquardt Thomas Marqardt
            tmarquardt Thomas Marqardt
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Issue deployment