Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-3046

Fix Retry handling in ozone RPC Client

    XMLWordPrintableJSON

Details

    Description

      Right now for all other exceptions other than serviceException we use FailOverOnNetworkException.

      This Exception policy is created with 15 max fail overs and 15 retries. 

       

      retryPolicyOnNetworkException.shouldRetry(
       exception, retries, failovers, isIdempotentOrAtMostOnce);

      2 issues with this:

      1. When shouldRetry returns action FAILOVER_AND_RETRY, it will stuck with same OM, and does not perform failover to next OM.  As OMFailoverProxyProvider#performFailover() is a dummy call does not perform any failover.
      2. When ozone.client.failover.max.attempts is set to 15, now with 2 policies with each set to 15, we will retry 15*2 times in worst scenario. 

       

       

      Attachments

        Issue Links

          Activity

            People

              bharat Bharat Viswanadham
              bharat Bharat Viswanadham
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m