Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-2758

Improve documentation for retries

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.2.0, 4.3.0
    • None
    • docs
    • None

    Description

      In the oozie-site.xml the property oozie.action.retry.interval exists.

      It is described as "The interval between retries of an action in case of failure" without specifying a time unit.

      From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval which is described as "Automatic retry interval for workflow action is in minutes and the default value is 10 minutes." the user could assume that the property oozie.action.retry.interval is also minutes. However, as in

      https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java

      one comment states "defaultRetryInterval retry interval, in seconds.".

      In our environment the standard settings are used, and the application is exactly doing this: do a retry every 10 SECONDS (then suspend after max number of retries e.g. in the case of a distcp action). However the user might expect a retry every 10 minutes.

      Here is an excerpt from our log:

      Next Retry, Attempt Number [2] in [10.000] milliseconds

      This should at least be correctly documented if not aligned to the same unit.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              julianendres Julian Endres
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: