Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-7759 Improve Ozone Replication Manager
  3. HDDS-8658

ReplicationManager: Change default command timeout to 10 minutes

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Done
    • None
    • 1.4.0
    • SCM

    Description

      In Replication Manager, a deadline is set on commands sent to a datanode. If the command has not completed within the timeout, RM assumes it is lost and will schedule a new command to another random node.

      Right now the default is set to 30 minutes as the legacy RM scheduled a lot of work onto the DNs and it could take a long time to complete. The new RM throttles the work sent, so a large queue on the DNs should not be possible.

      We should change the default event timeout to 10 minutes instead of 30.

      Attachments

        Issue Links

          Activity

            People

              sodonnell Stephen O'Donnell
              sodonnell Stephen O'Donnell
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: