Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-7759 Improve Ozone Replication Manager
  3. HDDS-8746

Add metrics to ReplicationSupervisor for task count and max stream

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0
    • SCM

    Description

      To give added visibility into the Replication Queue on a datanode, this Jira adds some extra metrics to give visibility into the count of "normal priority" Replication and Reconstruction tasks, along with the current maxStreams setting, which is automatically adjusted when the node moves into decommission.

      This was tested manually in Docker compose by sampling the JMX output. Sample output:

          "name" : "Hadoop:service=HddsDatanode,name=ReplicationSupervisorMetrics",
          "modelerType" : "ReplicationSupervisorMetrics",
          "tag.Hostname" : "b297306fcf0f",
          "numInFlightReplications" : 5,
          "numQueuedReplications" : 4,
          "numRequestedReplications" : 1,
          "numTimeoutReplications" : 0,
          "numSkippedReplications" : 0,
          "maxReplicationStreams" : 1, ** newly added
          "numInflightECReconstructionCoordinatorTask" : 5 ** newly added
      
      
          "name" : "Hadoop:service=HddsDatanode,name=ReplicationSupervisorMetrics",
          "modelerType" : "ReplicationSupervisorMetrics",
          "tag.Hostname" : "b297306fcf0f",
          "numInFlightReplications" : 1,
          "numQueuedReplications" : 0,
          "numRequestedReplications" : 6,
          "numTimeoutReplications" : 0,
          "numSkippedReplications" : 0,
          "maxReplicationStreams" : 1,      ** newly added
          "numInflightReplicationTask" : 1,  ** newly added
          "numInflightECReconstructionCoordinatorTask" : 0 ** newly added
      

      Attachments

        Issue Links

          Activity

            People

              sodonnell Stephen O'Donnell
              sodonnell Stephen O'Donnell
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: