Uploaded image for project: 'Apache Helix'
  1. Apache Helix
  2. HELIX-444

add per-participant partition count gauges to helix

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.7.1, 0.6.4
    • None
    • None

    Description

      We need a way to pull the known down partition counts out of DifferenceWithIdealState when an instance is offline, reducing the alert volume to solely the down instance notification. Without metrics from helix indicating the number of partitions hosted on a given participant, we can't reason as to which "DifferenceWithIdealState" counts are supposed to be down and which are an actually difference caused by something other than a node outage.
      These should be produced on a per-participant, per-resource basis (ie., helix.i001.participantstatus.cluster.host.db.partitiongauge = 64 or whatever)

      Attachments

        Activity

          People

            dafu Zhen Zhang
            dafu Zhen Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: