Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-16302

age of last shipped op and age of last applied op should be histograms

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0, 2.0.0
    • Replication
    • None
    • Reviewed

    Description

      Replication exports metric ageOfLastShippedOp as an indication of how much replication is lagging. But, with multiwal enabled, it's not representative because replication could be lagging for a long time for one wal group (something wrong with a particular region) while being fine for others. The ageOfLastShippedOp becomes a useless metric for alerting in such a case.

      Also, since there is no mapping between individual replication sources and replication sinks, the age of last applied op can be a highly spiky metric if only certain replication sources are lagging.

      We should use histograms for these metrics and use maximum value of this histogram to report replication lag when building stats.

      Attachments

        1. HBASE-16302.patch.v0.patch
          9 kB
          Ashu Pachauri

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ashu210890 Ashu Pachauri
            ashu210890 Ashu Pachauri
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment