Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-26913

Replication Observability Framework

    XMLWordPrintableJSON

Details

    Description

      In our production clusters, we have seen cases where data is present in source cluster but not in the sink cluster and 1 case where data is present in sink cluster but not in source cluster. 

      We have internal tools where we take incremental backup every day on both source and sink clusters and we compare the hash of the data in both the backups. We have seen many cases where hash doesn't match which means data is not consistent between source and sink for that given day. The Mean Time To Detect (MTTD) these inconsistencies is atleast 2 days and requires lot of manual debugging.

      We need some tool where we can reduce MTTD and requires less manual debugging.

      I have attached design doc. Huge thanks to bharathv  to come up with this design at my work place.

      Attachments

        Issue Links

          Activity

            People

              shahrs87 Rushabh Shah
              shahrs87 Rushabh Shah
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: