Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-1295 Multi data center replication
  3. HBASE-2223

Handle 10min+ network partitions between clusters

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.90.0
    • Replication
    • Reviewed

    Description

      We need a nice way of handling long network partitions without impacting a master cluster (which pushes the data). Currently it will just retry over and over again.

      I think we could:

      • Stop replication to a slave cluster if it didn't respond for more than 10 minutes
      • Keep track of the duration of the partition
      • When the slave cluster comes back, initiate a MR job like HBASE-2221

      Maybe we want less than 10 minutes, maybe we want this to be all automatic or just the first 2 parts. Discuss.

      Attachments

        1. HBASE-2223.patch
          124 kB
          Jean-Daniel Cryans

        Issue Links

          Activity

            People

              jdcryans Jean-Daniel Cryans
              jdcryans Jean-Daniel Cryans
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: