Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-1295 Multi data center replication
  3. HBASE-2223

Handle 10min+ network partitions between clusters

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.90.0
    • Component/s: Replication
    • Labels:
    • Hadoop Flags:
      Reviewed

      Description

      We need a nice way of handling long network partitions without impacting a master cluster (which pushes the data). Currently it will just retry over and over again.

      I think we could:

      • Stop replication to a slave cluster if it didn't respond for more than 10 minutes
      • Keep track of the duration of the partition
      • When the slave cluster comes back, initiate a MR job like HBASE-2221

      Maybe we want less than 10 minutes, maybe we want this to be all automatic or just the first 2 parts. Discuss.

        Attachments

        1. HBASE-2223.patch
          124 kB
          Jean-Daniel Cryans

        Issue Links

          Activity

            People

            • Assignee:
              jdcryans Jean-Daniel Cryans
              Reporter:
              jdcryans Jean-Daniel Cryans

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment