[HDFS-5442] Zero loss HDFS data replication for multiple datacenters - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

Hadoop is architected to operate efficiently at scale for normal hardware failures within a datacenter. Hadoop is not designed today to handle datacenter failures. Although HDFS is not designed for nor deployed in configurations spanning multiple datacenters, replicating data from one location to another is common practice for disaster recovery and global service availability. There are current solutions available for batch replication using data copy/export tools. However, while providing some backup capability for HDFS data, they do not provide the capability to recover all your HDFS data from a datacenter failure and be up and running again with a fully operational Hadoop cluster in another datacenter in a matter of minutes. For disaster recovery from a datacenter failure, we should provide a fully distributed, zero data loss, low latency, high throughput and secure HDFS data replication solution for multiple datacenter setup.

Design and code for Phase-1 to follow soon.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Disaster Recovery Solution for Hadoop.pdf
16/Dec/13 01:37
1.11 MB
Haifeng Chen
Disaster Recovery Solution for Hadoop.pdf
23/Dec/13 03:31
1.10 MB
Haifeng Chen
Disaster Recovery Solution for Hadoop.pdf
09/Jun/14 09:19
1.22 MB
Dian Fu

Issue Links

is related to

HDFS-9075 Multiple datacenter replication inside one HDFS cluster

Open

Sub-Tasks

1.	Support "region" in HDFS configurations	Open	Dian Fu
2.	Start primary/mirror services based on configurations	Open	Dian Fu
3.	Add MirrorJournalManager to transfer edits from primary cluster to Mirror cluster	Open	Dian Fu
4.	Add MirrorJournal ( server side implementation) to handle the journals from Primary cluster	Open	Dian Fu
5.	Format mirror cluster with primary cluster's NamespaceInfo in the runtime	Open	Jiang, Wenjie
6.	Add Disaster Recovery related metrics	Open	Jiang, Wenjie
7.	Disable APIs which may modify the mirror cluster	Open	Dian Fu
8.	Provide configurations to select mode of replication to Mirror cluster	Open	Unassigned
9.	Transfer data from primary cluster to mirror cluster synchronously	Open	Jiang, Wenjie

Activity

People

Assignee:: Dian Fu

Reporter:: Avik Dey

Votes:: 9 Vote for this issue

Watchers:: 107 Start watching this issue

Dates

Created:: 29/Oct/13 13:11

Updated:: 29/Mar/16 01:09