Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
The current implementation for restore uses WALReplay M/R job. This has performance and stability problems, since it uses HBase client API to insert data. We have to migrate to bulk load approach: generate hfiles directly from snapshot and incremental images. We run separate M/R job for every backup image between last FULL backup and current incremental backup we restore to and for every table in the list (image). If we have 10 tables and 30 days of incremental backup images - this results in 30x10 = 300 M/R jobs. MUST be optimized.
Attachments
Issue Links
- is part of
-
HBASE-14123 HBase Backup/Restore Phase 2
- Closed
- relates to
-
HBASE-15448 HBase Backup Phase 3: Restore optimization 2
- Closed
- requires
-
HBASE-15372 HBase Backup/Restore Phase 2: Keep WAL files in a centralized place to avoid duplication
- Closed