Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
Description
This copier is a modification of the distcp tool in HDFS. It does the following:
1. List out all the regions in the HBase cluster for the required table
2. Write the above out to a file
3. Each mapper
3.1 lists all the HFiles for a given region by querying the regionserver
3.2 copies all the HFiles
3.3 outputs success if the copy succeeded, failure otherwise. Failed regions are retried in another loop
4. Mappers are placed on nodes which have maximum locality for a given region to speed up copying
Attachments
Attachments
Issue Links
- depends upon
-
HBASE-4661 Ability to export the list of files for a some or all column families for a given region
- Closed
- is a clone of
-
HBASE-4663 MR based copier for copying HFiles
- Closed