Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-4618 HBase backups
  3. HBASE-4663

MR based copier for copying HFiles

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • documentation, regionserver
    • None

    Description

      This copier is a modification of the distcp tool in HDFS. It does the following:

      1. List out all the regions in the HBase cluster for the required table
      2. Write the above out to a file
      3. Each mapper
      3.1 lists all the HFiles for a given region by querying the regionserver
      3.2 copies all the HFiles
      3.3 outputs success if the copy succeeded, failure otherwise. Failed regions are retried in another loop
      4. Mappers are placed on nodes which have maximum locality for a given region to speed up copying

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              karthik.ranga Karthik Ranganathan
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: