HBase
  1. HBase
  2. HBASE-3472

New import/export jobs based on HFiles

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.92.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      Not sure if I'll have time to work on this, but the old sequencefile-based import/export are not as efficient as one based on HFiles would be. Idea is:

      • hbase exporthf: TableInputFormat -> Identity -> HFileOutputFormat (should also take time ranges, etc, for incremental backup)
      • hbase importhf: perhaps this is just an improvement to the existing completeBulkLoad that does parallel splitting when the input HFiles don't "fit" the target regions (right now this is single-threaded)

      This would be a good project for a new contributor!

        Activity

        Hide
        stack added a comment -

        Moving out of 0.92.0. Pull it back in if you think different.

        Show
        stack added a comment - Moving out of 0.92.0. Pull it back in if you think different.

          People

          • Assignee:
            Unassigned
            Reporter:
            Todd Lipcon
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:

              Development