Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-50

Snapshot of table

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Invalid
    • 0.95.2
    • 0.95.0
    • None
    • 0.96notable

    Description

      Havening an option to take a snapshot of a table would be vary useful in production.

      What I would like to see this option do is do a merge of all the data into one or more files stored in the same folder on the dfs. This way we could save data in case of a software bug in hadoop or user code.

      The other advantage would be to be able to export a table to multi locations. Say I had a read_only table that must be online. I could take a snapshot of it when needed and export it to a separate data center and have it loaded there and then i would have it online at multi data centers for load balancing and failover.

      I understand that hadoop takes the need out of havening backup to protect from failed servers, but this does not protect use from software bugs that might delete or alter data in ways we did not plan. We should have a way we can roll back a dataset.

      Attachments

        1. Snapshot Class Diagram.png
          91 kB
          Li Chongxin
        2. HBase Snapshot Implementation Plan.pdf
          176 kB
          Li Chongxin
        3. HBase Snapshot Design Report V3.pdf
          98 kB
          Li Chongxin
        4. HBase Snapshot Design Report V2.pdf
          54 kB
          Li Chongxin

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            lichongxin Li Chongxin
            viper799 Billy Pearson
            Votes:
            2 Vote for this issue
            Watchers:
            31 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment