Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-18693

adding an option to restore_snapshot to move mob files from archive dir to working dir

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 2.0.0-alpha-2
    • None
    • mob
    • None

    Description

      Today, there is a single mob region where mob files for all user regions are saved. There could be many files (one million) in a single mob directory. When one mob table is restored or cloned from snapshot, links are created for these mob files. This creates a scaling issue for mob compaction. In mob compaction's select() logic, for each hFileLink, it needs to call NN's getFileStatus() to get the size of the linked hfile. Assume that one such call takes 20ms, 20ms * 1000000 = 6 hours.

      To avoid this overhead, we want to add an option so that restore_snapshot can move mob files from archive dir to working dir. clone_snapshot is more complicated as it can clone a snapshot to a different table so moving that can destroy the snapshot. No option will be added for clone_snapshot.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            huaxiang Hua Xiang Assign to me
            huaxiang Hua Xiang

            Dates

              Created:
              Updated:

              Slack

                Issue deployment