Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-17992

The snapShot TimeoutException causes the cleanerChore thread to fail to complete the archive correctly

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 0.98.10, 1.3.0
    • Fix Version/s: None
    • Component/s: snapshots
    • Labels:
      None

      Description

      The problem is that when the snapshot occurs TimeoutException or other Exceptions, there is no correct delete /hbase/.hbase-snapshot/tmp, which causes the cleanerChore to fail to complete the archive correctly.
      Modifying the configuration parameter (hbase.snapshot.master.timeout.millis = 600000) only reduces the probability of the problem occurring.

      So the solution to the problem is: multi-Threaded exceptions or TimeoutExceptions, the Main-thread must wait until all the tasks are finished or canceled, the Main-thread can be cleared /hbase/.hbase-snapshot/tmp/snapshotName.Otherwise the task is likely to write /hbase/.hbase-snapshot/tmp/snapshotName/region - mainfest

      The problem exists in disabledTableSnapshot and enabledTableSnapshot, because I'm currently using the disabledTableSnapshot, so I provide the patch of disabledTableSnapshot

        Attachments

        1. hbase-17992.patch
          18 kB
          Bo Cui
        2. hbase-17992-0.98.patch
          22 kB
          Bo Cui
        3. hbase-17992-1.3.patch
          23 kB
          Bo Cui
        4. hbase-17992-master.patch
          24 kB
          Bo Cui

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                Bo Cui Bo Cui
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: