Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-4855

SplitLogManager hangs on cluster restart due to batch.installed doubly counted

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.92.0
    • 0.92.0
    • None
    • None
    • Reviewed

    Description

      Start a master and RS
      RS goes down (kill -9)
      Wait for ServerShutDownHandler to create the splitlog nodes. As no RS is there it cannot be processed.
      Restart both master and bring up an RS.
      The master hangs in SplitLogManager.waitforTasks().

      I feel that batch.done is not getting incremented properly. Not yet digged in fully.

      This may be the reason for occasional failure of TestDistributedLogSplitting.testWorkerAbort().

      Attachments

        1. HBASE-4855.patch
          0.5 kB
          ramkrishna.s.vasudevan

        Activity

          People

            ram_krish ramkrishna.s.vasudevan
            ram_krish ramkrishna.s.vasudevan
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: