Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-24292

A "stuck" master should not idle as active without taking action

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 2.3.0
    • None
    • master, Region Assignment
    • None

    Description

      The master schedules a SCP for the region server hosting meta. However, due to a misconfiguration, the cluster cannot make progress. After fixing the configuration issue and restarting, the cluster still cannot make progress. After the configured period (15 minuets), the master enters a "holding pattern" where it retains Active master status, but isn't taking any action.

      This "brown-out" state is toxic. It should either keep trying to make progress, or it should abort. Staying up and not doing anything is the wrong thing to do.

      Attachments

        Activity

          People

            rkrahul324 Rahul Kumar
            ndimiduk Nick Dimiduk
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated: