Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-12450

Unbalance chaos monkey might kill all region servers without starting them back

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.98.8, 0.99.2
    • None
    • None
    • Reviewed

    Description

      UnbalanceKillAndRebalanceAction does kill, balance and then start of region servers. But if the balance fails exception is thrown causing the region servers to not start. For me, the balance always kept on failing with socket timeout (default 1 min) as master runs one iteration of balance for 5 mins (default config). Eventually all servers are killed but never started back.

      Attachments

        1. HBASE-12450-0.98.patch
          2 kB
          Virag Kothari
        2. HBASE-12450.patch
          2 kB
          Virag Kothari
        3. HBASE-12450.patch
          2 kB
          Virag Kothari

        Activity

          People

            virag Virag Kothari
            virag Virag Kothari
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: