Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-9740

A corrupt HFile could cause endless attempts to assign the region without a chance of success

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 0.94.16
    • None
    • None
    • None
    • A corrupt HFile or HFIle missed could cause endless attempts to assign the region without a chance of success

    Description

      As described in HBASE-9737, a corrupt HFile in a region could lead to an assignment storm in the cluster since the Master will keep trying to assign the region to each region server one after another and obviously none will succeed.

      The region server, upon detecting such a scenario should mark the region as "RS_ZK_REGION_FAILED_ERROR" (or something to the effect) in the Zookeeper which should indicate the Master to stop assigning the region until the error has been resolved (via an HBase shell command, probably "assign"?)

      Attachments

        1. patch-9740_0.94.txt
          5 kB
          Ping
        2. HBase-9749_0.94_v3.patch
          14 kB
          Ping
        3. HBase-9749_0.94_v2.patch
          13 kB
          Ping
        4. HBase-9740_0.94_v4.patch
          15 kB
          Ping

        Issue Links

          Activity

            People

              Unassigned Unassigned
              adityakishore Aditya Kishore
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: