Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-5094

The META can hold an entry for a region with a different server name from the one actually in the AssignmentManager thus making the region inaccessible.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.92.0
    • 0.92.0
    • None
    • None

    Description

      RegionState rit = this.services.getAssignmentManager().isRegionInTransition(e.getKey());
                  ServerName addressFromAM = this.services.getAssignmentManager()
                      .getRegionServerOfRegion(e.getKey());
                  if (rit != null && !rit.isClosing() && !rit.isPendingClose()) {
                    // Skip regions that were in transition unless CLOSING or
                    // PENDING_CLOSE
                    LOG.info("Skip assigning region " + rit.toString());
                  } else if (addressFromAM != null
                      && !addressFromAM.equals(this.serverName)) {
                    LOG.debug("Skip assigning region "
                          + e.getKey().getRegionNameAsString()
                          + " because it has been opened in "
                          + addressFromAM.getServerName());
                    }
      

      In ServerShutDownHandler we try to get the address in the AM. This address is initially null because it is not yet updated after the region was opened .i.e. the CAll back after node deletion is not yet done in the master side.
      But removal from RIT is completed on the master side. So this will trigger a new assignment.
      So there is a small window between the online region is actually added in to the online list and the ServerShutdownHandler where we check the existing address in AM.

      Attachments

        1. HBASE-5094_1.patch
          2 kB
          ramkrishna.s.vasudevan

        Activity

          People

            ram_krish ramkrishna.s.vasudevan
            ram_krish ramkrishna.s.vasudevan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: