Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-8783

RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.94.8, 0.95.1
    • 0.98.0, 0.95.2, 0.94.9
    • snapshots
    • None

    Description

      The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be initialized with the wrong memberName.

      2013-06-21 05:03:41,732 DEBUG org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot Manager
      ...
      2013-06-21 05:03:41,875 INFO org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
      

      The Region Server Name is used as memberName, but since the snapshot manger is initialized before the RS receives the server name used by the master, the zkprocedure will use the wrong name (0.0.0.0).
      This will case the snapshot to fail with a TimeoutException since the master will not receive the expected RS

      Master:
      ZKProcedureCoordinatorRpcs: Watching for acquire node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
      
      RS:
      ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired barrier for procedure (foo23) in zk
      
      ...
      org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! Source:Timeout caused Foreign Exception Start:1371798732141, End:1371798792141, diff:60000, max:60000 ms
      

      Attachments

        1. HBASE-8783-0.94-v0.patch
          12 kB
          Matteo Bertozzi
        2. HBASE-8783-v0.patch
          13 kB
          Matteo Bertozzi
        3. HBASE-8783-v1.patch
          12 kB
          Matteo Bertozzi
        4. HBASE-8783-0.94-v1.patch
          11 kB
          Matteo Bertozzi

        Activity

          People

            mbertozzi Matteo Bertozzi
            mbertozzi Matteo Bertozzi
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: