Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-14682

CM restore functionality for regionservers is broken

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      In the DistributedHBaseCluster.restoreClusterStatus(), we are starting a regionserver for backup masters. Seems to be a copy-paste error from HBASE-12429.

      This is causing further issues on our test rig since we are starting a regionserver where only a backup master were before:

      INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_master | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.99
      INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"ps aux | grep proc_master | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
      INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:13244
      
      INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.99
      INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
      INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:
      INFO  [main] hbase.HBaseCluster: Restoring cluster - starting initial region server: 10.0.0.99:16000
      INFO  [main] hbase.HBaseCluster: Starting RS on: 10.0.0.99
      INFO  [main] hbase.HBaseClusterManager: Executing remote command: /usr/hdp/2.3.3.0-2971/hbase/bin/hbase-daemon.sh --config /tmp/hbaseConf start regionserver , hostname:10.0.0.99
      INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"/usr/hdp/2.3.3.0-2971/hbase/bin/hbase-daemon.sh --config /tmp/hbaseConf start reg
      INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:starting regionserver, logging to /var/log/hbase/hbase-hbase-regionserver-zk0-hb23-h.out
      
      INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.126
      INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.126 "sudo su - hbase -c \"ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
      INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:
      INFO  [main] hbase.HBaseCluster: Added new HBaseAdmin
      INFO  [main] hbase.HBaseCluster: Restoring cluster - done
      DEBUG [main] hbase.IntegrationTestBase: Done restoring the cluster
      

      Attachments

        1. hbase-14682_v1.patch
          1.0 kB
          Enis Soztutar

        Activity

          People

            enis Enis Soztutar
            enis Enis Soztutar
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: