Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-14420 Zombie Stomping Session
  3. HBASE-14819

hbase-it tests failing with OOME; permgen

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.2.0, 1.3.0, 2.0.0
    • test
    • None

    Description

      Let me up the heap used when failsafe forks.

      Here is example OOME doing ITBLL:

      2015-11-16 03:09:15,073 INFO  [Thread-694] actions.BatchRestartRsAction(69): Starting region server:asf905.gq1.ygridcore.net
      2015-11-16 03:09:15,099 INFO  [Thread-694] client.ConnectionUtils(104): regionserver/asf905.gq1.ygridcore.net/67.195.81.149:0 server-side HConnection retries=350
      2015-11-16 03:09:15,099 INFO  [Thread-694] ipc.SimpleRpcScheduler(128): Using deadline as user call queue, count=1
      2015-11-16 03:09:15,101 INFO  [Thread-694] ipc.RpcServer$Listener(607): regionserver/asf905.gq1.ygridcore.net/67.195.81.149:0: started 3 reader(s) listening on port=36114
      2015-11-16 03:09:15,103 INFO  [Thread-694] fs.HFileSystem(252): Added intercepting call to namenode#getBlockLocations so can do block reordering using class class org.apache.hadoop.hbase.fs.HFileSystem$ReorderWALBlocks
      2015-11-16 03:09:15,104 INFO  [Thread-694] zookeeper.RecoverableZooKeeper(120): Process identifier=regionserver:36114 connecting to ZooKeeper ensemble=localhost:50139
      2015-11-16 03:09:15,117 DEBUG [Thread-694-EventThread] zookeeper.ZooKeeperWatcher(554): regionserver:361140x0, quorum=localhost:50139, baseZNode=/hbase Received ZooKeeper Event, type=None, state=SyncConnected, path=null
      2015-11-16 03:09:15,118 DEBUG [Thread-694] zookeeper.ZKUtil(492): regionserver:361140x0, quorum=localhost:50139, baseZNode=/hbase Set watcher on existing znode=/hbase/master
      2015-11-16 03:09:15,119 DEBUG [Thread-694] zookeeper.ZKUtil(492): regionserver:361140x0, quorum=localhost:50139, baseZNode=/hbase Set watcher on existing znode=/hbase/running
      2015-11-16 03:09:15,119 DEBUG [Thread-694-EventThread] zookeeper.ZooKeeperWatcher(638): regionserver:36114-0x1510e2c6f1d0029 connected
      2015-11-16 03:09:15,120 INFO  [RpcServer.responder] ipc.RpcServer$Responder(926): RpcServer.responder: starting
      2015-11-16 03:09:15,121 INFO  [RpcServer.listener,port=36114] ipc.RpcServer$Listener(738): RpcServer.listener,port=36114: starting
      2015-11-16 03:09:15,121 DEBUG [Thread-694] ipc.RpcExecutor(115): B.default Start Handler index=0 queue=0
      2015-11-16 03:09:15,121 DEBUG [Thread-694] ipc.RpcExecutor(115): B.default Start Handler index=1 queue=0
      2015-11-16 03:09:15,121 DEBUG [Thread-694] ipc.RpcExecutor(115): B.default Start Handler index=2 queue=0
      2015-11-16 03:09:15,122 DEBUG [Thread-694] ipc.RpcExecutor(115): B.default Start Handler index=3 queue=0
      2015-11-16 03:09:15,122 DEBUG [Thread-694] ipc.RpcExecutor(115): B.default Start Handler index=4 queue=0
      2015-11-16 03:09:15,122 DEBUG [Thread-694] ipc.RpcExecutor(115): Priority Start Handler index=0 queue=0
      2015-11-16 03:09:15,123 DEBUG [Thread-694] ipc.RpcExecutor(115): Priority Start Handler index=1 queue=1
      2015-11-16 03:09:15,123 DEBUG [Thread-694] ipc.RpcExecutor(115): Priority Start Handler index=2 queue=0
      2015-11-16 03:09:15,123 DEBUG [Thread-694] ipc.RpcExecutor(115): Priority Start Handler index=3 queue=1
      2015-11-16 03:09:15,124 DEBUG [Thread-694] ipc.RpcExecutor(115): Priority Start Handler index=4 queue=0
      2015-11-16 03:09:15,124 DEBUG [Thread-694] ipc.RpcExecutor(115): Replication Start Handler index=0 queue=0
      2015-11-16 03:09:15,124 DEBUG [Thread-694] ipc.RpcExecutor(115): Replication Start Handler index=1 queue=0
      2015-11-16 03:09:15,124 DEBUG [Thread-694] ipc.RpcExecutor(115): Replication Start Handler index=2 queue=0
      2015-11-16 03:09:15,761 DEBUG [RS:0;asf905:36114] client.ConnectionManager$HConnectionImplementation(715): connection construction failed
      java.io.IOException: java.lang.OutOfMemoryError: PermGen space
      	at org.apache.hadoop.hbase.client.RegistryFactory.getRegistry(RegistryFactory.java:43)
      	at org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.setupRegistry(ConnectionManager.java:886)
      	at org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.<init>(ConnectionManager.java:692)
      	at org.apache.hadoop.hbase.client.ConnectionUtils$2.<init>(ConnectionUtils.java:154)
      	at org.apache.hadoop.hbase.client.ConnectionUtils.createShortCircuitConnection(ConnectionUtils.java:154)
      	at org.apache.hadoop.hbase.regionserver.HRegionServer.createClusterConnection(HRegionServer.java:689)
      	at org.apache.hadoop.hbase.regionserver.HRegionServer.setupClusterConnection(HRegionServer.java:720)
      	at org.apache.hadoop.hbase.regionserver.HRegionServer.preRegistrationInitialization(HRegionServer.java:733)
      	at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:889)
      	at org.apache.hadoop.hbase.MiniHBaseCluster$MiniHBaseClusterRegionServer.runRegionServer(MiniHBaseCluster.java:156)
      	at org.apache.hadoop.hbase.MiniHBaseCluster$MiniHBaseClusterRegionServer.access$000(MiniHBaseCluster.java:108)
      	at org.apache.hadoop.hbase.MiniHBaseCluster$MiniHBaseClusterRegionServer$1.run(MiniHBaseCluster.java:140)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:356)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1594)
      	at org.apache.hadoop.hbase.security.User$SecureHadoopUser.runAs(User.java:334)
      	at org.apache.hadoop.hbase.MiniHBaseCluster$MiniHBaseClusterRegionServer.run(MiniHBaseCluster.java:138)
      	at java.lang.Thread.run(Thread.java:745)
      Caused by: java.lang.OutOfMemoryError: PermGen space
      	at sun.misc.Unsafe.defineClass(Native Method)
      	at sun.reflect.ClassDefiner.defineClass(ClassDefiner.java:63)
      	at sun.reflect.MethodAccessorGenerator$1.run(MethodAccessorGenerator.java:399)
      	at sun.reflect.MethodAccessorGenerator$1.run(MethodAccessorGenerator.java:396)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at sun.reflect.MethodAccessorGenerator.generate(MethodAccessorGenerator.java:395)
      	at sun.reflect.MethodAccessorGenerator.generateConstructor(MethodAccessorGenerator.java:94)
      	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:48)
      	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
      	at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
      	at java.lang.Class.newInstance(Class.java:383)
      	at org.apache.hadoop.hbase.client.RegistryFactory.getRegistry(RegistryFactory.java:41)
      	... 17 more
      

      Attachments

        1. 14819.branch-1.txt
          2 kB
          Michael Stack
        2. 14819.total.master.changes.patch
          8 kB
          Michael Stack
        3. more.heap.patch
          1 kB
          Michael Stack
        4. 14819.add.maxpermsize.patch
          1 kB
          Michael Stack
        5. 14819.addendum.patch
          0.8 kB
          Michael Stack
        6. 14819v2.txt
          2 kB
          Michael Stack
        7. itag.txt
          1 kB
          Michael Stack
        8. Screen Shot 2015-11-16 at 11.37.41 PM.png
          189 kB
          Michael Stack

        Issue Links

          Activity

            People

              stack Michael Stack
              stack Michael Stack
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: