Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.90.6
-
None
-
None
-
Incompatible change, Reviewed
-
Set hbase.regionserver.catalog.timeout and hbase.master.catalog.timeout default down from Integer.MAX_VALUE to 10 minutes.
Description
We are seeing the timeout value of 2147483647ms which is ~24days. That seems a little high for not talking to META.
2012-08-27 21:57:04,572 INFO org.apache.hadoop.hbase.regionserver.CompactSplitThread: Running rollback/cleanup of failed split of table,PGE:3659323005:READ:\x7F\xFF\xFE\xC
6\xC9\x9DS\x7F,1346030679280.94cf5ab361b0e7d92b0b263ffb995852.; Timed out (2147483647ms)
org.apache.hadoop.hbase.NotAllMetaRegionsOnlineException: Timed out (2147483647ms)
at org.apache.hadoop.hbase.catalog.CatalogTracker.waitForMeta(CatalogTracker.java:390)
at org.apache.hadoop.hbase.catalog.CatalogTracker.waitForMetaServerConnectionDefault(CatalogTracker.java:422)
at org.apache.hadoop.hbase.catalog.MetaEditor.offlineParentInMeta(MetaEditor.java:109)
at org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:290)
at org.apache.hadoop.hbase.regionserver.CompactSplitThread.split(CompactSplitThread.java:156)
at org.apache.hadoop.hbase.regionserver.CompactSplitThread.run(CompactSplitThread.java:87)
2012-08-27 21:57:04,585 FATAL org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server serverName=hdpr002.va.cust.it,60020,1345819108288, load=(requests=102, regions=187, usedHeap=4699, maxHeap=12281): Abort; we got an error after point-of-no-return