diff --git a/src/main/docbkx/ops_mgt.xml b/src/main/docbkx/ops_mgt.xml index cd6562f..81be6c7 100644 --- a/src/main/docbkx/ops_mgt.xml +++ b/src/main/docbkx/ops_mgt.xml @@ -1122,17 +1122,164 @@ $ for i in `cat conf/regionservers|sort`; do ./bin/graceful_stop.sh --restart -- +
+ Most Important Master Metrics + Note: Counts are usually over the last metrics reporting interval. + + + hbase.master.numRegionServers + Number of live regionservers + + + hbase.master.numDeadRegionServers + Number of dead regionservers + + + hbase.master.ritCount + The number of regions in transition + + + hbase.master.ritCountOverThreshold + The number of regions that have been in transition longer than + a threshold time (default: 60 seconds) + + + hbase.master.ritOldestAge + The age of the longest region in transition, in milliseconds + + + +
Most Important RegionServer Metrics - Previously, this section contained a list of the most important RegionServer metrics. - However, the list was extremely out of date. In some cases, the name of a given metric has - changed. In other cases, the metric seems to no longer be exposed. An effort is underway to - create automatic documentation for each metric based upon information pulled from its - implementation. + Note: Counts are usually over the last metrics reporting interval. + + + hbase.regionserver.regionCount + The number of regions hosted by the regionserver + + + hbase.regionserver.storeFileCount + The number of store files on disk currently managed by the + regionserver + + + hbase.regionserver.storeFileSize + Aggregate size of the store files on disk + + + hbase.regionserver.hlogFileCount + The number of write ahead logs not yet archived + + + hbase.regionserver.totalRequestCount + The total number of requests received + + + hbase.regionserver.readRequestCount + The number of read requests received + + + hbase.regionserver.writeRequestCount + The number of write requests received + + + hbase.regionserver.numOpenConnections + The number of open connections at the RPC layer + + + hbase.regionserver.numActiveHandler + The number of RPC handlers actively servicing requests + + + hbase.regionserver.numCallsInGeneralQueue + The number of currently enqueued user requests + + + hbase.regionserver.numCallsInReplicationQueue + The number of currently enqueued operations received from + replication + + + hbase.regionserver.numCallsInPriorityQueue + The number of currently enqueued priority (internal housekeeping) + requests + + + hbase.regionserver.flushQueueLength + Current depth of the memstore flush queue. If increasing, we are falling + behind with clearing memstores out to HDFS. + + + hbase.regionserver.updatesBlockedTime + Number of milliseconds updates have been blocked so the memstore can be + flushed + + + hbase.regionserver.compactionQueueLength + Current depth of the compaction request queue. If increasing, we are + falling behind with storefile compaction. + + + hbase.regionserver.blockCacheHitCount + The number of block cache hits + + + hbase.regionserver.blockCacheMissCount + The number of block cache misses + + + hbase.regionserver.blockCacheExpressHitPercent + The percent of the time that requests with the cache turned on hit the + cache + + + hbase.regionserver.percentFilesLocal + Percent of store file data that can be read from the local DataNode, + 0-100 + + + hbase.regionserver.<op>_<measure> + Operation latencies, where <op> is one of Append, Delete, Mutate, + Get, Replay, Increment; and where <measure> is one of min, max, mean, median, + 75th_percentile, 95th_percentile, 99th_percentile + + + hbase.regionserver.slow<op>Count + The number of operations we thought were slow, where <op> is one + of the list above + + + hbase.regionserver.GcTimeMillis + Time spent in garbage collection, in milliseconds + + + hbase.regionserver.GcTimeMillisParNew + Time spent in garbage collection of the young generation, in + milliseconds + + + hbase.regionserver.GcTimeMillisConcurrentMarkSweep + Time spent in garbage collection of the old generation, in + milliseconds + + + hbase.regionserver.authenticationSuccesses + Number of client connections where authentication succeeded + + + hbase.regionserver.authenticationFailures + Number of client connection authentication failures + + + hbase.regionserver.mutationsWithoutWALCount + Count of writes submitted with a flag indicating they should bypass the + write ahead log + +
-
HBase Monitoring