Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-16947

Some improvements for DumpReplicationQueues tool

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Hide
      Add more output for DumpReplicationQueues tool.

      1. Show the dead regionservers under replication/rs znode.
      2. Make a summary about all the queues that belong to peer has been deleted.
      3. Aggregate all regionservers' size of replication queue.
      4. Show the total WALs size and how many WALs which can not found on hdfs.
      Show
      Add more output for DumpReplicationQueues tool. 1. Show the dead regionservers under replication/rs znode. 2. Make a summary about all the queues that belong to peer has been deleted. 3. Aggregate all regionservers' size of replication queue. 4. Show the total WALs size and how many WALs which can not found on hdfs.

    Description

      Recently we met too many replication WALs problem in our production cluster. We need the DumpReplicationQueues tool to analyze the replication queues info in zookeeper. So I backport HBASE-16450 to our branch based 0.98 and did some improvements for it.

      1. Show the dead regionservers under replication/rs znode. When there are too many WALs under znode, it can't be atomic transferred to new rs znode. So the dead rs znode will be leaved on zookeeper.
      2. Make a summary about all the queues that belong to peer has been deleted.
      3. Aggregate all regionservers' size of replication queue. Now the regionserver report ReplicationLoad to master, but there were not a aggregate metrics for replication.
      4. Show how many WALs which can not found on hdfs. But the reason (WAL Not Found) need more time to dig.

      Attachments

        1. HBASE-16947.patch
          15 kB
          Guanghao Zhang
        2. HBASE-16947-branch-1.patch
          15 kB
          Michael Stack
        3. HBASE-16947-branch-1.patch
          15 kB
          Guanghao Zhang
        4. HBASE-16947-branch-1.patch
          15 kB
          Guanghao Zhang
        5. HBASE-16947-v1.patch
          15 kB
          Guanghao Zhang

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            zghao Guanghao Zhang
            zghao Guanghao Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment