Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
-
None
-
Reviewed
-
-
master
Description
The thousands of spawned threads make the safepoint cost 80+s in our Master JVM processs.
2019-08-15,19:35:35,861 INFO [main-SendThread(zjy-hadoop-prc-zk02.bj:11000)] org.apache.zookeeper.ClientCnxn: Client session timed out, have not heard from server in 82260ms for sessionid 0x1691332e2d3aae5, closing socket connection and at
tempting reconnect
The stdout from JVM (can see from here there're 9126 threads & sync cost 80+s)
vmop [threads: total initially_running wait_to_block] [time: spin block sync cleanup vmop] page_trap_count 32358.859: ForceAsyncSafepoint [ 9126 67 474 ] [ 1 28 86596 87 101 ] 0
Also we got the jstack:
$ cat 31162.stack.1 | grep 'ForkJoinPool-1-worker' | wc -l
8648
It's a dangerous bug, make it as blocker.
Attachments
Attachments
Issue Links
- is caused by
-
HBASE-18309 Support multi threads in CleanerChore
- Resolved
- is related to
-
HBASE-22912 [Backport] HBASE-22867 to branch-1 to avoid ForkJoinPool to spawn thousands of threads
- Resolved
- relates to
-
HBASE-22871 Move the DirScanPool out and do not use static field
- Resolved
- links to