Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.0.0.0-incubating
-
None
Description
One backend process on master had been running for several days and can't be terminated.
The session is idle on all segments but master instance.
pstack/strace/back trace of the backend process.
[gpadmin@alpmdwgp1prd ~]$ pstack 423984 Thread 2 (Thread 0x7f0457844700 (LWP 424026)): #0 0x00007f04756670d3 in poll () from /lib64/libc.so.6 #1 0x0000000000b90114 in rxThreadFunc () #2 0x00007f0475e889d1 in start_thread () from /lib64/libpthread.so.0 #3 0x00007f04756708fd in clone () from /lib64/libc.so.6 Thread 1 (Thread 0x7f047862b720 (LWP 423984)): #0 0x00007f047568005e in __lll_lock_wait_private () from /lib64/libc.so.6 #1 0x00007f0475604dc0 in _L_lock_5199 () from /lib64/libc.so.6 #2 0x00007f047560071b in _int_free () from /lib64/libc.so.6 #3 0x0000000000b1be91 in gp_free2 () #4 0x0000000000b10acc in AllocSetDelete () #5 0x0000000000b1468b in MemoryContextDeleteImpl () #6 0x0000000000aaf0f1 in RelationDestroyRelation () #7 0x0000000000ab60f2 in RelationCacheInvalidate () #8 0x0000000000aa9453 in InvalidateSystemCaches () #9 0x0000000000937eeb in ReceiveSharedInvalidMessages () #10 0x000000000093c295 in LockRelationOid () #11 0x00000000004d8afd in heap_open () #12 0x0000000000aa46d4 in SearchCatCache () #13 0x00000000005c6512 in caql_getnext () #14 0x0000000000749153 in sql_exec_error_callback () #15 0x0000000000ad6e5a in errfinish () #16 0x0000000000ad8ed9 in elog_finish () #17 0x0000000000944e6b in handle_sig_alarm () #18 <signal handler called> #19 0x00007f047560168f in _int_malloc () from /lib64/libc.so.6 #20 0x00007f04756026b1 in malloc () from /lib64/libc.so.6 #21 0x0000000000b1c2c1 in gp_malloc () #22 0x0000000000b1259c in AllocSetAlloc () #23 0x0000000000b15f5d in MemoryContextAllocZeroImpl () #24 0x0000000000b6cb4f in initMotionLayerStructs () #25 0x00000000007275e0 in ExecutorStart () #26 0x0000000000749a2e in fmgr_sql () #27 0x000000000072e316 in ExecMakeFunctionResultNoSets () #28 0x000000000072e129 in ExecMakeFunctionResultNoSets () #29 0x0000000000733312 in ExecProject () #30 0x00000000007602c7 in ExecHashJoin () #31 0x000000000072ca84 in ExecProcNode () #32 0x000000000076bf38 in ExecSort () #33 0x000000000072caa6 in ExecProcNode () #34 0x000000000072199c in ExecutePlan () #35 0x00000000007221a8 in ExecutorRun () #36 0x0000000000971e09 in PortalRun () #37 0x0000000000966968 in exec_simple_query () #38 0x0000000000969ab9 in PostgresMain () #39 0x00000000008c707e in ServerLoop () #40 0x00000000008c9e20 in PostmasterMain () #41 0x00000000007c85af in main ()
Attachments
Issue Links
- relates to
-
HAWQ-1324 Query cancel cause segment to go into Crash recovery
-
- Closed
-
- links to