Uploaded image for project: 'Apache HAWQ'
  1. Apache HAWQ
  2. HAWQ-978

long running query got hang on master and can't be terminated

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0.0-incubating
    • 2.1.0.0-incubating
    • Query Execution
    • None

    Description

      One backend process on master had been running for several days and can't be terminated.
      The session is idle on all segments but master instance.

      pstack/strace/back trace of the backend process.

      [gpadmin@alpmdwgp1prd ~]$ pstack 423984
      Thread 2 (Thread 0x7f0457844700 (LWP 424026)):
      #0  0x00007f04756670d3 in poll () from /lib64/libc.so.6
      #1  0x0000000000b90114 in rxThreadFunc ()
      #2  0x00007f0475e889d1 in start_thread () from /lib64/libpthread.so.0
      #3  0x00007f04756708fd in clone () from /lib64/libc.so.6
      Thread 1 (Thread 0x7f047862b720 (LWP 423984)):
      #0  0x00007f047568005e in __lll_lock_wait_private () from /lib64/libc.so.6
      #1  0x00007f0475604dc0 in _L_lock_5199 () from /lib64/libc.so.6
      #2  0x00007f047560071b in _int_free () from /lib64/libc.so.6
      #3  0x0000000000b1be91 in gp_free2 ()
      #4  0x0000000000b10acc in AllocSetDelete ()
      #5  0x0000000000b1468b in MemoryContextDeleteImpl ()
      #6  0x0000000000aaf0f1 in RelationDestroyRelation ()
      #7  0x0000000000ab60f2 in RelationCacheInvalidate ()
      #8  0x0000000000aa9453 in InvalidateSystemCaches ()
      #9  0x0000000000937eeb in ReceiveSharedInvalidMessages ()
      #10 0x000000000093c295 in LockRelationOid ()
      #11 0x00000000004d8afd in heap_open ()
      #12 0x0000000000aa46d4 in SearchCatCache ()
      #13 0x00000000005c6512 in caql_getnext ()
      #14 0x0000000000749153 in sql_exec_error_callback ()
      #15 0x0000000000ad6e5a in errfinish ()
      #16 0x0000000000ad8ed9 in elog_finish ()
      #17 0x0000000000944e6b in handle_sig_alarm ()
      #18 <signal handler called>
      #19 0x00007f047560168f in _int_malloc () from /lib64/libc.so.6
      #20 0x00007f04756026b1 in malloc () from /lib64/libc.so.6
      #21 0x0000000000b1c2c1 in gp_malloc ()
      #22 0x0000000000b1259c in AllocSetAlloc ()
      #23 0x0000000000b15f5d in MemoryContextAllocZeroImpl ()
      #24 0x0000000000b6cb4f in initMotionLayerStructs ()
      #25 0x00000000007275e0 in ExecutorStart ()
      #26 0x0000000000749a2e in fmgr_sql ()
      #27 0x000000000072e316 in ExecMakeFunctionResultNoSets ()
      #28 0x000000000072e129 in ExecMakeFunctionResultNoSets ()
      #29 0x0000000000733312 in ExecProject ()
      #30 0x00000000007602c7 in ExecHashJoin ()
      #31 0x000000000072ca84 in ExecProcNode ()
      #32 0x000000000076bf38 in ExecSort ()
      #33 0x000000000072caa6 in ExecProcNode ()
      #34 0x000000000072199c in ExecutePlan ()
      #35 0x00000000007221a8 in ExecutorRun ()
      #36 0x0000000000971e09 in PortalRun ()
      #37 0x0000000000966968 in exec_simple_query ()
      #38 0x0000000000969ab9 in PostgresMain ()
      #39 0x00000000008c707e in ServerLoop ()
      #40 0x00000000008c9e20 in PostmasterMain ()
      #41 0x00000000007c85af in main ()
      

      Attachments

        Issue Links

          Activity

            People

              mli Ming Li
              mli Ming Li
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: