Uploaded image for project: 'Traffic Server'
  1. Traffic Server
  2. TS-1151

in some strange situation, cop will crash

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.3.4, 3.2.4, 3.0.4
    • 3.3.5, 3.1.4
    • Cop
    • None

    Description

      we get some strange crash, the manager & cop may die, we are not sure what that is, but I'd like to start one Issue here if we have other same issue.

      here is the log in /var/log/messages

      Mar 19 10:08:24 cache172.cn77 kernel:: [1553138.961401] [ET_NET 2][17949]: segfault at 2aadf1387937 ip 0000003c5bc7bdbe sp 00000000410f3188 error 4 in libc-2.5.so[3c5bc00000+14d000]
      Mar 19 10:08:27 cache172.cn77 traffic_manager[17935]: {0x7ff0c8d51720} FATAL: [LocalManager::pollMgmtProcessServer] Error in read (errno: 104)
      Mar 19 10:08:27 cache172.cn77 traffic_manager[17935]: {0x7ff0c8d51720} FATAL:  (last system error 104: Connection reset by peer)
      Mar 19 10:08:27 cache172.cn77 traffic_manager[17935]: {0x7ff0c8d51720} ERROR: [LocalManager::sendMgmtMsgToProcesses] Error writing message
      Mar 19 10:08:27 cache172.cn77 traffic_manager[17935]: {0x7ff0c8d51720} ERROR:  (last system error 32: Broken pipe)
      Mar 19 10:08:33 cache172.cn77 traffic_cop[17933]: cop received child status signal [17935 2816]
      Mar 19 10:08:33 cache172.cn77 traffic_cop[17933]: traffic_manager not running, making sure traffic_server is dead
      Mar 19 10:08:33 cache172.cn77 traffic_cop[17933]: spawning traffic_manager
      Mar 19 10:08:40 cache172.cn77 traffic_manager[2760]: NOTE: --- Manager Starting ---
      Mar 19 10:08:40 cache172.cn77 traffic_manager[2760]: NOTE: Manager Version: Apache Traffic Server - traffic_manager - 3.0.2 - (build # 299 on Mar  9 2012 at 09:55:44)
      Mar 19 10:08:40 cache172.cn77 traffic_manager[2760]: {0x7fd03d265720} STATUS: opened /var/log/trafficserver/manager.log
      Mar 19 10:08:46 cache172.cn77 traffic_cop[17933]: (cli test) unable to retrieve manager_binary
      Mar 19 10:08:54 cache172.cn77 traffic_server[2789]: NOTE: --- Server Starting ---
      Mar 19 10:08:54 cache172.cn77 traffic_server[2789]: NOTE: Server Version: Apache Traffic Server - traffic_server - 3.0.2 - (build # 299 on Mar  9 2012 at 09:56:00)
      Mar 19 10:09:00 cache172.cn77 traffic_server[2789]: {0x2b5a8ef03970} STATUS: opened /var/log/trafficserver/diags.log
      Mar 19 10:14:02 cache172.cn77 kernel:: [1553476.364204] [ET_NET 0][2789]: segfault at 2aab1fa99ce3 ip 0000003c5bc7bdbe sp 00007fff39743fa8 error 4 in libc-2.5.so[3c5bc00000+14d000]
      Mar 19 10:14:03 cache172.cn77 traffic_manager[2760]: {0x7fd03d265720} FATAL: [LocalManager::pollMgmtProcessServer] Error in read (errno: 104)
      Mar 19 10:14:03 cache172.cn77 traffic_manager[2760]: {0x7fd03d265720} FATAL:  (last system error 104: Connection reset by peer)
      Mar 19 10:14:03 cache172.cn77 traffic_manager[2760]: {0x7fd03d265720} ERROR: [LocalManager::sendMgmtMsgToProcesses] Error writing message
      Mar 19 10:14:03 cache172.cn77 traffic_manager[2760]: {0x7fd03d265720} ERROR:  (last system error 32: Broken pipe)
      

      here is the message in traffic.out

      Mar 19 10:11:06 cache162.cn77 kernel:: [2510081.212455] [ET_NET 3][319]: segfault at 2aaae6e986bc ip 0000003f7f27bdbe sp 0000000040be2188 error 4 in libc-2.5.so[3f7f200000+14d000]
      Mar 19 10:11:09 cache162.cn77 traffic_manager[305]: {0x7fd3a665c720} FATAL: [LocalManager::pollMgmtProcessServer] Error in read (errno: 104)
      Mar 19 10:11:09 cache162.cn77 traffic_manager[305]: {0x7fd3a665c720} FATAL:  (last system error 104: Connection reset by peer)
      Mar 19 10:11:09 cache162.cn77 traffic_manager[305]: {0x7fd3a665c720} ERROR: [LocalManager::sendMgmtMsgToProcesses] Error writing message
      Mar 19 10:11:09 cache162.cn77 traffic_manager[305]: {0x7fd3a665c720} ERROR:  (last system error 32: Broken pipe)
      Mar 19 10:11:09 cache162.cn77 traffic_cop[303]: cop received child status signal [305 2816]
      Mar 19 10:11:09 cache162.cn77 traffic_cop[303]: traffic_manager not running, making sure traffic_server is dead
      Mar 19 10:11:09 cache162.cn77 traffic_cop[303]: spawning traffic_manager
      Mar 19 10:11:16 cache162.cn77 traffic_manager[1227]: NOTE: --- Manager Starting ---
      Mar 19 10:11:16 cache162.cn77 traffic_manager[1227]: NOTE: Manager Version: Apache Traffic Server - traffic_manager - 3.0.2 - (build # 299 on Mar  9 2012 at 09:55:44)
      Mar 19 10:11:16 cache162.cn77 traffic_manager[1227]: {0x7f8ae2f48720} STATUS: opened /var/log/trafficserver/manager.log
      Mar 19 10:11:23 cache162.cn77 traffic_cop[303]: (cli test) unable to retrieve manager_binary
      Mar 19 10:11:39 cache162.cn77 traffic_server[1260]: NOTE: --- Server Starting ---
      Mar 19 10:11:39 cache162.cn77 traffic_server[1260]: NOTE: Server Version: Apache Traffic Server - traffic_server - 3.0.2 - (build # 299 on Mar  9 2012 at 09:56:00)
      Mar 19 10:11:46 cache162.cn77 traffic_server[1260]: {0x2ad4afd3d970} STATUS: opened /var/log/trafficserver/diags.log
      Mar 19 10:15:06 cache162.cn77 kernel:: [2510320.713808] [ET_NET 3][1277]: segfault at 2aab1cfa6a03 ip 0000003f7f27bdbe sp 000000004141c188 error 4 in libc-2.5.so[3f7f200000+14d000]
      Mar 19 10:15:06 cache162.cn77 traffic_manager[1227]: {0x7f8ae2f48720} ERROR: [LocalManager::pollMgmtProcessServer] Server Process terminated due to Sig 11: Segmentation fault
      Mar 19 10:15:06 cache162.cn77 traffic_manager[1227]: {0x7f8ae2f48720} ERROR:  (last system error 2: No such file or directory)
      Mar 19 10:15:06 cache162.cn77 traffic_manager[1227]: {0x7f8ae2f48720} ERROR: [Alarms::signalAlarm] Server Process was reset
      Mar 19 10:15:06 cache162.cn77 traffic_manager[1227]: {0x7f8ae2f48720} ERROR:  (last system error 2: No such file or directory)
      Mar 19 10:15:08 cache162.cn77 traffic_server[2412]: NOTE: --- Server Starting ---
      Mar 19 10:15:08 cache162.cn77 traffic_server[2412]: NOTE: Server Version: Apache Traffic Server - traffic_server - 3.0.2 - (build # 299 on Mar  9 2012 at 09:56:00)
      Mar 19 10:15:08 cache162.cn77 traffic_server[2412]: {0x2af4c2ad5970} STATUS: opened /var/log/trafficserver/diags.log
      Mar 19 10:54:53 cache162.cn77 ops.hdmon.power: [ OK ] Power Unit PSU 1: OK;Power Unit PSU 2: OK.
      

      Attachments

        Issue Links

          Activity

            People

              portl4t portl4t
              zym Zhao Yongming
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: