Uploaded image for project: 'Traffic Server'
  1. Traffic Server
  2. TS-394

taffic_server process sig abort in full cluster mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • None
    • 2.1.4
    • Core
    • None
    • ATS in full cluster mode is unusable, the traffic_server process will get sig abort by every request. code in trunk tested.

    Description

      I am trying to setup full cluster mode, but geting connection abort during every request. after tcpdump, it seems that ATS got the correct source file from backend, but do not send out the full file( with tcp reset during http transfer to client), then i am trying to figure out the root cause.

      with debug log enabled in records.config:
      CONFIG proxy.config.diags.debug.enabled INT 1
      CONFIG proxy.config.diags.debug.tags STRING http.|cluster.

      I got the following log from traffic.out:

      [Jun 22 15:19:11.296] Manager

      {139809315796768} ERROR: [LocalManager::pollMgmtProcessServer] Server Process terminated due to Sig 6: Aborted
      [Jun 22 15:19:11.296] Manager {139809315796768}

      ERROR: (last system error 2: No such file or directory)
      [Jun 22 15:19:11.296] Manager

      {139809315796768} ERROR: [Alarms::signalAlarm] Server Process was reset
      [Jun 22 15:19:11.296] Manager {139809315796768}

      ERROR: (last system error 2: No such file or directory)

      after strace traffic_server, I got the following info:

      [pid 19830] 0.000306 <... epoll_wait resumed> {}, 32768, 10) = 0
      [pid 19830] 0.000031 gettimeofday(

      {1277256740, 532763}

      , NULL) = 0
      [pid 19830] 0.000181 write(2, "FATAL: ClusterHandler.cc:2047: failed assert `ntodo >= 0`\n", 58) = 58
      [pid 19830] 0.000076 gettimeofday(

      {1277256740, 533020}

      , NULL) = 0
      [pid 19830] 0.000071 socket(PF_FILE, SOCK_DGRAM|SOCK_CLOEXEC, 0) = 101
      [pid 19830] 0.000062 connect(101,

      {sa_family=AF_FILE, path="/dev/log"}

      , 110) = -1 ENOENT (No such file or directory)
      [pid 19830] 0.000074 close(101) = 0
      [pid 19830] 0.000056 write(2, "/usr/bin/traffic_server", 23) = 23
      [pid 19830] 0.000050 write(2, " - STACK TRACE: \n", 17) = 17
      [pid 19830] 0.000289 futex(0x2b3a08e6a5b0, FUTEX_WAKE_PRIVATE, 2147483647) = 0
      [pid 19830] 0.000251 futex(0x2b3a08b11190, FUTEX_WAKE_PRIVATE, 2147483647) = 0
      [pid 19830] 0.000656 writev(2, [

      {"/usr/bin/traffic_server", 23}

      ,

      {"(", 1}

      ,

      {"ink_fatal_va", 12}

      ,

      {"+0x", 3}

      ,

      {"ab", 2}

      ,

      {")", 1}

      ,

      {"[0x", 3}

      ,

      {"6dcdab", 6}

      ,

      {"]\n", 2}

      ], 9) = 53

      I have fix the bug by comment out ClusterHandler.cc:2047. patch will followed.

      Attachments

        Activity

          People

            zwoop Leif Hedstrom
            zym Zhao Yongming
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: