Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-7921

ProcessManager::resume sometimes crashes accessing EventQueue.

    XMLWordPrintableJSON

Details

    Description

      The following segfault is found on ASF in MesosContainerizerSlaveRecoveryTest.ResourceStatistics but it's flaky and shows up in other tests and environments (with or without --enable-lock-free-event-queue) as well.

      Configuration
      ./bootstrap '&&' ./configure --verbose '&&' make -j6 distcheck
      
      *** Aborted at 1503937885 (unix time) try "date -d @1503937885" if you are using GNU date ***
      PC: @     0x2b9e2581caa0 process::EventQueue::Consumer::empty()
      *** SIGSEGV (@0x8) received by PID 751 (TID 0x2b9e31978700) from PID 8; stack trace: ***
          @     0x2b9e29d26330 (unknown)
          @     0x2b9e2581caa0 process::EventQueue::Consumer::empty()
          @     0x2b9e25800a40 process::ProcessManager::resume()
          @     0x2b9e2580f891 process::ProcessManager::init_threads()::$_9::operator()()
          @     0x2b9e2580f7d5 _ZNSt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvE3$_9vEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
          @     0x2b9e2580f7a5 std::_Bind_simple<>::operator()()
          @     0x2b9e2580f77c std::thread::_Impl<>::_M_run()
          @     0x2b9e29fe5a60 (unknown)
          @     0x2b9e29d1e184 start_thread
          @     0x2b9e2a851ffd (unknown)
      make[3]: *** [CMakeFiles/check] Segmentation fault (core dumped)
      

      A builds@mesos.apache.org query shows many such instances: https://lists.apache.org/list.html?builds@mesos.apache.org:lte=1M:process%3A%3AEventQueue%3A%3AConsumer%3A%3Aempty

      Attachments

        Issue Links

          Activity

            People

              bmahler Benjamin Mahler
              xujyan Yan Xu
              Benjamin Hindman Benjamin Hindman
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: