Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-2531

Libmesos terminates JVM

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.23.0
    • None
    • java api
    • None

    Description

      I have build Mesos from scratch using code available on GitHub, revision #a12242b.

      My Mesos cluster runs on MacOS and consists of one master and three slaves - all running on the same computer but on different ports. ZooKeeper runs also on the same computer.

      Later on I compiled Marathon also using latest version from GitHub, revision #6decf76. Marathon uses same ZooKeeper instance and successfully connects to Mesos cluster.

      After deploying simple application that runs sleep command for 120 seconds and scaling that application to ten my Marathon crushed killed by JVM after SIGSEGV in libmesos-0.23.0.dylib.

      Log

      [2015-03-23 15:47:17,872] INFO Computed new deployment plan: DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar, Some(sleep 120))), 10))))) (mesosphere.marathon.upgrade.DeploymentPlan$:263)
      [2015-03-23 15:47:17,876] INFO Deployment acknowledged. Waiting to get processed: DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar, Some(sleep 120))), 10))))) (mesosphere.marathon.state.GroupManager:142)
      [2015-03-23 15:47:17,877] INFO 127.0.0.1 -  -  [23/mar/2015:14:47:17 +0000] "PUT /v2/apps//bar HTTP/1.1" 200 92 "http://127.0.0.1:8080/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
      [2015-03-23 15:47:17,918] INFO 127.0.0.1 -  -  [23/mar/2015:14:47:17 +0000] "GET /v2/apps//bar/versions HTTP/1.1" 200 68 "http://127.0.0.1:8080/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
      [2015-03-23 15:47:20,722] INFO 127.0.0.1 -  -  [23/mar/2015:14:47:20 +0000] "GET /v2/apps HTTP/1.1" 200 592 "http://127.0.0.1:8080/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
      [2015-03-23 15:47:20,782] INFO Received status update for task bar.82501637-d16b-11e4-b7fa-aa4dda3d2dbb: TASK_RUNNING () (mesosphere.marathon.MarathonScheduler:149)
      [2015-03-23 15:47:20,790] INFO 127.0.0.1 -  -  [23/mar/2015:14:47:20 +0000] "GET /v2/deployments HTTP/1.1" 200 256 "http://127.0.0.1:8080/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
      #
      # A fatal error has been detected by the Java Runtime Environment:
      #
      #  SIGSEGV (0xb) at pc=0x000000012ec946f7, pid=98294, tid=27651
      #
      # JRE version: Java(TM) SE Runtime Environment (8.0-b132) (build 1.8.0-b132)
      # Java VM: Java HotSpot(TM) 64-Bit Server VM (25.0-b70 mixed mode bsd-amd64 compressed oops)
      # Problematic frame:
      # C  [libmesos-0.23.0.dylib+0x7836f7]  process::Future<mesos::internal::state::Variable>::isFailed() const+0x17
      #
      # Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
      #
      # An error report file with more information is saved as:
      # /Users/mkiedys/Downloads/MESOS/marathon/hs_err_pid98294.log
      #
      # If you would like to submit a bug report, please visit:
      #   http://bugreport.sun.com/bugreport/crash.jsp
      # The crash happened outside the Java Virtual Machine in native code.
      # See problematic frame for where to report the bug.
      #
      Abort trap: 6
      

      Java

      java version "1.8.0"
      Java(TM) SE Runtime Environment (build 1.8.0-b132)
      Java HotSpot(TM) 64-Bit Server VM (build 25.0-b70, mixed mode)

      System Software Overview

      • System Version: OS X 10.10.2 (14C109)
      • Kernel Version: Darwin 14.1.0
      • Secure Virtual Memory: Enabled
      • Time since boot: 13 days 11:02

      Attachments

        1. hs_err_pid98294.log
          52 kB
          Michał Kiędyś

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mkiedys Michał Kiędyś
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: