Uploaded image for project: 'Ignite'
  1. Ignite
  2. IGNITE-10015

Sporadic JVM crash due to restart nodes

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 2.7
    • None
    • general
    • None

    Description

      1. Start 4 node cluster with pre-configured TTL caches.
      2. Some 4 node may crash:

      [22:43:01,485][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_002, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_013, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_001, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_012, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_004, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_015, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,005][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_003, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,006][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_014, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,007][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=ignite-sys-cache, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,007][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_011, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,007][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_010, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_009, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_006, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_005, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_016, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_008, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,008][INFO][sys-#73][GridCacheProcessor] Finish proxy initialization, cacheName=cache_007, localNodeId=e8ad11f2-c6dd-4683-b449-44a726d715fd
      [22:43:02,012][INFO][db-checkpoint-thread-#68][GridCacheDatabaseSharedManager] Checkpoint started [checkpointId=214d43f2-6096-4b42-ab0f-52b7f98078f4, startPtr=FileWALPointer [idx=0, fileOff=513096, len=16483], checkpointLockWait=0ms, checkpointLockHoldTime=23ms, walCpRecordFsyncDuration=880ms, pages=238, reason='timeout']
      #
      # A fatal error has been detected by the Java Runtime Environment:
      #
      #  SIGBUS (0x7) at pc=0x00007f0aa29d8522, pid=12344, tid=0x00007f08b15f5700
      #
      # JRE version: Java(TM) SE Runtime Environment (8.0_161-b12) (build 1.8.0_161-b12)
      # Java VM: Java HotSpot(TM) 64-Bit Server VM (25.161-b12 mixed mode linux-amd64 compressed oops)
      # Problematic frame:
      # C  [libzip.so+0x12522]  newEntry+0x62
      #
      # Core dump written. Default location: /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/suites/core or core.12344
      #
      # An error report file with more information is saved as:
      # /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/suites/hs_err_pid12344.log
      Compiled method (nm)    7845  558     n 0       java.util.zip.ZipFile::getEntry (native)
       total in heap  [0x00007f0a8d3d1850,0x00007f0a8d3d1bc0] = 880
       relocation     [0x00007f0a8d3d1978,0x00007f0a8d3d19c0] = 72
       main code      [0x00007f0a8d3d19c0,0x00007f0a8d3d1bc0] = 512
      [thread 139675315439360 also had an error]
      #
      # If you would like to submit a bug report, please visit:
      #   http://bugreport.java.com/bugreport/crash.jsp
      # The crash happened outside the Java Virtual Machine in native code.
      # See problematic frame for where to report the bug.
      #
      /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/var/suite-cache/gg-pro-fab/bin/ignite.sh: line 200: 12344 Aborted                 (core dumped) "$JAVA" ${JVM_OPTS} ${QUIET} "${RESTART_SUCCESS_OPT}" ${JMX_MON} -DIGNITE_HOME="${IGNITE_HOME}" -DIGNITE_PROG_NAME="$0" ${JVM_XOPTS} -cp "${CP}" ${MAIN_CLASS} "${CONFIG}"
      

      or

      [22:28:53,182][INFO][sys-#73][GridDhtPartitionsExchangeFuture] Finish exchange future [startVer=AffinityTopologyVersion [topVer=4, minorTopVer=1], resVer=AffinityTopologyVersion [topVer=4, minorTopVer=1], err=null]
      [22:28:53,206][INFO][sys-#76][GridCachePartitionExchangeManager] Delay process full message without exchange id (there is exchange in progress) [nodeId=88aedbe3-b8b0-41b5-a321-d71001f5d98a]
      #
      # A fatal error has been detected by the Java Runtime Environment:
      #
      #  SIGBUS (0x7) at pc=0x00007f47d5fee522, pid=2264, tid=0x00007f45e5079700
      #
      # JRE version: Java(TM) SE Runtime Environment (8.0_161-b12) (build 1.8.0_161-b12)
      # Java VM: Java HotSpot(TM) 64-Bit Server VM (25.161-b12 mixed mode linux-amd64 compressed oops)
      # Problematic frame:
      # C  [libzip.so+0x12522]  newEntry+0x62
      #
      # Core dump written. Default location: /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/suites/core or core.2264
      #
      # An error report file with more information is saved as:
      # /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/suites/hs_err_pid2264.log
      Compiled method (nm)    7347  553     n 0       java.util.zip.ZipFile::getEntry (native)
       total in heap  [0x00007f47c13d0410,0x00007f47c13d0780] = 880
       relocation     [0x00007f47c13d0538,0x00007f47c13d0580] = 72
       main code      [0x00007f47c13d0580,0x00007f47c13d0780] = 512
      #
      # If you would like to submit a bug report, please visit:
      #   http://bugreport.java.com/bugreport/crash.jsp
      # The crash happened outside the Java Virtual Machine in native code.
      # See problematic frame for where to report the bug.
      #
      /var/lib/teamcity/data/work/dd4d79acf76cc870/i2test/var/suite-cache/gg-pro-fab/bin/ignite.sh: line 200:  2264 Aborted                 (core dumped) "$JAVA" ${JVM_OPTS} ${QUIET} "${RESTART_SUCCESS_OPT}" ${JMX_MON} -DIGNITE_HOME="${IGNITE_HOME}" -DIGNITE_PROG_NAME="$0" ${JVM_XOPTS} -cp "${CP}" ${MAIN_CLASS} "${CONFIG}"
      

      Attachments

        1. hs_err_pid9126.log
          120 kB
          Sergey Kozlov

        Activity

          People

            skozlov Sergey Kozlov
            skozlov Sergey Kozlov
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: