Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16136

LLAP: Before SIGKILL, collect diagnostic information before daemon goes down

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.2.0
    • None
    • llap
    • None

    Description

      Sometime daemons can get killed by YARN's pmem monitor which issue a kill followed by kill -9 after 250ms. This is really a short duration to collect anything useful.
      There is no clean way to trap SIGKILL in java.
      One option is to increase the time between kill and kill -9 in YARN and during that time we can have a shutdown hook handler to collect all diagnostics information like heapdump, jstack, jmx output etc. in a non-container directory.

      Attachments

        Activity

          People

            prasanth_j Prasanth Jayachandran
            prasanth_j Prasanth Jayachandran
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: