Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-3114

Add a pause monitor for impala processes.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Impala 2.5.0
    • Impala 2.6.0
    • Backend
    • None

    Description

      In various scenarios we have seen impalad process getting blocked by OS for various reasons like swapping, hardware faults, system hangs etc. Typically the whole process gets stalled for a while and it is difficult to figure that out what happened based on logs. Logging such an event can give us an idea to correlate it with other events happening around that time. For example, we see something like this in the dmesg when the OS blocks it.

      INFO: task impalad:21378 blocked for more than 120 seconds. 
      

      This was originally observed in the hadoop daemons like namenode/datanode and a JVM pause monitor was implemented [1]. JVM had other issues like long GCs too and helped in such cases. Idea here is to implement something for Impala daemons too, to make debugging easier.

      [1] https://github.com/cloudera/hadoop-common/blob/cdh5-2.3.0_5.1.0/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/util/JvmPauseMonitor.java

      Attachments

        Activity

          People

            bharathv Bharath Vissapragada
            bharathv Bharath Vissapragada
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: