Bigtop
  1. Bigtop
  2. BIGTOP-436

flume-node stop seems to mistarget some other java process on lucid

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Critical Critical
    • Resolution: Unresolved
    • Affects Version/s: 0.3.0
    • Fix Version/s: None
    • Component/s: Debian
    • Labels:
      None

      Description

      It seems that the following bit of code from flume-node reliably kills our jenkins slave process on lucid:

        # FLUME-919 will put an end to such extreme violence
        FLUME_PID=`cat $FLUME_PID_FILE`
        if [ -n $FLUME_PID ]; then
          FLUME_PID_GROUP=$(ps -o pgrp -p ${FLUME_PID} h)
      
          if [ -n $FLUME_PID_GROUP ]; then
            kill -TERM -${FLUME_PID_GROUP} &>/dev/null
            sleep 5
            kill -KILL -${FLUME_PID_GROUP} &>/dev/null
      
            rm -f $LOCKFILE $FLUME_PID_FILE
          fi
        fi
        return 0
      

      Here's how it happens:
      http://bigtop01.cloudera.org:8080/view/Bigtop-trunk/job/Bigtop-trunk-packagetest-lucid/label=lucid-slave/1/console

      We need to investigate and possibly fix this.

        Activity

        Hide
        Bruno Mahé added a comment -

        Assigning to you Patrick since you expressed some interest regarding looking into it.
        Have fun

        Show
        Bruno Mahé added a comment - Assigning to you Patrick since you expressed some interest regarding looking into it. Have fun
        Hide
        Roman Shaposhnik added a comment -

        This seems to be only happening in Jenkins env

        Show
        Roman Shaposhnik added a comment - This seems to be only happening in Jenkins env

          People

          • Assignee:
            Patrick Taylor Ramsey
            Reporter:
            Roman Shaposhnik
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:

              Development