Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-9086

Enforce process singleton rules through an exclusive write lock on a file, not a pid file +kill -0,

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 1.1.1, 2.0.3-alpha
    • None
    • scripts, util
    • None
    • Unix/Linux.

    Description

      the hadoop-daemon.sh script (and other liveness monitors) probe the existence of a daemon service by a kill -0 of a process id picked up from a pid file.
      This is flawed

      1. pid file locations may change with installations.
      2. Linux and Unix recycle pids, leading to false positives -the scripts think the process is running, when another process is.
      3. doesn't work on windows.

      Having the processes acquire an exclusive write-lock on a known file would delegate lock management and implicitly liveness to the OS itself. when the process dies, the lock is relased (on Unixes)

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: