Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-7386

Investigate providing some supervisor support for znode deletion

    XMLWordPrintableJSON

    Details

    • Type: Task
    • Status: Patch Available
    • Priority: Blocker
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: 3.0.0
    • Component/s: master, regionserver, scripts
    • Labels:
      None

      Description

      There a couple of JIRAs for deleting the znode on a process failure:
      HBASE-5844 (RS)
      HBASE-5926 (Master)
      which are pretty neat; on process failure, they delete the znode of the underlying process so HBase can recover faster.

      These JIRAs were implemented via the startup scripts; i.e. the script hangs around and waits for the process to exit, then deletes the znode.

      There are a few problems associated with this approach, as listed in the below JIRAs:
      1) Hides startup output in script
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401
      2) two hbase processes listed per launched daemon
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409
      3) Not run by a real supervisor
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409
      4) Weird output after kill -9 actual process in standalone mode
      https://issues.apache.org/jira/browse/HBASE-5926?focusedCommentId=13506801&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506801
      5) Can kill existing RS if called again
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401
      6) Hides stdout/stderr[6]
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13506832&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506832

      I suspect running in via something like supervisor.d can solve these issues if we provide the right support.

        Attachments

        1. HBASE-7386-master-01.patch
          42 kB
          Samir Ahmic
        2. HBASE-7386-master-00.patch
          39 kB
          Samir Ahmic
        3. HBASE-7386-conf-v3.patch
          22 kB
          Samir Ahmic
        4. HBASE-7386-bin-v3.patch
          40 kB
          Samir Ahmic
        5. HBASE-7386-conf-v2.patch
          22 kB
          Samir Ahmic
        6. HBASE-7386-bin-v2.patch
          37 kB
          Samir Ahmic
        7. HBASE-7386-src.patch
          2 kB
          Samir Ahmic
        8. HBASE-7386-conf.patch
          22 kB
          Samir Ahmic
        9. HBASE-7386-bin.patch
          36 kB
          Samir Ahmic
        10. supervisordconfigs-v0.patch
          2 kB
          Gregory Chanan
        11. HBASE-7386-v0.patch
          3 kB
          Gregory Chanan

          Issue Links

            Activity

              People

              • Assignee:
                stack stack
                Reporter:
                gchanan Gregory Chanan
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated: