Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-18045

Add ' -o ConnectTimeout=10' to the ssh command we use in ITBLL chaos monkeys

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • None
    • 3.0.0-alpha-4
    • integration tests
    • None
    • Reviewed

    Description

      Monkeys hang on me in long running tests. I've not spent too much time on it since it rare enough but I just went through a spate of them. When monkey kill ssh hangs, all killing stops which can give a false sense of victory when you wake up in the morning and your job 'passed'. I also see monkeys kill all servers in a cluster and fail to bring them back which causes job fail as no one is serving data. The latter may actually be another issue but for the former, I've had some success adding -o ConnectTimeout=10 as an option on ssh. You can do it easily enough via config but this issue is to suggest that we add it in code.

      Here is how you add it via config if interested:

      <property >
      <name>hbase.it.clustermanager.ssh.opts</name>
      <value> -o ConnectTimeout=10 </value>
      </property >

      Attachments

        Issue Links

          Activity

            People

              mlnsharma Narasimha Sharma
              stack Michael Stack
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: