Uploaded image for project: 'Apache Whirr (retired)'
  1. Apache Whirr (retired)
  2. WHIRR-268

whirr hangs when the file '$HOME/.ssh/known_hosts' includes an obsolete identifier for a certain ip address host.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.3.0
    • 0.5.0
    • core
    • None

    Description

      my properties file is ...

      $ cat cluster.properties 
      whirr.cluster-name=mycluster
      whirr.instance-templates=1 jt+nn,10 dn+tt
      whirr.provider=ec2
      whirr.identity=XXXXXXXXXXXXXXXXXXXX
      whirr.credential=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
      whirr.private-key-file=${sys:user.home}/.ssh/id_rsa
      whirr.public-key-file=${sys:user.home}/.ssh/id_rsa.pub
      whirr.location-id=us-east-1d
      #whirr.hardware-id=m1.small
      whirr.hardware-id=c1.medium
      whirr.service-name=hadoop
      # for m1.small
      #whirr.image-id=us-east-1/ami-2caa5845
      whirr.image-id=us-east-1/ami-7000f019
      
      $ whirr/bin/whirr launch-cluster --config cluster.properties
      Bootstrapping cluster
      Configuring template
      Starting 10 node(s) with roles [tt, dn]
      Configuring template
      Starting 1 node(s) with roles [jt, nn]
      Nodes started: [[id=us-east-1/i-ba63a7d5, providerId=i-ba63a7d5, tag=mycluster, name=null, 
      location=[id=us-east-1a, scope=ZONE, description=us-east-1a, parent=us-east-1], uri=null, 
      imageId=us-east-1/ami-7000f019, os=[name=null, family=ubuntu, version=10.04, arch=paravirtual, 
      is64Bit=false, description=ubuntu-images-us/ubuntu-lucid-10.04-i386-server-
      20110201.1.manifest.xml], userMetadata={}, state=RUNNING, privateAddresses=[10.245.106.99], 
      publicAddresses=[184.72.166.132], hardware=[id=c1.medium, providerId=c1.medium, name=c1.medium, 
      processors=[[cores=2.0, speed=2.5]], ram=1740, volumes=[[id=null, type=LOCAL, size=10.0, 
      device=/dev/sda1, durable=false, isBootDevice=true], [id=null, type=LOCAL, size=340.0, 
      device=/dev/sda2, durable=false, isBootDevice=false]], supportsImage=Not(is64Bit())]]]
      

      As you can see the above message, whirr is trying to startup a host whose ip address is '10.245.106.99'. But, whirr hangs and doesn't startup the hadoop service. So, I tried to login to the host '10.245.106.99' via ssh.

      hadoop@domU-12-31-39-00-A5-21:~$ ssh ubuntu@10.245.106.99
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
      @    WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED!     @
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
      IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY!
      Someone could be eavesdropping on you right now (man-in-the-middle attack)!
      It is also possible that the RSA host key has just been changed.
      The fingerprint for the RSA key sent by the remote host is
      b1:62:ad:fd:3f:a7:29:df:7f:0c:91:ca:ed:66:8e:3a.
      Please contact your system administrator.
      Add correct host key in /home/hadoop/.ssh/known_hosts to get rid of this message.
      Offending key in /home/hadoop/.ssh/known_hosts:8
      RSA host key for 10.245.106.99 has changed and you have requested strict checking.
      Host key verification failed.
      

      I expected that whirr hangs when the file '$HOME/.ssh/known_hosts' includes the obsolete entry. Although such case may occur rarely, someone who employs many instances in EC2 would meet such case. whirr needs to prevent hanging caused by obsolete identifiers of ssh.

      Attachments

        1. WHIRR-268.patch
          0.7 kB
          Andrei Savu
        2. WHIRR-268.patch
          3 kB
          Adrian Cole

        Activity

          People

            savu.andrei Andrei Savu
            hyunsik.choi Hyunsik Choi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: