Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-1961

HBase EC2 scripts

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Not A Problem
    • None
    • None
    • None
    • None
    • Amazon AWS EC2

    Description

      Attached tarball is a clone of the Hadoop EC2 scripts, modified significantly to start up a HBase storage only cluster on top of HDFS backed by instance storage.

      Tested with the HBase 0.20 branch but should work with trunk also. Only the AMI create and launch scripts are tested. Will bring up a functioning HBase cluster.

      Do "create-hbase-image c1.xlarge" to create an x86_64 AMI, or "create-hbase-image c1.medium" to create an i386 AMI. Public Hadoop/HBase 0.20.1 AMIs are available:
      i386: ami-c644a7af
      x86_64: ami-f244a79b

      launch-hbase-cluster brings up the cluster: First, a small dedicated ZK quorum, specifiable in size, default of 3. Then, the DFS namenode (formatting on first boot) and one datanode and the HBase master. Then, a specifiable number of slaves, instances running DFS datanodes and HBase region servers. For example:

          launch-hbase-cluster testcluster 100 5
      

      would bring up a cluster with 100 slaves supported by a 5 node ZK ensemble.

      We must colocate a datanode with the namenode because currently the master won't tolerate a brand new DFS with only namenode and no datanodes up yet. See HBASE-1960. By default the launch scripts provision ZooKeeper as c1.medium and the HBase master and region servers as c1.xlarge. The result is a HBase cluster supported by a ZooKeeper ensemble. ZK ensembles are not dynamic, but HBase clusters can be grown by simply starting up more slaves, just like Hadoop.

      hbase-ec2-init-remote.sh can be trivially edited to bring up a jobtracker on the master node and task trackers on the slaves.

      Attachments

        1.
        [EC2] Automate the configuration and launch of Stargate and Thrift Sub-task Closed Unassigned
        2.
        [EC2] document how to set up elastic load balancing across Stargate instances Sub-task Closed Unassigned
        3.
        [EC2] On demand monitoring and auto scaling Sub-task Closed Unassigned
        4.
        [EC2] EBS support Sub-task Closed Andrew Kyle Purtell
        5.
        [EC2] Add support for running a mapreduce job automatically once the cluster is up Sub-task Closed Andrew Kyle Purtell
        6.
        [EC2] Support optionally launching as spot instances Sub-task Closed Unassigned
        7.
        [EC2] Build a HA cluster Sub-task Closed Unassigned
        8.
        [EC2] Wiki page Sub-task Closed Unassigned
        9.
        [EC2] Periodic automatic backup to S3 Sub-task Closed Andrew Kyle Purtell
        10.
        [EC2] Support multivolume local instance storage Sub-task Closed Andrew Kyle Purtell
        11.
        [EC2] handle potentially large and uneven instance startup times Sub-task Closed Andrew Kyle Purtell
        12.
        [EC2] Add support for installation of user packages when instances start up Sub-task Closed Andrew Kyle Purtell
        13.
        [EC2] Prompt for and remember credentials if not configured Sub-task Closed Andrew Kyle Purtell
        14.
        [EC2] support mapreduce subsystem Sub-task Closed Andrew Kyle Purtell
        15.
        [EC2] LZO support Sub-task Closed Andrew Kyle Purtell
        16.
        Set up convenient ssh between instances Sub-task Closed Andrew Kyle Purtell
        17.
        [EC2] HDFS DataNode no longer required on master Sub-task Closed Andrew Kyle Purtell
        18.
        [EC2] JAVA_HOME handling broken Sub-task Closed Andrew Kyle Purtell
        19.
        [EC2] Adjust fs.file-max Sub-task Closed Andrew Kyle Purtell
        20.
        [EC2] pull version from build Sub-task Closed Andrew Kyle Purtell
        21.
        [EC2] Mount data volumes as xfs, noatime Sub-task Closed Andrew Kyle Purtell
        22.
        [EC2] Rebuild AMIs with jdk6u18 Sub-task Closed Andrew Kyle Purtell
        23.
        [EC2] Downgrade JDK to 6u17 and rebuild AMIs Sub-task Closed Andrew Kyle Purtell
        24.
        [EC2] mapreduce fixups for PE Sub-task Closed Andrew Kyle Purtell
        25.
        [EC2] Allocate elastic IP addresses for ZK and master nodes Sub-task Closed Andrew Kyle Purtell
        26.
        [EC2] ec2-describe-instances returning account number instead of bucket name Sub-task Closed Andrew Kyle Purtell

        Activity

          People

            apurtell Andrew Kyle Purtell
            apurtell Andrew Kyle Purtell
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: