Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-2341

Suite of test scripts that a.) load a cluster with a verifiable dataset and b.) do random kills of regionserver+datanodes in small cluster

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • None

    Description

      We just filed hbase-2340 but discussion up on irc has it that we need something more hardcore than pussy-footing inside a single jvm as hdfs-2340 does. The point was made (tlipcon) that its hard to ensure real recovery working if all is in the one JVM.

      So, this issue is about scripts that can:

      + load a cluster with a dataset that we can 'verify' as in we can tell if it has holes in it, if data has been lost.
      + script that does random kill of a random node on some random occasion
      + Script that can check cluster for data loss

      All above should work while cluster is under load.

      The above would not sit under junit.

      This looks like a suite that we'd want to run up in ec2 using Andrew's scripts and our donated aws credits.

      16:12 < tlipcon> here's my goal: we have a 5 node cluster in the back room. I want to run hbase on that at near full load for a week straight while some process goes around screwing with it
      16:12 < tlipcon> then I want to verify that I didn't lose a single edit over that week
      

      Attachments

        1. count-slaves.rb
          0.2 kB
          Andrew Kyle Purtell
        2. HBASE-2341-0.20.3.patch
          44 kB
          Karthik Ranganathan
        3. test.sh
          3 kB
          Andrew Kyle Purtell
        4. VerifiableEditor.java
          10 kB
          Todd Lipcon
        5. VerifiableEditor.java
          9 kB
          Todd Lipcon

        Issue Links

          Activity

            People

              Unassigned Unassigned
              stack Michael Stack
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: