When testing on our local dev environments, our HBase tests rely upon the fact that we presplit the data by running ./testdata/bin/split-hbase.sh during the data load phase. As-is, this script doesn't work against a remote cluster, and without it, remote cluster tests involving hbase will fail.
To make hbase tests run against a remote cluster, we need to either:
- Not rely on pre-splitting (although this has the problem of probably losing coverage, e.g. for testing HBase region pruning)
- Come up with a more reliable split method, or somehow synthesize the HBase metadata