Noah, I've been doing work in HADOOP-9361 both with trying to specify more
I'd be happy to! One of the biggest things we have run into is figuring out
what the contract is. Inferring it from the behavior of applications has
been our only real resource for that information. We have done some ad-hoc
stuff in which we have adapted HDFS tests to run against Ceph, but it is
pretty ugly and difficult to maintain.
One thing that we have run into recently with a user is diagnosing some
write performance problems. HDFS is performing well, and we think small
writes might be a culprit if Ceph isn't doing write buffering.
Understanding what that contract is, especially in terms of write safety is
another area slightly different than just unit tests that we are interested
Ahh, that would be cool. Certainly easier to merge changes to that than
accept an entire new file system upstream
Let me know how I get can started helping out.