Hi Rahul! I meant "git format-patch --find-copies-harder master". I tried that locally and found the problem: Our shakes.txt has CRLF line endings while the one you add has LF line endings (which is correct). Without --find-copies-harder, the patch just adds the file, with --find-copies-harder, the patch makes a copy and then changes line endings on every single line. So that's bad luck, sorry about the confusion!
As for the patch: Right now it doesn't compile for me, and I'd also put everything in crunch-contrib into the Java package "org.apache.crunch.contrib" so it's grouped nicely when we create aggregated Javadoc. Speaking of Javadoc, could you add some, along with a package-info.java file for o.a.c.contrib and o.a.c.contrib.bloomfilter?