Produce a maven-shaded hadoop-cloudstorage JAR for dowstream use so that
- Hadoop dependency choices don't control their decisions
- Little/No risk of their JAR changes breaking Hadoop bits they depend on
This JAR would pull in the shaded hadoop-client JAR, and the aws-sdk-bundle JAR, neither of which would be unshaded (so yes, upgrading aws-sdks would be a bit risky, but double shading a pre-shaded 30MB JAR is excessive on multiple levels.
Metrics of success: Spark, Tez, Flink etc can pick up and use, and all are happy