We have a use case where YARN applications would like to localize resources from Artifactory. Putting the resources on HDFS itself might not be ideal as we would like to leverage Artifactory to manage different versions of the resources.
It would be nice to have something like HttpFileSystem that implements the Hadoop filesystem API and reads from a HTTP endpoint.
Note that Samza has implemented the proposal by themselves:
The downside of this approach is that it requires the YARN cluster to put the Samza jar into the classpath for each NM.
It would be much nicer for Hadoop to have this feature built-in.