Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Speaking with wesmckinn, it would be really helpful to have Kerberos support in our HDFS logic. This should be straightforward; I would just need to switch to hdfsBuilderConnect() in the shim.
On a side note, is there a reason we aren't using Pivotal's libhdfs3? It uses RPCs natively rather than JNI.
https://github.com/Pivotal-Data-Attic/pivotalrd-libhdfs3
Dask has Python wrappers for this.