Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Won't Do
-
None
-
None
-
None
Description
PyTorch does not support HDFS as a datasource like TensorFlow.
Found this related issue, but nothing meaningful here: https://github.com/pytorch/pytorch/issues/5867
I think we should make PyTorch support HDFS by contributing to PyTorch's source code.
Here is a reference to the TF implementation of HDFS connector: https://github.com/tensorflow/tensorflow/tree/17e49b339b2b9a58ed967c69b7acb714dcd9b465/tensorflow/core/platform/hadoop
Any other ideas to approach this problem are welcome!