Details
-
New Feature
-
Status: Resolved
-
P2
-
Resolution: Implemented
-
None
-
None
Description
Currently Java SDK has HDFS support but Python SDK does not. With current portability efforts other runners may soon be able to use Python SDK. Having HDFS support will allow these runners to execute large scale jobs without using GCS.
Following suggests some libraries that can be used to connect to HDFS from Python.
http://wesmckinney.com/blog/python-hdfs-interfaces/
Attachments
Issue Links
- is blocked by
-
BEAM-3600 Do not ignore FileSystem errors and document expected behavior
- Resolved
- links to
(1 links to)