Details
-
New Feature
-
Status: Resolved
-
P2
-
Resolution: Fixed
-
None
-
None
Description
Currently, the text source implementation available for Python SDK [1] is a Dataflow native source which only works efficiently for Dataflow runner. We should add a custom text source on top of custom file-based source framework [2] so that other runner implementations can potentially use the same text source implementation.
Custom text source implementation for Java SDK is at [3].
[1] https://github.com/apache/incubator-beam/blob/python-sdk/sdks/python/apache_beam/io/fileio.py#L70
[2] https://github.com/apache/incubator-beam/blob/python-sdk/sdks/python/apache_beam/io/filebasedsource.py
[3] https://github.com/apache/incubator-beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/io/TextIO.java#L745
Attachments
Issue Links
- links to