Details
-
Improvement
-
Status: Resolved
-
P3
-
Resolution: Fixed
-
None
-
None
Description
For the moment, there is only HadoopInputFormatIO in Beam. To provide a support of different writing IOs, that are not yet natively supported in Beam (for example, Apache Orc or HBase bulk load), it would make sense to add HadoopOutputFormatIO as well. It will incorporate support of batching and streaming processing.
After, HadoopInputFormatIO and HadoopOutputFormatIO should be merged into one module, called HadoopFormatIO. Old HadoopInputFormatIO should become deprecated.
Attachments
There are no Sub-Tasks for this issue.