Details
-
Improvement
-
Status: Resolved
-
P3
-
Resolution: Fixed
-
None
-
None
Description
For the moment, there is only HadoopInputFormatIO in Beam. To provide a support of different writing IOs, that are not yet natively supported in Beam (for example, Apache Orc or HBase bulk load), it would make sense to add HadoopOutputFormatIO as well. It will incorporate support of batching and streaming processing.
After, HadoopInputFormatIO and HadoopOutputFormatIO should be merged into one module, called HadoopFormatIO. Old HadoopInputFormatIO should become deprecated.
Attachments
1.
|
Add batching support for HadoopOutputFormatIO | Resolved | Alexey Romanenko |
|
||||||||
2.
|
Add streaming support for HadoopOutputFormatIO | Resolved | David Hrbacek |
|
||||||||
3.
|
Deprecate HadoopInputFormatIO | Resolved | Alexey Romanenko |
|
||||||||
4.
|
Add HadoopFormatIO ITs to Jenkins | Resolved | Alexey Romanenko | |||||||||
5.
|
Update website documentation regarding using new HadoopFormatIO | Resolved | Alexey Romanenko |
|