Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-5310

Add support of HadoopOutputFormatIO

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.10.0
    • Component/s: io-java-hadoop
    • Labels:
      None

      Description

      For the moment, there is only HadoopInputFormatIO in Beam. To provide a support of different writing IOs, that are not yet natively supported in Beam (for example, Apache Orc or HBase bulk load), it would make sense to add HadoopOutputFormatIO as well. It will incorporate support of batching and streaming processing.

      After, HadoopInputFormatIO and HadoopOutputFormatIO should be merged into one module, called HadoopFormatIO. Old HadoopInputFormatIO should become deprecated.

        Attachments

          Activity

            People

            • Assignee:
              aromanenko Alexey Romanenko
              Reporter:
              aromanenko Alexey Romanenko
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 25h 40m
                25h 40m