Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-371

Backport HDFS IO enhancements from Scio

Details

    Description

      Right now there is a beam-sdks-java-io-hdfs module but only HDFSFileSource is implemented and there's a known issue with reading Avro files.
      https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102

      We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and simple authentication and would like to port it back to Beam.

      https://github.com/apache/incubator-beam/pull/485

      Attachments

        Issue Links

          Activity

            People

              sinisa_lyh Neville Li
              sinisa_lyh Neville Li
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: