Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2500

Add support for S3 as a Apache Beam FileSystem

Details

    • New Feature
    • Status: Resolved
    • P3
    • Resolution: Fixed
    • None
    • 2.3.0
    • io-java-aws
    • None

    Description

      Note that this is for providing direct integration with S3 as an Apache Beam FileSystem.

      There is already support for using the Hadoop S3 connector by depending on the Hadoop File System module[1], configuring HadoopFileSystemOptions[2] with a S3 configuration[3].

      1: https://github.com/apache/beam/tree/master/sdks/java/io/hadoop-file-system
      2: https://github.com/apache/beam/blob/master/sdks/java/io/hadoop-file-system/src/main/java/org/apache/beam/sdk/io/hdfs/HadoopFileSystemOptions.java#L53
      3: https://wiki.apache.org/hadoop/AmazonS3

      Attachments

        1. hadoop_fs_patch.patch
          7 kB
          Guillaume Balaine

        Issue Links

          Activity

            People

              jmarble Jacob Marble
              lcwik Luke Cwik
              Votes:
              3 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m