Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-49508

Optimized hadoop-aws dependency, aws-java-sdk-bundle jar is too large

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.0.0, 3.5.2
    • None
    • Build
    • None

    Description

      aws-java-sdk-bundle jar is too large,The size of the spark image will double。hadoop aws only requires the use of aws-java-sdk-s3 and aws-java-sdk-dynamodb

       

      // code placeholder
      <dependency>
          <groupId>org.apache.hadoop</groupId>
          <artifactId>hadoop-aws</artifactId>
          <version>${hadoop.version}</version>
          <exclusions>
              <exclusion>
                  <groupId>com.amazonaws</groupId>
                  <artifactId>aws-java-sdk-bundle</artifactId>
              </exclusion>
          </exclusions>
      </dependency>
      <dependency>
          <groupId>com.amazonaws</groupId>
          <artifactId>aws-java-sdk-s3</artifactId>
          <version>${awssdk.v1.version}</version>
      </dependency>
      <dependency>
          <groupId>com.amazonaws</groupId>
          <artifactId>aws-java-sdk-dynamodb</artifactId>
          <version>${awssdk.v1.version}</version>
      </dependency> 

      Attachments

        Activity

          People

            Unassigned Unassigned
            melin melin
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: