Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-13681

Reduce Kafka dependencies in hadoop-kafka module

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.0-alpha1
    • 3.0.0-alpha2
    • metrics
    • None
    • Incompatible change
    • Changed Apache Kafka dependency from kafka-2.10 to kafka-clients in hadoop-kafka module.

    Description

      The newly added Kafka module defines the Kafka dependency as:

      ...
      <groupId>org.apache.kafka</groupId>
      <artifactId>kafka-2.10</artifactId>
      <version>${kafka.version}</version>
      ...
      

      This is unfavorable because its using the server dependency, which transitively has the client jars. The server dependency includes all of the server code and some larger transitive dependencies like Scala and Zookeeper.

      Instead the pom file should be changed to only depend on the clients jar which is a much smaller footprint:

      ...
      <groupId>org.apache.kafka</groupId>
      <artifactId>kafka-clients</artifactId>
      <version>${kafka.version}</version>
      ...
      

      Attachments

        1. HADOOP-13681-1.patch
          0.6 kB
          Grant Henke

        Issue Links

          Activity

            People

              granthenke Grant Henke
              granthenke Grant Henke
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: