Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
2.0.0
-
None
Description
I use kafka 0.10.1 and java code with the following dependencies:
<dependency>
<groupId>org.apache.kafka</groupId>
<artifactId>kafka_2.11</artifactId>
<version>0.10.1.1</version>
</dependency>
<dependency>
<groupId>org.apache.kafka</groupId>
<artifactId>kafka-clients</artifactId>
<version>0.10.1.1</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-streaming_2.11</artifactId>
<version>2.0.0</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-streaming-kafka-0-10_2.11</artifactId>
<version>2.0.0</version>
</dependency>
The code tries to read the whole topic using:
kafkaParams.put("auto.offset.reset", "earliest");
Using 5 second batches:
jssc = new JavaStreamingContext(conf, Durations.seconds(5));
Each batch returns empty.
I debugged the code I noticed that KafkaUtils.fixKafkaParams is called that overrides "earliest" with "none".
Whether this is related or not, when I used kafka 0.8 on the client with kafka 0.10.1 on the server, I could read the whole topic.
Attachments
Attachments
Issue Links
- is duplicated by
-
SPARK-20037 impossible to set kafka offsets using kafka 0.10 and spark 2.0.0
- Resolved
- links to