[SPARK-12693] OffsetOutOfRangeException caused by retention - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Not A Problem
Affects Version/s: 1.6.0
Fix Version/s: None
Component/s: DStreams
Labels:
- kafka
Environment:

Ubuntu 64bit, Intel i7

Description

I am running Kafka server locally with extremely low retention of 3 seconds and with 1 second segmentation. I create direct Kafka stream with auto.offset.reset = smallest.

In case of bad luck (happens actually quite often in my case) the smallest offset retrieved druing stream initialization doesn't already exists when streaming actually starts.

Complete source code of the Spark Streaming application is here:
https://github.com/pygmalios/spark-checkpoint-experience/blob/cb27ab83b7a29e619386b56e68a755d7bd73fc46/src/main/scala/com/pygmalios/sparkCheckpointExperience/spark/SparkApp.scala

The application ends in an endless loop trying to get that non-existing offset and has to be killed. Check attached logs from Spark and also from Kafka server.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

kafka-log.txt
07/Jan/16 13:55
4 kB
Rado Buransky
log.txt
07/Jan/16 13:51
9 kB
Rado Buransky

Activity

People

Assignee:: Unassigned

Reporter:: Rado Buransky

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 07/Jan/16 13:49

Updated:: 08/Jan/16 18:41

Resolved:: 08/Jan/16 15:06