Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.0.0, 2.0.1, 2.0.2, 2.1.0, 2.1.1, 2.1.2, 2.1.3, 2.2.0, 2.2.1, 2.2.2, 2.3.0, 2.3.1
-
None
-
Important
Description
A checkpointed DStream is supposed to cut the lineage to its parent(s) such that any persisted RDDs for the parent(s) are removed. However, combined with the issue in SPARK-25302, they result in the Input Stream RDDs being persisted a lot longer than they are actually required.
See also related bug SPARK-25302.