Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.2.1
Description
We have identified an issue where the RocksDB state store iterator will not pick up store updates made after its creation. As a result of this, the timeoutProcessorIter in
will not pick up state changes made during newDataProcessorIter input processing. The user observed behavior is that a group state may receive input records and also be called with timeout in the same micro batch. This contradics the public documentation for GroupState -
https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/streaming/GroupState.html
- The timeout is reset every time the function is called on a group, that is, when the group has new data, or the group has timed out. So the user has to set the timeout duration every time the function is called, otherwise, there will not be any timeout set.