Details
Description
Running the stateful network word count example in Python (on one local node):
https://github.com/apache/spark/blob/master/examples/src/main/python/streaming/stateful_network_wordcount.py
At the beginning, when no data is streamed, empty status outputs are generated, only decorated by the current Time, e.g.:
-------------------------------------------
Time: 2015-01-14 17:58:20
-------------------------------------------
-------------------------------------------
Time: 2015-01-14 17:58:21
-------------------------------------------
As soon as I stream some data via netcat, no new status updates will show. Instead, one line saying
[Stage <number>:====================================================> (2 + 0) / 3]
where <number> is some integer number, e.g. 132. There is no further output on stdout.