Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-5252

Streaming StatefulNetworkWordCount example hangs

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.2.0
    • 1.3.0
    • DStreams, Examples, PySpark
    • None
    • Ubuntu Linux

    Description

      Running the stateful network word count example in Python (on one local node):
      https://github.com/apache/spark/blob/master/examples/src/main/python/streaming/stateful_network_wordcount.py

      At the beginning, when no data is streamed, empty status outputs are generated, only decorated by the current Time, e.g.:
      -------------------------------------------
      Time: 2015-01-14 17:58:20
      -------------------------------------------

      -------------------------------------------
      Time: 2015-01-14 17:58:21
      -------------------------------------------

      As soon as I stream some data via netcat, no new status updates will show. Instead, one line saying

      [Stage <number>:====================================================> (2 + 0) / 3]

      where <number> is some integer number, e.g. 132. There is no further output on stdout.

      Attachments

        1. debug.txt
          321 kB
          Lutz Buech

        Activity

          People

            davies Davies Liu
            LutzBuech Lutz Buech
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: