Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-2761

JoinBolt.java 's paradigm is new model of stream join?

    XMLWordPrintableJSON

Details

    • Question
    • Status: Resolved
    • Critical
    • Resolution: Not A Problem
    • None
    • None
    • storm-client
    • None

    Description

      Hi, I am a researcher from University of Toronto and I am studying acceleration on stream processing platform. I have a question about the model of window-based stream join used in the JoinBolt.java. From my understanding, when a new tuple arrived, we join this new tuple with all the tuples in the window of the opposite stream. However, in the JoinBolt.java, not only the new tuple, but the tuples in the entire local window will join with the window of the opposite stream. This actually produces a lot of duplicated results, since most of the old tuples in the local window have joined before. I don't know if this is a new paradigm or the storm's team misunderstood the model of stream join. Can someone help me to clarify this question?

      Attachments

        Activity

          People

            Unassigned Unassigned
            panfei Fei Pan
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: