Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-1964

Unexpected behavior when using count window together with timestamp extraction

    XMLWordPrintableJSON

    Details

      Description

      I launched a topology applying a tumbling count window of size 2 (watermark interval 200ms, lag 1s) with the following input (timestamp,value):

      (10,10)
      (10,20)
      (11,30)
      (12,40)
      (12,50)
      (12,60)
      (12,70)
      (13,80)
      (14,90)
      (15,100)
      

      And I got these windows as output:

      [(10,10), (10,20)]
      [(12,60), (12,70)]
      [(12,60), (12,70)]    // why (60, 70) twice?
      [(13,80), (14,90)]
      

      I would expect something like:

      [(10,10), (10,20)]
      [(11,30), (12,40)]
      [(12,50), (12,60)]
      [(12,70), (13,80)]
      [(14,90), (15,100)]
      

      It seems like that timestamp extraction and count windows does not fit each other.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                arunmahadevan Arun Mahadevan
                Reporter:
                affo Lorenzo Affetti
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: