Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-1964

Unexpected behavior when using count window together with timestamp extraction

    XMLWordPrintableJSON

Details

    Description

      I launched a topology applying a tumbling count window of size 2 (watermark interval 200ms, lag 1s) with the following input (timestamp,value):

      (10,10)
      (10,20)
      (11,30)
      (12,40)
      (12,50)
      (12,60)
      (12,70)
      (13,80)
      (14,90)
      (15,100)
      

      And I got these windows as output:

      [(10,10), (10,20)]
      [(12,60), (12,70)]
      [(12,60), (12,70)]    // why (60, 70) twice?
      [(13,80), (14,90)]
      

      I would expect something like:

      [(10,10), (10,20)]
      [(11,30), (12,40)]
      [(12,50), (12,60)]
      [(12,70), (13,80)]
      [(14,90), (15,100)]
      

      It seems like that timestamp extraction and count windows does not fit each other.

      Attachments

        Issue Links

          Activity

            People

              arunmahadevan Arun Mahadevan
              affo Lorenzo Affetti
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: