Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10816 EventTime based sessionization (session window)
  3. SPARK-34889

Introduce MergingSessionsIterator merging elements directly which belong to the same session

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.2.0
    • Structured Streaming
    • None

    Description

      This issue tracks effort on introducing MergingSessionsIterator, which enables to merge elements belong to the same session directly. This would be quite performant compared to UpdatingSessionIterator. Note that MergingSessionsIterator can only apply to the cases aggregation can be applied altogether, so there're still rooms for UpdatingSessionIterator to be used.

      This issue also introduces MergingSessionsExec which is the physical node on leveraging MergingSessionsIterator to sort the input rows and aggregate rows according to the session windows.

      Attachments

        Activity

          People

            kabhwan Jungtaek Lim
            kabhwan Jungtaek Lim
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: