Uploaded image for project: 'MRQL'
  1. MRQL
  2. MRQL-92

Use outer-joins for incremental queries in Spark streaming mode

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.9.8
    • None
    • Run-Time/Spark, Streaming
    • None

    Description

      Currently, incremental queries use Spark's coGroup to merge the current state with the results of processing the new data in the stream. With this patch, the merge is done with a special outer join that doesn't shuffle the state again (it only shuffles the results from the new data).

      Attachments

        Issue Links

          Activity

            People

              fegaras Leonidas Fegaras
              fegaras Leonidas Fegaras
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: