Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23397

Scheduling delay causes Spark Streaming to miss batches.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • 2.2.1
    • None
    • DStreams
    • None

    Description

      • For Complex Spark (Scala) based D-Stream based applications ,which requires creating Ex: 40 Jobs for every batch ,its been observed that ,batches does not get created on the specific time ,ex: if i started a Spark Streaming based application with batch interval as 20 seconds and application is creating 40 odd Jobs ,observe the next batch does not create 20 seconds later than previous job creation time.
      • This is due to the fact that Job Creation is Single Threaded, if Job Creation delay is greater than Batch Interval time ,batch execution misses its schedule.

      Attachments

        Activity

          People

            Unassigned Unassigned
            mhussain@informatica.com Shahbaz Hussain
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: