Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-21352

Memory Usage in Spark Streaming

    XMLWordPrintableJSON

Details

    • Important

    Description

      I am trying to figure out the memory used by executors for a Spark Streaming job. For data I am using the rest endpoint for Spark AllExecutors and just summing up the metrics totalDuration * spark.executor.memory for every executor and then emitting the final sum as the memory usage.

      But this is coming out to be very small for application which ran whole day , is something wrong with the logic.Also I am using dynamic allocation and executorIdleTimeout is 5 seconds.

      Also I am also assuming that if some executor was removed for due to idle timeout and then was allocated to some other task then its totalDuration will be increased by the amount of time took by the executor to execute this new task.

      https://stackoverflow.com/questions/44995212/spark-streaming-memory-usage-doubts

      Attachments

        Activity

          People

            Unassigned Unassigned
            shubhamIN29 Shubham Gupta
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: