Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-39957

Delay onDisconnected to enable Driver receives ExecutorExitCode

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.4.0
    • 3.4.0
    • Spark Core
    • None

    Description

      There are two methods to detect executor loss. First, when RPC fails, the function onDisconnected will be triggered. Second, when executor exits with ExecutorExitCode, the exit code will be passed from ExecutorRunner to Driver. These two methods may categorize same cases into different conclusions. We hope to categorize the ExecutorLossReason by ExecutorExitCode. This PR aims to make sure Driver receives ExecutorExitCode before onDisconnected is called.

      Attachments

        Activity

          People

            khchen Kai-Hsun Chen
            khchen Kai-Hsun Chen
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: