Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-29298

Separate block manager heartbeat endpoint from driver endpoint

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • Spark Core
    • None

    Description

      Executor's heartbeat will send synchronously to BlockManagerMaster to let it know that the block manager is still alive. In a heavy cluster, it will timeout and cause block manager re-register unexpected.
      This improvement will separate a heartbeat endpoint from the driver endpoint. In our production environment, this is really helpful to prevent executors from unstable up and down.

      Attachments

        Issue Links

          Activity

            People

              cltlfcjin Lantao Jin
              cltlfcjin Lantao Jin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: