Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15860

RemoteSparkJobMonitor may hang when RemoteDriver exits abnormally

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.3.0
    • None
    • None

    Description

      It happens when RemoteDriver crashes between JobStarted and JobSubmitted, e.g. killed by kill -9. RemoteSparkJobMonitor will consider the job has started, however it can't get the job info because it hasn't received the JobId. Then the monitor will loop forever.

      Attachments

        1. HIVE-15860.2.patch
          2 kB
          Rui Li
        2. HIVE-15860.2.patch
          2 kB
          Rui Li
        3. HIVE-15860.1.patch
          2 kB
          Rui Li

        Activity

          People

            lirui Rui Li
            lirui Rui Li
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: