Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-1507

Resource leak when a worker does not response KILLED message

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.11.0
    • Resource Manager, Worker
    • None

    Description

      Terminology

      • QM - query master
      • TM - tajo master

      The query kill mechanism is as follows:

      • Client sends kill command to TM.
      • TM forwards the kill command to QM.
      • QM disseminates the kill command to all workers.
      • Corresponding workers kill tasks and response KILLED to QM.

      BTW, Some workers cannot response KILLED message to QM due to its node failure, temporary network problem, or worker restart. In this case, TM cannot retrieve allocated resources even though the workers become normal.

      Attachments

        Activity

          People

            Unassigned Unassigned
            hyunsik Hyunsik Choi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: