Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-16728

Taskmanager dies after job got stuck and canceling fails

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Bug
    • 1.10.0
    • None
    • Runtime / Task
    • None

    Description

      At some point I noticed that a few jobs got stuck (they basically stopped processing the messages, I could detect this watching the expected output), so I tried to cancel them.

      The cancel operation failed, complaining that the job got stuck at 

      StreamTaskActionExecutor$SynchronizedStreamTaskActionExecutor.run(StreamTaskActionExecutor.java:86)

      and then the whole taskmanager shut down.

      See the attached log.

      This is actually happening practically every day in our staging environment where we are testing Flink 1.10.0.

      Attachments

        1. taskmanager.log.20200323.gz
          10 kB
          Leonid Ilyevsky

        Activity

          People

            Unassigned Unassigned
            lilyevsky Leonid Ilyevsky
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: