Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-35160

Support for Thread Dump provides a convenient way to display issues of thread deadlocks in tasks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.16.0, 1.17.1, 1.19.0, 1.18.1
    • None
    • Runtime / REST
    • None

    Description

      After receiving feedback from the business side about performance issues in their tasks, we attempted to troubleshoot and discovered that their tasks had issues with thread deadlocks. However, the Thread Dump entry on the Flink page only shows thread stacks. Since the users are not very familiar with Java stacks, they couldn't clearly identify that the deadlocks were due to issues in the business logic code and mistakenly thought they were problems with the Flink framework

      the JVM's jstack command can clearly display thread deadlocks, unfortunately, the business team does not have the permissions to log into the machines.  hear is the jstack log

      FlameGraph are excellent for visualizing performance bottlenecks and hotspots in application profiling but are not designed to pinpoint the exact lines of code where thread deadlocks occur.

      Perhaps we could enhance the Thread Dump feature to display thread deadlocks, similar to what the jstack command provides.

       

      Attachments

        1. image-2024-04-18-20-57-52-440.png
          280 kB
          elon_X
        2. image-2024-04-18-20-58-09-872.png
          735 kB
          elon_X
        3. image-2024-04-18-21-01-22-881.png
          610 kB
          elon_X
        4. image-2024-04-18-21-34-41-014.png
          1.13 MB
          elon_X
        5. image-2024-04-20-14-43-36-939.png
          1.63 MB
          elon_X

        Activity

          People

            Unassigned Unassigned
            elon elon_X
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: