Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-34573

the task is stuck on the high presure

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Won't Fix
    • 1.14.3
    • None
    • Runtime / Network
    • None

    Description

      we havae a flink job , jst one taskmanger;

      when use high presure as soure data,it will be stuck. sometimes it will be run 1d ,somtimes it will be run 30min.

      like this: (13:30 the taskmanager reboot,then run 30min, result is stuck )

      test 3 cases:

      1: low presure (1200eps ),  it will run 30 min or 1d 。

      2: close checkpoint , it will run 3d , high presure (1800eps) ,did not run stuck。

      3:double the orignal  managermemory, it still stuck, jst The appearance time has been changed to 3 days from 30mins.

       

      the threads dump info ,when high presure , cpu 90%~100%:

      tm-thread-dump-chk-0123[1].json

      this is the normal info, when the low presure :
      tm-thread-dump-no-lock-0123[1].json

       

       

      Attachments

        1. rate.PNG
          165 kB
          LSZ
        2. stuck.PNG
          91 kB
          LSZ
        3. tm-thread-dump-chk-0123[1].json
          369 kB
          LSZ
        4. tm-thread-dump-no-lock-0123[1].json
          372 kB
          LSZ

        Activity

          People

            Unassigned Unassigned
            liusz LSZ
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 120h
                120h
                Remaining:
                Remaining Estimate - 120h
                120h
                Logged:
                Time Spent - Not Specified
                Not Specified