Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-21352 FLIP-158: Generalized incremental checkpoints
  3. FLINK-24864

Release TaskManagerJobMetricGroup with the last slot rather than task

    XMLWordPrintableJSON

Details

    • TM-per-job metrics are now released upon last slot release rather than task, i.e. may stay on TM longer.

    Description

      // Beware that this ticket FLINK-24864 clones FLINK-23486.

       

      FLINK-23486 adds metrics for Changelog Uploader.

      Since that uploader is shared among tasks of the same job on a TM, the right level for metrics is TaskManagerJobMetricGroup (see design doc).

       

      However, the lifecycle of TaskManagerJobMetricGroup differs from that of StateChangelogStorage: the former is released on last task unregister; the latter - on last job slot release.

      This causes problem that an old TMJMG can be used by Storage.

       

      Releasing Storage on last task unregister has some disadvantages, including overhead, thread-safety and logical inconsistency.

      Releasing TMJMG on last slot release doesn't have major disadvantages.

      This ticket proposes the former change.

      Attachments

        Issue Links

          Activity

            People

              roman Roman Khachatryan
              roman Roman Khachatryan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: