Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-25318 Improvement of scheduler and execution for Flink OLAP
  3. FLINK-25586

ExecutionGraphInfoStore in session cluster should split failed and successful jobs

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.12.7, 1.13.5, 1.14.2
    • None
    • Runtime / Coordination
    • None

    Description

      In flink session cluster, jobs are stored in `FileExecutionGraphInfoStore`. When the count of jobs in it reaches `jobstore.cache-size` or the live time of jobs reaches `jobstore.expiration-time`, the specify jobs will be removed. We can't holds too many jobs for performance reason, but we should hold failed jobs for longer time to trace the cause of failure. So it's better to split failed and successful jobs in `FileExecutionGraphInfoStore` and support independent max-capacity for them.

      Attachments

        Activity

          People

            Zhanghao Chen Zhanghao Chen
            zjureel Fang Yong
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: