Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24048

Move changeLog inference out of optimizing phase

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.14.0
    • 2.0.0
    • Table SQL / Planner
    • None

    Description

      Currently, when there are multiple sinks in a sql job, the DAG is split into multiple relNode blocks; as changeLog inference is in optimizing phase, we need to propagate the changeLog mode among blocks to ensure each block can generate an accurate physical plan.

      In current solution, the DAG is optimized 3 times in order to propagate changeLog mode, which is inefficient. Actually, we can just optimize the DAG, expanding the DAG to a physical node tree, and then infer changeLog mode. In this way, the dag is only optimized 1 time.

      (Similarly, minibatch interval can also be inferred in same way)

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              icshuo Shuo Cheng
              Votes:
              0 Vote for this issue
              Watchers:
              17 Start watching this issue

              Dates

                Created:
                Updated: