[FLINK-10566] Flink Planning is exponential in the number of stages - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.5.4, 1.6.1, 1.7.0, 1.8.0
Fix Version/s: 1.5.6, 1.6.3, 1.7.1, 1.8.0
Component/s: API / DataSet
Labels:
- pull-request-available

Description

This makes it nearly impossible to run graphs with 100 or more stages. (The execution itself is still sub-second, but the job submission takes increasingly long.)

I can reproduce this with the following pipeline, which resembles my real-world workloads (with depth up to 10 and width up, and past, 50). On Flink it seems getting width beyond width 10 is problematic (times out after hours). Note the log scale on the chart for time.

  public static void runPipeline(int depth, int width) throws Exception {
    final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();

    DataSet<String> input = env.fromElements("a", "b", "c");
    DataSet<String> stats = null;

    for (int i = 0; i < depth; i++) {
      stats = analyze(input, stats, width / (i + 1) + 1);
    }

    stats.writeAsText("out.txt");
    env.execute("depth " + depth + " width " + width);
  }

  public static DataSet<String> analyze(DataSet<String> input, DataSet<String> stats, int branches) {
    System.out.println("analyze " + branches);
    for (int i = 0; i < branches; i++) {
      final int ii = i;

      if (stats != null) {
        input = input.map(new RichMapFunction<String, String>() {
            @Override
            public void open(Configuration parameters) throws Exception {
              Collection<String> broadcastSet = getRuntimeContext().getBroadcastVariable("stats");
            }
            @Override
            public String map(String value) throws Exception {
              return value;
            }
          }).withBroadcastSet(stats.map(s -> "(" + s + ").map"), "stats");
      }

      DataSet<String> branch = input
                               .map(s -> new Tuple2<Integer, String>(0, s + ii))
                               .groupBy(0)
                               .minBy(1)
                               .map(kv -> kv.f1);
      if (stats == null) {
        stats = branch;
      } else {
        stats = stats.union(branch);
      }
    }
    return stats.map(s -> "(" + s + ").stats");
  }

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

chart.png
16/Oct/18 13:38
12 kB
Robert Bradshaw

Issue Links

links to

GitHub Pull Request #7276

Activity

People

Assignee:: Maximilian Michels

Reporter:: Robert Bradshaw

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 16/Oct/18 13:39

Updated:: 15/Mar/19 12:22

Resolved:: 14/Dec/18 13:30

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m