Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-15325

Ignores the input locations of a ConsumePartitionGroup if the corresponding ConsumerVertexGroup is too large

    XMLWordPrintableJSON

Details

    Description

      When running TPC-DS jobs in a session cluster, we observed that sometimes tasks are not evenly distributed in TMs. The root cause turned out to be that the downstream tasks tend to be TM or host local with its input tasks. This helps to reduce network shuffle.
      However, in certain cases, like the topology presented in the attached image, jamming the input task's TM and machine with downstream tasks would affect the performance. In this case, respecting input location preferences is causing troubles more than bringing benefits.
      So I'm wondering whether we should introduce a config so that users can disable input location preferences?

      Attachments

        Issue Links

          Activity

            People

              zhuzh Zhu Zhu
              zhuzh Zhu Zhu
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: