Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24300

MultipleInputOperator is running much more slowly in TPCDS

    XMLWordPrintableJSON

Details

    Description

      When we are running TPCDS with release 1.14 we find that the job with MultipleInputOperator is running much more slowly than before. With a binary search among the commits, we find that the issue may be introduced by FLINK-23408

      At the commit 64570e4c56955713ca599fd1d7ae7be891a314c6, the job in TPCDS runs normally, as the image below illustrates:

      At the commit e3010c16947ed8da2ecb7d89a3aa08dacecc524a, the job q2.sql gets stuck for a pretty long time (longer than half an hour), as the image below illustrates:

      The detail of the job is illustrated below:

      The job uses a MultipleInputOperator with one normal input and two chained FileSource. It has finished reading the normal input and start to read the chained source. Each chained source has one source data fetcher.

      We capture the jstack of the stuck tasks and attach the file below. From the jstack.txt we can see the main thread is blocked on waiting for the lock, and the lock is held by a source data fetcher. The source data fetcher is still running but the stack keeps on CompletableFuture.cleanStack.

      This issue happens in a batch job. However, from where it get blocked, it seems also affects the streaming jobs.

      For the reference, the code of TPCDS we are running is located at https://github.com/ververica/flink-sql-benchmark/tree/dev.

      Attachments

        1. jstack-2.txt
          96 kB
          Yun Gao
        2. jstack.txt
          96 kB
          Zhilong Hong
        3. e3010c16947ed8da2ecb7d89a3aa08dacecc524a.png
          265 kB
          Zhilong Hong
        4. detail-of-the-job.png
          624 kB
          Zhilong Hong
        5. 64570e4c56955713ca599fd1d7ae7be891a314c6.png
          329 kB
          Zhilong Hong

        Issue Links

          Activity

            People

              dwysakowicz Dawid Wysakowicz
              Thesharing Zhilong Hong
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: