Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-5068

Support cow flink batch create fs input split asynchronously

    XMLWordPrintableJSON

Details

    Description

      CopyOnWriteInputFormat#createInputSplits is invoked by org.apache.flink.runtime.executiongraph.ExecutionJobVertex in JobManager to create file input splits synchronously.

      It's found that in batch mode this will occupy the largest share of job submission time.

      So in this PR it will be optimized by creating input splits in thread pool executor asynchronously.

      Attachments

        Issue Links

          Activity

            People

              chenshizhi Shizhi Chen
              chenshizhi Shizhi Chen
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: