Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-6864

Auto-tune dedup parallelism without affecting write parallelism

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • 1.1.0
    • None
    • None

    Description

      See HUDI-6863 and https://github.com/apache/hudi/pull/6802#issuecomment-1455802492

      I think we need to make sure that the dedup parallelism is only applied to the dedup stage, not affecting subsequent stages, which may require better parallelism control by repartitioning with right parallelism before workload profiling.

      Attachments

        Activity

          People

            Unassigned Unassigned
            guoyihua Ethan Guo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: