Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-35602 [Umbrella] Test Flink Release 1.20
  3. FLINK-35690

Release Testing: Verify FLIP-459: Support Flink hybrid shuffle integration with Apache Celeborn

    XMLWordPrintableJSON

Details

    Description

      Follow up the test for https://issues.apache.org/jira/browse/FLINK-35533

      In Flink 1.20,  we proposed integrating Flink's Hybrid Shuffle with Apache Celeborn through a pluggable remote tier interface. To verify this feature, you should reference these main two steps.

      1. Implement Celeborn tier.

      • Implement a new tier factory and tier for Celeborn, including these APIs, including TierFactory/TierMasterAgent/TierProducerAgent/TierConsumerAgent.
      • The implementations should support granular data management at the Segment level for both client and server sides.

      2. Use the implemented tier to shuffle data.

      • Compile Flink and Celeborn.
      • Deploy Celeborn service
      • Add the compiled flink plugin jar (celeborn-client-flink-xxx.jar) to Flink classpath.
      • Configure the options to enable the feature.
        • Configure the option taskmanager.network.hybrid-shuffle.external-remote-tier-factory.class to the new Celeborn tier classes. Except for this option, the following options should also be added.
      execution.batch-shuffle-mode: ALL_EXCHANGES_HYBRID_FULL 
      celeborn.master.endpoints: <the celeborn endpoint address>
      celeborn.client.shuffle.partition.type: MAP
      • Run some test examples(e.g., WordCount) to verify the feature.

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tanyuxin Yuxin Tan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: