Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-32726 [Umbrella] Test Flink Release 1.18
  3. FLINK-32779

Release Testing: Verify FLIP-301: Hybrid Shuffle supports Remote Storage

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Resolved
    • 1.18.0
    • 1.18.0
    • Tests
    • None

    Description

      This ticket aims to verify https://issues.apache.org/jira/browse/FLINK-31634.

      This verification mainly contains two parts.

      Part 1. Run without remote storage.
      This part mainly is to verify the new mode can use the Memory tier and Disk tier dynamically when shuffling.
      Set the mode to new hybrid shuffle mode(execution.batch-shuffle-mode: ALL_EXCHANGES_HYBRID_SELECTIVE), and run a simple job. For example(tpcds q1.sql). When the resource is enough, then the upstream and the downstream can run at the same time.

      Part2. Run with remote storage.
      This part mainly is to verify the new mode can use the Memory tier, Disk tier, Remote tier dynamically when shuffling.
      2.1 Set the mode to new hybrid shuffle mode(execution.batch-shuffle-mode: ALL_EXCHANGES_HYBRID_SELECTIVE)
      2.2 set the remote storage path with the option(taskmanager.network.hybrid-shuffle.remote.path: oss://flink-runtime/runtime/shuffle, note that the path oss://flink-runtime/runtime/shuffle in oss should be exist).
      2.3 Modify the
      option TieredStorageConfiguration#DEFAULT_MIN_RESERVE_DISK_SPACE_FRACTION to 1, compile the package, then run a simple job. For example(tpcds q1.sql). Check the shuffle data is written to the remote storage in the path oss://flink-runtime/runtime/shuffle.

      Attachments

        Activity

          People

            Weijie Guo Weijie Guo
            renqs Qingsheng Ren
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: