Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3665

Range partitioning lacks support to define sort orders

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Implemented
    • 1.0.0
    • 1.1.0
    • API / DataSet
    • None

    Description

      DataSet.partitionByRange() does not allow to specify the sort order of fields. This is fine if range partitioning is used to reduce skewed partitioning.
      However, it is not sufficient if range partitioning is used to sort a data set in parallel.

      Since DataSet.partitionByRange() is @Public API and cannot be easily changed, I propose to add a method withOrders(Order... orders) to PartitionOperator. The method should throw an exception if the partitioning method of PartitionOperator is not range partitioning.

      Attachments

        Activity

          People

            dwysakowicz Dawid Wysakowicz
            fhueske Fabian Hueske
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: