Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-1214

Prevent partitioning pushdown unless partitions fields match exactly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.8.0
    • 0.8.0
    • API / DataSet
    • None

    Description

      Consider an operation grouped on fields (A, B), followed by an operation grouped on field (A).

      Right now, the optimizer can push down the partitioning on (A), which serves both operations (the first step locally still groups by A and B). This may however by a bad idea for the cases where the field A has a low cardinality, or the value distribution is skewed.

      Since we cannot determine that robustly yet, I suggest to disable this optimization for now.

      Attachments

        Activity

          People

            sewen Stephan Ewen
            sewen Stephan Ewen
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: