Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-815

Improve partitioning hash function

    XMLWordPrintableJSON

Details

    Description

      Right now, the partitioner (`OutputEmitter`) used directly the hash code produced by the partitioning elements. Types like `Integer` have very weak hash functions, so the hash partitioning is very susceptible to skew there.

      ---------------- Imported from GitHub ----------------
      Url: https://github.com/stratosphere/stratosphere/issues/815
      Created by: StephanEwen
      Labels: runtime,
      Milestone: Release 0.6 (unplanned)
      Created at: Wed May 14 21:40:36 CEST 2014
      State: open

      Attachments

        Activity

          People

            Unassigned Unassigned
            github-import GitHub Import
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: