Uploaded image for project: 'Apache Nemo'
  1. Apache Nemo
  2. NEMO-327

Fix skew handling for multi shuffle edge receiver

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.2

      Description

      At now, if a vertex receives multiple shuffle edges (in a joining case or something like that), DataSkewPolicy will optimize the partitioning of the two edges respectively.

      This makes possible that two key-value elements having an identical key are assigned to different tasks. In this case, the result will differ from the expected one.

      We need to collect the data metric for the two edges in a single metric aggregation vertex and optimize the two edges at once (like in Dryad paper).

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                sanha Sanha Lee
                Reporter:
                sanha Sanha Lee
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m