Uploaded image for project: 'Samza'
  1. Samza
  2. SAMZA-906

Host Affinity - Minimize task reassignment when container count changes

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.10.1
    • Component/s: None
    • Labels:
      None

      Description

      When the job container count changes the default taskname grouper (GroupByContainerCount) will reassign all the tasks among the new container list in a round-robin fashion. This causes many of the tasks to shift to new containers. The shifted tasks will be unable to restore state from local disk because the new containers may not be assigned to the same host that the task's original container was assigned to.

      This ticket is to implement a task-to-container affinity, which complements the container-to-host affinity in the current implementation. The implementation will include a mapping which is persisted to the coordinator stream and is used as the basis for the new task-to-container mapping (ContainerModel).

      If the container count doesn't change, the old task mapping will be used. (Note that this will allow tools to inject custom mappings by writing to the coordinator stream).

      If the container count changes, a minimal number of tasks will be reassigned from the persisted mapping, in order to "balance" the containers.

        Attachments

        1. SAMZA-906_2.patch
          98 kB
          Jake Maes
        2. SAMZA-906_3.patch
          100 kB
          Jake Maes
        3. SAMZA-906_3.patch
          101 kB
          Jake Maes
        4. SAMZA-906_4.patch
          100 kB
          Jake Maes
        5. SAMZA-906_5.patch
          98 kB
          Jake Maes
        6. SAMZA-906.patch
          91 kB
          Jake Maes

          Issue Links

            Activity

              People

              • Assignee:
                jmakes Jake Maes
                Reporter:
                jmakes Jake Maes
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: