[SAMZA-882] Detect partition count changes in input streams - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 0.10.0
Fix Version/s: 0.10.1
Component/s: None
Labels:
None

Description

This is a known issue where any change in the partition count in the upstream affects the Samza job and it needs to be restarted. In such scenarios, we experience data loss or incorrect processing because the application logic depends on the partitioning strategy. It is worsened by the fact that we don't even have a good mechanism to detect such a change.

As a first-step towards detection, I propose that we modify the stream metadata cache maintained in Samza such that when there a change in partition count, we increment a gauge metric. This way we can at least attach a hook to monitor when this happens and take necessary actions.

However, in the long-term, we need to come up with a better strategy for handling this.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SAMZA-882-0.patch
04/Mar/16 20:16
35 kB
Navina Ramesh
SAMZA-882-1.patch
25/Mar/16 01:27
35 kB
Navina Ramesh

Issue Links

is part of

SAMZA-917 Dynamic rebalancing upon upstream changes

Open

Activity

People

Assignee:: Navina Ramesh

Reporter:: Navina Ramesh

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 29/Feb/16 20:10

Updated:: 25/Mar/16 02:37

Resolved:: 25/Mar/16 02:37