[SAMZA-348] Configure Samza jobs through a stream - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 0.7.0
Fix Version/s: None
Component/s: None
Labels:
- design
- project

Description

Samza's existing config setup is problematic for a number of reasons:

It's completely immutable once a job starts. This prevents any dynamic reconfiguration and auto-scaling. It is debatable whether we want these feature or not, but our existing implementation actively prevents it. See SAMZA-334 for discussion.
We pass existing configuration through environment variables. YARN exports environment variables in a shell script, which limits the size to the varargs length on the machine. This is usually ~128KB. See SAMZA-333 and ~~SAMZA-337~~ for details.
User-defined configuration (the Config object) and programmatic configuration (checkpoints and TaskName:State mappings (see ~~SAMZA-123~~)) are handled differently. It's debatable whether this makes sense.

In ~~SAMZA-123~~, jghoman and I propose implementing a ConfigLog. This log would replace both the checkpoint topic and the existing config environment variables in SamzaContainer and Samza's YARN AM.

I'd like to keep this ticket's scope limited to just the implementation of the ConfigLog, and not re-designing how Samza's config is used in the code (SAMZA-40). We should, however, discuss how this feature would affect dynamic reconfiguration/auto-scaling.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

DESIGN-SAMZA-348-1.pdf
23/Sep/14 07:29
304 kB
Chris Riccomini
DESIGN-SAMZA-348-1.md
23/Sep/14 07:29
45 kB
Chris Riccomini
DESIGN-SAMZA-348-0.pdf
12/Sep/14 20:59
220 kB
Chris Riccomini
DESIGN-SAMZA-348-0.md
12/Sep/14 20:59
30 kB
Chris Riccomini

Issue Links

incorporates

SAMZA-798 Performance and stability issue after combining checkpoint and coordinator stream

Resolved

is related to

SAMZA-40 Refactor Samza configuration

Open

SAMZA-42 Add a job setup phase to Samza

Open

SAMZA-333 Large samza configurations results in yarn job failure

Open

SAMZA-374 Need to be able to change SSP Grouper

Resolved

SAMZA-406 Hot standby containers

Open

relates to

SAMZA-375 Investigate Mesos Job Support

Open

SAMZA-416 Samza Configuration DSL

Open

supercedes

SAMZA-237 Consider implementing job control topic to support dynamic inputs, capacity changes, etc.

Resolved

(1 is related to, 2 relates to, 1 supercedes)

Sub-Tasks

1.	Integrate CoordinatorStream to use SystemConsumers and SystemProducers	Open	Unassigned
2.	Optimize CoordinatorStream's bootstrap mechanism	Open	Unassigned
3.	Explicit restart containers to pick up dynamic JobModel changes	Open	Alex Buck

Activity

People

Assignee:: Chris Riccomini

Reporter:: Chris Riccomini

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 18/Jul/14 16:10

Updated:: 31/Mar/16 17:44