[FLUME-2155] Improve replay time - ASF JIRA

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.5.0
Component/s: None
Labels:
None

Description

File Channel has scaled so well that people now run channels with sizes in 100's of millions of events. Turns out, replay can be crazy slow even between checkpoints at this scale - because of the remove() method in FlumeEventQueue moving every pointer that follows the one being removed (1 remove causes 99 million+ moves for a channel of 100 million!). There are several ways of improving - one being move at the end of replay - sort of like a compaction. Another is to use the fact that all removes happen from the top of the queue, so move the first "k" events out to hashset and remove from there - we can find k using the write id of the last checkpoint and the current one.

Attachments

Attachments

SmartReplay1.1.pdf
15/Aug/13 23:35
86 kB
Hari Shreedharan
SmartReplay.pdf
14/Aug/13 21:05
72 kB
Hari Shreedharan
FLUME-FC-SLOW-REPLAY-FIX-1.patch
06/Dec/13 17:43
4 kB
Brock Noland
FLUME-FC-SLOW-REPLAY-1.patch
06/Dec/13 17:43
11 kB
Brock Noland
FLUME-2155-initial.patch
19/Aug/13 04:53
2 kB
Hari Shreedharan
FLUME-2155.patch
18/Oct/13 07:29
12 kB
Hari Shreedharan
FLUME-2155.5.patch
13/Dec/13 17:09
37 kB
Brock Noland
FLUME-2155.4.patch
10/Dec/13 14:58
35 kB
Brock Noland
FLUME-2155.2.patch
08/Dec/13 17:10
34 kB
Brock Noland
fc-test.patch
14/Aug/13 03:01
12 kB
Hari Shreedharan
700000-710000
14/Aug/13 03:01
5.70 MB
Hari Shreedharan
300000-310000
14/Aug/13 03:01
5.70 MB
Hari Shreedharan
10000-20000
14/Aug/13 03:01
5.69 MB
Hari Shreedharan
100000-110000
14/Aug/13 03:01
5.70 MB
Hari Shreedharan

Issue Links

Add Link

is related to

FLUME-2118 Occasional multi-hour pauses in file channel replay

Resolved

Delete this link

FLUME-2260 Recommend Dual Checkpoints in file channel documentation

Open

Delete this link

links to

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Brock Noland

Reporter:: Hari Shreedharan

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 09/Aug/13 18:25

Updated:: 15/Dec/13 22:43

Resolved:: 13/Dec/13 20:37

Agile

Slack

Issue deployment