[ACCUMULO-4066] Conditional mutation processing performance could be improved. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.6.4, 1.7.0
Fix Version/s: 1.6.5, 1.7.1, 1.8.0
Component/s: tserver
Labels:
None

Description

When processing conditional mutations tablets reads are done. The way the current implementation does tablet reads has a lot of overhead. For each condition the following is done :

Opens and reserves iterators files.
Parse table iterators from table config (involves scanning and filtering entire table config)
Merges condition iterators and table iterators
Constructs iterator stack.

I created a branch where these operations (except for constructing iterator stack) are done per tablet and/or per batch of conditional mutations. Doing this I am seeing a 3x speed up in conditional mutation processing rates when data is cached.

Attachments

Issue Links

is blocked by

ACCUMULO-4098 ConditionalWriterIT is failing

Resolved

links to

conditional mutation performance test tool

Work in progress branch on Github

Activity

People

Assignee:: Keith Turner

Reporter:: Keith Turner

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 20/Nov/15 23:12

Updated:: 09/Feb/16 21:22

Resolved:: 09/Feb/16 21:22

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

0.5h