[ACCUMULO-1950] Reduce the number of calls to hsync - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.7.0
Component/s: None
Labels:
None

Description

As mutations written to a tablet server its buffered and once this buffer exceeds a certain size the data is dumped to the walog and then inserted into an in memory sorted map. These walog buffers are per a client and the max size is determined by tserver.mutation.queue.max.

Accumulo 1.5 and 1.6 call hsync() in hadoop 2 which ensures data is flushed to disk. This introduces a fixed delay when flushing walog buffers. The smaller tserver.mutation.queue.max is, the more frequently the walog buffers are flushed. With many clients writing to a tserver, this is not much of a concern because all of their walog buffers are flushed using group commit. This results in high throughput because large batches of data being written before hsync is called. However if a few client writing to a tserver there will be a lot more calls to hsync. It would be nice the # of calls to hsync was a function of the amount of data written regardless of the number of concurrent clients. Currently as the number of concurrent clients goes down, the number of calls to hsync goes up.

In 1.6 and 1.5 this can be mitigated by increasing tserver.mutation.queue.max, however this is multiplied by the number of concurrent writers. So increasing it can improve performance of a single writer but increases the chances of many concurrent writers exhausting memory.

Attachments

Issue Links

is related to

ACCUMULO-1905 WALog is slow

Resolved

Activity

People

Assignee:: Eric C. Newton

Reporter:: Keith Turner

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 02/Dec/13 20:24

Updated:: 14/Sep/14 22:20

Resolved:: 09/Sep/14 18:19

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m