[ACCUMULO-3710] Scanning with many singleton ranges crashes tserver - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Abandoned
Affects Version/s: 1.6.1
Fix Version/s: None
Component/s: client, tserver
Labels:
None

Description

Setup: single-node standalone 1.6.1 Accumulo instance.
Use case: scan ~1M individual rows, scattered across a ~15GB table.
The following steps crash the TabletServer:

1. Gather a List of Range objects, each one a singleton range spanning an entire row.
2. Create a BatchScanner with one read thread.
3. Set the ranges via BatchScanner.setRanges()
4. Start iterating through the scanner.

One solution is to batch the reads into groups of ~10k ranges idea.

Comment from Josh Elser:

Taking a quick glance at the code, it looks like this would be a good place to do some optimization in the BatchScanner's impl (TabletServerBatchReaderImpl). The BatchScanner will bin the ranges to the tablets and the servers hosting those tablets. Normally, this would be spread out, but, in your single server case, all 1M rows would all go to a single TabletServer in one RPC call.

I'm guessing a good optimization here would be to check the size of a batch of Ranges for a single tabletserver, and when above a certain threshold, split the batch in half and try to reprocess each half (the recursion would naturally keep splitting until we get down to some high-watermark).

Point being, if your client VM constructed the Ranges without issue, the BatchScanner impl should be smart enough to not knock over a TabletServer.

Verified to cause an OOME via tserver_localhost.out:

#

java.lang.OutOfMemoryError: Java heap space

-XX:OnOutOfMemoryError="kill -9 %p"

Executing /bin/sh -c "kill -9 12833"...

Attachments

Issue Links

is duplicated by

ACCUMULO-3813 single row look-ups are slow

Resolved

relates to

ACCUMULO-3602 BatchScanner optimization for AccumuloInputFormat

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Shana Hutchison

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 03/Apr/15 05:14

Updated:: 02/Nov/22 19:16

Resolved:: 02/Nov/22 19:16