[ACCUMULO-4778] Resolving table name to table id is expensive - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.7.3, 1.8.1
Fix Version/s: 1.9.0
Component/s: None
Labels:
- pull-request-available

Description

I was running a Fluo test application and profiling the tablet server and Fluo worker. The Fluo worker does lots small scans against Accumulo. Resolving table names to ids (which is done for each scan) was expensive enough to make a significant showing in the profiling data.

I looked that the 1.8 code and it does the following to resolve a table name :

reads over all cached table ids in zookeeper putting them in a treemap
does a lookup in the treemap

Ideally the client code would keep a cache of name to id mappings and invalidate them when something changes in zookeeper. The data in zookeeper is stored by id, so it does need to be inverted to lookup by name.

Attachments

Issue Links

relates to

ACCUMULO-1833 MultiTableBatchWriterImpl.getBatchWriter() is not performant for multiple threads

Resolved

links to

GitHub Pull Request #364

Activity

People

Assignee:: Michael Miller

Reporter:: Keith Turner

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 11/Jan/18 19:39

Updated:: 23/Apr/19 16:13

Resolved:: 31/Jan/18 17:23

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

4h 50m