[TEPHRA-299] Executing a large batch delete is very slow - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.15.0-incubating
Fix Version/s: 0.16.0
Component/s: None
Labels:
None

Description

I noticed that batch deletes are quire slow. In the profiler I found that almost all of the time is spent in org.apache.hadoop.hbase.regionserver.wal.FSHLog.blockOnSync().

Looking at TransactionProcessor.preDelete it is obvious why:

The batch delete is translated into single puts that are added to the region one by one, so each time the WAL is flushed.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

299-DOES-NOT_WORK.txt
14/Apr/19 00:09
6 kB
Lars Hofhansl
299-client.txt
14/Apr/19 01:23
9 kB
Lars Hofhansl
299-client-v2.txt
14/Apr/19 04:05
9 kB
Lars Hofhansl
299-client-v3.txt
14/Apr/19 07:36
6 kB
Lars Hofhansl
299-complete.txt
24/May/19 20:39
17 kB
Lars Hofhansl

Issue Links

relates to

HBASE-22235 OperationStatus.{SUCCESS|FAILURE|NOT_RUN} are not visible to 3rd party coprocessors

Resolved

Activity

People

Assignee:: Lars Hofhansl

Reporter:: Lars Hofhansl

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 13/Apr/19 19:13

Updated:: 04/Dec/20 04:46

Resolved:: 09/Jun/19 23:28