Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
-
None
Description
The batch scanner currently does the following.
- bin ranges to tablets servers and tabelts
- if any ranges could not be binned (e.g. a tablet had no location) goto 1
- queue up work for tablet servers on a thread pool
- wait for thread pool to complete all work
- if there were any failures goto 1
In the face of failures (tablets not assigned because of migration, tablet servers dying) it would be better if the batch scanner worked on what it could and immediately requeued failures for processing immediately. The ConditionalWriter and BatchWriter have failure queues and increase the retry time if something keeps failing.