Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Implemented
-
1.0.0
-
None
-
None
Description
Scanning medium sized rows with multiple concurrent scanners exhibits interesting 'ceiling' properties. A server runs at about 6.7k ops a second using 450% of possible 1600% of CPUs when 4 clients each with 10 threads doing scan 1000 rows. If I add '--filterAll' argument (do not return results), then we run at 1450% of possible 1600% possible but we do 8k ops a second.
Let me attach flame graphs for two cases. Unfortunately, there is some frustrating dark art going on. Let me try figure it... Filing issue in meantime to keep score in.
Attachments
Attachments
Issue Links
- relates to
-
HBASE-5979 Non-pread DFSInputStreams should be associated with scanners, not HFile.Readers
- Closed
-
HBASE-9102 HFile block pre-loading for large sequential scan
- Closed