Details
Description
A few of us were brainstorming this morning about what the default region size should be. There were a few general points made:
- in some ways it's better to be too-large than too-small, since you can always split a table further, but you can't merge regions currently
- with HFile v2 and multithreaded compactions there are fewer reasons to avoid very-large regions (10GB+)
- for small tables you may want a small region size just so you can distribute load better across a cluster
- for big tables, multi-GB is probably best
Attachments
Attachments
Issue Links
- is related to
-
HBASE-5386 [usability] Soft limit for eager region splitting of young tables
- Closed