Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
From the discussion from HBASE-16894 there is a set of use cases where writing to multiple regions in a single reducer can be helpful to reduce the overhead of MR jobs when a large number of regions exist in an HBase cluster and some regions can present a data skew, e.g. 100s or 1000s of regions with a very small number of rows vs. regions with 10s or millions or rows as part of the same job. And merging regions is not an option for the use case.
Attachments
Issue Links
- is duplicated by
-
HBASE-19226 Limit the reduce tasks number of incremental load
- Patch Available
- relates to
-
HBASE-16894 Create more than 1 split per region, generalize HBASE-12590
- Resolved