[HBASE-14339] HBase Bulk Load and super wide rows - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

This may not be a huge issues but it does come up. If the number of columns in a row are to many then KeyValueSortReducer will blow up with a out of memory exception, because it uses a TreeMap to sort the columns with in the memory of the reducer.

A solution would be to add the column family and qualifier to the key so the shuffle would handle the sort.

The partitioner would only partition on the rowKey but ordering would apply to the RowKey, Column Family, and Column Qualifier.

Look at the Spark Bulk load as an example. ~~HBASE-14150~~

Attachments

Issue Links

duplicates

HBASE-7743 Replace *SortReducers with Hadoop Secondary Sort

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Theodore michael Malaska

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 30/Aug/15 21:51

Updated:: 16/Jun/22 01:24

Resolved:: 05/Sep/15 23:04