[HBASE-68] [hbase] HStoreFiles needlessly store the column family name in every entry - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Component/s: regionserver
Labels:
None

Description

Today, HStoreFiles keep the entire serialized HStoreKey objects around for every cell in the HStore. Since HStores are 1-1 with column families, this is really unnecessary - you can always surmise the column family by looking at the HStore it belongs to. (This information would ostensibly come from the file name or a header section.) This means that we could remove the column family part of the HStoreKeys we put into the HStoreFile, reducing the size of data stored. This would be a space-saving benefit, removing redundant data, and could be a speed benefit, as you have to scan over less data in memory and transfer less data over the network.

Attachments

Issue Links

relates to

HBASE-61 [hbase] Create an HBase-specific MapFile implementation

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Bryan Duxbury

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 04/Jan/08 01:52

Updated:: 11/Jun/22 19:48

Resolved:: 08/Jun/14 21:49