Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.6.0
-
None
-
None
-
MacOSX, Java
Description
Current Chukwa HBase schema looks like this:
<timestamp>-<primaryKey> <columnFamily>:<cell>...
Monotonic increasing timestamp can not evenly distribute across region servers without special handle and care periodically.
It is time to revise the schema, and proposed schema looks like this:
<hhddmmyyyy>-<primaryId> cf:<cell>...
Timestamp is stored with cell, row key helps to split data by hour, and a full hour of metrics is stored on the same row. PrimaryKey is replaced with hash id of the primary key. Metrics tables to aggregate metrics:
chukwaMetrics -> chukwaMetricsMonthly -> chukwaMetricsYearly
Attachments
1.
|
Support custom hbase schema | Open | Unassigned |