[HBASE-8693] DataType: provide extensible type API - ASF JIRA

Details

Type: Sub-task
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.98.0, 0.95.2
Component/s: Client
Labels:
None

Hadoop Flags:

Reviewed
Release Note:

Hide
This patch introduces an extensible data types API for HBase. It is inspired by the following systems:

- PostgreSQL. Postgres has a user-extensible data type API, which has been used to great effect by it's user community (ie, PostGIS). The desire is for HBase to expose an equally extensible data type API. One aspect of the Postgres data type is the ability to provide equivalence functions for index operations. This appears to be of critical performance utility for its execution engine.
- Orderly. Orderly handles the issue of compound rowkeys by providing convenience classes for handling these kinds of data types. This influence is reflected in the Struct and Union family of classes.
- Phoenix. The PDataType enum used in Phoenix provides type hints, similar Postgres's equivalence functions. These appear to be used during query execution for numerical type promotion.

This patch introduces an interface, DataType, along with a number of data type implementations based on the Bytes encoding. Also included are Struct and Union types, demonstrating simple implementations of compound types. Helper classes around the Struct implementation are also provided.

This patch does not address the type compatibility concerns expressed by Phoenix's PDataType API (ie, isComparableTo, isCoercibleTo); these will be addressed in ~~HBASE-8863~~.

This patch also provides DataType implementations based on the OrderedBytes encoding from ~~HBASE-8201~~.

Show
This patch introduces an extensible data types API for HBase. It is inspired by the following systems: - PostgreSQL. Postgres has a user-extensible data type API, which has been used to great effect by it's user community (ie, PostGIS). The desire is for HBase to expose an equally extensible data type API. One aspect of the Postgres data type is the ability to provide equivalence functions for index operations. This appears to be of critical performance utility for its execution engine. - Orderly. Orderly handles the issue of compound rowkeys by providing convenience classes for handling these kinds of data types. This influence is reflected in the Struct and Union family of classes. - Phoenix. The PDataType enum used in Phoenix provides type hints, similar Postgres's equivalence functions. These appear to be used during query execution for numerical type promotion. This patch introduces an interface, DataType, along with a number of data type implementations based on the Bytes encoding. Also included are Struct and Union types, demonstrating simple implementations of compound types. Helper classes around the Struct implementation are also provided. This patch does not address the type compatibility concerns expressed by Phoenix's PDataType API (ie, isComparableTo, isCoercibleTo); these will be addressed in HBASE-8863 . This patch also provides DataType implementations based on the OrderedBytes encoding from HBASE-8201 .
Tags:
0.96notable

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

0001-HBASE-8693-Extensible-data-types-API.patch
07/Aug/13 23:41
136 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
07/Aug/13 22:45
136 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
05/Aug/13 01:47
131 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
02/Aug/13 21:40
127 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
02/Aug/13 20:43
128 kB
Nicolas Liochon
0001-HBASE-8693-Extensible-data-types-API.patch
02/Aug/13 01:21
128 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
31/Jul/13 03:40
100 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
23/Jul/13 22:57
137 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
23/Jul/13 00:33
130 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
16/Jul/13 23:23
127 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
11/Jul/13 18:29
93 kB
Nick Dimiduk
0001-HBASE-8693-Extensible-data-types-API.patch
24/Jun/13 22:08
50 kB
Nick Dimiduk
0002-HBASE-8693-example-Use-DataType-API-to-build-regionN.patch
16/Jul/13 23:37
9 kB
Nick Dimiduk
KijiFormattedEntityId.java
18/Jul/13 22:48
2 kB
Nick Dimiduk

Issue Links

depends upon

HBASE-8201 OrderedBytes: an ordered encoding strategy

Closed

is depended upon by

HBASE-8593 Type support in ImportTSV tool

Closed

is related to

HIVE-6150 Take advantage of Native HBase Compound keys

Open

DataType: provide extensible type API

Details

Attachments

Attachments

Issue Links

Activity

People

Dates