[HBASE-18067] Support a default converter for data read shell commands - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: shell
Labels:
None

Hadoop Flags:

Reviewed

Description

The get and scan shell commands have the ability to specify some complicated syntax on how to encode the bytes read from HBase on a per-column basis. By default, bytes falling outside of a limited range of ASCII are just printed as hex.

It seems like the intent of these converts was to support conversion of certain numeric columns as a readable string (e.g. 1234).

However, if non-ascii encoded bytes are stored in the table (e.g. UTF-8 encoded bytes), we may want to treat all data we read as UTF-8 instead (e.g. if row+column+value are in Chinese). It would be onerous to require users to enumerate every column they're reading to parse as UTF-8 instead of the limited ascii range. We can provide an option to encode all values retrieved by the command.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HBASE-18067.001.patch
18/May/17 17:25
16 kB
Josh Elser
HBASE-18067.002.patch
19/May/17 16:49
16 kB
Josh Elser
HBASE-18067.003.patch
19/May/17 17:25
16 kB
Josh Elser

Issue Links

causes

HBASE-21178 [BC break] : Get and Scan operation with a custom converter_class not working

Resolved

Activity

People

Assignee:: Josh Elser

Reporter:: Josh Elser

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 17/May/17 21:42

Updated:: 18/Oct/18 15:02

Resolved:: 22/May/17 02:32