[HBASE-1991] Architectural overview of HBase internals with description of conceptual gulf between HBase and HDFS - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Duplicate
Affects Version/s: 0.20.0
Fix Version/s: None
Component/s: documentation
Labels:
None

Release Note:
Implemented in a recent book.xml fix (FAQ section)

Description

One of the conceptual gulfs that needs addressing in HBase documentation is that if people are looking at the Hadoop website, they will read about HDFS that it is for (paraphrasing) "high throughput but does not promise low latency and is not suited for random reads."

HBase runs on top of HDFS, and it promises both low-latency and random reads.

How?

I'm not disputing that HBase does it... but not much is written down anywhere other than references to "caching."

Lars George put together a great page on some of the HBase file structures as they are stored in HDFS. Information like that would be useful to have in the HBase documentation, etc.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Doug Meil

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 18/Nov/09 22:00

Updated:: 11/Jun/22 23:15

Resolved:: 14/Apr/11 20:35