[IMPALA-8102] Impala/HBase recommendations need update - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: Impala 3.2.0
Component/s: Docs
Labels:
None

Target Version:

Impala 3.2.0
Epic Color:
ghx-label-8

Description

https://impala.apache.org/docs/build/html/topics/impala_hbase.html hasn't been updated for a while. The recommendations are a bit out of date - generally HBase is not the best format for analytic workloads yet that page seems to encourage using it.

E.g.

If you have join queries that do aggregation operations on large fact tables and join the results against small dimension tables, consider using Impala for the fact tables and HBase for the dimension tables.

Assigning to myself to figure out what the best practice is, but I think we need to include:

A statement Kudu offers significantly better performance for analytical workloads with mutable data
A statement that HDFS tables are also preferable unless data is frequently mutated
A pointer to the Kudu docs

Attachments

Activity

People

Assignee:: Tim Armstrong

Reporter:: Tim Armstrong

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 23/Jan/19 19:56

Updated:: 08/Feb/19 01:22

Resolved:: 08/Feb/19 00:19