Details
Description
column based storage has been proven a better storage layout for OLAP.
Hive does a great job on raw row oriented storage. In this issue, we will enhance hive to support column based storage.
Acctually we have done some work on column based storage on top of hdfs, i think it will need some review and refactoring to port it to Hive.
Any thoughts?
Attachments
Attachments
Issue Links
- blocks
-
HIVE-461 Optimize RCFile reading by using column pruning results
- Closed
- is blocked by
-
HIVE-337 LazySimpleSerDe should support multi-level nested array, map, struct types
- Closed
-
HIVE-360 Generalize the FileFormat Interface in Hive
- Closed
- relates to
-
AVRO-806 add a column-major file format
- Closed
-
HIVE-2097 Explore mechanisms for better compression with RC Files
- Open
-
HIVE-460 Improve ColumnPruner to prune more aggressively and keep column information for input tables
- Closed
-
HIVE-756 performance improvement for RCFile and ColumnarSerDe in Hive
- Closed