Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Currently we ran into several problems of the FileInputFormat/OutputFormat in Hive.
The requirements are:
R1. We want to support HBase: HIVE-806
R2. We want to selectively include files based on file names: HIVE-951
R3. We want to optionally choose to recurse on the directory structure: HIVE-1083
R4. We want to pass the filter condition into the storage (very useful for HBase, and indexed data format)
R5. We want to pass the column selection information into the storage (already done as part of the RCFile, but we can do it better)
We need to structure these requirements and the code structure in a good way to make it extensible.
Attachments
Issue Links
- blocks
-
HIVE-806 Hive with HBase as data store to support MapReduce and direct query
- Resolved
-
HIVE-951 Selectively include EXTERNAL TABLE source files via REGEX
- Open
-
HIVE-1083 allow sub-directories for an external table/partition
- Open
-
HIVE-1222 in metastore, do not store names of inputformat/outputformat/serde for non-native tables
- Open
-
HIVE-1226 support filter pushdown against non-native tables
- Closed
- is related to
-
HIVE-3660 Improve OutputFormat for Hive
- Open
- relates to
-
HIVE-705 Hive HBase Integration (umbrella)
- Closed