Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
We should document all the differences between managed and external tables. I need everyone's help for that though.
This is what I know:
- ARCHIVE/UNARCHIVE - DDLTask - Only works for managed tables
- TRUNCATE - DDLSemanticAnalyzer - Only works for managed tables
- MERGE/CONCATENATE - HiveRelOpMaterializationValidator - Only works for managed tables
- Constraints - DDLSemanticAnalyzer - (NOT NULL, DEFAULT, CHECK, only RELY ist allowed)
- IMPORT - ImportSemanticAnalyzer - This has some wild restrictions I didn't follow for external tables
- Query Results Caching - https://issues.apache.org/jira/browse/HIVE-18513 SemanticAnalyzer
Hortonworks has extra documentation listing these things: - Query cache
- Materialized views, except in a limited way
- Default statistics gathering
- Compute queries using statistics
- Automatic runtime filtering
- File merging after insert
It'd be great if someone (from Hortonworks or otherwise) could elaborate on those.