Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20287

Document the differences between managed and external tables

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • Documentation
    • None

    Description

      We should document all the differences between managed and external tables. I need everyone's help for that though.

      This is what I know:

      • ARCHIVE/UNARCHIVE - DDLTask - Only works for managed tables
      • TRUNCATE - DDLSemanticAnalyzer - Only works for managed tables
      • MERGE/CONCATENATE - HiveRelOpMaterializationValidator - Only works for managed tables
      • Constraints - DDLSemanticAnalyzer -  (NOT NULL, DEFAULT, CHECK, only RELY ist allowed)
      • IMPORT - ImportSemanticAnalyzer - This has some wild restrictions I didn't follow for external tables
      • Query Results Caching - https://issues.apache.org/jira/browse/HIVE-18513 SemanticAnalyzer
         
        Hortonworks has extra documentation listing these things:
      • Query cache
      • Materialized views, except in a limited way
      • Default statistics gathering
      • Compute queries using statistics
      • Automatic runtime filtering
      • File merging after insert
         
        It'd be great if someone (from Hortonworks or otherwise) could elaborate on those.

      Attachments

        Activity

          People

            larsfrancke Lars Francke
            larsfrancke Lars Francke
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: