Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10978

Document fs.trash.interval wrt Hive and HDFS Encryption

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 1.2.0
    • None
    • Documentation, Security

    Description

      This should be documented in 1.2.1 Release Notes

      When HDFS is encrypted (TDE is enabled), DROP TABLE and DROP PARTITION have unexpected behavior when Hadoop Trash feature is enabled.
      The later is enabled by setting fs.trash.interval > 0 in core-site.xml.
      When Trash is enabled, the data file for the table, should be "moved" to Trash bin. If the table is inside an Encryption Zone, this "move" operation is not allowed.
      There are 2 ways to deal with this:
      1. use PURGE, as in DROP TABLE blah PURGE. This skips the Trash bin even if enabled.
      2. set fs.trash.interval = 0. It is critical that this config change is done in core-site.xml. Setting it in hive-site.xml may lead to very strange behavior where the table metadata is deleted but the data file remains. This will lead to data corruption if a table with the same name is later created.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ekoifman Eugene Koifman
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: