[HDFS-6382] HDFS File/Directory TTL - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.4.0
Fix Version/s: None
Component/s: hdfs-client, namenode
Labels:
None

Tags:
ttl

Description

In production environment, we always have scenario like this, we want to backup files on hdfs for some time and then hope to delete these files automatically. For example, we keep only 1 day's logs on local disk due to limited disk space, but we need to keep about 1 month's logs in order to debug program bugs, so we keep all the logs on hdfs and delete logs which are older than 1 month. This is a typical scenario of HDFS TTL. So here we propose that hdfs can support TTL.

Following are some details of this proposal:
1. HDFS can support TTL on a specified file or directory
2. If a TTL is set on a file, the file will be deleted automatically after the TTL is expired
3. If a TTL is set on a directory, the child files and directories will be deleted automatically after the TTL is expired
4. The child file/directory's TTL configuration should override its parent directory's
5. A global configuration is needed to configure that whether the deleted files/directories should go to the trash or not
6. A global configuration is needed to configure that whether a directory with TTL should be deleted when it is emptied by TTL mechanism or not.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-TTL-Design.pdf
09/Jun/14 09:31
106 kB
Zesheng Wu
HDFS-TTL-Design -2.pdf
11/Jun/14 03:45
115 kB
Zesheng Wu
HDFS-TTL-Design-3.pdf
23/Jun/14 11:09
122 kB
Zesheng Wu

Issue Links

duplicates

HADOOP-2892 providing temp space management for applications

Resolved

HDFS-205 HDFS Tmpreaper

Resolved

is duplicated by

HDFS-13988 Allow for Creating Temporary Files

Resolved

is related to

HDFS-268 Distinguishing file missing/corruption for low replication files

Open

relates to

HDFS-7044 Support retention policy based on access time and modify time, use XAttr to store policy

Resolved

HADOOP-15567 Support expiry time in AdlFileSystem

Patch Available

(1 relates to)

Sub-Tasks

1.	FsShell supports HDFS TTL		Patch Available	Zesheng Wu
2.	Implement HDFS TtlManager		Patch Available	Zesheng Wu

Activity

People

Assignee:: Zesheng Wu

Reporter:: Zesheng Wu

Votes:: 3 Vote for this issue

Watchers:: 40 Start watching this issue

Dates

Created:: 13/May/14 08:50

Updated:: 12/Oct/18 17:53