[HDFS-14694] Call recoverLease on DFSOutputStream close exception - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.4.0
Fix Version/s: 3.4.0
Component/s: hdfs-client
Labels:
None

Target Version/s:

3.4.0, 3.3.2
Hadoop Flags:

Reviewed

Description

HDFS uses file-lease to manage opened files, when a file is not closed normally, NN will recover lease automatically after hard limit exceeded. But for a long running service(e.g. HBase), the hdfs-client will never die and NN don't have any chances to recover the file.

Usually client program needs to handle exceptions by themself to avoid this condition(e.g. HBase automatically call recover lease for files that not closed normally), but in our experience, most services (in our company) don't process this condition properly, which will cause lots of files in abnormal status or even data loss.

This Jira propose to add a feature that call recoverLease operation automatically when DFSOutputSteam close encounters exception. It should be disabled by default, but when somebody builds a long-running service based on HDFS, they can enable this option.

We've add this feature to our internal Hadoop distribution for more than 3 years, it's quite useful according our experience.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-14694.001.patch
02/Aug/19 08:08
8 kB
Chen Zhang
HDFS-14694.002.patch
02/Aug/19 08:19
7 kB
Chen Zhang
HDFS-14694.003.patch
22/Aug/19 00:04
8 kB
Chen Zhang
HDFS-14694.004.patch
22/Aug/19 09:10
8 kB
Chen Zhang
HDFS-14694.005.patch
25/Aug/20 10:20
8 kB
Lisheng Sun
HDFS-14694.006.patch
26/Aug/20 02:32
8 kB
Lisheng Sun
HDFS-14694.007.patch
30/Aug/20 07:17
8 kB
Lisheng Sun
HDFS-14694.008.patch
01/Sep/20 15:10
7 kB
Lisheng Sun
HDFS-14694.009.patch
02/Sep/20 06:47
7 kB
Lisheng Sun
HDFS-14694.010.patch
04/Sep/20 01:57
8 kB
Lisheng Sun
HDFS-14694.011.patch
04/Sep/20 01:58
8 kB
Lisheng Sun
HDFS-14694.012.patch
07/Sep/20 03:15
9 kB
Lisheng Sun
HDFS-14694.013.patch
07/Sep/20 07:56
9 kB
Lisheng Sun
HDFS-14694.014.patch
07/Sep/20 13:16
9 kB
Lisheng Sun

Issue Links

incorporates

HDFS-15684 EC: Call recoverLease on DFSStripedOutputStream close exception

Resolved

HDFS-15559 Complement initialize member variables in TestHdfsConfigFields#initializeMemberVariables

Resolved

relates to

HDFS-15858 Backport HDFS-14694 to branch-3.1/3.2/3.3

Patch Available

Activity

People

Assignee:: Lisheng Sun

Reporter:: Chen Zhang

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 02/Aug/19 04:49

Updated:: 27/Jan/24 03:29

Resolved:: 09/Sep/20 13:51