[HDFS-14313] Get hdfs used space from FsDatasetImpl#volumeMap#ReplicaInfo in memory instead of df/du - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.6.0, 2.7.0, 2.8.0, 2.9.0, 3.0.0, 3.1.0
Fix Version/s: 2.10.0, 3.0.4, 3.3.0, 3.1.4, 3.2.2
Component/s: datanode, performance
Labels:
None

Hadoop Flags:

Reviewed

Description

There are two ways of DU/DF getting used space that are insufficient.

Running DU across lots of disks is very expensive and running all of the processes at the same time creates a noticeable IO spike.
Running DF is inaccurate when the disk sharing by multiple datanode or other servers.

Getting hdfs used space from FsDatasetImpl#volumeMap#ReplicaInfos in memory is very small and accurate.

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-14313-branch-2.v2.patch
07/Aug/19 15:03
27 kB
Lisheng Sun
HDFS-14313-branch-2.v1.patch
07/Aug/19 09:15
27 kB
Lisheng Sun
HDFS-14313.branch-3.v1.patch
07/Aug/19 06:00
27 kB
Lisheng Sun
HDFS-14313.branch-3.2.patch
02/Oct/19 00:17
25 kB
Wei-Chiu Chuang
HDFS-14313.branch-3.1.patch
02/Oct/19 00:17
26 kB
Wei-Chiu Chuang
HDFS-14313.branch-3.0.v2.patch
07/Aug/19 08:39
27 kB
Lisheng Sun
HDFS-14313.branch-3.0.v1.patch
07/Aug/19 06:52
54 kB
Lisheng Sun
HDFS-14313.014.patch
06/Aug/19 15:01
26 kB
Lisheng Sun
HDFS-14313.013.patch
06/Aug/19 10:21
23 kB
Lisheng Sun
HDFS-14313.012.patch
06/Aug/19 07:42
23 kB
Lisheng Sun
HDFS-14313.011.patch
05/Aug/19 10:33
22 kB
Lisheng Sun
HDFS-14313.010.patch
01/Aug/19 11:11
22 kB
Lisheng Sun
HDFS-14313.009.patch
29/Jul/19 03:21
20 kB
Lisheng Sun
HDFS-14313.008.patch
25/Jul/19 16:20
20 kB
Lisheng Sun
HDFS-14313.007.patch
18/Jul/19 10:41
26 kB
Lisheng Sun
HDFS-14313.006.patch
08/Jul/19 12:10
25 kB
Lisheng Sun
HDFS-14313.005.patch
07/Jul/19 12:01
24 kB
Lisheng Sun
HDFS-14313.004.patch
02/Jul/19 09:27
24 kB
Lisheng Sun
HDFS-14313.003.patch
24/Jun/19 03:55
23 kB
Lisheng Sun
HDFS-14313.002.patch
24/Jun/19 02:52
23 kB
Lisheng Sun
HDFS-14313.001.patch
04/Apr/19 02:34
23 kB
Lisheng Sun
HDFS-14313.000.patch
03/Apr/19 12:32
32 kB
Lisheng Sun

Issue Links

causes

HDFS-14986 ReplicaCachingGetSpaceUsed throws ConcurrentModificationException

Resolved

is related to

HDFS-15174 Optimize ReplicaCachingGetSpaceUsed by reducing unnecessary io operations

Resolved

HDFS-15039 Cache meta file length of FinalizedReplica to reduce call File.length()

Resolved

relates to

HADOOP-12973 make DU pluggable

Resolved

HADOOP-12974 Create a CachingGetSpaceUsed implementation that uses df

Resolved

HADOOP-9884 Hadoop calling du -sk is expensive

Resolved

links to

GitHub Pull Request #1011

(1 relates to, 1 links to)

Activity

People

Assignee:: Lisheng Sun

Reporter:: Lisheng Sun

Votes:: 0 Vote for this issue

Watchers:: 16 Start watching this issue

Dates

Created:: 23/Feb/19 05:48

Updated:: 20/Feb/20 23:56

Resolved:: 07/Aug/19 15:22