[HADOOP-10434] Is it possible to use "df" to calculate the dfs usage instead of "du" - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Duplicate
Affects Version/s: 2.3.0
Fix Version/s: None
Component/s: fs
Labels:
- BB2015-05-TBR

Description

When we run datanode from the machine with big disk volume, it's found du operations from org.apache.hadoop.fs.DU's DURefreshThread cost lots of disk performance.

As we use the whole disk for hdfs storage, it is possible calculate volume usage via "df" command. Is it necessary adding the "df" option for usage calculation in hdfs (org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.BlockPoolSlice)?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-10434-1.patch
04/Jun/14 12:41
5 kB
MaoYuan Xian

Issue Links

duplicates

HADOOP-12974 Create a CachingGetSpaceUsed implementation that uses df

Resolved

is related to

HDFS-8791 block ID-based DN storage layout can be very slow for datanode on ext4

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: MaoYuan Xian

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 25/Mar/14 07:35

Updated:: 18/Dec/16 08:30

Resolved:: 18/Dec/16 08:30