[HADOOP-13065] Add a new interface for retrieving FS and FC Statistics - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.8.0, 3.0.0-alpha1
Component/s: fs
Labels:
None

Target Version/s:

2.8.0

Description

Currently FileSystem.Statistics exposes the following statistics:
BytesRead
BytesWritten
ReadOps
LargeReadOps
WriteOps

These are in-turn exposed as job counters by MapReduce and other frameworks. There is logic within DfsClient to map operations to these counters that can be confusing, for instance, mkdirs counts as a writeOp.

Proposed enhancement:
Add a statistic for each DfsClient operation including create, append, createSymlink, delete, exists, mkdirs, rename and expose them as new properties on the Statistics object. The operation-specific counters can be used for analyzing the load imposed by a particular job on HDFS.
For example, we can use them to identify jobs that end up creating a large number of files.

Once this information is available in the Statistics object, the app frameworks like MapReduce can expose them as additional counters to be aggregated and recorded as part of job summary.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-13065.008.patch
03/May/16 23:18
73 kB
Mingliang Liu
HADOOP-13065.009.patch
05/May/16 04:13
76 kB
Mingliang Liu
HADOOP-13065.010.patch
06/May/16 05:37
77 kB
Mingliang Liu
HADOOP-13065.011.patch
06/May/16 20:44
79 kB
Mingliang Liu
HADOOP-13065.012.patch
07/May/16 01:42
79 kB
Mingliang Liu
HADOOP-13065.013.patch
10/May/16 20:55
79 kB
Mingliang Liu
HADOOP-13065-007.patch
28/Apr/16 01:28
20 kB
Colin McCabe
HDFS-10175.000.patch
16/Mar/16 21:49
51 kB
Mingliang Liu
HDFS-10175.001.patch
21/Mar/16 23:34
48 kB
Mingliang Liu
HDFS-10175.002.patch
24/Mar/16 00:49
47 kB
Mingliang Liu
HDFS-10175.003.patch
25/Mar/16 03:48
49 kB
Mingliang Liu
HDFS-10175.004.patch
11/Apr/16 23:02
49 kB
Mingliang Liu
HDFS-10175.005.patch
16/Apr/16 01:33
53 kB
Mingliang Liu
HDFS-10175.006.patch
20/Apr/16 23:11
8 kB
Colin McCabe
TestStatisticsOverhead.java
05/Apr/16 03:29
3 kB
Colin McCabe

Issue Links

breaks

HDFS-10418 NPE in TestDistributedFileSystem.testDFSCloseOrdering

Resolved

relates to

HADOOP-13028 add low level counter metrics for S3A; use in read performance tests

Resolved

HADOOP-13171 Add StorageStatistics to S3A; instrument some more operations

Resolved

HADOOP-15124 Slow FileSystem.Statistics counters implementation

Patch Available

TEZ-3331 Add operation specific HDFS counters for Tez UI

Patch Available

Sub-Tasks

1.	FileSystemStorageStatistics#getLong(“readOps“) should return readOps + largeReadOps	Resolved	Mingliang Liu
2.	Support reset operation for new global storage statistics and per FS storage stats	Resolved	Mingliang Liu
3.	FileSystem#initialize must not attempt to create StorageStatistics objects with null or empty schemes	Resolved	Mingliang Liu
4.	FileSystemStorageStatistics must not attempt to read non-existent rack-aware read stats in branch-2.8	Resolved	Mingliang Liu
5.	Guard null stats key in FileSystemStorageStatistics	Resolved	Mingliang Liu
6.	Probing stats in DFSOpsCountStatistics/S3AStorageStatistics should be correctly implemented	Resolved	Mingliang Liu
7.	Define common statistics names across schemes	Resolved	Mingliang Liu
8.	DFSOpsCountStatistics$OpType#fromSymbol and s3a.Statistic#fromSymbol should be O(1) operation	Resolved	Mingliang Liu
9.	Use GlobalStorageStatistics.INSTANCE.reset() at FileSystem#clearStatistics()	Resolved	Brahma Reddy Battula

Activity

People

Assignee:: Mingliang Liu

Reporter:: Ram Venkatesh

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Dates

Created:: 16/Mar/16 21:05

Updated:: 13/Jul/18 15:20

Resolved:: 11/May/16 21:28