[HDFS-16203] Discover datanodes with unbalanced block pool usage by the standard deviation - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.4.0
Fix Version/s: 3.4.0
Component/s: datanode, ui
Labels:
- pull-request-available

Target Version/s:

3.4.0
Hadoop Flags:

Reviewed

Description

Discover datanodes with unbalanced volume usage by the standard deviation.

In some scenarios, we may cause unbalanced datanode disk usage:
1. Repair the damaged disk and make it online again.
2. Add disks to some Datanodes.
3. Some disks are damaged, resulting in slow data writing.
4. Use some custom volume choosing policies.

In the case of unbalanced disk usage, a sudden increase in datanode write traffic may result in busy disk I/O with low volume usage, resulting in decreased throughput across datanodes.

We need to find these nodes in time to do diskBalance, or other processing. Based on the volume usage of each datanode, we can calculate the standard deviation of the volume usage. The more unbalanced the volume, the higher the standard deviation.

We can display the result on the Web of namenode, and then sorting directly to find the nodes where the volumes usages are unbalanced.

This interface is only used to obtain metrics and does not adversely affect namenode performance.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

image-2021-09-01-19-16-27-172.png
01/Sep/21 11:16
199 kB
Tao Li

Issue Links

causes

HDFS-16761 Namenode UI for Datanodes page not loading if any data node is down

Resolved

links to

GitHub Pull Request #3366

Activity

People

Assignee:: Tao Li

Reporter:: Tao Li

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 01/Sep/21 11:17

Updated:: 11/Feb/24 04:08

Resolved:: 16/Sep/21 01:01

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

6h 20m