[HDFS-10967] Add configuration for BlockPlacementPolicy to avoid near-full DataNodes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Patch Available
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: namenode
Labels:
- balancer

Description

Large production clusters are likely to have heterogeneous nodes in terms of storage capacity, memory, and CPU cores. It is not always possible to proportionally ingest data into DataNodes based on their remaining storage capacity. Therefore it's possible for a subset of DataNodes to be much closer to full capacity than the rest.

This heterogeneity is most likely rack-by-rack – i.e. m whole racks of low-storage nodes and n whole racks of high-storage nodes. So It'd be very useful if we can lower the chance for those near-full DataNodes to become destinations for the 2nd and 3rd replicas.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-10967.03.patch
10/Oct/16 21:48
26 kB
Zhe Zhang
HDFS-10967.02.patch
10/Oct/16 21:17
27 kB
Zhe Zhang
HDFS-10967.01.patch
10/Oct/16 18:25
11 kB
Zhe Zhang
HDFS-10967.00.patch
07/Oct/16 22:00
4 kB
Zhe Zhang

Issue Links

relates to

HDFS-8041 Consider remaining space during block blockplacement if dfs space is highly utilized

Resolved

HDFS-8131 Implement a space balanced block placement policy

Resolved

Activity

People

Assignee:: Zhe Zhang

Reporter:: Zhe Zhang

Votes:: 1 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 06/Oct/16 00:06

Updated:: 04/Dec/17 23:40