[HADOOP-8468] Umbrella of enhancements to support different failure and locality topologies - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.0.0, 2.0.0-alpha
Fix Version/s: None
Component/s: ha, io
Labels:
None

Target Version/s:

2.1.0-beta

Description

The current hadoop network topology (described in some previous issues like: Hadoop-692) works well in classic three-tiers network when it comes out. However, it does not take into account other failure models or changes in the infrastructure that can affect network bandwidth efficiency like: virtualization.
Virtualized platform has following genes that shouldn't been ignored by hadoop topology in scheduling tasks, placing replica, do balancing or fetching block for reading:
1. VMs on the same physical host are affected by the same hardware failure. In order to match the reliability of a physical deployment, replication of data across two virtual machines on the same host should be avoided.
2. The network between VMs on the same physical host has higher throughput and lower latency and does not consume any physical switch bandwidth.
Thus, we propose to make hadoop network topology extend-able and introduce a new level in the hierarchical topology, a node group level, which maps well onto an infrastructure that is based on a virtualized environment.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Proposal for enchanced failure and locality topologies.pdf
04/Jun/12 04:09
260 kB
Junping Du
HADOOP-8468-total.patch
04/Jun/12 04:31
259 kB
Junping Du
HADOOP-8468-total-v3.patch
04/Jun/12 14:37
259 kB
Junping Du
Proposal for enchanced failure and locality topologies (revised-1.0).pdf
16/Jun/12 10:13
269 kB
Junping Du
HVE_Hadoop World Meetup 2012.pptx
24/Oct/12 13:16
954 kB
Junping Du
HVE User Guide on branch-1(draft ).pdf
30/Oct/12 17:43
380 kB
Junping Du

Issue Links

incorporates

HDFS-4240 In nodegroup-aware case, make sure nodes are avoided to place replica if some replica are already under the same nodegroup

Closed

YARN-18 Configurable Hierarchical Topology for YARN

Open

YARN-19 4-layer topology (with NodeGroup layer) implementation of Container Assignment and Task Scheduling (for YARN)

Open

HDFS-3495 Update Balancer to support new NetworkTopology with NodeGroup

Closed

HDFS-3601 Implementation of ReplicaPlacementPolicyNodeGroup to support 4-layer network topology

Closed

HDFS-3498 Make Replica Removal Policy pluggable and ReplicaPlacementPolicyDefault extensible for reusing code in subclass

Closed

is related to

HDFS-4886 Override verifyBlockPlacement() API in BlockPlacementPolicyWithNodeGroup

Resolved

HDFS-4898 BlockPlacementPolicyWithNodeGroup.chooseRemoteRack() fails to properly fallback to local rack

Closed

relates to

HADOOP-11326 documentation for configuring HVE: dfs.block.replicator.classname should be org.apache.hadoop.hdfs.server.blockmanagement.BlockPlacementPolicyWithNodeGroup

Resolved

HDFS-3564 Design enhancements to the pluggable blockplacementpolicy

Resolved

(1 incorporates, 2 is related to, 2 relates to)

Sub-Tasks

1.	Make NetworkTopology class pluggable	Closed	Junping Du
2.	Implementation of 4-layer subclass of NetworkTopology (NetworkTopologyWithNodeGroup)	Closed	Junping Du
3.	Backport Network Topology Extension for Virtualization (HADOOP-8468) to branch-1	Closed	Junping Du
4.	Document usage of node-group layer topology	Resolved	Unassigned
5.	CLONE - Backport Network Topology Extension for Virtualization (HADOOP-8468) to branch-1	Resolved	Unassigned

Activity

People

Assignee:: Junping Du

Reporter:: Junping Du

Votes:: 6 Vote for this issue

Watchers:: 63 Start watching this issue

Dates

Created:: 04/Jun/12 04:08

Updated:: 22/Feb/19 10:44

Resolved:: 22/Feb/19 10:44