[HDFS-14637] Namenode may not replicate blocks to meet the policy after enabling upgradeDomain - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.3.0
Fix Version/s: 3.3.0, 3.1.4, 3.2.2
Component/s: namenode
Labels:
None

Target Version/s:

3.3.0

Description

After changing the network topology or placement policy on a cluster and restarting the namenode, the namenode will scan all blocks on the cluster at startup, and check if they meet the current placement policy. If they do not, they are added to the replication queue and the namenode will arrange for them to be replicated to ensure the placement policy is used.

If you start with a cluster with no UpgradeDomain, and then enable UpgradeDomain, then on restart the NN does notice all the blocks violate the placement policy and it adds them to the replication queue. I believe there are some issues in the logic that prevents the blocks from replicating depending on the setup:

With UD enabled, but no racks configured, and possible on a 2 rack cluster, the queued replication work never makes any progress, as in blockManager.validateReconstructionWork(), it checks to see if the new replica increases the number of racks, and if it does not, it skips it and tries again later.

DatanodeStorageInfo[] targets = rw.getTargets();
if ((numReplicas.liveReplicas() >= requiredRedundancy) &&
    (!isPlacementPolicySatisfied(block)) ) {
  if (!isInNewRack(rw.getSrcNodes(), targets[0].getDatanodeDescriptor())) {
    // No use continuing, unless a new rack in this case
    return false;
  }
  // mark that the reconstruction work is to replicate internal block to a
  // new rack.
  rw.setNotEnoughRack();
}

Additionally, in blockManager.scheduleReconstruction() is there some logic that sets the number of new replicas required to one, if the live replicas >= requiredReduncancy:

int additionalReplRequired;
if (numReplicas.liveReplicas() < requiredRedundancy) {
  additionalReplRequired = requiredRedundancy - numReplicas.liveReplicas()
      - pendingNum;
} else {
  additionalReplRequired = 1; // Needed on a new rack
}

With UD, it is possible for 2 new replicas to be needed to meet the block placement policy, if all existing replicas are on nodes with the same domain. For traditional '2 rack redundancy', only 1 new replica would ever have been needed in this scenario.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-14637.001.patch
10/Jul/19 10:54
17 kB
Stephen O'Donnell
HDFS-14637.002.patch
22/Jul/19 14:16
27 kB
Stephen O'Donnell
HDFS-14637.003.patch
22/Jul/19 20:27
28 kB
Stephen O'Donnell
HDFS-14637.004.patch
23/Jul/19 09:26
28 kB
Stephen O'Donnell
HDFS-14637.005.patch
24/Jul/19 08:40
28 kB
Stephen O'Donnell
HDFS-14637.branch-3.1.patch
04/Oct/19 05:30
28 kB
Wei-Chiu Chuang
HDFS-14637.branch-3.2.patch
04/Oct/19 05:30
28 kB
Wei-Chiu Chuang

Issue Links

relates to

HDFS-7541 Upgrade Domains in HDFS

Resolved

Activity

People

Assignee:: Stephen O'Donnell

Reporter:: Stephen O'Donnell

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 08/Jul/19 19:28

Updated:: 04/Oct/19 05:30

Resolved:: 04/Oct/19 05:30