[HDFS-8147] Mover should not schedule two replicas to the same DN storage - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.6.0
Fix Version/s: 2.8.0, 2.7.1, 3.0.0-alpha1
Component/s: balancer & mover
Labels:
None

Hadoop Flags:

Reviewed

Description

Scenario:
1. Three DN cluster. For DNs storage type is like this.
DN1 : DISK,ARCHIVE
DN2 : DISK
DN3 : DISK,ARCHIVE (All DNs are in same rack)
2. One file with two replicas (In DN1 and DN2)
3. Set file storage policy COLD
4. Now execute Mover.

Expected Result: File blocks should move in DN1:ARCHIVE and DN3:ARCHIVE

Actual Result: chooseTargetInSameNode() move D1:DISK block to D1:ARCHIVE, but in next iteration chooseTarget() for same rake is selecting again DN1:ARCHIVE for target where already same block exists.
chooseTargetInSameNode() and chooseTarget() should not select the node as target where already same replica exists. The dispatcher will fail to move block as shown in the log below. Then, the Mover will try again in next iteration.

Logs

15/04/15 10:47:17 WARN balancer.Dispatcher: Failed to move blk_1073741852_1028 with size=11990 from 10.19.92.74:50010:DISK to 10.19.92.73:50010:ARCHIVE through 10.19.92.73:50010: Got error, status message opReplaceBlock BP-1258709199-10.19.92.74-1428292615636:blk_1073741852_1028 received exception org.apache.hadoop.hdfs.server.datanode.ReplicaAlreadyExistsException: Replica FinalizedReplica, blk_1073741852_1028, FINALIZED

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-8147_1.patch
20/Apr/15 06:49
1 kB
Surendra Singh Lilhore
HDFS-8147_2.patch
22/Apr/15 01:08
3 kB
Walter Su
HDFS-8147_3.patch
22/Apr/15 08:48
5 kB
Surendra Singh Lilhore
HDFS-8147_4.patch
23/Apr/15 13:25
4 kB
Surendra Singh Lilhore
HDFS-8147.patch
17/Apr/15 11:44
2 kB
Surendra Singh Lilhore

Issue Links

is broken by

HDFS-7310 Mover can give first priority to local DN if it has target storage type available in local DN

Closed

is related to

HDFS-8143 HDFS Mover tool should exit after some retry when failed to move blocks.

Closed

HDFS-8204 Mover/Balancer should not schedule two replicas to the same DN

Closed

is required by

HDFS-7621 Erasure Coding: update the Balancer/Mover data migration logic

Resolved

Activity

People

Assignee:: Surendra Singh Lilhore

Reporter:: Surendra Singh Lilhore

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 15/Apr/15 05:17

Updated:: 06/Jan/17 01:33

Resolved:: 23/Apr/15 18:59