[HDFS-142] In 0.20, move blocks being written into a blocksBeingWritten directory - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 0.20-append
Fix Version/s: 0.20-append, 0.20.205.0
Component/s: None
Labels:
None

Tags:
hbase

Description

Before 0.18, when Datanode restarts, it deletes files under data-dir/tmp directory since these files are not valid anymore. But in 0.18 it moves these files to normal directory incorrectly making them valid blocks. One of the following would work :

remove the tmp files during upgrade, or
if the files under /tmp are in pre-18 format (i.e. no generation), delete them.

Currently effect of this bug is that, these files end up failing block verification and eventually get deleted. But cause incorrect over-replication at the namenode before that.

Also it looks like our policy regd treating files under tmp needs to be defined better. Right now there are probably one or two more bugs with it. Dhruba, please file them if you rememeber.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

validateBlockMetaData-synchronized.txt
13/May/10 01:59
0.9 kB
Todd Lipcon
testfileappend4-deaddn.txt
21/Apr/10 06:04
2 kB
Todd Lipcon
recover-rbw-v2.txt
07/Jun/10 20:33
27 kB
Todd Lipcon
recentInvalidateSets-assertion-fix.txt
13/May/10 22:48
1 kB
Todd Lipcon
hdfs-142-testleaserecovery-fix.txt
05/May/10 16:55
5 kB
Todd Lipcon
hdfs-142-testcases.txt
27/Apr/10 22:11
8 kB
Todd Lipcon
hdfs-142-recovery-reassignment-and-bbw-cleanup.txt
05/May/10 05:57
19 kB
Todd Lipcon
HDFS-142-multiple-blocks-datanode-exception.patch
29/Mar/10 18:04
3 kB
Karthik Ranganathan
hdfs-142-minidfs-fix-from-409.txt
21/Mar/10 23:00
2 kB
Todd Lipcon
HDFS-142-finalize-fix.txt
27/Apr/10 01:27
7 kB
sam rash
HDFS-142-deaddn-fix.patch
21/Apr/10 23:15
3 kB
Nicolas Spiegelberg
hdfs-142-commitBlockSynchronization-unknown-datanode.txt
27/Apr/10 22:11
2 kB
Todd Lipcon
HDFS-142.20-security.2.patch
02/Sep/11 20:32
89 kB
Jitendra Nath Pandey
HDFS-142.20-security.1.patch
02/Sep/11 18:59
88 kB
Jitendra Nath Pandey
HDFS-142_20-append2.patch
08/Jun/10 23:55
84 kB
Nicolas Spiegelberg
HDFS-142_20.patch
21/Mar/10 07:33
54 kB
Nicolas Spiegelberg
handleTmp1.patch
15/Jan/09 07:12
9 kB
Dhruba Borthakur
dont-recover-rwr-when-rbw-available.txt
17/May/10 00:19
24 kB
Todd Lipcon
deleteTmp5_20.txt
15/Mar/10 23:37
35 kB
Dhruba Borthakur
deleteTmp5_20.txt
16/Mar/10 07:10
32 kB
Dhruba Borthakur
deleteTmp2.patch
06/Jan/09 22:43
13 kB
Dhruba Borthakur
deleteTmp.patch
19/Nov/08 20:04
6 kB
Dhruba Borthakur
deleteTmp_0.18.patch
07/Jan/09 23:49
2 kB
Dhruba Borthakur
appendQuestions.txt
04/Feb/09 09:09
8 kB
Dhruba Borthakur
appendFile-recheck-lease.txt
13/May/10 23:34
6 kB
Todd Lipcon

Issue Links

depends upon

HDFS-101 DFS write pipeline : DFSClient sometimes does not detect second datanode failure

Closed

HDFS-793 DataNode should first receive the whole packet ack message before it constructs and sends its own ack message for the packet

Closed

HDFS-988 saveNamespace race can corrupt the edits log

Closed

HDFS-606 ConcurrentModificationException in invalidateCorruptReplicas()

Closed

HDFS-826 Allow a mechanism for an application to detect that datanode(s) have died in the write pipeline

Closed

incorporates

HADOOP-4997 workaround for tmp file handling on DataNodes in 0.18 (HADOOP-4663)

Closed

relates to

HDFS-57 A Datanode's datadir could have lots of blocks in the top-level directory

Resolved

HADOOP-4702 Failed block replication leaves an incomplete block in receiver's tmp data directory

Closed

HADOOP-4810 Data lost at cluster startup time

Closed

HDFS-29 In Datanode, update block may fail due to length inconsistency

Closed

(1 incorporates, 4 relates to)

Activity

People

Assignee:: Dhruba Borthakur

Reporter:: Raghu Angadi

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 15/Nov/08 00:55

Updated:: 02/May/13 02:29

Resolved:: 16/Jun/10 22:30