[HDFS-11486] Client close() should not fail fast if the last block is being decommissioned - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.6.0
Fix Version/s: 2.9.0, 2.7.4, 3.0.0-alpha4, 2.8.2
Component/s: None
Labels:
None

Target Version/s:

2.9.0, 2.7.4, 3.0.0-alpha4, 2.8.1

Description

If a DFS client closes a file while the last block is being decommissioned, the close() may fail if the decommission of the block does not complete in a few seconds.

When a DataNode is being decommissioned, NameNode marks the DN's state as DECOMMISSION_INPROGRESS_INPROGRESS, and blocks with replicas on these DataNodes become under-replicated immediately. A close() call which attempts to complete the last open block will fail if the number of live replicas is below minimal replicated factor, due to too many replicas residing on the DataNodes.

The client internally will try to complete the last open block for up to 5 times by default, which is roughly 12 seconds. After that, close() throws an exception like the following, which is typically not handled properly.

java.io.IOException: Unable to close file because the last blockBP-33575088-10.0.0.200-1488410554081:blk_1073741827_1003 does not have enough number of replicas.

	at org.apache.hadoop.hdfs.DFSOutputStream.completeFile(DFSOutputStream.java:864)
	at org.apache.hadoop.hdfs.DFSOutputStream.closeImpl(DFSOutputStream.java:827)
	at org.apache.hadoop.hdfs.DFSOutputStream.close(DFSOutputStream.java:793)
	at org.apache.hadoop.fs.FSDataOutputStream$PositionCache.close(FSDataOutputStream.java:72)
	at org.apache.hadoop.fs.FSDataOutputStream.close(FSDataOutputStream.java:101)
	at org.apache.hadoop.hdfs.TestDecommission.testCloseWhileDecommission(TestDecommission.java:708)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:498)
	at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
	at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
	at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
	at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
	at org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)

Once the exception is thrown, the client usually does not attempt to close again, so the file remains in open state, and the last block remains in under replicated state.

Subsequently, administrator runs recoverLease tool to salvage the file, but the attempt failed because the block remains in under replicated state. It is not clear why the block is never replicated though. However, administrators think it becomes a corrupt file because the file remains open via fsck -openforwrite and the file modification time is hours ago.

In summary, I do not think close() should fail because the last block is being decommissioned. The block has sufficient number replicas, and it's just that some replicas are being decommissioned. Decomm should be transparent to clients.

This issue seems to be more prominent on a very large scale cluster, with min replication factor set to 2.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-11486-branch-2.8.003.patch
28/Mar/17 02:02
3 kB
Masatake Iwasaki
HDFS-11486.test-inmaintenance.patch
03/Mar/17 11:29
5 kB
Yiqun Lin
HDFS-11486.003.patch
27/Mar/17 18:44
3 kB
Wei-Chiu Chuang
HDFS-11486.002.patch
09/Mar/17 21:03
2 kB
Wei-Chiu Chuang
HDFS-11486.001.patch
09/Mar/17 16:43
7 kB
Wei-Chiu Chuang
HDF-11486.test.patch
02/Mar/17 18:20
2 kB
Wei-Chiu Chuang

Issue Links

duplicates

HDFS-11499 Decommissioning stuck because of failing recovery

Resolved

relates to

HDFS-11499 Decommissioning stuck because of failing recovery

Resolved

Activity

People

Assignee:: Wei-Chiu Chuang

Reporter:: Wei-Chiu Chuang

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 02/Mar/17 18:16

Updated:: 02/Oct/19 17:14

Resolved:: 28/Mar/17 09:56