Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
0.1.0
-
None
-
None
-
195 node cluster running redhat linux (2.6 kernel)
Description
In some failure modes, the pending creates list is not cleaned up and prevents that file from ever being created.
When I try to create the file after the first job was killed (hours previously), I get:
060404 084619 Cannot start file because pendingCreates is non-null. src=/user/oom/rand/part000118
060404 084619 Server handler 0 on 8020 call error: java.io.IOException: Cannot create file /user/oom/rand/part000118 on client DFSClient_-1656137458
java.io.IOException: Cannot create file /user/oom/rand/part000118 on client DFSClient_-1656137458
at org.apache.hadoop.dfs.NameNode.create(NameNode.java:147)
at sun.reflect.GeneratedMethodAccessor8.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)