Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.20-append
-
None
Description
the stallReplicationWork member in FSNamesystem is accessed by multiple threads without synchronization, but isn't marked volatile. I believe this is responsible for about 1% failure rate on TestFileAppend4.testAppendSyncChecksum* on my 8-core test boxes (looking at logs I see replication happening even though we've supposedly disabled it)