Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
The test TestFLush frequently fails in recent patchs. I looked for the failed log records. I found there were two reason lead this test TestFLush#testHFlushInterrupted to be failed. And this method failure is the main reason that TestFLush not pass test. The two failed reasons is below:
Tests run: 14, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 30.325 sec <<< FAILURE! - in org.apache.hadoop.hdfs.TestHFlush testHFlushInterrupted(org.apache.hadoop.hdfs.TestHFlush) Time elapsed: 0.864 sec <<< ERROR! java.io.IOException: The stream is closed at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:118) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140) at java.io.DataOutputStream.flush(DataOutputStream.java:123) at java.io.FilterOutputStream.close(FilterOutputStream.java:158) at org.apache.hadoop.hdfs.DataStreamer.closeStream(DataStreamer.java:877) at org.apache.hadoop.hdfs.DataStreamer.closeInternal(DataStreamer.java:726) at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:721)
testHFlushInterrupted(org.apache.hadoop.hdfs.TestHFlush) Time elapsed: 0.862 sec <<< ERROR!
java.nio.channels.ClosedByInterruptException: null
at java.nio.channels.spi.AbstractInterruptibleChannel.end(AbstractInterruptibleChannel.java:202)
at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:501)
at org.apache.hadoop.net.SocketOutputStream$Writer.performIO(SocketOutputStream.java:63)
at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:142)
at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:159)
at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:117)
at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82)
at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140)
at java.io.DataOutputStream.flush(DataOutputStream.java:123)
at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:653)
I analysed them, can be simplify described as below:
- The IOException happens when the stream is closed but the stream write operation continues.
- The ClosedByInterruptException happens when stream do hfulsh operations and thread interrupt happens.
So we should catch these exceptions in stream hflush and write operations.
Attachments
Attachments
Issue Links
- relates to
-
HDFS-2043 TestHFlush failing intermittently
- Closed