Description
SPARK-25871 introduced a regression in the streaming WAL – it no longer makes all the parent dirs, so you may see an exception like this in cases that used to work:
18/11/09 03:31:48 ERROR util.FileBasedWriteAheadLog_ReceiverSupervisorImpl: Failed to write to write ahead log after 3 failures ... org.apache.spark.SparkException: Exception thrown in awaitResult: at org.apache.spark.util.ThreadUtils$.awaitResult(ThreadUtils.scala:226) at org.apache.spark.streaming.receiver.WriteAheadLogBasedBlockHandler.storeBlock(ReceivedBlockHandler.scala:210) ... Caused by: java.io.FileNotFoundException: Parent directory doesn't exist: /tmp/__spark__1e8ba184-d323-47eb-b857-0e6285409424/88992/checkpoints/receivedData/0 at org.apache.hadoop.hdfs.server.namenode.FSDirectory.verifyParentDir(FSDirectory.java:1923)
Attachments
Issue Links
- is caused by
-
SPARK-25871 Streaming WAL should not use hdfs erasure coding, regardless of FS defaults
- Resolved
- links to