Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Duplicate
-
1.9.0
-
None
Description
The task cannot fail itself because it's stuck on the disposal of RocksDB, which also affects the job. I saw this for several times in recent months, most of the errors come from the broken disk. But I think we should also do something to deal with it more elegantly from Flink's perspective.
"LookUp_Join -> Sink_Unnamed (898/1777)- execution # 4" #411 prio=5 os_prio=0 tid=0x00007fc9b0286800 nid=0xff6fc runnable [0x00007fc966cfc000] java.lang.Thread.State: RUNNABLE at org.rocksdb.RocksDB.disposeInternal(Native Method) at org.rocksdb.RocksObject.disposeInternal(RocksObject.java:37) at org.rocksdb.AbstractImmutableNativeReference.close(AbstractImmutableNativeReference.java:57) at org.apache.flink.util.IOUtils.closeQuietly(IOUtils.java:263) at org.apache.flink.contrib.streaming.state.RocksDBKeyedStateBackend.dispose(RocksDBKeyedStateBackend.java:349) at org.apache.flink.streaming.api.operators.AbstractStreamOperator.dispose(AbstractStreamOperator.java:371) at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.dispose(AbstractUdfStreamOperator.java:124) at org.apache.flink.streaming.runtime.tasks.StreamTask.disposeAllOperators(StreamTask.java:618) at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:517) at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:733) at org.apache.flink.runtime.taskmanager.Task.run(Task.java:539) at java.lang.Thread.run(Thread.java:748)
Attachments
Issue Links
- duplicates
-
FLINK-5463 RocksDB.disposeInternal does not react to interrupts, blocks task cancellation
- Reopened