Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.5.0
-
None
-
None
-
commit.id=c9dbfbd
2 nodes with 32 cores and 32GB of max direct memory for drill
Description
After running the following queries (it's the same reproduction from DRILL-2274):
set planner.memory.max_query_memory_per_node=8589934592; select sub1.uid from `all2274.json` sub1 inner join `all2274.json` sub2 on sub1.uid = sub2.uid order by sub1.uid;
After the query starts returning results, I cancelled the query from sqlline. This caused the query to hang in a CANCELLATION_REQUESTED state.
Looking at jstack (attached) the root fragment is blocked waiting for Ack from the client.
The the foreman node (which also runs Zookeeper) runs out of disk space once the query finishes spilling, which seems to contribute to the issue. Once I changed the spill directory to nfs I no longer so the issue.
Attachments
Attachments
Issue Links
- is related to
-
DRILL-5599 Notify StatusHandlerListener that batch sending has failed even if channel is still open
- Resolved