Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
Currently, if a tablet's data directory group runs out of space during a MRS or DMS flush, the operation will fail, and the tserver will crash, as MRS and DMS flush failures are fatal without the proper care. For disk failures, this "care" meant ensuring that upon failing the op, the tablet has started the process of shutting down and being failed so it can be replicated elsewhere. No such handling currently exists for full disks, although it wouldn't be unreasonable to apply the same or similar steps.
Attachments
Issue Links
- is duplicated by
-
KUDU-761 TS seg fault after short log append
- Resolved
- is related to
-
KUDU-3114 tserver writes core dump when reporting 'out of space'
- Reopened
-
KUDU-2405 Sanity-check tablet copies for full disks
- Open
-
KUDU-2628 Allow hot-swapping of failed data directories
- Open
-
KUDU-1172 Enable deleting tablets before or while they bootstrap
- Open
-
KUDU-2795 Prevent cascading failures by detecting that disks are full and rejecting attempts to add additional replicas to a tablet server
- Open
- relates to
-
KUDU-2577 Support rebalancing data allocation across directories when adding a new data dir
- Open