Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Won't Do
-
None
-
None
-
None
Description
When OM statemachine receives any unknown failure (other than IOException type), it results in termination of OM.
Its observed that same failure is applied to other om nodes, while applyTransaction, as raft log is already replicated to minimum number of nodes in quorum before applyTransaction.
Till recovery method is not applied, OM remains down. So as to provide read only service, can move OM to read only mode, where read operation is allowed over leader.
Few points in consideration:
- Need block applyTransaction of raft log in readonly mode to avoid running transaction on previous failed operation
- Need support Read operation over read only mode (as leader election will happen, but leader will not be ready till latest transaction is updated)
Attachments
Issue Links
- links to