Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.1.0-beta
-
None
-
Reviewed
Description
FileSystemRMStateStore writes directly to the destination file when storing state. However if the RM were to crash in the middle of the write, the recovery method could encounter a partially-written file and either outright crash during recovery or silently load incomplete state.
To avoid this, the data should be written to a temporary file and renamed to the destination file afterwards.
Attachments
Attachments
Issue Links
- is duplicated by
-
YARN-1255 RM fails to start up with Failed to load/recover state error in a HA setup
- Resolved