Details
-
Sub-task
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
YARN-2928
-
Reviewed
Description
I have the RM recovery enabled. I see that upon restart the RM populates records into flow activity and flow runs but with wrong timestamps. What I mean by the timestamp is the part of the row key:
- flow activity: row created with the day of the RM restart
- flow run: row created with the RM start time as the "run id"
The following illustrates an example flow run:
metrics: [ ], events: [ ], id: "sjlee@Sleep job/1463433569917", type: "YARN_FLOW_RUN", createdtime: 1463422860987, info: { UID: "yarn_cluster!sjlee!Sleep job!1463433569917", SYSTEM_INFO_FLOW_RUN_ID: 1463433569917, SYSTEM_INFO_FLOW_NAME: "Sleep job", SYSTEM_INFO_FLOW_RUN_END_TIME: 1463422865033, SYSTEM_INFO_USER: "sjlee" }, isrelatedto: { }, relatesto: { }
The created time and the end time are correct (i.e. original time), whereas the timestamp in the row key (= run id: 1463433569917) is actually later than the end time and coincides with the RM restart.