Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.2.0
-
None
-
None
Description
Webhcat "jobs" api gets the list of jobs from RM and then gets details from history server.
RM has a policy of retaining fixed number of jobs to accommodate for the memory it has, while HistoryServer retains jobs based on their age. As a result, jobs that RM returns might not be present in HistoryServer and can result in a failure. HistoryServer also ends up retrying on failures even if they happen because the job actually does not exist.
The retries to get details from HistoryServer in such cases is too aggressive.
Attachments
Attachments
Issue Links
- mentioned in
-
Wiki Page Loading...