Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-11008

webhcat GET /jobs retries on getting job details from history server is too agressive

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 1.2.0
    • Fix Version/s: None
    • Component/s: WebHCat
    • Labels:
      None

      Description

      Webhcat "jobs" api gets the list of jobs from RM and then gets details from history server.

      RM has a policy of retaining fixed number of jobs to accommodate for the memory it has, while HistoryServer retains jobs based on their age. As a result, jobs that RM returns might not be present in HistoryServer and can result in a failure. HistoryServer also ends up retrying on failures even if they happen because the job actually does not exist.
      The retries to get details from HistoryServer in such cases is too aggressive.

        Attachments

        1. HIVE-11008.1.patch
          2 kB
          Thejas Nair

          Issue Links

            Activity

              People

              • Assignee:
                thejas Thejas Nair
                Reporter:
                thejas Thejas Nair
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: