Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17481 LLAP workload management
  3. HIVE-18078

WM getSession needs some retry logic

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.0.0
    • None
    • None

    Description

      When we get a bad session (e.g. no registry info because AM has gone catatonic), the failure by the timeout future fails the getSession call.
      The retry model in TezTask is that it would get a session (which in original model can be completely unusable, but we still get the object), and then retry (reopen) if it's a lemon. If the reopen fails, we fail.
      getSession is not covered by this retry scheme, and should thus do its own retries (or the retry logic needs to be changed)

      Attachments

        1. HIVE-18078.01.patch
          87 kB
          Sergey Shelukhin
        2. HIVE-18078.01.patch
          88 kB
          Sergey Shelukhin
        3. HIVE-18078.04.patch
          9 kB
          Sergey Shelukhin
        4. HIVE-18078.05.patch
          9 kB
          Sergey Shelukhin
        5. HIVE-18078.05.patch
          9 kB
          Sergey Shelukhin
        6. HIVE-18078.only.patch
          10 kB
          Sergey Shelukhin
        7. HIVE-18078.patch
          87 kB
          Sergey Shelukhin

        Issue Links

          Activity

            People

              sershe Sergey Shelukhin
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: