Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-2467

Oozie can shutdown itself on long GC pause

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.3.0
    • Component/s: None
    • Labels:
      None

      Description

      There are two issues.
      1. Oozie doesn't set ZK session timeout. So, curator uses default session timeout which is 1 min.
      If there is GC pause for more than 1 min, or frequent GC pause because of which curator can't send heartbeat, the connection will be dropped and Oozie will shutdown itself.

      2. Oozie uses ExponentialBackoffRetry with initial wait = 1sec and max retries = 3.
      so Oozie will only retry 3 times, with duration 1sec, 2 sec and 4 sec. We need increase max retries count to at least 10.

        Attachments

        1. OOZIE-2467-V2.patch
          7 kB
          Purshotam Shah
        2. OOZIE-2467-V1.patch
          7 kB
          Purshotam Shah

          Issue Links

            Activity

              People

              • Assignee:
                puru Purshotam Shah
                Reporter:
                puru Purshotam Shah
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: