Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-3156

Retry SSH action check when cannot connect to remote host

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 5.0.0
    • Fix Version/s: 5.1.0
    • Component/s: action
    • Labels:
      None
    • Flags:
      Patch

      Description

      When check() method of SshActionExecutor gets invoked, oozie will ssh connect to the host and check whether the pid of the process that ssh action started is still there (by checking the returned value of command "ssh <host-ip> ps -p <pid>" ) to determine whether ssh action completes or not.

      However, we found cases where oozie fails to connect to host during action status check (e.g., the host is under heavy load, or network is bad etc.).

      In such cases, the return value of command "ssh <host-ip> ps -p <pid>" will be 255 (ssh command exits with the exit status of the remote command or with 255 if an error occurred.).

      According the current logic of method getActionStatus() in SshActionExecutor, the action status will be determined as OK which may not be correct.

        Attachments

        1. ssh-check-bug.patch
          3 kB
          TIAN XING
        2. OOZIE-3156-v6.patch
          17 kB
          TIAN XING
        3. OOZIE-3156-v5.patch
          17 kB
          TIAN XING
        4. OOZIE-3156-v4.patch
          7 kB
          TIAN XING
        5. OOZIE-3156-v3.patch
          6 kB
          TIAN XING
        6. OOZIE-3156-v2.patch
          6 kB
          TIAN XING
        7. OOZIE-3156-v1.patch
          6 kB
          TIAN XING
        8. amend-OOZIE-3156.patch
          0.6 kB
          Mate Juhasz

          Activity

            People

            • Assignee:
              txsing TIAN XING
              Reporter:
              txsing TIAN XING
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: