Uploaded image for project: 'Ambari'
  1. Ambari
  2. AMBARI-12867

Do Not Automatically Abort Stack Repository Installation When A Host Timed Out

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 2.1.0
    • Fix Version/s: 2.1.2
    • Component/s: ambari-server
    • Labels:
      None

      Description

      On 1000 node RU I had 2.3.0.0-2557 installed with some 20 hosts down with heartbeat lost. Then I registered 2.3.2.0-2664 and when I proceeded to install, it would always get aborted with no logs in server or agents.

      Turns out that whenever we install, we do so in stages containing 100 hosts each. If any of the host failed or timed out etc., the rest of the stages are aborted. So in this case the first stage had 1 host timeout, which resulted in that and other stages being aborted.

      I cannot install a version without all hosts being alive. Workaround seems to be to delete lost hosts from Ambari.

        Issue Links

          Activity

          Hide
          hudson Hudson added a comment -

          ABORTED: Integrated in Ambari-trunk-Commit #3318 (See https://builds.apache.org/job/Ambari-trunk-Commit/3318/)
          AMBARI-12867 - Do Not Automatically Abort Stack Repository Installation When A Host Timed Out (jonathanhurley (jhurley: http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=bba679959b3edc16db507faeacd84d33167bbcf4)

          • ambari-server/src/main/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProvider.java
          • ambari-server/src/test/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProviderTest.java
          • ambari-server/src/main/java/org/apache/ambari/server/Role.java
          Show
          hudson Hudson added a comment - ABORTED: Integrated in Ambari-trunk-Commit #3318 (See https://builds.apache.org/job/Ambari-trunk-Commit/3318/ ) AMBARI-12867 - Do Not Automatically Abort Stack Repository Installation When A Host Timed Out (jonathanhurley (jhurley: http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=bba679959b3edc16db507faeacd84d33167bbcf4 ) ambari-server/src/main/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProvider.java ambari-server/src/test/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProviderTest.java ambari-server/src/main/java/org/apache/ambari/server/Role.java
          Hide
          hudson Hudson added a comment -

          FAILURE: Integrated in Ambari-branch-2.1 #418 (See https://builds.apache.org/job/Ambari-branch-2.1/418/)
          AMBARI-12867 - Do Not Automatically Abort Stack Repository Installation When A Host Timed Out (jonathanhurley) (jhurley: http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=20346a34c497dcfb8ef3bd5cbcd9c867dd2ec474)

          • ambari-server/src/test/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProviderTest.java
          • ambari-server/src/main/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProvider.java
          • ambari-server/src/main/java/org/apache/ambari/server/Role.java
          Show
          hudson Hudson added a comment - FAILURE: Integrated in Ambari-branch-2.1 #418 (See https://builds.apache.org/job/Ambari-branch-2.1/418/ ) AMBARI-12867 - Do Not Automatically Abort Stack Repository Installation When A Host Timed Out (jonathanhurley) (jhurley: http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=20346a34c497dcfb8ef3bd5cbcd9c867dd2ec474 ) ambari-server/src/test/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProviderTest.java ambari-server/src/main/java/org/apache/ambari/server/controller/internal/ClusterStackVersionResourceProvider.java ambari-server/src/main/java/org/apache/ambari/server/Role.java

            People

            • Assignee:
              jonathan.hurley Jonathan Hurley
              Reporter:
              jonathan.hurley Jonathan Hurley
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development