Uploaded image for project: 'Ambari'
  1. Ambari
  2. AMBARI-17198

Failure in mahout package installation upon retry is not correctly reported causing EU to fail

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 2.4.0
    • 2.4.0
    • ambari-server
    • None

    Description

      Steps
      1. With Ambari 2.2.2 build, deploy HDP 2.4.0.0 cluster
      2. Register bits for HDP-2.4.2.0-195 and start Installation of packages
      3. Observed an error in first attempt of package install on one of the host

      
      

      stderr: /var/lib/ambari-agent/data/errors-560.txt

      No handlers could be found for logger "root"
      2016-04-14 01:22:09,756 - Caught signal 15, will handle it gracefully. Compute the actual version if possible before exiting.
      2016-04-14 01:22:09,785 - Package Manager failed to install packages. Error: (4, 'Interrupted system call')
      Traceback (most recent call last):
      File "/var/lib/ambari-agent/cache/custom_actions/scripts/install_packages.py", line 386, in install_packages
      retry_count=agent_stack_retry_count)
      File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 154, in _init_
      self.env.run()
      File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run
      self.run_action(resource, action)
      File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action
      provider_action()
      File "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/_init_.py", line 54, in action_install
      self.install_package(package_name, self.resource.use_repos, self.resource.skip_repos)
      File "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py", line 45, in install_package
      active_base_repos = self.get_active_base_repos()
      File "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py", line 73, in get_active_base_repos
      (code, output) = self.call_with_retries(LIST_ACTIVE_REPOS_CMD)
      File "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/_init_.py", line 80, in call_with_retries
      return self._call_with_retries(cmd, is_checked=False, **kwargs)
      File "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/_init_.py", line 91, in _call_with_retries
      code, out = func(cmd, **kwargs)
      File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70, in inner
      result = function(command, **kwargs)
      File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 105, in call
      tries=tries, try_sleep=try_sleep)
      File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140, in _call_wrapper
      result = _call(command, **kwargs_copy)
      File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 240, in _call
      ready, _, _ = select.select(read_set, [], [], 1)
      error: (4, 'Interrupted system call')

      Python script has been killed due to timeout after waiting 1800 secs

      Attachments

        1. AMBARI-17198.patch.1
          8 kB
          Dmytro Grinenko
        2. AMBARI-17198.patch
          14 kB
          Dmytro Grinenko

        Issue Links

          Activity

            People

              dmitriusan Dmitry Lysnichenko
              hapylestat Dmytro Grinenko
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: