Uploaded image for project: 'Ambari'
  1. Ambari
  2. AMBARI-22864

Agent commands hang even after freeing up disk space on the host

    XMLWordPrintableJSON

Details

    Description

      STR

      1. Install a cluster with Ambari-2.6.2 and HDP-2.6.4.0
      2. Go the host (say host1) running Nimbus component and restart Nimbus
      3. Fill up the disk space on host1 (in my test, the disk space was filled up on the host running Nimbus component)
      4. Try to restart Nimbus. Nimbus restart expectedly fails with error:
        Caught an exception while executing custom service command: <type 'exceptions.IOError'>: [Errno 28] No space left on device; [Errno 28] No space left on device
        
      5. Now free up the disk space on host1 and try to restart Nimbus

       

      Result
      Nimbus restart command hangs and eventually times out

      Looks like the issue is because the action queue is unable to create new command for Nimbus restart.

      Attachments

        1. AMBARI-22864.patch
          1 kB
          Andrew Onischuk
        2. AMBARI-22864.patch
          1 kB
          Andrew Onischuk
        3. AMBARI-22864.patch
          1 kB
          Andrew Onischuk
        4. AMBARI-22864.patch
          1 kB
          Andrew Onischuk

        Activity

          People

            aonishuk Andrew Onischuk
            shavi71 Vivek Sharma
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 1h 10m
                1h 10m