Uploaded image for project: 'Slider'
  1. Slider
  2. SLIDER-69 Uber JIRA: Slider apps to withstand failures
  3. SLIDER-202

add integral/configurable chaos monkey to slider AM

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Slider 0.40
    • None
    • appmaster, test
    • None
    • Slider August #1, Slider August #2, Slider September #1

    Description

      The slider funtests trigger container and AM failure in the test code via RPC calls "killContainer" and "amSuicide". This is good for standalone tests, but complicates functional test deployments where we can't easily run such logic client-side.

      Proposed: a built in chaos monkey service in the AM which, when enabled, will at given intervals check to see if a container or AM restart should be triggered based on some preconfigured probabilities.

      This will be embedded and can be used in production if desired.
      e.g, short test:

      interval = 60s, P(AM)=0.1, P(Container)=1.0
      

      long test

      interval = 600s, P(AM)=0.01, P(Container)=0.5
      

      production

      interval = 24h, P(AM)=0., P(Container)=0.1
      

      Attachments

        Issue Links

          Activity

            People

              stevel@apache.org Steve Loughran
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 4h
                  4h