[HBASE-20519] [Chaos] Add more chaos options - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Umbrella
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: integration tests
Labels:
- beginner

Tags:
beginner

Description

Our Chaos menu is "drawing room polite" given the variety of failures available out in the wild world of deploys.

Other possible items to add (could do as subtasks of this umbrella) taken from a recent interesting read on how TiDB does its chaos:

Send SIGSTOP to hang or SIGCONT to resume the process.
Use `renice` to adjust the process priority or use `setpriority` for the threads of the process.
Max out the CPU.
Use `iptables` or `tc` to drop or reject the network packages or delay the network packages.
Use `tc` to reorder the network packages and use a proxy to reorder the gRPC requests.
Use `iperf` to take all network throughput.
Use `libfuse` to mount a file system and do the I/O fault injection.
Link `libfiu` to do the I/O fault injection.
Use `rm -rf` forcbily to remove all data.
Use `echo 0 > file` to damage a file.
Copy a huge file to create the `NoSpace` problem.

The article includes other interesting possibilities: exploiting the kernels fault injection mechanism or scripting systemtap to mess with nodes. It also describes how they automate their chaos-making.

Attachments

Sub-Tasks

1.	Send SIGSTOP to hang or SIGCONT to resume rs and add graceful rolling restart	Resolved	Szabolcs Bukros
2.	Network and Data related Actions	Resolved	Szabolcs Bukros
3.	Add an option to Actions to filter out meta RS	Resolved	Szabolcs Bukros

Activity

People

Assignee:: Unassigned

Reporter:: Michael Stack

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 02/May/18 16:51

Updated:: 02/May/18 16:51