[HADOOP-1989] Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 0.16.0
Fix Version/s: 0.16.0
Component/s: None
Labels:
None

Description

Proposal is to add an implementation for a Simulated Data Node.
This will

allow one to test certain parts of the system (especially the Name Node, protocols) much more easily and efficiently.
allow one to run performance benchmarks on the Name node without having a large cluster.
Inject faults for testing (e.g. one can add random faults based probability parameters).

The idea is that the Simulated Data Node will

discard any data written to blocks (but remember the blocks and their sizes)
generate fixed data on the fly when blocks are read (e.g. block is fixed set of bytes or repeated sequence of strings).

The Simulated Data Node can also be used for fault injection.
The data node can be parameterized with probabilities that allow one to control:

Delays on reads and writes, creates, etc
IO Exceptions
Loss of blocks
Failures

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SimulatedStoragePatchSubmit.txt
17/Oct/07 17:04
70 kB
Sanjay Radia
SimulatedStoragePatchSubmit5.txt
05/Nov/07 23:55
84 kB
Sanjay Radia
SimulatedStoragePatchSubmit6.txt
07/Nov/07 21:56
86 kB
Sanjay Radia
SimulatedStoragePatchSubmit7.txt
09/Nov/07 02:17
86 kB
Sanjay Radia
SimulatedStoragePatchSubmit8.txt
15/Nov/07 00:31
86 kB
Sanjay Radia
SimulatedStoragePatchSubmit9.patch
15/Jan/08 08:04
1.0 kB
Sanjay Radia

Activity

People

Assignee:: Sanjay Radia

Reporter:: Sanjay Radia

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 03/Oct/07 17:55

Updated:: 08/Jul/09 16:42

Resolved:: 17/Jan/08 01:25