[HADOOP-2423] The codes in FSDirectory.mkdirs(...) is inefficient. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.15.1
Fix Version/s: 0.17.0
Component/s: None
Labels:
None

Release Note:
Improved FSDirectory.mkdirs(...) performance. In NNThroughputBenchmark-create, the ops per sec in was improved ~54%.

Description

FSDirectory.mkdirs(...) creates List<String> v to store all dirs. e.g.

//Suppose 
src = "/foo/bar/bas/"
//Then,
v = {"/", "/foo", "/foo/bar", "/foo/bar/bas"}

For each directory string cur in v, no matter cur already exists or not, it will try to do a unprotectedMkdir(cur, ...). Then, cur is parsed to byte[][] in INodeDirectory.addNode (...).

We don't need to do the parsing for each string in v. Instead, byte[][] should be stored. Also, the loop should not continue once it finds an existing subdirectory.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

2423_20080311.patch
11/Mar/08 18:05
7 kB
Tsz-wo Sze
2423_20080310.patch
10/Mar/08 22:54
7 kB
Tsz-wo Sze
2423_20080304d.patch
04/Mar/08 22:16
8 kB
Tsz-wo Sze
2423_20080304c.patch
04/Mar/08 20:20
8 kB
Tsz-wo Sze
2423_20080304b.patch
04/Mar/08 19:29
8 kB
Tsz-wo Sze
2423_20080304.patch
04/Mar/08 18:10
5 kB
Tsz-wo Sze
2423_20080303.patch
03/Mar/08 19:13
13 kB
Tsz-wo Sze
2423_20080130.patch
30/Jan/08 23:36
14 kB
Tsz-wo Sze

Issue Links

is related to

HDFS-1832 FSNamesystem#startFileInternal unnecessarily traverses the directory multiple times

Open

relates to

HDFS-78 Eliminate redundant searches in the namespace directory tree.

Open

Activity

People

Assignee:: Tsz-wo Sze

Reporter:: Tsz-wo Sze

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 13/Dec/07 23:45

Updated:: 12/Apr/11 23:38

Resolved:: 13/Mar/08 19:51