[HADOOP-1283] Eliminate internal UTF8 to String and vice versa conversions in the name-node. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.12.0
Fix Version/s: 0.14.0
Component/s: None
Labels:
None

Description

We have internal conversions of those two types inside name-node code. One example:
NameNode.complete(String src, String clientName)
then it calls
FSNamesystem.completeFile(new UTF8(src), new UTF8(clientName));
which in turn finally calls
FSDirectory.addNode(path.toString(), newNode )
and in another place
FSDirectory.getNode(src.toString())

So we have several conversions of the same parameter back and forth during computation.
We should keep the parameter type consistent within different methods.

The question is, which type should be used: String or Text.
From previous discussions I remember that Text is more efficient in space and time for non ASCII
data. Here we mostly deal with file names and network addresses, which are ASCII.
Does it make sense to use Text in this case?

UTF8 is also used as a key in two maps: pendingCreates and leases.
This should be replaced too.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

EliminateUTF8.patch
15/Jun/07 23:32
41 kB
Konstantin Shvachko
EliminateUTF8-2.patch
21/Jun/07 02:27
43 kB
Konstantin Shvachko

Issue Links

is blocked by

HDFS-120 All remaining UTF8 data structures in HDFS code should be removed

Open

Activity

People

Assignee:: Konstantin Shvachko

Reporter:: Konstantin Shvachko

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 21/Apr/07 00:20

Updated:: 08/Jul/09 16:42

Resolved:: 21/Jun/07 18:10