Issue Details (XML | Word | Printable)

Key: HADOOP-2585
Type: New Feature New Feature
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Konstantin Shvachko
Reporter: Konstantin Shvachko
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

Automatic namespace recovery from the secondary image.

Created: 12/Jan/08 12:22 AM   Updated: 08/Jul/09 04:42 PM
Return to search
Component/s: None
Affects Version/s: 0.16.0
Fix Version/s: 0.18.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works SecondaryStorage.patch 2008-04-11 10:31 AM Konstantin Shvachko 68 kB
Text File Licensed for inclusion in ASF works SecondaryStorage.patch 2008-04-11 02:46 AM Konstantin Shvachko 68 kB
Text File Licensed for inclusion in ASF works SecondaryStorage.patch 2008-04-02 08:19 PM Konstantin Shvachko 70 kB
Issue Links:
Incorporates
 

Hadoop Flags: Incompatible change, Reviewed
Release Note:
Improved management of replicas of the name space image. If all replicas on the Name Node are lost, the latest check point can be loaded from the secondary Name Node. Use parameter "-importCheckpoint" and specify the location with "fs.checkpoint.dir." The directory structure on the secondary Name Node has changed to match the primary Name Node.
Resolution Date: 11/Apr/08 09:19 PM


 Description  « Hide
Hadoop has a three way (configuration controlled) protection from loosing the namespace image.
  1. image can be replicated on different hard-drives of the same node;
  2. image can be replicated on a nfs mounted drive on an independent node;
  3. a stale replica of the image is created during periodic checkpointing and stored on the secondary name-node.

Currently during startup the name-node examines all configured storage directories, selects the
most up to date image, reads it, merges with the corresponding edits, and writes to the new image back
into all storage directories. Everything is done automatically.

If due to multiple hardware failures none of those images on mounted hard drives (local or remote)
are available the secondary image although stale (up to one hour old by default) can be still
used in order to recover the majority of the file system data.
Currently one can reconstruct a valid name-node image from the secondary one manually.
It would be nice to support an automatic recovery.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Repository Revision Date User Message
ASF #647313 Fri Apr 11 21:18:29 UTC 2008 shv HADOOP-2585. Name-node imports namespace data from a recent checkpoint accessible via a NFS mount. Contributed by Konstantin Shvachko.
Files Changed
MODIFY /hadoop/core/trunk/src/test/org/apache/hadoop/dfs/TestCheckpoint.java
ADD /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/CheckpointSignature.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/FSNamesystem.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/FSImage.java
MODIFY /hadoop/core/trunk/build.xml
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/FSEditLog.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/ClientProtocol.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/TransferFsImage.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/FSConstants.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/io/WritableComparable.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/SecondaryNameNode.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/GetImageServlet.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/FSDirectory.java
MODIFY /hadoop/core/trunk/CHANGES.txt
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/util/StringUtils.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/Storage.java
MODIFY /hadoop/core/trunk/conf/hadoop-default.xml
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/conf/Configuration.java
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/NameNode.java

Repository Revision Date User Message
ASF #647338 Fri Apr 11 22:33:26 UTC 2008 shv Increment ClientProtocol.versionID missed by HADOOP-2585. Contributed by Konstantin Shvachko.
Files Changed
MODIFY /hadoop/core/trunk/CHANGES.txt
MODIFY /hadoop/core/trunk/src/java/org/apache/hadoop/dfs/ClientProtocol.java