Issue Details (XML | Word | Printable)

Key: HADOOP-5193
Type: Bug Bug
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Konstantin Shvachko
Reporter: Konstantin Shvachko
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

SecondaryNameNode does not rollImage because of incorrect calculation of edits modification time.

Created: 06/Feb/09 07:21 PM   Updated: 08/Jul/09 04:43 PM
Return to search
Component/s: None
Affects Version/s: 0.19.0
Fix Version/s: 0.19.1

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works EditsLength.patch 2009-02-06 07:32 PM Konstantin Shvachko 2 kB

Hadoop Flags: Reviewed
Resolution Date: 08/Feb/09 09:45 PM


 Description  « Hide
Secondary name-node cannot complete the second phase of the checkpoint because getFsEditsTime() returns the mod time of edits.new rather than edits file.
The difference is that edits remains unchanged during the whole checkpoint process an therefore can serve as an invariant. On the contrary edits.new is changing all the time since it is the target of the edits log during checkpoint. So comparison of the mod time of edits.new before and after checkpoint fail and name-node does not upload new image file from the secondary node and does not truncate edits files.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Konstantin Shvachko added a comment - 06/Feb/09 07:32 PM
This changes calculation of the edits length.
I also removed length() method from{{ EditLogOutputStream}}, because this was the only use case for it.

Boris Shkolnik added a comment - 06/Feb/09 08:14 PM
+1, looks good

note. you actually removed lastModified() and not length().


Konstantin Shvachko added a comment - 06/Feb/09 08:17 PM
Yes I meant lastModified() not length(). Thanks.

Hadoop QA added a comment - 07/Feb/09 03:04 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12399671/EditsLength.patch
against trunk revision 741762.

+1 @author. The patch does not contain any @author tags.

-1 tests included. The patch doesn't appear to include any new or modified tests.
Please justify why no tests are needed for this patch.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 Eclipse classpath. The patch retains Eclipse classpath integrity.

+1 release audit. The applied patch does not increase the total number of release audit warnings.

-1 core tests. The patch failed core unit tests.

-1 contrib tests. The patch failed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3809/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3809/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3809/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3809/console

This message is automatically generated.


Konstantin Shvachko added a comment - 08/Feb/09 09:14 PM
TestAgentConfig.testInitAdaptors_vs_Checkpoint fails because of HADOOP-5172.
TestTaskLimits timed out. Probably one of those cases that some previous test did not stop some servers.
Anyway it passes in my own builds and it has nothing to do with SecondaryNameNode at all.

Konstantin Shvachko added a comment - 08/Feb/09 09:45 PM
I just committed this.

Nigel Daley added a comment - 09/Feb/09 03:33 AM

-1 tests included. The patch doesn't appear to include any new or modified tests.
Please justify why no tests are needed for this patch.

Konstantin, why no regression test?

Boris, why didn't you ask about this in the code review? Our checklist is here: http://wiki.apache.org/hadoop/CodeReviewChecklist


Hudson added a comment - 16/Feb/09 05:00 PM