|
[
Permlink
| « Hide
]
Robert Chansler added a comment - 22/Jan/08 07:00 PM
Promoted for consideration in 16.
This is a new feature. Feature freeze for 0.16 has past. Assigning this to 0.17.
I'm disappointed this didn't go in for Hadoop 0.16, despite having been set as target for 0.16. I understand that this is a pretty big change, but I really want to be sure we get something for Hadoop 0.17.
While we wait for this, any old Hadoop DFS' with corrupted files will need to sit, waiting for their owners to have a way to retrieve the files. For that time fsck will always return corrupt. The inability to do anything with these files (except delete them) could be masking us from being able to detect other hadoop issues. There is a hack : you can delete (or better save) .meta file for a corrupted block and try to read the file. It should work.
I don't think this is very large feature. How about adding an option "-f" to "fs -get" so that it forces getting the corrupted files?
2063_20080220.patch: added a junit test.
Initial Comments:
Thanks, Raghu!
> I think it might not be useful to sent it to stdout
hmm.. it is just file data right? > The checksum feature is not available for some other FileSystem. So I am not sure whether we should make it as a FileSystem property. I will think about this. Though this is not available in other filesystems, it is used by the generic FsShell layer as an argument to generic command. In that sense the flag might belong in FileSystem. LocalFileSystem can certainly support it. I think it also makes the implementation simpler. 2063_20080221.patch
1,2,5: done +1. looks good.
Minor suggestions:
2063_20080222.patch:
> though getSrcFileSystem is private, it will be nice to have JavaDoc. FsShell.java is read by a lot of developers. > There are a few code clean up changes. Please double check these changes. Do we need extra output in TestDFSShell or was it a debug change? hudson problem? My last submission is still not tested for 3 days. Try again.
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12376266/2063_20080222.patch against trunk revision 619744. @author +1. The patch does not contain any @author tags. tests included +1. The patch appears to include 7 new or modified tests. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new javac compiler warnings. release audit +1. The applied patch does not generate any new release audit warnings. findbugs +1. The patch does not introduce any new Findbugs warnings. core tests +1. The patch passed core unit tests. contrib tests +1. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/1837/testReport/ This message is automatically generated. I just committed this. Thanks Nicholas.
Integrated in Hadoop-trunk #415 (See http://hudson.zones.apache.org/hudson/job/Hadoop-trunk/415/
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||