|
Abdul Qadeer made changes - 23/Aug/08 03:40 AM
Abdul Qadeer made changes - 23/Aug/08 03:46 AM
Abdul Qadeer made changes - 23/Aug/08 03:49 AM
Abdul Qadeer made changes - 23/Aug/08 03:50 AM
Abdul Qadeer made changes - 23/Aug/08 03:51 AM
[
Permlink
| « Hide
]
Abdul Qadeer added a comment - 22/Nov/08 07:58 AM
Patch to add bzip2 splitting support.
Abdul Qadeer made changes - 22/Nov/08 07:58 AM
Abdul Qadeer made changes - 22/Nov/08 07:59 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394477/Hadoop-4012-version1.patch against trunk revision 719787. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 9 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. -1 findbugs. The patch appears to introduce 2 new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3639/testReport/ This message is automatically generated. I looked at the test failures and they do not seem to be related to this
patch (they don't even run the fair scheduler).
Abdul Qadeer made changes - 24/Nov/08 02:09 AM
The following change was done in this new patch. Before this change, getPos()
was returning values one less than what it should be. Similarly available() method was returning -1 because the value of count becomes -1 at the end of the chunk. I hope it will resolve at least two of the 4 bugs. @Override @Override
Abdul Qadeer made changes - 24/Nov/08 02:16 AM
Abdul Qadeer made changes - 24/Nov/08 02:16 AM
Looks like Hudson is stuck on my patch. Should I cancel patch and re-submit?
Since Hudson queue seems stuck, I am canceling this patch.
Abdul Qadeer made changes - 25/Nov/08 08:28 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394526/Hadoop-4012-version2.patch against trunk revision 720162. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 9 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3644/testReport/ This message is automatically generated.
Abdul Qadeer made changes - 26/Nov/08 02:29 AM
Abdul Qadeer made changes - 26/Nov/08 02:30 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12394714/Hadoop-4012-version3.patch against trunk revision 720930. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 9 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. -1 findbugs. The patch appears to introduce 2 new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3663/testReport/ This message is automatically generated. This patch fixes the warning found by findbug
Abdul Qadeer made changes - 28/Nov/08 05:45 AM
Abdul Qadeer made changes - 28/Nov/08 05:45 AM
Abdul Qadeer made changes - 28/Nov/08 05:46 AM
Chris Douglas made changes - 30/Nov/08 02:46 AM
Should this change be part of a separate issue, then? I'm not sure what you mean by "two of the 4 bugs", but bug fixes shouldn't be part of large, new features if the fix is unaffected by the feature.
This affects too many core components to make the feature freeze for 0.20 (Fri).
Chris Douglas made changes - 04/Dec/08 04:22 AM
Ah, I see. Sorry, I hadn't understood.
Marking this issue as depending on or blocked by the related issues, and raising them as PA again, is better than creating a composite patch. If you feel you've addressed the points made by the commenter, it's appropriate to resubmit. There's a lot of traffic, and some responses will inevitably be overlooked.
LZO used to be a counterexample, but it has since been removed ( Thanks for your patience with this. I like the idea of avoiding seeking back 1 byte in LineRecordReader. With this change, each mapper will usually open only 2 data blocks, while the old approach makes each mapper open 3 data blocks. Considering all the costs about it (opening TCP connection to the remote data node, disk seek to read 1 byte, etc) this seems to be a reasonable improvement.
Chris Douglas:
I understand that any change to e.g. LineRecordReader (LRR) should be carried out with care because it the most frequently used code. It might not have been impossible to implement bzip2 codecs with the older implementation of LRR but with performance penalty for bzip2 (because it then had to decompress one more block per mapper) and the code in the LRR constructor and the next() method would have been a mess. HADOOP-4010 was separately created before finishing bzip2 work but unfortunately that couldn't get in. Owen O'Malley mentioned some concerns , to which I provided some logical hand waiving. I guess Owen has been busy or is not convinced with those arguments but he didn't said anything. I plan to ask him again that should I provide some kind of test cases just to make sure that his concerns are addressed. So going systematically I first plan to get 4010 approved and then move on by doing recommended changes. I have successfully used the bzip2 codecs for a binary compressed bzip2 files. So I hope by going step by step, I will be able to get this work approved. This patch incorporates changes suggested by Chris Douglas.
Abdul Qadeer made changes - 31/Mar/09 02:39 PM
Suhas Gogate made changes - 31/Mar/09 07:35 PM
support for concatenated bzip2 compressed file. Throw error until this new feature supported to let user know about incompatible input.
Suhas Gogate made changes - 31/Mar/09 07:35 PM
Now BZip2 can be invoked in either of the two modes;
CONTINOUS BYBLOCK is the mode which makes it possible to work with Hadoop Split input. And it should work fine for concatenation of many BZip2 compressed files as well. Otherwise in normal old mode (i.e CONTINOUS) its behavior is the same as it used to be. Currently the default mode is set to BYBLOCK. So problem mentioned in HADOOP-5601 should be solved.
Abdul Qadeer made changes - 02/Apr/09 01:28 PM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12404241/Hadoop-4012-version5.patch against trunk revision 761082. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. -1 findbugs. The patch appears to introduce 1 new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 release audit. The applied patch generated 663 release audit warnings (more than the trunk's current 661 warnings). -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-minerva.apache.org/100/testReport/ This message is automatically generated. Hudson's complaints about findbugs and release audit warnings are taken care of in this new patch.
Abdul Qadeer made changes - 06/Apr/09 11:09 AM
Abdul Qadeer made changes - 06/Apr/09 11:15 AM
Abdul Qadeer made changes - 06/Apr/09 11:16 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12404711/Hadoop-4012-version6.patch against trunk revision 762216. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 release audit. The applied patch generated 662 release audit warnings (more than the trunk's current 659 warnings). -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/154/testReport/ This message is automatically generated.
Abdul Qadeer made changes - 16/Apr/09 11:31 AM
One of my test case, which makes different length data, compresses and decompresses at different split points depends on
Abdul Qadeer made changes - 14/May/09 12:27 PM
Abdul Qadeer made changes - 15/May/09 06:22 AM
Patch was no appearing in Hudson queue so I am trying to re-submit the patch.
Abdul Qadeer made changes - 15/May/09 06:23 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12408129/Hadoop-4012-version7.patch against trunk revision 776148. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. -1 findbugs. The patch appears to introduce 3 new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 release audit. The applied patch generated 494 release audit warnings (more than the trunk's current 486 warnings). -1 core tests. The patch failed core unit tests. -1 contrib tests. The patch failed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/356/testReport/ This message is automatically generated. Looks like there is some problem with Hudson, atleast as far as core and contrib test are concerned. If you see Hudson queue (http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/
http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/362/#showFailuresLink http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/361/#showFailuresLink http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/360/#showFailuresLink http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/358/#showFailuresLink Similarly though Hudson complains about test failures but does not list the failed tests (like in this JIRA's case and there are many others) I will test my patch on a local box and will post the results here.
Johan Oskarsson made changes - 22/May/09 01:49 PM
Abdul Qadeer made changes - 27/May/09 06:19 AM
This patch fixes audit release and find bug warnings.
Abdul Qadeer made changes - 27/May/09 06:20 AM
Abdul Qadeer made changes - 27/May/09 06:21 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12409127/Hadoop-4012-version8.patch against trunk revision 779807. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 release audit. The applied patch generated 501 release audit warnings (more than the trunk's current 493 warnings). -1 core tests. The patch failed core unit tests. -1 contrib tests. The patch failed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/422/testReport/ This message is automatically generated. OK, it looks like this patch is in reasonable shape now. The failing tests seem to be the ones failing for everybody else.
We've been using an older version of the patch for some time now with a custom input format and have had consistent results. Not to mention the speedup due to parallelism. Can one of the committers please have a look? 1) Patch to fix audit release warnings.
Abdul Qadeer made changes - 01/Jun/09 11:08 AM
Abdul Qadeer made changes - 02/Jun/09 08:55 AM
Abdul Qadeer made changes - 02/Jun/09 08:55 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12409554/Hadoop-4012-version9.patch against trunk revision 781602. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. -1 release audit. The applied patch generated 500 release audit warnings (more than the trunk's current 492 warnings). -1 core tests. The patch failed core unit tests. -1 contrib tests. The patch failed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/459/testReport/ This message is automatically generated. (1) Running test-patch target on my local box does not produce release audit warnings:
[exec] (2) Test org.apache.hadoop.hdfs.server.datanode.TestBlockReplacement.testBlockReplacement does not fail on my local box and These 1 or 2 errors are occurring for other patches as well on Hudson. Additionally they are un-related as far as bzip2 is concerned. Can the Hadoop committers please review the patch so that I can complete this work. Thank you.
Chris Douglas made changes - 17/Jun/09 09:48 PM
Abdul, I think at this point it's OK to assume that Chris has accepted your explanation. Can you please post your latest patch with all the fixes. Beware that repository has changed: you'll need to do switch to
http://svn.apache.org/repos/asf/hadoop/common/trunk/ Guys, can we please wrap this up? It's taking way too long for this patch to make it. Please. Thanks. -1 to changing LineRecordReader. In particular, you've undone changes that were made by other jiras. This is very very touchy code that the current version of this patch breaks.
I really think that this is better done by using a separate BzipTextInputFormat and BzipLineRecordReader.
Can you please be more specific: what changes were undone? Can you please elaborate on what exactly this patch breaks (perhaps we need more tests?)? The code indeed is very heavily used, but I think it's also very well beaten upon by various tests.
I think the point of having splitting built in - is that all readers/formats can avoid re-implementing common things. We are currently using a binary format reader that works just fine with this patch with only minor changes. Moreover, the idea is to work out a common framework where other block-compressed formats could be processed in a similar manner. The alternative that you're suggesting is to have BzipTextInputFormat, XXXBlockCompressInputFormat, YYYBlockCompressInputFormat, and so on.
It was the feature from
This patch only seems to support TextInputFormat and Bzip. I don't see abstractions that would make this portable to other codecs or input formats. Maybe combining your binary format with BzipTextInputFormat makes sense? For binary files, SequenceFiles is already splittable with bzip and no changes. I totally agree with Owen that LineRecordReader (LRR) is a heavily used code in Hadoop and changes to such a code should be very carefully done. But that doesn't mean that, LRR code is closed for any improvements and feature enhancements. This LRR deals with Text data and Hadoop uses it as a default because probably Text is the most frequently used kind on input. I think same is true for BZip2 compressed files. If not all, most of them are Text compressed data. This patch provides the following feature to the end user.
User puts his BZip2 compressed text files in an input directory and submits the Hadoop job. Soon he gets the result in the output directory. That is it! He hadn't had to write or mention any input format Now comes the correctness of LRR code. testFormat() test case in org.apache.hadoop.mapred.TestTextInputFormat is a very stringent test case to ensure the correctness of LRR. And since this test case is frequently run whenever someone submits and tests a patch, so I dont think that any correctness problem in LRR can escape. Now comes the things like ........................................................ If we make a separate reader for BZip2 e.g BZip2LineRecordReader, in that class most of the code will be the replica of LRR. And doing that way, GZip support should also come out of LRR and there will be a GZipLineRecordReader, again mostly the code a replica of LRR. Whenever there comes a new codec in Hadoop we will make a new reader whose 99% code is the replica of LRR. So in my view it makes more sense to handle Line related text (be it plain or compressed) in LRR. When we were adding splitting support for BZip2, we felt that there might be situations when codecs want to change/manipulate the split start or end. So we added support for that. Similarly instead LRR counting read bytes itself, it now asks the stream about the position. This feature makes it possible for a codec to manipulate indirectly that how long a reader should keep on reading. We think these features can help for other block based codecs to do splitting easily without further changes in LRR. So in summary this patch might need relatively stringent reviewing by the committers due to its heavy usage but it does add useful functionality in a seamless way for the end user. I am making a new patch to accommodate concerns raised by Chris and Owen. Few files are in the "common" sub-project while the rest are in the "mapreduce". Since the older Hadoop project has been split into 3 sub-projects, how should I make the new patch?
Should I make two patches or merge them to make one?
Abdul Qadeer made changes - 04/Aug/09 10:19 AM
The older patch has been split into two (due to the new Hadoop sub-projects: common, mapreduce and HDFS). This patch has the work related to common while the portion going into MapReduce will be added soon on a separate MapReduce JIRA.
Abdul Qadeer made changes - 04/Aug/09 10:23 AM
Abdul Qadeer made changes - 04/Aug/09 10:24 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12415479/Hadoop-4012-version10.patch against trunk revision 800919. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. -1 javadoc. The javadoc tool appears to have generated 1 warning messages. -1 javac. The applied patch generated 118 javac compiler warnings (more than the trunk's current 116 warnings). +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/591/testReport/ This message is automatically generated. Fixes javadoc and javac warnings.
Abdul Qadeer made changes - 06/Aug/09 05:49 PM
Abdul Qadeer made changes - 06/Aug/09 07:41 PM
The MapReduce related stuff from the original
Abdul Qadeer made changes - 08/Aug/09 11:00 AM
Re-submitting because the patch is not appearing in the Hudson queue.
Abdul Qadeer made changes - 08/Aug/09 11:02 AM
I am out of office until Aug 14th. I will be checking my email
intermittently. If this is urgent then please call my cell phone, otherwise I will reply to your email when I get back. Thanks for your patience, – amr +1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12415766/Hadoop-4012-version11.patch against trunk revision 802224. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-vesta.apache.org/594/testReport/ This message is automatically generated. This is a reminder for the Hadoop committers that
Gentle Hadoop folk,
may I ask what we are waiting for while not committing this patch? If somebody is unhappy with this patch, it would be good to know their concern(s); likewise, if there are some internal discussions going on, it would be wonderful if somebody occasionally filled us in on that. This patch has been in the queue for a long time and now that Hudson is finally happy with it (and I know for various reasons it is hard to satisfy I think by delaying it, we're risking running into conflicts yet again and creating unnecessary work for ourselves. Thank you! Fair point.
I will try to complete a review this weekend. This is what we were looking for. Thank you.
Chris Douglas made changes - 30/Aug/09 09:47 PM
Does the code in LineRecordReader is executed by multiple threads? I found some discussion about it here http://issues.apache.org/jira/browse/HADOOP-3554 Even if it is run by multiple threads, I am not able to see the race conditions because start / end is changed only once in the constructor (I am assuming that LineRecordReader constructor is not called by multiple threads simultaneously) The only problem I see is that LineReacordReader should not forget to call getStart() method after calling createInputStream() method. Or saying another way, any one using SplittableCompressionCodec must call getStart() / get End() methods after creating the stream. So I am little confused about the race condition comments. Can you please help me understand? Thanks. True, there isn't threading issue with LineRecordReader. However, this is the only codec with state, SplittableCompressionCodec won't just be used in LineRecordReader and BZip2Codec may not be the last splittable codec. Though I don't think most codecs are accessed by multiple threads, synchronization is cheap, and the first user to implement a splittable reader with a shared codec will almost certainly cargo-cult this implementation and miss the race condition. If you'd rather leave it this way, then OK.
There was another part of the patch, in BZip2Codec::createInputStream: + if(in.getPos() <= start){
+ ((Seekable)seekableIn).seek(start);
+ in = this.createInputStream(seekableIn, readMode);
+
+
+ }
This drops the stream it created above, which wrapped the stream passed in with a CBZip2InputStream and BufferedInputStream. It's not clear why the stream is being re-created, either... particularly since the start stored in the codec is left alone. What case is being handled here? I tried a version of this using a supertype of CompressionInputStream instead of the semantics tried so far (voiding the synchronization discussion). It doesn't incorporate the other changes discussed.
Chris Douglas made changes - 02/Sep/09 12:06 AM
The reason to re-create the stream for the case when in.getPos() <= start is to tackle the cases like the following: Assume [BBBBBB] represents a BZip2 maker and d is a single compressed data element (this can happen There is some extra information at the start of stream i.e. BZ0h ^ indicates where currently the stream is: [BZh0BBBBBB]d[BBBBBB]d[BBBBBB]d[BBBBBB] _______________________________ ^ I go back 10 bytes in the stream before finding a marker. The reason is that the first 'maker' is 10 bytes long, all others are 6 bytes long. So after going backwards the stream position is as follows: [BZh0BBBBBB]d[BBBBBB]d[BBBBBB]d[BBBBBB] __________________ ^ Now finding next marker might align us with the wrong marker as follows: [BZh0BBBBBB]d[BBBBBB]d[BBBBBB]d[BBBBBB] ______________________ ^ So for such cases the code mentioned above works. But you rightly mentioned that I should have done this.start = start at the end of above code as well.
The new version looks fine to me. Let me incorporate the other changes you mentioned in it and to put the new patch on the JIRA Abdul, a comment explaining that case (or summarizing it and pointing at the JIRA) seems important, rather than just creating the stream with no motiviation.
All the issues discussed have been resolved and merged into Chris' changes in C4012-12.patch except making
BZip2InputStream::updateReportedByteCount (...) method protected. This method is called from private static class BZip2CompressionInputStream extends SplitCompressionInputStream So I have left it public. The corresponding patch in
Abdul Qadeer made changes - 03/Sep/09 09:04 AM
Abdul Qadeer made changes - 05/Sep/09 06:16 AM
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12418490/C4012-13.patch against trunk revision 811572. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-h4.grid.sp2.yahoo.net/18/testReport/ This message is automatically generated. Added a unit test for LineReader/SplittableCompressionCodec to TestCodec.
Chris Douglas made changes - 08/Sep/09 05:13 AM
Chris Douglas made changes - 08/Sep/09 05:13 AM
Chris Douglas made changes - 08/Sep/09 05:13 AM
It should be noted that "Spli tt ableCompressionCodec" chooses a different spelling than FileInputFormat::isSpli t able. Popular search engines return more results for the former than the latter, but if anyone cares about consistency on this point then now's the time to object.
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12418879/C4012-14.patch against trunk revision 812317. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 3 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch-h4.grid.sp2.yahoo.net/23/testReport/ This message is automatically generated. +1
I committed this. Thanks, Abdul!
Chris Douglas made changes - 10/Sep/09 08:52 PM
Integrated in Hadoop-Common-trunk-Commit #25 (See http://hudson.zones.apache.org/hudson/job/Hadoop-Common-trunk-Commit/25/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Integrated in Hadoop-Mapreduce-trunk-Commit #29 (See http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/29/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Integrated in Hadoop-Hdfs-trunk-Commit #26 (See http://hudson.zones.apache.org/hudson/job/Hadoop-Hdfs-trunk-Commit/26/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Integrated in Hadoop-Common-trunk #92 (See http://hudson.zones.apache.org/hudson/job/Hadoop-Common-trunk/92/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Integrated in Hadoop-Hdfs-trunk #80 (See http://hudson.zones.apache.org/hudson/job/Hadoop-Hdfs-trunk/80/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Integrated in Hdfs-Patch-h5.grid.sp2.yahoo.net #26 (See http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/26/
Integrated in Hdfs-Patch-h2.grid.sp2.yahoo.net #6 (See http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h2.grid.sp2.yahoo.net/6/
. Provide splitting support for bzip2 compressed files. Contributed by Abdul Qadeer Editorial pass over all release notes prior to publication of 0.21.
Robert Chansler made changes - 08/Oct/09 04:04 AM
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||