existing Bzip2Codec supported in hadoop 0.19/0.20 skipps the input records when input bzip2 compressed files is made up of concatinating multiple .bz2 files.
Created: 31/Mar/09 07:30 PM
Updated: 12/Sep/09 08:31 AM
Until the Bzip2Codec supports concatenated compressed bzip files as input it should detect it and throw an error to indicate input is not compatible...
Until the Bzip2Codec supports concatenated compressed bzip files as input it should detect it and throw an error to indicate input is not compatible...
(see the related JIRA https://issues.apache.org/jira/browse/HADOOP-5601)