Description
Somehow bzip2 does not work with SequenceFile:
String codec = "org.apache.hadoop.io.compress.BZip2Codec"; SequenceFile.Writer writer = SequenceFile.createWriter(fs, conf, new Path(output), reader.getKeyClass(), reader.getValueClass(), CompressionType.BLOCK, (CompressionCodec)Class.forName(codec).newInstance());
The stack trace is here:
java.lang.UnsupportedOperationException at org.apache.hadoop.io.compress.BZip2Codec.getCompressorType(BZip2Codec.java:80) at org.apache.hadoop.io.compress.CodecPool.getCompressor(CodecPool.java:98) at org.apache.hadoop.io.SequenceFile$Writer.init(SequenceFile.java:914) at org.apache.hadoop.io.SequenceFile$BlockCompressWriter.<init>(SequenceFile.java:1198) at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:401) at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:329) at org.apache.hadoop.mapred.TestSequenceFileBZip.main(TestSequenceFileBZip.java:43) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:165) at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
Attachments
Attachments
Issue Links
- is blocked by
-
HADOOP-5213 BZip2CompressionOutputStream NullPointerException
- Closed
- relates to
-
HADOOP-3646 Providing bzip2 as codec
- Closed