Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.23.1
-
None
-
Reviewed
-
bzip2 codec native-code
Description
The bzip2 codec supplied with Hadoop is currently available only as a Java implementation. A version that uses the system bzip2 library can provide improved performance and a better memory footprint. This will also make it feasible to utilize alternative bzip2 libraries that may perform better for specific jobs.