Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
-
None
Description
When running MapReduce with concatenated gzip files as input, only the first part ("member" in gzip spec parlance, http://www.ietf.org/rfc/rfc1952.txt) is read; the remainder is silently ignored. As a first step toward fixing that, this issue will add a configurable option to throw an error in such cases.
MAPREDUCE-469 is the tracker for the more complete fix/feature, whenever that occurs.
Attachments
Issue Links
- is related to
-
HADOOP-6335 Support reading of concatenated gzip and bzip2 files
- Resolved
-
PIG-42 Pig should be able to split Gzip files like it can split Bzip files
- Resolved
-
HADOOP-6835 Support concatenated gzip files
- Closed