Hadoop Common
  1. Hadoop Common
  2. HADOOP-8900

BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1-win, 2.0.1-alpha
    • Fix Version/s: 1.2.0, 2.0.3-alpha
    • Component/s: None
    • Labels:
      None
    • Environment:

      Encountered failure when processing large GZIP file

    • Hadoop Flags:
      Reviewed

      Description

      Encountered failure when processing large GZIP file
      • Gz: Failed in 1hrs, 13mins, 57sec with the error:
      ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
      at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
      at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
      at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
      at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
      at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
      at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
      at java.security.AccessController.doPrivileged(Native Method)
      at javax.security.auth.Subject.doAs(Subject.java:415)
      at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
      at org.apache.hadoop.mapred.Child.main(Child.java:260)
      Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
      at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
      at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
      at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
      at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
      at java.io.InputStream.read(InputStream.java:102)
      at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
      at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
      at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
      at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
      at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
      at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
      at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
      ... 9 more

      1. hadoop8900.txt
        4 kB
        Andy Isaacson
      2. BuiltInGzipDecompressor2.patch
        1 kB
        Slavik Krassovsky
      3. hadoop8900-2.txt
        5 kB
        Andy Isaacson
      4. hadoop-8900.branch-1.patch
        4 kB
        Suresh Srinivas

        Issue Links

          Activity

            People

            • Assignee:
              Andy Isaacson
              Reporter:
              Slavik Krassovsky
            • Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development