Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-779

Hadoop streaming does not work with gzipped input

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.9.0
    • 0.9.1
    • None
    • None

    Description

      When input files are gzipped, StreamLineRecordReader does not take the corect OutputStream to fetch the next record. Instead of using a GzipOutputStream, it uses a FSOutputStream. So input files are read as uncompressed plain text.

      Attachments

        1. GzipIn.patch
          9 kB
          Hairong Kuang

        Activity

          People

            hairong Hairong Kuang
            hairong Hairong Kuang
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: