Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-437

support gzip input files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.6.0
    • None
    • None

    Description

      To specify that line-oriented input is in gzip format:
      -jobconf stream.recordreader.compression=gzip

      Effect:
      1. FileSplit are forced to be whole files
      2. A GZIPInputStream filter is applied in the RecordReader

      Attachments

        Activity

          People

            Unassigned Unassigned
            michel_tourn Michel Tourn
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: