Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-151

Stream compression support

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.2
    • parser
    • None

    Description

      Tika should automatically detect and decode stream compression formats like gzip or bzip2. When parsing, such compression should be mentioned in the resulting metadata (compression=gzip), but should not otherwise affect the result of the parsing. In other words, the extracted text content should be the same regardless of whether the input stream has been compressed or not.

      Attachments

        Activity

          People

            jukkaz Jukka Zitting
            jukkaz Jukka Zitting
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: