Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-781

Update Tika to v0.6 for the MimeType detection

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.1
    • None
    • None

    Description

      [from annoucement]

      Apache Tika, a subproject of Apache Lucene, is a toolkit for detecting and
      extracting metadata and structured text content from various documents using
      existing parser libraries.

      Apache Tika 0.6 contains a number of improvements and bug fixes. Details can
      be found in the changes file:

      http://www.apache.org/dist/lucene/tika/CHANGES-0.6.txt

      Attachments

        Activity

          People

            jnioche Julien Nioche
            jnioche Julien Nioche
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: