Uploaded image for project: 'Jackrabbit Content Repository'
  1. Jackrabbit Content Repository
  2. JCR-1691

Includes new (old) mimetypes that OpenOfficeTextExtractor can handle

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • 1.4
    • 1.5
    • None
    • OS: Linux Debian kernel 2.6.18
      java version "1.5.0_14"

    Description

      The following patch adds the old openoffice (1.0 version) mimetypes to have their contents extracted.
      I've tested with simple files and it worked here.

      $ cat OpenOfficeTextExtractor-mimetype.patch
      — jackrabbit-1.4/jackrabbit-text-extractors/src/main/java/org/apache/jackrabbit/extractor/OpenOfficeTextExtractor.java 2007-12-19 12:57:58.000000000 -0200
      +++ jackrabbit-1.4-modified/jackrabbit-text-extractors/src/main/java/org/apache/jackrabbit/extractor/OpenOfficeTextExtractor.java 2008-07-24 15:01:08.000000000 -0300
      @@ -54,7 +54,11 @@
      "application/vnd.oasis.opendocument.graphics",
      "application/vnd.oasis.opendocument.presentation",
      "application/vnd.oasis.opendocument.spreadsheet",

      • "application/vnd.oasis.opendocument.text"});
        + "application/vnd.oasis.opendocument.text",
        + "application/vnd.sun.xml.calc",
        + "application/vnd.sun.xml.draw",
        + "application/vnd.sun.xml.impress",
        + "application/vnd.sun.xml.writer"});
        }

      //-------------------------------------------------------< TextExtractor >

      Attachments

        Activity

          People

            Unassigned Unassigned
            leslie Leslie Tsang
            Votes:
            2 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: