Uploaded image for project: 'Jackrabbit Content Repository'
  1. Jackrabbit Content Repository
  2. JCR-1887

msoffice text extractor for office 2007 files

    XMLWordPrintableJSON

Details

    Description

      i created a patch that provides a mstextextractor for jackrabbit. this patch will entirely replace all existing ms extractors.
      this patch can be applied as soon as poi-3.5 is available. the ms text extractor supports: doc, docx, ppt, pptx,
      xls, xlsx. the patch is not fully tested and uses poi code which is not yet available on the maven repo (needs to be
      build locally)

      Attachments

        1. mstextextractor.patch
          6 kB
          Philipp Koch

        Activity

          People

            jukkaz Jukka Zitting
            pkoch Philipp Koch
            Votes:
            1 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: