Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1030

Page extraction for Word,Excel Documents

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • For use with Solr

    Description

      I would like to extract pages from word doc's and excel sheets.

      Reason: I'm using solr to search files and give page hit results. For this I used pdfbox for page extraction. Now I would like to upload other doctypes but I can't seem to find paging support for it.

      Attachments

        Activity

          People

            Unassigned Unassigned
            davidvdd David vandendriessche
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: