Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-905

Embedded text boxes and shapes with text not supported

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 1.0
    • 1.2
    • parser
    • Windows 7

    Description

      This is similar to TIKA-904 but for normal word processing documents. In those, text contained in text boxes and shapes is not extracted.

      Attachments

        1. testPagesEmbeddedJIRA.pages
          1.01 MB
          Gabriel Valencia

        Activity

          People

            Unassigned Unassigned
            gvalenc Gabriel Valencia
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: