Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-872

Tika --extract fails for RTF

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 1.0
    • None
    • general
    • None
    • Windows 7 with Java v1.6

    Description

      A file that is embedded in an RTF file doesn't get extracted to disk.

      To "embed" a file into an RTF, simply drag-drop it into an RTF document when using MS-Word 2010. It will then create an EMF of the embedded file's preview.

      See attached file "embedded.rtf.zip" for an example input file that fails with Tika v1.0.

      Attachments

        1. embedded.rtf.zip
          623 kB
          Albert L.

        Issue Links

          Activity

            People

              Unassigned Unassigned
              albertlaw Albert L.
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: