[TIKA-1633] Can't extract .png images from pdf document - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.8
Fix Version/s: None
Component/s: server
Labels:
None

Description

Hello,
I am running tika doing:

java -jar tika-server-1.8.jar

then I need to extract images from document, i use:

curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip

In content.zip I only see:

_METADATA_
_TEXT_

nothing else!

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Damiano

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 20/May/15 13:58

Updated:: 29/May/15 18:36