Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Implemented
-
1.4.1
-
None
Description
Working with the latest Solr Trunk code and seems the Tika handlers for Solr Cell (ExtractingDocumentLoader.java) and Data Import handler (TikaEntityProcessor.java) fails to index the zip file contents again.
It just indexes the file names again.
This issue was addressed some time back, late last year, but seems to have reappeared with the latest code.
Jira for the Data Import handler part with the patch and the testcase - https://issues.apache.org/jira/browse/SOLR-2332.
Attachments
Attachments
Issue Links
- relates to
-
SOLR-2332 TikaEntityProcessor retrieves only File Names from Zip extraction
- Resolved
-
SOLR-10335 Upgrade to Tika 1.16 when available
- Closed