1. Solr

contrib - Solr Cell (Tika extraction)



Extract content from rich documents using Tika

Issues: Unresolved

Key Summary Due Date
Wish SOLR-1605 ExtractingRequestHandler does not embed original document
Improvement SOLR-1645 Add human content-type
Bug SOLR-1847 Solrj doesn't know if PDF was actually parsed by Tika

View Issues

Issues: Updated recently

Key Summary Updated
Bug SOLR-7139 SolrContentHandler for TIKA is broken by TikaOCR (caused by multiple startDocument events)
Bug SOLR-6856 regression in /update/extract ? ref guide examples of fmap & xpath don't seem to be working
Improvement SOLR-6488 Upgrade to TIKA 1.6

View Issues

Versions: Unreleased

Name Release date
Unreleased 4.10.5  
Unreleased Trunk  
Unreleased 5.1  
Unreleased 5.0.1