Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
3.1
-
None
Description
I updated this page to know about hte new bin/solr and example/exampledocs structure/contents...
https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika
however i noticed that several of the examples listed on that page didn't seem to work any more – notably...
- examples using "fmap" don't seem to create the fields they say they will
- examples using "xpath" don't seem to create any docs at all
Specific examples i had problems with...
curl "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc2&captureAttr=true&defaultField=text&fmap.div=foo_t&capture=div&commit=true" -F "sample=@example/exampledocs/sample.html" curl "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc3&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&commit=true" -F "sample=@example/exampledocs/sample.html" curl "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc4&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&literal.blah_s=Bah&commit=true" -F "sample=@example/exampledocs/sample.html" curl "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc5&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&literal.id=id&xpath=/xhtml:html/xhtml:body/xhtml:div/descendant:node()&commit=true" -F "sample=@example/exampledocs/sample.html"
...none of these example commands produced an error, but they also didn't seem to create the fields/docs they said they would (ie: no "foo_t" field was created)
Attachments
Attachments
Issue Links
- relates to
-
SOLR-7027 ExtractingRequestHandler indiscriminantly dumps all source HTML attributes into the catch-all field when captureAttr=false, but it should be more selective, something like only href, title, alt, etc. attributes
- Open