Description
Multivalue field has wrong norm when the field value is tokenized, the field or document is boosted, and the field is not source of copyField.
$ java -jar start.jar & $ echo '{ "add": { "doc": { "id":"no-boosted", "features": ["a","b","c"], "dyn_not_copied_txt": ["a","b","c"] } }, "add": { "boost": 10, "doc": { "id":"boosted", "features": ["a","b","c"], "dyn_not_copied_txt": ["a","b","c"] } }}' > test.json $ curl 'http://localhost:8983/solr/update/json?commit=true' -H 'Content-type:application/json' --data-binary @test.json {"responseHeader":{"status":0,"QTime":41}} $ curl 'http://localhost:8983/solr/select' -d 'omitHeader=true&wt=json&indent=on&q=*:*&fl=id,norm(features),norm(dyn_not_copied_txt)' { "response":{"numFound":2,"start":0,"docs":[ { "id":"no-boosted", "norm(features)":0.5, "norm(dyn_not_copied_txt)":0.5}, { "id":"boosted", "norm(features)":5.0, "norm(dyn_not_copied_txt)":512.0}] }}
In the above example, "features" is source of copyField. On the other hand, "dyn_not_copied_txt" is not so.
"features" and "dyn_not_copied_txt" have the same type attribute (type="text_general"), the same values ( ["a","b","c"] ) and the same boost. So, both fields must have the same norm in the document.
But, in boosted document only, the field that is not copied have too larger norm.
Attachments
Attachments
Issue Links
- is broken by
-
SOLR-6259 Performance issue with large number of fields and values when using copyFields
- Resolved