[SOLR-4656] Add hl.maxMultiValuedToExamine to limit the number of multiValued entries examined while highlighting - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 4.3, 6.0
Fix Version/s: 4.3, 6.0
Component/s: highlighter
Labels:
None

Description

I'm looking at an admittedly pathological case of many, many entries in a multiValued field, and trying to implement a way to limit the number examined, analogous to maxAnalyzedChars, see the patch.

Along the way, I noticed that we do what looks like unnecessary copying of the fields to be examined. We call Document.getFields, which copies all of the fields and values to the returned array. Then we copy all of those to another array, converting them to Strings. Then we actually examine them. a> this doesn't seem very efficient and b> reduces the benefit from limiting the number of mv values examined.

So the attached does two things:
1> attempts to fix this
2> implements hl.maxMultiValuedToExamine

I'd really love it if someone who knows the highlighting code takes a peek at the fix to see if I've messed things up, the changes are actually pretty minimal.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-4656-4x.patch
31/Mar/13 16:07
13 kB
Erick Erickson
SOLR-4656-4x.patch
31/Mar/13 02:17
14 kB
Erick Erickson
SOLR-4656-trunk.patch
31/Mar/13 02:17
14 kB
Erick Erickson
SOLR-4656.patch
30/Mar/13 14:00
5 kB
Erick Erickson

Issue Links

relates to

SOLR-6692 hl.maxAnalyzedChars should apply cumulatively on a multi-valued field

Closed

Activity

People

Assignee:: Erick Erickson

Reporter:: Erick Erickson

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 30/Mar/13 13:36

Updated:: 09/May/16 18:57

Resolved:: 02/Apr/13 17:07