[SOLR-380] There's no way to convert search results into page-level hits of a "structured document". - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: 4.9, 6.0
Component/s: search
Labels:
None

Description

"Paged-Text" FieldType for Solr

A chance to dig into the guts of Solr. The problem: If we index a monograph in Solr, there's no way to convert search results into page-level hits. The solution: have a "paged-text" fieldtype which keeps track of page divisions as it indexes, and reports page-level hits in the search results.

The input would contain page milestones: <page id="234"/>. As Solr processed the tokens (using its standard tokenizers and filters), it would concurrently build a structural map of the item, indicating which term position marked the beginning of which page: <page id="234" firstterm="14324"/>. This map would be stored in an unindexed field in some efficient format.

At search time, Solr would retrieve term positions for all hits that are returned in the current request, and use the stored map to determine page ids for each term position. The results would imitate the results for highlighting, something like:

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-380-XmlPayload.patch
12/Nov/07 07:43
155 kB
Tricia Jenkins
SOLR-380-XmlPayload.patch
16/Nov/07 03:38
92 kB
Tricia Jenkins
xmlpayload-src.jar
24/Apr/08 00:09
5.74 MB
Tricia Jenkins
xmlpayload.jar
24/Apr/08 00:09
10 kB
Tricia Jenkins
xmlpayload-example.zip
24/Apr/08 00:18
8.55 MB
Tricia Jenkins

Issue Links

relates to

SOLR-532 WordDelimiterFilter ignores payloads

Closed

SOLR-4722 Highlighter which generates a list of query term position(s) for each item in a list of documents, or returns null if highlighting is disabled.

Open

SOLR-522 analysis.jsp doesn't show payloads created/modified by tokenizers and tokenfilters

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Tricia Jenkins

Votes:: 4 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 16/Oct/07 03:50

Updated:: 27/Jul/16 05:01

Resolved:: 27/Jul/16 05:01