[LUCENE-1058] New Analyzer for buffering tokens - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.3
Component/s: modules/analysis
Labels:
None

Description

In some cases, it would be handy to have Analyzer/Tokenizer/TokenFilters that could siphon off certain tokens and store them in a buffer to be used later in the processing pipeline.

For example, if you want to have two fields, one lowercased and one not, but all the other analysis is the same, then you could save off the tokens to be output for a different field.

Patch to follow, but I am still not sure about a couple of things, mostly how it plays with the new reuse API.

See http://www.gossamer-threads.com/lists/lucene/java-dev/54397?search_string=BufferingAnalyzer;#54397

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-1058.patch
19/Nov/07 13:52
13 kB
Grant Ingersoll
LUCENE-1058.patch
21/Nov/07 02:51
17 kB
Grant Ingersoll
LUCENE-1058.patch
27/Nov/07 01:57
28 kB
Grant Ingersoll
LUCENE-1058.patch
27/Nov/07 14:10
29 kB
Grant Ingersoll
LUCENE-1058.patch
27/Nov/07 17:30
29 kB
Grant Ingersoll
LUCENE-1058.patch
28/Nov/07 15:16
11 kB
Grant Ingersoll
LUCENE-1058.patch
28/Nov/07 15:57
11 kB
Grant Ingersoll

Issue Links

relates to

SOLR-330 Use new Lucene Token APIs (reuse and char[] buff)

Resolved

Activity

People

Assignee:: Grant Ingersoll

Reporter:: Grant Ingersoll

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 19/Nov/07 13:45

Updated:: 28/Aug/22 11:42

Resolved:: 29/Nov/07 15:18