[LUCENE-7318] Graduate StandardAnalyzer out of analyzers module into core - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 6.2, 6.2.1, 7.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

Spinoff from ~~LUCENE-7314~~:

StandardAnalyzer has progressed substantially since we broke out the analyzers module ... it now follows a real Unicode standard (UAX #29 Unicode Text Segmentation). It's also much faster than it used to be, since it switched to JFlex a while back. Many bug fixes, etc.

I think it would make a good default for most Lucene users, and we should graduate it from the analyzers module into core, and make it the default for IndexWriter.

It's really quite crazy that users must go digging in the analyzers module to get started with Lucene ... we don't make them dig through the codecs module to find a good default codec ...

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-7318.patch
12/Jun/16 16:09
1.34 MB
Michael McCandless
LUCENE-7318-backwards.patch
12/Sep/16 18:28
38 kB
Uwe Schindler
LUCENE-7318-backwards.patch
12/Sep/16 12:59
37 kB
Uwe Schindler
LUCENE-7318-backwards.patch
11/Sep/16 19:51
35 kB
Uwe Schindler
LUCENE-7318-backwards.patch
11/Sep/16 18:30
28 kB
Uwe Schindler

Issue Links

is related to

LUCENE-7444 Remove English stopwords default from StandardAnalyzer in Lucene-Core

Closed

Activity

People

Assignee:: Michael McCandless

Reporter:: Michael McCandless

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 07/Jun/16 15:13

Updated:: 28/Aug/22 14:59

Resolved:: 12/Sep/16 18:29