Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-7318

Graduate StandardAnalyzer out of analyzers module into core

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 6.2, 6.2.1, 7.0
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      Spinoff from LUCENE-7314:

      StandardAnalyzer has progressed substantially since we broke out the analyzers module ... it now follows a real Unicode standard (UAX #29 Unicode Text Segmentation). It's also much faster than it used to be, since it switched to JFlex a while back. Many bug fixes, etc.

      I think it would make a good default for most Lucene users, and we should graduate it from the analyzers module into core, and make it the default for IndexWriter.

      It's really quite crazy that users must go digging in the analyzers module to get started with Lucene ... we don't make them dig through the codecs module to find a good default codec ...

        Attachments

        1. LUCENE-7318-backwards.patch
          38 kB
          Uwe Schindler
        2. LUCENE-7318-backwards.patch
          37 kB
          Uwe Schindler
        3. LUCENE-7318-backwards.patch
          35 kB
          Uwe Schindler
        4. LUCENE-7318-backwards.patch
          28 kB
          Uwe Schindler
        5. LUCENE-7318.patch
          1.34 MB
          Michael McCandless

          Issue Links

            Activity

              People

              • Assignee:
                mikemccand Michael McCandless
                Reporter:
                mikemccand Michael McCandless
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: