Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3730

Improved Kuromoji search mode segmentation/decompounding

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.6, 4.0-ALPHA
    • 3.6, 4.0-ALPHA
    • modules/analysis
    • None

    Description

      Kuromoji has a segmentation mode for search that uses a heuristic to promote additional segmentation of long candidate tokens to get a decompounding effect. This heuristic has been improved. Patch is coming up.

      Attachments

        1. LUCENE-3730_trunk.patch
          13 kB
          Christian Moen

        Activity

          People

            rcmuir Robert Muir
            cm Christian Moen
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: