Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-10416

Update Korean Dictionary for Nori

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 10.0 (main)
    • modules/analysis
    • None
    • New

    Description

      For Nori - Korean analyzer, there is Korean dictionary named mecab-ko-dic, which is available under an Apache license here: https://bitbucket.org/eunjeon/mecab-ko-dic

       

      The dictionary hasn't been updated in Nori although it has some updates to provide better analysis results. Downloading is available here: https://bitbucket.org/eunjeon/mecab-ko-dic/downloads

       

      There are changes between the currently used version and the latest release version(change log: https://bitbucket.org/eunjeon/mecab-ko-dic/src/master/CHANGES.md)

      • New feature: added semantic class for NNG - 장소, 행위, 상태변화, 정적상태
      • Fix: correct unexpectedly huge cost on NNG/장소
      • New words

       

      There's no issue with testing :lucene:analysis:nori:test and building a new binary.

      Attachments

        1. LUCENE-10416.patch
          5 kB
          Uihyun Kim

        Activity

          People

            Unassigned Unassigned
            uihyun Uihyun Kim
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: