Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8817

Combine Nori and Kuromoji DictionaryBuilder

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • New

    Description

      This issue is related to LUCENE-8816.

      Currently Nori and Kuromoji Analyzer use the same dictionary structure. (MeCab)
      If we make combine DictionaryBuilder, we can reduce the code size.
      But this task may have a dependency on the language.
      (like HEADER string in BinaryDictionary and CharacterDefinition, methods in BinaryDictionaryWriter, ...)
      On the other hand, there are many overlapped classes.

      The purpose of this patch is to provide users of Nori and Kuromoji with the same system dictionary generator.

      It may take some time because there is a little workload.
      The work will be based on the latest master, and if the LUCENE-8816 is finished first, I will pull the latest code and proceed.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              danmuzi Namgyu Kim
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: