Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8904

Enhance Nori DictionaryBuilder tool

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 8.x, 9.0
    • None
    • None
    • New

    Description

      It is the Nori version of sokolov's LUCENE-8863.
      This patch has two changes.
      1) Improve exception handling
      2) Enable external dictionary for testing

      Overall, it is the same as LUCENE-8863.

      But there are some differences between Nori and Kuromoji.
      These can be slightly different on the code.
      1) CSV field size
      Nori : 12
      Kuromoji : 13
      2) left context ID == right context ID
      Nori : can be different
      Kuromoji : always same
      3) Dictionary Type
      Nori : just one type
      Kuromoji : IPADIC, UNIDIC

      After this job, I'll apply LUCENE-8866 and LUCENE-8871 to Nori.

      Attachments

        Issue Links

          Activity

            People

              danmuzi Namgyu Kim
              danmuzi Namgyu Kim
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 50m
                  1h 50m