Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1786

improve performance of contrib/TestCompoundWordTokenFilter

Details

    • Test
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 4.0-ALPHA
    • modules/analysis
    • None
    • New, Patch Available

    Description

      contrib/analyzers/compound has some tests that use a hyphenation grammar file.

      The tests are currently for german, and they actually are nice, they show how the combination of the hyphenation rules and dictionary work in tandem.
      The issue is that the german grammar file is not apache licensed: http://offo.sourceforge.net/hyphenation/licenses.html
      So the test must download the entire offo zip file from sourceforge to execute.

      I happen to think the test is a great example of how this thing works (with a language where it matters), but we could consider using a different grammar file, for a language that is apache licensed.
      This way it could be included in the source with the test and would be more practical.

      Attachments

        1. LUCENE-1786.patch
          21 kB
          Robert Muir

        Issue Links

          Activity

            People

              rcmuir Robert Muir
              rcmuir Robert Muir
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: