Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4311

HunspellStemFilter returns another values than Hunspell in console / command line with same dictionaries.

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 3.5, 4.0-ALPHA, 3.6.1
    • Fix Version/s: None
    • Component/s: core/other
    • Labels:
      None
    • Environment:

      Apache Solr 3.5 - 4.0, Apache Tomcat 7.0

    • Lucene Fields:
      New

      Description

      When I used HunspellStemFilter for stemming the czech language text, it returns me bad results.

      For example word "praha" returns "praha" and "prahnout", what is not correct.

      So I try the same in my console (Hunspell command line) with exactly same dictionaries and it returns only "praha" and this is correct.

      Can somebody help me?

        Attachments

        1. cs_CZ.dic
          2.36 MB
          Jan Rieger
        2. cs_CZ.aff
          101 kB
          Jan Rieger

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                jrx Jan Rieger
              • Votes:
                0 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: