Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-544

Turkish stemmer causes an IndexOutOfRange

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Lucene.Net 4.8.0
    • Lucene.Net Contrib
    • None
    • Windows 7 x64, .NET 4.0

    Description

      Certain words in the Turkish stemmer is causing SnowballProgram to throw an index out of range exception in the method "find_among_b(Among[] v, int v_size)".

      Line 353
      diff = current[c - 1 - common] - w.s[i2];

      current[c - 1 - common] 'current[c - 1 - common]' threw an exception of type 'System.IndexOutOfRangeException' char

      {System.IndexOutOfRangeException}
      • current {ı}

        System.Text.StringBuilder
        Length 1 int

      c 8 int
      common 0 int

      Attachments

        1. 2015.12.24.TurkishStemmerFixTests.zip
          41 kB
          Türkay Kaynak
        2. Program.cs
          2 kB
          Türkay Kaynak
        3. Turkish.txt
          12 kB
          Peter Wright
        4. Turkish3.txt
          0.0 kB
          Peter Wright
        5. TurkishStemmer.TemporarySolution.cs
          84 kB
          Türkay Kaynak
        6. Visual Studio with exception details.png
          181 kB
          Peter Wright

        Issue Links

          Activity

            People

              Unassigned Unassigned
              pnwright Peter Wright
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: