Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-544

Turkish stemmer causes an IndexOutOfRange

Agile BoardAttach filesAttach ScreenshotVotersStop watchingWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Lucene.Net 4.8.0
    • Lucene.Net Contrib
    • None
    • Windows 7 x64, .NET 4.0

    Description

      Certain words in the Turkish stemmer is causing SnowballProgram to throw an index out of range exception in the method "find_among_b(Among[] v, int v_size)".

      Line 353
      diff = current[c - 1 - common] - w.s[i2];

      current[c - 1 - common] 'current[c - 1 - common]' threw an exception of type 'System.IndexOutOfRangeException' char

      {System.IndexOutOfRangeException}
      • current {ı}

        System.Text.StringBuilder
        Length 1 int

      c 8 int
      common 0 int

      Attachments

        1. Turkish3.txt
          0.0 kB
          Peter Wright
        2. Turkish.txt
          12 kB
          Peter Wright
        3. Visual Studio with exception details.png
          181 kB
          Peter Wright
        4. Program.cs
          2 kB
          Türkay Kaynak
        5. TurkishStemmer.TemporarySolution.cs
          84 kB
          Türkay Kaynak
        6. 2015.12.24.TurkishStemmerFixTests.zip
          41 kB
          Türkay Kaynak

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            pnwright Peter Wright
            Votes:
            0 Vote for this issue
            Watchers:
            6 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment