|
|
|
LUCENE-9842
|
LUCENE-9687
Hunspell suggestions: speed up for some non-Latin scripts
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9833
|
LUCENE-9687
Hunspell: AssertionError in WordStorage.lookupWord
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9832
|
LUCENE-9687
Hunspell: SIOOBE in GeneratingSuggester.expandRoot
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9831
|
LUCENE-9687
Hunspell GeneratingSuggester: faster flag & case checks, less allocations
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9830
|
LUCENE-9687
Hunspell: store word length for faster dictionary lookup/enumeration
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9825
|
LUCENE-9687
Hunspell: reverse the "words" trie for faster word lookup/suggestions
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9824
|
LUCENE-9687
Hunspell suggestions: speed up ngram score calculation for each dictionary entry
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9812
|
LUCENE-9687
Hunspell: honor empty stripping affixes when generating suggestions
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9811
|
LUCENE-9687
Hunspell suggestions: speed up ngram calculation by not searching for substrings in impossible places
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9810
|
LUCENE-9687
Hunspell: when generating suggestions, skip too deep word FST subtrees
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9808
|
LUCENE-9687
Hunspell suggestions: consider space/dash-separated words for each case variation
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9806
|
LUCENE-9687
Hunspell: speed up affix condition checking
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9805
|
LUCENE-9687
Hunspell: fix space + mixed case heuristics on suggestions
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9804
|
LUCENE-9687
Hunspell: fix most similar dictionary entry search by reversing the comparator
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9803
|
LUCENE-9687
Hunspell: don't check second stage suffixes if the first stage flag only occurs in prefixes
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9801
|
LUCENE-9687
Hunspell suggestions: speed up expandWord by enumerating only applicable affixes
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9800
|
LUCENE-9687
Hunspell: put a time limit on suggestion calculation
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9799
|
LUCENE-9687
Hunspell: don't check second-level affixes when the first level isn't a continuation
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9792
|
LUCENE-9687
Add automation for running regression tests
|
Dawid Weiss
|
Dawid Weiss
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9790
|
LUCENE-9687
Hunspell: avoid slow dictionary lookup if the word's hash isn't there
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9787
|
LUCENE-9687
Hunspell: speed up suggesting a bit by not creating a huge TreeSet
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9786
|
LUCENE-9687
Hunspell suggestions: try moving the last character into the middle
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9785
|
LUCENE-9687
Hunspell: don't check case in compound middle and end
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9784
|
LUCENE-9687
Hunspell suggestions: use US keyboard in absence of KEY option
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9783
|
LUCENE-9687
Hunspell: don't suggest more than 4 ngram corrections by default
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9782
|
LUCENE-9687
Hunspell suggestions: split by space (but not dash) also before last char
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9779
|
LUCENE-9687
Hunspell: add an API to interrupt long computations
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9778
|
LUCENE-9687
Hunspell: speed up input conversion
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9776
|
LUCENE-9687
Hunspell: allow to inflect the last part of COMPOUNDRULE compound
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9775
|
LUCENE-9687
Hunspell: make FORCEUCASE work when the first compound word is inherently title-case
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9772
|
LUCENE-9687
Hunspell: CHECKCOMPOUNDCASE shouldn't prohibit dash-separated uppercase compounds
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9771
|
LUCENE-9687
Hunspell: don't lookup word roots unnecessarily to check flags
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9770
|
LUCENE-9687
Hunspell: don't perform compound check recursively when looking for space-separated word pairs
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9769
|
LUCENE-9687
Hunspell: KEEPCASE should take precedence over affixed forms
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9766
|
LUCENE-9687
Hunspell: add API for retrieving dictionary morphological data and stemming
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9765
|
LUCENE-9687
Hunspell: rename SpellChecker to Hunspell, fix test name, update javadoc and CHANGES.txt
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9764
|
LUCENE-9687
Hunspell: try title case as FORCEUCASE misspelled word suggestions
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9763
|
LUCENE-9687
Hunspell: fix FORBIDDENWORD support
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9761
|
LUCENE-9687
Hunspell: check that FLAG and SET don't occur too far in the file, cleanup
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9760
|
LUCENE-9687
Hunspell: print total memory usage in TestAllDictionaries, cleanup
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9759
|
LUCENE-9687
Hunspell: add more to TestHunspellRepositoryTestCases.EXPECTED_FAILURES
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9758
|
LUCENE-9687
Hunspell: support NOSUGGEST option
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9753
|
LUCENE-9687
Hunspell: disallow compounds with parts present in dictionary space-separated
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9752
|
LUCENE-9687
Hunspell Stemmer: reduce parameter count
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9750
|
LUCENE-9687
Hunspell: improve suggestions for mixed-case misspelled words
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9749
|
LUCENE-9687
Hunspell: apply output conversion (OCONV) to the suggestions
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9748
|
LUCENE-9687
Hunspell: suggest inflected dictionary entries similar to the misspelled word
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9746
|
LUCENE-9687
Hunspell: unify case variation logic in Stemmer and SpellChecker
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9745
|
LUCENE-9687
Hunspell: tolerate more aff/dic file typos
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9743
|
LUCENE-9687
Hunspell: ignore original tests which are out of scope for now
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9742
|
LUCENE-9687
Hunspell: suggest dictionary entries similar to the misspelled word
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9740
|
LUCENE-9687
Avoid buffering and double-scan of flags in *.aff file
|
Dawid Weiss
|
Dawid Weiss
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9739
|
LUCENE-9687
Hunspell: speed up numeric flag parsing
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9736
|
LUCENE-9687
Hunspell: support MAP-based suggestions for groups of similar letters
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9735
|
LUCENE-9687
Hunspell: speed up flag checks by avoiding allocations
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9734
|
LUCENE-9687
Hunspell: support suggestions based on "ph" morphological data
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9733
|
LUCENE-9687
Hunspell: exception when loading dictionaries with mixed-case words and aliased flags
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9732
|
LUCENE-9687
Hunspell: support dictionary entries starting with slash
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9730
|
LUCENE-9687
Clean up temporary folder management in Dictionary
|
Dawid Weiss
|
Dawid Weiss
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9729
|
LUCENE-9687
Hunspell: support CHECKCOMPOUNDREP flags
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9728
|
LUCENE-9687
Hunspell: add a performance test
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9727
|
LUCENE-9687
Add build-side support for running full validation checks against hunspell repos
|
Dawid Weiss
|
Dawid Weiss
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9726
|
LUCENE-9687
Hunspell: speed up spellchecking by stopping at a single found stem
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9724
|
LUCENE-9687
Hunspell: tolerate existing aff/dic file typos
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9723
|
LUCENE-9687
Hunspell: update sanity tests that load all dictionaries
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9721
|
LUCENE-9687
Hunspell: disallow ONLYINCOMPOUND suffixes at the very end of compound words
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9720
|
LUCENE-9687
Hunspell: more ways to vary misspelled word variations for suggestions
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9717
|
LUCENE-9687
Hunspell: support CHECKCOMPOUNDPATTERN
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9716
|
LUCENE-9687
Hunspell: support flag usage before its format is even specified
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9710
|
LUCENE-9687
Hunspell: support minor compounding-related flags
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9709
|
LUCENE-9687
Hunspell: no special dotted i treatment outside tr/az languages
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9708
|
LUCENE-9687
Hunspell: support FLAG UTF-8 in absence of SET UTF-8
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9707
|
LUCENE-9687
Hunspell: check Lucene's implementation against Hunspell's test data
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9706
|
LUCENE-9687
Hunspell: support NEEDAFFIX flag on affixes
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9704
|
LUCENE-9687
Hunspell: support capitalization for German ß
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9703
|
LUCENE-9687
Hunspell: prohibit FORBIDDENWORD words and their case variations
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9702
|
LUCENE-9687
Hunspell: support alternate casing for short language codes
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9701
|
LUCENE-9687
Hunspell: implement simple REP-based suggestion algorithm
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9700
|
LUCENE-9687
Hunspell: support words with trailing dots
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9699
|
LUCENE-9687
Support German-like compound words
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9698
|
LUCENE-9687
Hunspell: reuse char[] when possible when stripping affix
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9697
|
LUCENE-9687
Hunspell Stemmer: use the same FST.BytesReader on all recursion levels
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9693
|
LUCENE-9687
Hunspell: check that all flags are > 0 and fit char
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9692
|
LUCENE-9687
Hunspell: extract Stemmer.stripAffix from similar code in prefix/suffix processing
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9691
|
LUCENE-9687
Hunspell: support trailing comments on aff option lines
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9690
|
LUCENE-9687
Hunspell: support special title-case for words with apostrophe
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9688
|
LUCENE-9687
Hunspell: consider prefix's continuation flags when applying suffix
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9685
|
LUCENE-9687
Hunspell: prefix condition is only checked on suffix, not stem
|
Dawid Weiss
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9684
|
LUCENE-9687
Hunspell: support COMPOUNDRULE
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9682
|
LUCENE-9687
Hunspell spellchecker: support numbers with separators
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9681
|
LUCENE-9687
Hunspell: deduplicate decodeFlags+hasFlag checks
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9678
|
LUCENE-9687
Hunspell: fix off-by-one error to support prefixes of word.length - 1
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9677
|
LUCENE-9687
Hunspell: simplify Dictionary.affixData storage
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9676
|
LUCENE-9687
Hunspell: improve stemming of all-caps words
|
Dawid Weiss
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9671
|
LUCENE-9687
Hunspell: shorten Stemmer.applyAffix
|
Unassigned
|
Peter Gromov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9667
|
LUCENE-9687
Hunspell: add a spellchecker, support BREAK and FORBIDDENWORD affix rules
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9665
|
LUCENE-9687
Hunspell: support default encoding
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
LUCENE-9664
|
LUCENE-9687
Hunspell support: fix most IntelliJ warnings, cleanup
|
Unassigned
|
Peter Gromov
|
|
Resolved |
Fixed
|
|
|
|
|