Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
Patch Available
-
Yes
Description
SpanishMinimalStemmerFilter is less aggressive stemmer than SpanishLightStemmerFilter
Ex:
input tokens -> output tokens
1. camiseta niños -> camiseta and nino
2. camisas -> camisa
camisetas and camisas are t-shirts and shirts respectively.
Stemming both of the tokens to camis will match both tokens and returns both t-shirts and shirts for query camisas(shirts). SpanishMinimalStemmerFilter will help handling these cases.
And importantly It will preserve gender context with tokens.
Ex: niños ,niñas chicos and chicas are stemmed to nino, nina, chico and chica