Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
New
Description
The current implementation of the disjunction interval produced by Intervals.or is a direct implementation of the OR operator from the Vigna paper. This produces minimal intervals, meaning that (a) is preferred over (a b), and (b) also over (a b). This has advantages when it comes to counting intervals for scoring, but also has drawbacks when it comes to matching. For example, a phrase query for ((a OR (a b)) BLOCK (c)) will not match the document (a b c), because (a) will be preferred over (a b), and (a c) does not match.
This ticket is to discuss the best way of dealing with disjunctions.
Attachments
Attachments
Issue Links
- links to