[LUCENE-6896] Fix/document various Similarity bugs around extreme norm values - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.5, 6.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

Spinoff from ~~LUCENE-6818~~:

iorixxx found problems with every Similarity (except ClassicSimilarity) when trying to test how they behave on every possible norm value, to ensure they are robust for all index-time boosts.

There are several problems:
1. buggy normalization decode that causes the smallest possible norm value (0) to be treated as an infinitely long document. These values are intended to be encoded as non-negative finite values, but going to infinity breaks everything.
2. various problems in the less practical functions that already have documented warnings that they do bad things for extreme values. These impact DFR models D, Be, and P and IB distribution SPL.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-6896.patch
15/Nov/15 14:45
12 kB
Robert Muir

Activity

People

Assignee:: Unassigned

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 15/Nov/15 14:39

Updated:: 28/Aug/22 14:45

Resolved:: 18/Jan/16 08:09