Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.9.0-incubating
-
None
-
None
Description
The "confidence-level" is intended to make it easier for clients to
decide how to process Enhancements. It would not use a numerical range
but four distinct values:
- confident: Meaning that a match is very likely - indicating that
those annotations typically can be accepted automatically (e.g. If the
EntityLinking engine finds a single Entity that exactly matches the
text selected by an text annotation) - ambiguous: Meaning that there are several possibilities but is is
still likely that one of them match (e.g. Paris, Paris (Texas)) - suggestion: Meaning that the match is not completely certain, but
there are not several options (e.g. Germans -> Germany) - uncertain: Meaning that Entities do match, but the probability of a
match is rather speculative (e.g. John -> Elton John)
IMHO using this classification would fit a lot of engines much better
as the numeric "fise:confidence" property as it does not rise the
expectation in users that confidence values are on a rational scale
(e.g. a Enhancement with a confidence of "0.8" is not two times as
likely as one with "0.4").
Attachments
Issue Links
- is related to
-
STANBOL-612 Add helper for validating the Stanbol EnhancementStructure to the Enhancer test module
- Closed