Uploaded image for project: 'Stanbol'
  1. Stanbol
  2. STANBOL-1411

Make the MLT disambiguation engine work with the FST linking engine

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.12.0, 1.0.0
    • Fix Version/s: 1.0.0, 0.12.1
    • Component/s: Enhancer
    • Labels:
      None

      Description

      The Solr MLT disambiguation engine uses the entityhub:site property to check if a suggested Entity originates from the Entityhub Site that is disambiguated.

      This entityhub:site is written by the EntityhubLinkingEngine, but not by the FST linking engine as this engine directly uses a SolrCore and does - strictly speaking - not know from (depend on) any Entityhub site.

      However STANBOL-1391 added the feature to the FST linking engine to provide origin information for annotations. Those do use the fise:origin property.

      This issue is about extending the MLT disambiguation engine to consider fise:origin as a fallback to entityhub:site. This will allow users that want to use MLT disambiguation together with FST linking by configuring the name of the Entityhub site as value of the enhancer.engines.linking.lucenefst.origin property for the FST linking engine.

        Attachments

        1. fst_disamb.patch
          4 kB
          Edi Bice

          Activity

            People

            • Assignee:
              rwesten Rupert Westenthaler
              Reporter:
              rwesten Rupert Westenthaler
            • Votes:
              1 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 2h
                2h
                Remaining:
                Remaining Estimate - 2h
                2h
                Logged:
                Time Spent - Not Specified
                Not Specified