Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Pending Closed
-
None
-
None
-
None
Description
Entity Disambiguation using FOAF correlation is implemented as a new Stanbol Enhancement Engine. The project includes 2 sub-tasks.
1. Entityhub site with substantial FOAF entities :
FOAF datasources are dispersed all over the internet, and it's essential to aggregate a sufficient FOAF dataset in-order to perform co-reference disambiguation techniques over data. foaf-site is the entityhub site created by indexing btc2012 dataset including FOAF data. For more information on the foaf-site integration in Stanbol please refer the github project at [1]. The README.md contains the steps to integrate btc2012 dataset as an entityhub-site in Stanbol.
2. Disambiguation Engine using FOAF correlation techniques :
The main algorithm calculates disambiguated confidence of an entity by analyzing the the connected-ness of the entity. This is calculated by processing the number of correlated URI References with other suggested entities in the content. Correlated URI links between entities derive their connected-ness and the highest connected entity will have high disambiguated confidence. The other disambiguation algorithm uses literal-matching of selected-text with foaf:name value of the entity. If matched, the confidence is increased. The foaf disambiguation engine project can be found at the github project [2]. To install the engine and configure Stanbol enhancement-chain please refer the README.md of the project.
[1] https://github.com/dileepajayakody/FOAFSite
[2] https://github.com/dileepajayakody/foaf-disambiguation