[OPENNLP-758] Unsupervised WSD techniques - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: wsd
Labels:
- gsoc
- gsoc2015
- java
- nlp
- wsd

Description

The objective of Word Sense Disambiguation (WSD) is to determine which sense of a word is meant in a particular context. Therefore, WSD is a classification task, where the classes are the different senses of the ambiguous word.

Different techniques are proposed in the academic literature, which fall mainly into two categories: Supervised and Unsupervised.

For this component, we focus on unsupervised techniques: these methods are based on unlabeled data, and do not exploit any manually tagged data.

The object of this project is to create a WSD solution (for English) that implements some unsupervised techniques. For example:

Context Clustering
Word Clustering
Cooccurrence Graphs
Overlap of Sense Definitions
Selectional Preferences
Structural Approaches
Etc.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

lesk_params_source.patch
05/Aug/15 16:30
6 kB
Anthony Beylerian
updates_and_fix_new_datareaders.patch
03/Aug/15 18:41
82 kB
Anthony Beylerian
cleanup.patch
02/Jul/15 09:27
79 kB
Anthony Beylerian
lesk_parameters.patch
19/Jun/15 08:52
68 kB
Anthony Beylerian
opennlp-tools-disambiguator.patch
10/Jun/15 12:29
54 kB
Anthony Beylerian

Activity

People

Assignee:: Anthony Beylerian

Reporter:: Mondher Bouazizi

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 12/Feb/15 09:57

Updated:: 11/Aug/15 17:15