Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-715

Clark clusters NameFinder features

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.6.0
    • 1.6.0
    • Name Finder
    • None

    Description

      Add token based features from Clark clusters (Clark 2003). This feature is actually the same as the one implemented in the WordClusterFeatureGenerator, but we should somehow make them separate (perhaps implementing a dynamic prefix id for each one, as in the dictionary features) as it has been shown that the combination of these clustering-based features improve results.

      Clark clusters can be generated using this tool:

      https://github.com/ninjin/clark_pos_induction

      Attachments

        Activity

          People

            ragerri Rodrigo Agerri
            ragerri Rodrigo Agerri
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: