Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2209

Improved Tokenization for Similarity Scoring plugin

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Done
    • None
    • None
    • scoring

    Description

      This patch would add Lucene based tokenization to the cosine similarity plugin and clean up the code currently present.

      Attachments

        Activity

          People

            sujenshah Sujen Shah
            sujenshah Sujen Shah
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: