Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2263

Sentiment Analysis - Standardize the interface and integrate OpenNLP, CoreNLP with pluggable models

    XMLWordPrintableJSON

Details

    Description

      Background

      As per Wikipedia: Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text analysis and computational linguistics to identify and extract subjective information in source materials. Sentiment analysis is widely applied to reviews and social media for a variety of applications, ranging from marketing to customer service.

      Goals

      • Create an interface for SentimentAnalyser
      • Create a parser for sentiment analysis with pluggable implementations of SentimentAnalyser and its model files
      • Provide an implementation from Apache OpenNLP and its default models for english. Make this as default
      • Provide an implementation from Stanford CoreNLP and make this as optional (due to its license incompatibility wit AL2.0). Make it attachable at runtime similar to already existing CoreNLP Named Entity Recogniser.
        • Leave room for plugging other/custom implementations at the runtime

      References

      Attachments

        Activity

          People

            Unassigned Unassigned
            thammegowda Thamme Gowda
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: