Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2263

Sentiment Analysis - Standardize the interface and integrate OpenNLP, CoreNLP with pluggable models

    XMLWordPrintableJSON

    Details

      Description

      Background

      As per Wikipedia: Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text analysis and computational linguistics to identify and extract subjective information in source materials. Sentiment analysis is widely applied to reviews and social media for a variety of applications, ranging from marketing to customer service.

      Goals

      • Create an interface for SentimentAnalyser
      • Create a parser for sentiment analysis with pluggable implementations of SentimentAnalyser and its model files
      • Provide an implementation from Apache OpenNLP and its default models for english. Make this as default
      • Provide an implementation from Stanford CoreNLP and make this as optional (due to its license incompatibility wit AL2.0). Make it attachable at runtime similar to already existing CoreNLP Named Entity Recogniser.
        • Leave room for plugging other/custom implementations at the runtime

      References

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              thammegowda Thamme Gowda
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: