[YARN-11242] New Map Reduce Example - Simple Sentiment Analysis - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Duplicate
Affects Version/s: 3.4.0
Fix Version/s: None
Component/s: None
Labels:
None

Target Version/s:

3.4.0
Flags:

Patch
Language:
- Java

Description

I am looking to add a new map reduce example, i.e, sentiment analysis. Sentiment analysis map reduce job helps in determining the sentiment score for a user. It takes each tweet made by an user and assigns a sentiment score for that tweet/sentence for a particular user and then aggregates the sentiment scores for all tweets made by all users.

This example takes the twitter dataset which contains users and the tweets made by users and gives the output as <username, sentiment score>. For each user, the sentiment score is calculated for all the tweets made by that particular user.

This mapreduce examples takes in two input files - input twitter dataset and a file containing list of words.
The word list file contains positive, negative and negation words which are used to give a sentiment score to the words in tweets.

You can use command:
bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar sentimentanalysis <input file/dir path> <output dir path> <word list file path/dir path>

For example, you can use the sample files and run the above command as:
bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar sentimentanalysis sample_data.txt <output dir path> sample_words.txt

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

sample_data.txt
04/Aug/22 18:11
34 kB
Meetu Patel
sample_words.txt
04/Aug/22 18:12
2 kB
Meetu Patel
YARN-11242.patch
04/Aug/22 18:52
11 kB
Meetu Patel

Issue Links

duplicates

MAPREDUCE-7400 New Map Reduce Example - Simple Sentiment Analysis

Patch Available

Activity

People

Assignee:: Unassigned

Reporter:: Meetu Patel

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 04/Aug/22 18:36

Updated:: 04/Aug/22 18:54

Resolved:: 04/Aug/22 18:54