approaches to sentiment analysis mse 2400 ealicara spring 2015 dr. tom way based in part on notes...

12
Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Upload: candace-fox

Post on 17-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Approaches toSentiment Analysis

MSE 2400 EaLiCaRA

Spring 2015 Dr. Tom Way

Based in part on notes from Aditya Joshi

Page 2: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

What is Sentiment Analysis?

• Identify the orientation of opinion in a piece of text

• Can be generalized to a wider set of emotions

The movie was fabulous!

The movie stars Mr. X

The movie was horrible!

MSE 2400 Evolution & Learning 2

Page 3: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Motivation

• Knowing sentiment is a very natural ability of a human being.

Can a machine be trained to do it?

• SA aims at getting sentiment-related knowledge especially from the huge amount of information on the internet

• Can be generally used to understand opinion in a set of documents

MSE 2400 Evolution & Learning 3

Page 4: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Tripod of Sentiment AnalysisCognitive Science

Natural Language Processing

Machine Learning

Sentiment Analysis

Natural Language Processing

Machine Learning

MSE 2400 Evolution & Learning 4

Page 5: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Challenges

• Contrasts with standard text-based categorization

• Domain dependent

• Sarcasm

• Thwarted expressions

Mere presence of words is Indicative of the category

in case of text categorization.

Not the case with sentiment analysis

• Contrasts with standard text-based categorization

• Domain dependent

• Sarcasm

• Thwarted expressions

Sentiment of a word is w.r.t. the

domain.

Example: ‘unpredictable’

For steering of a car,

For movie review,

• Contrasts with standard text-based categorization

• Domain dependent

• Sarcasm

• Thwarted expressions

Sarcasm uses words ofa polarity to represent

another polarity.

Example: The perfume is soamazing that I suggest you wear it

with your windows shut

• Contrasts with standard text-based categorization

• Domain dependent

• Sarcasm

• Thwarted expressions

the sentences/words that contradict the overall sentiment

of the set are in majority

Example: The actors are good, the music is brilliant and appealing.

Yet, the movie fails to strike a chord.

MSE 2400 Evolution & Learning 5

Page 6: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Quantifying sentiment

Objective Polarity

Subjective Polarity

Positive Negative

Term sense position

Each term has a Positive, Negative and Objective score. The scores sum to one.

MSE 2400 Evolution & Learning 6

Page 7: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Approach 1: Using adjectives

• Many adjectives have high sentiment value– A ‘beautiful’ bag– A ‘wooden’ bench– An ‘embarrassing’ performance

• Focusing on adjectives might be beneficial

MSE 2400 Evolution & Learning 7

Page 8: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Approach 2: Using Adverb-Adjective Combinations (AACs)

• Calculate sentiment value based on the effect of adverbs on adjectives

• Linguistic ideas:• Adverbs of affirmation: certainly• Adverbs of doubt: possibly• Strong intensifying adverbs: extremely• Weak intensifying adverbs: scarcely• Negation and Minimizers: never

MSE 2400 Evolution & Learning 8

Page 9: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Approach 3: Subject-based SA

• Examples:

The horse bolted.

The movie lacks a good story.

MSE 2400 Evolution & Learning 9

Page 10: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Lexical Analysis

subj. bolt

b VB bolt subj

subj. lack obj.

b VB lack obj ~subj

Argument that sends the sentiment (subj./obj.)

Argument that receives the sentiment (subj./obj.)

Argument that receives the sentiment (subj./obj.)

MSE 2400 Evolution & Learning 10

Page 11: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Example

The movie lacks a good story.

G JJ good obj.

The movie lacks \S+.

B VB lack obj ~subj.

Lexicon : Steps :

1) Consider a context window of upto five words

2) Shallow parse the sentence

3) Step-by-step calculate the sentiment value based on lexicon and by adding ‘\S+’ characters at each step

MSE 2400 Evolution & Learning 11

Page 12: Approaches to Sentiment Analysis MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way Based in part on notes from Aditya Joshi

Applications

• Review-related analysis

• Developing ‘hate mail filters’ analogous to ‘spam mail filters’

• Question-answering (Opinion-oriented questions may involve different treatment)

MSE 2400 Evolution & Learning 12