criticism mining: text mining experiments on book, movie and music reviews

39
Criticism Mining: Text Mining Experiments on Book, Movie and Music Reviews Xiao Hu, J. Stephen Downie, M. Cameron Jones The International Music Information Retrieval Systems Evaluation Lab (IMIRSEL) University of Illinois at Urbana-Champaign THE ANDREW W. MELLON FOUNDATION

Upload: seda

Post on 20-Jan-2016

42 views

Category:

Documents


1 download

DESCRIPTION

THE ANDREW W. MELLON FOUNDATION. Criticism Mining: Text Mining Experiments on Book, Movie and Music Reviews. Xiao Hu, J. Stephen Downie, M. Cameron Jones The International Music Information Retrieval Systems Evaluation Lab (IMIRSEL) University of Illinois at Urbana-Champaign. Agenda. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Criticism Mining: Text Mining Experiments on Book, Movie and Music Reviews

Xiao Hu, J. Stephen Downie, M. Cameron Jones

The International Music Information Retrieval Systems Evaluation Lab (IMIRSEL)

University of Illinois at Urbana-Champaign

THE ANDREW W. MELLON FOUNDATION

Page 2: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Agenda

MotivationCustomer reviews in epinions.comExperimental SetupData setResultsConclusions & Future Work

Page 3: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

MotivationCritical consumer-generated reviews of humanities

materials a rich resource of reviewers’ opinions, and

background / contextual informationself-organized: pave ways to automatic

processing

Text mining: mature and ready to useCriticism mining: provides a tool to assist

humanities scholarsLocatingOrganizingAnalyzing critical review content

Page 4: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Customer Reviews

Published on www.epinions.com Focused on the book, movie and music Each review associated with:

a genre label a numerical quality rating

Page 5: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

numerical rating associated

used in our experiments

Page 6: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Music Genres

Jazz, Rock, Country, Classical, Blues, Gospel, Punk, .…

Renaissance, Medieval, Baroque, Romantic, …

28 Major Genre Categories

Page 7: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Experimental Setup

to build and evaluate a prototype criticism mining system that could automatically : predict the genre of the work being reviewed predict the quality rating assigned to the

reviewed item differentiate book reviews and movie reviews,

especially for items in the same genredifferentiate fiction and non-fiction book

reviews

Page 8: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Data set

Reviews on

Book Movie Music

#. Of reviews

1800 1650 1800

#. Of genres

9 11 12

Mean of review length

1,095 words 1,514 words 1,547 words

Std. Dev. of review length

446 words 672 words 784 words

Term list size

41,060 47,015 47,864

Page 9: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Genre Taxonomy

Book MovieAction / Thriller1 Action /Adventure1

Juvenile Fiction 2 Children 2

Humor 3 Comedies 3

Horror 4 Horror/Suspense 4

Music & Performing Arts 5

Musical & Performing Arts 5

Science Fiction & Fantasy 6

Science-Fiction / Fantasy 6

Biography & Autobiography

Documentary

Mystery & Crime Dramas

Romance Education/General Interest

Japanimation (Anime)

War

Page 10: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Genre Taxonomy : Book

Fiction Non-fictionAction / Thriller1 Humor 3

Juvenile Fiction 2 Music & Performing Arts 5

Horror 4 Biography & Autobiography

Science Fiction & Fantasy 6

Mystery & Crime

Romance

Page 11: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Genre Taxonomy : Music

Blues Heavy Metal

Classical International

Country Jazz Instrument

Electronic Pop Vocal

Gospel R&B

Hardcore/Punk

Rock & Pop

The genre labels and the rating information provided the ground truth for experiments

Page 12: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Data Preprocessing

HTML tags were stripped out; Stop words were NOT stripped out;Punctuation was NOT stripped out;

They may contain stylistic informationTokens were stemmed

Page 13: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Categorization Model & Implementation

Naïve Bayesian (NB) ClassifierComputationally efficientEmpirically effective

Text-to-Knowledge (T2K) ToolkitA text mining frameworkReady-to-use modules and itinerariesNatural Language Processing tools

integratedSupporting fast prototyping of text mining

Page 14: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

NB itinerary in T2K

Data Preprocessing

NB Classifier

Page 15: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Results & Discussions

Page 16: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Genre Classification

Reviews on Book Movie MusicNumber of genres 9 11 12Reviews in each genre 200 150 150Term list size (terms) 41,060 47,015 47,864Mean of review length (words)

1,095 1,514 1,547

Std Dev of review length (words)

446 672 784

Mean of precision 72.18%

67.70%

78.89%

Std Dev of precision 1.89% 3.51% 4.11%5 fold random cross validation for book and movie reviews3 fold random cross validation for music reviews

Page 17: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : Book Reviews

Classified As

Action

Bio. Hor.

Hum.

Juv. Mus.

Mys.

Rom.

Sci.

Action 0.61 0.01 0.06 0.01 0.02 0.03 0.20 0.05 0.02

Bio. 0.04 0.70

0.01 0.05 0.03 0.13 0.01 0.03 0

Horror 0.09 0 0.66

0 0.05 0 0.12 0.02 0.06

Humor 0.01 0.10 0 0.74 0.03 0.08 0.01 0.01 0.03

Juvenile 0.01 0.01 0 0.07 0.86

0.02 0 0.02 0

Music 0 0.09 0 0 0.01 0.89

0 0 0.01

Mystery 0.20 0 0.01 0 0.01 0 0.70

0.05 0.04

Romance 0.06 0.01 0.01 0 0.04 0 0.08 0.78 0.03

Science 0.03 0 0.02 0.01 0.11 0.03 0.01 0.13 0.66

Page 18: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : MovieClassified As

Act. Ani. Chi. Com.

Doc.

Dra.

Edu.

Hor.

Mus.

Sci. War

Action 0.77

0 0 0.01 0 0.01 0.02 0 0 0.10 0.09

Anime 0 0.89

0.03 0.03 0 0 0 0 0 0.05 0

Children

0.02 0.01 0.95

0 0.01 0.01 0.01 0 0 0 0

Comedy

0.09 0.01 0.06 0.52 0.03 0.17 0.06 0.01 0.03 0.01 0.02

Docu. 0.02 0 0 0.04 0.63

0.01 0.19 0 0.09 0 0.02

Drama 0.16 0 0 0.12 0.10 0.45

0.05 0.03 0.03 0.01 0.04

Edu. 0 0 0.02 0.02 0.31 0.03 0.57

0 0 0.01 0.03

Horror 0.15 0.02 0.02 0.02 0.03 0.02 0.05 0.69

0 0.10 0.02

Music 0 0 0 0.01 0.18 0 0 0 0.81

0 0

Science 0.04 0.01 0.02 0 0.06 0.01 0.02 0.03 0 0.76 0.05War 0.11 0 0.01 0.01 0.08 0.08 0.05 0.03 0.02 0.02 0.59

Page 19: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : MusicClassified

AsBlu. Cla. Cou Ele. Gos. Pun. Met. Int’l Jazz Pop. RB Roc.

Blues 0.61 0 0.10 0 0 0 0 0 0 0 0 0.29

Classical

0 0.94 0 0.03 0 0 0 0 0 0 0 0.03

Country 0 0 0.92 0 0.03 0 0 0 0 0 0 0.06

Electr. 0 0 0 0.92 0 0 0.06 0 0 0 0 0.03

Gospel 0 0 0.05 0 0.80 0 0 0 0 0 0.05 0.10

Punk 0 0 0 0.05 0 0.71 0.05 0 0 0 0 0.19

Metal 0 0 0 0 0 0 0.89 0 0 0 0 0.11

Int’l 0 0.04 0.00 0.04 0 0 0 0.81 0 0 0 0.04

Jazz 0 0 0 0.04 0 0 0 0 0.89 0.04 0 0.04

Pop Vo. 0 0 0.04 0.07 0 0 0 0.04 0.07 0.68 0 0.11

R&B 0 0 0 0 0 0 0 0 0 0.06 0.88 0.06

Rock 0.03 0 0.03 0 0 0 0.03 0 0 0.03 0 0.89

Page 20: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Rating Classification

Five-class classification1 star vs. 2 stars vs. 3 stars vs. 4 stars vs 5

starsBinary Group classification

1 star + 2 stars vs. 4 stars + 5 starsad extremis classification

1 star vs. 5 stars5 fold random cross validation for all experiments

Page 21: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Rating : Book Reviews

Experiments 5 classes

Binary Group

Ad extremis

Number of classes 5 2 2Reviews in each class 200 400 300Term list size (terms) 34,123 28,339 23,131Mean of review length (words)

1,240 1,228 1,079

Std Dev of review length (words)

549 557 612

Mean of precision 36.70% 80.13% 80.67%Std Dev of precision 1.15% 4.01% 2.16%

Page 22: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Rating : Movie Reviews

Experiments 5 classes

Binary Group

Ad extremis

Number of classes 5 2 2Reviews in each class 220 440 400Term list size (terms) 40,235 36,620 31,277Mean of review length (words)

1,640 1,645 1,409

Std Dev of review length (words)

788 770 724

Mean of precision 44.82%

82.27%

85.75%

Std Dev of precision 2.27% 2.02% 1.20%

Page 23: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Rating : Music Reviews

Experiments 5 classes

Binary Group

Ad extremis

Number of classes 5 2 2Reviews in each class 200 400 400Term list size (terms) 35,600 33,084 32,563Mean of review length (words)

1,875 2,032 1,842

Std Dev of review length (words)

913 912 956

Mean of precision 44.25%

79.75%

85.94%

Std Dev of precision 2.63% 3.59% 3.58%

Page 24: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : Book Reviews

Classified As

1 star

2 stars

3 stars

4 stars

5 stars

1 star 0.45 0.21 0.15 0.09 0.10

2 stars 0.24 0.36 0.19 0.12 0.09

3 stars 0.11 0.17 0.28 0.22 0.21

4 stars 0.05 0.06 0.17 0.41 0.31

5 stars 0.04 0.07 0.17 0.26 0.46

Page 25: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : Movie Reviews

Classified As

1 star

2 stars

3 stars

4 stars

5 stars

1 star 0.49 0.19 0.17 0.08 0.072 stars 0.15 0.45 0.23 0.11 0.063 stars 0.04 0.24 0.28 0.27 0.174 stars 0.05 0.13 0.13 0.41 0.275 stars 0.07 0.03 0.16 0.20 0.54

Page 26: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : Music Reviews

Classified As

1 star

2 stars

3 stars

4 stars

5 stars

1 star 0.61 0.24 0.07 0.05 0.022 stars 0.24 0.15 0.36 0.15 0.093 stars 0.11 0.13 0.41 0.20 0.154 stars 0.03 0.06 0.10 0.32 0.485 stars 0 0 0.09 0.11 0.80

Page 27: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Classification of Book and Movie Reviews 1Reviews on all available genres

Books : 9 genres; Movies : 11 genres

Reviews on individual, comparable genresBook Movie

Action / Thriller1 Action /Adventure1

Juvenile Fiction 2 Children 2

Humor 3 Comedies 3

Horror 4 Horror/Suspense 4

Music & Performing Arts 5

Musical & Performing Arts 5

Science Fiction & Fantasy 6

Science-Fiction / Fantasy 6

Page 28: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Classification of Book and Movie Reviews 2

Eliminated words that can directly suggest the categories: "book", "movie", "fiction", "film", "novel", "actor",

"actress", "read", "watch", "scene" Frequently occurred in each category, but not bothTo make things harder / avoid oversimplifying

Results suggest stylistic difference in users’ criticisms on books and movies

5 fold random cross validation for all experiments

Page 29: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Book vs. Movie Reviews 1

Genre All Genres

Action Horror

Number of classes 2 2 2

Reviews in each class 800 400 400

Term list size (terms) 49,263 24,552 25,509

Mean of review length (words)

1,608 933 1,779

Std Dev of review length (words)

697 478 546

Mean of precision 94.28%

95.63%

98.12%

Std Dev of precision 1.18% 0.99% 1.40%

Page 30: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Book vs. Movie Reviews 2

Genre Humor/Comedy

Juvenile /Children

Number of classes 2 2

Reviews in each class 400 400

Term list size (terms) 26,713 21,326

Mean of review length (words)

1,091 849

Std Dev of review length (words)

625 333

Mean of precision 99.13%

97.87%

Std Dev of precision 1.05% 0.71%

Page 31: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Book vs. Movie Reviews 3

Genre Music & Performing Arts

ScienceFiction & Fantasy

Number of classes 2 2

Reviews in each class 400 400

Term list size (terms) 23,217 25,088

Mean of review length (words)

791 1,011

Std Dev of review length (words)

531 544

Mean of precision 97.02% 97.25%

Std Dev of precision 1.49% 1.91%

Page 32: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Classification of Fiction and Non-fiction Book Reviews 1

Fiction Non-fiction

Action / Thriller1 Humor 3

Juvenile Fiction 2 Music & Performing Arts 5

Horror 4 Biography & Autobiography

Science Fiction & Fantasy 6

Mystery & Crime

Romance

Page 33: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Classification of Fiction and Non-fiction Book Reviews 2

Eliminated words that can directly suggest the categories: "fiction", "non", "novel", "character", "plot", and

"story" Frequently occurred in each category, but not bothTo make things harder / avoid oversimplifying

Results suggest stylistic difference in users’ criticisms on fiction books and non-fiction ones

5 fold random cross validation for all experiments

Page 34: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Fiction vs. Non-fiction Book Reviews

Experiment Fiction vs. Non-fiction

Number of classes 2

Reviews in each class 600

Term list size (terms) 35,210

Mean of review length (words)

1,220

Std Dev of review length (words)

493

Mean of precision 94.67%

Std Dev of precision 1.16%

Page 35: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Confusion : Fiction vs. Non-fiction Book Reviews

Classified As

Fiction

Non-fiction

Fiction 0.98 0.02

Non-Fiction

0.09 0.91

Page 36: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

ConclusionsCustomer reviews are an excellent

resource for studying humanities materialsSuccessful experiments:

High classification precisions:

Genres; Ratings; Book vs. movie reviews

Fiction vs. non-fiction book reviewsReasonable confusions

Text mining techniques can help find important information about the materials being reviewed

Criticism Mining : make the ever-growing consumer-generated review resources useful to humanities scholars.

Page 37: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Future work

More text mining techniquesdecision trees, frequent pattern mining

Other critical textblogs, wikis, etc

Other facets of reviews “usage” in music reviews

Feature studies answer the “why” questions

Page 38: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

References Argamon, S., and Levitan, S. (2005). Measuring the Usefulness of

Function Words for Authorship Attribution. Proceedings of the 17th Joined International Conference of ACH/ALLC.

Downie, J. S., Unsworth, J., Yu, B., Tcheng, D., Rockwell, G., and Ramsay, S. J. (2005). A Revolutionary Approach to Humanities Computing?: Tools Development and the D2K Data-Mining Framework. Proceedings of the 17th Joined International Conference of ACH/ALLC.

Hu, X., Downie, J. S., West, K., and Ehmann, A. (2005). Mining Music Reviews: Promising Preliminary Results. Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR).

Sebastiani, F. (2002). Machine Learning in Automated Text Categorization. ACM Computing Surveys, 34, 1.

Stamatatos, E., Fakotakis, N., and Kokkinakis, G. (2000). Text Genre Detection Using Common Word Frequencies. Proceedings of 18th International Conference on Computational Linguistics.

Page 39: Criticism Mining:  Text Mining Experiments on Book, Movie and Music Reviews

Questions?

IMIRSEL

Thank you!

THE ANDREW W. MELLON FOUNDATION