early detection of twitter trends

22
EARLY DETECTION OF TWITTER TRENDS MILAN STANOJEVIC [email protected] UNIVERSITY OF BELGRADE SCHOOL OF ELECTRICAL ENGINEERING

Upload: azure

Post on 25-Feb-2016

30 views

Category:

Documents


0 download

DESCRIPTION

Early detection of Twitter trends. milan stanojevi c sm 123317 [email protected]. university of Belgrade school of electrical engineering. contents. Introduction Trending topics Parametric model Data-Driven approach Experiment results Conclusion. Milan Stanojevic. introduction. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Early detection of Twitter  trends

EARLY DETECTION OF TWITTER TRENDS

MILAN [email protected]

UNIVERSITY OF BELGRADESCHOOL OF ELECTRICAL ENGINEERING

Page 2: Early detection of Twitter  trends

CONTENTS

Introduction Trending topics Parametric model Data-Driven approach Experiment results Conclusion

2/22

Milan Stanojevic

Page 3: Early detection of Twitter  trends

INTRODUCTION

Events occur in large datasets We need:

detection classification prediction

Parametric models are popular but overly simplistic

Nonparametric approach is proposed for time series inference

Observed signal is compared to two sets of reference signals – positive and negative examples Is there enough information for earlier

prediction?(spoiler alert: YES)

3/22

Milan Stanojevic

Page 4: Early detection of Twitter  trends

TRENDING TOPICS

Twitter: a global communication network Tweet: a short, public message Topic: a phrase in a tweet Trending topic (trend): a topic that

becomes popular

4/22

Milan Stanojevic

Page 5: Early detection of Twitter  trends

PARAMETRIC MODEL

Expect certain type of pattern usually constant + jumps

Fit parameter in data e.g. size of a jump

5/22

Milan Stanojevic

Page 6: Early detection of Twitter  trends

DATA-DRIVEN APPROACH

All the information needed is in the data Assumptions:

tweets are written by people people are simple:

in how they spread information in how they connect to each other

there is a small number of distinct ways in which a topic becomes trending

6/22

Milan Stanojevic

Page 7: Early detection of Twitter  trends

DATA-DRIVEN APPROACH

7/22

Milan Stanojevic

Page 8: Early detection of Twitter  trends

DATA DRIVEN APPROACH

8/22

Milan Stanojevic

Page 9: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

9/22

Milan Stanojevic

Page 10: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

10/22

Milan Stanojevic

Page 11: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

11/22

Milan Stanojevic

Page 12: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

12/22

Milan Stanojevic

Page 13: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

13/22

Milan Stanojevic

Page 14: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

14/22

Milan Stanojevic

Page 15: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

15/22

Milan Stanojevic

Page 16: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

16/22

Milan Stanojevic

Page 17: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

17/22

Milan Stanojevic

Page 18: Early detection of Twitter  trends

CLASSIFICATION BY EXPERTS

Properties simple: computation of

distances scalable: computation is

easily parallelized nonparametric: model

“parameters” scale along with the data

18/22

Milan Stanojevic

Page 19: Early detection of Twitter  trends

EXPERIMENT

SETUP Dataset:

500 trends 500 non-trends

Do trend detection of 50% holdout set of topics

Online signal classification

RESULTS Early detection

79% rate of early detection, 1.43hrs average Low rate of error

95% true positive rate, 4% false positive rate

19/22

Milan Stanojevic

Page 20: Early detection of Twitter  trends

EXPERIMENT

FPR / TPR Tradeoff

Early / Late Tradeoff

20/22

Milan Stanojevic

Page 21: Early detection of Twitter  trends

CONCLUSION

New approach to detecting Twitter trends Generalized time series analysis method:

Classification Prediction Anomaly detection

Possible applications: Movie ticket sales Stock prices etc.

21/22

Milan Stanojevic

Page 22: Early detection of Twitter  trends

BIBLIOGRAPHY

Trend or No Trend: A Novel Nonparametric Method for Classifying Time SeriesStanislav Nikolov

Master thesisMassachusetts Institute of Technology (2011)

22/22Milan Stanojevic

[email protected]