twitter rank—finding topic- sensitive influential twitters singapore management university jianshu...

27
Twitter rank—finding topic-sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International Conference on Web Search and Data Mining (WSDM 2010)

Upload: beverly-mills

Post on 19-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Twitter rank—finding topic-sensitive influential twitters

Singapore Management University

Jianshu WENG

Ee Peng LIM

Jing JIANG

Qi He

ACM International Conference on Web Search and Data Mining (WSDM 2010)

Page 2: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Outline

• Problem

• Dataset

• Twitter rank

• Results

• Recommended paper

Page 3: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Problem

• Identifying influential users of micro-blogging services

How?

Page 4: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Frequently used algorithmsName Algorithm Measures the influence with Problems

InD In-degreenumber of followers does not accurately capture the

notion of influence

PR PageRank

link structure of the network ignores the interests of twitterers, which affects the way twitterers influence one another

TSPR Topic-sensitive pagerank

calculating PageRank vector for each topic.

propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities

TR Twitterrank Link structure + Topic similarity between users

Page 5: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Problem

• Identifying influential users of micro-blogging services

Topic similarity between users Link structure Twitter rank

Homophily

Page 6: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Why the problem important?

Page 7: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Why the problem important?

• Brings order to the real-time web in that it allows the search results to be sorted by the authority/influence of the contributing twitterers giving a timely update of the thoughts of influential twitterers

• A marketing platform. Targeting those influential users will increase the e ciency of the ffimarketing campaign . For example, a hand phone manufacturer can engage those twitterers influential in topics about IT gadgets to potentially influence more people.

• There are also applications that utilize Twitter to gather opinions and information on particular topics. Identifying influential twitterers for interesting topics can improve the quality of the opinions gathered.

……

Page 8: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Context _ twitter

• Friend tweet

• Follower following

friend follower Tweet(<140words)

following

Page 9: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Context _ dataset

Top-1000 Singapore

Page 10: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Context _ dataset

Resource Size

Top-1000 Core 996

Extended twitterers

All the followers and friends of each individual twitterer

6,748

Tweets All the tweets they had published so far 1,021,039

Page 11: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Context _ dataset

• Power-law distribution

5686 >1 tweetsMean=179.57

Among the 6745 twitterers, 957 have no friends, while 1782 have no followers

Page 12: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Context _ dataset• Reciprocity

• Too casual Strong indicator of the similarity among users

72.4% 80%

following

80.5% 80%

following

Homophily? How to prove?

Page 13: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Twitterrank

Homophily

Page 14: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Topic distillation

• LDA(Latent Dirichlet Allocation)

p( word | tweet ) = p( word | topic )*p( topic | tweet )

tweet topic tweet

word word topic

DT: the probability that twitterer si is interested in topic tj .

Automatically identify the topics that twitterers are interested in based on the tweets they published.

WT

Page 15: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Homophily

Page 16: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Homophily

• Hypothesis testing

Case1: >30 friendsCase2: <30 friends

Most of the twitterers (3785/4050) have less than 30 friends, which is not statistically significant. Therefore, two cases are considered. Why?

11505 pairs of twitterers with reciprocal “following” relationship.

Page 17: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Twitterrank

Graph D(V,E)

• Vertex: twitterers

• Edge: “following” relationship

ransition matrix for topic t

Page 18: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Twitterrank

• Jump why jump?

It is possible that some twitterers would “follow” one another in a looping manner without “following” other twitterers outside the loop. Such loop will accumulate high influence without distribute their influence. To tackle this, a teleportation vector E t is also introduced, which basically captures the probability that the random surfer would “jump”to some twitterers instead of following the edges of the graph D

Page 19: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Twitterrank

1. probabilities of different topics’ presence WT2. probabilities that a particular twitterer is interested in different topics DT

Page 20: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Results

• Influential twitterers identified in the Twitter dataset

Page 21: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Compared algorithmsName Algorithm Measures the influence with Problems

InD In-degreenumber of followers does not accurately capture the

notion of influence

PR PageRank

link structure of the network ignores the interests of twitterers, which affects the way twitterers influence one another

TSPR Topic-sensitive pagerank

calculating PageRank vector for each topic.

propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities

TR Twitterrank Link structure + Topic similarity between users

Page 22: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Results

Page 23: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

Application _ recommendation task

• Randomly choose | L | existing “following” relationship formed among twitterers

A

B

A

B 9

8

2

3

1

5

6

7

104

Page 24: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International

A recommended paper

• Java A, Song X, Finin T, et al. Why we twitter: understanding microblogging usage and communities[C]//Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis. ACM, 2007: 56-65.

• 引用: 2126次

Main work:• People talk about their daily activities and to seek or share information.

• Analyze the user intentions associated at a community level and show how users with similar intentions connect with each other.

Page 25: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International
Page 26: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International
Page 27: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International