modeling dynamic multi-topic discussions in online forums

16
Modeling Dynamic Multi- topic Discussions in Online Forums Hao Wu, Jiajun Bu, Chun Chen, Can Wang, Guang Qiu, Lijun Zhang and Jianfeng Shen* Zhejiang University, China *Zhejiang Health Information Center, China July 13, AAAI’2010 Atlanta, GA, USA

Upload: arden-willis

Post on 31-Dec-2015

29 views

Category:

Documents


0 download

DESCRIPTION

Modeling Dynamic Multi-topic Discussions in Online Forums. Hao Wu , Jiajun Bu, Chun Chen, Can Wang, Guang Qiu, Lijun Zhang and Jianfeng Shen * Zhejiang University, China *Zhejiang Health Information Center, China. July 13, AAAI’2010 Atlanta, GA, USA. Social Media. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Modeling Dynamic Multi-topic Discussions in Online Forums

Modeling Dynamic Multi-topic Discussions in Online Forums

Hao Wu, Jiajun Bu, Chun Chen, Can Wang, Guang Qiu, Lijun Zhang and Jianfeng Shen*

Zhejiang University, China*Zhejiang Health Information Center, China

July 13, AAAI’2010

Atlanta, GA, USA

Page 2: Modeling Dynamic Multi-topic Discussions in Online Forums

Social Media• Web 2.0 applications socialize users online

• Online Forums– Distinct platform for knowledge sharing and information exchange

2

Reveal how information propagates on Internet.Modeling the process of topic discussions and predicting user activity is an interesting problem!

Page 3: Modeling Dynamic Multi-topic Discussions in Online Forums

Benefits of Modeling

• Understand online human interactions and group forming

• Improve applications e.g., recommender• Track new ideas and technology• Mine opinions about products

3

Social network analysis

User review

Page 4: Modeling Dynamic Multi-topic Discussions in Online Forums

Environment of Online Forums

• Great complexity

• Randomness– Usually no well-defined

friendships or co-authorships– Free to posting– Topic drifts in a single thread

4

?

What are the mechanisms underlying user’s participation

What are the mechanisms underlying user’s participation

From which perspective to view the process of topic discussion

From which perspective to view the process of topic discussion

How to make use of the property of topics and temporal feature for modeling

How to make use of the property of topics and temporal feature for modeling

How to measure the importance of a user in discussions

How to measure the importance of a user in discussions

Modeling Dynamic Multi-topic Discussions is challenging !

433,839 threads13,599,245 posts

433,839 threads13,599,245 posts

Page 5: Modeling Dynamic Multi-topic Discussions in Online Forums

5

Outline

• Motivation and Intuitions

• Topic Flow Models

• Experimental Results

• Summary

Page 6: Modeling Dynamic Multi-topic Discussions in Online Forums

Topic Flow Model (TFM)

6

Reply LinkTopic Flow

The new comer reads some of the previous comments before posting.

The information (topic) flows from early participant to late participant .

Topic diffuses through the underlying social networks

Page 7: Modeling Dynamic Multi-topic Discussions in Online Forums

Basic Topic Flow Model (B-TFM)

7

Thread Document: Frequency of :Frequency of :

d D

Topic Flow

Peer-influence

dijRdiC

Self-preference

Normalization

ParticipationRank: measures the susceptibility of a user to a ‘infective’ topic

Social Network

Thread Documents

D

Random Walk With Restart

i ji

dij ijd Dw R

di id Dy C

1 (1 ) /T n S D W 11/iyq y

( 1) (1 )Tt t p S p q* 1(1 )( )T p I S q

j

Page 8: Modeling Dynamic Multi-topic Discussions in Online Forums

Topic-specific TFM (T-TFM)

8

FIFA World Cup

iPhone

Using Latent Dirichlet Allocation [Blei 2003]( | )z d

ij ijd Dw P z d R

( | )z di id Dy P z d C

Different interaction patterns according to different topics

Page 9: Modeling Dynamic Multi-topic Discussions in Online Forums

Time-sensitive T-TFM (TT-TFM)

• Forgetting Mechanism

9

past now

Time lapses

nowTime Lapse Factor

exp( ) ( | )z dij d ijd Dw t P z d R

exp( ) ( | )z di d id Dy t P z d C

Page 10: Modeling Dynamic Multi-topic Discussions in Online Forums

Evaluation: Prediction

• ParticipationRank (indicator)– The willingness of a user in participation to

discussion of a topic

10Synthesize For T-TFM and TT-TFM

?

Train Predict

Ranking

p

* *( | )F

zz Z d DP z d

p p Whether a user

joins in discussion? (post at least once )

Page 11: Modeling Dynamic Multi-topic Discussions in Online Forums

11

Outline

• Motivation and Intuitions

• Topic Flow Models

• Experimental Results

• Summary

Page 12: Modeling Dynamic Multi-topic Discussions in Online Forums

Experiments

• Dataset (www.honda-tech.com)

– Two communities: Drag Racing and Honda/Acura– Across one year, from 09/01/2008 to 08/31/2009.

12

posted more than the average number of posts per user.

Page 13: Modeling Dynamic Multi-topic Discussions in Online Forums

Results

13

• Evaluations– Divide the data into 12 continuous time windows– Generate ranking for each one month data, and

predict user posting activity in the following one week

Page 14: Modeling Dynamic Multi-topic Discussions in Online Forums

Model Selection

• = 0.3 and 0.1

• T = 30 and 40

• = 0.01

14

Page 15: Modeling Dynamic Multi-topic Discussions in Online Forums

Summary

• An intuitive model of discussions in online forums• Topic Flow Models (TFM)

– Consider both peer-influence and self-preference

– Property of latent topics

– Temporal feature: forgetting mechanism

• Evaluation on prediction of user activity • Future work:

– Utilize the web structure of online forum

– More data sets e.g.,

– Build recommendation system15

Page 16: Modeling Dynamic Multi-topic Discussions in Online Forums

Thanks!

Any Question?

16