community interaction and conflict on the websrijan/pubs/reddit-conflict-slides.pdf · •little is...

25
Community Interaction and Conflict on the Web Srijan Kumar @srijankr William Hamilton @williamleif Jure Leskovec @jure Dan Jurafsky @jurafsky 1

Upload: others

Post on 25-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Community Interaction and Conflict on the Web

Srijan Kumar@srijankr

William Hamilton@williamleif

Jure Leskovec@jure

Dan Jurafsky@jurafsky 1

Page 2: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Why study inter-community interactions?

•Users form communities•Communities interact with one another• Little is known about how community interaction

occurs• So, we study inter-community interactions between

20,000+ communities on Reddit

2Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 3: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

3

“Come look at all the brainwashed idiots in Documentaries….”

Members go and post negative/hateful

comments

• Can disrupt communities• Can decrease long-term

engagement

Conspiracy Documentaries

Conflict across communities

Understanding how communities fight and how to prevent conflicts is important to foster a healthy online environment.

Page 4: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Reddit Dataset We use public Reddit data for this study• 40 months (2014—2017)• 1.8+ billion comments• 100+ million users• 20,000+ communities

4

But…There are no labels of community interactions and conflicts.How to define these?

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 5: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

“Come look at all the brainwashed idiots in Documentaries….”

Defining inter-community interactions

5

Members of sourcemay be mobilized to comment in the linked target post

Source community links to a post in target community

Inter-community interaction happens if a hyperlink mobilizes usersfrom the source to the target community

Attackers

Defenders

Page 6: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Defining conflicts using crowdsourcing

6

How does the left (source) post refer to the right (target) post?A.With neutral or no opinionB.With a negative opinion

Source post Target post

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 7: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Defining conflicts using crowdsourcing

7

Conflicts = Interactions initiated by negative-sentiment source post

• Turkers labeled 1000 pairs of source-to-target posts• We developed text classifier (0.80 AUC) to label remaining pairs• We define conflicts as interactions that are initiated with negative

sentiment.• Identified 1800 conflicts

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 8: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Our model: Three phases of conflict

8Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 9: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Which communities engage in conflicts?

9

Question: Are all communities prone to conflict, or is it restricted to a few bad apples?

Community nodesUser nodes

Posting edges

Our solution:• Create who-posts-where network• Generate embedding vector for

each user and community, similar to word2vec

• Vectors learned to maximize probability of a user posting in a community

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 10: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

10

Dot = community

Blue dot = community that initiates fewer conflicts

Red dot = community that initiates more conflict

1% of communities start 74% of all conflicts

Conflicts are concentrated in some areas

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 11: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Who do communities attack?

11

Question: Do communities attack other random communities, or is there a relation between the source and target community?

Highly similar communities attack each other

Our solution:• TF-IDF similarity between communities:• Create word vector for each community from its posts• Calculate cosine similarity between source and target community

• TF-IDF similarity is 1.5x expected value

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 12: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Phases of conflict

12

• Initiated by handful of communities

• Attack similar, but opposing, communities

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 13: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Attacker-Defender Interactions

13

Hypothesis 2: attackers and defenders primarily reply to users of the same type

Hypothesis 1: attackers and defenders reply significantly to one another

Attacker node Defender nodeLegend:Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 14: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Attacker-PageRank and Defender-PageRank

14

A-PageRank: Run PageRank but restrict the teleport set to just attackers.• Quantifies node centrality with respect to all attackers.

Attacker node

Defender node

Legend:

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 15: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

15

Echo-chambers form during conflicts

Attackers

0.20

0.15

0.10

0.05

0.00

Average A-PageRank

Score

Defenders

Attackers have higher average A-PageRank scores than defenders.

So, attackers are closer to other attackers.

Defenders have higher average D-PageRank scores than attackers.

So, defenders are closer to other defenders.

Attackers

0.20

0.15

0.10

0.05

0.00

Average D-PageRank

Score

Defenders

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 16: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Echo-chambers form during conflicts

16

Hypothesis 2: attackers and defenders primarily reply to other users of the same type

Hypothesis 1: attackers and defenders reply significantly to one another

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 17: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

17

Ganging-up effect during conflicts• Some defenders are very close to attackers:

10x average A-PageRank score• Most defenders are unreachable: zero A-

PageRank score

• Linguistic analysis shows attackers swear more in replies to defenders

Attackers “gang-up” on some defenders during conflicts

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 18: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Phases of conflict

18

• Initiated by handful of communities

• Attack similar, but opposing, communities

• Conflicts create echo-chambers

• Attackers gang-up on defenders

Page 19: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

19

What prevents colonization?

Do conflicts change future engagement?If activity increases, then conflicts make users more loyal and active

ORIf activity decreases, then conflicts drive users away

Attackers “colonize” the target community and defenders leave.

Future activity - previous activityin target community

1%

0.5%

0%

-0.5%

Moreactive

Lessactive

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 20: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

20

How to defend against attacks?

(Defenders become less active) (Defenders become more active)Successful-attack

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 21: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

21

Successful vs unsuccessful defense propertiesWhen defense is successful:• Defenders reply directly more to attackers• Attackers and defenders are closer to each other in the reply network• Defenders tend to use more `anger’ words

Direct and angry replies to attackers (“fighting-back”) marks a successful defense.

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 22: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Phases of conflict

22

• Initiated by handful of communities

• Attack similar, but opposing, communities

• Conflicts create echo-chambers

• Attackers gang-up on defenders

• Conflicts lead to colonization

• Successful defense: direct heated engagement with attackers

Page 23: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Can we predict conflicts before they happen?

23

Mobilization of attackers

Task: Given a post from source to target community, will it lead to a conflict?

No mobilization

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 24: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Predicting conflicts• We create a “socially-primed” LSTM

structure.• Takes user, community, and word

embeddings as input for the prediction.

• A strong feature baseline gets 0.67 AUC• Socially-primed LSTM gets 0.72 AUC• Combination of both gets 0.76 AUC

24

target

Community Interaction and Conflict on the Web. Kumar et al. WWW 2018.

Page 25: Community Interaction and Conflict on the Websrijan/pubs/reddit-conflict-slides.pdf · •Little is known about how community interaction occurs •So, we study inter-community interactions

Community Interaction and Conflict on the Web

25

• Initiated by handful of communities

• Attack similar, but opposing, communities

• Conflicts create echo-chambers

• Attackers gang-up on defenders

• Conflicts lead to colonization

• Successful defense by direct heated engagement with attackers

• Conflicts predicted with 0.76 AUC• More results on positive inter-community interactions in the paper

Data and code: snap.stanford.edu/conflictCommunity Interaction and Conflict on the Web. Kumar et al. WWW 2018.