understanding the patterns of health information ...raywang/pub/amia17-zika_slides.pdf · train...

24
Xinning Gui 1 and Yue Wang 2 1. University of California, Irvine 2. University of Michigan Twitter: #AMIA2017 Understanding the Patterns of Health Information Dissemination on Social Media during the Zika Outbreak Improving Population Health Using Novel Data Sources S47

Upload: others

Post on 15-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Xinning Gui1 and Yue Wang2

1. University of California, Irvine2. University of MichiganTwitter: #AMIA2017

Understanding the Patterns of Health Information Dissemination on Social Media during the Zika OutbreakImproving Population Health Using Novel Data SourcesS47

Page 2: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

DisclosureWe and our spouses/partners have no relevant relationships with commercial interests to disclose.

2AMIA 2017 | amia.org

Page 3: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Learning ObjectivesAfter participating in this session the learner should be better able to:

• Utilize a mixed methods approach, in addition to machine learning, to monitor and assess the dissemination of health information (e.g. that related to Zika) on social media

• Formulate effective strategies for communicating public health information on social media• Identify opportunities and challenges that social media present to risk communication

3AMIA 2017 | amia.org

Page 4: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Motivation

4AMIA 2017 | amia.org Source: https://npin.cdc.gov/training/know-social-media-public-health

Page 5: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

• To analyze the risk communication on Twitter during 2016 Zika outbreak

• To understand the information created bygeneral public and public health authorities,and different information dissemination patterns

• To provide implications for effective riskcommunication strategies on social media

5AMIA 2017 | amia.org

Source: http://www.newsweek.com/zika-testing-fda-blood-donated-united-states-microcephaly-florida-brazil-493985

Research Goals

Page 6: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Collecting Zika-related tweets

6AMIA 2017 | amia.org

in 2016“zika”

1,495,480 tweets Top languagesEnglish 54%

Spanish 27%

Portuguese 12%

10% of

Page 7: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Zika tweets & Google trend

7AMIA 2017 | amia.org

Cases reported in 20 countries in the Americas

Brazil 2016 Summer Olympics

Page 8: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Worldwide Zika discussion on Twitter

8AMIA 2017 | amia.org

mid January, 2016 mid February, 2016

WHO declared emergency on Feb 1st, 2016

Page 9: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Content analysis roadmap

9AMIA 2017 | amia.org

1,495,480 tweets containing “zika”

54% inEnglish

popular tweets(3,581)

authoritative tweets (1,227)

Machine learning-assisted content analysis

4,808 tweets

Page 10: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Machine Learning-Assisted Content Analysis

10AMIA 2017 | amia.org

CodebookOpen coding;Axial coding

Train machine learning model(multiclass text classification)

Use learned model to assign codes to large data set

4,808 tweets (3,581 popular + 1,227 authoritative)

285 sampled tweets(200 popular + 85 authoritative)

Coded tweets

Page 11: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

11AMIA 2017 | amia.org

Machine Learning-Assisted Content Analysis

CodebookOpen coding;Axial coding

Train machine learning model(multiclass text classification)

4,808 tweets (3,581 popular + 1,227 authoritative)

285 sampled tweets(200 popular + 85 authoritative)

Coded tweets

Cross-validation accuracy ~80%

Use learned model to assign codes to large data set

Scale up analysis to big data with modest amount of manual effort

Page 12: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

285 coded tweets: 8 categories

12AMIA 2017 | amia.org

ScientificKnowledge

Pharmaceuticalprogress

Page 13: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Typology of Zika-related Tweet Content

13AMIA 2017 | amia.org

Category Retweet CountAverage (std. dev.)

Like CountAverage (std. dev.)

Percentage (%)

Joke 1362 (3706) 1425 (3040) 1.6

Sports Events 336 (452) 317.0 (311) 0.4

Descriptive Statistics of Top Retweeted Tweets (# of retweets >=100)

Category Retweet CountAverage (std. dev.)

Like CountAverage (std. dev.)

Percentage (%)

Joke 0 0 0

Sports Events 0 0 0

Descriptive Statistics of Authoritative Tweets

Page 14: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

14AMIA 2017 | amia.org

Screentshots taken on Oct 31, 2017

Page 15: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

15AMIA 2017 | amia.org

Page 16: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

16AMIA 2017 | amia.org

Page 17: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Implications• Conduct more interactive and engaging communication strategies

• E.g., CDC's viral Zombie Apocalypse campaign

• Consider including more information in tweets and restating scientific messages in plain language

17AMIA 2017 | amia.org

Page 18: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Typology of Zika-related Tweet Content

18AMIA 2017 | amia.org

Category Retweet Count Average(std. dev.)

Like Count Average (std.dev.)

Percentage (%)

Pharmaceutical Progress 1192 (1509) 607 (787) 0.4

Consequence 704 (922) 450 (680) 1.5

Research Progress 419 (672) 301.4 (529) 9.5

Descriptive Statistics of Top Retweeted Tweets (# of retweets >=100)

Category Retweet Count Average(std. dev.)

Like Count Average (std.dev.)

Percentage (%)

Scientific Knowledge 43 (177) 22 (65) 47.6

Infection Update 25 (45) 87 (56) 24.8

Policy 26 (38) 19 (30) 17.0

Descriptive Statistics of Authoritative Tweets

Page 19: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Implications• Consider monitoring information dissemination trends on social media to

• gain familiarity with major conversations and debates that take place among the generalpublic

• evaluate the effectiveness of social media efforts

19AMIA 2017 | amia.org

Page 20: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Temporal Development between Popularand Authoritative Tweets

20AMIA 2017 | amia.org

0

20

40

60

80

100

120

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Research Progress

popular authoritative

0

50

100

150

200

250

300

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Infection Update

popular authoritative

Page 21: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Implication• Consider publishing more timely content related to influential news and major

events

21AMIA 2017 | amia.org

Page 22: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Conclusion• providing more engaging and straightforward health message contents that

attend to people’s information needs

• adopting more interactive communication strategies

• delivering messages timely after related news and major events

• monitoring information dissemination trends on social media and evaluatingthe effectiveness of social media efforts

22AMIA 2017 | amia.org

Page 23: Understanding the Patterns of Health Information ...raywang/pub/AMIA17-Zika_slides.pdf · Train machine learning model (multiclass text classification) 4,808 tweets (3,581 popular

Acknowledgement

23AMIA 2017 | amia.org

TheCo-authors

Yubo KouPhD, Purdue University

Tera Leigh ReynoldsMPH, MA, University of California, Irvine

Yunan ChenPhD, University of California, Irvine

Qiaozhu MeiPhD, University of Michigan

Kai ZhengPhD, University of California, Irvine

And Anonymous Reviewers