citation metrics and the stories they tell

53
International Symposium on the Science of Science Library of Congress March 21 st 2016 Citation networks and the stories they tell Carl T. Bergstrom University of Washington

Upload: carlbergstrom

Post on 11-Jan-2017

1.283 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Citation metrics and the stories they tell

International Symposium on the Science of Science���Library of Congress���

March 21st 2016

Citation networks and the stories they tell

Carl T. BergstromUniversity of Washington

Page 2: Citation metrics and the stories they tell

Jevin West Martin Rosvall

SciSIP

Jennifer Jacquet Jacob Foster Shelley CorrellMolly King Daril Vilhena Ted BergstromJames Evans Ben Althouse Moritz StefanerDaniel Edler Ian Wesley-Smith Rodney GarrettMichael Jensen Morton Bech Ralph DandreaGregg Gordon

Page 3: Citation metrics and the stories they tell

Eigenfactor.org/projects/well-­‐formed/  

Citation is a core institution of ���academic science, tracing the flow of

ideas over time.

Page 4: Citation metrics and the stories they tell

The sum of all citations create a vast network of more than a billion citations among more than 100 million papers

Eigenfactor.org/projects/well-­‐formed/  

Page 5: Citation metrics and the stories they tell

Every one of those citations represents a careful decision by domain experts.

Eigenfactor.org/projects/well-­‐formed/  

Page 6: Citation metrics and the stories they tell

The citation network of science holds a wealth of information about how science

works, and how it can work better.

Eigenfactor.org/projects/well-­‐formed/  

Page 7: Citation metrics and the stories they tell

How can we extract ���this information?

Eigenfactor.org/projects/well-­‐formed/  

Page 8: Citation metrics and the stories they tell
Page 9: Citation metrics and the stories they tell

The first step is to assemble the data. ������

We have compiled citation networks ���from many sources:

Page 10: Citation metrics and the stories they tell

���How important is any particular paper, or any particular journal,���

in the network?

Mapequation.org  

Page 11: Citation metrics and the stories they tell

Count incoming links���(Impact Factor)

Mapequation.org  

Page 12: Citation metrics and the stories they tell

Count incoming links���(Impact Factor)

Use the whole network(Eigenfactor)

Mapequation.org  

Page 13: Citation metrics and the stories they tell

Important websites���are linked to by���

important websites.

Page 14: Citation metrics and the stories they tell

Important papers���are cited by ���important papers

Important journals���are cited by ���

important journals

Page 15: Citation metrics and the stories they tell

Eigenfactor algorithmP = α H + (1 − α ) a.eT

Matrix representing therandom walk over citations Probability of

not teleportingCross-citation Matrixdictating the structureof the citation network

Probability of teleportingto completely new journalweighted by the numberof articles in that journal

EF =100 Hπ[Hπ ]ii∑

Leading eigenvectorof the random walkmatrix P.

Normalization

Bergstrom (2007); West et al (2010)

Page 16: Citation metrics and the stories they tell

Applet coding: Daniel EdlerMapequation.org  

Page 17: Citation metrics and the stories they tell

The Eigenfactor Algorithm

Page 18: Citation metrics and the stories they tell
Page 19: Citation metrics and the stories they tell
Page 20: Citation metrics and the stories they tell
Page 21: Citation metrics and the stories they tell

Study, and publicize, the cost-effectiveness of journal subscriptions

Eigenfactor.org  Bergstrom and Bergstrom 2004 PNAS

Page 22: Citation metrics and the stories they tell

Study, and publicize, the cost-effectiveness of open access publishing

Eigenfactor.org  

Page 23: Citation metrics and the stories they tell

Ranking authors

“Author-level Eigenfactor performs best in identifying high-impact authors”���

- Dunaiski et al. ��� J. Informetrics May 2016

West et al 2013 JASIST

Page 24: Citation metrics and the stories they tell

Ranking articles: ���The Article-Level Eigenfactor (ALEF) Algorithm

Time

Olderpapers

Newer papers

Wesley-Smith et al 2016; West et al in prep.

Page 25: Citation metrics and the stories they tell
Page 26: Citation metrics and the stories they tell

Image Courtesy of Mark Newman

Small networks reveal structuredirectly.

Dating network in a Michigan high school

Page 27: Citation metrics and the stories they tell

Large networks could use some assistance.

Yeast protein interaction network

Ho et al. (2002) Nature

Page 28: Citation metrics and the stories they tell

good maps simplify ���and highlight��� relevant structures

Boston MTAGoogle maps

Page 29: Citation metrics and the stories they tell

Network community detection ������

We want a modular description of a weighted, directed network: ���

���Most flow on the network occurs within, ���

not between, local modules.

Page 30: Citation metrics and the stories they tell

DataCompressing Finding patterns

If we can find a good code for describing flow on a network, we will have solved the dual problem of finding the important structures with respect to that flow.

Page 31: Citation metrics and the stories they tell

The map equation tells us the description length for a particular modular structure

The map equation

Page 32: Citation metrics and the stories they tell

We conclude that the infomap method by Rosvall and Bergstrom is the best performing… ���

Among other things, the method can be applied to weighted and directed graphs as well, with excellent performances, so it has a large spectrum of potential applications.”

- Lancichinetti and Fortunato (2009)

Page 33: Citation metrics and the stories they tell

Rosvall and Bergstrom (2008) PNAS

Page 34: Citation metrics and the stories they tell

Althouse et al (2009) JASIST

Page 35: Citation metrics and the stories they tell

“coverage” Impact factor

Althouse et al (2009) JASIST

Page 36: Citation metrics and the stories they tell

1995 2004

Page 37: Citation metrics and the stories they tell

1. Determine which structures are statistically significant.���

2. Visualize changes in those structures.

Page 38: Citation metrics and the stories they tell

Rosvall and Bergstrom (2010) ���PLoS One

The emergence of neuroscience

Page 39: Citation metrics and the stories they tell

The map equation tells us the description length for a particular hierarchical structure

The hierarchical map equation

Page 40: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) ���PLoS One

Page 41: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) PLoS One

Revealing hierarchical structure

Page 42: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) PLoS One

Revealing hierarchical structure

Page 43: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) PLoS One

Revealing hierarchical structure

Page 44: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) PLoS One

Revealing hierarchical structure

Page 45: Citation metrics and the stories they tell

Rosvall and Bergstrom (2011) ���PLoS One

Revealing hierarchical structure

Page 46: Citation metrics and the stories they tell

Using hierarchical structure for scholarly recommendation

West et al (2016) In press

Page 47: Citation metrics and the stories they tell

http://babel.eigenfactor.org  

Using hierarchical structure for scholarly recommendation

Page 48: Citation metrics and the stories they tell

1920 1940 1960 1980 2000

0.10

0.15

0.20

0.25

0.30

perc

enta

ge o

f wom

en

What gender disparities still exist across academia?

first author

West et al. 2013 PLoS One

Page 49: Citation metrics and the stories they tell

1920 1940 1960 1980 2000

0.10

0.15

0.20

0.25

0.30

perc

enta

ge o

f wom

en

What gender disparities remain ���in scholarly publishing?

last author

first author

West et al. 2013 PLoS One

Page 50: Citation metrics and the stories they tell

Eigenfactor.org  

Page 51: Citation metrics and the stories they tell

Self-citation rate by gender

●●●

●●

●●●

●●

●●

●●●

●●●●

●●●

●●●●

●●●●●●●●●

●●●

●●●●●●●●●●●●●●●

●●

●●

●●●

●●●

●●●●●●

●●●●●

●●●●●

●●●●●●

●●●●●●●

●●●●●●●●●●●●●

●●●●●●

●●●

●●

Women's and men's rates of self-citation

���� ���� ���� ���� ���� ��������

���

���

���

���

���

���

����-����� / ����������

Based on > 3 million papers from JSTOR King et al. in prep.

Page 52: Citation metrics and the stories they tell

Rates of rates of self-citation do make a difference to impact metrics, particularly the h-index.

– Cameron et al 2016 Bioscience

Self-citations per authorship

”“

King et al. in prep.

Page 53: Citation metrics and the stories they tell