using wikipedia to make the digitized newspapers …...using wikipedia to make the digitized...
TRANSCRIPT
![Page 1: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/1.jpg)
Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable
Preliminary Findings Donald Taylor
Wikipedian-in-Residence
Maryland Historic Newspapers Project
University of Maryland Libraries
Chronicling America Wikipedia Edit-a-Thon, 18 August 2014 @donaldtaylorii
![Page 2: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/2.jpg)
Not your usual Wikipedian-in-Residence
![Page 3: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/3.jpg)
Who is using Chronicling America?
![Page 4: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/4.jpg)
![Page 5: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/5.jpg)
Citation Distribution
Number of citations Number of articles
1 1248
2 263
3-5 167
6-10 57
11-20 21
21-50 17
51-76 4
Total 1,777
![Page 6: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/6.jpg)
How many citations do editors make
Number of citations Editors
1 344
2 104
3-5 75
6-10 32
11-20 16
21-50 10
51-100 5
100-130 3
Total 589
![Page 7: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/7.jpg)
For the last year the rate of Chronicling America citation on Wikipedia has been increasing by about 36 per quarter. Over the last five quarters the growth rate of Chronicling America citation has increased by 150 percent.
![Page 8: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/8.jpg)
What are they using Chronicling America for?
![Page 9: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/9.jpg)
![Page 10: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/10.jpg)
WP:NOR
![Page 11: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/11.jpg)
In inclusionism vs. deletionism
newspapers = notability
![Page 12: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/12.jpg)
USS Kearsarge (BB-5) (keel laid: 1896, scrapped: 1955) https://en.wikipedia.org/wiki/USS_Kearsarge_(BB-5) Inkbug 19 Chronicling America citations of 62 total
![Page 13: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/13.jpg)
Belle Gunness (November 11, 1859 – April 28, 1908) https://en.wikipedia.org/wiki/Belle_Gunness 74.83.126.88 27 Chronicling America citations of 44 total
![Page 14: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/14.jpg)
McCants Stewart (July 11, 1877 – April 14, 1919) 16 Chronicling America citations of 25 total
Adele Ritchie (December 21, 1874 – April 24, 1930) 12 Chronicling America citations of 31 total
![Page 15: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/15.jpg)
Congress Mine – 48 Chronicling America citations from seven different newspapers
![Page 16: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/16.jpg)
What is to be done?
![Page 17: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/17.jpg)
The Human Factor
![Page 18: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/18.jpg)
50 percent of NDNP awardee respondents have not used
Wikipedia because it simply did not occur to them.
![Page 19: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/19.jpg)
Other reasons:
* Concerns about the reputation, integrity, or reliability of Wikipedia
* Concerns over conflict of interest in editing Wikipedia
* Our institution does not have the project resources
* Lack of relevant expertise
![Page 20: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/20.jpg)
Need to generate institutional buy-in, capability, enthusiasm
![Page 21: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/21.jpg)
Wikipedia needs to be in your strategic plan
![Page 22: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/22.jpg)
Tools
![Page 23: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/23.jpg)
Chronicling America is Big Data * 8 million pages
* If you read War and Peace in a month, it would talk you 600 years to read Chronicling America
* Viral Texts output can’t even be opened on a PC
* Tools are the only means
* Viral Texts found thousands of previously unknown articles
![Page 24: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/24.jpg)
Viral Texts
Ryan Cordell, Elizabeth Maddock Dillon and David Smith
Northeastern University's NULab for Texts, Maps, and Networks
![Page 25: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/25.jpg)
![Page 26: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/26.jpg)
Recurrence is a signal
![Page 27: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/27.jpg)
Named Entity Recognition
![Page 28: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/28.jpg)
![Page 29: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/29.jpg)
SuggestBot
![Page 30: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/30.jpg)
![Page 31: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/31.jpg)
You’re not going to build one thing and have it work –
it’s iterative.
![Page 32: Using Wikipedia to Make the Digitized Newspapers …...Using Wikipedia to Make the Digitized Newspapers of the National Digital Newspaper Program More Discoverable Preliminary Findings](https://reader030.vdocuments.us/reader030/viewer/2022041014/5ec57fc49e23b9589076e2c5/html5/thumbnails/32.jpg)
1. What doesn’t work about Wikipedia for you?
2. What doesn’t work about Chronicling America for you?
3. What ideas do you have that could make them better?
@donaldtaylorii [email protected]