partnering with libraries to support digital scholarship · aberdeen journal (aberdeen, scotland),...
TRANSCRIPT
Partnering with Libraries to Support Digital Scholarship
Peter Foster | Gale
“Windrush”
The Times (London, England), February 16, 1934, Issue 46680, p.2.
Aberdeen Journal (Aberdeen, Scotland), Tuesday, July 24, 1934, p .9
The Daily Telegraph (London, England), Tuesday, July 24, 1934, Issue 24701, p.11
Daily Mail (London, England), Saturday, October 21, 1939, Issue 13570, p.[1]
Aberdeen Journal (Aberdeen, Scotland), Tuesday, August 10, 1943, Issue 27647, p.1
Dundee Courier (Dundee, Scotland), Saturday, April 1, 1944, Issue 28339, p.3
Daily Mail (London, England), Friday, November 2, 1945, Issue 15442, p.[1]
The Times (London, England), February 16, 1934, Issue 46680, p.2.
Gloucester Citizen (Gloucester, England), Thursday, June 10, 1948, Vol. 74, Issue 34, p.1
Daily Mail (London, England), Tuesday, June 22, 1948, Issue 16257, p.3.
Derby Daily Telegraph (Derby, England), Thursday, June 10, 1948, Vol. CXXI, Issue 20839, p.8
The Daily Telegraph (London, England), Wednesday, June 23, 1948, Issue 29015, p.5
Picture Post (London, England), Saturday, July 2, 1949, Vol. 44, Issue 1, p.23
The Times Literary Supplement (London, England), Friday, October 2, 1959, Issue 3005, p.555The Listener (London, England), Thursday, October 30, 1969, Vol. 82, Issue 2118, p.[585]The Daily Telegraph (London, England), Friday, June 17, 1988, Issue 41361, p.21
The Economist (London, England), Saturday, June 20, 1998, Vol. 347, Issue 8073, p.42 Daily Mail (London, England), Thursday, August 24, 2000, Issue 32411, p.12
9
Gale Primary Sources
More than 200 million
pages of documents
across 500 years+ of
history, from the early
modern period to
present-day, sourced
from over 200 libraries
and archives.
Users can download the
OCR text of any document,
which can be analyzed
with open source tools like
Voyant, etc.
11
BUT creation of content sets this
way is time consuming and often
doesn’t meet the need for
analyzing large amounts of text or
data…which is a common need in
Digital Humanities.
www.gale.com/tdm
“Gale will make available content from its Gale Digital Collections to academic researchers for data mining and text mining purposes.
Gale will be the first publisher to make content sets such as these available in this format to academics.”
Our Announcement in 2014
Projects Using Gale Data
American Fiction and the Distribution of Words
https://litlab.stanford.edu/distributions-of-words-27k-novels/
David McClureStanford Literary Lab
SPO & Tudor Networks of Power
Prof Ruth AhnertQueen Mary University of London
Sebastian AhnertUniversity of Cambridge
Prof Dallas LiddleAugsburg College
The Times and Technological Change
Traditional histories of 19th century newspapers either argue:
1. That they experienced a change in form in the 1820s, but thereafter remained consistent (Stanley Morison)
2. They did not change much from their 18th century predecessors (Jeremy Black)
Professor John O’BrienUniversity of Virginia
Mercurius Elencticus (1647) (London, England), July 19, 1648 - July 26, 1648; Issue 35. p8
Athenian Gazette or CasuisticalMercury (London, England), Saturday, October 1, 1692; Issue 10. p1
Burney and poetry hunting
ECCO and “Commonplaces”
http://dh2016.adho.org/abstracts/343
Dr Glen Roe (and team)Australian National University
http://commonplacecultures.uchicago.edu/
Dr Hazel Wilkinson,University of Cambridge
ECCO & Printing in the 18th Century
‘Knowing who printed a book can give insight into the networks of “who knew who” during the 18th century, how information was exchanged, and an understanding of the literary culture of the time.’
The 90-9-1 rule for participation in an online community http://www.nngroup.com/articles/participation-inequality/
Slide courtesy of the COMHIS Collective:
https://comhis.github.io/
http://j.mp/comhis-bsecs
Challenge #1: Access to clean data
The 90-9-1 rule for participation in an online community http://www.nngroup.com/articles/participation-inequality/
Challenge #2: Most researchers lack the technical skills
Challenge #3: Multiple DH environments
Introducing Gale’s Digital Scholar Lab
Cloud-based Digital Humanities platform
for experts and “the 90%”.
Contains datasets extracted from Gale
Primary Sources collections, all optimized
for analysis
Will grow to include 3rd party content, not
just Gale.
Developed in association with DH scholars and technical experts
Create Custom Content Sets
…and build unique research projects
There will be a growing suite of
analysis tools that users can run
on customized content sets they
have created. Each tool can be
customized by the user.
…and later in 2018 users will be
able to create their own tools
using various coding languages
(e.g. Python, R)
Gale’s DS Lab is for “the 90%” too
Thank You!