piloting linked data to connect library and archive resources to the new world of data, and staff to...
DESCRIPTION
Presentation for the CNI (Coalition for Networked Information) Fall Forum, December 2012. Describes Emory University Library’s first-hand experience in interlinking Civil War-related materials and other online resources by leveraging open linked data principles. The library has been actively evaluating linked data’s potential to replace current library processes and services (bibliographic services, finding aids, cataloging, and metadata work) as a more efficient and sustainable means, and one that could bring greater benefit to end users for research and learning. The Library’s initial focus was on workforce education and hands-on learning through real-time experiments: the Connections project was begun to prepare staff to work with linked data, a process that has culminated in a 3-month hands-on pilot to build and convert some data. The pilot introduced the concept to a wide range of staff, including subject liaisons, archivists, metadata librarians, and programmers. Emory’s “silos” of data were interlinked with other open data sources as a way to enhance user discovery and use of library materials on a very limited scale.TRANSCRIPT
![Page 1: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/1.jpg)
Connections: Piloting linked data to connect library and
archive resources to the new world of data, and staff to new skills
Laura Akerman
Metadata Librarian
Robert W. Woodruff Library
Emory University
Zheng (John) Wang
AUL, Digital Access, Resources, and IT
Hesburgh Library
Notre Dame University
CNI Fall Meeting, December 11, 2012
![Page 2: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/2.jpg)
Who has presented most frequently at CNI?
![Page 3: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/3.jpg)
Current Model: Search and Discover
![Page 4: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/4.jpg)
![Page 5: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/5.jpg)
Metadata Published as Documents
![Page 6: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/6.jpg)
Require Human to Decipher
![Page 7: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/7.jpg)
Linked Data Model: Find
![Page 8: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/8.jpg)
Semantic Graph Model
![Page 9: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/9.jpg)
Machine Understands Semantics
![Page 10: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/10.jpg)
RDF Triple
Subject ObjectPredicate
![Page 11: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/11.jpg)
RDF Triple
Laura ConnectionLecture
![Page 12: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/12.jpg)
RDF Triples
Laura ConnectionLecture
CNI
Pla
ce
John
Kno
w2012
Yea
r
![Page 13: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/13.jpg)
Reuse, Authority Control, Knowledging Linking...
Relevant to What We Do
![Page 14: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/14.jpg)
Connections Pilot
To Interlink EAD, Catalog, and Other External Resources
![Page 15: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/15.jpg)
Connections: Context
Little Time to Learn Additional New Things
![Page 16: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/16.jpg)
Hands-on learning
![Page 17: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/17.jpg)
Ingredients• Leader/teacher/evangelist• Learning group – open to all
o 2 "classes" a month, 5 months. • Pilot: 3 months
o Brainstorming a pilot projecto Start small o Team: programmer, subject liaison, metadata
specialists, archivist, digital curator, fellow. o 1-3 hrs/week for all but leadero A sandbox running Linux
![Page 18: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/18.jpg)
![Page 19: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/19.jpg)
Maps
Our Own Triplestore
RDF from EAD
RDF from TEI
(and MARC)
RDF from MARCXML
Data from other archives
CW150 Other
data
Timelines
User interface Navigation
DBPedia
id.loc.gov
Integrate linked data into discovery layer (catalog)?
SPARQL
Civil War
Redesign metadata creation as RDF
Faculty project
National Park Service Data
Rosters
Crowdsourcing
![Page 20: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/20.jpg)
3 months later...
![Page 21: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/21.jpg)
Sampling little bites of the meal:
Visualization – Simile Welkin
EAD (starting from ArchiveHub stylesheet
Sesame triplestore
MARCXML (starting from LC DC stylesheet)
id.loc.gov URIs for LC subjects and names (scripted)
DBPedia/subjects (by hand)
Make some RDF metadata
![Page 22: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/22.jpg)
HTTP:OurResourceURL
HasSubject"Mobley, Thomas"
A few of the connections...
![Page 23: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/23.jpg)
HTTP:OurResourceURL
HasSubjectrdfs:resource HTTP://OurPersonMobleyT1rdfs:label""Mobley, Thomas"
![Page 24: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/24.jpg)
hasSubject
HTTP:OurPersonMobleyT1
memberOf
Confederate States of America. Army. Georgia Infantry Regiment, 48th
![Page 25: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/25.jpg)
hasSubject
HTTP:Our Mobley Tom1
memberOf
48th Georgia Infantry http://id.loc.gov/authorities/names/n99264720
hasSubject
sameAs
DBPedia:http://dbpedia.org/page/48th_Georgia_Volunteer_Infantry
![Page 26: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/26.jpg)
Confederate miscellany collection, 1860-1865
isPartOf
heldBy
![Page 27: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/27.jpg)
We learned:
Selecting material that will “link up”
without SPARQL, is too hard!
Even when items are in a unified “discovery layer”, the types of search are limited.
Get it into triples, then find out!
![Page 28: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/28.jpg)
We learned:
•(No one model to follow has emerged. We have to think about this ourselves.)There are many ways of modeling data
![Page 29: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/29.jpg)
ArchivesHub handles subjects:<associatedWith><!--About the Concept (Person)--><skos:Concept
xmlns:skos="http://www.w3.org/2004/02/skos/core#"
rdf:about="http://duchamp.library.emory.edu/resource/id/concept/person/lcnaf/gearyjohnwhite1819-1873">
<rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">Geary, John White, 1819-1873.</rdfs:label>
<skos:inScheme> <skos:ConceptScheme rdf:about="http://duchamp.library.emory.edu/resource/id/conceptscheme/lcnaf">
<rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">lcnaf</rdfs:label>
</skos:ConceptScheme> </skos:inScheme>
<foaf:focus xmlns:foaf="http://xmlns.com/foaf/0.1/"><!--About the Person--><foaf:Person
rdf:about="http://duchamp.library.emory.edu/resource/id/person/lcnaf/gearyjohnwhite1819-1873"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <rdf:type rdf:resource="http://purl.org/dc/terms/Agent"/> <rdf:type rdf:resource="http://erlangen-crm.org/current/E21_Person"/> <rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">Geary, John White, 1819-1873.</rdfs:label> </foaf:Person> </foaf:focus> </skos:Concept> </associatedWith>
![Page 30: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/30.jpg)
LC's MARCXML to RDF/Dublin Core:
dc:subject "Geary, John White, 1819-1873."
![Page 31: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/31.jpg)
Simile MARC to MODS to RDF:
<modsrdf:subject rdf:resource= "http://simile.mit.edu/2006/01/Entity#Geary_John_White_18191873"/> <rdf:Description rdf:about= "http://simile.mit.edu/2006/01/Entity#Geary_John_White_18191873"> <rdf:type rdf:resource= "http://simile.mit.edu/2006/01/ontologies/mods3#Person"/> <modsrdf:fullName>Geary, John White </modsrdf:fullName> <modsrdf:dates>1819-1873</modsrdf:dates </rdf:Description>
![Page 32: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/32.jpg)
Linked data is HUGE It’s coming at us FASTIt’s not “cooked” yet
We learned:
![Page 33: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/33.jpg)
More learnings
• We learned more by doing than by "class".
• Making DBPedia mappings or links by hand is very time consuming! We need better tools.
• We need to spend a lot more time learning about OWL, and linked data modeling.
![Page 34: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/34.jpg)
Challenges
• Easily available tools are not ideal!• Skills we needed more of: HTML5, CSS,
Javascript• Time! • Visualization/killer app not there yet.• Can't do things without the data! No timeline
if no dates!
![Page 35: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/35.jpg)
What we got out of it
Test triplestore for training and more development
Better ideas on what to pilot nextConvinced some doubters"Gut knowledge“ about triples, SPARQL, scaleBeginning to realize how this can be so much more than a better way to provide "search"
![Page 36: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/36.jpg)
Outside our reach for now
Transform ILS system to use triple store instead of MARC
Create hub of all data our researchers might wantMake a bank of shared transformations for EAD,
MARC, etc. Shared vocabulary mappings Social/networking aspect (e.g. Vivo, OpenSocial...)
- need a culture shift?
![Page 37: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/37.jpg)
Next? Maybe...
Build user navigation?More Civil War triples including other local institutions’ stuff?Publishing plan?Integrate ILS with DBPedia links?Suite of “portal tools” for scholars?Use linked data for crowdsourcing metadata?More classes?Connect with others at Emory around linked data
![Page 38: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/38.jpg)
Recommendation: Individual Institutions
• Focus on unique digital content• Publish unique triples• Reuse existing linked data
![Page 39: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/39.jpg)
Recommendation: Community
• Create standards or best practices
• Grow our skills• Test and evaluate tools• Develop tools
![Page 40: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/40.jpg)
Recommendation: Librarians’ Role?
• Interdisciplinary linking? • Metadata librarians - Linking association and
normalization
![Page 41: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/41.jpg)
Acknowledgements
Connections group sponsors: Lars Meyer, John Ellinger
Connections Pilot team: Laura Akerman (leader), Tim Bryson, Kim Durante, Kyle Fenton, Bernardo Gomez, Elizabeth Roke, John Wang
Fellows who joined us: Jong Hwan Lee, Bethany NashOur website:
https://scholarblogs.emory.edu/connections/ Laura Akerman, [email protected] Wang, [email protected]
![Page 42: Piloting Linked Data to Connect Library and Archive Resources to the New World of Data, and Staff to New Skills](https://reader036.vdocuments.us/reader036/viewer/2022070316/555b32e2d8b42ae82e8b4fe7/html5/thumbnails/42.jpg)
Thanks
Q&A