semantic web technologies for digital libraries
DESCRIPTION
Semantic Web Technologies For Digital LibrariesTRANSCRIPT
SEMANTIC WEB TECHNOLOGIES FOR DIGITAL LIBRARIES
By Nikesh .N
International School of Information Management, Mysore
PRESENTATION OVERVIEW
Digital Library Library standards & Tools Semantic Digital Library :Expectations Semantic Web Technologies Bibliographic Ontology (MarcOnt) Case Study ( Jerome DL) Conclusions
DIGITAL LIBRARY
Information System which deals with collection, organization, storage and retrieval of digital documents
WHY LIBRARY STANDARDS ?WHAT ARE TOOLS & TECHNIQUES ?
Bibliographic Descriptions Standards Library Classification Scheme Subject Headings Subject Indexing Retrieval Techniques
BIBLIOGRAPHIC DESCRIPTIONS
A bibliographic description format is formal definition of syntax, meaning and rules of describing resources collected by library or other similar entity. Bibliographic description formats are descendants of sets of rules for describing library resources used back in the XIXth century and are still used today.
Eg: ISBD , AACR, Bibtex, MARC 21,Doubline Core
LIBRARY CLASSIFICATION SCHEMES
Library classification systems help categorize library resources according to domain of interest of their content.
Eg: Dewey’s Decimal Classification
Universal Decimal Classification
Colon classification
SUBJECT HEADINGSFOR VOCABULARY CONTROL
Library of Congress subject Headings Sears list of subject headings Mesh ( Medical Subject Headings)
SUBJECT INDEXING
Pre-coordinate Indexing (Coordination by Indexer) PRESIS, POPSI, KWIC, Chain Indexing
Post coordinate Indexing (Coordination by user) Keyword indexing
INFORMATION SEARCH AND RETRIEVAL
Bibliographic field based search Full text search Boolean Search Proximity Search Truncation Search etc.
SoorajCreation-Creator/Role
ISIM LibraryCurrent Location-Repository Name
irises, nature, soilSubject-Matter
2009Creation-Date
IrisesTitle
paintingsObject/Work type
PaintingsClassification • Full-text search– “Paintings” AND
“Sooraj” AND “flowers” no result
• Semantic query– if the knowledge that
“irises” are “flowers” is modeled in an ontology (e.g. subclass-hierarchy)
– we can query for all “Paintings” by “Sooraj” with subject “flowers” and retrieve also the picture with subject “irises”
WHAT LACKS- SEMANTICS ( EXAMPLE)
SEMANTIC DIGITAL LIBRARY- EXPECTATIONS
A digital library system which is capable of Integrating information based on different metadata, e.g.: resources, user profiles, bookmarks, taxonomies
Providing interoperability with other systems (not only digital libraries)
Delivering more robust, user friendly search and browsing interfaces empowered by semantics
SEMANTIC WEB TECHNOLOGIES
Semantic Web is becoming reality by applications that support it and are based on it Web Ontology Languages : OWL, RDFS Ontology editors : Protégé, Onto Edit etc. RDF, RDF Schema RDF Storages: Sesame, Jena, YARS Reasoners: KAON, Racer Annotation tools ( Annotea, Onto Annotizer) Topic Maps Thesauri & Controlled Vocabularies
( eg: Wordnet)
–
SEMANTIC WEB TECHNOLOGIES FOR DIGITAL LIBRARIES?
Metadata is the key concept Many digital libraries do have metadata in place Task is to make them available in a machine
understandable format How to uplift Legacy Metadata to Semantic Level RDF: Is a framework to model any kind of metadata It delivers certain level of technical
interoperability
BIBLIOGRAPHIC ONTOLOGIES Effort to Capture the Semantics of Metadata
MarcOnt Initiatives MarcOnt Initiative has grown from the experiences of
developing and evaluating the first semantic digital library, JeromeDL
Developed as a part of Master's thesis of Sebastian RyszardKruk at the Gdańsk University of Technology (GUT), Poland
MarcOnt Initiative goals: Create a framework for collaborative ontology improvement Incorporate existing metadata and Library classification
scheme Offer tools for data mediation between different data
formats
MARCONT ONTOLOGY AND PORTAL
MarcOnt Ontology: Central point of MarcOnt Initiative Translation and mediation format Continuos collaborative ontology improvement Knowledge from the domain experts
MarcOnt Portal (source of knowledge): Suggestions Annotations Versioning Ontology editor
CASE STUDY-JEROMEDL
Joint effort of DERI, National University of Ireland,
Galway and Gdansk University of Technology (GUT)
Distributed under BSD Open Source license
JEROMEDL - MOTIVATIONS
Support for different kinds of bibliographic medatata,
like: DublinCore, BibTeX and MARC21 at the same
time. Making use of existing rich sources of bibliographic
descriptions (like MARC21) created by human.
Supporting users and communities: users have control over their profile information;
community-aware profiles are integrated with bibliographic
descriptions
support for community generated knowledge
ONTOLOGIES IN JEROMEDL
Structure (system administrators): JeromeDL structure ontology
Bibliographic and legacy descriptions ( domain experts MarcOnt bibliographic ontology Extensible MarcOnt suggestions
Communities (normal users, expert users with restricted vocabulary FOAF and FOAFRealm identity management ontology
STRUCTURE ONTOLOGY IN JEROMEDL
BIBLIOGRAPHIC (MARCONT) ONTOLOGY IN JEROMEDL
COMMUNITY-AWARE (FOAFREALM) ONTOLOGY
MARCONT MEDIATION SERVICES FOR LEGACY METADATA
MarcOnt OntologyMarcOnt RDF
MARC21 RDF
MARC21 XML
MARC21
Dublin Core RDF
Dublin Core XML
Dublin Core
New format RDF
New format XML
New format
Format translation
RDF Translator
Format co-operation
MarcOnt Mediation Services
SEMANTIC INTEROPERABILITY IN GEROMDL
Providing semantic annotations during uploading process: open module (JOnto) for handling any
taxonomies keywords based on:
WordNet free tagging
defining structure of resources in the JeromeDL ontology
Lifting legacy metadata to MarcOnt ontology Community maintained annotations
social semantic collaborative filtering semantic descriptions based on the FOAF
metadata
FOAF - DESCRIBING SOCIAL
NETWORKS
FOAF - Stands for Friend-of-a-Friend
Defines properties for a person
Does not only have to contain one person per file
Can build a network of people with foaf:knows links
FOAF can be easily extended to meet requirements,
as in the case of FOAFRealm for identity
management…
JEROMEDL – SEMANTIC INFORMATION IN
USE
Searching: Keyword-based search with semantic query expansion Semantic search:
Direct RDF querying Natural language templates
Faceted Navigation:creators, types, keywords, Topics etc.
Sharing: Social Semantic Collaborative Filtering Semantically Interlinked Online Communities
Heterogeneous communication: OAI-PMH
REFERENCES Semantic Web – W3C, http://www.w3.org/2001/sw/ The Semantic Web Community Portal, http://semanticweb.org Dublin Core Metadata Initiative (DCMI) – http://www.dublincore.org/ Jerome Digital Library Homepage – http://www.jeromedl.org MarcOnt Initiative Portal – http://www.marcont.org Marcin Synak: MarcOnt Ontology – Semantic MARC21 Description for
L2L & L2C Communication, Masters Thesis, Faculty of Electronics, telecommunication and Informatics, National University of Ireland
Sebastian R. Kruk and Marcin Synak: Semantic Digital Libraries: BANNF conference proceedings, 2007
THANK YOU
Questions Please ?