carl lagoze digital library service registry workshop services in a scholarly communication...
DESCRIPTION
Carl Lagoze Relevant Technology Trends Service-oriented architecture Web 2.0 Semantic Web SOA Web 2.0 RDF OWL-S OWLTRANSCRIPT
Carl Lagoze
Digital Library Service Registry WorkshopServices in a Scholarly Communication
Framework
Carl Lagoze
Joint work with….
• NSF-funded Pathways Projecto Jeroen Bekaerto Xiaoming Liuo Sandy Payetteo Herbert Van de Sompelo Simeon Warner
• Seed for upcoming Augmenting Interoperability across Scholarly Repositories Meeting sponsored by Andrew W. Mellon Foundation, the Coalition for Networked Information, the Digital Library Federation, JISC and Microsoft.
Carl Lagoze
Relevant Technology Trends
• Service-oriented architecture
• Web 2.0
• Semantic Web
SOA
Web 2.0
RDF
OWL-S
OWL
Carl Lagoze
Service-oriented architectures (SOA)
• Characteristics of serviceso Modular, atomico Well-defined interfaceso Loosely coupledo Like building blockso Standards for invoking
operations (e.g., SOAP/REST, XML)
• Benefits o Flexibilityo Enable creation of higher-level serviceso Enable customized end-user applicationso Re-use services in different contextso Evolution: create new services as neededo Orchestrate services to fulfill a process
monolithic application
Carl Lagoze
Implications of Web 2.0
• Key themeso Services (not packaged apps)o Architecture of participationo Remix/transform data sourceso Harness collective intelligence
• Emergent Behavioro Upcoming generations of scholars will have a completely
different paradigm and expectations regarding technologyo Collaborative classification (e.g., flickr)o Power of collective intelligence (amazon)o Alternative trust models (reputation – ebay; open-source)
Carl Lagoze
1. Creation and publication of new forms of “information units”
2. Services to better enable the processes of research and scholarship
3. Knowledge environments that captures semantic and factual relationships among information units
4. Promote information re-use and contextualization
5. Facilitate collaborative activity and capture information that is created as a byproduct of it
e-Scholarship in the New Context
Carl Lagoze
Services in e-Scholarship
• Decompose and distribute traditional steps in scholarly publishing value chain1
o Registration – claim precedence for a scholarly finding. o Certification - establish validity of scholarly claim o Awareness - discover and access claims and findingso Archiving - preserves the scholarly record over timeo Rewarding - based on metrics derived from that system
• Add new services to the mixo Workflow o Collaborative functions (e.g., annotation, re-use) o Data mining and analysiso Preservation monitoring and migration
1. Roosendaal and Geurts 1997
Carl Lagoze
Service pathways (decomposed and distributed)
Carl Lagoze
The repository model
• Repositories (institutional, learned society, etc.) are about facilitating the (re)use of materials in many contexts
• Repositories are the starting point of value chains
Carl Lagoze
Value chains starting in repositories
Example 1: Overlay journal
• Editor of overlay journal selects articles from 3 repositories (arXiv, DSpace, Fedora) for inclusion in the next journal issue
Carl Lagoze
Value chains starting in repositories
Example 2: Data-aware scholarship
• Researcher uses datasets from 2 repositories (Fedora, NVO), performs operations on those, creates a publication that contains the resulting new dataset and an accompanying paper, and deposits this publication in her institutional repository (DSpace).
Carl Lagoze
Value chains starting in repositories
Observations
• Need interoperable repository interfaces to support these kind of workflows
• Must tie the new object persistently to the objects in the origin repositories;
• Allow another party wants to add value via services to the new object (journal issue, publication)
Carl Lagoze
Value chains starting in repositories
Infrastructure to support such operations:• A data model supported across repositories• Core interfaces supported across repositories: Obtain,
Harvest, Put• Notion of persistent Identity and Lineage play a crucial
role in the data model and in the interfaces• Service Registry for discovery, matching, and application
of value-add operations to information units
Carl Lagoze
Data Model for Reusable Information
• Common framework for reuse among heterogeneous repositories
• Information sharing format - NOT a packaging format• Influenced by:
o FRBRo Kahn/Wilensky Digital Object Frameworko Compound Object Schema – METS, DIDL, FOXML
• Integrates information on:o Structureo Lineageo Access pointso Semantics
Carl Lagoze
Data Model and Service Matching
• Beyond simple (MIME) type mapping – e.g., PANIC• Provides framework for structural and semantic service
matching