interoperability fundamentals: sword 2
DESCRIPTION
A presentation for the SUETr Interoperability Workshop, London School of Economics Library, 9th December 2008TRANSCRIPT
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
UKOLN is supported by:
Interoperability Fundamentals: SWORD 2
8th December 2008SUETr Interoperability Workshop,
London School of Economics
Adrian StevensonSWORD 2 Project Manager
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD Quick Introduction
• Simple Web service Offering Repository Deposit
• JISC funded project 2007
• To provide a standard mechanism for ‘doing deposit’ into repositories
• Continuation funding for SWORD 2 from June 2008
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
What is it?
• A lightweight protocol for deposit• A profile of the Atom Publishing Protocol• Implementations of the SWORD deposit
interface in IntraLibrary, Fedora, DSpace and Eprints
• A number of deposit clients – web-based, command-line, desktop, Facebook
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Background
• Before SWORD there was Deposit API• Discussions at the JISC-CETIS Conference
2005 focussed on the lack of a deposit ‘standard’
• Rachel Heery and Repositories Research Team at UKOLN facilitated a working group of repository developers
• to address this requirement for a standard interface for deposit
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Motivations
• no standard interface for tagging, packaging or authoring tools to upload objects into a repository
• no standard interface for transferring digital objects between repositories
• no way to deposit into more than one repository with one ‘click’
• no way of initiating a deposit workflow from outside a repository system
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
The Project Partners
• SWORD partners:– UKOLN, University of Bath and University of York
(Project Management) – Adrian Stevenson & Julie Allinson
– University of Aberystwyth (DSpace, Fedora, & clients) –– Stuart Lewis, Neil Taylor, Glen Robson, Richard Jones– University of Southampton (EPrints) – Les Carr– Intrallect (IntraLibrary) –Sarah Currier
• Plus some friendly advisors– Jim Downing, Richard Green
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
The Acronym
• Simple – lightweight, agile and fit-for-purpose • Web service – independent of proprietary
software, supports standard interfaces • Offering• Repository – or any system which wants to put or
receive content • Deposit – or put, or post, or register, or add – a
little step in the ingest workflow
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Use Cases
• Deposit from a Desktop/Online tool• Multiple deposit - e.g. deposit to institutional and
(mandated) funders’ repository with one action • Machine deposit - e.g. automated deposit from a
laboratory machine • Migration/transfer - e.g. to a preservation service • Mediated deposit - e.g. deposit by a nominated
representative, to additional repositories
Scenario 1 : Author deposits using a desktop authoring system to a mediated multiple deposit service
Librarian L completes the deposit through the repository interface
id
Librarian L invokes deposit of a surrogate into arxiv.org
Deposi
t
id
Author A deposits via an easy-deposit desktop application into the institutional repository's mediated deposit queue
A lightweight deposit web service can facilitate this transfer of object(s)
Scenario 3 : Deposit in multiple repositories
Deposi
tThe depositor can choose one or more repositories to deposit into
A lightweight deposit web service can facilitate this transfer of object(s)
A depositor is required to submit to a Research Council repository, but they also wish to deposit into their institutional repository and a relevant subject repository
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD AtomPub Profile
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Standards
• WebDAV (http://www.webdav.org/) • JSR 170 (http://www.jcp.org/en/jsr/detail?id=170) • JSR 283 (http://www.jcp.org/en/jsr/detail?id=283) • SRW Update (http://www.loc.gov/standards/sru/) • Flickr Deposit API (http://www.flickr.com/services/api/) • Fedora Deposit API
(http://www.fedora.info/definitions/1/0/api/) • OKI OSID (http://www.okiproject.org/) • ECL (http://ecl.iat.sfu.ca/) • ATOM Publishing Protocol
(http://www.ietf.org/htmlcharters/atompub-charter.html)
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
“the Atom Publishing Protocol is an application-level protocol for publishing and editing Web resources” • benefits
– supports many of our parameters and requirements, in particular file deposit
– it already exists and has growing support – it is well-used in popular applications– it has an extension mechanism – good fit with the Web architecture
• drawbacks / risks – too much of a retrofit? – it is designed for a single package/file OR an atom document – this
means that we need to package up metadata and files
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD AtomPub Profile
• SWORD profile builds on AtomPub• Provides set of extensions, constraint
relaxations and enforcements when:– Clients post compound resources (zip,tar)– Mediated deposit required– Workflows involved
• SWORD compliance does not preclude AtomPub compliance
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD APP Package Support
• AtomPub uses MIME to describe resources• Inadequate for compound types e.g.
– Zip, tar– METS, SCORM, MPEG21, DIDL packages
• SWORD extends AtomPub:– sword:acceptPackaging element– Value taken from SWORD package types
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD APP Mediated Deposit
• SWORD deposit client user may not be owner of resource
• SWORD allows clients to set a HTTP header:– X-On-Behalf-Of
• Assumes trust between owner and mediating user
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD APP Developer Features
• No-Op (Dry Run)
• Verbose Output
• Client and Server Identity
• Auto-Discovery
• Error Documents
• Nested Service Desription
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD APP Error Documents
• SWORD adds new class of doc to AtomPub to allow better error description– ErrorContent– ErrorChecksumMismatch– ErrorBadRequest– TargetOwnerUnknown– MediationNotAllowed
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD Profile of AtomPub
• Part B follows AtomPub specification highlighting where SWORD profile diverges
• Part B covers:– Protocol Operations
• Retrieving Service Document• Listing Collections• Creating a Resource• Editing a Resource - Not currently implemented
– Category Documents – MUST NOT be required
– Service Documents• new elements: version, verbose, noOp, maxUploadSize
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
How it Works
• APP works by issuing HTTP requests (GET, POST) – GET Service Document (explain/discover)– POST ATOM document or file to collection URI
• HTTP response and ATOM document is returned
• HTTP basic authentication is required
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Examples – Get (explain)
GET /sword-app/servicedocument HTTP/1.1Host: www.myrepository.ac.ukX-On-Behalf-Of: lcarr
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Examples – Post (Deposit)
POST /burning-collection HTTP/1.1Host: www.myrepository.ac.uk/sword-appContent-Type: application/zipAuthorization: Basic ZGFmZnk6c2VjZJldA==Content-length: nnnContent-MD5: md5-digestContent-Disposition: filename=mydeposit.zipX-On-Behalf-Of: lcarrX-Format-Namespace: METS
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD 2 Profile Updates
• SWORD Profile Version 1.3 includes:• Revised deviations from AtomPub and Atom
– increasing requirement for persistent Atom Entry Documents
• Includes description of SWORD specific extensions
• Removed notion of levels of compliance• Added sword:userAgent, sword:error,
sword:service, sword:version and sword:maxUploadSize elements
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD In Use
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Implementations• Repository implementations
– DSpace – EPrints – IntraLibrary – Fedora
• Client implementations – Java & PHP libraries– command-line, desktop and web clients– Facebook Client– Deposit from within MS Word & Powerpoint– Feedforward / FOREsite and others at:
http://www.swordapp.org/sword/implementations
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Facebook Client
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
OfficeSWORD Add-on• http://www.codeplex.com/OfficeSWORD
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
SWORD in use
• In addition to the case study implementations: – Feedforward has already implemented – ICE project is looking at SWORD – DSpace and EPrints installations already exist – Microsoft eChemistry work – OAI-ORE - FOREsite work – more are planned
• NISO activity around deposit - hopefully this will recognise
• Collaberation with Nature Publishing Group
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
More Info and Contact
• SWORD Website:• http://www.swordapp.org• General queries:
– Adrian [email protected]
• Technical queries:– Sword sourceforge list
A centre of expertise in digital information management
www.ukoln.ac.uk www.bath.ac.uk
Questions
• SWORD Website• http://www.swordapp.org
• Adrian Stevenson, UKOLN• [email protected]