repository rodeo redux blue data university of michigan data repository digital commonwealth boston...

Post on 25-Apr-2018

218 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Repository Rodeo Redux

DSpace, EPrints, Fedora, Samvera, Islandora, Invenio

Panelists

DSpace: Tim Donohue, DuraSpace

EPrints: Will Fyson, University of Southampton

Fedora: David Wilcox, DuraSpace

Hydra: Tom Cramer, Stanford

Islandora: Melissa Anez, Islandora Foundation

Invenio: Tibor Simko, CERN

DSpace

to

From

Out of the box, open source solution that allows you preserve, share, and make discoverable your digital content.

Est. 2002A turnkey institutional repository

application.

Out of the box

❏ Optimized for SEO ❏ Accept any file formats❏ Customizable forms /

workflows / embargo❏ Customizable metadata

(based on Qualified DC)❏ Organizational hierarchy❏ Access rights delegation

❏ Usage Statistics (Google Analytics or internal)

❏ Batch import / export / editing

❏ OAI, SWORD, REST, RDF❏ DOIs, ORCIDs, OpenAIRE❏ AIP backup / restore

and many more...

Some Statistics

★ >2,000 known sites★ 120+ countries★ 20+ languages★ 16 Registered Service

Providers (in 11 countries)★ Annual software releases★ 15 years young!

Est. 2002

(2016)(2015)(2013)(2012)

Open Source Adaptation

+

4.0(2013)

5.0(2015)

6.0 (2016)

7.0 (2018)

First REST API,DOIs

Easier upgrades,ORCID

Major Java API refactor,Easier configuration

One UI, new REST API

6.1 coming soon!

DSpace 7 UIWorking Group

CoordinationTim DonohueDuraSpace

Angular UI(SubTeam)

Art LowelAtmire

REST API(SubTeam)

Andrea Bollini4Science

GOAL +NEW

https://wiki.duraspace.org/display/DSPACE/DSpace+7+UI+Working+Group

Open Source Collaboration

More on DSpace 7 efforts

“Future of DSpace and core updates” Session at 1:30pm

+

DSpace is funded / developed / supportedby its community

★ Give back code and/or hire a service provider to do so★ Become a member and influence product roadmap,

governance and future member benefits.★ Membership also funds coordination

Funding & Support

Get involved

Want to stay up-to-date with latest development? Join us in Slack:https://wiki.duraspace.org/display/DSPACE/Slack

Want general updates or have questions? See our Mailing Lists:https://wiki.duraspace.org/display/DSPACE/Mailing+Lists

Other support options also available (IRC, StackOverflow, etc):https://wiki.duraspace.org/display/DSPACE/Support

EPrints

EPrints Functionality

● Repository of Persistent Managed Objects

○ Supports open access to research publications, data, educational materials

○ CRIS-lite facilities for research management

● Built on a set of metadata, data and infrastructure services / abstraction layers

○ Any data formats, library-definable metadata and workflows

● Offered by a configurable set of software components

● Available through a 3rd party app store

EPrints Bazaar

● Allows independent third party contributions

● One-click installation○ and uninstallation!

● Separates functionality from core● It’s another EPrints repository

○ A repository of software objects

EPrints Community

● Software announced in Oct 1999, first version delivered June 2000

● Now 635 EPrints repositories recorded in ROAR

● Supported by EPrints Services* at the University of Southampton

○ co-ordinate community support

○ provide commercial hosting, training, bespoke development

○ Core EPrints development

● Strongest community presence in UK, Germany/Austria

○ Independent community groups

* other providers are available

EPrints Development Objectives

● Repositories are used for many functions - could be more

● Simplify and streamline where possible

○ Re-engineer for less complexity

○ Significant issue in v3.3 repository innovation is what to take out!

● Make it more easy to deliver customer-specific innovations

EPrints 3.4

● EPrints “Flavours” system

○ Zero – Minimal working EPrints with plugins that provides archiving capability.

Build your own EPrints based archiving system from scratch.

○ Publication – EPrints repository for open access publication and reporting

○ Data –EPrints repository for research data management

○ Education – EPrints for educational resource sharing (aka EdShare)

○ Social media – EPrints for harvesting social media data (aka Tweepository)

○ Impact - EPrints for recording impact details, be it social, academic or economic impact

● Available now on request - contact us at info@services.eprints.org

ORCID Support

● Bazaar Plugin● Brings together ORCID developments

from across the community● Adds validated ORCID field for users

and contributors● Provides reports on ORCID uptake● Future plans to synchronise research

outputs between repository and ORCID profile

IRStats 2.5

● 3.4 Development● Overhauled the report form, making it

more flexible and intuitive.● Added improved robot/crawler with

support for blacklists.● Blacklist for user agents and IP ranges

are maintained centrally, with input from IRUS-UK

Other Recent Development Highlights

● Reporting Framework (Bazaar)○ Flexible reporting○ Colour coded results○ Detailed validation reports○ Integrates with multiple plugins

(including ORCID Support)

● Faceted Search (3.4)○ a.k.a. “‘search filters’ or ‘guided search”○ enter initial search terms, then refine

with a number of contextual filters

● Lists (3.4)○ Compile arbitrary lists of records ○ Add record to a list from search results or

abstract page○ Describe list with a rich text field

● Visualisations (3.4)○ Visualise records search results, browse

views and lists.○ Bubble charts based on fields○ Timelines based on datestamps○ Author collaboration graphs○ Word clouds○ Built using D3

Come and chat for more details and demonstrations!

Challenges

● App store model is successful

● Open Source model of software contribution has limitations

○ Community / customers want user-facing functionality

○ Focus of funding / contribution

● Challenge is filling the “gap”

○ Systematic bug fixing, documentation

○ Core development

● Platform Future

○ Exciting new software = change management challenge in business critical software

Fedora

Flexible Extensible Durable Object Repository Architecture

Concept

Implementation

Community

Fedora...

Stores, preserves, and provides access to digital objects

Supports flexible and complex content models for objects

Supports complex semantic relationships between objects inside and outside the repository using RDF

Supports millions of objects, both large and small

Interoperates with other applications and services

Why use Fedora?

Fedora is flexible: it can handle both simple and complex use cases

Content in Fedora is durable: Fedora supports long-term preservation

Fedora powers successful digital repository and DAM applications

Fedora is standards-based

Fedora is backed by a thriving community

Fedora Front-Ends

Fedora is middleware

You can build a custom framework, or join a broader community:

Supporting Open Scholarship and Open Science

RDA Repository Interoperability working group

● Establishing standards for interoperability between different research data repository platforms

COAR Next Generation Repositories working group

● Identifying functionalities and architectures for the next generation repositories within the context of scholarly communication

OpenSesame

OSF

Fedora Facts

Managed by DuraSpace (not-for-profit)

Funded by the community

Collaboratively developed by the community

Supported by 2 full-time staff members (not developers)

The Fedora Community

400+public sites

1061mailing list members

25active

developers

74member

institutions

14+ workshops and 2 training camps annually

Join the community and learn more

Fedora website

http://fedorarepository.org

Communication channels

https://wiki.duraspace.org/display/FF/Mailing+Lists+etc

Contact me

dwilcox@duraspace.org

Samvera (aka Hydra)

is now

The project formerly known as

One Body, Many Heads

One Body, Many Heads

Technical Stack

Deep Blue DataUniversity of Michigan

Data Repository

Digital CommonwealthBoston Public Library’s Statewide Aggregator

Digital Repository of Ireland: Humanities &

Soc. Sciences Data

Duke University Digital Repository: Data, Pubs

& Library Collections

Education & Research Archive University of Alberta

Virgo GISUniversity of Virginia Maps & GIS Data

PLUMPrinceton UniversityDigitization Workflows

Hydra at HullUniversity of HullInstitutional Repository

● Large Universities● Small Universities● Colleges● Public Broadcasting● Government Ministry● National Libraries● National Lab● Small Research Labs● National Digital Repository● Statewide Digital Libraries● Chemical Heritage Foundation● Museum of Performing Arts● A Shakespeare Festival

● Self-deposit System● Digital Collections System● Sheet Music● Architectural Resources● Electronic Theses &

Dissertations● Digital Image System● Media Management● Media Preservation System● Research Data Management● Digitization Workflow System● Digital Preservation System● Digital Archives System● And more!

Used By... Used For...

Solutions and Solution Bundles

If You Want To Go Far, Go Together

Hydra Connect at Boston Public Library, October 2016260 attendees from 90 institutions

Samvera Partners & Adopters

At the heart of the Samvera Community are the Samvera Partners, those who both use the software and contribute to the Samvera effort overall; through code, analysis, design, support, funding, or other resources.

Samvera Partners collectively advance the project and the community for the benefit of all participants.

Samvera Github Activity

http://jrvis.com/red-dwarf/?user=samvera&repo=sufia * https://www.openhub.net/p/projecthydra

Communication Channels

478 Slack

Team Members

Samvera Interest & Working Groups

● Fedora 3 to 4 Migration Interest Group● DSpace/Samvera Interest Group● Display Sets Working Group● Hyrax Analytics Working Group● Samvera Agile PM Interest Group● Samvera Archivists Interest Group ● Samvera Data Mapper Working Group● Samvera DevOps Interest Group● Samvera Digital Preservation Interest Group● Samvera Documentation Working Group● Samvera Geospatial Interest Group

● Samvera GIS Data Modeling Working Group● Samvera Metadata Interest Group● Samvera Applied Linked Data Interest Group● Samvera URI Selection Working Group● Samvera Metrics Interest Group● Samvera Newspapers Interest Group● Samvera Plugins Working Group● Repository Management Interest Group● Samvera Time-based Media Interest Group● Samvera User Experience Interest Group

Technical InnovationsMore Features, Convergence, Configuration, Polish

1. Deployment options: local, docker, AWS, HykuDirect2. Custom workflow engine (Sipity)3. APIs: IIIF (hyku only), Linked Data, ResourceSync4. Consolidation of Sufia & CurationConcerns into Hyrax5. Plugins (in particular ones that allow common

PCDM-based data models: GeoWorks, BookConcerns, Avarax, etc.)

Internationalization

Sufia + CurationConcerns

Community Innovations

annual report

registered commercial providers

8Community-based technical roadmap

● from the Hydra-in-a-Box Project ● the best of Samvera in a turnkey solution● cloud-ready, with multi-tenancy option● no dev required: config over coding● modern APIs: IIIF + ResourceSync● colossal levels of anticipation● give it a spin: demo.hydrinabox.org● in Beta now

Islandora

Islandora is...

...an open-source software framework designed to help institutions and organizations and their audiences collaboratively manage, and discover digital assets using a best-practices framework. Islandora was originally developed by the University of Prince Edward Island's Robertson Library, but is now implemented and contributed to by an ever-growing international community.

Islandora is...

Islandora 7.x-1.x

● Drupal 7● Solr● Fedora 3● Solution Packs

Islandora CLAW

● Drupal 8● Solr● Fedora 4● Modules

Islandora CLAW

Islandora CLAW MVP

● Minimum Viable Product

○ Bare minimum tools and functions needed for a first release.

○ Alpha MVP May 15; Beta MVP June 29

Community Development● GitHub

○ Integrated testing (Travis)○ Committers○ “Core,” Labs, CLAW, Interest Groups, and

Deprecated repositories.

● JIRA○ Ticketing system○ Bugs, Features, Improvements○ Bi-weekly review

Community Development

● Releases○ All volunteer○ April & October○ Maintainers, Testers, Documenters, and Auditors.

Islandora Community

150+ public sites, 1395 listserv members, 144 GitHub members, 29 Committers, 26 Contributed Modules, 29 Foundation members, 11 Interest Groups, 7 Community-led software releases, 16 Camps, 2 conferences, 2 major versions.

And TWO full-time employees (we hired a Technical Lead!).

Islandora Community

Islandora Foundation

A soliciting non-profit found in July, 2013 to steward the Islandora project and future development of both the software and the community.

Islandora Foundation

Islandora Foundation

Contact Us

Website: islandora.ca

Email: community@islandora.ca

Twitter: @islandora

Invenio

A Pythonic digital library framework

Build your own fully customised digital library, institutional repository, multimedia archive, or research data repository on the web.

Use cases

Showcase

Technology

loadbalancer

webfront-end

web back-end

worker

queue

cache

db

search files

Community

½ of 2017 on

★ 421 issues★ 712 pull requests★ 62 committers

View A: Framework

● Ecosystem of independent collaborative packages● Strong focus on well-separated roles and REST APIs

Compose your

own solution...

View B: Flavours

… or use predefined flavours

$ cat invenio-app-ils/setup.pyinstall_requires = [

'invenio-access>=1.0.0a11,<1.1.0','invenio-accounts>=1.0.0b3,<1.1.0',

[...]'invenio-oauth2server>=1.0.0a15,<1.1.0','invenio-oauthclient>=1.0.0a12,<1.1.0','invenio-userprofiles>=1.0.0a9,<1.1.0','invenio-indexer>=1.0.0a9,<1.1.0','invenio-jsonschemas>=1.0.0a3,<1.1.0','invenio-oaiserver>=1.0.0a12,<1.1.0','invenio-pidstore>=1.0.0b1,<1.1.0','invenio-records-rest>=1.0.0a18,<1.1.0','invenio-records-ui>=1.0.0a9,<1.1.0','invenio-records>=1.0.0b1,<1.1.0','invenio-search-ui>=1.0.0a6,<1.1.0','invenio-search>=1.0.0a9,<1.1.0','invenio-marc21>=1.0.0a5,<1.1.0',

Invenio 3.0 in production

Zenodo (and more)CERN Analysis Preservation

Status

More than 100 packages:

● Base technology (17)● Core features (22)● Add-on features (24)● Standalone utilities (23)

New CONTRIB (18) and ATTIC (26) organisations.

Roadmap

● Invenio 3.0 α → β → production package releases○ Stable API → rich documentation → data migration○ Base bundle✔○ Record bundle✔○ Access bundle✔○ Files bundle✔○ Deposit bundle✔

● Invenio 3.0 flavours → ILS✔ IR✔ Multimedia✔ RDM✔

inveniosoftware.org

@inveniosoftware

@inveniosoftware

A Pythonic digital library framework

Discussion

top related