chiara latronico,europeana cloud - ingestion clinic, the european library
DESCRIPTION
TRANSCRIPT
europeana cloud
Ingestion ClinicChiara LatronicoOperations Officer, The European Library
Marian Lefferts Executive Manager, CERL – WP4 Leader
Europeana Cloud Ingestion Clinic: 19-21 June, 3 July, 2013
Agenda Ingestion process step by step Ingestion plan broken down per provider Rights documentation (Europeana Pro) Other topics:
Thumbnails De-duplication Sets and subsets Catalogue records vs digital objects Collection descriptions
Providers experience and questions
The European Library Portal
The European Library: Ingestion WorkflowPreparation Work Content ingestion questionnaire Ingestion plan Sample records to ingest Datasets ready for harvesting Create case in CRM: case # to provider Step by Step Harvest metadata Enhance metadata Index in acceptance portal Communicate with data provider Live index = live portalDeliver to EuropeanaEnhance and publish in Europeana
The European Library: System Architecture
Harvesting: Repox
Harvesting: Repox
Ingestion: UIM Loading
Ingestion: UIM Validation
Ingestion: UIM Validation
Ingestion: UIM Validation
UIM Validation: Record in Portal
UIM Validation: Record in Portal
UIM Validation: Records in Portal
Validation: Acceptance Portal
XSLT to Internal Object Model
Ingestion: UIM OAI-Enrichment-Acceptance
Ingestion: UIM OAI-Enrichment-Acceptance
Validation: Acceptance Portal
Dataset in Acceptance Create an account onhttp://www.theeuropeanlibrary.org/
Use credentials to sign in to acceptancehttp://www.tel.ulcc.ac.uk/acceptance/
Validate data using tabs Default Dublin Core (Soon) EDM
Validation: Acceptance Portal
Acceptance Portal: Communication
When a dataset is in acceptance Communication with data provider Fixing dataset if needed More commination until provider gives approval to publish Data provider accepts dataset Dataset ready for The European Library
live index
Ingestion: UIM Index to Publish
Live Index: Live Portal
When a provider accepts dataset Dataset ready for live index
Dataset indexed into the live portal It takes from 1 day to 1 week for a
dataset to be searchable in The European Library live portal
(this is variable and changes due to circumstances)
Dataset Live in Europeana
When a provider accepts a dataset
Dataset delivered to Europeana Dataset searchable in Europeana by
following quarter
Dataset published live in Europeana E-mail to provider with link to dataset into
Europeana portal
SugarCRM: eCloud Ingestion Plan
eCloud Ingestion Plan Report
eCloud Ingestion Plan: Hangout # 119th June1. National Library of Technology (NTK), PragueThree datasets scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014
2. ULBFive datasets scheduled for Q4 2013Delivery to The European Library: October 2013 In Europeana by Q1 2014
3. DIALNETTwo datasets scheduled for Q1 2014Delivery to The European Library: January 2014 In Europeana by Q2 2014
eCloud Ingestion Plan: Hangout # 119th June
4. Tilburg UniversityOne dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014
5. OAPENTwo datasets scheduled for Q2 2013Delivery to The European Library: May 2013In Europeana by Q3 2013
eCloud Ingestion Plan: Hangout # 2 (21st June)
1. University of EdinburghTen datasets scheduled for Q4 2013Delivery to The European Library: October 2013In Europeana by Q1 2014
2. DANSThree datasets scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2014
3. UNIBI One dataset scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2014
eCloud Ingestion Plan: Hangout # 2 (21st June)
4. VU UniversityNine datasets scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2013
one dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014
one dataset scheduled for Q3 2014Delivery to The European Library: July 2014In Europeana by Q4 2014
5. WalesOne dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014
eCloud Ingestion Plan: Hangout # 3 (3rd July)
1. Bavarian State LibraryOne dataset scheduled for Q4 2013 Delivery to The European Library: October 2013In Europeana by Q4 2013
2. Debrecen University LibraryThree datasets scheduled for Q1 2015 Delivery to The European Library: January 2015 In Europeana by Q2 2015
eCloud Ingestion Plan: Hangout # 3 (3rd July)
3. HAZUTwenty-eight sub-sets scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q1 2014
One sub-set scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014
eCloud Ingestion Plan: Number of Records
Records promised = Records delivered
Number of records promised needs to be the same of the number of records delivered to The European Library
If a data provider cannot deliver the record promised The Collections Team needs to be informed soon
If a data provider has more records to deliver It’s good news and we will be happy to ingest more
Deliverable D4.1 (containing the ingestion schedule) is available on Basecamp and can be accessed by everyone
Europeana Pro Website
Europeana Pro is the Europeana Professional website http://pro.europeana.eu/
Here is possible to findInformation about projects NewsDiscussionsTechnical documentation
For data provider to make metadata Europeana rights information
Europeana Rights on Europeana ProEuropeana Rights... Define the rights to the digital objectA definition is mandatory for each recordCan be inserted into the metadata Can be sent via email (if the same statement is appliccable for each record)
There are 12 rights statements to choose from 2 Public Domain 6 Creative Commons Licenses 4 Europeana Rights Reserved Statements
Europeana Rights on Europeana Pro website http://pro.europeana.eu/web/guest/available-rights-statements
Other TopicsThumbnailsAre not mandatory but they enrich the collectionCan be inserted into the metadataA pattern to thumbnails can be sent via email
Other TopicsDe-duplication
If two or more of your datasets share the same records, the data provider needs to Inform the Collections TeamHelp us to identify a pattern to de-duplicate recordsOr give us a list of identifiers to work with
The European Library portal clusters similar recordsBut Europrana does not accept duplications
Other TopicsSub-sets
If a dataset is made up of several sub-sets, the data provider needs to
Inform the Collections Team
Because tables and Ingestion Plan might need to be updated
Other TopicsCatalogue records and digital objects
A catalogue record (bibliographic info) is recommended for each recordA link to a digital object is mandatory for each recordLink to digital objects need be inserted into the metadataEuropeana does not accept records without links to digital objects
Other Topics Example of record with no catalogue records or digital objects
Other TopicsCollection descriptions
A data provider could enrich a dataset by sending us a collection descriptionIt would appear on the collection level page in The European Library portalIt would improve retrieval of a dataset on Google searchIt supports data analysis for Content Ingestion Strategy
A few examplesPicture Archives and Graphics Collection, Austrian National LibraryAlba amicorum from the Koninklijke Bibliotheek, National Library of the NetherlandsDigital Periodicals and Newspapers, National Library of Spain
Other Topics
Providers experience
Comments about time table?Special issues regarding your own datasets?Assistance in preparing the data?Issues with number of records?Questions?
www.theeuropeanlibrary.org