antoine isaac europeana – vu university amsterdam
DESCRIPTION
Antoine Isaac Europeana – VU University Amsterdam. Dagstuhl Multilingual Semantic Web seminar. Europeana. “A digital library that is a single, direct and multilingual access point to the European cultural heritage.” European Parliament. 24 M objects ( images, text, sound and video) - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/1.jpg)
Antoine Isaac
Europeana – VU University Amsterdam
Dagstuhl Multilingual Semantic Web seminar
![Page 2: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/2.jpg)
Europeana
24 M objects (images, text, sound and video)
From over 2.200 libraries, museums, archives
From 33 countries
For everyone
“A digital library that is a single, direct and multilingual access point to the European cultural heritage.”
European Parliament
![Page 3: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/3.jpg)
Multilingual Access in Europeana
![Page 4: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/4.jpg)
Dimensions of multilingual access
Interface
Search (query translation or document translation)
Result presentation
Browsing
![Page 5: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/5.jpg)
Europeana's efforts
Interface translated into 26 languages
Query translation: only prototype
Query result filtering by country/language
Document translation (user enabled)
Semantic contextualization of objects
• Multilingual enrichment/annotation of metadata
![Page 6: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/6.jpg)
Making metadata work for multilingual access
![Page 7: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/7.jpg)
Current metadata in Europeana
Simple object records
Flat (text values)
Without language tags!
Only language-related info on metadata is at collection level
• Can be "mul"
Need to change!
a new Europeana Data Model (EDM)
![Page 8: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/8.jpg)
"Semantic layer" of contextual resources(concepts, persons, places, events...)
Networked objects
Cultural artefact
PaintingSculptureBuildling
Exploiting semantic relationse.g. “broader concept”, “place of birth”, “involved
person”…
![Page 9: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/9.jpg)
Multilingual metadata
![Page 10: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/10.jpg)
Fetching already available linked data
http://www.w3.org/2005/Incubator/lld/XGR-lld-vocabdataset/
E.g., from libraries
![Page 11: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/11.jpg)
Interoperability
Encouraging the use of RDF + common and simple elements
![Page 12: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/12.jpg)
Interoperability
Encouraging the use of common and simple data elements
<skos:Concept rdf:about="http://www.mimo-db.eu/InstrumentsKeywords/2308"> <skos:prefLabel xml:lang="fr">Piano carré</skos:prefLabel> <skos:prefLabel xml:lang="it">Pianoforte a tavolino</skos:prefLabel> <skos:prefLabel xml:lang="en">Square pianoforte</skos:prefLabel> <skos:prefLabel xml:lang="de">Tafelklavier</skos:prefLabel> <skos:prefLabel xml:lang="nl">Tafelpiano</skos:prefLabel> <skos:prefLabel xml:lang="sv">Taffel</skos:prefLabel> <skos:broader> <skos:Concept rdf:about="http://www.mimo-db.eu/InstrumentsKeywords/2273"> <skos:prefLabel xml:lang="en">Pianofortes</skos:prefLabel> </skos:Concept> </skos:broader></skos:Concept>
![Page 13: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/13.jpg)
Interoperability
mixed nature of eligible contextual resources: dictionaries, synonym/translation lists, thesauri, authority lists, gazetteers…
interplay: “semantic” data next to multilingual data
![Page 14: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/14.jpg)
Simultaneous approaches
Getting richer semantic/multilingual metadata from providers
Fetching third-party contextual data and linking it to “un-contextualized” objects
Linking contextual data from an institution to another more general / more commonly used contextual dataset
• Dbpedia.org, VIAF.org…
![Page 15: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/15.jpg)
Status and challenges
![Page 16: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/16.jpg)
Current status
All this is work in progress and will take time
R&D prototypes (EuropeanaConnect) showing the challenges of gathering appropriate multilingual tools and data
First tests of simple techniques in production portal: GeoNames (places) and GEMET (concepts)
Encouraging, but illustrate issues with too naïve approaches (no NLP) and incomplete data
• Cheval
• Poison
http://www.europeana.eu
![Page 17: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/17.jpg)
Problems & requirements
For providers & Europeana
Continue work on metadata
Benchmarking (cf. CHiC lab @ CLEF)
Positioning as consumers and contributors of data (cf Asun’s slides)
data.europeana.eu
For language-intensive tools and resources
Availability: open resources
Interoperability
Simplicity
• But not always! E.g., not only “first hit” translations
Scale: scalability of tools, number and scope of datasets
Many languages, some lesser-resourced (wrt. English)
![Page 18: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/18.jpg)
Another illustration: VOICES project
Something entirely different but not completely unrelated
Voice-based community-centric mobile services for social development
Easing communication on agricultural trade
Listing of products/prices via phone/radio
Pilot in Mali
Challenges
Data-centric project, but language technology plays a crucial role
Objects should be provided with textual and audio labels (text-to-speech system) in different languages
Local languages: e.g., Bambara
Lack of resource: need low-cost, easy-to-adapt solutions
Victor de Boer, VU Amsterdam ([email protected])
![Page 19: Antoine Isaac Europeana – VU University Amsterdam](https://reader036.vdocuments.us/reader036/viewer/2022062321/56812d14550346895d91f547/html5/thumbnails/19.jpg)
Thank you
http://www.few.vu.nl/~aisaac/
Some slides based on Marlies Olensky and Juliane Stiller -
Multilingual Web Workshop, June 11, 2012, Dublin