making use of the linked open data services for openaire (di4r 2016 tutorial session)

35
Sahar Vahdati Christoph Lange Giorgos Alexiou George Papastefanatos Making Use of the Linked Open Data Services for OpenAIRE: Querying Data about Research Results, Person, Projects and Organizations Digital Infrastructure for Research (DI4R) 28-30 September 2016 Krakau, Poland University of Bonn, Germany Athena Research Center

Upload: openaire

Post on 14-Apr-2017

173 views

Category:

Internet


1 download

TRANSCRIPT

Page 1: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Sahar Vahdati Christoph Lange

Giorgos AlexiouGeorge

Papastefanatos

Making Use of the Linked Open Data Services for OpenAIRE

Querying Data about Research Results Person Projects and Organizations

Digital Infrastructure for Research (DI4R)28-30 September 2016

Krakau Poland

University of Bonn Germany Athena Research Center

Session outlinebull Introduction to OpenAIREbull Technical Conceptsbull Hands on Session

Open Access Infrastructure for Research in Europe

Need for digital research infrastructures for all kinds of research outputs across disciplines and countries

bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets

bull manages scientific publications and associated scientific material

bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines

httpwwwopenaireeu

OpenAIRE Services

OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources

bull Research data and other research outputs rather than only publications

bull The links between considered entities

bull Relationship of European OA infrastructures with other regions of the world

enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications

gt23k datasetsgt5k repositories

Core entities

Linking entities

OpenAIRE Data Model

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 2: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Session outlinebull Introduction to OpenAIREbull Technical Conceptsbull Hands on Session

Open Access Infrastructure for Research in Europe

Need for digital research infrastructures for all kinds of research outputs across disciplines and countries

bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets

bull manages scientific publications and associated scientific material

bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines

httpwwwopenaireeu

OpenAIRE Services

OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources

bull Research data and other research outputs rather than only publications

bull The links between considered entities

bull Relationship of European OA infrastructures with other regions of the world

enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications

gt23k datasetsgt5k repositories

Core entities

Linking entities

OpenAIRE Data Model

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 3: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Open Access Infrastructure for Research in Europe

Need for digital research infrastructures for all kinds of research outputs across disciplines and countries

bull comprises a database of all EC FP7 and H2020 funded research projects publications datasets

bull manages scientific publications and associated scientific material

bull aggregates Open Access publications and links them to research data and funding bodiesbull supports the Open Access principles via national helpdesks and comprehensive guidelines

httpwwwopenaireeu

OpenAIRE Services

OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources

bull Research data and other research outputs rather than only publications

bull The links between considered entities

bull Relationship of European OA infrastructures with other regions of the world

enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications

gt23k datasetsgt5k repositories

Core entities

Linking entities

OpenAIRE Data Model

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 4: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

OpenAIRE Services

OpenAIRE focuses onbull Workflows and processes of scholarly communication rather than resources

bull Research data and other research outputs rather than only publications

bull The links between considered entities

bull Relationship of European OA infrastructures with other regions of the world

enables search discovery and monitoring of the publications and datasets resulting from gt100k research projects gt17m publications

gt23k datasetsgt5k repositories

Core entities

Linking entities

OpenAIRE Data Model

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 5: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Core entities

Linking entities

OpenAIRE Data Model

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 6: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Example of data about Core Entities

Entity type Result

openaireID od_______908fac3db85bbcb1f52ae07c5868d8fb453

dateOfTransformation 2015-02-06dateOfCollection 2015-02-06

titleA Patient from Argentina Infected with Rickettsia massiliae

Dateofacceptance 01042010Publisher The American Society of Tropical Medicine and Hygiene

Pid oaieuropepmcorg2077077PMC2844561Language EnglishSubject Articles

BestLicense Open Acces

An entity of type Result

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 7: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Interlink to other databasesSupport researchers by answering interesting queries

The OpenAIRE vision

bull Data about scientific events emergence of scientific topics

bull Data about people affiliation impact of certain research

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 8: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Use cases

bull Research managers use new indicators for measuring the quality bull Policy makers get a quick overview of the findings and projectsbull Researchers find comprehensive citations list research movement between

communitiesorganizationsbull Reviewers get a quick overview of the field covered by the paper or dataset under review

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 9: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Challenges supported by LOD Services

Linked Open Data(LOD)

RDF data model

Publishing the OpenAIRE data as Linked Open Data and linking it to related datasets

bull Diverse data formatsbull Various means to accessquery databull Use of different identifiersbull Heterogeneity of metadata schemas

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 10: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Expected valuesbull Open up a window to the Linked Open Data Webbull Increase the OpenAIRE technical interoperability

bull Increase the reusability of the OpenAIRE research metadatabull Engage with additional user communities

bull Explore synergies with and added value to related open content initiatives

bull Provide links through LOD to similar infrastructuresbull Offer new services for OA data monitoring activitiesbull Provide services to export the OpenAIRE objects as a LOD graphbull Facilitate integration with other LOD graphs relative to similar systems and

infrastructuresbull Find patterns to enrich the OpenAIRE information space

Exposing the OpenAIRE Information Space as linked data

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 11: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Towards OpenAIRE LOD Services

Phase 1 LOD Production

Phase 1 Interlinking OpenAIRE RDF Graph to LOD cloud

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 12: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Phase 1 LOD Production

Core entitiesLinking entities

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 13: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Specify vocabularies

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 14: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Organizations Results Persons Datasources Projects

68526 17414766 62958315 19443 624417

including duplicates connected with sameAs

Total Number of Triples 1013527855 Distinct Entities 98256

OpenAIRE data as RDF Graph

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 15: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Phase2 Interlinking OA-RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyoad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstringfoafname Jha P^^xsdstring oavisAuthorOf

oad755469c995c2cb6cb55c3483634b026 a foafPersonoavhasTarget

resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095 oavhasLabel personResult_authorship_isAuthorOf^^xsdstring oavranking 6^^xsdinteger

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 16: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

RDF (Resource Description Framework)

Resource anything uniquely identifiable Description description of resource via representing properties and relations Framework web-based protocols and semanticsRDF triples List of statements

Subject (URI)Predicate (URI)

Object (URI or Literal)

oadpublication1

ldquoJuan Carlos Garciacutealdquo

oavhasAuthor

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 17: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

RDF version of example

PREFIX dcterms lthttppurlorgdctermsgthellipPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX prov lthttpwwww3orgnsprov

od_______908hellip rdftype cerifResultEntitydctermsdescription ldquo The first confirmed case ldquodctermspublisher ldquoThe American Society of

Tropical Medicine and Hygienerdquo hellipoavresultSubject ldquoArticlesldquooavdateOfCollection 2015-02-06

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 18: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Example of data about Linking entitiesAn entity of type Person_Result whose ranking property can have the value 1 to indicate the first author

od_______908f39hellip1c4a PersonResult od_______908fa3b453

RdftypefoafPersonoavrank 1

RdftypecerifResultEntity

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 19: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

How to query RDF SPARQL (Protocol and RDF Query Language)

bullQuery language of RDF-based databullSPARQL endpoint RDF-triple database on a server available on the WebbullPattern matching languagebullProtocol layerbullQuery interface

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 20: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

How to query

bullSPARQL variables are bound to RDF terms eg title authorbullInspired by SQL via SELECT statement

Example SELECT title author

bullReturn as a table

title authorA Patient from Argentina Infected with Rickettsia

massiliae Juan Carlos Garciacutea

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 21: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

OpenAIRE as LOD

bull OA LOD in BETA versionbull Triples per entitybull Online data SPARQL endpointbull Offline data RDF dumpbull Entities and URIs (interactive

browsing)bull Dereferenceable URIs for all

entities

httpwww betalodopenaireeu

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 22: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Steps

bullSpecify an RDF vocabulary bullSpecify terms and namespacesbullMap the OA data model to an RDF data modelbullMap the OA data to an statistic RDF dumpbullSpecify strategies to automate the RDF generation

Data conforming to LOD best practices published in BETA

December 2015

Main entitiesLinking entities

httpbetalodopenaireeu

OA RDF graph

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OpenAIRE data

OA RDF

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 23: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Sample queryselect (count (distinct s) as count) flevel from lttestgt from ltrelationsTestgt where s a lthttpwwweurocrisorgontologiescerif13Projectgt lthttplodopenaireeuvocabfundingLevel0gt flevel GROUP BY flevel order by count

Number of publications with their corresponding funding level

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 24: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

General architecture

OpenAIRE Metadata

RDFization

Interlinking

RDF Store

Deduplication amp Inference

Apache Solr

httpswwwopenaireeu

LOD Client

httpbetalodopenaireeu

OA Vocabulary

OA Data Model

HTML BrowserHTML HTML RDF

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 25: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

StepsbullIdentify datasets to be interlinked to bullSelect interlinking tools LIMES SilkbullTest interlinking OA with DBLP and DBpediabullEvaluate resulting link setsbullSpecify strategy for interlinking in OA workflow

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

Interlinking OpenAIRE RDF Graph to LOD cloud

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

httpbetalodopenaireeu

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 26: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

OA LOD interlinking workflow

PreprocessingProcess all the dumps from candidate datasetsPrune useless metadata Transform the metadata to key-value pairs(hadoop key(ID)-value([Properties]))Store in HDFS

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 27: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Sample interlinking resultResult of interlinking is a set of links between URIs from source and

target dataset

DBLP dump is not complete

lthttplodopenairebde783gt owlsameAs lthttpdblpl3sBoissonnatN96gtlthttplodopenaire4f8964gt owlsameAs lthttpdblpl3sShrobe96gtlthttplodopenaire27fea2gt owlsameAs lthttpdblpl3sX96cgtlthttplodopenairef433b9gt owlsameAs lthttpdblpl3sLiroyG96gt

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 28: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

DBLP

CiteSeer

CEUR Ope

Pu

lAK A

hellipprefix oad lthttplodopenaireeudatagt prefix oav lthttplodopenaireeuvocabgt prefix dbpedia-owl httpdbpediaorgontologyprefix vivo lthttpvivoweborgfilesvivo-isf-public-16owlgt prefix pext lthttpwwwontotextcomproton-ontologygt prefix swrclthttpswrcontowareorgontologygt oad07553d8e646b69b868a9791da39a1802 a foafPerson

foaffirstName P^^xsdstring foaflastName Jha^^xsdstring foafname Jha P^^xsdstringoavisAuthorOf oad755469c995c2cb6cb55c3483634b026 a foafPerson

oavhasTarget resultdoajarticles_6fcd7b3b47ebbd05ce73018731ff9095oavhasLabel personResult_authorship_isAuthorOf^^xsdstringoavranking 6^^xsdintegeroad075558cd104f737d82a34cb7e9fecd7d a foafPersonfoaffirstName T^^xsdstring foaflastName Bere^^xsdstring foafname Bere T^^xsdstringhellip

OA LOD

Linked Open Data(LOD)

Ideas for LOD in Monitoringmonitoring interlinking

when the target dataset grows from one version to another one

we can expect the linkset to grow as well

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 29: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Scientific eventsBootstrapping datasets for scientific events

CEUR-WSorg datasetOpenResearchorgInclude events in OA Data Model (Conference Object)

Measure the quality of eventsbull Related to funding and sponsoringbull Continualitybull Accepted project publicationsbull Reputation of peoplebull Locationbull Citationbull hellip

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 30: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Hands on

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 31: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

httpbetalodopenaireeusparql

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 32: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

Example What is the overall research output of a given project

oavproduces and UNION are not workingPREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgt

PREFIX oav lthttplodopenaireeuvocabgtPREFIX cerif httpwwweurocrisorgontologiescerif13

SELECT x y WHERE

y a cerifResultEntity

y oavresultType dataset

UNION y oavresultType publication

x a cerifProjecty ceriflinkToProject y

LIMIT 10

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 33: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT o

WHERE

x oavprojectOrganization oo a foafOrganization

y oavprojectOrganization o2o2 a foafOrganization

FILTER (sameTerm(o o2) ampamp sameTerm(x y)) LIMIT 10

Example What organizations are more active than others wrt projects

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 34: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example What datasets has published by a specific person who involved in a given project

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho
Page 35: Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial session)

PREFIX rdf lthttpwwww3org19990222-rdf-syntax-nsgtPREFIX oav lthttplodopenaireeuvocabgt

PREFIX cerif lthttpwwweurocrisorgontologiescerif13gtPREFIX dcterms lthttppurlorgdctermsgt

PREFIX foaf lthttpxmlnscomfoaf01gtSELECT y

WHERE

p ceriflinksToPerson xx a foafPerson

x dctermscreator yy oavresultType dataset

LIMIT 10

Example List the full names of all authors who have (co-)authored a publication in project P

  • Making Use of the Linked Open Data Services for OpenAIRE Quer
  • Session outline
  • Open Access Infrastructure for Research in Europe
  • OpenAIRE Services
  • OpenAIRE Data Model
  • Example of data about Core Entities
  • The OpenAIRE vision
  • Use cases
  • Challenges supported by LOD Services
  • Expected values
  • Towards OpenAIRE LOD Services
  • Phase 1 LOD Production
  • Specify vocabularies
  • Slide 14
  • Phase2 Interlinking OA-RDF Graph to LOD cloud
  • RDF (Resource Description Framework)
  • RDF version of example
  • Example of data about Linking entities
  • How to query RDF
  • How to query
  • OpenAIRE as LOD
  • Slide 22
  • Sample query
  • General architecture
  • Interlinking OpenAIRE RDF Graph to LOD cloud
  • OA LOD interlinking workflow
  • Sample interlinking result
  • Slide 28
  • Scientific events
  • Slide 30
  • Slide 31
  • Example What is the overall research output of a given project
  • Example What organizations are more active than others wrt
  • Example What datasets has published by a specific person who i
  • Example List the full names of all authors who have (co-)autho