link discovery tutorial part v: hands-on

22
Link Discovery Tutorial Part V: Hands-On Axel-Cyrille Ngonga Ngomo (1) , Irini Fundulaki (2) , Mohamed Ahmed Sherif (1) (1) Institute for Applied Informatics, Germany (2) FORTH, Greece October 18th, 2016 Kobe, Japan Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 1 / 19

Upload: holistic-benchmarking-of-big-linked-data

Post on 14-Jan-2017

429 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Link Discovery Tutorial Part V: Hands-On

Link Discovery TutorialPart V: Hands-On

Axel-Cyrille Ngonga Ngomo(1), Irini Fundulaki(2), Mohamed Ahmed Sherif(1)

(1) Institute for Applied Informatics, Germany(2) FORTH, Greece

October 18th, 2016Kobe, Japan

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 1 / 19

Page 2: Link Discovery Tutorial Part V: Hands-On

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 2 / 19

Page 3: Link Discovery Tutorial Part V: Hands-On

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 3 / 19

Page 4: Link Discovery Tutorial Part V: Hands-On

Test Dataset

Semantic Web Dog Food CorpusData exposed: Metadata (papers, presentations, people) for several semantic webrelated conferences and workshops, including the most recent ISWC, ESWC andWWW events.

Dumps: http://data.semanticweb.org/dumpsEndpoint: http://data.semanticweb.org/sparqlDataHub: https://datahub.io/dataset/semantic-web-dog-foodDownload:http://iswc2016ldtutorial.aksw.org/tutorial-material/

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 4 / 19

Page 5: Link Discovery Tutorial Part V: Hands-On

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 5 / 19

Page 6: Link Discovery Tutorial Part V: Hands-On

Task I: DeduplicationExecute the given Configuration File

Decentralized nature of LODData contain duplicatesHow to efficiently detect similar resources?

Task IFind duplicate authors in Semantic Web Dog Food Dataset

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 6 / 19

Page 7: Link Discovery Tutorial Part V: Hands-On

Task I: DeduplicationExecute the given Configuration File

Decentralized nature of LODData contain duplicatesHow to efficiently detect similar resources?

Task IFind duplicate authors in Semantic Web Dog Food Dataset

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 6 / 19

Page 8: Link Discovery Tutorial Part V: Hands-On

Limes Configuration FilePREFIXes

<PREFIX ><NAMESPACE >http :// www.w3.org /2000/01/ rdf - schema #</ NAMESPACE ><LABEL >rdfs </ LABEL >

</ PREFIX >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 7 / 19

Page 9: Link Discovery Tutorial Part V: Hands-On

Limes Configuration FileSOURCE and TARGET datasets

<SOURCE ><ID >SDF1 </ID ><ENDPOINT >semanticDogFood .nt </ ENDPOINT ><VAR >?x</VAR ><PAGESIZE > -1</ PAGESIZE ><RESTRICTION >?x a foaf: Person </ RESTRICTION ><PROPERTY >rdfs: label </ PROPERTY ><TYPE >NT </TYPE >

</ SOURCE >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 8 / 19

Page 10: Link Discovery Tutorial Part V: Hands-On

Limes Configuration FileMETRIC

<METRIC >Levenshtein (x.rdfs: label , y.rdfs: label )</ METRIC >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 9 / 19

Page 11: Link Discovery Tutorial Part V: Hands-On

Limes Configuration FileACCEPTANCE and REVIEW

<ACCEPTANCE ><THRESHOLD >0.9 </ THRESHOLD ><FILE >similarAuthor .nt </FILE ><RELATION >ov: similarTo </ RELATION >

</ ACCEPTANCE >

<REVIEW ><THRESHOLD >0.5 </ THRESHOLD ><FILE >similarAuthor_review .nt </FILE ><RELATION >owl: sameAs </ RELATION >

</ REVIEW >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 10 / 19

Page 12: Link Discovery Tutorial Part V: Hands-On

Limes Configuration FileOUTPUT format

<OUTPUT >TTL </ OUTPUT >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 11 / 19

Page 13: Link Discovery Tutorial Part V: Hands-On

Task IRun Limes

Run Limesjava -jar limes-core-1.0.0.jar task1.xml

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 12 / 19

Page 14: Link Discovery Tutorial Part V: Hands-On

Task IUsing Machine Learning

<MLALGORITHM ><NAME >wombat simple </NAME ><TYPE >unsupervised </TYPE >

</ MLALGORITHM >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 13 / 19

Page 15: Link Discovery Tutorial Part V: Hands-On

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 14 / 19

Page 16: Link Discovery Tutorial Part V: Hands-On

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Page 17: Link Discovery Tutorial Part V: Hands-On

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Page 18: Link Discovery Tutorial Part V: Hands-On

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Page 19: Link Discovery Tutorial Part V: Hands-On

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 16 / 19

Page 20: Link Discovery Tutorial Part V: Hands-On

Task IIIUse Limes GUI

Run Limes GUIjava -jar limes-core-1.0.0.jar -g

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 17 / 19

Page 21: Link Discovery Tutorial Part V: Hands-On

Acknowledgment

This work was supported by grants from the EU H2020 Framework Programmeprovided for the project HOBBIT (GA no. 688227).

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 18 / 19

Page 22: Link Discovery Tutorial Part V: Hands-On

References I

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 19 / 19