yago-sumo: integrating yago into the suggested upper merged ontology

65
Introduction Approach Conclusion Integrating YAGO into the Suggested Upper Merged Ontology G. de Melo 1 , F. Suchanek 1 , A. Pease 2 1: Max Planck Institute for Informatics, Germany 2: Articulate Software, USA 2008-11-03 G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Upload: gerard-de-melo

Post on 13-Jun-2015

422 views

Category:

Technology


4 download

DESCRIPTION

The YAGO-SUMO integration incorporates millions of entities from YAGO, which is based on Wikipedia and WordNet, into the Suggested Upper Merged Ontology (SUMO), a highly axiomatized formal upper ontology. With the combined force of the two ontologies, an enormous, unprecedented corpus of formalized world knowledge is available for automated processing and reasoning, providing information about millions of entities such as people, cities, organizations, and companies. Compared to the original YAGO, more advanced reasoning is possible due to the axiomatic knowledge delivered by SUMO. A reasoner can conclude e.g. that a child of a human must also be a human and cannot be born before its parents, or that two people sharing the same parents must be siblings.

TRANSCRIPT

Page 1: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Integrating YAGO into theSuggested Upper Merged Ontology

G. de Melo1, F. Suchanek1, A. Pease2

1: Max Planck Institute for Informatics, Germany2: Articulate Software, USA

2008-11-03

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 2: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Outline

1 IntroductionOntologies and KBsSUMOExtending OntologiesYAGO

2 ApproachIncorporationClass InformationStatements

3 ConclusionOngoing WorkSummary

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 3: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Ontologies/KBs: providebackground knowledge forintelligent applications

Schism:formal ontologies vs. large KBs

Goal: Large-scale formal ontology

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 4: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Ontologies/KBs: providebackground knowledge forintelligent applications

Schism:formal ontologies vs. large KBs

Goal: Large-scale formal ontology

formal ontologies: complex axioms(e.g. in FOL), but quite small

large-scale KBs (e.g. based onWikipedia): only simple facts

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 5: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Ontologies/KBs: providebackground knowledge forintelligent applications

Schism:formal ontologies vs. large KBs

Goal: Large-scale formal ontology

combine the best of both worlds!

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 6: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

Suggested Upper Merged Ontology

open source

based on KIF rather than e.g. OWL

large formal ontology (20,000 terms, 70,000 axioms)axiomatization of general and domain-specific conceptsfor applications requiring basic “common sense”

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 7: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

Suggested Upper Merged Ontology

open source

based on KIF rather than e.g. OWL

origins: IEEE standard upper ontology groupcore owned by IEEE (basically Public Domain), portions GPLe.g.: OpenCyc doesn’t include axioms of commercial Cyc

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 8: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

Suggested Upper Merged Ontology

open source

based on KIF rather than e.g. OWL

peer review, community of experts and usersformal verification with ATP systems

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 9: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

Suggested Upper Merged Ontology

open source

based on KIF rather than e.g. OWL

OWL without additional rules is not very expressiveKIF variant standardized as ISO/IEC IS 24707:2007(Common Logic)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 10: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction: Why Axiomatic Ontologies?

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 11: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction: Why Axiomatic Ontologies?

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 12: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO Example

(=>

(and

(parent ?CHILD ?PARENT)

(subclass ?CLASS Organism)

(instance ?PARENT ?CLASS))

(instance ?CHILD ?CLASS))

This implies, for example, that a child of a Human is also a Human.

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 13: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Structure of SUMO

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 14: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

additional domain ontologies

however, SUMO is mainly an upper ontology

not enough instances and ground facts

e.g. for geography, finance, transportation

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 15: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

additional domain ontologies

however, SUMO is mainly an upper ontology

not enough instances and ground facts

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 16: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

SUMO

additional domain ontologies

however, SUMO is mainly an upper ontology

not enough instances and ground facts

e.g. people, cities, books

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 17: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Extending Ontologies: Possible Approaches

Manual work

Information extraction from corpora / the Web

Import from existing databases

slow process, low coverageSemantic Wikis not yet accepted enough

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 18: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Extending Ontologies: Possible Approaches

Manual work

Information extraction from corpora / the Web

Import from existing databases

low accuracynot canonical / in line with upper ontology

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 19: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

Extending Ontologies: Possible Approaches

Manual work

Information extraction from corpora / the Web

Import from existing databases

feasible, but not universal enough

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 20: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

YAGO

combine entities and facts from Wikipedia with an upperontology

original YAGO: WordNet for the upper level

New goal: integrate with SUMO

excellent coverage: around 2 million entitiesmillions of facts about themhigh quality: e.g. birth dates of people, location of cities

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 21: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

YAGO

combine entities and facts from Wikipedia with an upperontology

original YAGO: WordNet for the upper level

New goal: integrate with SUMO

mainly a lexical knowledge basee.g. hyponymic relationships do not strictly imply subsumptionslack of formal axioms

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 22: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ontologies and KBsSUMOExtending OntologiesYAGO

Introduction

YAGO

combine entities and facts from Wikipedia with an upperontology

original YAGO: WordNet for the upper level

New goal: integrate with SUMO

so the class information actually is meaningful

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 23: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Outline

1 IntroductionOntologies and KBsSUMOExtending OntologiesYAGO

2 ApproachIncorporationClass InformationStatements

3 ConclusionOngoing WorkSummary

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 24: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Incorporation

Idea: most Wikipedia articles become new entities

Semi-automatic matching: although SUMO contains onlyfew instances, some degree of overlap exists

use weighted string similarity measureadditional manual validation

−→ equivalence table

Entity Generation: produce a new unique term name forWikipedia article not listed in equivalence table, subject to thefollowing desiderata:

prevent clashes with SUMO or other entitiesconcisenessabide to KIF syntax (Wikipedia uses Unicode)must be a proper entity (not: “List of ...”)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 25: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Incorporation

Idea: most Wikipedia articles become new entities

Semi-automatic matching: although SUMO contains onlyfew instances, some degree of overlap exists

use weighted string similarity measureadditional manual validation

−→ equivalence table

Entity Generation: produce a new unique term name forWikipedia article not listed in equivalence table, subject to thefollowing desiderata:

prevent clashes with SUMO or other entitiesconcisenessabide to KIF syntax (Wikipedia uses Unicode)must be a proper entity (not: “List of ...”)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 26: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Incorporation

Idea: most Wikipedia articles become new entities

Semi-automatic matching: although SUMO contains onlyfew instances, some degree of overlap exists

use weighted string similarity measureadditional manual validation

−→ equivalence table

Entity Generation: produce a new unique term name forWikipedia article not listed in equivalence table, subject to thefollowing desiderata:

prevent clashes with SUMO or other entitiesconcisenessabide to KIF syntax (Wikipedia uses Unicode)must be a proper entity (not: “List of ...”)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 27: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

goal: each entity should have class membership information

use Wikipedia category system, however cannot use it directly

first link categories to WordNet, then map to SUMO

requirement: distinguish thematic categories from categoriesencoding class membership

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 28: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

goal: each entity should have class membership information

use Wikipedia category system, however cannot use it directly

first link categories to WordNet, then map to SUMO

requirement: distinguish thematic categories from categoriesencoding class membership

categorization not transitivemembers of subcategories often unrelated to parent category

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 29: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

goal: each entity should have class membership information

use Wikipedia category system, however cannot use it directly

first link categories to WordNet, then map to SUMO

requirement: distinguish thematic categories from categoriesencoding class membership

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 30: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

goal: each entity should have class membership information

use Wikipedia category system, however cannot use it directly

first link categories to WordNet, then map to SUMO

requirement: distinguish thematic categories from categoriesencoding class membership

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 31: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO

shallowparsing: noungroup parser toidentifyheadword

heuristic:ignorecategories withheadword insingular form

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 32: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO

shallowparsing: noungroup parser toidentifyheadword

heuristic:ignorecategories withheadword insingular form

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 33: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

check WordNet for premodifier + headword or headword only

disambiguate using frequency information

result: relationship to WordNet-derived class

e.g. “American singer” or “singer”

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 34: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

check WordNet for premodifier + headword or headword only

disambiguate using frequency information

result: relationship to WordNet-derived class

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 35: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

YAGO: From Wikipedia to WordNet

check WordNet for premodifier + headword or headword only

disambiguate using frequency information

result: relationship to WordNet-derived class

American singers of German origin

becomes linked as a subclass to theWordNet-derived class Person

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 36: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

Voting Procedure

problem:

regular polysemy, Wikipedia articles simultaneously coverseveral metonymically related sensese.g. Brown University is both a College and aGroupOfPeople

will cause inconsistencies when the axioms are added

solution:

look at top-level branches for each proposed class (locations,artifacts, abstract entities, etc.)voting procedure to determine most salient branch (ties brokenarbitrarily)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 37: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

Voting Procedure

problem:

regular polysemy, Wikipedia articles simultaneously coverseveral metonymically related sensese.g. Brown University is both a College and aGroupOfPeople

will cause inconsistencies when the axioms are added

solution:

look at top-level branches for each proposed class (locations,artifacts, abstract entities, etc.)voting procedure to determine most salient branch (ties brokenarbitrarily)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 38: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

From WordNet toSUMO

good news:

existing manuallyestablished WordNet-SUMO-mappings

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 39: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

From WordNet toSUMO

in some cases, thesemappings provide anequivalent SUMOclass

−→ directly use theSUMO class instead ofthe WordNet oneE.g. Human instead ofWordNet’s “person”

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 40: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

From WordNet toSUMO

in many cases, themappings provide asuper-class

−→ create newWordNet-based class,make it a subclass ofSUMO class

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 41: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

From WordNet to SUMO

in further cases, the mappings yield a property or relation−→ create new WordNet-based class, add axioms of theform(=>

(instance ?ENTITY Guitarist)

(property ?ENTITY Musician))

Then recursively move up WordNet’s class hierarchy addingparent classes, until until a genuine parent class in SUMO isavailable.

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 42: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

Evaluation

lots of heuristics, multiple steps

yet: accuracy of 92.67%± 2.98% (evaluation of most specificgenuine SUMO parents for new instances)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 43: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Class Information

Evaluation

lots of heuristics, multiple steps

yet: accuracy of 92.67%± 2.98% (evaluation of most specificgenuine SUMO parents for new instances)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 44: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Information Extraction

YAGO uses manual rules and heuristics to extract informationabout entities from Wikipedia pages

mainly based on categories and infoboxes, not on article text,e.g. geographical location, spouse, etc.

manual rewriting rules to express facts using SUMO’s terms

sample evaluation: for each relation, at least 95% of thestatements are accurate

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 45: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Information Extraction

YAGO uses manual rules and heuristics to extract informationabout entities from Wikipedia pages

mainly based on categories and infoboxes, not on article text,e.g. geographical location, spouse, etc.

manual rewriting rules to express facts using SUMO’s terms

sample evaluation: for each relation, at least 95% of thestatements are accurate

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 46: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Information Extraction

YAGO uses manual rules and heuristics to extract informationabout entities from Wikipedia pages

mainly based on categories and infoboxes, not on article text,e.g. geographical location, spouse, etc.

manual rewriting rules to express facts using SUMO’s terms

sample evaluation: for each relation, at least 95% of thestatements are accurate

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 47: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Information Extraction

YAGO uses manual rules and heuristics to extract informationabout entities from Wikipedia pages

mainly based on categories and infoboxes, not on article text,e.g. geographical location, spouse, etc.

manual rewriting rules to express facts using SUMO’s terms

sample evaluation: for each relation, at least 95% of thestatements are accurate

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 48: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

SUMO Integration

mapping rules

new relations added to SUMO when necessary

incl. additional rules for reasoning

extracted fact:X hasCapital Y

becomes:(capitalCity Y X)

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 49: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

SUMO Integration

mapping rules

new relations added to SUMO when necessary

incl. additional rules for reasoning

(instance establishedOnDate BinaryRelation)

(domain 1 establishedOnDate Agent)

(domain 2 establishedOnDate TimeInterval)

(=> (establishedOnDate ?OBJ ?TIME)

(exists (?FOUNDING)

(and (instance ?FOUNDING Founding)

(result ?FOUNDING ?OBJ)

(overlapsTemporally (WhenFn ?FOUNDING) TIME))))

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 50: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

SUMO Integration

mapping rules

new relations added to SUMO when necessary

incl. additional rules for reasoning

(instance establishedOnDate BinaryRelation)

(domain 1 establishedOnDate Agent)

(domain 2 establishedOnDate TimeInterval)

(=> (establishedOnDate ?OBJ ?TIME)

(exists (?FOUNDING)

(and (instance ?FOUNDING Founding)

(result ?FOUNDING ?OBJ)

(overlapsTemporally (WhenFn ?FOUNDING) TIME))))

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 51: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Statements with Literals

proper encoding of literals with units:e.g. (MeasureFn 3.0 SquareMeter)

date ranges are recast(exists ?YEARNO ?MONTHNO ?YEARNO

(and

(birthdate HerveyDeStanton

(DayFn ?DAYNO

(MonthFn ?MONTHNO

(YearFn ?YEARNO))))

(greaterThanOrEqualTo ?YEARNO 1270)

(lessThanOrEqualTo ?YEARNO 1279)))

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 52: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Statements with Literals

proper encoding of literals with units:e.g. (MeasureFn 3.0 SquareMeter)

date ranges are recast(exists ?YEARNO ?MONTHNO ?YEARNO

(and

(birthdate HerveyDeStanton

(DayFn ?DAYNO

(MonthFn ?MONTHNO

(YearFn ?YEARNO))))

(greaterThanOrEqualTo ?YEARNO 1270)

(lessThanOrEqualTo ?YEARNO 1279)))

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 53: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Additional Grounding

statements of the form(representsInLanguage

"Immanuel Kant" ImmanuelKant EnglishLanguage)

produce a greater level of formal grounding of the semanticsof term names

when names are ambiguous, providing such symbolic stringsfor multiple languages can further reduce the range of possibleinterpretations

classes are better-specified due to their extensionalcharacterization

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 54: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Additional Grounding

statements of the form(representsInLanguage

"Immanuel Kant" ImmanuelKant EnglishLanguage)

produce a greater level of formal grounding of the semanticsof term names

when names are ambiguous, providing such symbolic stringsfor multiple languages can further reduce the range of possibleinterpretations

classes are better-specified due to their extensionalcharacterization

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 55: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

IncorporationClass InformationStatements

Statements

Additional Grounding

statements of the form(representsInLanguage

"Immanuel Kant" ImmanuelKant EnglishLanguage)

produce a greater level of formal grounding of the semanticsof term names

when names are ambiguous, providing such symbolic stringsfor multiple languages can further reduce the range of possibleinterpretations

classes are better-specified due to their extensionalcharacterization

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 56: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Outline

1 IntroductionOntologies and KBsSUMOExtending OntologiesYAGO

2 ApproachIncorporationClass InformationStatements

3 ConclusionOngoing WorkSummary

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 57: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Ongoing Work

Ongoing Work

TPTP transformation for reasoning

SUMO problems in CADE competitions

ATP systems for large-scale reasoning

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 58: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Ongoing Work

Ongoing Work

TPTP transformation for reasoning

SUMO problems in CADE competitions

ATP systems for large-scale reasoning

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 59: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Ongoing Work

Ongoing Work

TPTP transformation for reasoning

SUMO problems in CADE competitions

ATP systems for large-scale reasoning

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 60: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 61: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 62: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 63: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 64: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology

Page 65: YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology

IntroductionApproach

Conclusion

Ongoing WorkSummary

Summary

Summary

SUMO: axiomatic representation of common sense knowledgebut lack of simple encyclopedic facts

YAGO methodology: add entities and statements about themfrom Wikipedia

semi-automatic techniques, basic amount of manual work

−→ formal ontology with around two million entities andseveral million statements and axioms

SUMO is catapulted from an upper level ontology to afull-fledged all-purpose KB

Open source, available online:http://www.demelo.org/yagosumo/

G. de Melo, F. Suchanek, A. Pease Integrating YAGO into theSuggested Upper Merged Ontology