yago: yet another great ontologyturing.cs.washington.edu/announce/docs-2009/suchanek.pdf · fabian...

63
YAGO - Yet Another Great Ontology 1 Fabian M. Suchanek YAGO: Yet Another Great Ontology Fabian M. Suchanek (joint work with Gjergji Kasneci, Mauro Sozio and Gerhard Weikum) (Max-Planck-Institute for Informatics, Saarbrücken/Germany)

Upload: others

Post on 07-Jul-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 1Fabian M. Suchanek

YAGO:Yet Another Great Ontology

Fabian M. Suchanek (joint work with Gjergji Kasneci, Mauro Sozio and Gerhard Weikum)

(Max-Planck-Institute for Informatics, Saarbrücken/Germany)

Page 2: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 2Fabian M. Suchanek

Thank you,

Prof. Etzioni,Prof. Weld,Mr. Allen

for inviting me so expeditiously!

Before we start the talk...

Page 3: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 3Fabian M. Suchanek

Fabian M. Suchanek2003 BSc. in Cognitive Science from U Osnabrück/Germany2005 MSc. in Computer Science from U Saarbrücken/Germany2008 PhD in Computer Science at the Max-Planck Institute for Informatics in Saarbrücken/Germany, working on Ontologies and Information Extraction

About myself

Page 4: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 4Fabian M. Suchanek

The Search for Excellent Scientists

Max-Planck Institute University of Washington

Page 5: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 5Fabian M. Suchanek

scientist musician prize

Invisible Gorilla steals the Nobel Prize...The gorilla, plus dropped food and country music, were honored...

newscientist.org/article/invisibleGorilla.htm Cached Similar pages

The Search for Excellent Scientists

Page 6: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 6Fabian M. Suchanek

scientist who are musicians and won a prize

Invisible Gorilla steals the Nobel Prize...The gorilla, plus dropped food and country music, were honored...

newscientist.org/article/invisibleGorilla.htm Cached Similar pages

The Search for Excellent Scientists

Page 7: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 7Fabian M. Suchanek

Please give me IMMEDIATELY the scientists who are...

Invisible Gorilla steals the Nobel Prize...The gorilla, plus dropped food and country music, were honored...

newscientist.org/article/invisibleGorilla.htm Cached Similar pages

The Search for Excellent Scientists

Page 8: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 8Fabian M. Suchanek

1980born

scientist

personsubclass

"Sam Smart" "Dr. Smart"

means means

Words

is a

Individuals

Classes

Relations

Solution: An Ontology

musician

is a

subclass

Page 9: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 9Fabian M. Suchanek

Vision

The world, I‘d like to say, even though some may contradict, is not as it seems. It rather seems as if the world seems not what it seems

Gathering the knowledge of this world in a structured ontology.

Semantic Search ر

Question answering ر

Machine Translation ر

Document classification ر

… ر

Page 10: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 10Fabian M. Suchanek

Where do we get the ontology from?

Assemble the ontology manually ر

(WordNet, SUMO, Cyc, GeneOntology)

Question: Can we increase coverage? (MPI is in none of these)

Extract information from corpora (e.g. the Web)ر

(Espresso, Snowball, LEILA, TextRunner, KnowItAll)

Question: Can we increase accuracy? Can we achieve canonicity?

Use community work (Semantic Wikipedia, Freebase)ر

Question: Will it take off?

Extract the ontology from a semi-structured corpus (Wikipedia)ر

(KYLIN/KOG, DBpedia)

Question: Can we go beyond Wikipedia?

Page 11: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 11Fabian M. Suchanek

Our Approach (= this talk)

Extract information from corpora (e.g. the Web)ر

to extend the core ontology: SOFIE

Extract the ontology from a semi-structured corpus (Wikipedia)ر

as a core ontology: YAGO

Page 12: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 12Fabian M. Suchanek

YAGO Construction: Infoboxes

blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter

Name: Sam Smart

Born in: Berlin

...

BerlinbornIn

Exploit infoboxes

Smart, S

Page 13: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 13Fabian M. Suchanek

YAGO Construction: Categories

blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter

Categories:

1980_births

Exploit relational categories

bornIn

Exploit infoboxes

1980born Berlin

Smart, S

Page 14: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 14Fabian M. Suchanek

YAGO Construction: Categories

blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter

Categories:

German_scientists

Exploit relational categories

bornIn

Exploit infoboxes

1980born

Exploit conceptual categories

is aGermanScientist

Berlin

Smart, S

Page 15: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 15Fabian M. Suchanek

YAGO Construction: Categories

blah blah blub Elvis (don't read this! Better listen to the talk!) laber fasel suelz. Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Blub, aber blah! Insbesondere, blub, texte zu, und so weiter blah blah blub Elvis laber fasel suelz. Insbesondere, blub, texte zu, und so weiter

Categories:

Physics

Exploit relational categories

bornIn

Exploit infoboxes

1980born

Exploit conceptual categoriesAvoid thematic categories

is aPhysics

Berlin

is aGermanScientist

Smart, S

Page 16: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 16Fabian M. Suchanek

YAGO Construction: Thematic vs Conceptual

American scientists of German origin

Premodifier Head PostmodifierShallow linguistic noun phrase parsing:

Heuristics: If the head is a plural word, the category is conceptual

Biomedical science in Germany

conceptual:

thematic:

Page 17: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 17Fabian M. Suchanek

YAGO Construction: Upper Model

1980born

German Scientist

is a

person

entity

?

Page 18: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 18Fabian M. Suchanek

YAGO Construction: Upper Model

1980born

German Scientist

is a

People_by_occupation

Business

?Social_group

Page 19: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 19Fabian M. Suchanek

YAGO Construction: Upper Model

1980born

German Scientist

is a

Scientist 1

Person

subclass WordNet

Wikipedia

Scientist 42....

Page 20: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 20Fabian M. Suchanek

YAGO Construction: Upper Model

1980born

German Scientist

is a

Scientist

Person

subclass

subclass

"scientist"means

"S. Smart"means

WordNet

Wikipedia

[WWW 2007]

Page 21: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 21Fabian M. Suchanek

YAGO: Relations

is afamilyNamegivenNamebornOnDatediedOnDatebornIndiedInlocatedIn

establishedOnDateisMarriedTohasPopulationhasHeighthasWeighthasInflationactedIn...

90 relations

Manual evaluation:

95% correct

Page 22: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 22Fabian M. Suchanek

YAGO: Size*

KnowItAll SUMO WordNet OpenCyc Cyc

30,000 60,000 200,000 300,000

3,000,000

19,000,000

Yago

* Publicly available ontologies with a quality guarantee. Size is not correlated with usefulness.

Page 23: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 23Fabian M. Suchanek

YAGO Model: Why binary is not enough

is a

1998

Wikipedia

since

source

#1 (Sam, is_a, scientist)

#2 (#1, since, 1998)

#3 (#1, source, Wikipedia)

scientist

Page 24: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 24Fabian M. Suchanek

YAGO Model: Formal view

A YAGO ontology over

a set of relations R رa set of common entities C رa set of fact identifiers I رis a function

I (RCI) R (RIC)

We can talk aboutfacts (#1, source, Wikipedia) رadditional arguments (#1, since, 1998) رrelations (time, hasRange, time_interval) ر

#1 (Sam, is_a, scientist)

#2 (#1, since, 1998)

#3 (#1, source, Wikipedia)

Still: Decideable

Consistency

[WWW 2007]

Page 25: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 25Fabian M. Suchanek

A Hitchhiker's Guide to Ontology

DBpedia(HU Berlin)

Cyc(commercial)

Freebase(community)

UMBEL(commercial)

Linking Open Data(HU Berlin, U Leipzig, OLS Inc.)

YAGO

SUMO(research project)

YAGO forms taxonomic backbone

YAGO is part of the project by its Web service

YAGO contributes the entities

Planned

YAGO will be included

YAGO and SUMO have been merged

[Elsevier 2008]

KOG(U Washington)

YAGO helps learn the taxonomy

Page 26: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 26Fabian M. Suchanek

This talk

Extract information from corpora (e.g. the Web)ر

to extend the core ontology: SOFIE

Extract the ontology from a semi-structured corpus (Wikipedia)ر

as a core ontology: YAGO

Page 27: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 27Fabian M. Suchanek

Extending the Ontology

Dr. Smart was born in 1980.

Dr. Smart 1980was born in

Our first approach:

LEILA - Combining Linguistic and Statistical Analysis [SIGKDD 2006]

Worked well, but was slow.

Page 28: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 28Fabian M. Suchanek

Extending the Ontology

Dr. Smart was born in 1980.

Dr. Smart 1980was born in

bornInYear(Person, Year)

Page 29: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 29Fabian M. Suchanek

Extending the Ontology

Dr. Smart was born in 1980.

Dr. Smart 1980bornInYear

1. Mapping patterns to relations

Page 30: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 30Fabian M. Suchanek

Extending the Ontology

Dr. Smart was born in 1980.

1980bornInYear

1. Mapping patterns to relations

2. Disambiguating entity names

diedInYear 1776

Page 31: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 31Fabian M. Suchanek

Extending the Ontology

Dr. Smart was born in 1980.

1980bornInYear

1. Mapping patterns to relations

2. Disambiguating entity names

3. Performing logical reasoning

Page 32: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 32Fabian M. Suchanek

SOFIE: A Unifying Framework

1. Mapping patterns to relations

2. Disambiguating entity names

3. Performing logical reasoning„Elvis was born in 1937.“

1937bornInYear

+

„X was born in Y“

is a good pattern forbornInYear

Page 33: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 33Fabian M. Suchanek

SOFIE: A Unifying Framework

1. Mapping patterns to relations

2. Disambiguating entity names

3. Performing logical reasoning„Dr. Smart was born in 1980.“

1980bornInYear

+

„X was born in Y“ is a good pattern for

bornInYear

Page 34: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 34Fabian M. Suchanek

„...The world as such, I would like to say – even though some will contradict – is not as it seems. As Dr. Smart pointed out, in his ground-breaking paper „The world according to Smart“, the world rather seems not what it seems...“

SOFIE: A Unifying Framework

1. Mapping patterns to relations

2. Disambiguating entity names

3. Performing logical reasoning

r(x,y) /\ occurs(p,x,y) => isGoodPattern(p,r)

isGoodPattern(p,r) /\ occurs(p,x',y') => r(x',y')

disambiguate(„Dr. Smart“, Sam_Smart)[0.8]

0.8

0.2

disambiguate(„Dr. Smart“, Lisa_Smart)[0.2]

Page 35: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 35Fabian M. Suchanek

SOFIE: A Unifying Framework

1. Mapping patterns to relations

2. Disambiguating entity names

3. Performing logical reasoning

New !

r(x,y) /\ occurs(p,x,y) => isGoodPattern(p,r)

isGoodPattern(p,r) /\ occurs(p,x',y') => r(x',y')

bornInYear(x,b) /\ diedInYear(x,d) => b<d

disambiguate(„Dr. Smart“, Sam_Smart)[0.8]

It's all just logical formulae with weights

Find truth values for the literals so that a maximal number of formulae is happy! [WWW 2009]

Page 36: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 36Fabian M. Suchanek

SOFIE: A Unifying Framework

r(x,y) /\ occurs(p,x,y) => isGoodPattern(p,r)

isGoodPattern(p,r) /\ occurs(p,x',y') => r(x',y')

bornInYear(x,b) /\ diedInYear(x,d) => b<d

disambiguate(„Dr. Smart“, Sam_Smart)[0.8]

It's all just logical formulae with weights

Find truth values for the literals so that a maximal number of formulae is happy!

bornInYear(Elvis,1937)

YAGO

Page 37: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 37Fabian M. Suchanek

SOFIE: Information Extraction as MAX SAT

r(x,y) /\ s(x,z) => t(x,z) [w]...

Problem:

The Weighted MAX SAT Problem is NP-hardر

Our instance contains YAGO (19 million facts)ر and textual facts (e.g. some dozens of thousands)

The best-known approximation algorithmر cannot deal well with our specific instance

We have a Weighted MAX SAT Problem

Page 38: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 38Fabian M. Suchanek

SOFIE: A Unifying Framework

/\ r(a,b) => s(x,y)

Sam_Smart 1980

AlgorithmFunctional MAX SAT

FOR i=1 TO 42...NEXT i

Polynomial time

Approximation

Guarantee

Weighted MAX SAT Problem

Page 39: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 39Fabian M. Suchanek

SOFIE: Relation to Markov Logic

P

bornIn(SamSmart, 1980)

false true

P(X) ~ Π e sat(i,X) wi

Number of satisfied instances of the ith formula

Weight of the ith formula

r(x,y) /\ s(x,z) => t(x,z) [w]...

max X Π e sat(i,X) wi

max X log( Π e sat(i,X) wi )

max X Σ sat(i,X) wi

Page 40: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 40Fabian M. Suchanek

SOFIE: Relation to Markov Logic

P

bornIn(Sam, 1980)

false true

P(X) ~ Π e sat(i,X) wi

Number of satisfied instances of the ith formula

Weight of the ith formula

r(x,y) /\ s(x,z) => t(x,z) [w]...

max X Π e sat(i,X) wi

max X log( Π e sat(i,X) wi )

max X Σ sat(i,X) wi

Weighted MAX SAT problem

Page 41: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 41Fabian M. Suchanek

SOFIE: Experiments

91%24min1150unstructuredNews article toy corpus

90%15h53440unstructuredBiographies from Web

94%15h152000semi-structured

Wikipedia subcorpus

100%8min3100structuredWikipedia toy corpus

PrecisionTimeRelations# DocsTypeCorpus

Page 42: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 42Fabian M. Suchanek

The Excellent Scientist

Which scientist was also a musician and has won a prize?

hasWonPrize

is amusician

scientist is a

Back to the original question:

Page 43: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 43Fabian M. Suchanek

X isa scientist

X isa musician

X hasWonPrize Y

The Excellent Scientist

Which scientist was also a musician and has won a prize?

Back to the original question:

Page 44: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 44Fabian M. Suchanek

Page 45: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 45Fabian M. Suchanek

YAGO – A Core of Semantic Knowledge - Mozilla

Page 46: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 46Fabian M. Suchanek

Page 47: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 47Fabian M. Suchanek

Page 48: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 48Fabian M. Suchanek

I am a Screenshot

Brian May – Musician and Astrophysicist

Page 49: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 49Fabian M. Suchanek

Conclusion

They do existر

http://mpii.de/yago

We made a step towards gatheringر the knowledge of this world in a structured ontology

YAGOSOFIEThe world, I‘d

like to say, even though some may contradict, is not as it seems. It rather seems as if the world seems not what it seems

Page 50: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 50Fabian M. Suchanek

References

[SIGKDD 2006] Fabian M. Suchanek, Georgiana Ifrim and Gerhard Weikum "Combining Linguistic and Statistical Analysis to Extract Relations from Web Documents" International Conference on Knowledge Discovery and Data Mining (SIGKDD 2006)

[WWW 2007] Fabian M. Suchanek, Gjergji Kasneci and Gerhard Weikum "YAGO - A Core of Semantic Knowledge" International World Wide Web conference (WWW 2007)

[Elsevier 2008] Fabian M. Suchanek, Gjergji Kasneci and Gerhard Weikum "YAGO - A Large Ontology from Wikipedia and WordNet" Elsevier Journal of Web Semantics 2008

[WWW 2009] Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum „SOFIE – A Self-Organizing Framework for Information Extraction“ International World Wide Web conference (WWW 2009)

Page 51: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 51Fabian M. Suchanek

Future Work at the Max-Planck Institute

personalize YAGO (Shady Elbassuoni, Maya Ramanath) رuse social networks to extend YAGO (Maya, Sharat, Ashwin) رmake YAGO multilingual (Gerard de Melo) رadd Web services (Nicoleta Preda) رmake querying efficient (Gjergji Kasneci) رstore YAGO efficiently (Thomas Neumann) ر

make reasoning efficient (Mauro Sozio,Martin Theobald)رprovide good visualization (Shady Elbassuoni) رadd biomedical knowledge (Alessandro Fiori) رadd multimodal support (Martin Schreiber) رadd natural language support (with Alexander Koller) ر

Page 52: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 52Fabian M. Suchanek

Future Work at UW?

learn also relations and classes (KOG, KYLIN) رinvestigate multilingual support (PanImages) رinvestigate reasoning/inferencing (Holmes, MLN) رfeed information back into Wikipedia (Intelligence in Wikipedia) رextract information on a larger scale (KnowItAll/TextRunner) رguide the information extraction process (Alice/LLL) ر

Instead of competing with these projects,

join forces with them!

Page 53: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 53Fabian M. Suchanek

YAGO: Knowledge Representation

OWL Full

OWL DL

RDFS

YAGO

subClassOf

Reification

ADTs

Transitivity

Acyclicity

Datatypes

Property Restrictions

Page 54: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 54Fabian M. Suchanek

YAGO Model: Logical aspects

Axioms:(x, is_a, y)

(y, subclass, z)

=> (x, is_a, z)

...

scientist

person

subclass

is a

is a

Page 55: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - Yet Another Great Ontology 55Fabian M. Suchanek

YAGO Model: Logical aspects

Axioms:(x, is_a, y)

(y, subclass, z)

=> (x, is_a, z)

...f1, f2, f3, f4, f5

f1, f2, f3

f1, f2, f3, f4, f5, f6, f7, f8, f9, f10

derive facts

Eliminate facts

finite, unique

finite, unique[WWW 2007]

Page 56: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 56Fabian M. Suchanek

SOFIE rules!

occurs(P,WX,WY)/\ refersTo(WX.X)/\ refersTo(WY,Y)/\ R(X,Y) => expresses(P,R)

occurs(P,WX,WY)/\ expressed(P,R)/\ refersTo(WX.X)/\ refersTo(WY,Y)/\ range(R,D1)/\ domain(R,D2)/\ type(X,D1)/\ type(Y,D2)=> R(X,Y) R(X,Y)

R(X,Y)/\ R(X,Z)/\ type(R,functionalRelation)=> Y = Z

disambiguationPrior(W,X) => refersTo(W,X)

bornInYear(X,B) /\ diedInYear(X,D) => B<D

Page 57: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 57Fabian M. Suchanek

SOFIE: Clause transformation

r(X,Y) /\ s(X,Y) => t(X,X)u(a)

Entities {a,b}

r(a,a) /\ s(a,a) => t(a,a)r(a,b) /\ s(a,b) => t(a,a)r(b,a) /\ s(b,a) => t(b,b)r(b,b) /\ s(b,b) => t(b,b)u(a)

Rules

Grounded Rules r(a,a) \/ s(a,a) \/ t(a,a)

r(a,b) \/ s(a,b) \/ t(a,a)

r(b,a) \/ s(b,a) \/ t(b,b)

r(b,b) \/ s(b,b) \/ t(b,b)

u(a)

Clauses

Page 58: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 58Fabian M. Suchanek

SOFIE: Clause transformation

r(a,a) \/ s(a,a) \/ t(a,a)

r(a,b) \/ s(a,b) \/ t(a,a)

r(b,a) \/ s(b,a) \/ t(b,b)

r(b,b) \/ s(b,b) \/ t(b,b)

u(a)

Clauses

s(a,a) YAGO

1r(a,a) [w1]r(a,b) [w2]r(b,a) [w3]r(b,b) [w4]

Textual Facts

Page 59: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 59Fabian M. Suchanek

SOFIE: Clause weighting

1 \/ 1 \/ t(a,a) [w1]

1 \/ s(a,b) \/ t(a,a) [w2]

1 \/ s(b,a) \/ t(b,b) [w3]

1 \/ s(b,b) \/ t(b,b) [w4]

u(a) [W]

Clausesr(a,a) [w1]r(a,b) [w2]r(b,a) [w3]r(b,b) [w4]

Textual Facts

s(a,a) YAGO

Page 60: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 60Fabian M. Suchanek

SOFIE: Hypothesis generation

r(a,b) [w1]Textual Facts

r(X,Y) /\ s(X,Y) => t(X,X)Rules

t(a,a)t(b,b)

Hypotheses

Page 61: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 61Fabian M. Suchanek

SOFIE: Hypothesis generation

r(a,a) /\ s(a,a) => t(a,a)r(a,b) /\ s(a,b) => t(a,a)

Grounded Rules

r(X,Y) /\ s(X,Y) => t(X,X)Rules

t(a,a)Hypotheses

Page 62: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 62Fabian M. Suchanek

SOFIE: Functional MAX SAT Algorithm

X \/ Z [w1]

X \/ Y [w1]

Y \/ Z [w1]

Z [w1]

ClausesXYZ

Variables

The functional MAX SAT Algorithm considers only unit clauses.

=1=0

=0

Page 63: YAGO: Yet Another Great Ontologyturing.cs.washington.edu/announce/docs-2009/Suchanek.pdf · Fabian M. Suchanek YAGO - Yet Another Great Ontology 24 YAGO Model: Formal view A YAGO

YAGO - A Core of Semantic Knowledge 63Fabian M. Suchanek

SOFIE: Experiments

74465

35, 46

505

165

165

# Facts

57%100%8min3100semi-structured 50% infoboxes removed

Wikipedia toy corpus

31%56% Snowball90%

91%

94%

100%

Precision

?

24%, 31%

?

98%

Recall

24min1150unstructuredNews article toy corpus

15h53440unstructuredBiographies from Web

15h152000semi-structuredWikipedia subcorpus

8min3100structuredWikipedia toy corpus

Time# Rel# DocsTypeCorpus