1 ontacwg: coordinating knowledge classifications patrick cassidy mitre corporation* presented at...

47
1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government Conference February 9, 2006 MITRE–McLean, Virginia * NOTE: The author’s affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE’s concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.

Upload: olivia-nelson

Post on 17-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

1

ONTACWG: Coordinating Knowledge Classifications

Patrick CassidyMITRE Corporation*

Presented at theFourth Semantic Interoperability for

E-Government Conference

February 9, 2006 MITRE–McLean, Virginia

* NOTE: The author’s affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE’s concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.

Page 2: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

ONTACWGOntology and Taxonomy Coordinating Working Group

A working group of the Semantic Interoperability Community of Practice (SICoP)

To assist in the development and cross-referencing of Knowledge Classification Systems (Ontologies, taxonomies, thesauri, graphical knowledge representations) by:

(1) maintaining on-line resources where such efforts can share: data; utilities to help create such resources; and pilot programs to

demonstrate how to use such knowledge classifications for practical purposes

(2) To adopt and extend, as a community, a Common Semantic Model that can serve as the “defining conceptual vocabulary” adequate to specify the meanings of the terms used within all of the participating communities, and relate the community terms to each other precisely.

Page 3: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

3

ONTACWG Projects

• Forum – e-mail discussion

• Web site with resources– http://colab.cim3.net/cgi-bin/wiki.pl?OntologyTaxonomyCoordinatingWG

• Repository Working Group

• COSMO Working Group

Page 4: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

4

Where Are We?

• Many Taxonomies and Ontologies

• Few Mappings of One to the Other

• No Agreed Standard of Meaning

Page 5: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

5

Where Do We Want To Go?

• Powerful Search

• Semantic Interoperability

• Automatic Knowledge Extraction

Page 6: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

6

How Do We get There?

• Create Agreed Standard of Meaning: a Common Semantic Model (COSMO)

• Use Existing Upper Ontology or adapt one for our own use

• Define (map) terms in Existing Taxonomies and Ontologies by use of Common Defining Concepts

Page 7: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

7

Why Is a Top-Level Ontology Needed?

• To support semantic interoperability by serving as a Common Semantic Model, functioning as a common defining vocabulary, allowing systems developed in different locations to share their definitions and reason with each other’s data

• To provide a well-tested inventory of basic concepts that can be combined to specify the meaning of domain-specific concepts in a form suitable for reasoning

Page 8: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

8

What Does it Mean to “Specify the meaning of a term”?

• “The biological mother of a person is a woman who has given birth to that person”

• {{?Mother isTheBiologicalMotherOf ?Child} impliesThat (ThereExists {((exactly one) ?Event) and ((exactly one) ?Date) and ((exactly one) ?Location)}

suchThat {{?Event isa BirthEvent} and {?Event occurredOn ?Date} and {?Event occurredAt ?Location} and {?Mother is (The Mother in ?Event)} and {?Child is (The Baby in ?Event)} and {(The BirthDate of ?Child) is ?Date} and {(The BirthPlace of ?Child) is ?Location}})}

Page 9: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

9

The Integrating Function of the Common Semantic Model

Obligation Duty

GenericObligation

SameAs

SameAs

Page 10: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

10

The Integrating Function of the Common Semantic Model –

via Domain-level Mapping

Obligation Duty

GenericObligation

SameAs

SameAs

Page 11: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

11

Taxonomy Mapping for Search

• When a category in one taxonomy can be identified with a category in another taxonomy, the documents associated with each node are relevant to the other

• When documents indexed by another taxonomy are not of interest to a local community, they can nevertheless be used to train an associative document classifier, which can find the documents in the community document collection that are relevant to that topic

Page 12: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

12

ONTACWG for Search

• ONTACWG might maintain, for each topic within a community KCS:– a set of sample documents that can be used

to classify a local document collection by associative document-matching techniques

– one or more sample queries that are known to find pages on the www relevant to the topic (possibly different for each search engine)

– a list of www pages relevant to the topic

Page 13: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

13

Taxonomy Mapping For Interoperability

• Communities build and maintain their own terminologies and KCSs, using them in any way they wish for their own community purposes

• When community members want their semantic information to interoperate with other domain knowledge, where logical inference is needed, they can use the mappings to the Common Semantic Model

Page 14: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

14

Taxonomy Mapping for Natural Language Understanding

• Language understanding requires recognition of the context in which linguistic statements are made

• Maintaining a large public set of documents or document fragments illustrating particular topics can help natural language programs to recognize known textual contexts

Page 15: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

15

The Long-Term Goal

Semantic Interoperability:The ability of computers to accurately communicate conceptual information; to correctly interpret the meanings of communicated information and make appropriate decisions

By adopting or building a common conceptual language for computers, which can be used to specify and relate the meanings of terms in any community terminology.

Page 16: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

16

What A Common Semantic Model Is

A means to allow computers to accurately communicate conceptual information – in effect, a common language for computers –

Fo use when the users want to communicate

Page 17: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

17

What A Common Semantic Model Isn’t

≠ A controlled vocabularyEach community can choose its own words to refer to concepts

≠ A mandated standardUsers can use any common ontology or none, as their own needs dictate

Page 18: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

18

Communities and Controlled Vocabularies

• Whenever a community of interest or community of practice is sufficiently homogeneous to agree on a controlled vocabulary, that vocabulary can serve as a linguistic signature of a particular context, which will be helpful in machine interpretation of text documents.

• i.e., multiple controlled vocabularies are good things. The Common Semantic Model can specify the relations between terms in community vocabularies.

Page 19: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

19

Concepts vs. WordsMathematical Theory / | \ / \ \ / | \ / \ | \ \ / | \ /

Axioms:(Every Cat has (( 4) Legs))

(Every House has ((atLeast 1) Door))

HouseCat

Siamese

Ontological Theory

Terminology

“Siamese Cat”

“Residential House”

“Haus”

“chat siamois”“Siamesische Katze”

“House”

“maison”

“Siamese feline”“Siamese”

“дом”

シャム猫

Page 20: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

20

•Categorical Ambiguity can be represented as a union of categories

– Metaphor– Poetry– Double entendre– Rhetoric

• “Jack went fishing last weekend and caught three trout and a cold.”

•Intentionally Ambiguous Word Use Not at issue in formal classification

Page 21: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

21

Who Needs a Common Semantic Model?

• Any computer system that needs to accurately communicate conceptual information needs a language in common with the receiving system

"Money is being spent on labs and hiring smart people who make products do unnatural acts together.”

Alan Shockley, manager of Enterprise Information Technology at EDS

Estimated costs of lack of data interoperability nationwide is over 100B/yr

Page 22: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

22

Will Any Upper Ontology Serve?

Lenat’s Dictum (Building Large Knowledge-Based Systems, 1990, p. 20):

• Do the top layers of the global ontology correctly

• Relate all the rest of human knowledge to those top layers

Page 23: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

23

Will Any Upper Ontology Serve?

Publicly Available Upper Ontologies:

OpenCyc SUMO DOLCE Omega (SENSUS)

OCHRE BFO WordNet (?) MSO ISO 15926

European Initiative: WonderWebNew American Initiative: NCORDTO project: IKRIS

Comparison of Upper Ontologies: http://www.mitre.org/work/tech_papers/tech_papers_04/04_0603/04_1175.pdf

Page 24: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

24

A Merged Upper Ontology –One Possible Method

• Merge the compatible elements of the Cyc, Omega, SUMO, MidLevel, DOLCE, BFO, and ISO 15926, and add Other concepts as desired by participants, and map this to Wordnet:

• => COSMO• COmmon Semantic MOdel• or Cyc, Omega, Sumo, Midlevel, Other

Page 25: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

25

COSMOThe Common Semantic Model

• We need an inventory of logically defined higher-level concepts adequate to specify the meanings of the terms and concepts in all domain Knowledge Classification Systems used by participants.

• Structured as a set of precisely interrelated ontologies without duplicated concepts and with a set of logically consistent default core concepts

Page 26: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

26

How Many Defining Concepts?

Clues:

• LDOCE uses a controlled defining vocabulary of ~ 2000 words, to define over 65,000 words

• Japanese students learn ~1850 kanji

• AMESLAN dictionary has ~5000 signs

Page 27: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

27

When Do We Need a New Primitive Defining Concept?

• If any of the content words in the natural-language definition have no corresponding concepts in the existing COSMO

• If it is necessary to use a “disjoint” relation to distinguish a new concept from others in the ontology

Page 28: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

28

Requirements

• Tools to make the COSMO easy to understand and easy to use

• Tools to view and extract only those concepts of interest for a particular application

• Pilot and Demonstration applications that illustrate the benefits of using the COSMO

Page 29: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

29

TOOLS

• KCS Building and Maintenance Tools– Protege http://protege.stanford.edu/– UML http://www.uml.org/– Concept Maps http://cmap.ihmc.us/

• Representation Formalisms– KIF/SKIF/ESKIF/CL/IKL/Conceptual Graphs– OWL– OWL extensions (SWRL, RuleML, OWL-

Flight, ?)

Page 30: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

30

CONTROLLED ENGLISH• ClearTalk (Skuce, 1996)

http://www.csi.uottawa.ca/~kavanagh/Ikarus/Cleartalk.html

• Effective NL Paraphrasing of Ontologies on the Semantic Webhttp://www.mindswap.org/papers/nlpowl.pdf

• Sowa’s “Common Logic Controlled English”http://www.jfsowa.com/clce/specs.htm

• ESKIF (developmental)

Page 31: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

Example of Problem without a COSMOClass: Wine

<owl:Class rdf:ID="Wine"> <rdfs:subClassOf rdf:resource="&food;PotableLiquid" /> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#locatedIn"/> <owl:someValuesFrom rdf:resource="&vin;Region"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:label xml:lang="en">wine</rdfs:label> </owl:Class>

Page 32: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

ObjectProperty: locatedInFrom wine.rdf

http://www.w3.org/2001/sw/WebOnt/guide-src/wine.rdf

<owl:ObjectProperty rdf:ID="locatedIn"> <rdf:type rdf:resource="&owl;TransitiveProperty" /> <rdfs:domain rdf:resource="http://www.w3.org/2002/07/owl#Thing" /> <rdfs:range rdf:resource="#Region" /> </owl:ObjectProperty>

<Region rdf:ID="MedocRegion"> <locatedIn rdf:resource="#BordeauxRegion" /> </Region>

<Region rdf:ID="BordeauxRegion"> <locatedIn rdf:resource="#FrenchRegion" /> </Region>

Page 33: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

33

Medoc (Wine)• <owl:Class rdf:ID="Medoc">• <rdfs:subClassOf>• <owl:Restriction>• <owl:onProperty rdf:resource="#hasColor" />• <owl:hasValue rdf:resource="#Red" />• </owl:Restriction>• </rdfs:subClassOf>• <rdfs:subClassOf>• <owl:Restriction>• <owl:onProperty rdf:resource="#hasSugar" />• <owl:hasValue rdf:resource="#Dry" />• </owl:Restriction>• </rdfs:subClassOf>• <owl:intersectionOf rdf:parseType="Collection">• <owl:Class rdf:about="#Bordeaux" />• <owl:Restriction>• <owl:onProperty rdf:resource="#locatedIn" />• <owl:hasValue rdf:resource="#MedocRegion" />• </owl:Restriction>• </owl:intersectionOf>• </owl:Class>

Page 34: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

34

• <owl:Class rdf:ID="Medoc">• <owl:equivalentClass>• <owl:Class>• <owl:intersectionOf rdf:parseType="Collection">• <owl:Class rdf:about="#Bordeaux" />• <owl:Restriction>• <owl:onProperty rdf:resource="#locatedIn" />• <owl:hasValue rdf:resource="#MedocRegion" />• </owl:Restriction>• </owl:intersectionOf>• </owl:Class>• </owl:equivalentClass>• <rdfs:subClassOf>• <owl:Restriction>• <owl:onProperty rdf:resource="#hasColor" />• <owl:hasValue rdf:resource="#Red" />• </owl:Restriction>• </rdfs:subClassOf>• <rdfs:subClassOf>• <owl:Restriction>• <owl:onProperty rdf:resource="#hasSugar" />• <owl:hasValue rdf:resource="#Dry" />• </owl:Restriction>• </rdfs:subClassOf>• </owl:Class>

Fig. 1. OWL Class ‘Medoc’ in the Wine Ontology Serialized in RDF/XML

‘Medoc is a sweet, red color wine located in the Medoc region.’

Page 35: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

35

ESKIF Version• {{Medoc isaTypeOf Wine} and (Every

Medoc is {Dry and RedColored and (ProducedIn (the MedocRegion))})}

SKIF:• (isaSubclassOf Medoc Wine)• (necessarily Medoc hasAttribute Dry)• (necessarily Medoc hasAttribute

RedColored)• (necessarily Medoc hasAttribute

(ProducedIn MedocRegion)

Page 36: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

36

ESKIF

• Like SKIF, but statements in braces have first two arguments inverted

• {ColonelMustard killed MissScarlet}

≡ (killed ColonelMustard MissScarlet)

{{ColonelMustard killed MissScarlet},

(in (the Conservatory)) (with (A Knife))}

{(The Person named “Albert Einstein”) proposed (The Theory called “The Theory of Relativity”)}

Page 37: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

37

Basic Components of An Ontology

Hierarchy of TypesSemantic Relations (slots/associations)

InstancesFunctionsAxioms

ConstraintsProcedural Methods

Page 38: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

38

Handling Different Perspectives• It is widely recognized that different communities

are interested in different aspects of the same entities

• These can be represented in a logically consistent manner by allowing dynamic creation of classes with only some of the known attributes and relations of the physically realistic class

• This corresponds to the use of anonymous classes in an OWL restriction

• Many different contexts may need to be distinguished

Page 39: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

39

Different Interests

How big is the diamond? How much does it cost?

Page 40: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

40

Flexible View Creation

Entity

SelectiveView DetailedEntity

PricedObject

DiamondRing

isaSubViewOf

Page 41: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

41

Knowledge: Search and Deploy

BrowsingSearch

KeywordSearch

CommunityTaxonomy or

Thesaurus

ComprehensiveFormalizedKnowledge / | \ / \ \ / | \ / \ | \ \ / | \ /

TS TS

AutomatedReasoning

GraphicSearch

CommunityGraphical View

CommunityKnowledge

Needs

dictate

Retrieved Knowledge

provides

ActionenablesCommunity Goals

Community Goals

HumanFriendly

MachineFriendly

providesDocumentCollection

Page 42: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

42

Registries for KOSs

• A registry will provide information to allow the public to determine whether a KOS is suitable for their purposes – metadata about the KOS.

• A registry that can describe the relations between KOS systems (dependency, similarity) requires special types of metadata.

Page 43: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

43

Special KOS registry requirement

• In order to be reusable outside the originating community, a KOS should have information specifying whether the meanings of its terms depend on any other KOS, or are related to terms in any other KOS.

• In the event that an upper ontology is used to specify meanings in a KOS, that needs to be explicitly represented.

• If an ontology is intended to be independent and self-describing, that needs to be specified.

Page 44: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

44

For the Skeptical

Help the process: It will be useful to have a set of use cases or scenarios that would provide a practical challenge for the developers of integrating technologies such as the Common Semantic Model. What would satisfy the variable ?this:

“If you can do ?this, I will be convinced that a Common Semantic Model is valuable.”

Page 45: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

45

ONTACWG

Page 46: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

46

SWSL Logic Inventory

Page 47: 1 ONTACWG: Coordinating Knowledge Classifications Patrick Cassidy MITRE Corporation* Presented at the Fourth Semantic Interoperability for E-Government

47