is opencyc doomed to be the new esperanto, or is oor doomed to be the new electronic data...

14
Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our content What we’d want a good host to provide Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?

Upload: isaac-cantrell

Post on 27-Mar-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both!

Doug Lenat

Cycorp

Our content

What we’d want a good host to provide

Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?

Page 2: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

2

Our Content

OpenCyc (www.opencyc.org)The Cyc Ontology made 100% freely available (yes, 100% free even for commercial purposes)Available for download on SourceForgeOver 30,000 “users”

ResearchCyc (researchcyc.cyc.com)OpenCyc + millions of hand-engineered assertionsFree for R&D purposes Current users: 300 research groups (1/2 academic)

Page 3: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

3

What are people doing with it?

• USAF 45th Space Wing: Decision Support• USNavy: Threat Scenario Detection• US Forest Service: Regulatory Compliance• LarKC: Large Knowledge Collider• Medical Research Center: Clinical Trial Cohort Selection (doctors can now directly formulate complex FOPC queries via interactive clarification dialogue; DBs)• Glaxo: semi-automatic ontology alignment across multiple large domain-specific info sources

Page 4: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

4

What’s in OpenCyc

(#$isa 596215)

(#$genls 99198)

(#$disjointWith 6114)

(#$resultIsa 4277)

(#$resultGenl 1206)

(#$argIsa 35617

(#$argGenl 5398)

(#$arg1Isa 16748)

(#$arg1Genl 2354)

(#$arg2Isa 14114

(#$arg2Genl 2283)

(#$arg3Isa 3486)

(#$argFormat 5493)

(#$arg2Format 3320)

(#$functionalInArgs 1427)

(#$arity 16416)

(#$arityMin 958)

(#$comment 57305)

(#$genlPreds 7440)

(#$negationInverse 990)

(#$genlMt 26078)

(#$denotationInEnglish 409745)

(#$synonymousExternalConcept 13916)

Explicitly: 300k terms; 14k predicates; 57k classes; 2 million assertions; infin. more nonatomic terms and inferred assertions

Page 5: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

5

Systems and Processes

‘lifetime’ of system

energy source

boundary

resource conveyer

resource synthesizer

providerOfMotiveForce

doneBy

transporter

eventOccursAt

Page 6: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

6

FunctionalSystem

Specializations

AutocatalyticProcess

Ecosystem

EcologicalProcess

Organization

Organism

Culture-Practice

Metabolism

componentInSystem

agentInEcosystem

hasMembers

anatomicalParts

Page 7: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

7

Ecosystem Classes

Ecosystem

BiomeAquatic

LifeZone

DesertEcosystem

TropicalRainforestEcosystem

ChaparralEcosystem

TundraEcosystem

TaigaEcosystem

GrasslandEcosystem

genlsgenls

genls

Page 8: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

8

ChaparralEcosystem

MediterraneanClimateCycleclimateOfEcosystemType

MediterraneanScrub

terrainClimateType

GeographicalRegion

Eco-system

genls

genls

Territory Of Santa

Barbara, CA

hasClimateType

Page 9: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

9

What We’d Want a Good Host to Provide

A commitment to use – to have contributors all provide content under – some Creative Commons license, as opposed to e.g. a GNU license

Retention of the provenance/lineage of contributed ontological content

Agreement on some of the most fundamental ontological relations

Agreement on a small set of inter-ontology alignment relations

Page 10: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

10

Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?

OKKAM is already a funded UE FP7 project (~$10M, 3-years) that started 2 months ago. Ontologizing individuals (including organizations such as the USArmy and IBM as individuals), providing a unique identifier and agreed-on set of properties for each individual

DBpedia extracted the content of fact boxes from Wikipedia + 35 open-source ontologies; KBpedia EU STREP ($3M) follow-on and will include true ontology-merging

Lots of other projects which other speakers in this panel will no doubt mention

Page 11: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

11

Page 12: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

12

FP7 IP - LarKC ConsortiumOrganisation Country

Universität Innsbruk Austria

AstraZenica AB, R&D Sweden

CEFRIEL S.c.r.l. Italy

Cycorp, Raziskovanje in Eksperimentalni Razvoj, d.o.o. Slovenia

Universität Stuttgart, HPCC Germany

Max Plank Gesellshaft Germany

Sirma Group, Ontotext Lab Bulgaria

Saltlux Korea

Siemens Aktiengesellshaft Germany

University of Sheffield United Kingdom

Vrije Universiteit Amsterdam Netherlands

Beijing University of Technology PRC

WHO: International Agency for Research on Cancer France

Page 13: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

13

Page 14: Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both! Doug Lenat Cycorp Our

14