safe and economic re-use of ontologies: a logic-based ...schneidt/talks/0805_dl.pdf · our approach...
TRANSCRIPT
Why re-use? Our methodology Tool support Perspectives
Safe and Economic Re-Use of Ontologies:A Logic-Based Methodology and Tool Support
Ernesto Jimenez-Ruiz1 Bernardo Cuenca Grau2
Ulrike Sattler3 Thomas Schneider3 Rafael Berlanga1
1Computer Languages and Systems, Universitat Jaume I, Spain
2Computing Laboratory, University of Oxford, UK
3Computer Science, University of Manchester, UK
DL, 13–16 May 2008
Why re-use? Our methodology Tool support Perspectives
Our approach in a nutshell
Logic-based methodology for the re-use of ontologies
Safe use of imported symbols
1 Don’t change their meaning! X
Economic import of the external ontologies
2 Import only the relevant parts . . .
3 . . . without loss of information! X
L
E
Tool support — Protege plugin
Work in progress!
Why re-use? Our methodology Tool support Perspectives
Our approach in a nutshell
Logic-based methodology for the re-use of ontologies
Safe use of imported symbols
1 Don’t change their meaning! X
Economic import of the external ontologies
2 Import only the relevant parts . . .
3 . . . without loss of information! X
L
E
1
Tool support — Protege plugin
Work in progress!
Why re-use? Our methodology Tool support Perspectives
Our approach in a nutshell
Logic-based methodology for the re-use of ontologies
Safe use of imported symbols
1 Don’t change their meaning! X
Economic import of the external ontologies
2 Import only the relevant parts . . .
3 . . . without loss of information! X
L
E
1
3
2
Tool support — Protege plugin
Work in progress!
Why re-use? Our methodology Tool support Perspectives
Our approach in a nutshell
Logic-based methodology for the re-use of ontologies
Safe use of imported symbols
1 Don’t change their meaning! X
Economic import of the external ontologies
2 Import only the relevant parts . . .
3 . . . without loss of information! X
L
E
1
3
2
Tool support — Protege plugin
Work in progress!
Why re-use? Our methodology Tool support Perspectives
And now . . .
1 Why ontology re-use?
2 A safe and economic methodology
3 Tool support
4 Perspectives
Why re-use? Our methodology Tool support Perspectives
A re-use scenario: the Health-e-Child project
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Why re-use? Our methodology Tool support Perspectives
A re-use scenario: the Health-e-Child project
NCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Why re-use? Our methodology Tool support Perspectives
A case for safe and economic re-useNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Reasons for re-use
Saves time for re-writing
Provides access to well-established knowledge
Doesn’t require expertise in drugs, proteins, anatomy etc.
Guarantees to provide
[safe] Importing terms doesn’t change their meaning.
[eco] Import all relevant parts of external ontologies.
[aux] The order of imports doesn’t matter.
Why re-use? Our methodology Tool support Perspectives
And now . . .NCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
1 Why ontology re-use?
2 A safe and economic methodology
3 Tool support
4 Perspectives
Why re-use? Our methodology Tool support Perspectives
A working cycle: the offline phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Load local ontology LRepeat at user’s discretion
Choose set Si of external symbolsplus associated ontology Ei
Make sure the Si are pairwise disjoint
For each Si do
Select refinement view
or generalisation view
Why re-use? Our methodology Tool support Perspectives
A working cycle: the offline phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Load local ontology LRepeat at user’s discretion
Choose set Si of external symbolsplus associated ontology Ei
Make sure the Si are pairwise disjoint
For each Si do
Select refinement view
or generalisation view
S1 ={JRA} E1 =NCI
Why re-use? Our methodology Tool support Perspectives
A working cycle: the offline phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Load local ontology LRepeat at user’s discretion
Choose set Si of external symbolsplus associated ontology Ei
Make sure the Si are pairwise disjoint
For each Si do
Select refinement view
or generalisation view
S1 ={JRA} E1 =NCI
S2 ={KneeJoint, Fever}E2 =Galen
Why re-use? Our methodology Tool support Perspectives
A working cycle: the offline phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Load local ontology LRepeat at user’s discretion
Choose set Si of external symbolsplus associated ontology Ei
Make sure the Si are pairwise disjoint
For each Si do
Select refinement view
or generalisation view
S1 ={JRA} E1 =NCI
S2 ={KneeJoint, Fever}E2 =Galen
Refine S1
Refine + reference S2
Why re-use? Our methodology Tool support Perspectives
A working cycle: the offline phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Load local ontology LRepeat at user’s discretion
Choose set Si of external symbolsplus associated ontology Ei
Make sure the Si are pairwise disjoint
For each Si do
Select refinement view
or generalisation view
S1 ={JRA} E1 =NCI
S2 ={KneeJoint, Fever}E2 =Galen
Refine S1
Refine + reference S2
Safety
Why re-use? Our methodology Tool support Perspectives
Formalising the Safety GuaranteeNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Safety
Importing terms doesn’t change their meaning.
Example
JRAO ∪ NCI |= JRA v GeneticDisorderiff NCI |= JRA v GeneticDisorder.
Definition (Safety)
L guarantees safety if for every i = 1, . . . , n:
For every E ′i with Sig(L) ∩ Sig(E ′
i ) ⊆ Si ,L ∪ E ′
i is a conservative extension of E ′i .
Why re-use? Our methodology Tool support Perspectives
Formalising the Safety GuaranteeNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Safety
Importing terms doesn’t change their meaning.
Example
JRAO ∪ NCI |= JRA v GeneticDisorderiff NCI |= JRA v GeneticDisorder.
Definition (Safety)
L guarantees safety if for every i = 1, . . . , n:
For every E ′i with Sig(L) ∩ Sig(E ′
i ) ⊆ Si ,L ∪ E ′
i is a conservative extension of E ′i .
Why re-use? Our methodology Tool support Perspectives
Approximating conservativityNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
L ∪ E ′i is a deductive conservative extension of E ′
i
∀α with Sig(α) ⊆ Sig(E ′i ), L ∪ E ′
i |= α iff E ′i |= α.
⇑
L ∪ E ′i is a model-theoretic conservative extension of E ′
i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
.
⇑
L is semantically ⊥-local w.r.t. L ∪ E ′i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
such that XJ = ∅ for each X ∈ Sig(L) \ Si .
Why re-use? Our methodology Tool support Perspectives
Approximating conservativityNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
L ∪ E ′i is a deductive conservative extension of E ′
i
∀α with Sig(α) ⊆ Sig(E ′i ), L ∪ E ′
i |= α iff E ′i |= α.
⇑
L ∪ E ′i is a model-theoretic conservative extension of E ′
i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
.
⇑
L is semantically ⊥-local w.r.t. L ∪ E ′i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
such that XJ = ∅ for each X ∈ Sig(L) \ Si .
Why re-use? Our methodology Tool support Perspectives
Approximating conservativityNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
L ∪ E ′i is a deductive conservative extension of E ′
i
∀α with Sig(α) ⊆ Sig(E ′i ), L ∪ E ′
i |= α iff E ′i |= α.
⇑
L ∪ E ′i is a model-theoretic conservative extension of E ′
i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
.
⇑
L is semantically ⊥-local w.r.t. L ∪ E ′i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
such that XJ = ∅ for each X ∈ Sig(L) \ Si .
Why re-use? Our methodology Tool support Perspectives
Approximating conservativityNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
L is semantically ⊥-local w.r.t. L ∪ E ′i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
such that XJ = ∅ for each X ∈ Sig(L) \ Si . [
⇑
L is syntactically ⊥-local w.r.t. Si
all GCIs in L are of the form C⊥ v C or C v C> whereCI
⊥ = ∅ and CI> = ∆I follow from [
similar conditions for RIs and Trans(R) statements
Why re-use? Our methodology Tool support Perspectives
Approximating conservativityNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
L is semantically ⊥-local w.r.t. L ∪ E ′i
∀I for E ′i ∃J for L ∪ E ′
i : I|Sig(E′i )
= J |Sig(E′i )
such that XJ = ∅ for each X ∈ Sig(L) \ Si . [
⇑
L is syntactically ⊥-local w.r.t. Si
all GCIs in L are of the form C⊥ v C or C v C> whereCI
⊥ = ∅ and CI> = ∆I follow from [
similar conditions for RIs and Trans(R) statements
Why re-use? Our methodology Tool support Perspectives
Providing safetyNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Examples
C7 v JRA 3 ⊥-local
GeneticDisorder v C7 3 >-local
JRA v GeneticDisorder 8 non-local
Theorem [Cuenca Grau, Horrocks, Kazakov, Sattler 2007]
If L is syntactically local w.r.t. each Si , then L guarantees safety.
Why re-use? Our methodology Tool support Perspectives
The online phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Repeat at user’s discretion
Select an Si
Load external ontology Ei for Si
Customise scope of module
Extract module EMi from Ei
Import EMi into L
Why re-use? Our methodology Tool support Perspectives
The online phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Repeat at user’s discretion
Select an Si
Load external ontology Ei for Si
Customise scope of module
Extract module EMi from Ei
Import EMi into L
KneeJoint
}1 level}3 levels
Why re-use? Our methodology Tool support Perspectives
The online phaseNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
OFFLINE
ONLINE
Repeat at user’s discretion
Select an Si
Load external ontology Ei for Si
Customise scope of module
Extract module EMi from Ei
Import EMi into L
KneeJoint
}1 level}3 levels
Module Coverage
Why re-use? Our methodology Tool support Perspectives
Formalising the Module Coverage GuaranteeNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Module coverage
Import all relevant parts of external ontologies.
Example
JRAO ∪ NCI |= JRA v GeneticDisorderiff JRAO ∪ NCI-module |= JRA v GeneticDisorder.
Definition (Module coverage)
Let EMi ⊆ Ei with Si ⊆ Sig(EM
i ). EMi guarantees coverage of Si if:
For every L′ with Sig(L′) ∩ Sig(Ei ) ⊆ Si ,L′ ∪ Ei is a conservative extension of L′ ∪ EM
i .
Why re-use? Our methodology Tool support Perspectives
Formalising the Module Coverage GuaranteeNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Module coverage
Import all relevant parts of external ontologies.
Example
JRAO ∪ NCI |= JRA v GeneticDisorderiff JRAO ∪ NCI-module |= JRA v GeneticDisorder.
Definition (Module coverage)
Let EMi ⊆ Ei with Si ⊆ Sig(EM
i ). EMi guarantees coverage of Si if:
For every L′ with Sig(L′) ∩ Sig(Ei ) ⊆ Si ,L′ ∪ Ei is a conservative extension of L′ ∪ EM
i .
Why re-use? Our methodology Tool support Perspectives
Providing coverageNCI
JRAO Galen
Arthropathy
ArthritisAutoimmune
DiseaseRheumatologic
Disorder
Atrophic Arthritis Polyarthritis Rheumatoid Arthritis
Juvenile Chronic Polyarthritis Juvenile Rheumatoid Arthritis
Arthritis diseases
C1
. . .C7
Joints
Drugs
affects
isTreatedBy
Coverage is again provided using locality.
Locality-based modules =syntactic approximations of conservativity-based modules
in general not minimalefficiently computable
Why re-use? Our methodology Tool support Perspectives
And now . . .
1 Why ontology re-use?
2 A safe and economic methodology
3 Tool support
4 Perspectives
Why re-use? Our methodology Tool support Perspectives
A Protege Plugin for the reuse of ontologies: Safe and Economique
Why re-use? Our methodology Tool support Perspectives
And now . . .
1 Why ontology re-use?
2 A safe and economic methodology
3 Tool support
4 Perspectives
Why re-use? Our methodology Tool support Perspectives
“Shopping for symbols”
Extend module scope customisation:
Browse external ontology and pick symbols
At each stage, view resulting module
“Check out” module
; Treemaps?
Why re-use? Our methodology Tool support Perspectives
Other plans
Optimise module extraction
Import “by reference” as opposed to “by value”
Multi-user scenario
Module extraction service at owl.cs.manchester.ac.uk
Modularity tool tutorial at ISWC 2008
Perform user study and improve interface
Why re-use? Our methodology Tool support Perspectives
Other plans
Optimise module extraction
Import “by reference” as opposed to “by value”
Multi-user scenario
Module extraction service at owl.cs.manchester.ac.uk
Modularity tool tutorial at ISWC 2008
Perform user study and improve interface
Why re-use? Our methodology Tool support Perspectives
Other plans
Optimise module extraction
Import “by reference” as opposed to “by value”
Multi-user scenario
Module extraction service at owl.cs.manchester.ac.uk
Modularity tool tutorial at ISWC 2008
Perform user study and improve interface
Why re-use? Our methodology Tool support Perspectives
Invitation
We want you. . .
. . . to work with us on incorporating our servicesinto your workflows!
. . . r favourite ontologies and real-life signatures!
Contact
Thank you!
Why re-use? Our methodology Tool support Perspectives
Invitation
We want you. . .
. . . to work with us on incorporating our servicesinto your workflows!
. . . r favourite ontologies and real-life signatures!
Contact
Thank you!
“Synthetic” experiments
Setting
Randomly generated signatures of size 1 . . . 330
Computed Lower of Upper Module (LUM)for each such signature
Results
99 % of Galen LUMs contain < 5 % of Galen’s axioms
similar findings for NCI
Statistics
“Real-life” experiments
Setting
LUMs for manually selected signatures from Galen and NCI(Health-e-Child context: JRA + Cardiomyopathies)
Results
Ext. Ont. # Sig. # axioms
Galen 11 105 (2.5%)
Galen 72 620 (14.9%)
Galen 76 736 (17.6%)
NCI 18 488 (0.1%)
NCI 124 4751 (1.2%)
NCI 144 5057 (1.3%)
Comparing experiments
Setting
SNOMED (health care; restricted language; 350,000 axioms)
Initial signatures: terms from intensive care unit
Compared UM, LUM to MEX (conservativity-based modules)
and SRS (Seidenberg/Rector segments)
Results
# axioms in %# Sig. MEX SRS (L)UM
4,000 2 2 4
16,000 7 7 10
24,000 10 10 15 ;
time 4–5 s 1 s 4–7 s
MEX SRS
LUM UM
R vS
C vD
C ≡D
More links
Protege and ProSE
protege.stanford.edu
krono.act.uji.es/people/Ernesto/safety-ontology-reuse
Health-e-Child
www.health-e-child.org
NCI and Galen
nciterms.nci.nih.gov/NCIBrowser/Dictionary.do
ftp1.nci.nih.gov/pub/cacore/EVS/NCIThesaurus
www.co-ode.org/galen