Download - Medical Ontologies: An Overview
Medical Ontologies: An Overview
Barry Smithhttp://ifomis.de
http:// ifomis.de 2
Three levels of ontology1) formal (top-level) ontology dealing with
categories employed in every domain: object, event, whole, part, instance, class
2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy
3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture
http:// ifomis.de 3
Three levels of ontology1) formal (top-level) ontology dealing with
categories employed in every domain: object, event, whole, part, instance, class
2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy
3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture
http:// ifomis.de 4
Three levels of ontology1) formal (top-level) ontology dealing with
categories employed in every domain: object, event, whole, part, instance, class
2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy
3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture
http:// ifomis.de 5
Three levels of ontology1) formal (top-level) ontology dealing with
categories employed in every domain: object, event, whole, part, instance, class
2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy
3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture
http:// ifomis.de 6
IFOMISInstitute for Formal Ontology and Medical
Information ScienceLeipzig
http://ifomis.de
philosophers and medical informaticians attempting to build and test a Basic Formal Ontology for applications in biomedical and
related domains
http:// ifomis.de 7
IFOMIS
use basic principles of philosophical ontologyfor quality assurance and alignment of biomedical ontologies
http:// ifomis.de 8
Compare:
1) pure mathematics (theories of structures such as order, set, function, mapping) employed in every domain
2) applied mathematics, applications of these theories = re-using the same definitions, theorems, proofs in new application domains
3) physical chemistry, biophysics, etc. = adding detail
http:// ifomis.de 9
Three levels of ontology1) formal (top-level) ontology =
medical ontology has nothing like the technology of definitions, theorems and proofs provided by pure mathematics
2) domain ontology = UMLS Semantic Network, GALEN CORE
3) terminology-based ontology = UMLS, SNOMED-CT, GALEN, FMA
?????
http:// ifomis.de 10
Strategy
Part 1: Provide an overview of medical ontologies and of the top-level ontologies which they implicitly define
Part 2: Show how principles of classification and definition derived from top-level ontology can help in quality assurance of terminology-based ontologies and in ontology alignment
Parts 3 and 4: IFOMIS Collaboration with L&C
http:// ifomis.de 11
http:// ifomis.de 12part of the UMLS Semantic Network
http:// ifomis.de 13
UMLS Semantic Network
entity event
physical conceptual object entity
http:// ifomis.de 14
UMLS Semantic Network
entity event
physical conceptual object entity
http:// ifomis.de 15
conceptual entity
Organism AttributeFinding
Idea or ConceptOccupation or Discipline
OrganizationGroup
Group AttributeIntellectual Product
Language
http:// ifomis.de 16
Conceptual EntityIdea or Concept
Functional ConceptQualitative ConceptQuantitative ConceptSpatial Concept
Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence
Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence
http:// ifomis.de 17
Fairfax Countyis an Idea or Concept
http:// ifomis.de 18
Why is Fairfax County a Conceptual Entity for UMLS-SN?UMLS-SN Spatial Concepts share the following characteristics: a) they are extended in space b) their boundaries are determined not by any underlying physical discontinuities but rather by human fiat. The referent of ‘Fairfax County’ satisfies these conditions, but so also does hand, which is not classified by UMLS as a conceptual entity.
http:// ifomis.de 19
http:// ifomis.de 20
http:// ifomis.de 21
gene part_of cell component
body system conceptual_part_of fully formed anatomical structure
http:// ifomis.de 22
conceptual entity
idea or concept
functional concept
body system
http:// ifomis.de 23
But:
Gene or Genome is defined as: “A specific sequence … of nucleotides along a molecule of DNA or RNA …”
and nucleotide sequence is_a conceptual entity
http:// ifomis.de 24
entity
physical conceptual object entity
idea or concept
functional concept
body system
confusion of entity and concept
http:// ifomis.de 25
Functional Concept:
Body system is_a Functional Concept.
but:
Concepts do not perform functions or have physical parts.
http:// ifomis.de 26
This:
is not a concept
http:// ifomis.de 27
Problem: Confusion of Is_A and Has_Role
Physical Entity
Chemical Entity
Chemical ChemicalViewed Viewed
Structurally Functionally
http:// ifomis.de 28
Chemical Viewed Structurally vs. Chemical Viewed Functionally
reflects a distinction between types of classification – not between types of entity
compare a classificationof people into:tall people, people who play tennis, people who look like flies from a distanceetc.
http:// ifomis.de 29
Confusion of Is_A and Has_Role
Physical Object
Substance
Food Chemical Body Substance
http:// ifomis.de 30
Roles
A box used for storage is not (ipso facto) a special kind of box
An animal belonging to the emperor is not a special kind of animal
http:// ifomis.de 31
The Hydraulic Equation
BP = CO*PVR
arterial blood pressure is directly proportional to the product of blood flow (cardiac output, CO) and peripheral vascular resistance (PVR)
http:// ifomis.de 32
Confusion of Ontology and Epistemology
blood pressure is an Organism Function,cardiac output is a Laboratory or Test Result
or Diagnostic Procedure
BP = CO*PVR thus asserts that blood pressure is proportional either to a laboratory or test result or to a diagnostic procedure
http:// ifomis.de 33
Disease History
is classified by UMLS under Health Care Activity
This runs togetherthe history or course of a disease on the side of the patient (ontology)
with the act of eliciting that history (epistemology).
http:// ifomis.de 34
Object vs. Process =Continuant vs. Occurrent
Continuant entities = endure through time organisms, cells, molecules exist in full in every instant at which you exist at all
Occurrent entities (processes, events, activities, changes, histories) unfold themselves in time; never exist in full in any single instant
http:// ifomis.de 35
Dependent vs. Independent Entities
Dependent entities require support from other entities in order to exist:there is no mass or shape without some body Independent entities are themselves the substrates for qualities, dispositions, motions, functions and other dependent entities
http:// ifomis.de 36
entities
independent dependent occurrents continuants continuants (always
dependent)
ORGANISMS ROLES PROCESSES CELLS FUNCTIONS HISTORIES
MOLECULES CONDITIONS LIVES (diseases) (courses of
diseases)
http:// ifomis.de 37
entities
independent dependent occurrents continuants continuants (always
dependent)
ORGANISMS ROLES PROCESSES CELLS FUNCTIONS HISTORIES
MOLECULES CONDITIONS LIVES (diseases) (courses of
diseases)
classes
instances
http:// ifomis.de 38
A three-category ontology along these lines accepted by
DOLCE = first module of Semantic Web Wonderweb Foundational Ontologies Library
BFO = IFOMIS Basic Formal OntologyUMLS-SN, GO
http:// ifomis.de 39
GALEN
independent dependent occurrents continuants continuants (always
dependent)
GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states, roles)
http:// ifomis.de 40
GALEN
independent dependent occurrents continuants continuants (always
dependent)
GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states,cell + sputum roles)
http:// ifomis.de 41
GALEN
independent dependent occurrents continuants continuants (always
dependent)
GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states, roles)
http:// ifomis.de 42
GALEN
independent dependent occurrents continuants continuants (always
dependent)
GENERALISED ASPECT GENERALISEDSTRUCTURES + PROCESSGENERALISED (features, SUBSTANCES states, roles)
http:// ifomis.de 43
http:// ifomis.de 44
http:// ifomis.de 45
http:// ifomis.de 46
http:// ifomis.de 47
http:// ifomis.de 48
immune system is_a logical structure
http:// ifomis.de 49
GALEN CORE (1996)
Phenomenon = those categories which can be observed
TopThing
DomainCategory DomainAttribute
ModifierConcept ValueType Phenomenon
Feature Collection Selector
GeneralisedProcess
GeneralisedStructure
GeneralisedSubstance State
Modality
Role
Unit
GeneralLevelOfSpecification
Status
Aspect
http:// ifomis.de 50
GALEN CORE (1996)
Phenomenon = categories whose instances can be observed
TopThing
DomainCategory DomainAttribute
ModifierConcept ValueType Phenomenon
Feature Collection Selector
GeneralisedProcess
GeneralisedStructure
GeneralisedSubstance State
Modality
Role
Unit
GeneralLevelOfSpecification
Status
Aspect
http:// ifomis.de 51
SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force
http:// ifomis.de 52
SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force
http:// ifomis.de 53
SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force
http:// ifomis.de 54
An unintuitive top-level
with unintuitive rules for classificationleads to coding errorsdifficulties in training of curatorsobstacles to alignment with other ontology
and terminology systemsobstacles to harvesting content in automatic
reasoning systems
http:// ifomis.de 55
Principles for Building Medical Ontologies
Barry Smithhttp://ifomis.de
http:// ifomis.de 57
Examples
Don’t confuse entities with concepts Don’t confuse domain entities with logical
structuresDon’t confuse ontology with epistemologyDon’t confuse is_a with has_role
http:// ifomis.de 58
Further Principles
univocity: terms should have the same meanings (and thus point to the same referents) on every occasion of use
UMLS-SN: ‘organization’ = body plan‘organization’ = social organization
http:// ifomis.de 59
univocity
Gene Ontology:‘part_of’ = ‘can be part of’ (flagellum part_of
cell)‘part_of’ = ‘is sometimes part of’ (replication
fork part_of the nucleoplasm)‘part_of’ = ‘is included as a sublist in’
http:// ifomis.de 60
don’t forget instances
part_of as a relation between classesvs. part as a relation between instances
A part_of B1. every instance of A is part of some
instance of B 2. every instance of B has some instance of
A as part
http:// ifomis.de 61
Part_of as a relation between classes is more problematic than is
standardly supposed
testis part_of human being ?
heart part_of human being ?
http:// ifomis.de 62
UMLS-SN Semantic Relations
preventsDefinition: Stops, hinders or eliminates an action or condition.Inverse: prevented_by
contraception prevents pregnancypregnancy prevented by contraception
http:// ifomis.de 63
UMLS-SN Semantic Relation
producesDefinition: Brings forth, generates or creates. Inverse: produced_by
artificial insemination produces pregnancypregnancy produced by artificial insemination
http:// ifomis.de 64
positivity
complements of classes are not themselves classes. (Terms such as ‘non-mammal’ or ‘non-membrane’ do not designate natural kinds.)
http:// ifomis.de 65
objectivity
which classes exist is not a function of our biological knowledge. (Terms such as ‘unknown’ or ‘unclassified’ or ‘unlocalized’ do not designate biological natural kinds.)
http:// ifomis.de 66
rules governing levels
the terms in a classificatory hierarchy should be divided into predetermined levels (analogous to the levels of kingdom, phylum, class, order, etc., in traditional biology).
the terms in a partonomic hierarchy should be divided into predetermined granularity levels (organism, organ, cell, molecule, etc.)
http:// ifomis.de 67
JEPD (jointly exhaustive and pairwise disjoint)
single inheritance: no class in a classifi-catory hierarchy should have more than one parent on the immediate higher level
exhaustiveness: the classes on any given level should exhaust the domain of the classificatory hierarchy (difficult to satisfy in biomedicine but is accepted as a goal by every scientist)
http:// ifomis.de 68
Shortfalls from single inheritance
are often clues as to bad coding, since they mark deviations from is_a relations which also block ontology alignment
http:// ifomis.de 69
Is_a Overloading
The success of ontology alignment depends crucially on the degree to which basic ontological relations such as is a and part of can be relied on as having the same meanings in the different ontologies to be aligned.
http:// ifomis.de 70
Use of multiple inheritance
involves the assignment to the is_a relation of a plurality of different meanings within a single ontology.
The resultant mélange makes coherent integration across ontologies achievable (at best) only under the guidance of human beings with relevant biological knowledge
http:// ifomis.de 71
rules for definitions
intelligibility: the terms used in a definition should be simpler (more intelligible) than the term to be defined
otherwise the definition provides no assistance to the understanding (for humans)or is unprocessable (for machines)
http:// ifomis.de 72
substitutabilityin all so-called extensional contexts a defined term should be substitutable by its definition in such a way that the result is both grammatically correct and has the same truth-value as the sentence with which we begin
GO:0015070: toxin activity Definition: Acts as to cause injury to other living
organisms.
http:// ifomis.de 73
substitutability
There is toxin activity here
There is acts as to cause injury to other living organisms here
http:// ifomis.de 74
modularity: you can’t define everything
isolate primitive terms (= level 0)define terms on level n + 1, for each n 0 using only:
• terms taken from levels n and belowplus• logical and ontological constants such as
‘and’, ‘all’, ‘is_a’ and ‘part_of’
http:// ifomis.de 75
univocity and modularity
if these rules are not satisfied then error checking and ontology alignment can be achieved, at best, only with human intervention
http:// ifomis.de 76
The Foundational Model of Anatomy
follows formal rules for definitions laid down by Aristotle. A definition is the specification of the essence (nature, invariant structure) shared by all the members of a class or natural kind.
http:// ifomis.de 77
The Foundational Model of Anatomy
Topmost node are the undefinable primitives. The definition of a class lower down in the hierarchy is provided by specifying the parent of the class together with the relevant differentia, which tells us what marks out instances of the defined class within the wider parent class, as in: human = rational animal.
http:// ifomis.de 78
FMA Examples
Cell is an anatomical structure that consists of cytoplasm surrounded by a plasma membrane with or without a cell nucleus
Plasma membrane is a cell part that surrounds the cytoplasm,
http:// ifomis.de 79
The FMA regimentationbrings the advantage that each definition reflects the position in the hierarchy to which a defined term belongs. The position of a term within the hierarchy enriches its own definition by incorporating automatically the definitions of all the terms above it.The entire information content of the FMA’s term hierarchy can be translated very cleanly into a computer representation
http:// ifomis.de 80
These Rules are Rules of Thumb
The world of biomedical research is a world of difficult trade-offsthe benefits of formal (logical and ontological) rigor need to be balanced
1. against the constraints of computer tractability, 2. against the needs of biomedical practitioners.
But automatic alignment biomedical information resources will be achieved only to the degree that such resources conform to the standard principles of classification and definition
http:// ifomis.de 81
Axioms Every class has at least one instance
Distinct classes on the same level never share instances
Distinct leaf classes within a classification never share instances
http:// ifomis.de 82
Axioms
Every genus has an instantiated species
Each species has a smaller class of instances than its genus
http:// ifomis.de 83
Axioms
Every genus has at least two children
http:// ifomis.de 84
Theorems
Every instance is also an instance of some leaf class
Classes which share a child in common are either identical or one is subordinated to the other
http:// ifomis.de 85
Mathematical Structure
Each class hierarchy constitutes a supremum-semilattice with respect to is_a
http:// ifomis.de 86
Classes vs. SetsBoth classes and sets are marked by
granularity – but sets are timelessA class endures through time and
survives the turnover in its instances
A set is determined by its members A class is not determined by its instances
(as a state is not determined by its citizens and as an organism is not determined by its molecules)
http:// ifomis.de 87
Classes vs. SumsClasses are marked by granularity: they divide up the corresponding domain into whole units or membersThe class of human beings is instantiated only by human beings as single, whole units.
The sum of human beings includes also all cells and molecules existing inside human beings as parts
http:// ifomis.de 88
Classes sets
A set is an abstract structure, existing outside time and space. The set of human beings existing at t is (timelessly) a different entity from the set of human beings existing at t because of births and deaths.
http:// ifomis.de 89
Classes vs. Sets
A set with n members has in every case exactly 2n subsetsThe subclasses of a class are limited in number(which classes are subsumed by a larger class is a matter for empirical science to determine)
http:// ifomis.de 90
Conclusion
The analogue of pure mathematics for biomedical informatics
(the theory of biomedical classification) must look very different from standard mathematical set theory (and from its progeny, including Description Logic)
The formal theory of biological classification is still in its infancy
http:// ifomis.de 91
Biological classes are marked always by an opposition between standard or prototypical instances and a surrounding penumbra of non-standard instances (not all instances of the class human being are marked by the presence of amputation stumps or pituitary tumors). To do justice to these matters FMA introduces the factor of idealization, which means (in first approximation) that the classes of the FMA’s Anatomy Taxonomy AT include only those instances to which canonical anatomy applies.
This means that we need to revise definitions D1–D4 by restricting the range of variables x, y, ... to the realm of individuals which satisfy the generalizations of canonical anatomy, so that the same abstraction of anatomy (structure) will be represented in all the instances of any given AT-class. This device of specifying different ranges of variables gives us the means also to represent the generalizations belonging to the different branches of canonical anatomy, for example to canonical anatomy for male vs. female human beings, for human beings at various developmental stages, and for organisms in other species. It can allow us also to represent the generalizations governing the anatomical variants yielded by the presence of, for example, coronary arteries or bronchopulmonary segments, which deviate from canonical anatomical patterns of organization.
http:// ifomis.de 92
Classes vs. Sets: Granularity and TimeSets in the mathematical sense, too, are marked by the factor of granularity, which means that each
set comprehends its members as single, whole units. A class or set is laid across reality like a grid consisting (1) of a number of slots or pigeonholes each (2) occupied by some member. (This informal talk of grids and slots is formalized in [[14]] in terms of the theory of granular partitions.) Classes are distinguished from sets, however, by the fact that a set is determined by its members. This means that it is (1) associated with a specific number of slots, each of which (2) must be occupied by some specific member. A set is thus specified in a double sense. A class, in contrast, survives the turnover in its instances, and so it is specified in neither of these senses, since both (1) the number of associated slots and (2) the individuals occupying these slots may vary with time.
Sets are distinguished from classes also in this: a set with n members has in every case exactly 2n subsets, constituted by all the combinations of these members. The subclasses of a class, on the other hand, are limited in number, and which classes are subsumed by a larger class is a matter for empirical science to determine. Leaves (lowest nodes) in the taxonomy are (changing) collections of instances. As we move up the taxonomy we encounter in succession collections of such collections of instances, collections of collections of such collections, etc., organized in a nested hierarchy reaching up to the maximal class or ‘root’. We can visualize the classes at different levels as being analogous to geopolitical entities (towns, counties, states) as represented on a map. Instances correspond in this analogy to the corresponding populations: a class is not determined by its instances as a state is not determined by its citizens.
http:// ifomis.de 93
Classes are distinguished from sets also by their relation to time. A set is an abstract structure, existing outside time and space, and this is so even when its members are parts of concrete reality. Since each set is determined by its members, the set of human beings existing at t is (timelessly) a different entity from the set of human beings existing at t because of births and deaths.
Matters are different with regard to classes. The class human being can survive the change in the stock of its instances which occurs when John and Jane die, because classes exist in time. John and Jane themselves can similarly survive changes in the stock of cells or molecules by which they are constituted.
To do justice to the fact that classes in the biological domain endure even when their extensions change, a full definition of the is_a relation must involve a temporally indexed reading of inst (with variables t, t, etc., ranging over times):
D1*A is_a B =def t x ( inst(x, A, t) inst(x, B, t) ),so that A is_a B means: at all times t, if x is an instance of A at t then x is an instance of B
at t. D1* will also take care of false positives such as adult is_a child, which an untensed reading of D1 would otherwise allow. In general, all statements of inst and part relations involving objects in biomedical ontologies, like all the data of instantiated anatomy, are indexed by times.
http:// ifomis.de 94
Taxonomy and PartonomyA taxonomy such as AT is formally speaking a tree in the mathematical sense.
It satisfies axioms to the effect that (1) it has a root or unique maximal genus (here: anatomical entity) and (2) all other classes are connected to this root via finite chains of is_a relations satisfying a principle of single inheritance. A partonomy, in contrast, is a partial order in the mathematical sense, with top (here: organism – the class instantiated by mereologically maximal entities), to which all other classes are connected via chains of part_of relations.
We can then define the concepts of root and leaf of a taxonomy and top and bottom of a partonomy as follows.
D5 root(A) =def B (B is_a A) D6 leaf(A) =def B (B is_a A A = B)D7 top(A) =def
B (A = B or B part_of A) & not-B (A part_of B)D8 bottom(A) =def not-B (B part_of A).
http:// ifomis.de 95
We can then postulate axioms to the effect that every class includes some leaf as subclass, and that every instance of every class instantiates some leaf:
AB ( leaf(B) & B is_a A ) Ax ( inst(x, A) B (leaf(B) & inst(x, B) ) )
The taxonomical union AÈB of classes A and B is defined as the minimal class satisfying the condition that it contains both A and B as subclasses. Such a class always exists, since A and B are in any case subclasses of the root. The taxonomic union of femur and liver, for example, is organ. The partonomic union of two classes A+B is the class, if it exists, whose instances are sums x+y of instances of classes A and B respectively. While every pair of classes has a taxonomic union, only some classes have a partonomic union, since entities of the form x+y are instances of classes only in some highly restricted cases, for example: left lung = upper-lobe-of-left-lung + lower-lobe-of-left-lung. Such examples characteristically involve the phenomenon of fiat boundaries. [[15],[16]]
http:// ifomis.de 96
As concerns taxonomic intersection, a class is never immediately subordinated to more than one higher class within a tree. This means that if two classes overlap in sharing some common sub-class, then this is because one is a subclass of the other. AB, the taxonomic intersection of A and B, if it exists, is then simply the smaller of these two classes. We can add further an axiom to the effect that, if two classes are such as to overlap in sharing some common instances, then this, too, is because one is a subclass of the other:
x (inst(x, A) Ù inst(x, B)) A is_a B or B is_a A.Classes can overlap partonomically, on the other hand, in such a way that there is a
class which stands in the part_of relation to both, though neither stands in this relation to the other:
D9 A1 partonomic_overlap A2 = def A (A part_of A1 & A part_of A2).
For example: pelvis and vertebral column overlap in the sacrum and coccyx. Most classes in the biomedical domain do not overlap partonomically in this sense, yet it is this difference in behavior between taxonomic and partonomic overlap which captures the essential difference between the tree structure of taxonomies and the partial order structure of partonomies.
http:// ifomis.de 97
ConclusionPractitioners in the biomedical sciences move easily between the realm of classes and the realm of
instances existing in time and space. For historical reasons, however, work on biomedical ontologies and terminologies – which grew out of work on medical dictionaries and nomenclatures – has focused almost exclusively on classes (or ‘concepts’) atemporally conceived. This class-orientation is common in knowledge representation, and its predominance has led to the entrenchment of an assumption according to which all that need be said about classes can be said without appeal to formal features of instantiation of the sorts described above. This, however, has fostered an impoverished regime ofof definitions in which the use of identical terms in different systems has been allowed to mask underlying incompatibilities. Matters have not been helped by the fact that description logic, the prevalent framework for terminology-based reasoning systems, has with some recent exceptions (e.g. [[i]]) been oriented primarily around reasoning with classes.
Certainly if we are to produce information systems with the requisite computational properties, then this entails recourse to a logical framework like that of description logic. At the same time, however, we must ensure that the data that serves as input to such systems is organized formally in a way that sustains rather than hinders successful alignment with other systems. The way forward is to recognize, as does the FMA, that these are two distinct tasks, both of which are equally important to the construction of biomedical ontologies and terminologies.
http:// ifomis.de 98
The problem of ontology alignmentGOSCOPSWISS-PROTSNOMEDMeSHFMA
…all remain at the level of TERMINOLOGY (two reasons:
legacy of dictionaries + DL)What we need is a REFERENCE ONTOLOGY = a
formal theory of the foundational relations which hold TERMINOLOGY ONTOLOGIES and APPLICATION ONTOLOGIES together
http:// ifomis.de 99
Analogous distinctions required for nearly all foundational relations of ontologies and semantic
networks:A causes BA is associated with BA is located in Betc.
Reference to instances is necessary in defining mereotopological relations such as spatial occupation and spatial adjacency
http:// ifomis.de 100
Instances are elite individuals
Which classes (and thus which instances) exist in a given domain is a matter for empirical research.
Cf. Lewis/Armstrong “sparse theory of universals”
http:// ifomis.de 101
D extension(A) = {x | inst(x, A)}
D9 differentia(A) =def BC nearestspecies(B, C) & A B & A C & extension(C) = extension(B) extension(C)
http:// ifomis.de 102
The genus together with the differentia of a species constitutes the essence of the species.
differentia (A) not-class(A)
http:// ifomis.de 103
Axioms (Berg)
A1 lowestspecies(A) x inst(x, A) A2 lowestspecies(A) & lowestspecies(B) & A
B (not-x inst(x, A) & inst(x, B))A3 nearestspecies(A, B) & nearestspecies (A, C)
B = CA4 genus(A) & inst(x, A)
B nearestspecies(B, A) & inst(x, B) A5 nearestspecies(A, B) the extension of A
is a subset of the extension of B
http:// ifomis.de 104
Axioms (Berg)
genus(A) & inst(x, A) B nearestspecies(B, A) & inst(x, B)
EVERY GENUS HAS AN INSTANTIATED SPECIES
nearestspecies(A, B) the extension of A is a subset of the extension of B
EACH SPECIES HAS A SMALLER CLASS OF INSTANCES THAN ITS GENUS
http:// ifomis.de 105
Axioms (Berg)nearestspecies(B, A)
C (nearestspecies(C, A) & B CEVERY GENUS HAS AT LEAST TWO CHILDREN
nearestspecies(B, A) & nearestspecies(C, A) & B C) not-x (inst(x, B) & inst(x, C))
SPECIES OF A COMMON A8 There is no infinite sequence <A1, A2, …>
such that nearestspecies(Ai, Ai+1) for all i 1A9 There is no infinite sequence <A1, A2, …>
such that nearestspecies(Ai+1, Ai) for all i 1
http:// ifomis.de 106
Theorems (Berg)
T1 nearestspecies(A, B) the extension of A is a proper subset of the extension of B
T2 A x inst(x, A)T3 nearestspecies(A, B) not-C
(nearestspecies(A, C) & nearestspecies(C, B))T4 lowestspecies(A1) & lowestspecies(A2) &
nearestspecies(A1, B) not-C(nearestspecies (B, C) &
nearestspecies (C, A2)
http:// ifomis.de 107
Theorems (Berg)T5 (genus(A) & inst(x, A)) B
(lowestspecies(B) & B is_a A & inst(x, B))T6 (genus(A) & lowestspecies(B) & x (inst(x,
A) & inst(x, B)) B is_a A T7 A is_a B & A is_a C
(B = C or B is_a C or C is_a BT8 (genus(A) & genus(B) & x(inst(x, A) &
inst(x, B))) C(C is_a A & C is_a B) T9 class(A) & class(B) (A = B or A is_a B
or B is_a A or not-x(inst(x, A) & inst(x, B)))
http:// ifomis.de 108
WordNetNOT: wheel PART OF car
WordNet represents part-of quite sparingly It normally gives trivial holonymic relations which are just true by definition).
wheel PART OF wheeled vehicle steering wheel PART OF steering system
http:// ifomis.de 109
WordNetWith has_part relations it is more generous:
car, auto, automobile, machine, motorcar -- HAS PART: air bag HAS PART: glove compartment etc.
http:// ifomis.de 110
Circular definitions
and associated problems in general endemic in biomedical terminology systemsConfusion of use and mention
Confusion of concepts and objectsConfusion of concepts and classes Confusion of terms and objectsConfusion knowledge with what is knownConfusion of object-level with machine-levelSimple stupidity… all of which lead to poor coding
http:// ifomis.de 111
UMLS-SN
http:// ifomis.de 112
UMLS-SN Semantic Relations
Semantic Relation: affectsTUI: T151Definition: Produces a direct effect on. Implied here is the altering or influencing of an existing condition, state, situation, or entity. This includes has a role in, alters, influences, predisposes, catalyzes, stimulates, regulates, depresses, impedes, enhances, contributes to, leads to, and modifies.Inverse: affected_by
http:// ifomis.de 113
UMLS-SN Semantic Relations
Semantic Relation: carries_outTUI: T141Definition: Executes a function or performs a procedure or activity. This includes transacts, operates on, handles, and executes.Inverse: carried_out_by
http:// ifomis.de 114
UMLS-SN Semantic Relations
Semantic Relation: causesTUI: T147Definition: Brings about a condition or an effect. Implied here is that an agent, such as for example, a pharmacologic substance or an organism, has brought about the effect. This includes induces, effects, evokes, and etiology.Inverse: caused_by
http:// ifomis.de 115
UMLS-SN Semantic Relations
Semantic Relation: consists_ofTUI: T172Definition: Is structurally made up of in whole or in part of some material or matter. This includes composed of, made of, and formed of.Inverse: constitutes
http:// ifomis.de 116
UMLS-SN Semantic Relations
Semantic Relation: containsTUI: T134Definition: Holds or is the receptacle for fluids or other substances. This includes is filled with, holds, and is occupied by.Inverse: contained_in
http:// ifomis.de 117
UMLS-SN Semantic Relations
Semantic Relation: derivative_ofTUI: T178Definition: In chemistry, a substance structurally related to another or that can be made from the other substance. This is used only for structural relationships. This does not include functional relationships such as metabolite of, by product of, nor analog of.Inverse: has_derivative
http:// ifomis.de 118
UMLS-SN Semantic Relations
Semantic Relation: developmental_form_ofTUI: T179Definition: An earlier stage in the individual maturation of.Inverse: has_developmental_form
http:// ifomis.de 119
UMLS-SN Semantic Relations
Semantic Relation: evaluation_ofTUI: T161Definition: Judgment of the value or degree of some attribute or process.Inverse: has_evaluation
http:// ifomis.de 120
UMLS-SN Semantic Relations
Semantic Relation: exhibitsTUI: T145Definition: Shows or demonstrates.Inverse: exhibited_by
http:// ifomis.de 121
UMLS-SN Semantic Relations
Semantic Relation: functionally_related_toTUI: T139Definition: Related by the carrying out of some function or activity.Inverse: functionally_related_to
http:// ifomis.de 122
UMLS-SN Semantic Relations
Semantic Relation: indicatesTUI: T156Definition: Gives evidence for the presence at some time of an entity or process.Inverse: indicated_by
http:// ifomis.de 123
UMLS-SN Semantic Relations
Semantic Relation: ingredient_ofTUI: T202Definition: Is a component of, as in a constituent of a preparation.Inverse: has_ingredient
http:// ifomis.de 124
UMLS-SN Semantic Relations
Semantic Relation: issue_inTUI: T165Definition: Is an issue in or a point of discussion, study, debate, or dispute.Inverse: has_issue
http:// ifomis.de 125
UMLS-SN Semantic Relations
Semantic Relation: manifestation_ofTUI: T150Definition: That part of a phenomenon which is directly observable or concretely or visibly expressed, or which gives evidence to the underlying process. This includes expression of, display of, and exhibition of.Inverse: has_manifestation
http:// ifomis.de 126
UMLS-SN Semantic Relations
Semantic Relation: property_ofTUI: T159Definition: Characteristic of, or quality of.Inverse: has_property
http:// ifomis.de 127
UMLS-SN Semantic Relations
Semantic Relation: result_ofTUI: T157Definition: The condition, product, or state occurring as a consequence, effect, or conclusion of an activity or process. This includes product of, effect of, sequel of, outcome of, culmination of, and completion of.Inverse: has_result
http:// ifomis.de 128
UMLS-SN Semantic Relations
Semantic Relation: surroundsTUI: T176Definition: Establishes the boundaries for, or defines the limits of another physical structure. This includes limits, bounds, confines, encloses, and circumscribes.Inverse: surrounded_by
http:// ifomis.de 129
UMLS-SN Semantic Relations
Semantic Relation: traversesTUI: T177Definition: Crosses or extends across another physical structure or area. This includes crosses over and crosses through.Inverse: traversed_by
http:// ifomis.de 130
UMLS-SN Semantic Relations
Semantic Relation: performsTUI: T188Definition: Executes, accomplishes, or achieves an activity.Inverse: performed_by
http:// ifomis.de 131
UMLS-SN Semantic Relations
Semantic Relation: physically_related_toTUI: T132Definition: Related by virtue of some physical attribute or characteristic.Inverse: physically_related_to
http:// ifomis.de 132
UMLS-SN Semantic Relations
Semantic Relation: conceptually_related_toDefinition: Related by some abstract concept, thought, or idea.Inverse: conceptually_related_to
http:// ifomis.de 133
Prototypicality
Biological classes are marked always by an opposition between standard or prototypical instances and a surrounding penumbra of non-standard instances How solve this problem: restrict range of instance variables x, y, to standard instances?Recognize degrees of instancehood? (Impose topology/theory of vagueness on classes?)
http:// ifomis.de 134
Example: joint anatomyjoint HAS-HOLE joint spacejoint capsule IS-OUTER-LAYER-OF jointmeniscus
– IS-INCOMPLETE-FILLER-OF joint space– IS-TOPO-INSIDE joint capsule– IS-NON-TANGENTIAL-MATERIAL-PART-OF
jointjoint
– IS-CONNECTOR-OF bone X– IS-CONNECTOR-OF bone Y
synovia– IS-INCOMPLETE-FILLER-OF joint space
synovial membrane IS-BONAFIDE-BOUNDARY-OF joint space
http:// ifomis.de 135
SNOMED RT (2000)
already has description logic definitionsbut it also has some bad coding, which
derives from failure to pay attention to ontological principles:
e.g.both testes is_a testis
http:// ifomis.de 136
entity
physical conceptual object entity
organism anatomical structure
fully formed anatomical structure
body part, organ or organ component
http:// ifomis.de 137
Body System
Circulatory SystemNervous SystemImmune SystemMusculo-Skeletal Systemetc.
http:// ifomis.de 138
UMLS Semantic Network
entity event
physical conceptual object entity
http:// ifomis.de 139
P1. entities in different highest-level categories (independent continuant, dependent continuant, occurrent) should not be combined within a single class;
P2. objects should not be combined within a single class with the roles they play or with the functions they exercise;
P3. entities in reality should not be combined within a single class with our knowledge about or with our concepts of such entities;
P4. what is concrete (what exists in space and time and enters into causal relations) should not be combined within a single class with what is abstract (for example with abstract spatial regions, measures, and the like);
P5. classifications should respect the factor of time; for example classes should be assigned in a way that is consistent with the fact that continuant entities endure through time.
http:// ifomis.de 140
Anatomical Structure – (Embryonic Structure; Fully Formed Anatomical Structure; Anatomical Abnormality): Anatomical Structure is defined as: “A normal or pathological part of the anatomy or structural organization of an organism.” Note that in the phrase ‘structural organization’, the term ‘organization’ is not used in conformity with SN’s own definition (see below) as meaning ‘social organization’. Rather it is used to mean an entity’s Bauplan. The latter, however, would be, not a concrete three-dimensionally extended independent thing but rather some dependent abstract feature which gives shape and functionality to an entity of this sort. Then, however, it should not subsume liver or leukocyte.
http:// ifomis.de 141
The UMLS Semantic Network
is ‘an upper-level ontology … in which all concepts are given a consistent and semantically coherent representation’.
Alexa McCray, “An upper level ontology for the biomedical domain”. Comp Functional Genomics 2003; 4: 80-84.
http:// ifomis.de 142
Concepts
CEN/TC251 ENV 12264 : – This ENV is applicable to the description of the categorial structure of
systems of concepts supporting computer-based terminological systems, including coding systems, for health-care.
– concept : “unit of thought constituted through abstraction on the basis of properties common to a set of one or more referents”
BUT THEY NEVER IN FACT LOOK AT THE REFERENTS AT ALL!
ISO/TC215/N142: Health informatics —Vocabulary of terminology – The purpose of this International Standard is to define a set of basic
concepts required to describe and discuss formal representation of concepts and characteristics, for use especially in formal computer based concept representation systems.
– concept: “unit of knowledge created by a unique combination of characteristics”
THEY ARE ALREADY TWO LEVELS REMOVED FROM THE REFERENT!
http:// ifomis.de 143
Chemical
Chemical ChemicalViewed Viewed
Structurally Functionally
Inorganic Organic Enzyme Biomedical or Chemical Chemical Dental Material
http:// ifomis.de 144
Genbank
gene =df
DNA region of biological interest with a name and that carries a genetic trait or phenotype
http:// ifomis.de 145
Why is this a problem?
All biomedical ontologies and terminology systems must make themselves conform to the UMLS Semantic Network
Foundational Model of Anatomy divides body parts into physical entities and conceptual entities ‘with some regret’ …
http:// ifomis.de 146
Entities
http:// ifomis.de 147
Entities
universals (classes, types, roles …)
particulars (individuals, tokens, instances …)
Axiom: Nothing is both a universal and a particular
http:// ifomis.de 148
Two Kinds of Elite Entities
classes, within the realm of universals
instances within the realm of particulars
http:// ifomis.de 149
Entities
classes
http:// ifomis.de 150
Entities
classes*
*natural, biological
http:// ifomis.de 151
Entities
classes of objects
different axioms for classes of functions, processes, etc.
http:// ifomis.de 152
Entities
classes
instances
http:// ifomis.de 153
Classes are natural kinds
Instances are natural exemplars of natural kinds(problem of non-standard instances must be dealt with also)
http:// ifomis.de 154
Entities
classes
instancesinstances
penumbra of borderline cases
http:// ifomis.de 155
Entities
classes
instancesjunkjunk
junk
example of junk: beachball desk
http:// ifomis.de 156
Primitive opposition between universals and particulars
variables A, B, … range over universalsvariables x, y, … range over particulars
http:// ifomis.de 157
Primitive relations: inst and part
inst(Jane, human being)part(Jane’s heart, Jane’s body)
A class is anything that is instantiatedAn instance as anything (any individual) that
instantiates some class
http:// ifomis.de 158
Entities
human
Jane
inst
http:// ifomis.de 159
Entities
human
Jane’s heart part Jane
http:// ifomis.de 160
Axioms for part
Axioms governing part (= ‘proper part’) (1) it is irreflexive (2) it is asymmetric (3) it is transitive (+ usual mereological axioms)
part is the usual mereological relation among individuals
http:// ifomis.de 161
Definitions
class(A) =def x inst(x, A)
instance(x) = defA inst(x, A)
Theorem: Nothing can be both an instance and a class
http:// ifomis.de 162
Axiom of Extensionality
Classes which share identical instances are identical(need to take care of the factor of time)
http:// ifomis.de 163
Entities
classes
x, y, …
differentiae (roles, qualities…)
http:// ifomis.de 164
Differentiae
Aristotelian Definitions An A is a B which exemplifies C
C is a differentiaNo differentia is a classexemp(individual, differentia)exemp(Jane, rationality)objects exemplify roles
http:// ifomis.de 165
role
http:// ifomis.de 166
A is_a B genus(A)
species(A)
classes
instances
http:// ifomis.de 167
A is_a B =def x (inst(x, A) inst(x, B))
genus(A)=def B (B is_a A & B A) species(A)=def B (A is_a B & B A)
classes
instances
http:// ifomis.de 168
nearest species
nearestspecies(A, B)=def A is_a B &
C ((A is_a C & C is_a B) (C = A or C = B)
B
A
http:// ifomis.de 169
Definitions
highest genus
lowest species
http:// ifomis.de 170
lowest species and highest genus
lowestspecies(A)=def
species(A) & not-genus(A)
highestgenus(A)=def
genus(A) & not-species(A)
Theorem:class(A) genus(A) or lowestspecies(A)