ontology learning and population from text: algorithms, evaluation and applications
DESCRIPTION
Chapters 1 - 5. Ontology Learning and Population from Text: Algorithms, Evaluation and Applications. Presented by Sole. Introduction. Artificial intelligence - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/1.jpg)
ONTOLOGY LEARNING AND POPULATION FROM TEXT: ALGORITHMS, EVALUATION
AND APPLICATIONS
Presented by Sole
Chapters 1 - 5
![Page 2: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/2.jpg)
Introduction Artificial intelligence
Build systems that incorporate knowledge about a domain to reason on the basis of this knowledge and solve problems not encountered before Include explicit and symbolic representation of
knowledge about a domain Symbolic representation and procedural aspects
are separated so that it can be reused across systems
Which symbols to use and what they stand for?
2
![Page 3: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/3.jpg)
Introduction Ontology
Defines what is important in a domain and how concepts are related Knowledge-based system: determine which
symbols are needed and how they are interpreted Logical level: interpretation can be constraint
according to the ontology by axiomatizing symbols Issues
Costly to construct Time-consuming Significant coverage of domain is needed Meaning and consistent generalization are required
Knowledge
Acquisition
Bottleneck
3
![Page 4: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/4.jpg)
Introduction Solution
Automatically learn ontologies from data Goal: bridging the gap between
World of symbols (words used in natural language) World of concepts (abstractions of human thought)
Challenge Correctness and consistency of the model can not
be guaranteed Human post-processing definitely necessary
Automatically learned ontologies need to be inspected, validated, and modified by humans before they can be applied for applications relying on logical reasoning
4
![Page 5: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/5.jpg)
Ontologies Definition
Philosophical discipline Science of existence or the study of being
Computer Science Formal specifications of a conceptualization
Resources representing the conceptual model underlying a certain domain, describing it in a declarative fashion and thus cleanly separating it from procedural aspects
5
![Page 6: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/6.jpg)
Ontologies Example
6
![Page 7: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/7.jpg)
Learning from Text Ontology learning
Acquire a domain model from data Lifting : XML-DTDs, UML diagrams, databases Semi-structured sources: HTML, XML Unstructured sources: ontology learning from text
7
![Page 8: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/8.jpg)
Learning from Text Meaning triangle
Every language has symbols that evoke a concept that refers to a concrete individual in the world
8
![Page 9: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/9.jpg)
Learning from Text Ontology population
Learning concepts and relations Knowledge markup or annotation: select text
fragments and assign them to an ontological concept
Applications Several methods have been developed in
recent years Challenge
No consensus within ontology learning community on concrete tasks for ontology learning
Comparison between approaches is difficult
9
![Page 10: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/10.jpg)
Learning from Text10
Ontology learning tasks (layer cake)
![Page 11: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/11.jpg)
Learning from Text11
Terms: Task: find a set of relevant concepts and
relations E.g., words, multi-word compounds
State-of-the-art IR methods NLP methods: POS tagger, statistical
approaches
![Page 12: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/12.jpg)
Learning from Text12
Synonyms: Task: find words which denote the same
concept E.g., synsets on WordNet
State-of-the-art Semantically-similar words Sense disambiguation and synonym discovery Latent Semantic Indexing (LSI) Statistical information measures defined over
the Web to detect synonyms
![Page 13: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/13.jpg)
Learning from Text13
Concepts: Task: find intentional definitions of concept,
their extension, and lexical signs used to refer to them
State-of-the-art Clusters of related terms LSI-based techniques Discovery of hierarchies of named entities Know-it-all system OntoLearn system
![Page 14: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/14.jpg)
Learning from Text14
Hierarchies: Task: concept hierarchy induction,
refinement and lexical extension State-of-the-art
Lexico-syntactic patterns Clustering algorithm to automatically derive
concept hierarchies Analysis of term co-occurrence in same
sentence/document
![Page 15: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/15.jpg)
Learning from Text15
Relations: Task: learn relations identifiers or labels as
well as their appropriate domain and range State-of-the-art
Association rules Syntactic-dependencies
Very few approaches address the issue of learning ontology relations from text
![Page 16: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/16.jpg)
Learning from Text16
Axiom schemata instantiations: Task: learn which concepts, relations, or
pair of concepts the axioms in a given system apply to
General axioms Task: derive more complex relationships
and connections between concepts and relations Logical interpretations constraining the
interpretation of concepts and relations
![Page 17: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/17.jpg)
Learning from Text17
Population: Task: learn instances of concepts and
relations State-of-the-art
Associated to well-known tasks for which a variety of approaches have been developed
Information extraction Named entity recognition
![Page 18: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/18.jpg)
Basics18
Natural Language Processing
![Page 19: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/19.jpg)
Basics19
Pre-processing steps
Chunking Syntactic analysis: parsing
NLP
![Page 20: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/20.jpg)
Basics20
Pre-processing
Contextual features
Syntactic dependencies
Bank
River FinancialInstitution
The museum houses an impressive collection of medieval and modern art. The building combines geometric abstraction with classical references that allude to the Roman influence on the region.
NLP
![Page 21: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/21.jpg)
Basics21
Similarity measures
NLP
![Page 22: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/22.jpg)
Basics22
Similarity measures Binary similarity measures
Geometric similarity measures
NLP
![Page 23: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/23.jpg)
Basics23
Similarity measures Measures based on probability distribution
Hypothesis testing
NLP
![Page 24: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/24.jpg)
Basics24
Term relevance Weight the importance of a term in a
document
NLP
![Page 25: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/25.jpg)
Basics25
WordNet Lexical database for the English language
NLP
![Page 26: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/26.jpg)
Basics26
Formal concept analysis Formal objects: concepts+ Formal attributes: characteristics describing
objects+ Incidence relation: information about which
attributes hold for each object= Formal context
![Page 27: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/27.jpg)
Basics27
Example
FCA
![Page 28: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/28.jpg)
Basics28
Example
FCA
![Page 29: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/29.jpg)
Basics29
Machine learning Automatic recognition/detection of patterns
and regularities within sample data Patterns can be used to understand/describe the
data or to make predictions Learning process
Supervised Predicts the appropriate category for an example
from a set of categories represented by a set of labels
Unsupervised Search for common and frequent structures within
the data (data exploration)
![Page 30: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/30.jpg)
Basics30
Supervised learning Regression
Numeric prediction (labels are continue values) Classification
Assign proper category to a given example
ML
Target value
Feature vector
![Page 31: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/31.jpg)
Basics31
Classifiers Bayesian Classifiers Decision Trees Instance-Based Learning Support Vector Machines Artificial Neural Networks
Tools WEKA RapidMiner
ML
![Page 32: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/32.jpg)
Basics32
Examples
ML
![Page 33: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/33.jpg)
Basics33
Unsupervised learning Clustering: find groups of similar objects in data
There is no labeled data to train from Classification
Hierarchical vs. non-hierarchical Non-hierarchical algorithms produce a set of groups Hierarchical algorithms order groups in a tree
structure Hard vs. soft
Hard: elements are assigned to distinct clusters Soft: elements are assigned to clusters with a
certain degree of membership
ML
![Page 34: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/34.jpg)
Basics34
Algorithms K-means Hierarchical clustering Hierarchical Agglomerative (Bottom-Up)
Clustering Divisive (Top-Down) Clustering
ML
![Page 35: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications](https://reader033.vdocuments.us/reader033/viewer/2022051118/5681610c550346895dd05c75/html5/thumbnails/35.jpg)
Datasets35
Corpus description