introduction to semantic web, metadata, knowledge representation, ontologies

125
Introduction to the Semantic Web

Upload: fulvio-corno

Post on 01-Nov-2014

1.321 views

Category:

Education


2 download

DESCRIPTION

Lecture slides for the course "Semantic Web: Technologies, Tools, Applications" by Fulvio Corno and Laura Farinetti at Politecnico di Torino, year 2012.Further information and material: http://elite.polito.it/teaching-mainmenu-69/master-a-phd-mainmenu-94/56-01lhviu-semweb

TRANSCRIPT

Page 1: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

Introduction to

the Semantic Web

Page 2: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 2

Semantic Web

Web second generation

Web 3.0

“Conceptual structuring of the Web in an explicit

machine-readable way”

(Tim Berners-Lee)

In other words…

…let the machine do most of the work!!!

http://www.w3.org/2001/sw/

Page 3: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 3

“Official” introduction The Semantic Web is a web of data. There is

lots of data we all use every day, and its not part

of the web. I can see my bank statements on the

web, and my photographs, and I can see my

appointments in a calendar. But can I see my

photos in a calendar to see what I was doing

when I took them? Can I see bank statement

lines in a calendar?

Why not? Because we don’t have a web of data.

Because data is controlled by applications, and

each application keeps it to itself

Page 4: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 4

Example

Page 5: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 5

“Official” introduction

The Semantic Web is about two things

It is about common formats for integration and combination of data drawn from diverse sources, where on the original Web mainly concentrated on the interchange of documents.

It is also about language for recording how the data relates to real world objects. That allows a person, or a machine, to start off in one database, and then move through an unending set of databases which are connected not by wires but by being about the same thing

Page 6: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 6

An example …

How can a machine distinguish the

meanings … ?

“I am a professor of computer science.”

“I am a professor of computer science,

you may think. Well,…”

Page 7: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 7

Key principles

The Semantic Web is the Web

Same base technologies, evolutionary

Decentralized (incomplete, inconsistent)

Provide explicit statements regarding web resources

Authors, original information providers

Intermediaries (humans and/or machines)

Information consumers determine consequences of the statements

Distributed ‘reasoning’

Page 8: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 8

1989:

WWW

original

proposal

Page 9: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 9

Technology stack (old: pre-2008)

Page 10: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 10

Technology stack (current: 2008)

Page 11: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 11

The real world

Not

yet...!

Page 12: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 12

The real world

Not

yet...! Not always

necessary...

Page 13: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 13

The real world

Not

yet...! Not always

necessary...

Page 14: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

Current “hot” topics

F. Corno, L. Farinetti - Politecnico di Torino 14

Page 15: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

Metadata and

Metadata Standards

Page 16: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 16

Goal of the semantic Web

The Semantic Web will enable

machines to COMPREHEND semantic

documents and data, NOT human

speech and writing

Then, how???

Semantic Web foundation: metadata

Page 17: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 17

Resource and description

Resource

Content, format, …

Access method dependent on format (I can read it if I “know” its language)

Resource description

Independent of the format (I can read “people’s comments” about the resource… provided that I know the language in which the comment is written)

Page 18: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 18

Resource and description

description

resource

this resource

was created on

April 14th, 2009

the title of this

resource is

“Introduction to

the Semantic

Web”

the author of

this resource

is L. Farinetti

this resource is

related to

computer

science,

knowledge

representation

and metadata

the quality of

this resource

is high,

according to F.

Corno

this resource is suitable

for PhD students

Page 19: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 19

Resource and description

Resource Content, format, …

Access method dependent on format (I can read it if I “know” its language)

Standardization (i.e. common language for applications) ??? Practically impossible …

Huge amount of existing information

Hundreds of human languages

Hundreds of computer languages (other word for formats)

Page 20: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 20

Resource and description

Resource description

Independent of the format (I can read “people’s

comments” about the resource… provided that I know

the language in which the comment is written)

Standardization (i.e. common language for

applications) ???

Feasible

Smaller amount of information, possibly new

Solution: define a standard language for writing

comments (“metadata” in semantic web terminology)

Page 21: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 21

Resource and description

this resource

was created on

April 14th, 2009

the title of this

resource is

“Introduction to

the Semantic

Web”

the author of

this resource

is L. Farinetti

this resource is

related to

computer

science,

knowledge

representation

and metadata

the quality of

this resource

is high,

according to F.

Corno

this resource is suitable

for PhD students

Metadata

Field name = field value

Page 22: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 22

Resource and description

description

resource

Date =

2009-04-14

Title =

“Introduction to

the Semantic

Web”

Author =

L. Farinetti

Topic =

{computer

science,

knowledge

representation,

metadata}

Quality = high

Level = PhD students

Rated by F. Corno

Page 23: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 23

Semantic Web main tasks

Metadata annotation

Description of resources using standard

languages

Search

Retrieve relevant information according to

user’s query / interest / intention

Use metadata (and possibly content) in a

“smart” way (i.e. “reasoning” about the

meaning of annotations)

Page 24: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 24

Meaningful metadata annotations

Common language for describing resources

Resource description standards

Common language for description field names

Metadata standards

Common language for description field values

Metadata standards + controlled vocabularies

Semantically rich descriptions to support search

Knowledge representation techniques, ontologies

Page 25: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 25

Common language for describing

resources

Page 26: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 26

Common language for describing

resources

Resource Description Framework (RDF)

Resource = URI (retrievable, or not)

RDF is structured in statements

A statement is a triple

Subject – predicate – object

Subject: a resource

Predicate: a verb / property / relationship

Object: a resource, or a literal string

Page 27: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 27

Common language for describing

resources

Diagram:

Simple RDF assertion (triple):

triple (hasAuthor, URI, L.Farinetti)

URI L.Farinetti hasAuthor

Author =

L. Farinetti

Page 28: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 28

Common language for describing

resources

RDF in XML syntax:

Author =

L. Farinetti

<RDF xmlns=“http://www.w3.org/TR/ … ” >

<Description about=“http://www.polito.it/semweb/intro”>

<Author>L.Farinetti</Author>

</Description>

</RDF>

Page 29: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 29

Common language for field names

Title = ...

Problem

Author = …

Creator, Maker,

Contributor …

Synonymy

Topic = …

Topics, Subject, Subjects,

Argument, Arguments

Singular / plural

Level = …

Difficult to clearly

define concept in a

few words

Educational level,

destination, suitability, …

Date = …

Date of creation, date of

last modification, date of

revision, …

Different concepts:

need for more details

Page 30: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 30

Common language for field names

Solution: metadata standards

Many standardization bodies are involved

Standards may be general

e.g. Dublin Core (DC)

or may depend on goal, context, domain, …

e. g. educational resources (IEEE LOM), multimedia

resources (MPEG-7), images (VRA), people (FOAF,

IEEE PAPI), geospatial resources (GSDGM),

bibliographical resources (MARC, OAI), cultural

heritage resources (CIDOC CRM)

Page 31: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 31

Metadata standards examples

Page 32: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 32

Dublin Core

Dublin Core Metadata Element Set

(DCMES)

Building blocks to define metadata for the

Semantic Web

15 elements, or categories, general enough to

describe most of the published resources

Extra elements and element refinements

Page 33: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 33

DC metadata element set

Page 34: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 34

Example of description using

Dublin Core (in RDF)

A paper in the

“Ariadne” journal

Page 35: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 35

Common language for field values

Problems

Value type

Title =

“Introduction to

the Semantic

Web”

type = string

Date =

2009-04-14

type = date

Author =

L. Farinetti

type = string

“standard” format?

Laura Farinetti, Farinetti

Laura, Farinetti L., …

Page 36: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 36

Common language for field values

Problems

Value type

Value restrictions?

freedom vs shared understanding

Quality = high

High, medium, low?

1 to 5?

any value?

Level = PhD students

any value?

list of possible values?

Topic =

{computer

science,

knowledge

representation,

metadata}

any value?

any number of values?

Page 37: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 37

Common language for field values

Solution: metadata standards + controlled

vocabularies

Metadata standards

Only some, and partially

Controlled vocabularies

Explicit list of possible values

Page 38: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 38

Examples from IEEE LOM

1484.12.1 - 2002 Learning Object

Metadata (LOM) Standard

Developed by the IEEE Learning Technology

Standards Committee (LTSC)

Standard to describe the “Learning

Objects” in order to guarantee their

interoperability

Page 39: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 39

Examples from IEEE LOM

Page 40: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 40

Examples from IEEE LOM

Page 41: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 41

Examples from IEEE LOM

Page 42: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 42

… + controlled vocabularies

A closed list of named subjects, which can

be used for classification

Metadata field values are

restricted to a list of terms

(selected by experts)

Topic =

{computer

science,

informatics,

knowledge

representation,

metadata}

Page 43: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 43

Semantically rich descriptions to

support search

Page 44: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 44

Semantically rich descriptions to

support search

http://dictybase.org/db/html/help/GO.html

Topic =

{metabolism, …}

Page 45: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

Knowledge

Representation

Page 46: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 46

Need for knowledge representation

Semantically rich descriptions need

“understanding” the meaning of a resource

and the domain related to the resource

Disambiguation of terms

Shared agreement on meanings

Description of the domain, with concepts and

relations among concepts

Page 47: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 47

Example: Dublin Core metadata

Metadata of a single paper

Page 48: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 48

Problems

Title usually offers good clues, but it does not necessarily mention all names of all

subjects the user is interested in

it may presuppose knowledge the user does not actually possess

Subject is meant to convey precisely what the document is about, but much depends on how extensive the set of keywords

is, whether all related subjects are mentioned, and whether too many subjects are listed

Metadata does not say much about “how related” a resource is to a given subject

Page 49: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 49

Search results for “topic maps”

Page 50: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 50

Problems

Authors were free to define their own

subject keywords

Results are not “about” topic maps, but

“related to” topic maps

If an author forgets to list “topic maps”, his

paper will never be found

Page 51: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 51

Subject-based classification

Any form of content classification that groups objects by their subjects e.g the use of keywords to classify papers

Metadata fields describe what the objects are about by listing discrete subjects inside a subject-based classification

Important: difference between describing the objects being classified and describing the subjects used to classify them Metadata describe objects

Subject-based classification is the approach to describe subject

Page 52: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 52

Subject-based classification ... “On those remote pages it is written that animals are divided into:

a. those that belong to the Emperor b. embalmed ones c. those that are trained d. suckling pigs e. mermaids f. fabulous ones g. stray dogs h. those that are included in this classification i. those that tremble as if they were mad j. innumerable ones k. those drawn with a very fine camel's hair brush l. others m. those that have just broken a flower vase n. those that resemble flies from a distance"

From The Celestial Emporium of Benevolent Knowledge, Borges

Page 53: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 53

Subject-based classification

techniques

Controlled vocabularies

Taxonomies

Thesauri

Faceted classification

Ontologies

Folksonomies

Others

… Most come from library science

Page 54: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 54

Controlled vocabulary

A closed list of named subjects, which can be

used for classification

Composed of terms: particular name for a

particular concept

similar to keywords

Terms are not concepts

A single term may be the name of one or more

concepts

A single concept may have multiple names

Ambiguity avoided by forbidding duplicate terms

Page 55: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 55

Controlled vocabulary

Goal

Prevent authors from defining terms that are meaningless, too broad or too narrow

Prevent authors from misspelling

Prevent different authors from choosing slightly different forms of the same term

The simplest form of controlled vocabulary is a list of terms (or “pick list”)

Topic =

{computer

science,

knowledge

representation,

mtadata, RDF,

topic navigation

maps} topic maps

Page 56: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 56

Controlled vocabulary

Reduce ambiguity inherent in normal human languages

Solve the problems of homographs, homonyms, synonyms and polysemes by ensuring

That each concept is described using only one authorized term

That each authorized term in the controlled vocabulary describes only one concept

Page 57: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 57

Problems solved

Synonym

different words with identical or very similar meanings

Page 58: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 58

Problems solved

Synonym

different words with identical or very similar meanings

close “Will you please close that door!”

“The tiger was now so close that I could smell it...”

pupil student

opening in the iris of the eye

axes

('æk.səz) plural of axe

('æk.siz) plural of axis

Page 59: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 59

Problems solved

Synonym

different words with identical or very similar meanings

student and pupil (noun)

buy and purchase (verb)

sick and ill (adjective)

to get

take (I'll get the drinks)

become (she got scared)

wood

understand (I get it)

a piece of a tree

a geographical area with many trees

Page 60: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 60

Controlled vocabulary examples

Practically no “real” examples

With very little extra effort: taxonomies and

thesauri!

Circuit theory

Electronic circuits

Microwave technology

Electron tubes

Semiconductor materials and devices

Dielectric materials and devices

Magnetic materials and devices

Superconducting materials and devices

Blood

Cord blood

Erythrocyte

Leukocyte

Basophil

Eosynophil

Lymphoblast

Lymphocyte

Monocyte

Neutrophil

Page 61: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 61

Taxonomy

Subject-based classification that arranges the terms in the controlled vocabulary into a hierarchy

Dates back to Carl Linnæus’s work on zoological and botanical classification (18th century)

Page 62: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 62

Taxonomy

Allow related terms to be grouped together

It is clear that “topic

maps” and “XTM” are

related

Easier to classify

documents

Easier to choose

search keywords

Page 63: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 63

Taxonomies and metadata

Metadata are stored as usual with the resource

The “subject” will contain only controlled terms

Controlled terms belong to a hierarchy, shared by all papers

Page 64: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 64

Taxonomy example: INSPEC

http://www.theiet.org/publishing/inspec/index.cfm

Page 65: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 65

Taxonomy example: INSPEC

Page 66: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 66

INSPEC

journal

article

database

Page 67: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 67

Taxonomy example: anatomy terms

http://www.cbil.upenn.edu/anatomy.php3

Page 68: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 68

Taxonomy example

Page 69: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 69

Taxonomy example

http://www.acm.org/class/1998/ccs98.html

Page 70: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 70

Taxonomy limits

Only two kinds of relationships between terms

Parent = broader term

Child = narrower term

topic navigation maps synonym

no more in use

difference?

synonym XML topic map

difference?

Page 71: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 71

Thesaurus

Extends taxonomies

subjects are arranged in a hierarchy

Other statements can be made about the

subjects

Two ISO standards

ISO2788 for monolingual thesauri

ISO5964 for multilingual thesauri

Page 72: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 72

Thesaurus relationships

BT – broader term Refers to a term with wider or less specific meaning

Some systems allow multiple BTs for one term, while others do not

Inverse property: NT - narrower term

A taxonomy only uses BT and NT

SN – scope note String explaining its meaning within the thesaurus

Useful when the precise meaning of the term is not obvious from context

Page 73: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 73

Thesaurus relationships USE

Another term that is to be preferred instead of this term

Implies that the terms are synonymous

Inverse property: UF

TT – top term The topmost ancestor of this term

The BT of the BT of the BT...

RT – related term A term that is related to this term, without being a

synonym of it or a broader/narrower term

Page 74: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 74

Thesaurus example

http://www.ukat.org.uk/thesaurus/

Page 75: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 75

Thesaurus example

http://www.swinburne.edu.au/corporate/registrar/rms/keywords.htm

Page 76: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 76

Thesaurus example

Library of Congress

Subject Heading

http://www.loc.gov/cds/lcsh.html

Page 77: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

W3C

standard:

SKOS

F. Corno, L. Farinetti - Politecnico di Torino 77

UK Archival Thesaurus

Page 78: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 78

Faceted classification

Proposed by

S.R. Ranganathan in the ‘30s

Facets are the different axes along which

documents can be classified

Each facet contains a number of terms

Usually with a thesaurus organization

Usually a term belongs to one facet only

A document is classified by selecting one term

from each facet

Page 79: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 79

Faceted classification example

http://flamenco.berkeley.edu/

Page 80: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 80

Advantages

Multi-

dimensionality

Persistence

Scalability

Flexibility

http://freeable.polito.it/

Page 81: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 81

Ontology

Model for describing the world that

consists of a set of types, properties, and

relationships

Extends the other subject-based

classification approaches

Has open vocabularies

Has open relationship types (not just BT/NT,

RT and USE/UF)

Page 82: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 82

Ontology structure

Concepts

Relationships

Is-a

Other

Instances

Page 83: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 83

Folksonomy

Internet-mediated social environments

Tags compiled through social tagging

Social tagging

Decentralized practice where individuals and groups create, manage and share tags to annotate digital resources in an online social environment

Generally characterized by non-standard tagging

Page 84: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 84

Other subject-based techniques

Synonym rings

Connect together a set of terms as being

equivalent for search purpose

Similar to UF/USE relationship of thesauri,

but no preferred term

Page 85: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 85

Other subject-based techniques

Authority file

Similar to a synonym ring, but consists of UF/USE

relationships instead of synonym relationships

One term in each synonym ring is indicated as the

preferred term for that subject

e.g. Library of

Congress Name

Authority File

Page 86: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 86

Subject-based classification

summary

Terminology is rarely used

in a consistent way

Controlled vocabularies

are thesauri, thesauri are

ontologies, …

http://www.iesr.ac.uk/profile/vocabs/index.html/#CtrldVocabsList

Page 87: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 87

Subject-based classification

summary

Page 88: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

Ontologies

Page 89: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 89

Semantically rich descriptions to

support search

http://dictybase.org/db/html/help/GO.html

Topic =

{metabolism, …}

Page 90: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 90

Ontologies

An ontology is an explicit description of

a domain

concepts

properties and attributes of concepts

constraints on properties and attributes

individuals (often, but not always)

An ontology defines

a common vocabulary

a shared understanding

Page 91: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 91

“Ontology engineering”

Defining terms in the domain and relations

among them

defining concepts in the domain (classes)

arranging the concepts in a hierarchy

(subclass-superclass hierarchy)

defining which attributes and properties (slots)

classes can have and constraints on their

values

defining individuals and filling in slot values

Page 92: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 92

Why develop an ontology?

To share common understanding of the

structure of information

among people

among software agents

To enable reuse of domain knowledge

to avoid “re-inventing the wheel”

to introduce standards to allow interoperability

Page 93: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 93

An ontology

HNC

HND

Certificate

Diploma

Award

2 years

1 year

Is_a

Is_a

Is_a

Is_a

takes

takes

takes

takes

Is_equivalent_to

Page 94: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 94

A more complex ontology [base.Entity] Person Worker Faculty Professor AssistantProfessor AssociateProfessor FullProfessor VisitingProfessor Lecturer PostDoc Assistant ResearchAssistant TeachingAssistant AdministrativeStaff Director Chair {Professor} Dean {Professor} ClericalStaff SystemsStaff Student UndergraduateStudent GraduateStudent

Page 95: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 95

A more complex ontology Organization Department School University Program ResearchGroup Institute Publication Article TechnicalReport JournalArticle ConferencePaper UnofficialPublication Book Software Manual Specification Work Course Research Schedule

Page 96: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 96

A more complex ontology

Relation Argument 1 Argument 2 ====================================================== publicationAuthor Publication Person publicationDate Publication .DATE publicationResearch Publication Research softwareVersion Software .STRING softwareDocumentation Software Publication teacherOf Faculty Course teachingAssistantOf TeachingAssistant Course takesCourse Student Course age Person .NUMBER emailAddress Person .STRING head Organization Person undergraduateDegreeFrom Person University mastersDegreeFrom Person University doctoralDegreeFrom Person University advisor Student Professor subOrganization Organization Organization ………..

Page 97: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 97

Example of ontology engineering

chair

Page 98: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 98

Example of ontology engineering

1.A piece of furniture consisting of a seat, legs, back, and often

arms, designed to accommodate one person.

2.A seat of office, authority, or dignity, such as that of a bishop.

a.An office or position of authority, such as a professorship.

b.A person who holds an office or a position of authority,

such as one who presides over a meeting or administers a

department of instruction at a college; a chairperson.

3.The position of a player in an orchestra.

4.Slang. The electric chair.

5.A seat carried about on poles; a sedan chair.

6.Any of several devices that serve to support or secure, such as

a metal block that supports and holds railroad track in position.

chair

Page 99: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 99

Example of ontology engineering

A piece of furniture consisting of a seat, legs, back,

and often arms, designed to accommodate one

person.

chair

Page 100: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 100

Example of ontology engineering

chair seat stool bench

Page 101: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 101

Example of ontology engineering

Something I can sit on

chair seat stool bench

Something I can sit on

???

Page 102: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 102

chair seat stool bench

Something I can sit on

“sittable”

Example of ontology engineering

Page 103: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 103

chair seat stool bench

table

Example of ontology engineering

Something I can sit on

“sittable”

Page 104: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 104

Example of ontology engineering

Something I can sit on

chair seat stool bench

“for_sitting”

table

“sittable”

Something designed for sitting

Page 105: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 105

Ontology structure

chair seat stool bench

“for_sitting”

table

“sittable”

Page 106: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 106

Concepts

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Furniture to sit on

Shorthand name

Synthetic title

Definition “sittable”

Page 107: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 107

Internationalization

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Furniture to sit on

Shorthand name

Synthetic title

Definition

Furniture to sit on Furniture to sit on Furniture to sit on Furniture to sit on Furniture to sit on Furniture to sit on

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

Some piece of furniture that can

be used to sit on, either by

design or by its shape.

“sittable”

Page 108: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 108

Relationships

chair seat stool bench

“for_sitting”

table

“sittable”

is_a is_a is_a

is_a

is_a

is_a

room material

wood

is_a

classroom

dining room

is_a is_a

Page 109: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 109

Relationships

chair seat stool bench

“for_sitting”

table

“sittable”

is_a is_a is_a

is_a

is_a

is_a

room material

wood

is_a

classroom

dining room

is_a is_a

made_of

made_of

Page 110: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 110

Ontology building blocks

Ontologies generally describe:

Individuals

the basic or “ground level” objects

Classes

sets, collections, or types of objects

Attributes

properties, features, characteristics, or parameters

that objects can have and share

Relationships

ways that objects can be related to one another

Page 111: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 111

Individuals

Also known as “instances”

can be concrete objects

animals

molecules

trees

or abstract objects

numbers

words

Page 112: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 112

Concepts

Also known as “Classes”

abstract groups, sets, or collections of objects

They may contain

individuals

other classes

a combination of both

Examples

Person: the class of all people

Vehicle: the class of all vehicles

Page 113: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 113

Concepts Can be defined extensionally …

By defining every object that falls under the definition

of the concept

A class C is extensionally defined if and only if for

every class C', if C' has exactly the same members of

C, C and C' are identical

E.g.: DayOfWeek = {Monday, Tuesday, Wednesday,

Thursday, Friday, Saturday, Sunday}

… or intensionally

By defining the necessary and sufficient conditions for

belonging to the concept

E.g.: “bachelor” is an “unmarried man”

Page 114: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 114

Concepts

Defined by

Name: any identifier, usually carefully chosen

Definition: describes the well agreed meaning

of the concept, in a human readable form

Terms (Lexicon): list of terms (synonyms, etc.)

usually adopted to identify the concept

Page 115: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 115

Subsumption

A concept (class) can subsume / be

subsumed by any other class

Subsumption is used to establish class

hierarchies

Page 116: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 116

Class partition

A set of related classes and associated

rules that allow objects to be placed into

the appropriate class

GEOMETRIC

FIGURE

GEOMETRIC POINT

TWO DIMENSIONAL

FIGURE ONE

DIMENSIONAL FIGURE

Page 117: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 117

Class partition

Disjoint partition

A disjoint partition rule guarantees that a

single instance of a class cannot be in more

than one sub-classes

E.g. one specific truck

cannot be in both

4-axle and

6-axle classes

VEHICLE

CAR TRUCK

6-AXLE 4-AXLE

Page 118: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 118

Class partition

Exhaustive partition

every concrete object in the super-class is an

instance of at least one of the partition

classes

Page 119: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 119

Attributes

Describe specific features

Can be complex (e.g.: list of values)

Defined for a class/concept (e.g. car)

Examples:

number-of-doors: 4

number-of-wheels: 4

engine: {3.0L,4.0L}

Page 120: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 120

Relationships

Attributes that relate two or more concepts

two concepts → binary relationship

three concepts → ternary relationship

Domain

the concept(s) from which the relationship departs

Range

the concept(s) to which the relationship applies

Page 121: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 121

Relationships

Examples

Car(MiniMinor) → individual definition

Car(Mini) → individual definition

Successor(Mini,MiniMinor) → relationship

domain range

Page 122: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 122

Commonly used relationships

Subsumption

the most important

is-superclass-of

usually denoted by its inverse is-a

(is-subclass-of)

Meronymy

is-part-of

describes how object are combined together

to form composite objects

Page 123: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 123

Example

http://www.yeastgenome.org/help/GO.html

Page 124: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 124

http://www.webology.ir/2006/v3n3/a28.html

Ontology alignment

Page 125: Introduction to Semantic Web, Metadata, Knowledge Representation, Ontologies

F. Corno, L. Farinetti - Politecnico di Torino 125

License

This work is licensed under the Creative

Commons Attribution-Noncommercial-

Share Alike 3.0 Unported License.

To view a copy of this license, visit

http://creativecommons.org/licenses/by-

nc-sa/3.0/ or send a letter to Creative

Commons, 171 Second Street, Suite 300,

San Francisco, California, 94105, USA.