foundations of the semantic web rocky dunlap college of computing georgia tech advisors: spencer...

39
Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Upload: gregory-adams

Post on 29-Dec-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Foundations of the Semantic Web

Rocky DunlapCollege of ComputingGeorgia Tech

Advisors:Spencer RugaberLeo Mark

Page 2: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Things we can talk about

Earth System CuratorIncluding a short demo

Semantic Web......versus today’s Web...theoretical foundations...application areas

Implications for LogicBloxHow could SW technologies be useful

for LogicBlox?

Page 3: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

The short story on Curator

Problem statement Model-generated datasets in the climate community

lack metadata that is required for understanding and analyzing the dataset

Premise The descriptors used for comprehensively specifying

a model configuration are also needed for a scientifically useful description of the model output data

Solution Develop a standardized metadata formalism for

describing climate datasets that is based on the model configuration used to generate the dataset

Page 4: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

The Google Way

Keyword search...• Good for expert users who are familiar with the datasets

But...• Many details of what went into the dataset left out• Not machine-readable (e.g., by analysis tools)• Limited standardization• Not semantic or conceptual in nature

10 lines of metadataRun script: 1500+ lines plus input files

Page 5: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

NASA GEOS-5 Model

GEOS-5

surface fvcore gravi ty_wave_drag

history agcm

dynamics physics

chemistry moist_processes radiation turbulence

infrared solar lake land_ice data_ocean land

vegetation catchment

coupler

coupler coupler

coupler

coupler

coupler

coupler

http://gmao.gsfc.nasa.gov/systems/geos5/

Page 6: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Demo

http://cdp.ucar.edu:28080/query/queryESC.htm

Page 7: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Demo

Page 8: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Beyond Curator

Curator is a representative projectE-science communities are emerging

in many areas of science Prediction

Scientific data management -- an emerging area that will continue to grow

Page 9: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Today’s Web

Primarily intended for human consumption

Lots of content, but very little about what the content means

One relationship: the hyperlinkNo semantics really

HTML is almost completely about presentation

Page 10: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

HTML<html> <body> <h1>Rocky’s Pizzeria</h1> <h2>Where you make your own pizza pie!</h2>

<b>Crust options:</b> <ul> <li>Deep Pan</li> <li>Thin and Crispy</li> </ul>

<br/>

<b>Toppings:</b> <ul> <li>Mozzarella</li> <li>Anchovies</li> ....

Page 11: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

(Non-semantic) Hyperlinks

<html>

</html>

<html>

</html> <html>

</html>

<html>

</html>

<html>

</html>

<html>

</html>

<html>

</html>

<html>

</html> <html>

</html>

<a href=“http://www.logicblox.com/”>Home Page</a>

??

Page 12: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

The Semantic Web

Not a new webA layer on top of the existing web

Web-based content should be annotated with explicit semantics

Intended primarily for machine-consumption, i.e., “intelligent agents”

Markup for describing meaning of content, not just presentation

Page 13: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Giving meaning to content<restaurants> <restaurant> <name>Rocky’s Pizzeria</name> <description>Where you make your own pizza pie!</description> <phone>404-123-4567</phone> <address>123 Peachtree Street</address>

<menu choice=“crust”> <option>Deep Pan</option> <option>Thin and Crispy</option> </menu>

<menu choice=“toppings”> <option>Mozzarella</option> <option>Anchovies</option> </menu> <restaurant></restaurants>

Page 14: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Semantic Links

<html>

</html>

hasReview

hasPhotoAlbum

hasNewVersion

Page 15: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

SW Application Areas

Semantic interoperability and data integration health care, drug industry, e-science

Semantic search Can we outsmart Google?

Specific areas: Geographic Information Systems B2B Mediation Legal Automobile diagnostics and repair

http://www.w3.org/2001/sw/sweo/public/UseCases/

Page 16: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

SW Technology Stack

HTTP, URIs

Internet

XML, XML Schema

RDF(S)/OWL

Rules

Reliable transport

Hypertext transfer

Structured data transfer

Conceptual descriptions

Technology What it provides

Advanced reasoning

Page 17: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

SW Ontology Languages

Resource Description Framework (RDF)1

Provides the ability to make statements (propositions) about resources on the web

Web Ontology Language (OWL)2

More expressive features than RDFStrong theoretical basis on

Description Logics

Page 18: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF Statements

“The Curator meeting is at GFDL.”

Curatormeeting GFDL

hasLocation

subject predicate object

Page 19: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF Statements

“The Curator meeting is Oct 18-19.”

Curatormeeting GFDL

“18 Oct 2007”

“19 Oct 2007”

hasLocation

starts

ends

resource

literal

Page 20: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF Statements

“Balaji works at GFDL.”

Curatormeeting GFDL

“18 Oct 2007”

“19 Oct 2007”

Balaji

hasLocationworksAt

starts

ends

Page 21: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF XML Representation

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:esc="http://www.earthsystemcurator.org">

<rdf:Description rdf:about=“http://....#OctCuratorMeeting"><esc:hasLocation rdf:resource=“http://....#GFDL”/><esc:starts>18 Oct 2007</esc:starts><esc:ends>19 Oct 2007</esc:ends>

</rdf:Description>

<rdf:Description rdf:about=“http://....#Balaji"><esc:worksAt rdf:resource=“http://....#GFDL”/>

</rdf:Description>

</rdf:RDF>

Page 22: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF Schema

Define a domain specific data model for RDFIncludes classes and properties (along

with subclasses and subproperties)Properties are first class (they are not

defined as part of a particular class)

Page 23: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

RDF Schema

ClassesProperties

Event

MeetingFlight

Person

hasLocation domain: Event range: Place

starts domain: Event range: date

Place

ends domain: Event range: date

worksAt domain: Person range: Place

Page 24: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

OWL (Web Ontology Language)

Motivations Expressive ontology language Precise semantics

• Understanding of formal properties such as decidability and complexity of inferencing

Ease of use for a wide audience Influences

DAML+OIL (existing language) Description Logics Frames paradigm

Page 25: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Description Logics

“A formal language for representing knowledge and reasoning about it.”3

“Description”Primary facilities are concept and role

descriptions “Logics”

Equipped with a FOL-based formal semantics

Inference procedures reveal implied knowledge

Page 26: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

DL Knowledge Base

TBoxContains the terminology, i.e., the

vocabulary of an application domain, i.e., concepts and roles

Think database schema ABox

Contains the assertions about named individuals

Think database data

Page 27: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

In the TBox

Two symbols atomic concepts (A, B) atomic roles (R)

Complex descriptions through concept constructors (C, D) role constructors

Concept and role constructors determine: The expressiveness of the DL Decidability/complexity of reasoning

Page 28: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

C A | (atomic concept)

| (universal concept)

| (bottom concept)

A | (atomic negation)

C D | (intersection)

R.C | (value restriction)

R. (limited existential quant.)

DL Language ALA

E

Grammar for concept construction

Page 29: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

DL Language AL

Atomic concepts: Food, Pizza, CheeseToppingAtomic role: hasTopping

“Non-pizza foods”Food Pizza

“Pizza with only cheese toppings”Pizza hasTopping.CheeseTopping

“Pizza with some topping”Pizza hasTopping.

A

E

Page 30: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Name Syntax Symbol

Union C D U

Existential quant. R.C E

Unqual. cardinality restriction ≥ n R, ≤ n R, = n R N

Qual. cardinality restriction ≥ n R.C, ≤ n R.C, = n R.C Q

Role hierarchy R S H

Inverse properties R- I

Functional properties F

Nominals (enumeration) O

DL Concept/Role Constructors

E

Page 31: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Pizza(x) y.hasTopping(x,y) CheeseTopping(y)

Pizza(x) y.hasTopping(x,y)

Pizza(x) y.hasTopping(x,y) CheeseTopping(y)

Pizza hasTopping.CheeseTopping

Food(x) Pizza(x)

Food Pizza

FOL Representations

A

A

Pizza hasTopping.

E

E

Page 32: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

In the ABox

Assertions about individuals

Pizza(p1)Pizza(p2)CheeseTopping(mozzarella)CheeseTopping(goatCheese)hasTopping(p1, mozzerella)hasTopping(p1, goatCheese)hasTopping(p2, goatCheese)

Page 33: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Reasoning on the KB

TBox Concept satisfiability

• Are there possible instances?

Concept subsumption• Is concept C almost more general than D?

Concept equivalence ABox

Instance checking• Is individual X an instance of concept C?

Consistency checking• Is individual X a possible model of the TBox?

Page 34: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

From DL to OWL

OWL has three “species” or dialects OWL-DL equivalent to DL SHOIN(D) OWL-Lite equivalent to DL SHIF(D) OWL-Full -- messy...

Must adhere to constraints of the Web Standard XML syntax URIs for identifying concepts, roles, and

individuals Concrete datatypes based on XML Schema

Protégé Demo4

Page 35: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Adding Rules to OWL

Semantic Web Rule Language (SWRL)5

hasParent(?x,?y) hasBrother(?y,?z) hasUncle(?x,?z)

Artist(?x) artistStyle(?x, ?y) Style(?y) creator(?z,?x) style/period(?z, ?y)

Page 36: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Semantic Web Apps

DBPedia Convert Wikipedia articles into RDF

Freebase Wikipedia + semantic links

Twine Social networking + semantic web

GeoNames Ontology of geospatial semantic information

WordNet Ontology RDF/OWL representation of WordNet

Page 37: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

Discussion

Implications for LogicBlox?

Page 38: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

References

[1] Manola, et. al. RDF Primer. http://www.w3.org/TR/rdf-primer/

[2] McGuiness, et al. OWL Web Ontology Language Overview. http://www.w3.org/TR/owl-features/

[3] Baader, et. al. The Description Logic Handbook, 2007.

[4] Horridge, at. al. A Practical Guide To Building OWL Ontologies Using the Protégé-OWL Plugin and CO-ODE Tools, Edition 1.0., 2004. http://www.co-ode.org/resources/tutorials/ProtegeOWLTutorial.pdf

[5] Horrocks, et. al. SWRL: A Semantic Web Rule Language Combining OWL and RuleML. http://www.w3.org/Submission/SWRL/

Page 39: Foundations of the Semantic Web Rocky Dunlap College of Computing Georgia Tech Advisors: Spencer Rugaber Leo Mark

AE