a comprehensive framework for semantic annotation of web content

17
KEOD 2013 – 20 th September 2013 A Comprehensive Framework for Semantic Annotation of Web Content Manuel Fiorelli 1 , Maria Teresa Pazienza 2 , Armando Stellato 2 1 Department of Civil Engineering and Computer Science, 2 Department of Enterprise Engineering University of Rome, Tor Vergata, Italy {fiorelli,pazienza,stellato}@info.uniroma2.it

Upload: azure

Post on 24-Feb-2016

27 views

Category:

Documents


0 download

DESCRIPTION

A Comprehensive Framework for Semantic Annotation of Web Content. Manuel Fiorelli 1 , Maria Teresa Pazienza 2 , Armando Stellato 2 1 Department of Civil Engineering and Computer Science, 2 Department of Enterprise Engineering University of Rome, Tor Vergata , Italy - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013

A Comprehensive Framework forSemantic Annotation of Web

ContentManuel Fiorelli1, Maria Teresa Pazienza2, Armando Stellato2

1Department of Civil Engineering and Computer Science, 2Department of Enterprise Engineering

University of Rome, Tor Vergata, Italy

{fiorelli,pazienza,stellato}@info.uniroma2.it

Page 2: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 2

Outline• Annotation in action• The problem• Our proposal• Our choices

• Combined managed of ontologies and Web content• Event-based behavioural model• Convention over configuration

• End-user customizability• Conclusions

Page 3: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 3

Annotation in actionAnnotation is a pervasive activity depending on different assumptions about granularity, level of formality and the use, if any, of ontologies.

Collaborative authoring of documentsFree text notes by co-authors

NLP toolsStructured annotations with weak semantics

Semantic WebAnnotations with respect to formalized knowledge to make explicit the meaning of Web resources (even Web services)

Page 4: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 4

The problem

The ultimate annotation system is unlikely to appear:• Even comprehensive annotation models (e.g. the draft

produced by the Open Annotation W3C Community Group) may not cover all usage scenarios

• Divergent methodologies: automatic vs manual annotation

• Different applications: NLP, Web Annotation, …

Page 5: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 5

The problem

Design Choices

Supp

orte

d ap

plic

ation

s

General PurposeProgramming Language

Full systemSpecification

Don’t take the convexity too strictly :-D

Page 6: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 6

Our proposal

framework

Design Choices

Supp

orte

d ap

plic

ation

s

General PurposeProgramming Language

Full systemSpecification

Low upfront commitment to specific design choices

Progressive levels of customization/implementation

Page 7: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 7

Our choicesNarrowed Scope

Applications dealing with combined management of ontologies and Web content.

Behavioural ModelAssignment of handlers to predetermined events. We call an assignment of semantically coherent handlers an annotation family

Convention over ConfigurationTo ease the development of common interaction patterns.

Page 8: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 8

Combined management of ontologies and Web content

Interaction with the domain model Interaction with the

Web content

cross realm interactions

http://semanticturkey.uniroma2.it/

Page 9: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 9

Semantic Turkey

Page 10: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 10

Semantic Turkey Architecture Three layered extensible architecture

• Presentation Layer• An extension to one of Mozilla Applications

• User Interface based on XUL, XBL and Javascript technologies

• Not–only–presentation

• Extensible via Mozilla Ext. Mechanism

• Services Layer• Enables communication between the client

(Firefox browser extension) and the ontology persistence layer.

• HTTP based Services accessed through the Ajax paradigm

• OSGi Extensible Servicing System

• Persistence Layer• Access to ontological knowledge.

• Based on dedicated ontology API, which can be implemented through use of different technologies.

Page 11: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 11

Event-based behavioural modelFrom UI gestures (e.g. drag&drop) to events.selectionOverResource

fired when a selection from a Web page is dropped onto an ontological resource

resourceOverContentfired upon gestures for the association of Web content with an ontological resource regardless of their occurrence in the text

contentLoadedtriggered when Web content is loaded, in order to execute presentation related activities, e.g. highlighting the annotated fragments

Creation of annotations

Presentation of annotations

Page 12: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 12

Event-based behavioural model

HandlersFamilies

bookmarking

open annotation(coarse grain)

open annotation(fine grain)

Events

•selectionOverResource•resourceOverContent•contentLoaded

•selectionOverResource•resourceOverContent•contentLoaded

•selectionOverResource•resourceOverContent•contentLoaded

Multiple handlers for offering different choices to the user

Events agnostic about the resource type (e.g. class vs individual), but handlers have pre-conditions that may filter incoming events on the basis of the target resource type

Page 13: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 13

Convention over configurationEvent-based model makes almost no prior assumption about the application or the interaction patterns

• Therefore, it is comprehensive

However, to help the developer we provide a collection of predetermined handlers:

• Add an annotation to a resource• Create a new instance• Create a (sub-)concept• Set a property value

Which are defined up to a bunch of functions that must be implemented by the developer.

Page 14: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 14

Convention over configuration

Extension point

User Web Document Annotation Family

contentLoaded checkAnnotationsForContent

Browser Window

enableContentDecoration

click on annotation icon

decorateContent

DOM manipulations

inject references operations on annotations,such as deletion, visualization, etc...

getAnnotations

getDOMRange

Page 15: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 15

Convention over configuration

Extension point

User Web Document Annotation Family

click on annotation

Popup

choose alternative

deleteAnnotation

showAnnotation

showAttachedResource

openPopup

OR

Page 16: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 16

End-user customizabilityMost usage scenarios only concern with a subset of the possible interactions which a given family may offer.

Select a subset of the handlers

Strengthen the preconditions

Page 17: A Comprehensive Framework for Semantic Annotation of Web Content

KEOD 2013 – 20th September 2013 17

Future Developments• Extend the set of available conventions• Evaluate our framework with respect to the coverage of the

features of state-of-the-art annotation systems• Is it possible to implement the identified features in our

framework?• How much effort does it require?