project report: semantic portal business and economics

22
Semantic Portal Business and Economics – Project Report NKOS Workshop September 19 th 2008 Aarhus, Denmark Project Report: Semantic Portal Business and Economics Kai Eckert Computer Science Institute University of Mannheim Germany Magnus Pfeffer University Library University of Mannheim Germany

Upload: tristram-davies

Post on 31-Dec-2015

24 views

Category:

Documents


0 download

DESCRIPTION

NKOS Workshop September 19 th 2008 Aarhus, Denmark. Project Report: Semantic Portal Business and Economics. Kai Eckert Computer Science Institute University of Mannheim Germany. Magnus Pfeffer University Library University of Mannheim Germany. Project Goal. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Project Report: Semantic Portal Business and Economics

Semantic Portal Business and Economics – Project Report

NKOS WorkshopSeptember 19th 2008

Aarhus, Denmark

Project Report:Semantic Portal Business and Economics

Kai EckertComputer Science Institute

University of MannheimGermany

Magnus PfefferUniversity Library

University of MannheimGermany

Page 2: Project Report: Semantic Portal Business and Economics

2/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Project Goal

Creating a OPAC+ Library Search Enginge

Content Library media All licenced fulltext documents Focus on economics

Modern user interface Thesaurus-based search and retrieval Drill-down using facets Support multiple thesauri

Page 3: Project Report: Semantic Portal Business and Economics

3/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Research Topics

Automatic indexing in the field of economics Thesaurus-based user search interfaces Multi-thesaurus indexing and search

Page 4: Project Report: Semantic Portal Business and Economics

4/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Current Status

Prototype indexing system

Elsevier journal articles STW Thesaurus Collexis Search Engine

Datasets

Automatic indexing results Manually indexed articles as gold standard

Page 5: Project Report: Semantic Portal Business and Economics

5/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Automatic Indexing Assessment

Precision and recall comparison

Meaningless numbers on the macro level Tedious on the micro level

Visual analysis using Semtinel

Per concept IC-Diff analysis Treemap for navigation Easy identification of critical concepts

Page 6: Project Report: Semantic Portal Business and Economics

6/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

IC Diff Analysis with Semtinel

Page 7: Project Report: Semantic Portal Business and Economics

7/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Automatic Indexing Assessment cont.

Editing of example critical thesaurus concepts

Lack of sysnonyms Insufficient disamgibuation Overly broad concepts

Reindexing

Improved Precision and recall

Page 8: Project Report: Semantic Portal Business and Economics

8/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Further Steps

Analysis and Semtinel Tool

Improve framework (SKOS loader) Document based analysis methods

Multi-Thesaurus Retrieval

Multiple indexes Merging multiple thesauri UI Design

Page 9: Project Report: Semantic Portal Business and Economics

9/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Further Steps cont.

Prototype retrieval system

Collexis engine and user interface User study

Integration into library systems

Representation using RDF and DC Evaluation of Ex Libris “Primo” product

Page 10: Project Report: Semantic Portal Business and Economics

10/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Open Questions

How can one judge indexing results? Is our approach reasonable?

More ideas or use-cases for Semtinel? Feature-Requests? (e.g. Ontology-Editor, ...)

Page 11: Project Report: Semantic Portal Business and Economics

11/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Thank you for your attention.

[email protected]

[email protected]

Page 12: Project Report: Semantic Portal Business and Economics

12/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Additional Slides

Page 13: Project Report: Semantic Portal Business and Economics

13/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

IC Diff Analysis

D IC c = IC c − IIC c

IC c=−log P c IIC c =−log hypoc1max

Information Content:Proposed by ResnikDepends on Frequency in Document Base

Intrinsic Information Content:Proposed by Seco, Veale und HayesBased on the Number of Subconcepts

Intuitive: A value between -1 and 1 that says, if a concept has a suspicious frequency regarding its position in the thesaurus.

Page 14: Project Report: Semantic Portal Business and Economics

14/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Semtinel Workbench

Page 15: Project Report: Semantic Portal Business and Economics

15/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Semtinel API

STWSKOS

CSV CDSKEA

Pubmed

Access

ConnectorFramework

ThesaurusViewer

CoreOverview

IC Diff

Collexis

RVKMeSH

SemtinelCore

I/OFramework

AnalysisFramework IC, IIC

Children

Frequency

GUIFramework

TreemapVisualizer

Page 16: Project Report: Semantic Portal Business and Economics

16/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Intrinsic Information Content

Page 17: Project Report: Semantic Portal Business and Economics

17/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Information Content

Page 18: Project Report: Semantic Portal Business and Economics

18/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

IC Diff

Page 19: Project Report: Semantic Portal Business and Economics

19/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Bioscience

Page 20: Project Report: Semantic Portal Business and Economics

20/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Organisms

Page 21: Project Report: Semantic Portal Business and Economics

21/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Animals

Page 22: Project Report: Semantic Portal Business and Economics

22/11Kai Eckert and Magnus Pfeffer

Semantic Portal Business and Economics – Project Report

Persons