intelligent database systems lab presenter: chang, shih-jie authors: kevin meijer, flavius...

18
Intelligent Database Systems Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach for extracting domain taxonomies from text

Upload: virgil-long

Post on 17-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Presenter: CHANG, SHIH-JIE

Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom

2014.DSS.

A semantic approach for extracting domain taxonomies from text

Page 2: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Outlines

MotivationObjectivesMethodologyExperimentsConclusionsComments

Page 3: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Motivation

• Manually creating a taxonomy is difficult and time

consuming process and may not be high quality.

Page 4: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Objectives

• This paper presents a framework using a semantic

approach for the automatic building of a domain taxonomy,

called Automatic Taxonomy Construction from Text (ATCT).

Page 5: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Methodology

Page 6: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Page 7: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Methodology – term filtering

lexical cohesion

domain pertinence

domain consensus

domain score of term tin domain corpus Di

Page 8: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

WSD on text corpora

WSD on existing taxonomies

Page 9: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Methodology – Concept hierarchy creation

score(pricing | ‘pricing behavior’) = 0.6 + ½*0.4 +1/3*0.3= 0.9 score(trading | ‘pricing behavior’) = 0.7 + ½*0.3 = 0.85

Page 10: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Implementation WSD result

Page 11: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Implementation hierarchy creation result

Page 12: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Experiments

semantic precision

semantic recall

Page 13: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Experiments – core taxonomy V.S. reference taxonomy

Page 14: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Experiments – two measurestaxonomic precision

taxonomic recall

global taxonomic precision

global taxonomic recall

taxonomic F-measure

Page 15: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Experiments

Page 16: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Experiments

Page 17: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Conclusions– ATCT framework can be successfully applied to other domains than economics and management.– Our approach works well in capturing the broader–narrower relation between concepts .

Page 18: Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Kevin Meijer, Flavius Frasincar, Frederik Hogenboom 2014.DSS. A semantic approach

Intelligent Database Systems Lab

Comments• Advantages

– Define concepts well.• Applications

– Built taxonomies 、 Term extraction and filtering.