timo honkela: an introduction to text mining

8
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016 Timo Honkela Modeling Meaning and Knowledge 25 Apr 2016 [email protected] An introduction to text mining

Upload: timo-honkela

Post on 17-Feb-2017

120 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Timo Honkela

Modeling Meaning and Knowledge25 Apr 2016

[email protected]

An introduction totext mining

Page 2: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Data mining

Page 3: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Data mining tasks(Hand, Mannila & Smyth 2001)

● Exploratory data analysis● Descriptive modeling● Prescriptive modeling:

classification and regression● Discovering patterns and rules● Retrieval by content

Page 4: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Text mining

http://www.intechopen.com/books/theory-and-applications-for-advanced-text-mining

Page 5: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Text mining

● Finding structures and relations at different levels of abstraction

● Study of distributions, trends and correlations● Text classification and clustering● Entity extraction● Authorship analysis● Sentiment analysis● etc. etc.

Page 6: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Application areas of text mining

● Digital humanities– Sociology

– History

– Literature

– Law

● Knowledge management● Customer relationship management (CRM)● Competence management

– Archeology

– Linguistics

– Religion

– Philosophy

● Remember also– Medicine

– Psychology

– Geology

– etc.

Page 7: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

Examples using the SOM

● Art museum visitorsPockets full of memories: an interactive museum installationG Legrady, T HonkelaVisual Communication 1 (2), 163-169

● PoetryIn search for volta: Statistical analysis of word patterns in Shakespeare's sonnetsO Kohonen, S Katajamäki, T Honkela.Proceedings of AMKLC'05, International Symposium on Adaptive Models of Knowledge, Language and Cognition, pages 44–47, Finland

● Religious cognitionCounterintuitiveness as the hallmark of religiosityI Pyysiäinen, M Lindeman, T HonkelaReligion 33 (4), 341-355

● CompetenceDocument maps for competence managementT Honkela, R Nordfors, R TuuliProceedings of the Symposium on Professional Practice in AI, 31-39

Dimensionality reductionVisualizationAbstraction

Page 8: Timo Honkela: An introduction to text mining

Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016

New projects

● Digital Mindscapes: Mining social media(Jussi Pakkasvirta, Krista Lagus, Mika Pantzar, Minna Ruckenstein, etc.)

http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/citizen-mindscapes-digihum-starts_3-vain-luku.pdf

● Computational History 1640–1910: Mining newspapers(Mikko Tolonen, Kimmo Kettunen, Hannu Salmi, Tapio Salakoski, etc.)

http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/comhis-presentation-logomo-22-march-2016.pdf

In many casesa supportinginfrastructureis FIN-CLARIN