timo honkela: an introduction to text mining
TRANSCRIPT
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Timo Honkela
Modeling Meaning and Knowledge25 Apr 2016
An introduction totext mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Data mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Data mining tasks(Hand, Mannila & Smyth 2001)
● Exploratory data analysis● Descriptive modeling● Prescriptive modeling:
classification and regression● Discovering patterns and rules● Retrieval by content
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Text mining
http://www.intechopen.com/books/theory-and-applications-for-advanced-text-mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Text mining
● Finding structures and relations at different levels of abstraction
● Study of distributions, trends and correlations● Text classification and clustering● Entity extraction● Authorship analysis● Sentiment analysis● etc. etc.
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Application areas of text mining
● Digital humanities– Sociology
– History
– Literature
– Law
● Knowledge management● Customer relationship management (CRM)● Competence management
– Archeology
– Linguistics
– Religion
– Philosophy
● Remember also– Medicine
– Psychology
– Geology
– etc.
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Examples using the SOM
● Art museum visitorsPockets full of memories: an interactive museum installationG Legrady, T HonkelaVisual Communication 1 (2), 163-169
● PoetryIn search for volta: Statistical analysis of word patterns in Shakespeare's sonnetsO Kohonen, S Katajamäki, T Honkela.Proceedings of AMKLC'05, International Symposium on Adaptive Models of Knowledge, Language and Cognition, pages 44–47, Finland
● Religious cognitionCounterintuitiveness as the hallmark of religiosityI Pyysiäinen, M Lindeman, T HonkelaReligion 33 (4), 341-355
● CompetenceDocument maps for competence managementT Honkela, R Nordfors, R TuuliProceedings of the Symposium on Professional Practice in AI, 31-39
Dimensionality reductionVisualizationAbstraction
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
New projects
● Digital Mindscapes: Mining social media(Jussi Pakkasvirta, Krista Lagus, Mika Pantzar, Minna Ruckenstein, etc.)
http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/citizen-mindscapes-digihum-starts_3-vain-luku.pdf
● Computational History 1640–1910: Mining newspapers(Mikko Tolonen, Kimmo Kettunen, Hannu Salmi, Tapio Salakoski, etc.)
http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/comhis-presentation-logomo-22-march-2016.pdf
In many casesa supportinginfrastructureis FIN-CLARIN