text analysis and visualisation: an overview of tools · summit 1 jul 2014 more analyze data...

18
Text Analysis and Visualisation: An Overview of Tools

Upload: others

Post on 01-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Text Analysis and Visualisation:

An Overview of Tools

Page 2: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 3: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 4: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 5: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 6: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 7: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome
Page 8: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

□ Segmentation or tokenisation □ Often based on the fact that there are generally

spaces in between words □ Types are the unique words in a document;

tokens are the total number of words

He cried in a whisper at some image, at some vision,--he cried out twice, a cry that was no more than a breath-- 'The horror! The horror!‘

28 tokens and 21 types

Studies based on vocabulary

Page 9: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Frequency lists

Frequency list produced using TaporWare

Page 10: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Stopword filtering

Frequency list produced using TaporWare

Page 11: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Type-token ratio

Graph produced using R

Page 12: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Concordances

Concordance produced using AntConc

Page 14: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Collocation

List produced using TaporWare

List produced using AntConc

Page 15: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Co-occurrence

List produced using TaporWare

Page 17: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Information extraction

Page 18: Text Analysis and Visualisation: An Overview of Tools · Summit 1 Jul 2014 more Analyze data Analyze texts Author an interactive work ... Dutch encroacnee ane Inn . About Welcome

Conclusions

□ Text analysis tools may produce new views on the text

□ There are caveats; Tools are based on assumptions on how texts ought to be analysed

□ Customisations of existing tools are generally needed for more specific research question

□ Identification of appropriate tools via library

support