a sprinkling of key words
DESCRIPTION
A Sprinkling of Key Words. Mike Scott Aston University June 30, 2010. Issues: Key words (KWs). Keyness Aboutness Distribution patterns of KWs. complex pattern. or simple. fractal?. Fractal. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/1.jpg)
A Sprinkling of Key Words
Mike ScottAston UniversityJune 30, 2010
![Page 2: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/2.jpg)
![Page 3: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/3.jpg)
![Page 4: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/4.jpg)
Issues: Key words (KWs)
Keyness AboutnessDistribution patterns of KWs
![Page 5: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/5.jpg)
complex pattern
![Page 6: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/6.jpg)
or simple
![Page 7: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/7.jpg)
fractal?
![Page 8: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/8.jpg)
Fractal
• A fractal is "a rough or fragmented geometric shape that can be split into parts, each of which is (at least approximately) a reduced-size copy of the whole,"[1] a property called self-similarity – (Wikipedia)– [1] Mandelbrot, B.B. (1982). The Fractal
Geometry of Nature. W.H. Freeman and Company.
![Page 9: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/9.jpg)
![Page 10: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/10.jpg)
Keyness
• aboutness
• importance
• a textual category
![Page 11: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/11.jpg)
![Page 12: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/12.jpg)
aboutness
• what the text is about
• what the message is
• what it all means
• picture from mindreadersdictionary.com
![Page 13: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/13.jpg)
importance
centrality
![Page 14: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/14.jpg)
Context
• Claimp by Maya Goldblum
• New Designers 07
![Page 15: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/15.jpg)
Impoverished context
• Dandelion Light by Sunghwa Jang
• New Designers 07
![Page 16: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/16.jpg)
Levels of Context
P hysica l env ironm ent
![Page 17: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/17.jpg)
Identification of KWs: criteria
simple verbatim repetitionno allowance for anaphora, synonymy, antonymy etc.thresholdone word, or more than one?
![Page 18: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/18.jpg)
Corpus-bound or corpus-driven?
• Machine-identified keyness is ideal for corpus-driven research
• The researcher lets the PC suggest areas needing further chasing up
• See recent work by McEnery, Baker, etc. and Nelia Scott 1998
![Page 19: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/19.jpg)
Research Questions
How are the KWs of Bleak House distributed? Are the KWs of different kinds (nouns/verbs … character/place/style words) distributed differently?Do the KWs of the chapters reflect the pattern of the whole text but on a smaller scale?
![Page 20: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/20.jpg)
Bleak House
published 1852-3(20 monthly instalments)
350,000 wordsPreface + 66 Chapters
![Page 21: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/21.jpg)
reference corpus
9 million words52 novels,
29 other 19th Century authors23 Dickens
![Page 22: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/22.jpg)
Procedures
• download Bleak House (Gutenberg Project)
• separate each chapter as a separate file
• create a wordlist of the reference corpus
• create a wordlist of the whole of Bleak House
• create a batch of wordlists, one of each chapter of Bleak House
ref. corpus
BH
![Page 23: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/23.jpg)
KW Procedures
• Compute KW list of the whole novel
• Compute batch of KW lists, one for each chapter
![Page 24: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/24.jpg)
Overall Results
Over 300 positive KWs for the whole novelAbout 70 negative KWs including God (half as frequent as in 19th C literature overall)
![Page 25: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/25.jpg)
Excel• spreadsheet constructed at
the same time as the batch of KW files
http:\\www.lexically.net\downloads\corpus_linguistics\Bleak_House.xls
fewer characters in first chapters
pronouns are sprinkled
![Page 26: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/26.jpg)
Chapter by Chapter
Average of 23 KWs per chapter – same settings, same reference corpus (19th C Lit.)Per chapter: minimum 5, maximum 38.
![Page 27: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/27.jpg)
Chapter by chapter variationKW Categories
0%
20%
40%
60%
80%
100%
120%
1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67
Other
Characters
Places
Pronouns
Titles
![Page 28: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/28.jpg)
Global KWs
![Page 29: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/29.jpg)
Local KWs
![Page 30: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/30.jpg)
middling burstiness
• verbsappears
begins
puts
observes
replies
continues
says
considers
etc.
![Page 31: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/31.jpg)
Preliminary findings
• All chapters have KWs• Individual chapters differ considerably in their
KWs• because KWs are not all global• Character KWs enter the novel gradually• Pronouns and verbs present in many sections
but absent in many too– not much to do with aboutness– middling level of burstiness
• KWs of different kinds are distributed differently
![Page 32: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/32.jpg)
Preliminary conclusion
• KWs of the chapters do not simply reflect the pattern of the whole text but on a smaller scale
• Keyness is not fractal
![Page 33: A Sprinkling of Key Words](https://reader036.vdocuments.us/reader036/viewer/2022062518/568140c0550346895dac877b/html5/thumbnails/33.jpg)
References• Baker, P., Gabrielatos C., Khosravinik, M., Krzyzanowski, M.,
McEnery, T. & Wodak, R., 2008. A useful methodological synergy? Combining critical discourse analysis and corpus linguistics to examine discourses of refugees and asylum seekers in the UK press. Discourse & Society 19(3), 273-305.
• McEnery, Tony, 2009. "Keywords and Moral Panics: Mary Whitehouse and Media Censorship". in Dawn Archer (ed.) What's in a Word-list? Investigating word frequency and keyword extraction. Farnham: Ashgate, 93-124.
• Scott, M. Nelia, 1998, Normalisation and Readers' Expectations: A Study of Literary Translation with Reference to Lispector's A Hora da Estrela. Liverpool: Unpublished PhD thesis, University of Liverpool.