glocall 2015 globalization and localization in computer-assisted language learning the future of...

78
GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal www.LEXTUTOR.ca

Upload: noel-dickerson

Post on 18-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

GLOCALL 2015

Globalization and Localization in Computer-Assisted Language Learning

The future of Vocabprofiling

Tom Cobb

Université du Québec à Montréal

www.LEXTUTOR.ca

Page 2: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

GLOCALL 2015

Globalization and Localization in Computer-Assisted Language Learning

The future & meaning of Vocabprofiling

Tom Cobb

Université du Québec à Montréal

www.LEXTUTOR.ca

Page 3: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

3

Backgrounder

• Vocabprofiling is an activity performed on my website Lextutor

• www.lextutor.ca

• Which is probably the thing I am best known for + the reason you have asked me here

• So the framing question of my presentation is this:

Page 4: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

4

Is Lextutor a bunch of software, or a coherent theory of language acquisition?

Page 5: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

5

A clue.

Page 6: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

6

Take 2 on a framing issue with a bit of “localization”

• Korean learners are interested in vocabulary lists?

• Don’t try to stop this interest

• Instead give them the right lists and a good way to use them

Page 7: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

7

First some scene-setting stats

So A LOT of people use this

Page 8: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

8

Korea always in daily Top 10 user countries

Page 9: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

9

And elsewhere in GloCall land...

Page 10: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

10

Page 11: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

11

At a rate of…

Page 12: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

12

What do these folks look at ?Basically every routine gets significant use

Page 13: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

13

Page 14: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

14

What do these folks look at ?But a few routines get the lion’s share

VOCABPROFILEgroup

VOCAB TESTS

CONCORDANCEgroup

Page 15: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

15

What do these folks look at ?But a few routines get the lion’s share

VOCABPROFILEgroup

VOCAB TESTS

CONCORDANCEgroup

Page 16: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

16

What do these folks look at ?But a few routines get the lion’s share

VOCABPROFILEgroup

VOCAB TESTS

CONCORDANCEgroup

Page 17: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

17

So what is Vocabprofiling?

(A.k.a. VP-ing)

=> a procedure for matching learner to text through word frequency

Page 18: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

18

What is word frequency?What is concordancing?

Page 19: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

19

Page 20: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

20

Word frequency in a general corpus

=Best predictor of word

knowledge

Page 21: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

21

Which eventually gives us…

Etc.

Page 22: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

22

…From which we can determine

The frequency of the words the learners know

vs.

the frequency of the words in a given text

Page 23: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

23

1. The frequency of the words the learners know

Page 24: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

24

2. The frequency of the words in a given textThere are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

Page 25: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

25

So when we know the profile of the learner,

+ the profile of the text,

+ that 95% of words must be known for minimal comprehension

…we can make some fairly reliable and very useful predictions

Page 26: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

26

Using this simple multiplication formula

Learner profile• K1 - 70% x• K2 - 50% x• K3 - 40% x• K4 - 20% x

Text profile• K1 - 80% =• K2 - 10% =• K3 - 5% =• K4 - 5% = --------------

100%

Predicted % of words comprehended

• K1 - 56%• K2 - 5%• K3 - 2%• K4 - 1% ------------

64%

With <95% of words known, even just to level of passive meaning recognition, comprehension will be minimal

Page 27: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

27

Have you ever seen something like this in you learners reading

materials?

Page 28: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

28

Case 1: Can this learner read this text ?

CHANGE THIS: START WITH IMPOSSIBLE CASE

There are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

Page 29: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

29

Yes

The learner knows all the words sampled at 1k-2k-3k

The text is 95% 1k-3k words, so he probably knows all those

The learner knows > ½ the words 4k-5kThe text is 5% of 4k-5k words

The learner will know > half of these = 3%

The learner knows 95+3=98% of words in the text

Page 30: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

30

Case 2: Can this learner read this text ?

There are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

Page 31: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

31

This learner will know most of the k1-k2 words (90%), but half or less k3 and up

But 10% of words in this text are k3 and up!

This learner will know about 93% of the words

Page 32: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

32

Here is this text with 93% of words known

lextutor.ca/cloze/vp/

Page 33: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

33

So what can this learner do with this text?

First let us nuance the 95% empirical finding

Page 34: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

34

From empirical research, we know that…

With <90% of words known, comprehension is difficult/impossible

With 90-95% known, comprehension is possible with resources Dictionary, discussion, content lecturer, seminar discussion…

With >95% known, (1) independent reading is feasible

(2) between 95-98%, fluency will improve

Page 35: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

35

So, again, can this learner read this text ?There are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

YES for a with-resources intensive reading task

NO for a reading examination

NO if the goal is fluency development

Page 36: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

36

Case 3: Can this learner read this text ?

There are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

Page 37: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

37

The learner knows 70% of K1 wordsThe text is 74% k1 words

He will know about .7 x .7= 50% of k1 words

He knows a handful after k1The text is 27% after k1

At most he has 20% of theseSo he will know .27x.2= 5.4% of 2k-5k words

So he will know < 60% of words in the text

Page 38: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

38

Here is this text with 68% of words known…

lextutor.ca/cloze/vp/

Page 39: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

39

What happens if you give this text to this learner?

There are strict time limits on the detention of persons without charge. An arrested person may not be detained without charge for more than 24 hours, unless a serious arrestable offence has been committed. If a serious arrestable offence has been committed a superintendent can extend the period to 36 hours to secure or preserve evidence by continued questioning. When a serious arrestable offence has been committed and the suspect needs to be held in custody beyond the 36 hour period, the police must bring the suspect before a magistrate to extend the time limit to a maximum of 60 hours.

Page 40: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

40

Have you ever seen something like this in you learners readers?

This is an, um… undesigned reading experience

With what objective?

Page 41: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

41

Don’t believe this works?

Test this idea with your learners

Here are 7 handy pre-profiled texts, with 95% cut-offs at different k-

levels

(pic is link)

http://www.lextutor.ca/vp/comp/samples.html

Page 42: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

42

So with Testing+VP we can find level-appropriate and task-appropriate texts

Only find?

Where will we find interesting texts for beginnners?

With VP we can also create such texts

Page 43: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

43

Page 44: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

44

Page 45: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

45

Original Edited-to-a-profile

Page 46: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

46

Plus this neat little feature

Page 47: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

47

Ongoing VP work

• BNC lists supplemented by BNC-Coca– Brit and US English

• Frequency joined by Greco-Latin / Anglo-Saxon indicators– A.k.a. Multi- and single syllable words

• Incorporation of Multi-word units– “a lot” should be one k-1 unit, – not one k-1 + one k-3

• Differentiation of homoforms– River bank and money bank should be two words at two k-levels

• AND FINALLY Move to Mobile

Page 48: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

48

Page 49: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

49

The final vocab lists we need

1,000 List 3,000 List ?something course… …… something_of_a

bank_1 …… …

of_course bank_2

Page 50: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

50

General move to phrases in Lextutor

Page 51: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

51

So the Future of Vocabprofiling?

Ever more sophisticated analysis, without leaving teacher behind

Cut some words in two (bank)And some words into phrases (a lot)

Allow teachers to build collective library of texts on Lextutor, by topic x and 95% k-level y

Adapt VP as Web search engine to find texts of keywords x, y, z and VP k1=x, k2=y, k3=z

with Google API

Page 52: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

52

Need more?<<new

yesterday>> – Horst &

Cobb chapter on Vping in…

Page 53: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

53

So we have seen some useful VP tips and tricks from Lextutor

• But is it more?• Thesis:

Text computing + language acquisition are tightly connected in 2015

ALL LEXTUTOR ROUTINES ARE INSTANCES OF ‘DATA DRIVEN LANGUAGE LEARNING’ (DD-LL)

Page 54: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

54

DD-LL in the broader scheme

Page 55: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

55

DD-LL in the broader scheme

Page 56: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

56

DD-LL in the broader scheme

Page 57: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

57

Input as data

Large supplies of ‘Comprehensible input’ in real life are not simple to come by

While waiting, ‘Comprehensible computer input’ can do some of the job + create readiness

We saw VP predict text readability

Computing can also make language comprehensible in several other ways

Page 58: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

58

Concordance1. For word meaning

If you can’t infer a meaning in one context, maybe in another

http://www.lextutor.ca/hyp/1/

Page 59: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

59

Concordance2. To expose collocations

Page 60: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

60

Concordance3. For what does not exist in a language

Page 61: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

61

Concordance4. As data-linked writing tool

Page 62: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

62

Page 63: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

63

TTS To (1) slow speech down and (2) allow repetition

Generated straight from text without pre-recording

Page 64: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

64

Code-link texts to external resource Many generated by algos straight from text without recording

http://www.lextutor.ca/hyp/2/

Page 65: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

65

So if ‘comprehensible input’ is a theory of language acquisition

•Then Data-Driven Language-Learning is a theory about how to assure a reliable supply of this– Esp where NSs are in short supply

• And our best guess yet as to a principled use of the computer in language learning

Page 66: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

66

But wait! Does DD-LL ‘work’?• Ongoing meta-analysis of a wide range of

DDLL application with the great Alex Boulton

Page 67: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

67

205 recent DDL studies investigated56 studies selected for analysis

(selected for pre-post or exptl-control quantitative design)

Page 68: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

68

149 studies thrown out for…

• Small n-size• Unreported or uncalculable Std Deviations• Plagiarism of other studies/ students’ PhDs• Math errors

Leaving these in the final line-up

Page 69: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

69

Page 70: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

Many of the 56 From GloCALL zone

Nam2010; Yoon & Jo, 2014

70

Page 71: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

71

A wide range of RQ’sDoes DDL help with grammar in academic writing?Does DDL help with synonym use in academic writing?Does DDL help with preposition choice in academic writing?Does DDL help with writing in academic writing?Does DDL help with word collocations in academic writing?Does DDL help with word connotations in academic writing?Does DDL help with phraseology in academic writing?Does DDL help with spelling in academic writing?Does "corpus-based collocation instruction" help with collocations?Does DDL help with lexical knowledge (definitions and translations)?Does DDL help with interpreting?Does DDL help with lexical knowledge (definitions and translations)?Does DDL help with interpreting?Does DDL help with lexical knowledge (definitions and translations)?Does DDL help with interpreting?

etc…

Page 72: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

72

Boiled down to a common measure: e.s.(= comparing means in light of overall

standard deviation)

Page 73: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

73

So, over 56 good studies,pre vs. post means,

or control vs. experimental means are

an average 1.46 Std Devs apart

Page 74: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

So e.s.=1.46 would be something like this(about a 15% difference with typical SDs)

Group A (Control or Pre)• 69• 69• 68• 64• 69• 87

M = 71SD = 8

Group B (Experimental or Post)• 83• 84• 85• 86• 87• 70

M = 82.5SD = 6

74

e.s. = 11.5 / 7 = 1.5

Page 75: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

75

So DD-LL ‘works’ for many learning objectives

The only problem is, usually in some amazingly adapted version of concordancing

Which can be found where?Not exactly the delight of commercial app-developers

Obviously I see Lextutor as providing accessible DDL softwareThrough a sustained program of continuous development

A particular example

Page 76: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

76

Page 78: GLOCALL 2015 Globalization and Localization in Computer-Assisted Language Learning The future of Vocabprofiling Tom Cobb Université du Québec à Montréal

78

The EndThank You

Questions now, or via<[email protected]>

• Tom Cobb

• Université du Québec à Montréal

• www.LEXTUTOR.ca