nick saville bridging the gap between theory and practice ealta krakow may 2006 investigating the...

36
Nick Saville Bridging the gap between theory and practice EALTA Krakow May 2006 Investigating the impact of language assessment systems within a state educational context

Upload: daniel-harvey

Post on 01-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Nick Saville

Bridging the gap between theory and practice

EALTA KrakowMay 2006

Investigating the impact of language assessment systems within a state educational context

Investigating the impact of language assessment systems within a state educational context

Nick Saville

Bridging the gap between theory and practice

EALTA KrakowMay 2006

• Background - a personal perspective• 1980’s

• Bachman – early 1990s

• The literature on washback/impact• early work and recent progress

• gaps? where next?

• Analysis of three case studies – what can be learnt?

• Towards a comprehensive model of impact

• Applying the model in a state educational context• the Asset Languages Project

Outline

The 1980s – a personal perspective

• assessment in Italian universities• entrance exams in Japan• the influence of TOEIC/TOEFL e.g. in Japan/Korea• developing Cambridge exams

• Tests affect individuals and society!• How can this be managed better?• What is needed to “do a better job”?

Background

V

R Practicality?

Test

Background – 1987-1990 : Japan

Considerations in developing fair tests

The art of the possible

Practicality

V

R P

Test

“Practicality in Language Testing: an educational management model”

Main argument: test development is a form of educational

innovation - and needs to be managed as such

“... achieving a balance between the purpose of the test, its validity for the purpose, the required reliability for the purpose and the constraints

imposed by the context is essentially the task facing the test designer ….”

Saville (1990), University of Reading - based on test development project Japan, 1987-9

Practicality

V

R P

Test

Aspects of Practicality within a

context and educational setting:

•Acceptability•Applicability•Availability•Difficulty•Economy•Interpretability•Relevance•Replicability

“… a principled approach to Practicality should provide the test designer with the means of approaching test development so that a suitable balance can be achieved without overlooking factors which cause possible solutions to fall down in practice”.

Putting the test into context

V

R P

Test

The aim … is not only to encourage good testing practice, but to prevent bad tests being produced ....

... a bad test is not only one with low reliability and dubious validity but also one which has a damaging backwash on the curriculum.

Saville 1990:11-13

A logical consequence …. is that ethicality will be achieved as a result ..

……. this is because any test which is produced should be appropriate to the educational context in which it is to be used and the effect on learners and institutions will be a major consideration.

Putting the test into context

V

R P

Test

Impact Ripples

V

R P

Test

Impact Ripples

V

R P

Test

I

Local Impact

“micro” level

Impact Ripples

V

R P

Test

II

Wider Impact

(“macro”

level) II

U = V + R + I + P

Bachman - Cambridge 1990/91

Usefulness as overall validity

U = V + R + I + P

Bachman and Palmer, 1996 : U = Cv + A + I + R + I + P

Developing “useful tests”, fit for purpose

Balancing the test qualities

Usefulness as overall validity

Starting to develop a model

1993 – 1995

Using VRIP to develop and revise exams

e.g. IELTS 1995 • The IELTS impact project

The literature on washback/impact

Readings in the language testing literature:

• Hamp-Lyons (1989)• Wall and Alderson (1993) Does washback exist? Etc..• Bailey (1996)• Hamp-Lyons (1997)• Watanabe (1997)• Cheng and Watanabe (eds) (2004)

• Recent PhD studies and subsequent books based on research conducted in the 1990s:

• Cheng (SILT 21 - 2005)• Wall (SILT 23 - 2005)• Green (2004 – SILT forthcoming 2007)

• Hawkey – SILT 24 (forthcoming - 2006)

• Current work in Lancaster, ETS, UCLA, Cambridge etc.

The literature on washback/impact

So• Impact is relatively new in the field of language assessment - an

extension on the notion of washback and related to ethicality• It is now considered to be of growing importance• It is part of a validity argument and evidence needs to be provided

Broadly speaking there is consensus • impact deals with wider influences and includes the “macro contexts”

- tests and examinations in society• washback is an aspect of impact related to the “micro contexts” of

the classroom and the school BUT

The dynamics between the micro and macro contexts mean that this is a complex rather than a simple relationship

- a “complex dynamic system”

The literature on washback/impact

And currently:• there is no comprehensive model of test or

examination impact within educational contexts

• impact has not yet been fully integrated into an approach to test development and validation in a systematic way

Three case studies – 1995 to 2004

Case 1 - the world-wide survey of the impact of IELTS• a starting point for the work and the original model for what has followed• a conceptualisation of impact and design/validation of suitable

instruments to investigate it

Case 2 - the Italian PL2000 project• an application of the model within a macro educational context• an initial attempt at the applying the approach on a limited basis within a

state educational context

Case 3 - the Florence Learning Gains Project• an extension and re-application of the model within in a single school

context • at the micro level focusing on individual stakeholders within a single

language teaching institution

Learning from the case studies

What can be learned using these specific impact projects as meta-data?

Learning from the case studies

Three key factors of contemporary educational systems need to be accounted for:

1. the nature of complex dynamic systems

2. the roles that stakeholders play within such systems

3. the need to see assessment projects as educational innovations within the systems and to manage change effectively

Group

Country Culture Politics L1 Role of L2 Model of L2

RegionUrban/rural

Wealthy/poor

School Sector

Public/private

Cycle Primary Middle Upper

MACROCONTEXT

Learnerand

Teacher

Class

CommunityDemographic make up

Context inEducation:A complex,dynamic system

Individudal Differences: Demographic Socio-Psychological Strategic Prior Knowledge/learning

1. the nature of complex dynamic systems

Context

Stakeholders in the Testing Community

Government

agencies

Professional

bodies

Learners

Teachers, Heads

School owners

Test writers

Consultants

Examiners

Test centre

administrators

Materials writers

Publisher

inter alia

Learners

Parents/carers

Teachers, Heads

School owners

Receiving institutions

Government agencies

Professional bodies

Employers

Academic researchers

inter alia)

Input to test design Context of test use -

provided by stakeholders where decisions are made by

stakeholders using test scores

Cambridge ESOL

Test construct

Test format Test conditions Test assessment criteria

Test scores

Testing system

2. the roles that stakeholders play

See Wall (2005)

a case study using insights from testing and innovation theory

E.g. Henrichsen (1989)

3. the need to see assessment projects as educational innovations and to manage change effectively

Hybrid Model of the Diffusion / Implementation Process

Antecedents Process Consequences

Learning from the case studies

When applied to language assessment – two key factors also need to be accounted for :

1. the nature of language itself as a socio-cognitive phenomenon (the latest views on validity)

2. the nature of the test development and validation process• from conception to routine data collection and analysis

Impact research, therefore is no different from any other kind of validation activity........

Theory Test Taking Context

TT CONTEXT TLU Learning context Context of score use

1. A SOCIO-COGNITIVE FRAMEWORK

MessickBachmanKaneMislevyWeiretc.

Theory Test Taking Context

TT CONTEXT TLU Learning context Context of score use

A SOCIO-COGNITIVE FRAMEWORK

The testing system

Construct

Theory Test Taking Context

TT CONTEXT TLU Learning context Context of score use

The contexts

Learning contexts

Testingcontexts

Use of resultscontexts

Theory Test Taking Context

TT CONTEXT TLU Learning context Context of score use

Impact

2. Model of the Test Development Process

Identifying stakeholders and their needs

Linking these needs to the requirements of test usefulness - including predicted impact

- theoretical

- practical

Long term, Iterative Processes -a key feature of validation

Model of the Test Development Process

Involvement of the stakeholder constituency

E.g. during test design and development

presentation and consultation to do with specifications and detailed syllabus designs

professional support programmes for institutions and individual teachers/students etc. who plan to use the examinations

training and employment of suitable personnel within the field to work on all aspects of the examination cycle – to be question/item writers, to act as examiners, etc.

After an examination becomes operational

Procedures also need to be in place to routinely collect data which allows impact to be estimated:

e.g.

• who is taking the examination • (i.e. a profile of the candidates)

• who is using the examination results and for what purpose

• who is teaching towards the examination and under what circumstances

• what kinds of courses and materials are being designed and used to prepare candidates

• what effect the examination has on public perceptions generally

• (e.g. regarding educational standards)

• how the examination is viewed by those directly involved in educational processes

• (e.g. by students, examination takers, teachers, parents, etc.)

• how the examination is viewed by members of society outside education

• (e.g. by politicians, business people, etc.)

Towards a comprehensive model

How can these considerations be combined to produce a comprehensive, integrated model?

Next phase: applying the model

Asset Languages within the UK educational context

Welcome to Asset Languages

Asset Languages is a new way of motivating languages learners and rewarding their language skills.

The Asset Languages assessment scheme is for language learners of all ages and abilities: from primary school through to further, higher and adult education. Asset Languages is being developed by Cambridge Assessment through OCR and Cambridge ESOL, as part of the DfES' National Languages Strategy.

Contacts:

[email protected]

www.cambridgeesol.org/rs_notes

www.assetlanguages.org.uk