caveon webinar series - unlocking the mystery of the validity triangle 11-2014

27
Caveon Webinar Series 1 Unlocking the Mystery of the Validity Triangle Is Test Security Required for Test Score Validity? Dr. Phil Dickison, Ph.D. Chief Officer of Exams NCSBN Dennis Maynes Chief Scientist Caveon Test Security November 19, 2014

Upload: caveon-test-security

Post on 05-Jul-2015

170 views

Category:

Education


2 download

DESCRIPTION

Is Test Security Required for Test Score Validity? We're pleased to announce that Dr. Philip Dickison, the Director of Examinations at the National Council of State Boards of Nursing (NCSBN), will join us as a special guest for this month's session of the Caveon Webinar Series. Eleven years ago (when Caveon was founded), many measurement professionals viewed test security as an intriguing anachronism. At that time, we were determined to "catch" cheaters by analyzing test data with statistics. Since then, many measurement professionals have agreed that test security is foundational to test score validity. You cannot trust the test scores when the security of the exam is suspect. At Caveon, we have learned that using statistics to catch cheaters was fraught with scientific, operational, and perhaps most daunting, legal challenges. Many measurement professionals now use statistics to assess the validity of test scores. They focus on test takers' scores, rather than cheating behaviors. This evolution was crystalized recently when Dr. Dickison presented his "Validity Triangle" model in a Plenary Session during last month's Conference on Test Security. Please join Dr. Dickison and Caveon Chief Scientist Dennis Maynes for an important exploration on the use of statistics in supporting test score validity. This session will: • Provide a timeline of important milestones in the use of Data Forensics in supporting Test Security • Share invaluable "lessons learned" (often, the hard way) in implementing a Data Forensics program • Discuss the opportunities and challenges Data Forensics can present to program directors • Present an important evolution in gauging test score validity

TRANSCRIPT

Page 1: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Caveon Webinar Series

1

Unlocking the Mystery of the

Validity Triangle

Is Test Security Required for

Test Score Validity?

Dr. Phil Dickison, Ph.D.Chief Officer of Exams

NCSBN

Dennis MaynesChief Scientist

Caveon Test Security

November 19, 2014

Page 2: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Agenda for Today

• Psychometrician’s Dilemma

• Cheater’s Philosophy

• What can Statistics Show?

• “The Validity Triangle”

• Old Paradigm

• New Paradigm

• Important Conclusions

2

Page 3: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

What do you think?

Can statistics be used to

address security concerns?

3

Page 4: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Psychometrician’s Dilemma

4

Page 5: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

1977

It would never be feasible to prove that

cheating occurred based only on [a

statistic].

• Frary, Tideman, Watts

• Journal of Educational and Behavioral

Statistics 1977 2(235)

5

Page 6: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Offering a Retest – Frary, et. al.

• When individuals produce test results

that are not believed to be reliable

indicators of their abilities, they should

retake the test.

• The modest, undeserved burden that

would occasionally fall on entirely

innocent persons is a small price to

pay for the reduction in ill-gotten

gains.

6

Page 7: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

2012

The proposed [statistical analysis]

should never be used blindly or relied

on as the only evidence of cheating.

Although it has the full rigor of a

statistical [analysis], it does make Type

I and II errors.

• -van der Linden & Jeon

• Journal of Educational and Behavioral

Statistics 2012 37(1)

7

Page 8: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Use of Statistics—van der Linden &

Jeon

A statistical test of cheating should always:

• be used as part of a larger suite of

procedures

• use independent sources of information, for

instance:

– the responses on the test items,

– the response times,

– external evidence,

– as well as the source addressed in this

research.

8

Page 9: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

2014

We are trying to estimate

the probability of being

a cheater given that the

detection method singles

us out as one.

• Wainer, Test Fraud, eds.

Kingston & Clark

9

Page 10: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

2013

“In my opinion, most conversations about cheating lack clarity regarding the kind of cheating we are talking about…”

“As a professional testing specialist, it is simply not incumbent upon me to prove that cheating has occurred for me to express my psychometric concerns or cautions about score validity.”

• Greg Cizek, NCSA Presentation

10

Page 11: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Cizek, con’t

“Whereas I used to think that it was nearly always important to have more than one source of evidence, I now lean more toward accepting a single strong piece of evidence as reasonable for pursuing further investigation of suspected testing impropriety. “

“Sometimes, a highly improbable finding simply demands further scrutiny.”

Re-published with permission here:

http://www.caveon.com/tilsa-test-security-guidebook-next-steps/

11

Page 12: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Summary

• You can’t prove cheating using statistics

• Statistical decisions are not perfect– Some cheaters will go unpunished

– Some innocents will be wrongly accuse

• Statistical propositions have not been explicitly stated– This has led to talking “around” test security

issues instead of directly “addressing” them

12

Page 13: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

13

Cheater’s Philosophy

Page 14: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

January 2011: Dr. Phil Show

Episode: “Whatever it Takes”Jolie Fitch - Steinmetz Academic Decathlon Scandal

•"Is there any question that you cheated?“ Dr. Phil asks Jolie.

• "No," she replies.

•"You knew you were cheating when you cheated.“

• “Yes," Jolie answers. "We got a copy of the test

questions."

14

Page 15: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Jolie justifies herself

• Dr. Phil: "You got caught. … Are you sorry about

this?“

• Jolie: "No, not really.

Other than getting caught.“

• Dr. Phil: "What do you tell

your kids about cheating?“

• Jolie:

– "I tell them that life is a gray area.”

– “I would do it again.”

– “There is always someone who can give you the

answers.”

– “It wasn’t unfair because others are doing it.”15

Page 16: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Cheater’s Philosophy

• I do not accept your moral code, therefore your argument is without merit.– Amoral vs Immoral

• If you think I cheated, prove it!

• If you can’t prove it, you can’t call me a cheater.

• It works to the advantage of the cheater when you use the label “Cheater.”

16

Page 17: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

What Can Statistics Show?

17

Page 18: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Statistical Propositions #1

• The tests were not taken independently

• The test was completed inhumanly fast

• The test was completed using stolen

answer keys

• The test performance was inconsistent

– With respect to scores

– With respect to response time

18

Page 19: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Statistical Propositions #2

• The wrong-to-right answer changes were

atypical and inexplicable.

• The observed pattern of test taking through

time was inconsistent with normal test

taking.

19

Page 20: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Inferences about Cheating

• Must demonstrate means, motive &

opportunity

• Must demonstrate intent

• Must demonstrate culpability or

responsibility

• Must discard other viable explanations

20

Page 21: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Proper Inferences

• Explicitly list propositions

• Use relevant data & statistics

– To make inferences about scores use statistics

that provide information about scores

– To make inferences about behavior use

statistics that provide information about

behavior

• Findings are conditioned upon the

propositions

21

Page 22: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Dr. Phil DickisonNCSBN

22

Page 23: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Security - The Validity Paradigm

The Traditional Paradigm

Content

Psychometrics

Secu

rity

Act

ivit

y

Secu

rity

Act

ivit

y

Secu

rity

Act

ivit

y

Secu

rity

Act

ivit

y

Secu

rity

Act

ivit

y

23

Page 24: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Security - The Validity Paradigm

The New Paradigm

Security

24

Page 25: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Challenges of the New Paradigm

• Awareness of securities role in the validity argument

• Switching the focus from catching cheaters to releasing valid results

• Meaningful and visible standards

• Better use of Data Forensics

• Lack of a robust specialized professional field related to test security

25

Page 26: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Conclusions

“I would urge us to reframe our concerns

about test data integrity not as cheating

concerns, but as a validity issue.”

“It is important to treat each similarly situated

case the same way, and a coherent,

comprehensive set of policies and

procedures, uniformly applied, is essential.”

– Greg Cizek, NCSA, 2013

http://www.caveon.com/tilsa-test-security-guidebook-next-steps/

26

Page 27: Caveon Webinar Series - Unlocking the Mystery of the Validity Triangle 11-2014

Thank you!

- Follow Caveon on twitter @caveon

- Check out our blog…www.caveon.com/blog

- LinkedIn Group – “Caveon Test Security”

Phil Dickison, [email protected]

Dennis Maynes, Caveon Test

[email protected]

27