reliability chapter 3. classical test theory every observed score is a combination of true score...

16
Reliability Chapter 3

Upload: cleopatra-simon

Post on 31-Dec-2015

213 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Reliability

Chapter 3

Page 2: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Classical Test Theory

Every observed score is a combination of true score plus error.

Obs. = T + E

yreliabilitss

ss

O

T

O

E 2

2

2

2

1

Page 3: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Reliability Systematic versus unsystematic

error

Reliability only measures unsystematic error

Page 4: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Correlation

Correlation is a statistical technique that is often used in estimating reliability

Correlation coefficient: a numerical indicator of the relationship between two sets of data.

Page 5: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Positive Correlation

0

10

20

30

40

50

60

70

80

90

100

0 10 20 30 40 50 60 70 80 90 100

Series1

Page 6: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Negative Correlation

0

10

20

30

40

50

60

70

80

90

100

0 10 20 30 40 50 60 70 80 90 100

Series1

Page 7: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Pearson-Product MomentCorrelation

r

1z 2zN

Page 8: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Types of Reliability

Test-Retest

Alternate or Parallel Forms

Internal Consistency Measures

Page 9: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Internal Consistency Measures

Split-half reliability Spearman-Brown formula

Kuder-Richardson formulas KR 20 KR 21

Coefficient Alpha

Page 10: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Non-typical Situations

Speed tests

Criterion-referenced tests

Page 11: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Evaluating Reliability Coefficients

Examine purpose for using instrument

Have knowledge about the reliability coefficients of other instruments in area

Examine characteristics of particular clients against reliability coefficients

SES age culture/ethnicity

Page 12: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Standard Error of Measurement

rsSEM 1

Provides an estimation of the range of scores if someone were to take an instrument over and over again.

Based on the premise that when individuals take a test multiple times, the scores fall into a normal distribution.

Page 13: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Example of SEM

Sam’s SAT Verbal = 550 r = .91; s = 100

SEM =

68% of the time, Sam’s true score would fall between 520 and 580 95% of the time, Sam’s true score would fall between 490 and 610 99.5% of the time, Sam’s true score would fall between 460 and

640

100 1 .91

100 .09

100 .3 30

Page 14: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Using SEM to evaluate a score

Page 15: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Standard Error of Difference

A measure used by a counselor to examine the difference between two scores and determine if there is a significant difference.

Page 16: Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E

Alternative Theoretical Model

Generalizability or Domain Sampling Theory

Focus is on estimating the extent to which specific sources of variation under defined conditions are contributing to the score on the instrument.