diagnostics – part ii using statistical tests to check to see if the assumptions we made about the...

Diagnostics – Part II

Using statistical tests to check to see if the assumptions we made about the

model are realistic

Diagnostic methods

• Some simple (but subjective) plots. (Then)

• Some formal statistical tests. (Now)

Simple linear regression model

• Error terms have mean 0, i.e., E(i) = 0.

i and j are uncorrelated (independent).

• Error terms have same variance, i.e., Var(i) = 2.

• Error terms i are normally distributed.

The response Yi is a function of a systematic linear component and a random error component:

iii XY 10

with assumptions that:

Why should we keep NAGGING ourselves about the model?

• All of the estimates, confidence intervals, prediction intervals, hypothesis tests, etc. have been developed assuming that the model is correct.

• If the model is incorrect, then the formulas and methods we use are at risk of being incorrect. (Some are more forgiving than others.)

Summary of the tests we’ll learn …

• Durbin-Watson test for detecting correlated (adjacent) error terms.

• Modified Levene test for constant error variance.

• (Ryan-Joiner) correlation test for normality of error terms.

The Durbin-Watson test for uncorrelated (adjacent) error terms

n

tt

n

ttt

e

eeD

1

2

2

21

Durbin-Watson test statistic

Compare D to Durbin-Watson test bounds in Table B.7:

• If D > upper bound (dU), conclude no correlation.

• If D < lower bound (dL), conclude positive correlation.

• If D is between the two bounds, the test is inconclusive.

Example: Blaisdell Company

170160150140130

29

28

27

26

25

24

23

22

21

Industry Sales

Co

mpa

ny S

ales

($

mill

ions

)

S = 0.0860563 R-Sq = 99.9 % R-Sq(adj) = 99.9 %

Company = -1.45475 + 0.176283 Industry

Regression Plot

($ millions)

Seasonally adjusted quarterly data, 1988 to 1992.

Reasonable fit, but are the error terms positively auto-correlated?

2018161412108642

0.2

0.1

0.0

-0.1

Observation Order

Re

sid

ual

Residuals Versus the Order of the Data(response is Company)

Blaisdell Company Example: Durbin-Watson test

• Stat >> Regression >> Regression. Under Options…, select Durbin-Watson statistic.

• Durbin-Watson statistic = 0.73

• Table B.7 with level of significance α=0.01, (p-1)=1 predictor variable, and n=20 (5 years, 4 quarters each) gives dL= 0.95 and dU=1.15.

• Since D=0.73 < dL=0.95, conclude error terms are positively auto-correlated.

For completeness’ sake … one more thing about Durbin-Watson test

• If test for negative auto-correlation is desired, use D*=4-D instead. If D* < dL, then conclude error terms are negatively auto-correlated.

• If two-sided test is desired (both positive and negative auto-correlation possible), conduct both one-sided tests, D and D*, separately. Level of significance is then 2α.

Modified Levene Test for nonconstant error variance

• Divide the data set into two roughly equal-sized groups, based on the level of X.

• If the error variance is either increasing or decreasing with X, the absolute deviations of the residuals around their group median will be larger for one of the two groups.

• Two-sample t* to test whether mean of absolute deviations for one group differs significantly from mean of absolute deviations for second group.

Modified Levene Test in Minitab

• Use Manip >> Code >> Numeric to numeric … to create a GROUP variable based on the values of X.

• Stat >> Regression >> Regression. Under Storage …, select residuals.

• Stat >> Basic statistics >> 2 Variances … Specify Samples (RESI1) and Subscripts (GROUP). Select OK. Look in session window for Levene P-value.

Example: How is plutonium activity related to alpha particle counts?

2010 0

0.15

0.10

0.05

0.00

plutonium

alph

a

S = 0.0125713 R-Sq = 91.6 % R-Sq(adj) = 91.2 %

alpha = 0.0070331 + 0.0055370 plutonium

Regression Plot

A residual versus fits plot suggesting non-constant error variance

0.120.100.080.060.040.020.00

0.03

0.02

0.01

0.00

-0.01

-0.02

-0.03

-0.04

Fitted Value

Re

sid

ual

Residuals Versus the Fitted Values(response is alpha)

Plutonium Alpha Example: Modified Levene’s Test

Levene's Test (any continuous distribution) Test Statistic: 9.452P-Value : 0.006

It is highly unlikely (P=0.006) that we’d get such an extreme Levene statistic (L=9.452) if the variances of the two groups were equal.

Reject the null hypothesis at the 0.01 level, and conclude that the error variances are not constant.

(Ryan-Joiner) Correlation test for normality of error terms in Minitab

• H0: Error terms are normally distributed vs. HA: Error terms are not normally distributed

• Stat >> Regression >> Regression. Under storage…, select residuals.

• Stat >> Basic statistics >> Normality Test. Select residuals (RESI1) and request Ryan-Joiner test. Select OK.

100 chi-square (1 df) data values

1050

40

30

20

10

0

chi

Per

cent

Normal probability plot and test for 100 chi-square (1 df) data values

P-Value (approx): < 0.0100R: 0.8301W-test for Normality

N: 100StDev: 1.56173Average: 1.10426

9876543210

.999

.99

.95

.80

.50

.20

.05

.01

.001

Pro

babi

lity

chi

Normal Probability Plot

100 normal(0,1) data values

2.52.01.51.00.5-0.0-0.5-1.0-1.5-2.0-2.5

20

10

0

normal

Per

cent

Normal probability plot and test for 100 normal(0,1) data values

P-Value (approx): > 0.1000R: 0.9956W-test for Normality

N: 100StDev: 1.02403Average: 0.137793

20-2

.999

.99

.95

.80

.50

.20

.05

.01

.001

Pro

babi

lity

normal


Normal probability plot for Tree diameter (X) and C-dating Age (Y)

6004002000-200-400

2

1

0

-1

-2

No

rmal

Sco

re

Residual

Normal Probability Plot of the Residuals(response is Age)

Tree diameter and Age Example: Ryan-Joiner Correlation Test

P-Value (approx): < 0.0100R: 0.9225W-test for Normality

N: 20StDev: 280.985Average: -0.0000000

6004002000-200

.999

.99

.95

.80

.50

.20

.05

.01

.001

Pro

babi

lity

RESI1


Some closing comments

• Checking of assumptions is important, but be aware of the “robustness” of your methods, so you don’t get too hung up.

• Model checking is an art as well as a science.

• Do not think that there is some definitive correct answer “in the back of the book.”

• Use your knowledge of the subject matter.

diagnostics – part ii using statistical tests to check to see if the assumptions we made about the...

Documents

error terms i

d lower bound d

durbinwatson test bounds

normality of error terms

twosided test

d upper bound d u

realistic slide

modified levene test