statistics for managers using microsoft excel

58
12 - 1 © 1998 Prentice-Hall, Inc. Statistics for Managers Using Microsoft Excel, 1/e Statistics for Managers Using Microsoft Excel Multiple Regression Models Chapter 12 Learning Objectives Explain the linear multiple regression model Interpret linear multiple regression computer output Explain multicollinearity Describe the types of multiple regression models

Upload: chancellor-sexton

Post on 02-Jan-2016

67 views

Category:

Documents


0 download

DESCRIPTION

Statistics for Managers Using Microsoft Excel. Multiple Regression Models Chapter 12 Learning Objectives Explain the linear multiple regression model Interpret linear multiple regression computer output Explain multicollinearity Describe the types of multiple regression models. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Statistics for Managers Using Microsoft Excel

12 - 1

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Statistics for Managers Using Microsoft Excel

Multiple Regression Models Chapter 12

Learning Objectives Explain the linear multiple regression model

Interpret linear multiple regression computer output

Explain multicollinearity Describe the types of multiple regression

models

Page 2: Statistics for Managers Using Microsoft Excel

12 - 2

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Regression Models

MultipleRegression

Models

LinearDummyVariable

LinearNon-

Linear

Inter-action

Poly-Nomial

SquareRoot Log Reciprocal Exponential

Page 3: Statistics for Managers Using Microsoft Excel

12 - 3

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Linear Multiple Regression Model

Relationship between 1 dependent & 2 or more independent variables is a linear function

Y X X Xi i i P Pi i 0 1 1 2 2

Dependent (response) variable

Independent (explanatory) variables

Population slopes

Population Y-intercept

Random error

Page 4: Statistics for Managers Using Microsoft Excel

12 - 4

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Population Multiple Regression Model

X2

Y

X1YX = 0 + 1X 1 i + 2X 2 i

0

Y i = 0 + 1X 1 i + 2X 2 i + i

ResponsePlane

(X 1 i,X 2 i)

(Observed Y )

i

Bivariate modelBivariate model

Page 5: Statistics for Managers Using Microsoft Excel

12 - 5

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Sample Multiple Regression Model

X2

Y

X1

b0

Y i = b0 + b1X 1 i + b2X 2 i + e i

ResponsePlane

(X 1 i,X 2 i)

(Observed Y)

^

e i

Y i = b0 + b1X 1 i + b2X 2 i

Bivariate modelBivariate model

Page 6: Statistics for Managers Using Microsoft Excel

12 - 6

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Regression Modeling Steps

Define problem or question Specify model Collect data Do descriptive data analysis Estimate unknown parameters Evaluate model Use model for prediction

Page 7: Statistics for Managers Using Microsoft Excel

12 - 7

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Linear Regression Equations

Too complicated

by hand! Ouch!

Page 8: Statistics for Managers Using Microsoft Excel

12 - 8

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interpretation of Estimated Coefficients

Slope (bP) Estimated Y changes by bP for each 1 unit

increase in XP holding all other variables constant

Example: If b1 = 2, then Sales (Y) is expected to increase by 2 for each 1 unit increase in Advertising (X1) given the Number of Sales Rep’s (X2)

Y-Intercept (b0) Average value of Y when XP = 0

Page 9: Statistics for Managers Using Microsoft Excel

12 - 9

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Parameter Estimation Example

You work in advertising for the New York Times. You want to find the effect of ad size (sq. in.) & newspaper circulation (000) on the number of ad responses (00).

You’ve collected the following data:

Resp Size Circ

1 1 24 8 81 3 13 5 72 6 44 10 6

Page 10: Statistics for Managers Using Microsoft Excel

12 - 10

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Parameter Estimation Excel Output

bbPP

bb00

bb11bb22

Page 11: Statistics for Managers Using Microsoft Excel

12 - 11

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interpretation of Coefficients Solution

Slope (b1) # Responses to Ad is expected to increase

by .2049 (20.49) for each 1 sq. in. increase in Ad Size holding Circulation constant

Slope (b2) # Responses to Ad is expected to increase

by .2805 (28.05) for each 1 unit (1,000) increase in Circulation holding Ad Size constant

Page 12: Statistics for Managers Using Microsoft Excel

12 - 12

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Regression Modeling Steps

Define problem or question Specify model Collect data Do descriptive data analysis Estimate unknown parameters

Evaluate model Use model for prediction

Evaluating Multiple Regression Model Steps

Page 13: Statistics for Managers Using Microsoft Excel

12 - 13

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Evaluating Multiple Regression Model Steps

Examine variation measures Do residual analysis Test parameter significance

Overall model Portions of model Individual coefficients

Test for multicollinearity

Page 14: Statistics for Managers Using Microsoft Excel

12 - 14

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Evaluating Multiple Regression Model Steps

Examine variation measures Do residual analysis Test parameter significance

Overall model Portions of model Individual coefficients

Test for multicollinearity

New!Ne

w!New!

Expand

e

d!

Page 15: Statistics for Managers Using Microsoft Excel

12 - 15

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Coefficient of Multiple Determination

Proportion of variation in Y ‘explained’ by all X variables taken together

r2Y.12..P = Explained variation = SSR

Total variation SST Never decreases when new X variable

is added to model Only Y values determine SST Disadvantage when comparing models

Page 16: Statistics for Managers Using Microsoft Excel

12 - 16

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Proportion of variation in Y ‘explained’ by all X variables taken together

Reflects Sample size Number of independent variables

Smaller than r2Y.12..P

Used to compare models

Adjusted Coefficient of Multiple Determination

Page 17: Statistics for Managers Using Microsoft Excel

12 - 17

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Evaluating Multiple Regression Model Steps

Examine variation measures Do residual analysis

Test parameter significance Overall model Portions of model Individual coefficients

Test for multicollinearity

New!New!

New!

Expand

ed!

Page 18: Statistics for Managers Using Microsoft Excel

12 - 18

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Testing Overall Significance

Shows if there is a linear relationship between all X variables together & Y

Uses F test statistic Hypotheses

H0: 1 = 2 = ... = P = 0 No linear relationship

H1: At least one coefficient is not 0 At least one X variable affects Y

Page 19: Statistics for Managers Using Microsoft Excel

12 - 19

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Evaluating Multiple Regression Model Steps

Examine variation measures Do residual analysis Test parameter significance

Overall model Portions of model Individual coefficients

Test for multicollinearity

New!Ne

w!New!

Expand

e

d!

Page 20: Statistics for Managers Using Microsoft Excel

12 - 20

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multicollinearity

High correlation between X variables Coefficients measure combined effect Leads to unstable coefficients

depending on X variables in model Always exists; matter of degree Example: Using both Sales & Profit as

explanatory variables in same model

Page 21: Statistics for Managers Using Microsoft Excel

12 - 21

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Detecting Multicollinearity

Examine correlation matrix Correlations between pairs of X variables

are more than with Y variable

Examine variance inflation factor (VIF) If VIFj > 5, multicollinearity exists

Few remedies Obtain new sample data Eliminate one correlated X variable

Page 22: Statistics for Managers Using Microsoft Excel

12 - 22

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Regression Models

MultipleRegression

Models

LinearDummyVariable

LinearNon-

Linear

Inter-action

Poly-Nomial

SquareRoot Log Reciprocal Exponential

Page 23: Statistics for Managers Using Microsoft Excel

12 - 23

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Polynomial (Curvilinear) Regression Model

Relationship between 1 dependent & 2 or more independent variables is a quadratic function

Useful 1st model if non-linear relationship suspected

Page 24: Statistics for Managers Using Microsoft Excel

12 - 24

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Polynomial (Curvilinear) Regression Model

Relationship between 1 dependent & 2 or more independent variables is a quadratic function

Useful 1st model if non-linear relationship suspected

Polynomial model

Y X Xi i i i 0 1 1 11 12

Linear effect

Curvilinear effect

Page 25: Statistics for Managers Using Microsoft Excel

12 - 25

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Y

X1

Polynomial (Curvilinear) Model Relationships

Y

X1

Y

X1

Y

X1

11 > 011 > 0

11 < 011 < 0

Page 26: Statistics for Managers Using Microsoft Excel

12 - 26

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Polynomial (Curvilinear) Model Worksheet

Case, i Yi X1i X1i2

1 1 1 1

2 4 8 64

3 1 3 9

4 3 5 25

: : : :Create X1

2 column. Run regression with Y, X1, X1

2

Page 27: Statistics for Managers Using Microsoft Excel

12 - 27

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Regression Models

MultipleRegression

Models

Linear DummyVariable

LinearNon-

Linear

Inter-action

Poly-Nomial

SquareRoot Log Reciprocal Exponential

Page 28: Statistics for Managers Using Microsoft Excel

12 - 28

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Regression Model

Involves categorical X variable with 2 levels e.g., male-female, college-no college etc.

Variable levels coded 0 & 1 Assumes only intercept is different

Slopes are constant across categories

Page 29: Statistics for Managers Using Microsoft Excel

12 - 29

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Regression Model

Involves categorical X variable with 2 levels e.g., male-female, college-no college etc.

Variable levels coded 0 & 1 Assumes only intercept is different

Slopes are constant across categories

Dummy-variable modelY X X Xi i i P Pi i 0 1 1 2 2

Page 30: Statistics for Managers Using Microsoft Excel

12 - 30

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Model Worksheet

Case, i Yi X1i X2i

1 1 1 1

2 4 8 0

3 1 3 1

4 3 5 1

: : : :X2 levels: 0 = Group 1; 1 = Group 2. Run regression with Y, X1, X2

Page 31: Statistics for Managers Using Microsoft Excel

12 - 31

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interpreting Dummy-Variable Model Equation

Given: Starting salary of college grad'sGPA

iif Female

f Male

Y b b X b XYX

X

i i i

0 1 1 2 2

1

201{

Page 32: Statistics for Managers Using Microsoft Excel

12 - 32

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interpreting Dummy-Variable Model Equation

Given: Starting salary of college grad'sGPA

iif Female

Males (

f Male

):

Y b b X b XYX

X

Y b b X b b b X

i i i

i i i

X

0 1 1 2 2

1

2

0 1 1 2 0 1 1

01

(0)2 0

{

Page 33: Statistics for Managers Using Microsoft Excel

12 - 33

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interpreting Dummy-Variable Model Equation

Given: Starting salary of college grad'sGPA

iif Female

Males (

f Male

):

Y b b X b XYX

X

Y b b X b b b X

i i i

i

X

0 1 1 2 2

1

2

0 1 1 2 0 1

01

(0)2 0

{

Females (X2 = 1):

Y b b 1 1 2 1 1(1)i 0 b X b

0 b X

1

2b

Same slopes

Page 34: Statistics for Managers Using Microsoft Excel

12 - 34

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Model Relationships

YY

XX1100

00

Same slopes b1

bb00

b0 + b2

Females

Males

Page 35: Statistics for Managers Using Microsoft Excel

12 - 35

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Model Example

Computer Output:

f Maleif Female

i

Y X X

X

i i i

3 5 7

01

1 2

2 {

Page 36: Statistics for Managers Using Microsoft Excel

12 - 36

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Model Example

Computer Output:

f Maleif Female

Males (

i

):

Y X X

X

Y X X

i i i

i i i

X

3 5 7

01

3 5 7(0) 3 5

1 2

2

1 1

2 0

{

Page 37: Statistics for Managers Using Microsoft Excel

12 - 37

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Dummy-Variable Model Example

Computer Output:

f Maleif Female

Males (

i

):

Y X X

X

Y X X

i i i

i i i

X

3 5 7

01

3 5 7(0) 3 5

1 2

2

1 1

2 0

{

Yi 3 5 (1) 51 1

X i 7 10 X i

Females ( ):X 2 1

Same slopes

Page 38: Statistics for Managers Using Microsoft Excel

12 - 38

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Regression Models

MultipleRegression

Models

LinearDummyVariable

LinearNon-

Linear

Inter-action

Poly-Nomial

SquareRoot Log Reciprocal Exponential

Page 39: Statistics for Managers Using Microsoft Excel

12 - 39

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Regression Model

Hypothesizes interaction between pairs of X variables Response to one X variable varies at

different levels of another X variable

Page 40: Statistics for Managers Using Microsoft Excel

12 - 40

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Regression Model

Hypothesizes interaction between pairs of X variables Response to one X variable varies at

different levels of another X variable

Contains two-way cross product terms Y X X X Xi i i i i i 0 1 1 2 2 3 1 2

Page 41: Statistics for Managers Using Microsoft Excel

12 - 41

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Regression Model

Hypothesizes interaction between pairs of X variables Response to one X variable varies at

different levels of another X variable

Contains two-way cross product terms

Can be combined with other models e.g., dummy variable model

Y X X X Xi i i i i i 0 1 1 2 2 3 1 2

Page 42: Statistics for Managers Using Microsoft Excel

12 - 42

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Effect of Interaction

Given:

Y X X X Xi i i i i i 0 1 1 2 2 3 1 2

Page 43: Statistics for Managers Using Microsoft Excel

12 - 43

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Effect of Interaction

Given:

Without interaction term, effect of X1 on Y is measured by 1

Y X X X Xi i i i i i 0 1 1 2 2 3 1 2

Page 44: Statistics for Managers Using Microsoft Excel

12 - 44

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Effect of Interaction

Given:

Without interaction term, effect of X1 on Y is measured by 1

With interaction term, effect of X1 onY is measured by 1 + 3X2

Effect increases as X2i increases

Y X X X Xi i i i i i 0 1 1 2 2 3 1 2

Page 45: Statistics for Managers Using Microsoft Excel

12 - 45

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Example

XX11

44

88

1212

0000 110.50.5 1.51.5

YY Y = 1 + 2X1 + 3X2 + 4X1X2

Page 46: Statistics for Managers Using Microsoft Excel

12 - 46

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Example

XX11

44

88

1212

0000 110.50.5 1.51.5

YY Y = 1 + 2X1 + 3X2 + 4X1X2

YY = 1 + 2 = 1 + 2XX11 + 3( + 3(00) + 4) + 4XX11((00) = 1 + 2) = 1 + 2XX11

Page 47: Statistics for Managers Using Microsoft Excel

12 - 47

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Example

YY

XX11

44

88

1212

0000 110.50.5 1.51.5

Y = 1 + 2X1 + 3X2 + 4X1X2

YY = 1 + 2 = 1 + 2XX11 + 3( + 3(11) + 4) + 4XX11((11) = 4 + 6) = 4 + 6XX11

YY = 1 + 2 = 1 + 2XX11 + 3( + 3(00) + 4) + 4XX11((00) = 1 + 2) = 1 + 2XX11

Page 48: Statistics for Managers Using Microsoft Excel

12 - 48

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Example

Effect (slope) of X1 on Y does depend on X2 value

XX11

44

88

1212

0000 110.50.5 1.51.5

YY Y = 1 + 2X1 + 3X2 + 4X1X2

YY = 1 + 2 = 1 + 2XX11 + 3( + 3(11) + 4) + 4XX11((11) = 4 + ) = 4 + 66XX11

YY = 1 + 2 = 1 + 2XX11 + 3( + 3(00) + 4) + 4XX11((00) = 1 + ) = 1 + 22XX11

Page 49: Statistics for Managers Using Microsoft Excel

12 - 49

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Interaction Regression Model Worksheet

Case, i Yi X1i X2i X1i X2i

1 1 1 3 3

2 4 8 5 40

3 1 3 2 6

4 3 5 6 30

: : : : :Multiply X1 by X2 to get X1X2. Run regression with Y, X1, X2 , X1X2

Page 50: Statistics for Managers Using Microsoft Excel

12 - 50

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Multiple Regression Models

MultipleRegression

Models

Linear DummyVariable

LinearNon-

Linear

Inter-action

Poly-Nomial

SquareRoot

Log Reciprocal Exponential

Page 51: Statistics for Managers Using Microsoft Excel

12 - 51

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Inherently Linear Models

Non-linear models that can be expressed in linear form Can be estimated by LS in linear form

Require data transformation

Page 52: Statistics for Managers Using Microsoft Excel

12 - 52

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Inherently Linear Models

Non-linear models that can be expressed in linear form Can be estimated by LS in linear form

Require data transformation Multiplicative model example

iiii

iiii

XXY

XXY

lnlnlnlnln 22110

21021

Page 53: Statistics for Managers Using Microsoft Excel

12 - 53

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Square-Root Transformation

Y

X1

Y X Xi i i i 0 1 1 2 2

11 > 0 > 0

11 < 0 < 0

Page 54: Statistics for Managers Using Microsoft Excel

12 - 54

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Logarithmic Transformation

Y

X1

11 > 0 > 0

11 < 0 < 0

iiii XXY 22110 lnln

Page 55: Statistics for Managers Using Microsoft Excel

12 - 55

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Reciprocal Transformation

Y

X1

11 > 0 > 0

11 < 0 < 0

YX Xi

i ii 0 1

12

2

1 1

AsymptoteAsymptote

Page 56: Statistics for Managers Using Microsoft Excel

12 - 56

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Exponential Transformation

Y

X1

11 > 0 > 0

11 < 0 < 0

Y eiX X

ii i 0 1 1 2 2

Page 57: Statistics for Managers Using Microsoft Excel

12 - 57

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Inherently Linear Models Worksheet

Case, i Yi X1iX1i lnX1i 1/X1i

1 1 1 1.0 0.00 1.000

2 4 9 3.0 2.20 .1111

3 1 16 4.0 2.77 .0625

4 3 25 5.0 3.22 .0400

: : : : : :

Transform X1.Run regression with Y, & either X1, lnX1 , 1/X1

Page 58: Statistics for Managers Using Microsoft Excel

12 - 58

© 1998 Prentice-Hall, Inc.

Statistics for Managers Using Microsoft Excel, 1/e

Conclusion

Explained the linear multiple regression model

Interpreted linear multiple regression computer output

Explained multicollinearity Described the types of multiple

regression models