malhotra mr05 ppt 18

47
18-1 © 2007 Prentice Hall Chapter Eighteen 18-1 Discriminant and Logit Analysis

Upload: abhishek-chakraborty

Post on 15-Nov-2014

358 views

Category:

Documents


54 download

TRANSCRIPT

Page 1: Malhotra Mr05 Ppt 18

18-1© 2007 Prentice Hall

Chapter Eighteen

18-1

Discriminant and Logit Analysis

Page 2: Malhotra Mr05 Ppt 18

18-2© 2007 Prentice Hall

Chapter Outline

1) Overview

2) Basic Concept

3) Relation to Regression and ANOVA

4) Discriminant Analysis Model

5) Statistics Associated with Discriminant Analysis

6) Conducting Discriminant Analysis

i. Formulation

ii. Estimation

iii. Determination of Significance

iv. Interpretation

v. Validation

Page 3: Malhotra Mr05 Ppt 18

18-3© 2007 Prentice Hall

Chapter Outline

7) Multiple Discriminant Analysis

i. Formulation

ii. Estimation

iii. Determination of Significance

iv. Interpretation

v. Validation

8) Stepwise Discriminant Analysis

Page 4: Malhotra Mr05 Ppt 18

18-4© 2007 Prentice Hall

9) The Logit Model

i. Estimation

ii. Model Fit

iii. Significance Testing

iv. Interpretation of Coefficients

v. An Illustrative Application

10) Summary

Chapter Outline

Page 5: Malhotra Mr05 Ppt 18

18-5© 2007 Prentice Hall

Similarities and Differences between ANOVA, Regression, and Discriminant Analysis

Table 18.1

ANOVA REGRESSION DISCRIMINANT/LOGITSimilaritiesNumber of One One OnedependentvariablesNumber ofindependent Multiple Multiple Multiplevariables

DifferencesNature of thedependent Metric Metric CategoricalvariablesNature of theindependent Categorical Metric Metricvariables

Page 6: Malhotra Mr05 Ppt 18

18-6© 2007 Prentice Hall

Discriminant Analysis

Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor or independent variables are interval in nature.

The objectives of discriminant analysis are as follows: Development of discriminant functions, or linear

combinations of the predictor or independent variables, which will best discriminate between the categories of the criterion or dependent variable (groups).

Examination of whether significant differences exist among the groups, in terms of the predictor variables.

Determination of which predictor variables contribute to most of the intergroup differences.

Classification of cases to one of the groups based on the values of the predictor variables.

Evaluation of the accuracy of classification.

Page 7: Malhotra Mr05 Ppt 18

18-7© 2007 Prentice Hall

When the criterion variable has two categories, the technique is known as two-group discriminant analysis.

When three or more categories are involved, the technique is referred to as multiple discriminant analysis.

The main distinction is that, in the two-group case, it is possible to derive only one discriminant function. In multiple discriminant analysis, more than one function may be computed. In general, with G groups and k predictors, it is possible to estimate up to the smaller of G - 1, or k, discriminant functions.

The first function has the highest ratio of between-groups to within-groups sum of squares. The second function, uncorrelated with the first, has the second highest ratio, and so on. However, not all the functions may be statistically significant.

Discriminant Analysis

Page 8: Malhotra Mr05 Ppt 18

18-8© 2007 Prentice Hall

Geometric Interpretation

Fig. 18.1

X2

X1

G1

G2

D

G1 G2

2 2 2 2 2 2 2 2 2 2

2 1 1

1 1

2 2 2 2 1

1 1 1 1 1 1 1 1 1

Page 9: Malhotra Mr05 Ppt 18

18-9© 2007 Prentice Hall

Discriminant Analysis ModelThe discriminant analysis model involves linear combinations of

the following form:

D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk

Where:

D = discriminant score

b 's = discriminant coefficient or weight

X 's = predictor or independent variable

The coefficients, or weights (b), are estimated so that the groups differ as much as possible on the values of the discriminant function.

This occurs when the ratio of between-group sum of squares to within-group sum of squares for the discriminant scores is at a maximum.

Page 10: Malhotra Mr05 Ppt 18

18-10© 2007 Prentice Hall

Canonical correlation. Canonical correlation measures the extent of association between the discriminant scores and the groups. It is a measure of association between the single discriminant function and the set of dummy variables that define the group membership.

Centroid. The centroid is the mean values for the discriminant scores for a particular group. There are as many centroids as there are groups, as there is one for each group. The means for a group on all the functions are the group centroids.

Classification matrix. Sometimes also called confusion or prediction matrix, the classification matrix contains the number of correctly classified and misclassified cases.

Statistics Associated with Discriminant Analysis

Page 11: Malhotra Mr05 Ppt 18

18-11© 2007 Prentice Hall

Discriminant function coefficients. The discriminant function coefficients (unstandardized) are the multipliers of variables, when the variables are in the original units of measurement.

Discriminant scores. The unstandardized coefficients are multiplied by the values of the variables. These products are summed and added to the constant term to obtain the discriminant scores.

Eigenvalue. For each discriminant function, the Eigenvalue is the ratio of between-group to within-group sums of squares. Large Eigenvalues imply superior functions.

Statistics Associated with Discriminant Analysis

Page 12: Malhotra Mr05 Ppt 18

18-12© 2007 Prentice Hall

F values and their significance. These are calculated from a one-way ANOVA, with the grouping variable serving as the categorical independent variable. Each predictor, in turn, serves as the metric dependent variable in the ANOVA.

Group means and group standard deviations. These are computed for each predictor for each group.

Pooled within-group correlation matrix. The pooled within-group correlation matrix is computed by averaging the separate covariance matrices for all the groups.

Statistics Associated with Discriminant Analysis

Page 13: Malhotra Mr05 Ppt 18

18-13© 2007 Prentice Hall

Standardized discriminant function coefficients. The standardized discriminant function coefficients are the discriminant function coefficients and are used as the multipliers when the variables have been standardized to a mean of 0 and a variance of 1.

Structure correlations. Also referred to as discriminant loadings, the structure correlations represent the simple correlations between the predictors and the discriminant function.

Total correlation matrix. If the cases are treated as if they were from a single sample and the correlations computed, a total correlation matrix is obtained.

Wilks' . Sometimes also called the U statistic, Wilks' for each predictor is the ratio of the within-group sum of squares to the total sum of squares. Its value varies between 0 and 1. Large values of (near 1) indicate that group means do not seem to be different. Small values of (near 0) indicate that the group means seem to be different.

Statistics Associated with Discriminant Analysis

Page 14: Malhotra Mr05 Ppt 18

18-14© 2007 Prentice Hall

Conducting Discriminant Analysis

Fig. 18.2

Assess Validity of Discriminant Analysis

Estimate the Discriminant Function Coefficients

Determine the Significance of the Discriminant Function

Formulate the Problem

Interpret the Results

Page 15: Malhotra Mr05 Ppt 18

18-15© 2007 Prentice Hall

Conducting Discriminant Analysis Formulate the Problem

Identify the objectives, the criterion variable, and the independent variables.

The criterion variable must consist of two or more mutually exclusive and collectively exhaustive categories.

The predictor variables should be selected based on a theoretical model or previous research, or the experience of the researcher.

One part of the sample, called the estimation or analysis sample, is used for estimation of the discriminant function.

The other part, called the holdout or validation sample, is reserved for validating the discriminant function.

Often the distribution of the number of cases in the analysis and validation samples follows the distribution in the total sample.

Page 16: Malhotra Mr05 Ppt 18

18-16© 2007 Prentice Hall

Information on Resort Visits: Analysis SampleTable 18.2

Annual Attitude Importance Household Age of Amount Resort Family Toward Attached Size

Head of Spent on No. Visit Income Travel to Family

Household Family ($000) Vacation

Vacation

1 1 50.2 5 8 3 43 M (2)2 1 70.3 6 7 4 61 H (3)3 1 62.9 7 5 6 52 H (3)4 1 48.5 7 5 5 36 L (1)5 1 52.7 6 6 4 55 H (3)6 1 75.0 8 7 5 68 H (3)7 1 46.2 5 3 3 62 M (2)8 1 57.0 2 4 6 51 M (2)9 1 64.1 7 5 4 57 H (3)10 1 68.1 7 6 5 45 H (3)11 1 73.4 6 7 5 44 H (3)12 1 71.9 5 8 4 64 H (3)13 1 56.2 1 8 6 54 M (2)14 1 49.3 4 2 3 56 H (3)15 1 62.0 5 6 2 58 H (3)

Page 17: Malhotra Mr05 Ppt 18

18-17© 2007 Prentice Hall

Information on Resort Visits: Analysis Sample

Table 18.2, cont.Annual Attitude Importance Household Age of Amount Resort Family Toward Attached Size

Head of Spent on No. Visit Income Travel to Family

Household Family ($000) Vacation

Vacation

16 2 32.1 5 4 3 58 L (1)17 2 36.2 4 3 2 55 L (1)18 2 43.2 2 5 2 57 M (2)19 2 50.4 5 2 4 37 M (2)20 2 44.1 6 6 3 42 M (2)21 2 38.3 6 6 2 45 L (1)22 2 55.0 1 2 2 57 M (2)23 2 46.1 3 5 3 51 L (1)24 2 35.0 6 4 5 64 L (1)25 2 37.3 2 7 4 54 L (1)26 2 41.8 5 1 3 56 M (2)27 2 57.0 8 3 2 36 M (2)28 2 33.4 6 8 2 50 L (1)29 2 37.5 3 2 3 48 L (1)30 2 41.3 3 3 2 42 L (1)

Page 18: Malhotra Mr05 Ppt 18

18-18© 2007 Prentice Hall

Information on Resort Visits: Holdout SampleTable 18.3 Annual Attitude Importance Household Age of Amount Resort Family Toward Attached Size

Head of Spent on No. Visit Income Travel to Family

Household Family ($000) Vacation

Vacation

1 1 50.8 4 7 3 45 M(2)2 1 63.6 7 4 7 55 H (3)3 1 54.0 6 7 4 58 M(2)4 1 45.0 5 4 3 60 M(2)5 1 68.0 6 6 6 46 H (3)6 1 62.1 5 6 3 56 H (3)7 2 35.0 4 3 4 54 L (1)8 2 49.6 5 3 5 39 L (1)9 2 39.4 6 5 3 44 H (3)10 2 37.0 2 6 5 51 L (1)11 2 54.5 7 3 3 37 M(2)12 2 38.2 2 2 3 49 L (1)

Page 19: Malhotra Mr05 Ppt 18

18-19© 2007 Prentice Hall

Conducting Discriminant Analysis Estimate the Discriminant Function Coefficients

The direct method involves estimating the discriminant function so that all the predictors are included simultaneously.

In stepwise discriminant analysis, the predictor variables are entered sequentially, based on their ability to discriminate among groups.

Page 20: Malhotra Mr05 Ppt 18

18-20© 2007 Prentice Hall

Results of Two-Group Discriminant Analysis

Table 18.4 GROUP MEANS

VISIT INCOME TRAVEL VACATION HSIZE AGE

1 60.52000 5.40000 5.80000 4.33333 53.733332 41.91333 4.33333 4.06667 2.80000 50.13333Total 51.21667 4.86667 4.9333 3.56667 51.93333

Group Standard Deviations

1 9.83065 1.91982 1.82052 1.23443 8.770622 7.55115 1.95180 2.05171 .94112 8.27101Total 12.79523 1.97804 2.09981 1.33089 8.57395

Pooled Within-Groups Correlation MatrixINCOME TRAVEL VACATION HSIZE AGE

INCOME 1.00000TRAVEL 0.19745 1.00000VACATION 0.09148 0.08434 1.00000HSIZE 0.08887 -0.01681 0.07046 1.00000AGE - 0.01431 -0.19709 0.01742 -0.04301 1.00000

Wilks' (U-statistic) and univariate F ratio with 1 and 28 degrees of freedom

Variable Wilks' F Significance

INCOME 0.45310 33.800 0.0000TRAVEL 0.92479 2.277 0.1425VACATION 0.82377 5.990 0.0209HSIZE 0.65672 14.640 0.0007AGE 0.95441 1.338 0.2572

Cont.

Page 21: Malhotra Mr05 Ppt 18

18-21© 2007 Prentice Hall

Results of Two-Group Discriminant Analysis

Table 18.4, cont. CANONICAL DISCRIMINANT FUNCTIONS

% of Cum Canonical After Wilks'Function Eigenvalue Variance % Correlation Function Chi-square df Significance

: 0 0 .3589 26.130 5 0.00011* 1.7862 100.00 100.00 0.8007 :

* marks the 1 canonical discriminant functions remaining in the analysis.

Standard Canonical Discriminant Function Coefficients

FUNC 1 INCOME 0.74301 TRAVEL 0.09611 VACATION 0.23329 HSIZE 0.46911 AGE 0.20922

Structure Matrix:Pooled within-groups correlations between discriminating variables & canonical discriminant functions (variables ordered by size of correlation within function)

FUNC 1 INCOME 0.82202 HSIZE 0.54096 VACATION 0.34607 TRAVEL 0.21337 AGE 0.16354

Cont.

Page 22: Malhotra Mr05 Ppt 18

18-22© 2007 Prentice Hall

Cont.

Results of Two-Group Discriminant Analysis

Table 18.4, cont. Unstandardized Canonical Discriminant Function Coefficients

FUNC 1INCOME 0.8476710E-01TRAVEL 0.4964455E-01VACATION 0.1202813HSIZE 0.4273893AGE 0.2454380E-01(constant) -7.975476Canonical discriminant functions evaluated at group means (group centroids)

Group FUNC 11 1.291182 -1.29118Classification results for cases selected for use in analysis

Predicted Group MembershipActual Group No. of Cases 1 2

Group 1 15 12 380.0% 20.0%

Group 2 15 0 150.0% 100.0%

Percent of grouped cases correctly classified: 90.00%

Page 23: Malhotra Mr05 Ppt 18

18-23© 2007 Prentice Hall

Results of Two-Group Discriminant Analysis

Table 18.4, cont.

Classification Results for cases not selected for use in the analysis (holdout sample)

Predicted Group MembershipActual Group No. of Cases 1 2

Group 1 6 4 266.7% 33.3%

Group 2 6 0 60.0% 100.0%

Percent of grouped cases correctly classified: 83.33%.

Page 24: Malhotra Mr05 Ppt 18

18-24© 2007 Prentice Hall

Conducting Discriminant Analysis Determine the Significance of Discriminant Function The null hypothesis that, in the population, the means

of all discriminant functions in all groups are equal can be statistically tested.

In SPSS this test is based on Wilks' . If several functions are tested simultaneously (as in the case of multiple discriminant analysis), the Wilks' statistic is the product of the univariate for each function. The significance level is estimated based on a chi-square transformation of the statistic.

If the null hypothesis is rejected, indicating significant discrimination, one can proceed to interpret the results.

Page 25: Malhotra Mr05 Ppt 18

18-25© 2007 Prentice Hall

Conducting Discriminant Analysis Interpret the Results The interpretation of the discriminant weights, or coefficients, is

similar to that in multiple regression analysis. Given the multicollinearity in the predictor variables, there is no

unambiguous measure of the relative importance of the predictors in discriminating between the groups.

With this caveat in mind, we can obtain some idea of the relative importance of the variables by examining the absolute magnitude of the standardized discriminant function coefficients.

Some idea of the relative importance of the predictors can also be obtained by examining the structure correlations, also called canonical loadings or discriminant loadings. These simple correlations between each predictor and the discriminant function represent the variance that the predictor shares with the function.

Another aid to interpreting discriminant analysis results is to develop a Characteristic profile for each group by describing each group in terms of the group means for the predictor variables.

Page 26: Malhotra Mr05 Ppt 18

18-26© 2007 Prentice Hall

Conducting Discriminant Analysis Assess Validity of Discriminant Analysis

Many computer programs, such as SPSS, offer a leave-one-out cross-validation option.

The discriminant weights, estimated by using the analysis sample, are multiplied by the values of the predictor variables in the holdout sample to generate discriminant scores for the cases in the holdout sample. The cases are then assigned to groups based on their discriminant scores and an appropriate decision rule. The hit ratio, or the percentage of cases correctly classified, can then be determined by summing the diagonal elements and dividing by the total number of cases.

It is helpful to compare the percentage of cases correctly classified by discriminant analysis to the percentage that would be obtained by chance. Classification accuracy achieved by discriminant analysis should be at least 25% greater than that obtained by chance.

Page 27: Malhotra Mr05 Ppt 18

18-27© 2007 Prentice Hall

Results of Three-Group Discriminant Analysis

Table 18.5

Group MeansAMOUNT INCOME TRAVEL VACATION HSIZE AGE

1 38.57000 4.50000 4.70000 3.10000 50.30000

2 50.11000 4.00000 4.20000 3.40000 49.50000

3 64.97000 6.10000 5.90000 4.20000 56.00000

Total 51.21667 4.86667 4.93333 3.56667 51.93333

Group Standard Deviations1 5.29718 1.71594 1.88856 1.19722 8.09732

2 6.00231 2.35702 2.48551 1.50555 9.25263

3 8.61434 1.19722 1.66333 1.13529 7.60117

Total 12.79523 1.97804 2.09981 1.33089 8.57395

Pooled Within-Groups Correlation MatrixINCOME TRAVEL VACATION HSIZE AGE

INCOME 1.00000

TRAVEL 0.05120 1.00000

VACATION 0.30681 0.03588 1.00000

HSIZE 0.38050 0.00474 0.22080 1.00000

AGE -0.20939 -0.34022 -0.01326 -0.02512 1.00000 Cont.

Page 28: Malhotra Mr05 Ppt 18

18-28© 2007 Prentice Hall

Results of Three-Group Discriminant Analysis

Table 18.5, cont.Wilks' (U-statistic) and univariate F ratio with 2 and 27 degrees of freedom.

Variable Wilks' Lambda F Significance

INCOME 0.26215 38.00 0.0000TRAVEL 0.78790 3.634 0.0400VACATION 0.88060 1.830 0.1797HSIZE 0.87411 1.944 0.1626AGE 0.88214 1.804 0.1840

CANONICAL DISCRIMINANT FUNCTIONS

% of Cum Canonical After Wilks'Function Eigenvalue Variance % Correlation Function Chi-square df Significance

: 0 0.1664 44.831 10 0.001* 3.8190 93.93 93.93 0.8902 : 1 0.8020 5.517 4 0.242* 0.2469 6.07 100.00 0.4450 :

* marks the two canonical discriminant functions remaining in the analysis.

Standardized Canonical Discriminant Function Coefficients

FUNC 1 FUNC 2INCOME 1.04740 -0.42076TRAVEL 0.33991 0.76851VACATION -0.14198 0.53354HSIZE -0.16317 0.12932AGE 0.49474 0.52447

Cont.

Page 29: Malhotra Mr05 Ppt 18

18-29© 2007 Prentice Hall

Results of Three-Group Discriminant AnalysisTable 18.5, cont.Structure Matrix:Pooled within-groups correlations between discriminating variables and canonical discriminant functions (variables ordered by size of correlation within function)

FUNC 1 FUNC 2INCOME 0.85556* -0.27833HSIZE 0.19319* 0.07749VACATION 0.21935 0.58829*TRAVEL 0.14899 0.45362*AGE 0.16576 0.34079*

Unstandardized canonical discriminant function coefficientsFUNC 1 FUNC 2

INCOME 0.1542658 -0.6197148E-01TRAVEL 0.1867977 0.4223430VACATION -0.6952264E-01 0.2612652HSIZE -0.1265334 0.1002796AGE 0.5928055E-01 0.6284206E-01(constant) -11.09442 -3.791600

Canonical discriminant functions evaluated at group means (group centroids)Group FUNC 1 FUNC 21 -2.04100 0.418472 -0.40479 -0.658673 2.44578 0.24020 Cont.

Page 30: Malhotra Mr05 Ppt 18

18-30© 2007 Prentice Hall

Results of Three-Group Discriminant Analysis

Table 18.5, cont.Classification Results:

PredictedGroup Membership Actual Group No. of Cases 1 2 3

Group 1 10 9 1 090.0% 10.0% 0.0%

Group 2 10 1 9 010.0% 90.0% 0.0%

Group 3 10 0 2 80.0% 20.0% 80.0%

Percent of grouped cases correctly classified: 86.67%

Classification results for cases not selected for use in the analysis

PredictedGroup MembershipActual Group No. of Cases 1 2 3

Group 1 4 3 1 075.0% 25.0% 0.0%

Group 2 4 0 3 10.0% 75.0% 25.0%

Group 3 4 1 0 325.0% 0.0% 75.0%

Percent of grouped cases correctly classified: 75.00%

Page 31: Malhotra Mr05 Ppt 18

18-31© 2007 Prentice Hall

All-Groups ScattergramFig. 18.3

-4.0

Across: Function 1 Down: Function 2

4.0

0.0

-6.0 4.0 0.0-2.0-4.0 2.0 6.0

1

1 1

1 1

1

1

1

1

2

1 2

2

2

2

2

3

3

3

3

3

3

3

2

3*

* *

* indicates a group centroid

Page 32: Malhotra Mr05 Ppt 18

18-32© 2007 Prentice Hall

Territorial MapFig. 18.4

-4.0

Across: Function 1 Down: Function 2

4.0

0.0

-6.0 4.0 0.0-2.0-4.0 2.0 6.0

1

1 3

*

-8.0

-8.0

8.0

8.0

1 3

1 3

1 3

1 3 1 3 1 3 1 3

1 1 2 3 1 1 2 2 3 3

1 1 2 2

1 1 1 2 2 2 2 3 3

1 1 1 2 2

1 1 2 2 1 1 2

2 1 1 1 2 2

1 1 2 2

1 1 2 2

1 1 1 2 2 1 1 1 2 2

1 1 2 2 2

2 2 32 3 3

2 2 3 3

2 2 3

2 2 3

2 2 3

2 2 3 3

2 3 3

2 3 3

2 3 3

* *

* Indicates a group centroid

Page 33: Malhotra Mr05 Ppt 18

18-33© 2007 Prentice Hall

Stepwise Discriminant Analysis Stepwise discriminant analysis is analogous to

stepwise multiple regression (see Chapter 17) in that the predictors are entered sequentially based on their ability to discriminate between the groups.

An F ratio is calculated for each predictor by conducting a univariate analysis of variance in which the groups are treated as the categorical variable and the predictor as the criterion variable.

The predictor with the highest F ratio is the first to be selected for inclusion in the discriminant function, if it meets certain significance and tolerance criteria.

A second predictor is added based on the highest adjusted or partial F ratio, taking into account the predictor already selected.

Page 34: Malhotra Mr05 Ppt 18

18-34© 2007 Prentice Hall

Each predictor selected is tested for retention based on its association with other predictors selected.

The process of selection and retention is continued until all predictors meeting the significance criteria for inclusion and retention have been entered in the discriminant function.

The selection of the stepwise procedure is based on the optimizing criterion adopted. The Mahalanobis procedure is based on maximizing a generalized measure of the distance between the two closest groups.

The order in which the variables were selected also indicates their importance in discriminating between the groups.

Stepwise Discriminant Analysis

Page 35: Malhotra Mr05 Ppt 18

18-35© 2007 Prentice Hall

The Logit Model

The dependent variable is binary and there are several independent variables that are metric

The binary logit model commonly deals with the issue of how likely is an observation to belong to each group

It estimates the probability of an observation belonging to a particular group

Page 36: Malhotra Mr05 Ppt 18

18-36© 2007 Prentice Hall

Binary Logit Model Formulation

The probability of success may be modeled using the logit model as:

Or

Or

i

n

ie XaP

P

1log i

i 0

XaXaXaa kke

PP

...

1log 22110

Page 37: Malhotra Mr05 Ppt 18

18-37© 2007 Prentice Hall

Model Formulation

)exp(1

)exp(

0

0

Xa

Xa

i

k

ii

i

k

ii

P

Where:

P = Probability of success

Xi = Independent variable i

ai = parameter to be estimated.

Page 38: Malhotra Mr05 Ppt 18

18-38© 2007 Prentice Hall

Properties of the Logit Model

Although Xi may vary from to , P is constrained to lie between 0 and 1.

When Xi approaches , P approaches 0.

When Xi approaches , P approaches 1.

When OLS regression is used, P is not constrained to lie between 0 and 1.

Page 39: Malhotra Mr05 Ppt 18

18-39© 2007 Prentice Hall

Estimation and Model Fit

The estimation procedure is called the maximum likelihood method.

Fit: Cox & Snell R Square and Nagelkerke R Square.

Both these measures are similar to R2 in multiple regression.

The Cox & Snell R Square can not equal 1.0, even if the fit is perfect

This limitation is overcome by the Nagelkerke R Square.

Compare predicted and actual values of Y to determine the percentage of correct predictions.

Page 40: Malhotra Mr05 Ppt 18

18-40© 2007 Prentice Hall

Significance Testing

Page 41: Malhotra Mr05 Ppt 18

18-41© 2007 Prentice Hall

Interpretation of Coefficients

If Xi is increased by one unit, the log odds will change by ai units, when the effect of other independent variables is held constant.

The sign of ai will determine whether the probability increases (if the sign is positive) or decreases (if the sign is negative) by this amount.

Page 42: Malhotra Mr05 Ppt 18

18-42© 2007 Prentice Hall

Explaining Brand LoyaltyTable 18.6

No. Loyalty Brand Product Shopping 1 1 4 3 5 2 1 6 4 4 3 1 5 2 4 4 1 7 5 5 5 1 6 3 4 6 1 3 4 5 7 1 5 5 5 8 1 5 4 2 9 1 7 5 4

10 1 7 6 4 11 1 6 7 2 12 1 5 6 4 13 1 7 3 3 14 1 5 1 4 15 1 7 5 5 16 0 3 1 3 17 0 4 6 2 18 0 2 5 2 19 0 5 2 4 20 0 4 1 3 21 0 3 3 4 22 0 3 4 5 23 0 3 6 3 24 0 4 4 2 25 0 6 3 6 26 0 3 6 3 27 0 4 3 2 28 0 3 5 2 29 0 5 5 3 30 0 1 3 2

Page 43: Malhotra Mr05 Ppt 18

18-43© 2007 Prentice Hall

Results of Logistic Regression

Table 18.7

Dependent Variable Encoding

Original Value Internal Value Not Loyal 0 Loyal 1

Model Summary

Step -2 Log

likelihood Cox & Snell R Square

Nagelkerke R Square

1 23.471(a) .453 .604

a Estimation terminated at iteration number 6 because parameter estimates changed by less than .001.

Page 44: Malhotra Mr05 Ppt 18

18-44© 2007 Prentice Hall

Results of Logistic Regression

Table 18.7, cont.

Variables in the Equationa

Variable(s) entered on step 1: Brand, Product, Shopping.a.

1.274 .479 7.075 1 .008 3.575

.186 .322 .335 1 .563 1.205

.590 .491 1.442 1 .230 1.804

-8.642 3.346 6.672 1 .010 .000

Brand

Product

Shopping

Constant

Step1

B S.E. Wald df Sig. Exp(B)

Classification Tablea

The cut value is .500a.

12 3 80.03 12 80.0

80.0

ObservedNot LoyalLoyal

Loyalty to theBrand

Overall Percentage

Step 1Not Loyal LoyalLoyalty to the Brand Percentage

Correct

Predicted

Page 45: Malhotra Mr05 Ppt 18

18-45© 2007 Prentice Hall

SPSS Windows

The DISCRIMINANT program performs both two-group and multiple discriminant analysis. To select this procedure using SPSS for Windows click:

Analyze>Classify>Discriminant …

The run logit analysis or logistic regression using SPSS for Windows, click:

Analyze > Regression>Binary Logistic

Page 46: Malhotra Mr05 Ppt 18

18-46© 2007 Prentice Hall

SPSS Windows: Two-group Discriminant

1. Select ANALYZE from the SPSS menu bar.2. Click CLASSIFY and then DISCRIMINANT.3. Move “visit” in to the GROUPING VARIABLE box.4. Click DEFINE RANGE. Enter 1 for MINIMUM and 2 for MAXIMUM.

Click CONTINUE.5. Move “income,” “travel,” “vacation,” “hsize,” and “age” in to the

INDEPENDENTS box.6. Select ENTER INDEPENDENTS TOGETHER (default option)7. Click on STATISTICS. In the pop-up window, in the DESCRIPTIVES

box check MEANS and UNIVARIATE ANOVAS. In the MATRICES box check WITHIN-GROUP CORRELATIONS. Click CONTINUE.

8. Click CLASSIFY.... In the pop-up window in the PRIOR PROBABILITIES box check ALL GROUPS EQUAL (default). In the DISPLAY box check SUMMARY TABLE and LEAVE-ONE-OUT CLASSIFICATION. In the USE COVARIANCE MATRIX box check WITHIN-GROUPS. Click CONTINUE.

9. Click OK.

Page 47: Malhotra Mr05 Ppt 18

18-47© 2007 Prentice Hall

SPSS Windows: Logit Analysis

1. Select ANALYZE from the SPSS menu bar.

2. Click REGRESSION and then BINARY LOGISTIC.

3. Move “Loyalty to the Brand [Loyalty]” in to the DEPENDENT VARIABLE box.

4. Move “Attitude toward the Brand [Brand},” “Attitude toward the Product category [Product},” and “Attitude toward Shopping [Shopping],” in to the COVARIATES(S box.)

5. Select ENTER for METHOD (default option)

6. Click OK.