introduction to multi-level modeling

86
Introduction to Multi-Level Modeling Juliet Aiken Design and Statistical Analysis Laboratory November 13, 2009

Upload: fathia

Post on 22-Jan-2016

40 views

Category:

Documents


0 download

DESCRIPTION

Introduction to Multi-Level Modeling. Juliet Aiken Design and Statistical Analysis Laboratory November 13, 2009. What to Expect. Slides will be posted on DaSAL website (http://blog.umd.edu/statconsulting/) after the presentation (but not immediately) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Introduction to Multi-Level Modeling

Introduction to Multi-Level Modeling

Juliet Aiken

Design and Statistical Analysis Laboratory

November 13, 2009

Page 2: Introduction to Multi-Level Modeling

What to Expect Slides will be posted on DaSAL website

(http://blog.umd.edu/statconsulting/) after the presentation (but not immediately)

Not all questions will be answered right now Ask anyway! We’ll write them down and follow up with them before

posting the slides online Terminology:

Random Coefficient Modeling: A statistical technique Hierarchical Linear Modeling: A piece of software which can

be used to do random coefficient modeling It’s okay if some of this is information overload! I do not care if you get up to get more food in the

middle of this

Page 3: Introduction to Multi-Level Modeling

Goals

Understand basics of random coefficient multi-level modeling (RCM) When to use and when not to use RCM What is a random coefficient? Slopes vs. intercepts as outcomes

Provide a platform for further reading and investigation into proper use of RCM Materials and references for later use

Page 4: Introduction to Multi-Level Modeling

Overview

What is multi-level modeling? Multi-level modeling examples What is multi-level modeling?, revisited Why do we care? Multi-level statistical models

Justification for use Two types of models

Page 5: Introduction to Multi-Level Modeling

Overview MLM: Random Coefficient Modeling

Distinguishing Random Coefficients from Random Effects

Kinds of RC Modeling that can be conducted HLM Show and Tell with HLM

Page 6: Introduction to Multi-Level Modeling

Overview of Example 20 teams, 100 people (5 per team), 4

observations of performance per person Self efficacy is measured at each time point

per person Measures of team empowerment from each

individual with respect to their team Does self-efficacy predict performance?

Within individuals? Across individuals?

Does team empowerment predict performance?

Page 7: Introduction to Multi-Level Modeling

What Is Multi-Level Modeling? Family of statistical models for data analysis Theory, design, and measurement

Page 8: Introduction to Multi-Level Modeling

Multi-Level Theoretical Models:Modeling with Correlated Errors (V-C)

OrganizationalSupport

Work-FamilyConflict

OrganizationalSupport

Work-FamilyConflict

Partner 1

Partner 2

Page 9: Introduction to Multi-Level Modeling

Multi-Level Theoretical Models:N-Level Modeling (V-C)

Self-EfficacyIndividual

Performance

Team Empowerment

GroupPerformance

Level 2: Group-Level

Level 1: Individual-Level

Page 10: Introduction to Multi-Level Modeling

Multi-Level Theoretical Models:Multi-Level Modeling (V-C)

Self-EfficacyIndividual

Performance

TeamEmpowerment

GroupPerformance

Level 2: Group-Level

Level 1: Individual-Level

Page 11: Introduction to Multi-Level Modeling

Multi-Level Theoretical Models:Cross-Level Modeling (RCM)

Self-EfficacyIndividual

Performance

TeamEmpowerment

Level 2: Group-Level

Level 1: Individual-Level

Page 12: Introduction to Multi-Level Modeling

Multi-Level Theoretical Models:Frog-in-Pond Modeling (RCM)

Self-EfficacyRelative to

Group Mean

Individual Performance

TeamEmpowerment

Level 2: Group-Level

Level 1: Individual-Level

Page 13: Introduction to Multi-Level Modeling

Multi-Level Modeling: Levels

Organization (School)—Group (Class) Group (Class)—Individual Organization (School)—Individual Individual—Time

Two levels, three levels, four levels, more!

Page 14: Introduction to Multi-Level Modeling

What do they have in common? Hierarchical Non-randomly Nested

Implies sampling method that begins at the highest level

Random nesting (e.g. experimental conditions) does not violate independence of observations assumptions

Page 15: Introduction to Multi-Level Modeling

Why do we care?

Nesting violates independence of observations Results in heteroscedasticity Violates assumptions of “regular” regression Results in incorrectly estimated standard errors, and

consequently, “wrong” results Thus multi-level modeling more powerful and more

honest

Page 16: Introduction to Multi-Level Modeling

Multi-Level Statistical Models Must justify use of multi-level modeling

techniques Group-level variables (e.g. same supervisor,

same organization) Justification for aggregation

Theoretical Design and Measurement (referent) Statistical (ICC1, ICC2, rwg, AD, others)

Page 17: Introduction to Multi-Level Modeling

Multi-Level Statistical Models Regression-Based

Random Coefficient Modeling Variance-Covariance-Based

Latent Growth Analysis (in SEM)

Page 18: Introduction to Multi-Level Modeling

RCM vs. VC

RCM Path analysis harder Measured variables Easy to include

interactions Hard to include

correlated errors Copes with missing data Easy to add levels

VC Factor analysis Path analysis easy Latent variables Interactions harder Different errors easy to

add Harder to cope with

missing data Harder to add levels Goodness of fit

information Model suggestions to

improve fit

Page 19: Introduction to Multi-Level Modeling

Random Coefficient Modeling Random Effects

Experimental conditions (e.g. for medicine) Fixed: Can infer about treatments used in the experiment Random: For purposes of generalization

Random Variables Fixed: Variable with values that are known (e.g. gender) Random: Variable with values selected from a probability

distribution and are measured with error (e.g. IQ)

Page 20: Introduction to Multi-Level Modeling

Random Coefficient Modeling Random Coefficients

Fixed: Coefficients (e.g. slopes or intercepts) do not vary across people/teams/etc.

Random: Coefficients in which values estimated are assumed to be distributed as a probability function

Random coefficients do NOT correspond to random effects or variables Equations versus design/experimental

manipulations

Page 21: Introduction to Multi-Level Modeling

Random Coefficient Modeling Extension of the general/generalized linear

model Outcome of interest measured at the lowest

level “Intercepts as outcomes” “Slopes as outcomes”

Page 22: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment impacts average individual performance

Self-EfficacyIndividual

Performance

TeamEmpowerment

Level 2: Group-Level

Level 1: Individual-Level

Page 23: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment impacts average individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Page 24: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment impacts average individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Team 1 has high team empowerment, and thus, higher average individual performance than Team 5

Page 25: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment impacts average individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

However, Team 1 and Team 5 exhibit the SAME relationship between individual self-efficacy and performance

Page 26: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Slopes as OutcomesH1: Group team empowerment moderates the relationship between self-

efficacy and individual performance

Self-EfficacyIndividual

Performance

TeamEmpowerment

Level 2: Group-Level

Level 1: Individual-Level

Page 27: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Page 28: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Team 1 has high team empowerment, BUT still the same average individual performance as Team 5

Page 29: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

However, for the high team empowerment team, Team 1, self-efficacy is positively related to performance, whereas for the low- team empowerment, Team 5, self-efficacy is negatively related to performance

Page 30: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Slopes and Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-

efficacy and individual performance AND impacts average individual performance directly

Self-EfficacyIndividual

Performance

TeamEmpowerment

Level 2: Group-Level

Level 1: Individual-Level

Page 31: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Slopes and Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance AND impacts average individual performance directly

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Page 32: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Slopes and Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance AND impacts average individual performance directly

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Team 1 has high team empowerment, and, consequently, a higher average individual performance than Team 5

Page 33: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Slopes and Intercepts as OutcomesH1: Group team empowerment moderates the relationship between self-efficacy

and individual performance AND impacts average individual performance directly

Individual Self-Efficacy

Indi

vidu

al P

erfo

rman

ce

Additionally, for the high team empowerment team, Team 1, self-efficacy is positively related to performance, whereas for the low- team empowerment team, Team 5, self-efficacy is negatively related to performance

Page 34: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Word of Warning Large models can be unstable

Small changes in the model may result in large changes in the result of the analysis

Might be due to multicollinearity in cross-level interactions/high correlations in parameter estimates

Mainly a problem when few observations on the highest level

Unbalanced samples may have too-small estimated standard errors Makes hypothesis tests too liberal

Except for fixed coefficients, your df is tied to the number of observations at the highest level of predictor variables

Page 35: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Software Available

Intercept? Equations Estimation Missing Data

Cross-Classification

HLM Yes One for each level

RML is default (ML)

Level 1 only (Pair-wise)

Yes

MLwiN X1 = 1 One for each level

MCMC is default (ML/QML)

Imputes with each iteration

Yes—easier (no level spec. requirement)

SAS Proc

Mixed

Optional, default is yes

One RML Pair-wise Yes

SPSS Mixed

Optional, default is no

Point-and-click or One

RML (ML) Pair-wise Yes

Page 36: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Software Available

# Datafiles # Levels Random Effects?

Centering GUI?

HLM Multiple Up to 3 Level 2 and above assumed fixed

Group and Grand

Yes

MLwiN 1 Unlimited (Default = 5)

Yes, can specify

Done Manually

Yes

SAS 1 Unlimited Yes, can specify

Done Manually

No

SPSS Mixed

1 Up to 3 Yes, can specify

Done Manually

Yes

Page 37: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Assumptions

Observations at highest level are independent Linear models Level 1—normal random errors Level 2—multivariate normal random errors Level 1(2) predictors are independent of Level 1(2)

residuals Variance of residual errors is the same at all levels Variances of residual errors is the same across units at

Level 1 Independent errors across and within levels

Page 38: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Options

RCM Multivariate RCM 2- or 3- levels Cross-classified models (2 levels only)

Page 39: Introduction to Multi-Level Modeling

HLM: Options

Page 40: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Data preparation

“Down” format Sort in ascending order

Names truncate to 8 letters ID variables on all levels No missing data on higher levels

Page 41: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 42: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 43: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 44: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Preparation

Page 45: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 46: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 47: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 48: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 49: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Preparation

Page 50: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Everyone with HLM on their machine, please

open it now.

Page 51: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 52: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 53: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 54: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 55: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 56: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 57: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 58: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 59: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 60: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 61: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Centering

None Grand-mean:

Just re-centering Makes intercept more meaningful Helps reduce multicollinearity

Group-mean: For frog-in-pond effect studies Completely different model

Page 62: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 63: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 64: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Model Specification

Different equations for each level and coefficient Level 1: Yij = β0j + βjXij + rij

Level 2: β0j = γ00 + u0j

β1j = γ10 + uij

Other programs would write the same model in one statement: Yij = γ00 + γ10Xij + γ11Xij + u0j + u1jXij + rij

Page 65: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 66: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM Robust Standard Errors

If response variable does not have normal distribution

AND N >= 100

Page 67: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 68: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 69: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 70: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 71: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 72: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 73: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 74: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Analysis

Page 75: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 76: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 77: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 78: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 79: Introduction to Multi-Level Modeling

Random Coefficient Modeling: HLM

Page 80: Introduction to Multi-Level Modeling

Random Coefficient Modeling: Major Points Multi-level modeling always comes back to theory and

measurement For some variables, need to justify aggregation Hierarchical designs should employ hierarchical

statistics for maximum power and more accurate analyses

Random coefficients are NOT random effects Your df is only as high as the number of units at the

highest level of your predictors HLM software is not the only (or even the best!)

available

Page 81: Introduction to Multi-Level Modeling

ReferencesBooks

Bryk, A. S., & Raudenbush, S. W. (1992). Hierarchical linear models: Applications and data analysis methods. Newbury Park, CA: Sage.

Hox, J. (2002). Multilevel analyses: Techniques and applications. Mahwah, NJ: Erlbaum.

Kreft, I. & de Leeuw, J. (1998). Introducing Multilevel Modeling. London: Sage.

Page 82: Introduction to Multi-Level Modeling

ReferencesHow-To and General Reference Articles

Bliese, P.D. & Ployhart, R. E. (2002). Growth modeling using random coefficient models: Model building, testing, and illustrations. Organizational Research Methods, 5, 362-387.

Bliese, P. D., & Hanges, P. J. (2004). Too liberal and too conservative: The perils of treating grouped data as though they were independent. Organizational Research Methods, 7, 400-417.

Hofmann, D. A. (1997). An overview of the logic and rationale of hierarchical linear models. Journal of Management, 23, 723-744.

Kreft, I. G. G., de Leeuw, J., & Aiken, L. S. (1995). The effect of different forms of centering in hierarchical linear models. Multivariate Behavioral Research, 30, 1-21.

Page 83: Introduction to Multi-Level Modeling

ReferencesHow-To and General Reference Articles

Kreft, I. G. G., de Leeuw, J., & van der Leeden, R. (1994). Review of five multilevel analysis programs: BMDP-5V, GENMOD, HLM, ML3, and VARCL. The American Statistician, 48, 324-335.

Singer, J. D. (1998). Using SAS PROC MIXED to fit multi-level models, hierarchical models, and individual growth models. Journal of Educational and Behavioral Statistics, 23, 323-355.

Zhou, X., Perkins, A. J., & Hui, S. L. (1999). Comparisons of software packages for generalized linear multilevel models. The American Statistician, 53, 282-290.

Page 84: Introduction to Multi-Level Modeling

ReferencesTheoretical Issues

Chan, D. (1998). Functional relations among constructs in the same content domain at different levels of analysis: A typology of composition models. Journal of Applied Psychology, 83, 234-246.

Klein, K. J. & Kozlowski, S. W. (2000). From micro to meso: Critical steps in conceptualizing and conducting multilevel research. Organizational Research Methods, 3, 211-236.

Morgeson, F. P. & Hofmann, D. A. (1999). The structure and function of collective constructs: Implications for multilevel research and theory development. Academy of Management Review, 24, 249-265.

Ostroff, C. (1993). Comparing correlations based on individual-level and aggregated data. Journal of Applied Psychology, 78, 569-582.

Page 85: Introduction to Multi-Level Modeling

ReferencesEmpirical Examples

Atwater, L., Wang, M., Smither, J. W., & Fleenor, J. W. (2009). Are cultural characteristics associated with the relationship between self and others’ ratings of leadership? Journal of Applied Psychology, 94, 876-886. (Uses MPLUS)

Chen, G., Kirkman, B. L., Kanfer, R., Allen, D., & Rosen, B. (2007). A multilevel study of leadership, empowerment, and performance in teams. Journal of Applied Psychology, 92, 331-346. (Uses S-PLUS/R)

Klein, K. J., Lim, B., Saltz, J. L., & Mayer, D. M. (2004). How do they get there? An examination of the antecedents of centrality in team networks. Academy of Management Journal, 47, 952-963. (Uses SAS)

Liao, H., & Chuang, A. (2004). A multilevel investigation of factors influencing employee service performance and customer outcomes. Academy of Management Journal, 47, 41-58. (Uses HLM)

Page 86: Introduction to Multi-Level Modeling

Thank You!

Paul Hanges, Mo Wang, Kevin O’Grady, and DaSAL consultants (Marsha Sargeant, Laura Sherman, Brandi Stupica, and Tracy Tomlinson) for comments and suggestions

Songqi Liu for theoretical & empirical references

Everyone for coming out today!