experimental design

I

1

I

Experimental and Quasi-Experimental Designs for Generalized Causal Inference

I I

,

I

i

I

II

i

I

' I

I I

II

To Donald T. Campbell and Lee J. Cronbach, who helped us to understand science, and to our wives, Betty, Fay, and Barbara, who helped us to understand life.

Editor-in-Chief: Kathi Prancan Senior Sponsoring Editor: Kerry Baruth Associate Editor: Sara Wise Associate Project Editor: Jane Lee Editorial Assistant: Martha Rogers Production Design Coordinator: Lisa Jelly Senior Manufacturing Coordinator: Marie Barnes Executive Marketing Manager: Ros Kane Marketing Associate: Caroline Guy Copyright 2002 Houghton Mifflin Company. All rights reserved. No part of this work may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying and recording, or by any information storage or retrieval system without the prior written permission of the copyright owner unless such copying is expressly permitted by federal copyright law. With the exception of nonprofit transcription in Braille, Houghton Mifflin is not authorized to grant permission for further uses of copyrighted selections reprinted in this text without the permission of their owners. Permission must be obtained from the individual copyright owners as identified herein. Address requests for permission to make copies of Houghton Mifflin material to College Permissions, Houghton Mifflin Company, 222 Berkeley Street, Boston, MA 02116-3764. Printed in the U.S.A. Library of Congress Catalog Card Number: 2001131551 ISBN: 0-395-61556-9 9-MV-08

ContentsPrefaceXV

1. EXPERIMENTS AND GENERALIZED CAUSAL INFERENCEExperiments and CausationDefining Cause, Effect, and Causal Relationships Causation, Correlation, and Confounds Manipulable and Nonmanipulable Causes Causal Description and Causal Explanation

!

133

7 7 9

Modern Descriptions of ExperimentsRandomized Experiment Quasi-Experiment Natural Experiment Nonexperimental Designs

1213 13

17 18

Experiments and the Generalization of Causal ConnectionsMost Experiments Aie Highly Local But Have General Aspirations Construct Validity: Causal Generalization as Representation External Validity: Causal Generalization as Extrapolation Approaches to Making Causal Generalizations

1818 20 21 22

Experiments and Metascience ..The Kuhnian Critique Modern Social Psychological Critiques Science and Trust Implications for Experiments

26 27 28 28 29

A World Without Experiments or Causes?

31

v

vi

I

CONTENTS

2. STATISTICAL CONCLUSION VALIDITY AND INTERNAL VALIDITYValidityA Validity Typology Threats to Validity

33 3437 39

Statistical Conclusion ValidityReporting Results of Statistical Tests of Covariation Threats to Statistical Conclusion Validity The Problem of Accepting the Null Hypothesis

4242 45 52

Internal ValidityThreats to Internal Validity Estimating Internal Validity in Randomized Experiments and Quasi-Experiments

5354 61

The Relationship Between Internal Validity and Statistical Conclusion Validity

63

3. CONSTRUCT VALIDITY AND EXTERNAL VALIDITYConstruct ValidityWhy Construct Inferences Are a Problem Assessment of Sampling Particulars Threats to Construct Validity Construct Validity, Preexperimental Tailoring, and Postexperimental Specification

64 6466 69 72 81

External ValidityThreats to External Validity Constancy of Effect Size Versus Constancy of Causal Direction Random Sampling and External Validity Purposive Sampling and External Validity

8386 90 91 92

More on Relationships, Tradeoffs, and PrioritiesThe Relationship Between Construct Validity and External Validity The Relationship Between Internal Validity and Construct Validity Tradeoffs and Priorities

9393 95 96

Summary

102

CONTENTS

I vii

4. QUASI-EXPERIMENTAL DESIGNS THAT EITHER LACK A CONTROL GROUP OR LACK PRETEST OBSERVATIONS ON THE OUTCOME 103The Logic of Quasi-Experimentation in Brief Designs Without Control GroupsThe One-Group Posttest-Only Design The One-Group Pretest-Posttest Design The Removed-Treatment Design The Repeated-Treatment Design

104 106106 108 111 113

Designs That Use a Control Group But No PretestPosttest-Only Design With Nonequivalent Groups Improving Designs Without Control Groups by Constructing Contrasts Other Than With Independent Control Groups The Case-Control Design

115115 125 128

Conclusion

134

5. QUASI-EXPERIMENTAL DESIGNS THAT USE BOTH CONTROL GROUPS AND PRETESTSDesigns That Use Both Control Groups and :PretestsThe Untreated Control Group Design With Dependent Pretest and Posttest Samples Matching Through Cohort Controls

135 136136 148

Designs That Combine Many Design ElementsUntreated Matched Controls With Multiple Pretests and Posttests, Nonequivalent Dependent Variables, and Removed and Repeated Treatments Combining Switching Replications With a Nonequivalent Control Group Design An Untreated Control Group With a Double Pretest and Both Independent and Dependent Samples

153

153 154 154

The Elements of DesignAssignment Measurement Comparison Groups Treatment Design Elements and Ideal Quasi-Experimentation

15 6156 158 159 160 160

..I I I ' I

viii

J

CONTENTS

Conclusion Appendix5.1: important Developments in Analyzing Data From Designs With Nonequivalent GroupsPropensity Scores and Hidden Bias Selection Bias Modeling Latent Variable Structural Equation Modeling

161

161162 166 169

6. QUASI-EXPERIMENTS: INTERRUPTED TIME-SERIES DESIGNSWhat Is a Time Series?Describing Types of Effects Brief Comments on Analysis

171 172172 174

Simple Interrupted Time SeriesA Change in Intercept A Change in Slope Weak and Delayed Effects The Usual Threats to Validity

175175 176 178 179

Adding Other Design Features to the Basic Interrupted Time SeriesAdding a Nonequivalent No-Treatment Control Group Time Series Adding Nonequivalent Dependent Variables Removing the Treatment at a Known Time Adding Multiple Replications Adding Switching Replications

181182 184 188 190 192

Some Frequent Problems with Interrupted Time-Series DesignsGradual Rather Than Abrupt Interventions Delayed Causation Short Time Series Limitations of Much Archival Data

195196 197 198 203

A Comment on Concomitant Time Series Conclusion

205 206 207 208208

7. REGRESSION DISCONTINUITY DESIGNSThe Basics of Regression DiscontinuityThe Basic Structure

CONTENTS

I ix

Examples of Regression Discontinuity Designs Structural Requirements of the Design Variations on the Basic Design

212 216 219

Theory of the Regression Discontinuity DesignRegression Discontinuities as Treatment Effects in the Randomized Experiment Regression Discontinuity as a Complete Model of the Selection Process

220221 224

Adherence to the CutoffOverrides of the Cutoff Crossovers and Attrition Fuzzy Regression Discontinuity

22 7227 228 229

Threats to ValidityRegression Discontinuity and the Interrupted Time Series Statistical Conclusion Validity and Misspecification of Functional Form Internal Validity

229229 230 ?37

Combining Regression Disconfinuity and Randomized Experiments Combining Regression Discontinuity and I. Quasi-Experiments Regression Discontinuity-E:J'periment or Quasi-Experiment? Appendix 7.1: The Logic of Statistical Proofs alwut Regression Discontinuity

238 241 242 243

8. RANDOMIZED EXPERIMENTS: RATIONALE, DESIGNS, AND CONDITIONS CONDUCIVE TO DOING THEMThe Theory of Random AssignmentWhat Is Random Assignment? Why Randomization Works Random Assignment and Units of Randomization The Limited Reach of Random Assignment

246 24 7248 248 253 256

Some Designs Used with Random AssignmentThe Basic Design The Pretest-Posttest Control Group Design Alternative-Treatments Design with Pretest

257257 261 261

~.. I'

X

I CONTENTSMultiple Treatments and Controls with Pretest Factorial Designs Longitudinal Designs Crossover Designs 262 263 266 268

Conditions Most Conducive to Random AssignmentWhen Demand Ou!strips Supply When an Innovation Cannot Be Delivered to All Units at Once When Experimental Units Can Be Temporally Isolated: The Equivalent-Time-Samples Design When Experimental Units Are Spatially Sep;lrated or I11terunit Communication Is Low When Change Is Mandated and Solutions Are Acknowledged to Be Unknown When a Tie Can Be Broken or Ambiguity About Need Can Be Resolved When Some fersons Express No Preference Among Alternatives When You Can Create Your Own Organization When You Have Control over Experimental Units When Lotteries Are Expected

269269 270 270 271 272 273 273 274 274 2 75

When Random Assignment Is Not Feasible or Desirable Discussion

276 277

9. PRACTICAL PROBLEMS 1: ETHICS, PARTICIPANT RECRUITMENT, ~ND RANDOM ASSIGNMENTEthical and Legal Issues with ExperimentsThe Ethics of Experimentation Withholding a Potentially Effective Treatment The Ethics of Random Assignment Discontinuing Experiments for Ethical Reasons Legal Problems in Experiments

279 280281 283 286 289 290

Recruiting Participants to Be in t}le Experiment Improving the Random Assignmer.t ProcessMethods of Randomization What to Do If Pretest Means Differ Matching and Stratifying Matching and Analysis of Covariance The Human Side of Random As~ignment

292 294294 303 304 306 307

Conclusion

311

CONTENTS

I xi

Appendix 9.1: Random Assignment by ComputerSPSS and SAS World Wide Web Excel

311311 313 313

10. PRACTICAL PROBLEMS 2: TREATMENT IMPLEMENTATION AND ATTRITIONProblems Related to Treatment ImplementationInducing and Measuring Implementation Analyses Taking Implementation into Account

314 315315 320

Post-Assignment AttritionDefining the Attrition Problem Preventing Attrition Analyses of Attrition

323323 324 334

Discussion

340

11. GENERALIZED CAUSAL INFERENCE: A GROUNDED THEORYThe Received View of Generalized Causal Inference: Formal SamplingFormal Sampling of Causes and Effects Formal Sampling of Persons and Settings Summary

341 342344 346 348

A Grounded Theory of Generalized Causal. InferenceExemplars of How Scientists Make Generalizations Five Principles of Generalized Causal Inferences The Use of Purposive Sampling Strategies Applying the Five Principles to Construct and External Validity Shoulg Experimenters Apply These Principles to All Studies? Prospective and Retrospective Uses of These Principles

348349 353 354 356 371 372

Discussion

373

12. GENERALIZED CAUSALJNFERENCE: METHODS FOR SINGLE STUDIES 374Purposive Sampling and Generalized Causal InferencePurposive Sampling of Typical Instances Purposive Sampling of Heterogeneous Instances

374375 376

I I II' ,,

xii

I

CONTENTS

Purposive Sampling and the First Four Principles Statistical Methods for Generalizing from Purposive Samples

378 386

Methods for Studying Causal ExplanationQualitative Methods Statistical Models of Causal Explanation Experiments That Manipulate Explanations

389389 392 414

Conclusion

415

13. GENERALIZED CAUSAL INFERENCE: METHODS 417 FOR MULTIPLE STUDIESGeneralizing from Single Versus Multiple Studies Multistudy Programs of ResearchPhased Models of Increasingly Generalizable Studies Directed Programs of Experiments

418 418419 420

Narrative Reviews of Existing ResearchNarrative Reviews of Experiments Narrative Reviews Combining Experimental and Nonexperimental Research Problems with Narrative Reviews

421422 423 423

Quantitative Reviews of Existing ResearchThe Basics of Meta-Analysis Meta-Analysis and the Five Principles of Generalized Causal Inference Discussion of Meta-Analysis

425426 435 445

Appendix 13.1: Threats to the Validity of Meta-AnalysesThreats to Inferences About the Existence of a Relationship Between Treatment and Outcome Threats to Inferences About the Causal Relationship Between Treatment and Outcome Threats to Inferences About the Constructs Represented in Meta-Analyses Threats to Inferences About External Validity in Meta-Analyses

44644 7 450 453 454

14. A CRITICAL ASSESSMENT OF OUR ASSUMPTIONSCausation and ExperimentationCausal Arrows and Pretzels Epistemological Criticisms of Experiments Neglected Ancillary Questions

456 453453 459 461

CONTENTS

/ xiii

ValidityObjections to Internal Validity Objections Concerning the Discrimination Between Construct Validity and External Validity Objections About the Completeness of the Typology Objections Concerning the Nature of Validity

462462 466 473 475

Quasi-ExperimentationCriteria for Ruling Out Threats: The Centrality of Fuzzy Plausibility Pattern Matching as a Problematic Criterion The Excuse Not to Do a Randomized Experiment

484484 485 486

Randomized ExperimentsExperiments Cannot Be Successfully Implemented Experimentation Needs Strong Theory and Standardized Treatment Implementation Experiments Entail Tradeoffs Not Worth Making Experiments Assume an Invalid Model of Research Utilization The Conditions of Experimentation Differ from the Conditions of Policy Implementation Imposing Treatments Is Fundamentally Flawed Compared with Encouraging the Growth of Local Solutions to Problems

488488 489 490 493 495 497

Causal Generalization: An Overly Complicated Theory? Nonexperimental AlternativesIntensive Qualitative Case Studies Theory-Based Evaluations Weaker Quasi-Experiments Statistical Controls

498 499500 501 502 503

Conclusion Glossary References Name Index Subject Index

504 505 514 593 609

Preface

a book for those who have already decided that identifying a dependable relationship between a cause and its effects is a high priority ~nd who wish to consider experimental methods for doing so. Such causal relationships are of great importance in human affairs. The rewards associated with being correct in identifying causal relationships can be high, and the costs of misidentification can be tremendous. To know whether increased schooling pays,off in later life happiness or in increased lifetime earnings is a boon to individuals facing the decision about whether to spend more time in school, and it also helps policymakers determine how much financiaJ support to give educational ipstitutions. In health, from the earliest years of hmnan existence, causation has heiped to identify which strategies are effective in dealing with disease. In pharmaq:>logy, divinations and reflections on experience in the remote past sometimes led to 'the development of many useful treatments, but other judgments about effective plants and ways of placating gods were certainly more incorrect than correct and presumably contributed to many unnecessary deaths. The utility of finding such causal connections is so widely understood that much effort goes to locating them in both human affairs in general and in science in particular. However, history also teaches us that it is rare for those causes to be so universally tme that they hold under all conditions with all types of people and at all historical time periods. All causal statements are inevitably contingent. Thus, although threat from an out-group often causes in-group cohe~ion, this is not always the case. For instance, in 1492 the king of Granada had to watch as his Moorish subjects left the city to go to their ancestral homes in North Africa, being unwilling to fight against the numerically superior troops of the Catholic kings of Spain who were lodgeq in Santa Fe de la Frontera nearby. Here, the external threat from the out-group of Christian Spaniards led not to increased social cohesion among the Moslem Spaniards but rather to the latter's disintegration as a defensive force. Still, some causal hypotheses are more contingent than others. It is of obvious utility to learn as much as one can about those contingencies and to identify those relationships that hold more consistently. For instance, aspirin is such a wonderful drug because it reduces the symptoms associated with many different kinds of illness, including head colds, colon cancer, eliei th:tt design eleJucnts .'I:Ui.:h a.'ll coutror hrrou ps or pret{:!'.ts are mtdcsirahle or unnec:es.'llaly-c\'Cn when desc.ripthc causal infen:t1ce is the highest priority. We wanl' to ca~t duuht ut1

such beliefs by showing th~ co.rs ro iuteru:tl v:tlidit y that such dtsigns can incur so that re.earchers c:1n choose whether to incur these costs given their other resca rch. pri()ritics, Second, :'"'e usc these clcsih.,,s to illu.'lltrat(: how various ..,a lidity d1rcais operate ~:ith :tctuaJ c.xarnp1c.'ll, for it is rnon: jmportant to learn how LO tnink ~ritically :tbout these threats than it is to learn a list of desigr

experimental design

Documents

generalized

statistical

external validity

internal validity

control groups

causal generalization

narrative

construct