the surrogate index: combining short -term proxies to ... · susan athey, stanford. raj chetty,...

38
Susan Athey, Stanford Raj Chetty, Harvard Guido Imbens, Stanford Hyunseung Kang, UW-Madison November 2019 The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely

Upload: others

Post on 20-Mar-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Susan Athey, StanfordRaj Chetty, Harvard

Guido Imbens, StanfordHyunseung Kang, UW-Madison

November 2019

The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely

Page 2: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

W Y

Class SizeMarketing

Lifetime EarningsLong-Term Revenue

Estimating long-term impacts of treatments is central in many fields, from economics to marketing

Two key challenges in estimating long-term treatment effects using conventional experimental/quasi-experimental methods

1. Long delays in observing impacts

2. Experimental estimates are often very imprecise

Problem: Estimating Long-Term Impacts of Interventions

Page 3: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

One intuitive solution: use short-term proxies to predict long-term impacts

Estimate effect of treatment on an intermediate outcome S

Regress Y on S in observational data and multiply treatment effect on S by this regression coefficient to predict long-term impact

This is common in the social sciences…

W Y

Test ScoresEarnings in Mid-20s

S

Class Size Lifetime Earnings

Using Short-Term Outcomes as Proxies

Page 4: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Slope = $15410000

15000

20000

25000M

ean

Wag

e Ea

rnin

gs fr

om A

ge 2

5-27

($)

0 20 40 60 80 100Kindergarten Test Score Percentile

Predicting Earnings from Early Childhood Test Scores

Page 5: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

20 30 40 50 60 70Age

20k

30k

40k

50k

60k

Annu

al E

arni

ngs

Prediction assuming constant % impact on earnings

Estimated Treatment Effect at Ages 26-27: $6k

Predicting Lifetime Earnings ImpactsUsing Treatment Effect Estimates on Earnings in Early Adulthood

Mean Earnings by Age in Cross-Section

Page 6: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Prentice (1989) formalized this approach in biostatistics, labeling an intermediate outcome a surrogate if Y is independent of W conditional on S

Problem: validity of this assumption is often unclear in applications

Do test scores fully capture impacts on earnings by themselves?

Do short-term impacts on earnings accurately reflect lifetime earnings impacts?

W Y

Class SizeNeighborhoods

Lifetime EarningsLife Expectancy

Test ScoresEarnings in Mid-20s

S

Potential Solution: Surrogates

Page 7: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

This Paper: Combining Multiple Short-Term Proxies

How can we estimate long-term treatment effects when we don’t necessarily have a valid surrogate?

We show how we can make progress on these issues in the era of big data, where we typically have many intermediate outcomes, not just one potential surrogate

Rather than debating whether any one variable is a valid statistical surrogate, combine many short-term proxies to create a “surrogate index”

Combining many variables makes it more likely that we span all the causal pathways from treatment to long-term outcome

Page 8: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

W Y

S 1

S 2

S 3

Combining Multiple Surrogates

Page 9: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Simple idea: form predicted value of long-term outcome using multiple surrogates (e.g., via linear regression) and estimate treatment effects on that predicted value

This can allow us to estimate long-term treatment effects more quickly and more precisely (smaller standard errors)

Approach is intuitive, but most work still uses a single variable as a candidate surrogate

This Paper

Page 10: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Contributions of this paper:

1. [Identification] Formalize assumptions required for identification using surrogate index

2. [Bias] Bound bias from violations of these assumptions and show how they can be validated

3. [Precision] Characterize gains in precision from using surrogate index instead of long-term outcome

4. [Application] Apply method to show practical value of combining proxies for problems we work on

Illustrate method and key results primarily focusing on empirical application here

This Paper

Page 11: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Assume researcher has two different datasets:

Experimental dataset (E): data on W (treatment) and S (intermediate outcome), with W randomly assigned

Example: Tennessee STAR experiment that varied class size randomly

Observational dataset (O): data on S and Y (long-term outcome), and possibly W, with W not randomly assigned

Example: standard school district dataset linked to long-term outcome data

Setup

Page 12: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Surrogate index is the conditional expectation of long-term outcome given the intermediate outcomes (and any pre-treatment covariates) in the observational dataset

In a linear model, can be estimated as the predicted value from a regression of the long-term outcome on the intermediate outcomes

The Surrogate Index

Page 13: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Identification Using the Surrogate Index

Assumption 1 (Unconfounded Treatment Assignment):

Assumption 2 (Surrogacy):

Assumption 3 (Comparability):

Treatment effect on the surrogate index in the experimental sample is an unbiased estimate of treatment effect on the long-term outcome under three assumptions:

Page 14: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

California Greater Avenues to Independence program: job assistance program implemented in late 1980s to help welfare (AFDC) recipients find work

MDRC conducted a randomized trial of GAIN in four urban counties: Alameda (Oakland), Los Angeles, Riverside, and San Diego

Focus first on Riverside program, which was widely heralded as being the most successful program that had the largest impacts on employment and earnings

Riverside emphasized a “jobs first” approach to re-entry into labor force (rather than human capital development/training to find ideal match)

Then return to other sites, which we hold out and use for out-of-sample validation

Empirical Application: California GAIN Training Program

Page 15: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Use data from Hotz, Imbens, and Klerman (2006), who conducted a nine-year follow-up using data from UI records

5,445 individuals participated in program in Riverside, randomly assigned to treatment and control

At baseline: 22% employed; mean quarterly earnings of $452

Riverside GAIN Program: Experimental Analysis

Page 16: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

10

20

30

40

1 6 11 16 21 26 31 36Quarters Since Random Assignment

TreatmentControl

Employment Rates in Treatment vs. Control Group, by QuarterEm

ploy

men

t Rat

e (%

)

Page 17: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

10

20

30

40

1 6 11 16 21 26 31 36Quarters Since Random Assignment

TreatmentTreatment Mean Over 9 YearsControlControl Mean Over 9 Years

Empl

oym

ent R

ate

(%)

Employment Rates in Treatment vs. Control Group, by Quarter

Question: could we have estimated mean impact over 9 years more quickly using short-term employment rates as surrogates?

Page 18: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Construct surrogate index by regressing mean employment rate over 36 quarters on employment indicators from quarter 1 to quarter S:

Then estimate treatment effect on surrogate index based on employment rates up to quarter S

Assess how quickly (at what value of S) we can estimate nine-year mean impact accurately

Construction of Surrogate Index

Page 19: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

-4

0

4

8

12

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nEm

ploy

men

t Rat

e O

ver 9

Yea

rs (%

)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Naive Short-Run Mean Over x QuartersSurrogate Index EstimateActual Mean Treatment Effect Over 36 Quarters

Estimates of Treatment Effect on Mean Employment Rates Over Nine YearsVarying Quarters of Data Used to Construct Estimate

Page 20: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Surrogate Estimate Using Emp. Rate in Quarter x Only

-4

0

4

8

12

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nEm

ploy

men

t Rat

e O

ver 9

Yea

rs (%

)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Actual Mean Treatment Effect Over 36 Quarters

Estimates of Treatment Effect on Mean Employment Rates Over Nine YearsVarying Quarters of Data Used to Construct Estimate

Page 21: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

4

6

8

10

12

14

Trea

tmen

t Effe

ct o

n M

ean

Empl

oym

ent R

ate

to Q

uarte

r x(%

)

6 11 16 21 26 31 36Quarters Since Random Assignment

Six-Quarter Surrogate Index EstimateActual Experimental Estimate

Estimates of Treatment Effects on Cumulative Mean Employment RatesVarying Outcome Horizon, Six-Quarter Surrogate Window

Page 22: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

-20

-10

0

10

20

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nEm

ploy

men

t Rat

e O

ver 9

Yea

rs (%

)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Actual Mean Treat. Eff. Over 36 Quart.Surrogate Index EstimateBounds on Bias:

Bounds on Mean Treatment Effect Based on Surrogate IndexVarying Number of Quarters Used to Estimate Surrogate Index

Page 23: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

-20

-10

0

10

20

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nEm

ploy

men

t Rat

e O

ver 9

Yea

rs (%

)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Actual Mean Treat. Eff. Over 36 Quart.Surrogate Index EstimateBounds on Bias:

Bounds on Mean Treatment Effect Based on Surrogate IndexVarying Number of Quarters Used to Estimate Surrogate Index

95% CI for Bounds

Page 24: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

-20

-10

0

10

20

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nEm

ploy

men

t Rat

e O

ver 9

Yea

rs (%

)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Actual Mean Treat. Eff. Over 36 Quart.Surrogate Index EstimateBounds on Bias: Bounds on Bias:

Bounds on Mean Treatment Effect Based on Surrogate IndexVarying Number of Quarters Used to Estimate Surrogate Index

Page 25: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Gains in Precision from Using Surrogate Index

Std Err. = 1.06%Std Err. = 0.69%

Std Err. = $56.21

Std Err. = $36.34

0

100

200

300

400

500

Effe

ct o

n M

ean

Earn

ings

($)

0

2

4

6

8

10

Effe

ct o

n M

ean

Empl

oym

ent (

%)

Effect on Mean EmploymentOver Nine Years (LHS)

Effect on Mean QuarterlyEarnings Over Nine Years (RHS)

95% CI for Experimental Estimate of Mean Nine-Year Effect95% CI for Six-Quarter Surrogate Index Estimate

Page 26: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Now turn to data from the other three sites: Oakland, LA, San Diego

Use six-quarter surrogate index estimated in Riverside and ask how well it performs in predicting heterogeneity in treatment effects across sites

Joint test of surrogacy and comparability assumptions

Predicting Cross-Site Heterogeneity

Page 27: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Surrogate Index Estimates vs. Actual Experimental Estimates, by SiteMean Employment Rate over Nine Years

Note: Surrogate Index Estimates are based on a Six-Quarter Surrogate Index Estimated Using Data from Riverside

Riverside

Los Angeles

San Diego

Alameda

45° Line-2

0

2

4

6

8

Six-

Qua

rter S

urro

gate

Inde

x Es

timat

e of

Trea

tmen

t Effe

ct o

n M

ean

Empl

oym

ent R

ate

(%)

Actual Treatment Effect on Mean Employment Rate (%) Over 36 Quarters-2 0 2 4 6 8

Page 28: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Trea

tmen

t Effe

ct o

n M

ean

Qua

rterly

Ear

ning

s ($

)

Riverside

Surrogate Index Estimates vs. Actual Experimental Estimates, by SiteMean Employment Rate over Nine Years

Los Angeles

San DiegoAlameda

-100 0 100 200 300 400

45° Line

300

400

200

100

0

-100

Note: Surrogate Index Estimates are based on a Six-Quarter Surrogate Index Estimated Using Data from RiversideActual Treatment Effect on Mean Quarterly Earnings ($) Over 36 Quarters

Six-

Qua

rter S

urro

gate

Inde

x Es

timat

e of

Page 29: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Conclusion

Surrogate indices can be used to expedite and improve the precision of estimation of long-term treatment effects under empirically plausible assumptions

Impacts of economic programs on lifetime earnings to early childhood interventions on health to marketing impacts on downstream revenue

Page 30: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Future Work: Building a Surrogate Library

Over time, we can develop guidance on which surrogates are adequate by analyzing other experiments, as we did across sites in the GAIN job training program

Ex: how many years of earnings, college attendance, other measures are needed to reliably predict lifetime income?

Identifying surrogates that match long-term outcomes in existing/ongoing empirical studies would help us build a “surrogate library”

These surrogate indices can then be used in future work to increase precision and speed of program evaluation

Page 31: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Supplementary Results

Page 32: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

TreatmentControl

Quarters Since Random Assignment

10

20

30

40

1 6 11 16 21 26 31 36Quarters Since Random Assignment

TreatmentControl

Earnings in Treatment vs. Control Group, by Quarter

10

500

1000

1500

Mea

n Q

uarte

rly E

arni

ngs(

$)

Page 33: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

0

100

200

300

400

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nQ

uarte

rly E

arni

ngs

Ove

r 9 Y

ears

($)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Surrogate Index EstimateNaive Short-Run EstimateActual Mean Treatment Effect Over 36 Quarters

Estimates of Treatment Effect on Mean Quarterly Earnings Over Nine YearsVarying Quarters of Data Used to Construct Estimate

Page 34: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

0

100

200

300

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Actual Mean Treatment Effect Over 36 QuartersSurrogate Estimate Using Earnings in Quarter x Only

Estimates of Treatment Effect on Mean Quarterly Earnings Over Nine YearsUsing Earnings in a Single Quarter as a Surrogate

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nQ

uarte

rly E

arni

ngs

Ove

r 9 Y

ears

($)

Page 35: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

150

200

250

300

350

400

Trea

tmen

t Effe

ct o

n M

ean

Qua

rterly

Ear

ning

s to

Qua

rter x

($)

6 11 16 21 26 31 36Quarters Since Random Assignment

Six-Quarter Surrogate Index EstimateActual Experimental Estimate

Estimates of Treatment Effects on Mean Quarterly Earnings, by Outcome Horizon

Estimated Effects on Cumulative Mean Quarterly Earnings

Page 36: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

-1000

-500

0

500

1000

Estim

ated

Tre

atm

ent E

ffect

on

Mea

nQ

uarte

rly E

arni

ngs

Ove

r 9 Y

ears

($)

1 6 11 16 21 26 31 36Quarters Since Random Assignment

Bounds on Mean Treatment Effect on Earnings Based on Surrogate IndexVarying Number of Quarters Used to Estimate Surrogate Index

Actual Mean Treat. Eff. Over 36 Quart.Surrogate Index EstimateBounds on Bias: Bounds on Bias:

Page 37: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Estimates of Treatment Effects on Mean Employment Rates by YearActual Estimates by Year vs. Six-Quarter Surrogate Index Estimate

-5

0

5

10

15

x(%

)

3 4 5 6 7 8 9Years Since Random Assignment

Six-Quarter Surrogate Index EstimateActual Experimental Estimate

Trea

tmen

t Effe

ct o

n M

ean

Empl

oym

ent R

ate

at Y

ear

Page 38: The Surrogate Index: Combining Short -Term Proxies to ... · Susan Athey, Stanford. Raj Chetty, Harvard. Guido Imbens, Stanford. Hyunseung Kang, UW-Madison. November 2019. The Surrogate

Estimates of Treatment Effects on Mean Quarterly Earnings by YearActual Estimates by Year vs. Six-Quarter Surrogate Index Estimate

x(%

)

3 4 5 6 7 8 9Years Since Random Assignment

Six-Quarter Surrogate Index EstimateActual Experimental Estimate

Trea

tmen

t Effe

ct o

n M

ean

Qua

rterly

Ear

ning

s at

Yea

r

0

100

200

300

400