cesis seminar series feb 2014 pardis nabavi

34
1 Estimation of multivalued treatment effects under conditional independence Matias D. Cattaneo, David M. Drukker, Ashley D. Holland CESIS Seminar series Feb 2014 Pardis Nabavi

Upload: hasana

Post on 09-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Estimation of multivalued treatment effects under conditional independence Matias D. Cattaneo, David M. Drukker, Ashley D. Holland. CESIS Seminar series Feb 2014 Pardis Nabavi. Multivalued treatment effects. Estimating causal effects of multivalued treatments under ignorability - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CESIS Seminar series Feb 2014 Pardis Nabavi

1

Estimation of multivalued treatment effectsunder conditional independence

Matias D. Cattaneo, David M. Drukker, Ashley D. Holland

CESIS Seminar seriesFeb 2014

Pardis Nabavi

Page 2: CESIS Seminar series Feb 2014 Pardis Nabavi

2

Multivalued treatment effects• Estimating causal effects of

multivalued treatments under ignorability

possible treatment levels j = 0, 1, ..., J yi is the observed outcome variable wi denotes the treatment level di(j) = 1(wi = j), xi is a kx × 1 vector of covariates

Page 3: CESIS Seminar series Feb 2014 Pardis Nabavi

3

Ignorability Assumption• (Selection-on-observables) y(j) ⊥⊥ d(j)|x.

• : implies that the distribution of each potential outcome y(j) is independent of the random treatment d(j), conditional on the covariates x.

• (No-empty-cell) 0 < pmin < pj(x) with pj(x) = P[w = j|x].

• : says that for every possible x in the population, there is a strictly positive probability that someone with that covariate pattern could be assigned to each treatment level.

Page 4: CESIS Seminar series Feb 2014 Pardis Nabavi

4

Command• Implements two semiparametric estimators for

multivalued treatment effects discussed in Cattaneo (2010)

• Poparms (short for potential-outcomes parameters)

• The estimators implemented in poparms use a multinomial-logit model (MLM) to predict the probability of each treatment level as function of covariates.

Page 5: CESIS Seminar series Feb 2014 Pardis Nabavi

5

Two semiparametric estimators• inverse-probability weighted (IPW) • efficient-influence-function (EIF)

• selection-on-observables assumption to identify and estimate the parameters of the potential-outcomes distributions

• after controlling for observed covariates, the potential-outcome distributions are independent of the treatment level administered, and therefore rules out that some unobservable factor correlated with treatment assignment affects the potential-outcome distributions

Page 6: CESIS Seminar series Feb 2014 Pardis Nabavi

6

propensity score

• The only unknown functions for the IPW estimators are the conditional probability functions pj(x), j = 0, 1, . . . , J, forming the generalized propensity score p(x), which can be estimated parametrically or nonparametrically.

Page 7: CESIS Seminar series Feb 2014 Pardis Nabavi

7

Some features of the implemented procedures• Under standard regularity conditions, the IPW and

EIF estimators are consistent, asymptotically normal and semiparametric efficient when nonparametric estimators are employed to approximate the unknown functions introduced above

• Because we construct the joint VCE, we can conduct joint inference on the mean and quantile of the potential-outcome distributions, and hence we are also able to conduct valid inference on many other treatment effect parameters of interest.

• Both the IPW and EIF estimators can be used to construct simple inference procedures for joint means and quantiles of the potential-outcome distributions, and combinations thereof.

Page 8: CESIS Seminar series Feb 2014 Pardis Nabavi

8

Some features of the implemented procedures• It’s recommended to employ the EIF estimator in

practice because it enjoys the so-called double-robust property when viewed from a (flexible) parametric implementation point of view

• While the IPW estimator may be preferred over the EIF because of its simplicity, it is important to note that the VCE for the IPW estimator requires implementing all the ingredients of the EIF estimator. Thus, the IPW estimator is simpler only as a point estimator

• In poparms, the EIF estimator is the default.

Page 9: CESIS Seminar series Feb 2014 Pardis Nabavi

9

Syntax• poparms (treatvar gpsvars) (depvar cvars) [If ][in]

[,quantiles(numlist) vce(vcemethod [, vceoptions]) ipw]

• treatvar is categorical variable indicating treatment

• gpsvars are the covariates in the equations for the generalized propensity score

• depvar is the outcome variable• cvars are the covariates on which conditional

moments are calculated in the efficient influence-function estimator.

• gpsvars and cvars may contain factor variables

Page 10: CESIS Seminar series Feb 2014 Pardis Nabavi

10

bfit• bfit subcmd sorts a set of fitted candidate

regression models by an information criterion, puts the best-fitting model in ereturn, and displays a table showing the ranking of the models.

• The Bayesian information criterion is the default and the Akaike information criterion may optionally be specified.

Page 11: CESIS Seminar series Feb 2014 Pardis Nabavi

11

A Numerical Example• estimate the population effects of giving harder

tests to students on student performance

a class with normal tests (w=0), a class whose tests included some hard questions

(w=1), or a class whose tests included even more hard

questions (w=2)

• Control variables Parental status = pindex Student’s enviroment = eindex

Page 12: CESIS Seminar series Feb 2014 Pardis Nabavi

12

Select a model for the predicted probabilities• From a theoretical perspective, it is assumed that

a sufficient number of terms in a polynomial of the covariates are included in the MLM.

• The practical implication of this approach is that we must find an MLM for treatment probabilities before using poparms. We use our bfit command to automate this procedure.

Page 13: CESIS Seminar series Feb 2014 Pardis Nabavi

13

subcommand with the bfit• we specify the logit subcommand with the bfit

command to cause bfit to search among candidate MLMs. The covariates pindex and eindex follow the outcome variable w. The option corder(3) specifies that bfit should fit a sequence of models that builds up to a fully interacted, third-order polynomial in the 2 covariates.

• We specify the option base(0) so that w=0 will be the base category for the the candidate MLMs. Finally, we specify option aic to select the model that minimizes the AIC instead of using the default Bayesian information criterion (BIC).

Page 14: CESIS Seminar series Feb 2014 Pardis Nabavi

14

Command• bfit logit w pindex eindex, corder(3) base(0)

sort(aic)• Default corder is 2.

Page 15: CESIS Seminar series Feb 2014 Pardis Nabavi

15

Page 16: CESIS Seminar series Feb 2014 Pardis Nabavi

16

Control predicted probabilities • Before we proceed, we need to check that

all of the predicted probabilities are sufficiently greater than 0 and less than 1.

• For each treatment level j, an overlap plot depicts the estimated density of the predicted probabilities for treatment level j conditional on each possible treatment level.

Page 17: CESIS Seminar series Feb 2014 Pardis Nabavi

17

Page 18: CESIS Seminar series Feb 2014 Pardis Nabavi

18

Do the small minimums for phat0 likely indicate a problem?

• conditional densities of the predicted probability

Page 19: CESIS Seminar series Feb 2014 Pardis Nabavi

19

No evidence that there is any mass of observations with predictedprobabilities close to either 0 or 1

Page 20: CESIS Seminar series Feb 2014 Pardis Nabavi

20

Similar for for treatment levels 1 and 2

Page 21: CESIS Seminar series Feb 2014 Pardis Nabavi

21

No mass too close to 0 or 1!

Page 22: CESIS Seminar series Feb 2014 Pardis Nabavi

22

Same here!

Page 23: CESIS Seminar series Feb 2014 Pardis Nabavi

23

Select a model for the conditional means

• specify the regress subcommand• bfit regress spmeasure pindex eindex, corder(3)

Page 24: CESIS Seminar series Feb 2014 Pardis Nabavi

24

Check the results

Page 25: CESIS Seminar series Feb 2014 Pardis Nabavi

25

Selecting the model• bfit selected a fully interacted second-order

polynomial. As the output presents no indication of any problems, we accept this model as our model for the conditional mean.

• Note that, for simplicity, we employ the same flexible specification for both regression functions.

Page 26: CESIS Seminar series Feb 2014 Pardis Nabavi

26

poparms• poparms takes 2 equations:

model for the predicted probabilities

model for the conditional expectations

Page 27: CESIS Seminar series Feb 2014 Pardis Nabavi

27

Output:

Page 28: CESIS Seminar series Feb 2014 Pardis Nabavi

28

See the parameter names

Page 29: CESIS Seminar series Feb 2014 Pardis Nabavi

29

Average treatment effects

Page 30: CESIS Seminar series Feb 2014 Pardis Nabavi

30

Margin

Page 31: CESIS Seminar series Feb 2014 Pardis Nabavi

31

Margin Plot

Page 32: CESIS Seminar series Feb 2014 Pardis Nabavi

32

Quantile effect

• Or joint mean and quantile effect

Page 33: CESIS Seminar series Feb 2014 Pardis Nabavi

33

Quantile effects

Page 34: CESIS Seminar series Feb 2014 Pardis Nabavi

34

Thanks for listening to me!

More information:Cattaneo, M.D., Drukker,D.M. & Holland, A.D. (2013), “Estimation of multivalued treatment effects under

conditional independence”, Stata Journal