multiple imputation

10
Multiple Imputation

Upload: hope-walker

Post on 03-Jan-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Multiple Imputation. Multiple Imputation. Missing data method developed by Donald Rubin Simulate multiple samples of “complete” data, and compute estimates and standard errors from the complete data. Rubin distinguished multiple imputation from Different models Same model - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Multiple Imputation

Multiple Imputation

Page 2: Multiple Imputation

Multiple Imputation

Missing data method developed by Donald Rubin

Simulate multiple samples of “complete” data, and compute estimates and standard errors from the complete data.

Rubin distinguished multiple imputation from– Different models– Same model

We will focus on same-model multiple imputation

Page 3: Multiple Imputation

Missing Data mechanism

Missing data mechanisms– MCAR (Missing completely at random)—

missing data are a random subsample of complete data

– MAR (Missing at random)—missing data mechanism may depend on independent variables, but not the response

Page 4: Multiple Imputation

Missing Data mechanism

Ignorable nonresponse– MCAR– Parameter for missing process different

from data parameters Example for discussion

– Growth curve models for largemouth bass

Page 5: Multiple Imputation

Computer Example

5 Teachers, 3 methods, Y=relative improvement

Method Teacher 1

Teacher 2

Teacher 3

Teacher 4

Teacher 5

A 10, 7 6 11 6 6

B 4 . 8.5 4,5 3

C 9 13 16 8 6

Page 6: Multiple Imputation

Multiple Imputation simulation

Repeated draws i=1,…,M from the posterior predictive distribution of the missing data.

The complete data sets have the same set of fully observed responses.

In practice, there are numerous ways to generate complete data.

Introductory methods rely on monotone missingness, and classic results for conditional distributions of jointly multivariate normal random variables.

Page 7: Multiple Imputation

Multiple Imputation simulation

In a multivariate normal setting (some values of Y missing), we generate our draws from Y|X:

XYXXYXYYXXXYXY

XXXY

YXYY

X

Y

XMVNXY

MVNX

Y

11 ,~|then

,,~ If

Page 8: Multiple Imputation

Multiple Imputation Estimation

Combining results from imputation for parameters of interest is surprisingly straightforward. E.g., let q represent the PMM’s for Method. We can compute

ˆ i and s ˆ i, i 1,...M

Page 9: Multiple Imputation

Multiple Imputation Estimation

Our estimate and its standard error can be computed as:

1

ˆ and

1

,1

,ˆ1

2

1

2

1

MBs

MW

BM

MWs

M

MiM

M

iM

MM

M

iiM

i

M

Page 10: Multiple Imputation

Multiple Imputation Estimation

Combining estimates in SAS is non-standard.

Our example with LSMeans is atypical, and more straightforward than most.