crj 604 introduction - arizona state university linear regression (next week) 3. multiple regression...

22
CRJ 604 Introduction

Upload: voxuyen

Post on 10-May-2018

224 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

CRJ 604 Introduction

Page 2: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

Here’s the raw data for 12 birth-month groups of babies:

Temperature in 6th month Average crawling month

66 6.86

73 7.02

72 6.84

63 7.32

52 6.58

39 7.23

33 7.73

30 7.55

33 7.78

37 7.69

48 7.69

57 7.53

Page 3: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

Page 4: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

6.5

77

.58

cra

wl

30 40 50 60 70temp6

“scatter crawl temp6” produces this graph

Page 5: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

“twoway (scatter crawl temp6) (lfit crawl temp6)” adds a line of “best fit”

6.5

77

.58

30 40 50 60 70temp6

crawl Fitted values

Page 6: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

-This line of best fit is what linear regression does. It represents our best estimate of

the relationship between our two variables.

- To obtain a quantification of this line in stata: “reg crawl temp6”

Page 7: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Do babies delay crawling

when it’s cold outside?

This regression produces the following relationship between

temperature and crawling month:

-Month start crawling = 8.21-.0177*(avg. temperature)

-So a 1 point decrease in temperature is associated with babies

delaying crawling .018 months (about half a day).

- With a 10 point increase in temp, baby should crawl 5 days

earlier.

-Phoenix babies must be super fast crawlers . . .

Page 8: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Goals

From the syllabus:

1. Students will understand the theoretical issues involved in the basic linear regression model in its simplest form (bivariate regression) and multivariate form (multiple regression).

2. Students will also acquire fluency with the computer application (using Stata) of bivariate and multivariate regressions and probit/logit models, including testing assumptions and applying fixes.

Page 9: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Plan for course

1. Overview of course (today)

2. Simple linear regression (next week)

3. Multiple regression

4. Violations of regression assumptions

5. Limited dependent variables

6. Propensity score matching

Page 10: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Textbook: Wooldridge’s

Introductory Econometrics

“Econometrics” is simply the

term economists use for

statistics.

- Crime examples used in

book. I/we will

supplement these with

more examples.

- Data sets / solutions to

odd-numbered exercises

are online

Page 11: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Online resources, this course

All course documents will be located at the following address:

http://www.public.asu.edu/~gasweete/crj604/

This includes the syllabus, lecture slides, datasets for the exercises, and solutions to odd-numbered exercises.

Page 12: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Statistical package

All analysis for this class should be

done using Stata!

- Stata is installed in this classroom

- Outside of the classroom, you will

have to access Stata via Saguaro

- The details of this access are

forthcoming.

Page 13: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

The best place to learn Stata:

http://www.ats.ucla.edu/stat/stata/

A quick introduction:

http://data.princeton.edu/stata/

What I use:

Online resources, Stata

Page 14: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Computer exercise C1.1

Page 15: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Research questions

You will come to understand statistical approaches

to answering questions like these:

Is a particular rehabilitation program effective in

reducing recidivism?

Does gang membership increase crime?

Does juvenile arrest affect high school dropout?

Does inequality increase crime rates?

Page 16: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Types of Data

Your approach to answering research questions is

constricted by the data to which you have

access.

Nonexperimental data: naturally occurring,

preferably collected in a systematic manner

Experimental data: random assignment of cases

to two or more conditions.

Page 17: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Theory

Barring data restrictions, the way you

approach research questions is guided

by criminological theory.

E.g. Social control, strain, differential

association, social disorganization

These theories point to constructs that

account for crime.

For statistical analysis, we create variables

that are supposed to represent theoretical

constructs.

Page 18: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Becker’s model of crime, in theory

Page 19: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Becker’s model of crime, in practice

Page 20: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Causality

Criminologists are often concerned with the causal effect of one variable on another.

Ceteris paribus, meaning “all else equal,” is essential for causal analysis. It’s also difficult to impossible to achieve.

Example: Law enforcement and crime

Page 21: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Causality

Recall: Month start crawling = 8.21-

.0177*(avg. temperature)

Is it reasonable to assume that we have captured a causal relationship between temperature and crawling age?

What else might be correlated with crawling age and temperature?

Page 22: CRJ 604 Introduction - Arizona State University linear regression (next week) 3. Multiple regression 4. Violations of regression assumptions 5. Limited dependent variables 6. Propensity

Next time:

Homework: Problems 1.2, C1.2, C1.4

Note: If I cannot provide access to Stata by

Friday, C1.2 and C1.4 will be done as an

in-class exercise next week.

Read: Wooldridge Chapters 1 & 2, and

appendices A through C if you haven’t

already.