13 regression analysis quant-tech-regression

32
Quantitative Methods Varsha Varde

Upload: pooja-sakhla

Post on 20-Jan-2015

326 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: 13 regression analysis quant-tech-regression

Quantitative Methods

Varsha Varde

Page 2: 13 regression analysis quant-tech-regression

Quantitative Methods

Models for Data Analysis & Interpretation: Regression Analysis

Page 3: 13 regression analysis quant-tech-regression

Varsha Varde 3

Cause and Effect

The Present Contains Nothing More Than The Past, and What Is Found In The Effect Was Already In The Cause.

- Henri Bergson (19th Century French Philosopher)

Page 4: 13 regression analysis quant-tech-regression

Varsha Varde 4

Regression Model

• A Statistical Model which Depicts the Influence of One Cardinal Variable (The Cause) on Another Cardinal Variable (The Effect).

• These Models Have a Wide Variety of Forms and Degrees of Complexity.

Page 5: 13 regression analysis quant-tech-regression

Varsha Varde 5

Regression

• The Step Logically Next To Correlation.

• Situation: Usually, Correlation Between Two Variables Is Not Mere Benign Association. But, It Is In Fact Causation.

• It Is a Cause and Effect Relationship, Where X Influences Y.

• X is the Cause Variable.

• Y is the Effect Variable.

Page 6: 13 regression analysis quant-tech-regression

Varsha Varde 6

Some Examples

Cause Effect

Movie Ticket Price Multiplex Occupancy

Machine Downtime Production

Rainfall at Night Absenteeism Next Day

R&D Expenditure Gross Profit

? ?

Page 7: 13 regression analysis quant-tech-regression

Varsha Varde 7

Regression

• Dictionary Says: The Act of Returning or Stepping Back to a Previous Stage.

• Query: Do Quantitative Methods Force Us to Regress instead of Progress?

• Or, Is It Back to the Future?• Answer: Statistics, Like Any Other Field,

Adopts Crazy Names Arising from Some Important Historical Events.

• Soap Opera.

Page 8: 13 regression analysis quant-tech-regression

Varsha Varde 8

Story of Regression

• Sir Francis Galton Studied the Heights of the Sons in Relation to the Heights of Their Fathers.

• His Conclusion: Sons of Tall Fathers Were Not So Tall and Sons of Short Fathers Were Not So Short as their Fathers.

• Path Breaking Finding: Human Heights Tend To Regress Back To Normalcy.

Page 9: 13 regression analysis quant-tech-regression

Varsha Varde 9

Evolution of the Term ‘Regression’

• Since Then (1880), Similar Studies on Nature and Extent of Influence of One or More Variables on Some Other Variable Acquired the Name ‘Regression Analysis’.

• In Quantitative Methods, Regression Means a ‘Cause and Effect Relationship’.

• Cause Variable = Independent Variable

• Effect Variable = Dependent Variable

Page 10: 13 regression analysis quant-tech-regression

Varsha Varde 10

Scatter PlotHorizontal Axis: Reasoning Scores

Vertical Axis: Creativity Scores

Page 11: 13 regression analysis quant-tech-regression

Varsha Varde 11

Scatter PlotHorizontal Axis: Cause Variable: Reasoning Scores

Vertical Axis: Effect Variable: Creativity Scores

Page 12: 13 regression analysis quant-tech-regression

Varsha Varde 12

Regression Curve Horizontal Axis: Cause Variable: Reasoning Scores

Vertical Axis: Effect Variable: Creativity Scores

Page 13: 13 regression analysis quant-tech-regression

Varsha Varde 13

Regression Analysis

• A Quantitative Method which Tries to Estimate the Value of a Cardinal Variable (the Effect) by Studying Its Relationship with Other Cardinal Variables (the Cause).

• This Relationship is Expressed by a Custom-Designed Statistical Formula Called Regression Equation.

Page 14: 13 regression analysis quant-tech-regression

Varsha Varde 14

Purpose of Regression Analysis

1. To Establish Exact Nature of Influence of Cause Variable on Effect Variable.

2. To Determine the Quantum of Influence.

3. To Estimate an Unknown Value of Effect Variable from Value of Cause Variable.

4. To Forecast Future Values of Effect Variable from Info about Cause Variable

Page 15: 13 regression analysis quant-tech-regression

Varsha Varde 15

Patterns of Regression Curves

• Pattern: Upward Sloping Straight Line

• Mathematical Model: Y = a + bX (b > 0)

Page 16: 13 regression analysis quant-tech-regression

Varsha Varde 16

Estimating Regression Parameters a & b

• Formula for Regression Coefficient b :

Mean of Products of Values – Product of the Two Means= -------------------------------------------------------------------------- Variance of Cause Variable

• Formula for Regression Constant a :

a = Mean of Effect Variable Minus b times Mean of Cause Variable

• Don’t Worry. This is the Job of SPSS.

Page 17: 13 regression analysis quant-tech-regression

Varsha Varde 17

Estimating Correlation Coefficient

• Recall the Formula for Correlation Coeff.

• Pearson’s Correlation Coefficient

• Formula:

Mean of Products of Values – Product of the Two Means= -------------------------------------------------------------------------- Product of the Two Standard Deviations

• Spot the Similarity and the Difference.

Page 18: 13 regression analysis quant-tech-regression

Varsha Varde 18

A Simple Example

Empl. No. Yrs in Co. Salary (‘000) Product

1 2 25 50

2 3 30 90

3 5 37 185

4 7 38 266

5 8 40 320

Total 25 170 911

Arith Mean 5 34

Std. Dev. 2.3 5.6

Page 19: 13 regression analysis quant-tech-regression

Varsha Varde 19

Regression Model

• Formula for Regression Coefficient b : Mean of Products of Values – Product of the Two Means= -------------------------------------------------------------------------- Variance of Cause Variable

(911 / 5) – (5 x 34) 182.2 – 170 12.2= ----------------------- = ------------- = ----------- = 2.30

2.3 x 2.3 5.3 5.3

• Formula for Regression Constant a : a = Mean of Effect Variable Minus b times Mean of Cause Variable = 34 – 2.3 x 5 = 22.5

• Regression Model: Y = 22.5 + 2.3 X

Page 20: 13 regression analysis quant-tech-regression

Varsha Varde 20

Check Goodness of the Model

Empl. No. Yrs in Co. Salary (‘000) Estimate

1 2 25

2 3 30

3 5 37

4 7 38

5 8 40

Total 25 170

Arith Mean 5 34

Std. Dev. 2.3 5.6

Page 21: 13 regression analysis quant-tech-regression

Varsha Varde 21

Check Goodness of the Model

Empl. No. Yrs in Co. Salary (‘000) Estimate

1 2 25 27.1

2 3 30 29.4

3 5 37 34.0

4 7 38 38.6

5 8 40 40.9

Total 25 170 170

Arith Mean 5 34

Std. Dev. 2.3 5.6

Page 22: 13 regression analysis quant-tech-regression

Varsha Varde 22

Concept: Error of Estimation

• Note the Difference Between the Actual Values of Effect Variable (Salary) and the Values Estimated by the Regression Model

• This is the Error of Estimation

• Less the Error, Better the Model. Ideally 0.

• Statistical Model: Y = a + b X + e

• If Correlation is Perfect (+1 or -1), e = 0.

Page 23: 13 regression analysis quant-tech-regression

Given below are five observations collected in a regression study on two variables, x (independent variable) and y (dependent variable).

x y2 43 44 35 26 1a. Develop the least squares estimated regression

equation.b. Estimate value of y for x=7. Varsha Varde 23

Q5.

Page 24: 13 regression analysis quant-tech-regression

Varsha Varde 24

Exercise: Fit a Regression Model to Reasoning & Creativity Scores

Apl No, RsnSc CrvSc Apl No, RsnSc CrvSc

01 15.2 11.9 11 8.1 6.8

02 9.9 13.1 12 15.2 13.0

03 7.1 8.9 13 10.9 13.9

04 17.9 17.4 14 17.2 19.1

05 5.1 6.9 15 8.2 10.1

06 10.0 8.8 16 10.8 15.9

07 7.2 14.0 17 12.0 12.1

08 17.1 15.8 18 13.1 16.0

09 15.2 9.7 19 17.9 19.2

10 9.2 12.1 20 7.1 11.9

Page 25: 13 regression analysis quant-tech-regression

Varsha Varde 25

Exercise

• Does Your Model Look Like What I Got?:Creativity Scores = 5.23 + 0.65 x Reasoning Scores + e

• Test the Goodness of Your Regression Model

• How Bad are the Errors?

Page 26: 13 regression analysis quant-tech-regression

Varsha Varde 26

Other Patterns of Regression Curves

• Pattern: Downward Sloping Straight Line

• Statistical Model: Y = a - bX + e (b > 0)

Page 27: 13 regression analysis quant-tech-regression

Varsha Varde 27

Other Patterns of Regression Curves

• Pattern: Simple ExponentialModel: Log Y = a + bX + e (b > 0)

• Pattern: Negative ExponentialModel: Log (1/Y) = a + bX + e (b > 0)

• Pattern: Upward CurvilinearModel: Y = a + b Log X + e (b > 0)

• Pattern: Downward Curvilinear• Pattern: Logistic or S Curve

Page 28: 13 regression analysis quant-tech-regression

Varsha Varde 28

Your Role as a Manager

• Grasp the Situation Thoroughly. (Qualitative)• Identify Related Cardinal Variables. (DIY)• Obtain Quantitative Data on Them.• Draw Scatter Plot. Your Asstt Will Do It For You• If It Shows a Pattern, Compute Correlation

Coefficient. (Use SPSS or YAWDIFY)• If It Is High (+ or -), Draw a Free Hand Curve

and Identify the Pattern of Regression Curve.• Compute Regression Parameters for the Pattern

and Fit Regression Model. (SPSS or YAWDIFY)

Page 29: 13 regression analysis quant-tech-regression

Varsha Varde 29

A Word of Caution

• Undertake Regression Analysis Only For Cardinal Variables.

• Select the Variables Only If You Logically Suspect Influence of One Over the Other.

• Carry Out Regression Analysis Only After Completing Correlation Analysis AND Only If The Selected Cause and Effect Variables Are Highly Correlated.

Page 30: 13 regression analysis quant-tech-regression

Varsha Varde 30

Simple and Multiple Regression

• Simple Regression: One Cause Variable Influences the Effect Variable.

• This is What We Focused On So Far.

• Regression Models Have a Wide Variety of Forms and Degrees of Complexity.

• Multiple Regression: Several Cause Variables Jointly Influence Effect Variable.

Page 31: 13 regression analysis quant-tech-regression

Varsha Varde 31

Multiple Regression

• Multiple Regression Analysis is a Method to Analyze the Effect of Joint Influence of Many Cause Variables on Effect Variable.

• Multiple Regression Model:

Y = a + b1X1 + b2X2 + - - - - +bnXn + e

• Caution: Cause Variables X1, X2, - - - -, Xn Should Not Be Inter-Correlated.

• Otherwise You Face Multicollinearity.

Page 32: 13 regression analysis quant-tech-regression

Varsha Varde 32

Exercise: Select Cause Variables

Cause # 1 X1

Cause # 2 X2

Cause # 3 X3

Effect Y

Machine Downtime

Labour Absenteeism

Power Outage

Monthly Production

EPS ? ? BASF Share Price

? ? ? MRP

? ? ? ManpowerRequiremt