so far (chapters 1 and 2)

39
1 Chapter 3: Feb. 5-10, 2004 So Far (Chapters 1 and 2) Plot Your Data: make a graph, histogram, or stem plot Look for Overall Patterns: shape, center, spread, outliers Calculate a Numerical Summary: to briefly describe center and spread NEW: With a large number of observations, the overall distribution pattern may be so regular that we can describe it with a smooth curve We now have both graphical and numerical tools for describing distributions of data:

Upload: thibault-jaycob

Post on 03-Jan-2016

29 views

Category:

Documents


0 download

DESCRIPTION

So Far (Chapters 1 and 2). We now have both graphical and numerical tools for describing distributions of data:. Plot Your Data: make a graph, histogram, or stem plot Look for Overall Patterns: shape, center, spread, outliers - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: So Far (Chapters 1 and 2)

1Chapter 3: Feb. 5-10, 2004

So Far (Chapters 1 and 2)

Plot Your Data: make a graph, histogram, or stem plot

Look for Overall Patterns: shape, center, spread, outliers

Calculate a Numerical Summary: to briefly describe center and spread

NEW: With a large number of observations, the overall distribution pattern may be so regular that we can describe it with a smooth curve

We now have both graphical and numerical tools for describing distributions of data:

Page 2: So Far (Chapters 1 and 2)

Chapter 3: Feb. 5-10, 2004 2

Chapter 3

The Normal Distributions

Page 3: So Far (Chapters 1 and 2)

3Chapter 3: Feb. 5-10, 2004

Density Curves

Example: here is a histogram of vocabulary scores of 947 seventh graders.

The smooth curve drawn over the histogram is a mathematical model for the distribution.

Page 4: So Far (Chapters 1 and 2)

4Chapter 3: Feb. 5-10, 2004

Density Curves

Example: the areas of the shaded bars in this histogram represent the proportion of scores in the observed data that are less than or equal to 6.0. This proportion is equal to 0.303.

Page 5: So Far (Chapters 1 and 2)

5Chapter 3: Feb. 5-10, 2004

Density Curves

Example: now the area under the smooth curve to the left of 6.0 is shaded. If the scale is adjusted so the total area under the curve is exactly 1, then this curve is called a density curve. The proportion of the area to the left of 6.0 is now equal to 0.293.

Page 6: So Far (Chapters 1 and 2)

6Chapter 3: Feb. 5-10, 2004

Density Curves

Always on or above the horizontal axis

Have area exactly 1 underneath curve

Area under the curve and above any range of values is the proportion of all observations that fall in that range…

AREA = PERCENTAGE

Page 7: So Far (Chapters 1 and 2)

7Chapter 3: Feb. 5-10, 2004

Density Curves The median of a density curve is the

equal-areas point, the point that divides the area under the curve in half

The mean of a density curve is the balance point, at which the curve would balance if made of solid material

Page 8: So Far (Chapters 1 and 2)

8Chapter 3: Feb. 5-10, 2004

Density Curves: Mean vs. Median:

Symmetric Distribution Skewed Distribution

Page 9: So Far (Chapters 1 and 2)

9Chapter 3: Feb. 5-10, 2004

Density Curves: Notation The mean and standard deviation

computed from actual observations (the data) are denoted by and s, respectively.

x

The mean and standard deviation of the actual distribution represented by the smooth density curve are denoted by µ (“mu”) and (“sigma”), respectively.

Page 10: So Far (Chapters 1 and 2)

10Chapter 3: Feb. 5-10, 2004

Uniform Distributions

A distribution where each data value (outcome) has an equally likely chance of being seen.

Think: random numbers…lottery!!

Page 11: So Far (Chapters 1 and 2)

11Chapter 3: Feb. 5-10, 2004

Distribution of 5 "White" Numbers

0

0.005

0.01

0.015

0.02

0.025

0.03

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53

Number

Pro

bab

ilit

y

Page 12: So Far (Chapters 1 and 2)

12Chapter 3: Feb. 5-10, 2004

Probability of One "Red" Powerball

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

0.05

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41

Number

Pro

ba

bili

ty

Page 13: So Far (Chapters 1 and 2)

13Chapter 3: Feb. 5-10, 2004

QuestionData sets consisting of physical measurements (heights, weights, lengths of bones, and so on) for adults of the same species and sex tend to follow a similar pattern. The pattern is that most individuals are clumped around the average, with numbers decreasing the farther values are from the average in either direction.

What shape would a histogram (or density curve) of such measurements have?

Page 14: So Far (Chapters 1 and 2)

14Chapter 3: Feb. 5-10, 2004

Density Curves: A Specific Example

(1) Symmetric Distribution

(2) Single-Peaked

(3) Bell-Shaped

Page 15: So Far (Chapters 1 and 2)

15Chapter 3: Feb. 5-10, 2004

The Normal Distribution

standard deviation

mean

Page 16: So Far (Chapters 1 and 2)

16Chapter 3: Feb. 5-10, 2004

The Normal DistributionKnowing the mean (µ) and standard deviation () allows us to make various conclusions about Normal distributions. Notation: N(µ,).

Examples: (1) different µ, same ; (2) same µ, different

Page 17: So Far (Chapters 1 and 2)

17Chapter 3: Feb. 5-10, 2004

68-95-99.7 Rule for Any Normal Curve

68% of the observations fall within one standard deviation of the mean

95% of the observations fall within two standard deviations of the mean

99.7% of the observations fall within three standard deviations of the mean

Page 18: So Far (Chapters 1 and 2)

18Chapter 3: Feb. 5-10, 2004

68-95-99.7 Rule for Normal Curves

68%

+- µ

+3-3

99.7%

µ

+2-2

95%

µ

Page 19: So Far (Chapters 1 and 2)

19Chapter 3: Feb. 5-10, 2004

68-95-99.7 Rule for Normal Curves

Page 20: So Far (Chapters 1 and 2)

20Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

Heights of adult men, aged 18-24mean: 70.0 inches

standard deviation: 2.8 inches

heights follow a normal distribution, so we

have that heights of men are N(70, 2.8).

Page 21: So Far (Chapters 1 and 2)

21Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

68-95-99.7 Rule for men’s heights68% are between 67.2 and 72.8 inches

[ µ = 70.0 2.8 ]

95% are between 64.4 and 75.6 inches[ µ 2 = 70.0 2(2.8) = 70.0 5.6 ]

99.7% are between 61.6 and 78.4 inches[ µ 3 = 70.0 3(2.8) = 70.0 8.4 ]

Page 22: So Far (Chapters 1 and 2)

22Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980 What proportion of men are less than 72.8

inches tall?

?

70 72.8 (height values)

+1

? = 84%

68% (by 68-95-99.7 Rule)

16%

-1

Page 23: So Far (Chapters 1 and 2)

23Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

What proportion of men are less than 68 inches tall?

?

68 70 (height values)

How many standard deviations is 68 from 70?

Page 24: So Far (Chapters 1 and 2)

24Chapter 3: Feb. 5-10, 2004

Standard Normal Distribution The standard Normal distribution is the Normal

distribution with mean 0 and standard deviation 1. We can write: N(0,1).

If a variable x has any Normal distribution with any other mean µ and standard deviation , then the following standardized variable (standardized score) has the standard Normal distribution:

x

z

Page 25: So Far (Chapters 1 and 2)

25Chapter 3: Feb. 5-10, 2004

Standardized Scores How many standard deviations is 68 from

70? standardized score (the z-score) =

(observed value minus mean) ÷ (std dev)

[ = (68 70) / 2.8 = 0.71 ] The value 68 is 0.71 standard deviations

below the mean 70.

Page 26: So Far (Chapters 1 and 2)

26Chapter 3: Feb. 5-10, 2004

Determining Area/Probability Given

a Data Value

Using Normal Distributions: I

Page 27: So Far (Chapters 1 and 2)

27Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

What proportion of men are less than 68 inches tall?

-0.71 0 (standardized values)68 70 (height values)

?

Page 28: So Far (Chapters 1 and 2)

28Chapter 3: Feb. 5-10, 2004

Table A:Standard Normal Probabilities

See pages 652-653 in text for Table A.(the “Standard Normal Table”)

Look up the closest standardized score (z-score) in the table.

Find the probability (area) to the left of the standardized score.

Page 29: So Far (Chapters 1 and 2)

29Chapter 3: Feb. 5-10, 2004

Table A:Standard Normal Probabilities

Page 30: So Far (Chapters 1 and 2)

30Chapter 3: Feb. 5-10, 2004

z .00 .02

0.8 .2119 .2090 .2061

.2420 .2358

0.6 .2743 .2709 .2676

0.7

.01

.2389

Table A:Standard Normal Probabilities

Page 31: So Far (Chapters 1 and 2)

31Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

What proportion of men are less than 68 inches tall?

-0.71 0 (standardized values)68 70 (height values)

.2389

Answer: 23.89%

Page 32: So Far (Chapters 1 and 2)

32Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

What proportion of men are greater than 68 inches tall?

.2389

-0.71 0 (standardized values)68 70 (height values)

1.2389 = .7611

Page 33: So Far (Chapters 1 and 2)

33Chapter 3: Feb. 5-10, 2004

Determining a Data Value Given the Area/Probability

Using Normal Distributions: II

Page 34: So Far (Chapters 1 and 2)

34Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

How tall must a man be to place in the lower 10% for men aged 18 to 24?

.10 ? 70 (height values)

Page 35: So Far (Chapters 1 and 2)

35Chapter 3: Feb. 5-10, 2004

See pages 652-653 in text for Table A.

Look up the closest probability (to .10 here) in the table.

Find the corresponding standardized score.

The value you seek is that many standard deviations from the mean.

Table A:Standard Normal Probabilities

Page 36: So Far (Chapters 1 and 2)

36Chapter 3: Feb. 5-10, 2004

z .07 .09

1.3 .0853 .0838 .0823

.1020 .0985

1.1 .1210 .1190 .1170

1.2

.08

.1003

Table A:Standard Normal Probabilities

Page 37: So Far (Chapters 1 and 2)

37Chapter 3: Feb. 5-10, 2004

Health and Nutrition Examination Study of 1976-1980

How tall must a man be to place in the lower 10% for men aged 18 to 24?

-1.28 0 (standardized values)

.10 ? 70 (height values)

Page 38: So Far (Chapters 1 and 2)

38Chapter 3: Feb. 5-10, 2004

Observed Value for a Standardized Score Need to “unstandardize” the z-score to find

the observed value (x) :

x z

observed value =mean plus [(standardized score) (std dev)]

zx

solve for x

Page 39: So Far (Chapters 1 and 2)

39Chapter 3: Feb. 5-10, 2004

Observed Value for a Standardized Score

observed value =mean plus [(standardized score) (std dev)]

= 70 + [(1.28 ) (2.8)]

= 70 + (3.58) = 66.42

A man would have to be approximately 66.42 inches tall or less to place in the lower 10% of all men in the population.