2 5 standard deviation

Post on 18-May-2015

4.546 Views

Category:

Technology

5 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Do Now, Calculate these values

• Customer waiting times, in seconds, for two banks

• Mean =• Median =• Mode =• Midrange =

Jefferson Providence

6.5 4.2

6.6 5.4

6.7 5.8

6.8 6.2

7.1 6.7

7.3 7.7

7.4 7.7

7.7 8.5

7.7 9.3

7.7 10

Measures of Variation

• We need to measure the variation in such a way that– Data in which values that are close together should have a

small measurement

– Values that are spread apart should have a high measurement

• Range• Standard deviation• Variance

Range

• maximum - minimum• Jefferson:• Providence:

Standard Deviation

• “Measurement of variation of values about their mean”

• Standard deviation based on a sample:– Abbreviated s

2

1

1

n

ii

x xs

n

• Same units as the original values• Sample variance = s2

Population Standard Deviation

• Standard deviation of the entire population:– Abbreviated σ (lower-case sigma)

2

1

n

ii

x

N

• Same units as the original values• Population variance = σ2

Standard Deviation

• Interpretation– “Average distance from the mean”

– About 2/3 of the values should fall within 1 standard deviation of the mean

• Can be estimated with the Range rule of thumb:

4

ranges

Calculating a Standard Deviation

1. Find the mean

2. For each valuea. Subtract the mean from the value

b. Square the difference

3. Add the squares from step 2b.

4. Divide that sum buy one less than number of samples

5. Find the square root of the quotient

Live Example: Jefferson Valley

6.5

6.6

6.7

6.8

7.1

7.3

7.4

7.7

7.7

7.7

Your turn: Providence

4.2

5.4

5.8

6.2

6.7

7.7

7.7

8.5

9.3

10

Rounding

• Round to one more decimal place that the original data.– E.g., the mean of 14 15 9 16 and 12 would be 13.2

– E.g., the mean of 1.04, 2.17, and 3.79 would be 2.333

– Except for ‘well-know’ decimals

• 0.25, 0.75, 0.5

• Round the standard deviation to one more place that the mean

Homework

• Find the standard deviation of Super Bowl Points• Use the range rule of thumb to estimate the standard

deviation of motor vehicle deaths and Sunspot number

Uses Standard Deviation

• Interpretation– “Average distance from the mean”

– About 2/3 of the values should fall within 1 standard deviation of the mean

• Uses– Identify outliers and determining what is “unusual”

– Frequency Tables

– Curving

Estimating Frequencies

meanmean + s

mean + 2smean + 3smean – 3s

mean – 2smean – s

34% 34%17% 17%2.4% 2.4%0.1% 0.1%

68%

95%

99.7%

• In a large sample of normally distributed data a specific percent of values should fall into frequency classes whose boundaries are based on the mean and standard deviation

Distribution Spread of Bank Data

7.15

7.15

2 0.48

8.11

7.15

(3)0.48

8.59

0 (mean)

+1s +2s + 3s-3s -2s -1s

7.15

0.48

7.63

7.15

0.48

6.67

7.15

2 0.48

6.19

7.15

(3)0.48

5.71

Outliers and Unusual Values

• 5% is considered the boundary for being unusual• Values are considered outliers or unusual if they are

beyond 2 standard deviations from the mean.

mean mean + 2smean – 2s

2.5% 2.5%

95%

Outliers and Unusual Values

• Outliers and/or unusual values are usually investigated– Different category

– Bad data

– “Bad day”

• Outliers and/or unusual values might be trimmed from the list

Frequency Table

• We can use the mean and standard deviation as the boundary of our frequency classes

• If the data is normally distributed– 68% of our samples should be within one standard

deviation of the mean

– 95% should be within two standard deviations

– 99.7 should be within three standard deviations

Building a Frequency Table

1. Create a table of six classes2. Find the mean and standard deviation3. Use the mean as the lower class fourth class4. Add the standard deviation to the mean to get the

LCL for the fifth classes, add it again for the LCL for the sixth the mean.

5. Subtract the standard deviation from the mean to get the LCL for the third class, subtract it again to get the LCL for the second class. The first class has no LCL

6. Fill in the UCLs; the sixth class has no UCL7. Fill in the frequencies

IQ Scores

• The mean and standard deviation for IQ scores is 100 and 15, respectively.

• Create a frequency table for the following IQ scores: 100 111 122 95 83 95 109 93 102 110 86 98 120 108 117 94 101 105 104 72

IQ Frequency Table

LCB UCB Frequency

Using Mean and Std Dev to curve a tests

C+ B B+ AF D D+ C

• 25 55 59 59 63 71 71 74 80 80 80 83 84 84 87 88 95 95 100 100

• X-bar is 77, s is 18

• Use s divided by 3 for grade boundaries

A+

Your Turn

• Define distribution spread and a frequency table using the following values:50 140 104 75 55 22 75 39 90 45 110 62 90 35 200 90

• Are there any “unusual values?• What is the “theoretical” percent of values that fall

within 1 sigma?• What is the actual percent of values that fall within 1

sigma?

IQ Frequency Table

LCB UCB Frequency

Homework

• The mean adult male height is 69 inches; the standard deviation is 3.5

• The mean adult female height is 64 inched; the standard deviation is 3

• On the next slide use these values to create a frequency table of your opinion poll data.

• Identify outliers/unusual values

Frequency Table: Male

LCB UCB Frequency

Frequency Table: Female

LCB UCB Frequency

top related