chapter 6 · web view6.2 properties of a normal distribution. i. continuous probability...

25
CHAPTER 6 CHAPTER 6 The Normal Distribution The Normal Distribution Objectives Objectives Identify distributions as symmetrical or skewed. Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Identify the properties of the normal distribution. Find the area under the standard normal distribution, given Find the area under the standard normal distribution, given various various z values. values. Find the probabilities for a normally distributed variable by Find the probabilities for a normally distributed variable by transforming it into a standard normal variable. transforming it into a standard normal variable. Find specific data values for given percentages using the Find specific data values for given percentages using the standard normal distribution. standard normal distribution. Use the central limit theorem to solve problems involving sample Use the central limit theorem to solve problems involving sample means for large samples. means for large samples. Use the normal approximation to compute probabilities for a Use the normal approximation to compute probabilities for a binomial variable. binomial variable. 6.1 Introduction 6.1 Introduction Many continuous variables have distributions that are bell-shaped Many continuous variables have distributions that are bell-shaped and are called and are called approximately normally distributed variables approximately normally distributed variables. A normal distribution is also known as the A normal distribution is also known as the bell curve bell curve or the or the Gaussian Gaussian distribution distribution. Normal and Skewed Distributions Normal and Skewed Distributions The The normal distribution normal distribution is a continuous, bell-shaped distribution of a is a continuous, bell-shaped distribution of a variable. variable. If the data values are evenly distributed about the mean, the If the data values are evenly distributed about the mean, the distribution is said to be distribution is said to be symmetrical symmetrical. If the majority of the data values fall to the left or right of If the majority of the data values fall to the left or right of the mean, the distribution is said to be the mean, the distribution is said to be skewed skewed. Left Skewed Distributions Left Skewed Distributions When the majority of the data values fall to the right of the When the majority of the data values fall to the right of the mean, the distribution is said to be mean, the distribution is said to be negatively negatively or left skewed or left skewed . The mean is . The mean is to the left of the median, and the mean and the median are to the left to the left of the median, and the mean and the median are to the left of the mode. of the mode. Right Skewed Distributions Right Skewed Distributions 1

Upload: others

Post on 23-Mar-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

CHAPTER 6CHAPTER 6The Normal DistributionThe Normal DistributionObjectivesObjectives

•• Identify distributions as symmetrical or skewed.Identify distributions as symmetrical or skewed.

•• Identify the properties of the normal distribution.Identify the properties of the normal distribution.

•• Find the area under the standard normal distribution, given various Find the area under the standard normal distribution, given various zz values. values.

•• Find the probabilities for a normally distributed variable by transforming it into a standardFind the probabilities for a normally distributed variable by transforming it into a standard normal variable.normal variable.

•• Find specific data values for given percentages using the standard normal distribution.Find specific data values for given percentages using the standard normal distribution.

•• Use the central limit theorem to solve problems involving sample means for large Use the central limit theorem to solve problems involving sample means for large samples.samples.

•• Use the normal approximation to compute probabilities for a binomial variable.Use the normal approximation to compute probabilities for a binomial variable.

6.1 Introduction6.1 Introduction•• Many continuous variables have distributions that are bell-shaped and are called Many continuous variables have distributions that are bell-shaped and are called approximately normally distributed variablesapproximately normally distributed variables..•• A normal distribution is also known as the A normal distribution is also known as the bell curvebell curve or the or the Gaussian distributionGaussian distribution..

Normal and Skewed DistributionsNormal and Skewed Distributions

•• The The normal distributionnormal distribution is a continuous, bell-shaped distribution of a variable. is a continuous, bell-shaped distribution of a variable.

•• If the data values are evenly distributed about the mean, the distribution is said to be If the data values are evenly distributed about the mean, the distribution is said to be symmetricalsymmetrical..•• If the majority of the data values fall to the left or right of the mean, the distribution is saidIf the majority of the data values fall to the left or right of the mean, the distribution is said to be to be skewedskewed..

Left Skewed DistributionsLeft Skewed Distributions

•• When the majority of the data values fall to the right of the mean, the distribution is said When the majority of the data values fall to the right of the mean, the distribution is said to be to be negativelynegatively or left skewedor left skewed. The mean is to the left of the median, and the mean and the . The mean is to the left of the median, and the mean and the median are to the left of the mode.median are to the left of the mode.

Right Skewed DistributionsRight Skewed Distributions

•• When the majority of the data values fall to the left of the mean, the distribution is said toWhen the majority of the data values fall to the left of the mean, the distribution is said to be be positively or right skewedpositively or right skewed. The mean falls to the right of the median and both the mean and . The mean falls to the right of the median and both the mean and the median fall to the right of the mode.the median fall to the right of the mode.

6.2 Properties of a Normal Distribution6.2 Properties of a Normal DistributionI. Continuous Probability Distributions

A continuous random variable is one that can theoretically take on any value on some line interval. We use f(x) to represent a probability density function. Unfortunately, f(x) does not give us the probability that the value x will be observed. To understand how a probability density function for a continuous random variable enables us to find probabilities, it is important to understand the relationship between probability and area. For the following given histogram, what is the probability that x is in between 2.5 to 5.5?

1

Page 2: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Use the given frequency histogram to calculate P(2.5 < x < 5.5):P(2.5 < x < 5.5) = (4 + 5 + 4) / (1+ 2 + 3 + 4 + 5 + 4 + 3 + 2 + 1) = 13 / 25 = 52%

Use the corresponding relative frequency histogram to calculate P(2.5 < x < 5.5):P(2.5 < x < 5.5) = 16% + 20% + 16% = 52% which is the same as the area of the three middle bars of the relative frequency histogram. The width of each bar is one and the height is the given percentage.

For a continuous probability distribution, 1) > 0 for all values x of the random variable; 2) the total area under the graph of is 1; 3) P(a < x < b) can be approximated by the area under the graph of for a < x < b.

Note: P(x = a) = 0 for continuous random variables. This implies P(a x b) = P(a < x < b); P(x a) = P(x > a);

and P(x a) = P(x < a).

II. II. The Normal Distribution Continuous probability distributions can assume a variety of shapes. However, the most

important distribution of continuous random variables in statistics is the normal distribution that is approximately mound-shaped. Many naturally occurring random variables such as IQs, height of humans, weights, times, etc. have nearly normal distributions.

•• The mathematical equation for a normal distribution isThe mathematical equation for a normal distribution is

Where Where e e 2.718, 2.718, 3.14, 3.14, = population mean = population mean = population standard deviation

The mean is located at the center of distribution. The distribution is symmetric about its mean .

2

Page 3: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

There is a correspondence between area and probability. Since the total area under the normal probability distribution is equal to 1, the symmetry implies

that the area to the right of is 0.5 and the area to the left of is also 0.5.Large values of reduce the height of the curve and increase the spread.Small values of increase the height of the curve and reduce the spread. Almost all values of a normal random variable lie in the interval

III. Properties of the Normal DistributionIII. Properties of the Normal Distribution

•• The shape and position of the normal distribution curve depend on two parameters, the The shape and position of the normal distribution curve depend on two parameters, the meanmean and the and the standard deviationstandard deviation. .

•• Each normally distributed variable has its own normal distribution curve, which depends Each normally distributed variable has its own normal distribution curve, which depends on the values of the variable’s mean and standard deviation.on the values of the variable’s mean and standard deviation.

Normal Distribution PropertiesNormal Distribution Properties

•• The normal distribution curve is bell-shaped.The normal distribution curve is bell-shaped.

•• The mean, median, and mode are equal and located at the center of the distribution.The mean, median, and mode are equal and located at the center of the distribution.

•• The normal distribution curve is The normal distribution curve is unimodalunimodal (i.e., it has only one mode). (i.e., it has only one mode).

•• The curve is symmetrical about the mean, which is equivalent to saying that its shape is The curve is symmetrical about the mean, which is equivalent to saying that its shape is the same on both sides of a vertical line passing through the center.the same on both sides of a vertical line passing through the center.

•• The curve is continuous—i.e., there are no gaps or holes. For each value of The curve is continuous—i.e., there are no gaps or holes. For each value of XX, here is a , here is a corresponding value of corresponding value of YY..

•• The curve never touches the The curve never touches the xx axis. Theoretically, no matter how far in either direction axis. Theoretically, no matter how far in either direction the curve extends, it never meets the the curve extends, it never meets the xx axis—but it gets increasingly closer. axis—but it gets increasingly closer.

6.3 The Standard Normal Distribution6.3 The Standard Normal Distribution•• Since each normally distributed variable has its own mean and standard deviation, the Since each normally distributed variable has its own mean and standard deviation, the shape and location of these curves will vary. In practical applications, one would have to have a shape and location of these curves will vary. In practical applications, one would have to have a table of areas under the curve for each variable. To simplify this, statisticians use the standard table of areas under the curve for each variable. To simplify this, statisticians use the standard normal distribution.normal distribution.•• The The standard normal distributionstandard normal distribution is a normal distribution with a mean of 0 and a standard is a normal distribution with a mean of 0 and a standard deviation of 1. deviation of 1.

Recall: zRecall: z Values Values

•• The The zz value is the number of standard deviations that a particular value is the number of standard deviations that a particular XX value is away from value is away from the mean. The formula for finding the z value is:the mean. The formula for finding the z value is:

Area Between 0 and zArea Between 0 and z

3

Page 4: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

To find the area between 0 and any To find the area between 0 and any zz value: Look up the z value in the table. value: Look up the z value in the table.

Area in Any TailArea in Any Tail

•• Look up the Look up the zz value to get the area. value to get the area.

•• Subtract the area from 0.5000.Subtract the area from 0.5000.

Area Between Two Area Between Two zz Values Values

•• Look up both Look up both zz values to get the areas. values to get the areas.

•• Subtract the smaller area from the larger area.Subtract the smaller area from the larger area.

Area Between Area Between zz Values—Opposite Sides Values—Opposite Sides

•• Look up both Look up both zz values to get the areas. values to get the areas.

•• Add the areas.Add the areas.

Area To the Left of Any Area To the Left of Any zz Value Value

•• Look up the Look up the zz value to get the area. value to get the area.

•• Add 0.5000 to the area.Add 0.5000 to the area.

Area To the Right of Any Area To the Right of Any zz Value Value

•• Look up the Look up the zz value in the table to get the area. value in the table to get the area.

•• Add 0.5000 to the area.Add 0.5000 to the area.

Area Under the CurveArea Under the Curve•• The area under the curve is more important than the frequencies because the area The area under the curve is more important than the frequencies because the area corresponds to the probability!corresponds to the probability!

•• Note:Note: In a continuous distribution, the probability of any exact In a continuous distribution, the probability of any exact Z Z value is 0 since area value is 0 since area would be represented by a vertical line above the value. But vertical lines in theory have no would be represented by a vertical line above the value. But vertical lines in theory have no area. So area. So

Example 1:

(a) Find P(0 < z < 1.63)

4

Page 5: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(b) Find P(-2.48 < z < 0)

(c) Find P(-2.02 < z < 1.74)

(d) Find P(1.02 < z < 1.84)

(e) Find the probability that z is between -.58 and -.10.

(f) Find the probability that z is larger than 1.76.

(g) Find the probability that z is less than 2.04.

(h) Find the probability that z is within two standard deviations of the mean.

5

Page 6: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Example 2: Assume the standard normal distribution. Fill in the blanks.

(a) P( 0 < z < _______ ) = .4279

(b) P( 0 < z < _______ ) = .4997

(c) P( _______ < z < 0 ) = .4370

(d) P( z < _______ ) = .9846

(e) P( z < _______ ) = .1190

(f) Find the z value to the left of the mean so that 71.90% of the area under the distribution curve lies to the right of it.

6

Page 7: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(g) Find two z values, one positive and one negative, so that the areas in the two tails total to 12%.

6.4 Applications of the Normal Distribution6.4 Applications of the Normal DistributionI. Calculating Probabilities for a Non-Standard Normal Distribution

Consider a normal variable x with mean and standard deviation .

1. Standardize from x to z

2. Use Table E to find the central area corresponding to z

3. Adjust the area to answer the questionExample 1:Let x be a normal random variable with mean 80 and standard deviation 12. What percentage of values are

(a) larger than 56?

(b) less than 62?

(c) between 85 and 98

(d) outside of 1.5 standard deviations of the mean?

7

Page 8: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Example 2: (Ref: General Statistics by Chase/Bown, 4th Ed.)The length of times it takes for a ferry to reach a summer resort from the mainland is approximately normally distributed with mean 2 hours and standard deviation of 12 minutes. Over many past trips, what proportion of times has the ferry reached the island in(a) less than 1 hour, 45 minutes?

(b) more than 2 hours, 5 minutes?

(c) between 1 hour, 50 minutes and 2 hours, 20 minutes?

II. Calculating a Cutoff Value Backward steps for calculating probabilities of a non-standard normal distribution.

1. Adjust to the corresponding central area.2. Use Table E to find the corresponding z cutoff value.

8

Page 9: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

3. Non-standardize from z to x:

Example 1:Employees of a company are given a test that is distributed normally with mean 100 and variance 25. The top 5% will be awarded top positions with the company. What score is necessary to get one of the top positions?

Example 2:Quiz scores were normally distributed with = 14 and = 2.8, the lower 20% should receive tutorial service. Find the cutoff score.

Section 6 – 5 The Central Limit TheoremSection 6 – 5 The Central Limit Theorem

• I. Sampling Distribution of Sample MeanExample 1: Population Distribution TableExample 1: Population Distribution Table

(a) Find the population mean and population standard deviation of the population

distribution table.

9

Page 10: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(b) Construct a probability histogram for x

Example 2: From the population distribution of example 1, 2 random variables are randomly selected.

(a) List out all possible combinations (sample space) and for each combination.

10

Page 11: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(b) Construct a probability distribution table for .

(c) Construct a probability histogram for .

11

Page 12: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(d) Find the mean of the sampling distribution of .

(e) Find the standard deviation of the sampling distribution of .

(f) Compare with .

(g) Compare with .

12

Page 13: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Population parameter Sample statistics

Mean

Standard deviation

Population Distribution Sampling Distribution

II. Central Limit TheoremII. Central Limit TheoremIf the population distribution is normally distributed, the sampling distribution of will be normally distributed, for any sample size n.

If the population distribution is not normally distributed, the sampling distribution of will be normally distributed for any size of n 30

Example 1: Population distribution

Given: = 50, = 10

13

)(xP

x

41

2 4 6 8 1 2 3 4 5 6 7 8

10/1

)(xP

x

10/2

10/3

10/4

Page 14: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

(a) Find and for n = 4

(b) Is the sampling distribution normally distributed?

(c) If n is changed from 4 to 36, is the sampling distribution normally distributed?

Example 2: (Ref: General Statistics by Chase/Bown, 4th Ed.)A population has mean 325 and variance 144. Suppose the distribution ofsample means is generated by random samples of size 36.(a) Find and

(b) Find

(c) Find

14

Page 15: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Example 3: The average number of days spent in a North Carolina hospital for a coronary bypass in 1992 was 9 days and the standard deviation was 4 days (North Carolina Medical Database Commission, Consumer’s Guide to Hospitalization Charges in North Carolina Hospitals, August 1994). What is the probability that a random sample of 30 patients will have an average stay longer than 9.5 days?

Example 4: Suppose the test scores for an exam are normally distributed with = 75, = 8(a) What percent of the students has a score greater than 85?

(b) What is the probability that 4 randomly selected students will have a mean score higher than 85?

15

Page 16: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Section 6 - 6 Normal Approximation to the Binomial DistributionSection 6 - 6 Normal Approximation to the Binomial DistributionI. When to use a Normal distribution to approximate a Binomial distribution?Recall that a binomial distribution is determined by n and p. When p is approximately 0.5, and

as n increases, the shape of the binomial distribution becomes similar to the normal distribution. In order to use a normal distribution to approximate a binomial distribution, n must be sufficiently large. It is known n will be sufficiently large if It is known n will be sufficiently large if npnp ≥ 5 and ≥ 5 and nqnq ≥ 5 ≥ 5.

When using a normal distribution to approximate a binomial distribution, the mean and standard deviation of the normal distribution is the same as the binomial distribution. Now recall the formulas for finding the mean and standard deviation.

II. Continuity Correction• In addition to the condition np ≥ 5 and nq ≥ 5, a correction for continuity is used in employing a continuous distribution (Normal distribution) to approximate a discrete distribution (Binomial distribution).

Warning : The continuity correction should be used only when approximating the Binomial probability with a normal probability. Don’t use the continuity correction with other Don’t use the continuity correction with other normal probability problems.normal probability problems.

Continuity correction x 0.5

Example 1:Use the continuity correction to rewrite each expression:(a) Bi Dist.N Dist. (d) Bi Dist. N Dist.

P( x > 6) P( 1 < x < 7)

(b) Bi Dist.N Dist. (e) Bi Dist. N Dist.P( x 3) P ( 5 x 10)

(c) Bi Dist.N Dist. (f) Bi Dist. N Dist.P( x 9) P (4 < x 6)

III. Using a Normal Distribution to approximate a Binomial Distribution

16

Page 17: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Step 1: Check whether the normal distribution can be used. ( np 5 and nq 5 )

Step 2: Find the mean and standard deviation .

Step 3: Write the problem in probability notation, using x.Step 4: Rewrite the problem by using the continuity correction factor.

Continuity correction x 0.5Step 5: Find the corresponding z value(s).

Step 6: Use the z table to find the center area and adjust the center area to answer the question.

Example 1: (Ref: General Statistics by Chase/Bown, 4th Ed.)Assume that the experiment is a binomial experiment.Find the probability of 10 or more successes, where n = 13 and p = .4.

(a) Use the Binomial table

(b) Using the normal approximation to the binomial.

17

Page 18: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Example 2: A dealer states that 90% of all automobiles sold have air conditioning. If the dealer sells 250 cars, find the probabilitythat fewer than 5 of them will not have air conditioning.

Example 3: In a corporation, 30% of the people elect to enroll in thefinancial investment program offered by the company.Find the probability that of 800 randomly selected people,between 260 and 300 inclusive have enrolled in the program.

18

Page 19: CHAPTER 6 · Web view6.2 Properties of a Normal Distribution. I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value

Summary Summary

•• The normal distribution can be used to describe a variety of variables, such as heights, The normal distribution can be used to describe a variety of variables, such as heights, weights, and temperatures.weights, and temperatures.

•• The normal distribution is bell-shaped, unimodal, symmetric, and continuous; its mean, The normal distribution is bell-shaped, unimodal, symmetric, and continuous; its mean, median, and mode are equal.median, and mode are equal.

•• Mathematicians use the standard normal distribution which has a mean of 0 and a Mathematicians use the standard normal distribution which has a mean of 0 and a standard deviation of 1.standard deviation of 1.

•• The normal distribution can be used to describe a sampling distribution of sample The normal distribution can be used to describe a sampling distribution of sample means.means.

•• These samples must be of the same size and randomly selected with replacement from These samples must be of the same size and randomly selected with replacement from the population.the population.

•• The central limit theorem states that as the size of the samples increases, the The central limit theorem states that as the size of the samples increases, the distribution of sample means will be approximately normal.distribution of sample means will be approximately normal.

•• The normal distribution can be used to approximate other distributions, such as the The normal distribution can be used to approximate other distributions, such as the binomial distribution.binomial distribution.

•• For the normal distribution to be used as an approximation to the binomial distribution, For the normal distribution to be used as an approximation to the binomial distribution, the conditions the conditions npnp 5 and 5 and nqnq 5 must be met. 5 must be met.

•• A correction for continuity may be used for more accurate results.A correction for continuity may be used for more accurate results.

ConclusionsConclusions

•• The normal distribution can be used to approximate other distributions to simplify the The normal distribution can be used to approximate other distributions to simplify the data analysis for a variety of applications.data analysis for a variety of applications.

19