week_4_unit_i
TRANSCRIPT
-
7/27/2019 week_4_UNIT_I
1/25
1
2. The Poisson Probability Distribution
The Poisson distribution was developed by the French mathematician Simeon
Denis Poisson in 1837.
The Poisson random variablesatisfies the following conditions:
1. The number of successes in two disjoint time intervals is independent.2. The probability of a success during a small time interval is proportional to
the entire length of the time interval.
Apart from disjoint time intervals, the Poisson random variable also applies todisjoint regions of space.
Applications
the number of deaths by horse kicking in the Prussian army (firstapplication)
birth defects and genetic mutations rare diseases (like Leukemia, but not AIDS because it is infectious and so not
independent) - especially in legal cases
car accidents traffic flow and ideal gap distance number of typing errors on a page hairs found in McDonald's hamburgers spread of an endangered animal in Africa failure of a machine in one month
The probability distribution of a Poisson random variableX representing the
number of successes occurring in a given time interval or a specified region of
space is given by the formula:
-
7/27/2019 week_4_UNIT_I
2/25
2
where
x= 0, 1, 2, 3...
e= 2.71828 (but use your calculator's ebutton)
= mean number of successes in the given time interval or region of space
Mean and Variance of Poisson Distribution
If is the average number of successes occurring in a given time interval or region
in the Poisson distribution, then the mean and the variance of the Poisson
distribution are both equal to .
E(X) =
and
V(X) = 2=
Note: In a Poisson distribution, only oneparameter, is needed to determine the
probability of an event.
EXAMPLE 1
A life insurance salesman sells on the average 3 life insurance policies per week.
Use Poisson's law to calculate the probability that in a given week he will sell
(a) some policies
(b) 2 or more policies but less than 5 policies.
-
7/27/2019 week_4_UNIT_I
3/25
3
(c) Assuming that there are 5 working days per week, what is the probability that
in a given day he will sell one policy?
Answer
Here, = 3
(a) "Some policies" means "1 or more policies". We can work this out by finding 1
minus the "zero policies" probability:
P(X> 0) = 1 P(x0)
Now So o
b)
c) Average number of policies sold per day:
So on a given day,
-
7/27/2019 week_4_UNIT_I
4/25
4
EXAMPLE 2
Twenty sheets of aluminum alloy were examined for surface flaws. The frequency
of the number of sheets with a given number of flaws per sheet was as follows:
Number of flaws Frequency
0 4
1 3
2 5
3 2
4 4
5 1
6 1
What is the probability of finding a sheet chosen at random which contains 3 or
more surface flaws?
Answer
The total number of flaws is given by:
(0 4) + (1 3) + (2 5) + (3 2) + (4 4) + (5 1) + (6 1) = 46
So the average number of flaws for the 20 sheets is given by:
-
7/27/2019 week_4_UNIT_I
5/25
5
The required probability is:
Histogram of Probabilities
We can see the predicted probabilities for each of "No flaws", "1 flaw", "2 flaws",etc on this histogram.
Histogram of Probabilities
[The histogram was obtained by graphing the following function for integer values
ofxonly.
Then the horizontal axis was modified appropriately.]
-
7/27/2019 week_4_UNIT_I
6/25
6
EXAMPLE 3
If electricity power failures occur according to a Poisson distribution with an
average of 3 failures every twenty weeks, calculate the probability that there will
not be more than one failure during a particular week.
Answer
The average number of failures per week is:
"Not more than one failure" means we need to include the probabilities for "0
failures" plus "1 failure".
EXAMPLE 4
Vehicles pass through a junction on a busy road at an average rate of 300 per hour.
1. Find the probability that none passes in a given minute.2. What is the expected number passing in two minutes?3. Find the probability that this expected number actually pass through in a
given two-minute period.
Answer
The average number of cars per minute is:
(a)
http://www.intmath.com/counting-probability/ans-13.php?a=3http://www.intmath.com/counting-probability/ans-13.php?a=3http://www.intmath.com/counting-probability/ans-13.php?a=3 -
7/27/2019 week_4_UNIT_I
7/25
7
(b) E(X) = 5 2 = 10
(c) Now, with = 10, we have:
Histogram of Probabilities
Based on the function
we can plot a histogram of the probabilities for the number of cars per minute:
EXAMPLE 5
A company makes electric motors. The probability an electric motor is defective is
0.01. What is the probability that a sample of 300 electric motors will contain
exactly 5 defective motors?
Answer
The average number of defectives in 300 motors is = 0.01 300 = 3
The probability of getting 5 defectives is:
http://www.intmath.com/counting-probability/ans-13.php?a=4http://www.intmath.com/counting-probability/ans-13.php?a=4http://www.intmath.com/counting-probability/ans-13.php?a=4 -
7/27/2019 week_4_UNIT_I
8/25
8
NOTE:This problem looks similar to abinomial distributionproblem, that we met
in the last section.
If we do it using binomial, with n= 300,x= 5,p= 0.01 andq= 0.99, we get:
P(X= 5) = C(300,5)(0.01)5(0.99)
295= 0.10099
We see that the result is very similar. We can use binomial distribution to
approximate Poisson distribution (and vice-versa) under certain circumstances.
Histogram of Probabilities
3.The Normal Distribution
A random variableXwhose distribution has the shape of a normal curveis called a
normal random variable.
http://www.intmath.com/counting-probability/12-binomial-probability-distributions.phphttp://www.intmath.com/counting-probability/12-binomial-probability-distributions.phphttp://www.intmath.com/counting-probability/12-binomial-probability-distributions.phphttp://www.intmath.com/counting-probability/12-binomial-probability-distributions.php -
7/27/2019 week_4_UNIT_I
9/25
9
Normal Curve
This random variableXis said to be normally distributed with mean and standard
deviation if its probability distribution is given by
Properties of a Normal Distribution
1. The normal curve is symmetrical about the mean ;2. The mean is at the middle and divides the area into halves;3. The total area under the curve is equal to 1;4. It is completely determined by its mean and standard deviation (or
variance 2)
Note:
In a normal distribution, only 2parameters are needed, namely and 2.
Area Under the Normal Curve using Integration
The probability of a continuous normal variableXfound in a particular interval [a,
b] is the area under the curve bounded byx = aandx = band is given by
-
7/27/2019 week_4_UNIT_I
10/25
10
and the area depends upon the values of and .
The Standard Normal Distribution
It makes life a lot easier for us if we standardizeour normal curve, with a mean of
zero and a standard deviation of 1 unit.
If we have the standardized situationof = 0and = 1, then we have:
Standard Normal Curve = 0, = 1
We can transform all the observations of any normal random variableXwith mean
and variance to a new set of observations of another normal random variableZ
with mean 0 and variance 1 using the following transformation:
We can see this in the following example.
Example
-
7/27/2019 week_4_UNIT_I
11/25
11
Say = 2 and = 1/3in a normal distribution.
The graph of the normal distribution is as follows:
= 2, = 1/3
The following graph represents the same information, but it has been
standardizedso that = 0 and = 1:
= 0, = 1
The two graphs have different and , but have the same shape (if we tweak the
axes).
The new distribution of the normal random variableZwith mean 0 and variance 1
(or standard deviation 1) is called a standard normal distribution. Standardizing
the distribution like this makes it much easier to calculate probabilities.
-
7/27/2019 week_4_UNIT_I
12/25
12
If we have mean and standard deviation , then
Since all the values of X falling between x1 and x2 have corresponding Z values
between z1and z2, it means:
The area under theXcurve betweenX = x1andX = x2equals:
The area under theZcurve betweenZ = z1andZ = z2.
Hence, we have the following equivalent probabilities:
P(x1
-
7/27/2019 week_4_UNIT_I
13/25
13
The area above is exactly the same as the area
z1= 0.5 to z2= 2
in the standard normal curve:
= 0, = 1
Percentages of the Area Under the Standard Normal Curve
A graph of this standardized (mean 0 and variance 1) normal curve is shown.
-
7/27/2019 week_4_UNIT_I
14/25
14
In this graph, we have indicated the areas between the regions as follows:
-1 Z 168.27%
-2 Z 2 95.45%
-3 Z 399.73%
This means that 68.27% of the scores lie within 1 standard deviation of the mean.
This comes from:
Also, 95.45% of the scores lie within 2 standard deviations of the mean.
This comes from:
Finally, 99.73% of the scores lie within 3 standard deviations of the mean.
This comes from:
The total area from - < z< is 1.
The z-Table
The areas under the curve bounded by the ordinates z= 0 and any positive value
of zare found in the z-Table. From this table the area under the standard normal
curve between any two ordinates can be found by using the symmetry of the curve
about z= 0.
-
7/27/2019 week_4_UNIT_I
15/25
15
EXAMPLE 1
Find the area under the standard normal curve for the following, using the z-table.
Sketch each one.
(a) between z= 0 and z= 0.78
(b) between z= -0.56 and z= 0
(c) between z= -0.43 and z= 0.78
(d) between z= 0.44 and z= 1.50
(e) to the right of z= -1.33.
Answer
From the z-table:
(a) 0.2823
(b) 0.2123
-
7/27/2019 week_4_UNIT_I
16/25
16
(c) 0.1664 + 0.2823 = 0.4487
(d) 0.4332 - 0.1700 = 0.2632
(e) 0.4082 + 0.5 = 0.9082
-
7/27/2019 week_4_UNIT_I
17/25
17
EXAMPLE 2
Find the following probabilities:
(a) P(Z> 1.06)
(b) P(Z< -2.15)
(c) P(1.06
-
7/27/2019 week_4_UNIT_I
18/25
18
This time, we need to take the area of the whole left side (0.5) and subtract the
area from z= 0 to z= 2.15 (which is actually on the right side, but the z-table is
assuming it is the right hand side.)
(c) This is the same as asking "What is the area between 1.06 and 4.00 under the
standard normal curve?"
(d) This is the same as asking "What is the area between -1.06 and 4.00 under the
standard normal curve?"
We find the area on the left side from z= -1.06 to z= 0 (which is the same as z= 0
to z= 1.06), then add the area between z= 0 to z= 1.04 (on the right side):
EXAMPLE 3
-
7/27/2019 week_4_UNIT_I
19/25
19
It was found that the mean length of 100 parts produced by a lathe was 20.05 mm
with a standard deviation of 0.02 mm. Find the probability that a part selected at
random would have a length
(a) between 20.03 mm and 20.08 mm
(b) between 20.06 mm and 20.07 mm
(c) less than 20.01 mm
(d) greater than 20.09 mm.
Answer
X= length of part
(a) 20.03 is 1 standard deviation below the mean;
20.08 is standard deviations above the mean
So the probability is 0.7745.
(b) 20.06 is 0.5 standard deviations above the mean;
20.07 is 1 standard deviation above the mean
-
7/27/2019 week_4_UNIT_I
20/25
20
So the probability is 0.1498.
(c) 20.01 is 2 s.d. below the mean.
So the probability is 0.0228.
(d) 20.09 is 2 s.d. above the mean, so the answer will be the same as (c),
P(X> 20.09) = 0.0228.
EXAMPLE 4
A company pays its employees an average wage of $3.25 an hour with a standard
deviation of 60 cents. If the wages are approximately normally distributed,
determine
a. the proportion of the workers getting wages between $2.75 and $3.69 anhour;
b. the minimum wage of the highest 5%.Answer
-
7/27/2019 week_4_UNIT_I
21/25
21
X= wage
(a)
So about 56.6% of the workers have wages between $2.75 and $3.69 an hour.
(b) W= minimum wage of highest 5%
x= 1.645 (from table)
Solving gives:x= 4.237
So the minimum wage of the top 5% of salaries is $4.24.
EXAMPLE 5
The average life of a certain type of motor is 10 years, with a standard deviation of
2 years. If the manufacturer is willing to replace only 3% of the motors that fail,
how long a guarantee should he offer? Assume that the lives of the motors follow
a normal distribution.
Answer
X= life of motor
-
7/27/2019 week_4_UNIT_I
22/25
22
x= guarantee period
Normal Curve: = 10, = 2
We need to find the value (in years) that will give us the bottom 3% of thedistribution. These are the motors that we are willing to replace under the
guarantee.
P(X < x) = 0.03
The area that we can find from the z-table is
0.5 - 0.03 = 0.47
The corresponding z-score is z= -1.88.
Since , we can write:
Solving this givesx= 6.24.
So the guarantee period should be 6.24 years.
Example 6
-
7/27/2019 week_4_UNIT_I
23/25
23
In the standard normal curve to the right, the mean is 0 and the standard
deviation is 1.
The green shaded area in the diagram represents the area that is within 1.45
standard deviations from the mean. The area of this shaded portion is 0.4265 (or
42.65% of the total area under the curve).
To get this area of 0.4265, we read down the left side of the table for the standard
deviation's first 2 digits (the whole number and the first number after the decimal
point, in this case 1.4), then we read across the table for the "0.05" part (the toprow represents the 2nd decimal place of the standard deviation that we are
interested in.)
z 0.00 0.01 0.02 0.03 0.04 0.05 0.06
1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279
We have:
(left column) 1.4 + (top row) 0.05 = 1.45 standard deviations
The area represented by 1.45 standard deviations to the right of the mean is
shaded in green in the standard normal curve above.
-
7/27/2019 week_4_UNIT_I
24/25
24
You can see how to find the value of 0.4265 in the full z-table below. Follow the
"1.4" row across and the "0.05" column down until they meet at 0.4265.
Exercises
1. In a Binomial distribution containing of 5 independent trials the probability of 1
and
2 successes are 0.49 and 0.2048 respectively. Find parameter p of the
distribution.
2.Ten coins are thrown simultaneously. Find the probability of getting 7 heads.
3. The probability of a man hitting a target is 1/3.
(i) If he fires 5 times, what is the probability of his hitting the target at least
twice.
(ii) How many times must he fires so that the probability of his hitting the target
at
least once is more than 90%.
4. Find mean and variance of Poisson distribution.
5.The probability of a Poisson variate taking the values 1 and 2 are equal.
Find (i) (ii) P (X1) (iii) P (1
-
7/27/2019 week_4_UNIT_I
25/25
(ii) how many students get more than 40
(iii) how many students get below 20
(iv) how many students get more than 50
****