+ probability. + the basic law of probability all outcomes being likely, an event’s probability =...
TRANSCRIPT
+
Probability
+THE BASIC LAW OF PROBABILITY
ALL OUTCOMES BEING LIKELY, AN EVENT’S PROBABILITY =
THE WAYS OF THE EVENT YOU’RE INTERESTED IN OCCURING/TOTAL POSSIBLE NUMBER OF OUTCOMES (n)
Written as a formula, this would be:
P(A)=number of events in A / total number of trials n
+Some basic probabilities
What is the probability that you roll a three on a six sided die?
What is the probability of drawing the 10 of hearts from a deck of cards?
What is the probability that you flip a heads?
These are easy, what about when we have to deal with events that occur in several stages?
+We make trees
+Example
You flip a coin 3 times, what is the probability of 3 heads? Heads, Tails, Heads At least one Tails 2 or more heads?
+Make your tree
Each new section represents a trial
+A handy formula for finding n
TO FIND n (THE TOTAL POSSIBLE OUTCOMES)
HANDY FORMULA: Number of possible outcomes per object ^ number of
objects 4 sides of each COIN ^ 3 individual COINS This is your n=2 ^ 3 = 8
+EXAMPLE
YOU ROLL A PAIR OF 4-SIDED DICE. EACH OUTCOME HAS A PROBABILITY OF ______________
+HOW TO SOLVE IT:
FIRST, FIND N (THE TOTAL POSSIBLE OUTCOMES)
HANDY FORMULA: Number of possible outcomes per object ^ number of
objects 4 sides of each die ^ 2 individual dice This is your n
THE PROBABILITY FOR EACH OUTCOME IS 1/N OR 1/16
THIS COULD ALSO BE SHOWN BY A TREE
+Probability tree
+EXAMPLE
YOU ROLL A PAIR OF 4-SIDED DICE. WHAT IS THE PROBABILITY THAT THE SUM IS EVEN?
+HOW TO SOLVE IT: We already made the tree:
Just go through it, find the even sums, and divide by 16
+EXAMPLE
YOU ROLL A PAIR OF 4-SIDED DICE. WHAT IS THE PROBABILITY THAT THE FIRST ROLL IS BIGGER THAN YOUR SECOND ROLL?
+HOW TO SOLVE IT: USE THE TREE
+
Rules of Probability
Unions, Intersections
+A trick to remember the difference
Intersection: MIT DUSP is located at the intersection of Mass Ave AND Vasser. An intersection contains the elements in A AND B Example: You have two sets
A={2,4,6,8,10} B={1,2,3,4,5} What is A B?
Intersections:
+
Unions Union: Think of a union as a marriage between two
sets: When people get married they bring their belongings into one house. Items which either he OR she owned are now in the new house. A Union contains elements in A OR in B
Example: A={2,4,6,8,10} B={1,2,3,4,5}
What it A B? The number is in A or B
A trick to remember the difference
+The General Rule for addition
Used for unions of probabilities
What is the probability that either A or B happens?
The formula is P(A U B) = P(A) + P(B) – P(A∩B)
+The probability of 3 events occurring (A and B and C)
P( A U B U C)= P(A) + P(B) + P(C) – P(A B) –P(A C)-P(B C)∩ ∩ ∩
+Example You qualify to be in the English civil service if you have
a degree, you are a member of Pi Alpha, or you pass an exam. What is the probability a person is on the list because they passed an exam, had a degree, or was a member of Pi Alpha?
Exam result
Degree
No Degree
Subtotal
Passed 26 14 40
Failed 4 16 20
Subtotal
30 30
Total 60
Pi alpha member
Exam Result
Degree No degree
subtotal
Passed 64 16 80
Failed 36 74 110
Subtotal
100 90 190
Total 190
Non Member
Total
Pass 120
Fail 130
Total 250
+Add the probabilities up… This question is asking you to calculate an EITHER
probability, so you use the addition rule
It is asking for three events, so you need to add all three, subtract shared, and then re-add the overlap of all three
Find probability of Passing: 120/250
Find the probability of Having a degree: 130/250
Being a member: 60/250
Exam result
Degree
No Degree
Subtotal
Passed 26 14 40
Failed 4 16 20
Subtotal
30 30
Total 60
Exam Result
Degree No degree
subtotal
Passed 64 16 80
Failed 36 74 110
Subtotal
100 90 190
Total 190
Total
Pass 120
Fail 130
Total 250
Pi alpha member Non Member
+Last steps:
The probability of passing, having a degree, or being a member is: 0.48+0.52+0.24 The answer is 1.24 (which we know can’t be right)
We now need to subtract P(A and B) P(A and C) and P(B and C)
Go back to the tables to get these numbers
This is P(member and Pass) =(40/250)=0.16 P(member and degree) =(30/250)=0.12 P(Pass and degree)= (90/250)=0.36
+Last steps:
Then re-add P(A and B and C) The table says it is 26/250=0.104
So: 1.24-(0.16+0.36+0.12) + 0.104=
0.704 is the probability a person is on the list because they passed an exam, had a degree, or was a member of Pi Alpha?
+General multiplication rule
Used when you want to find the joint probability of two events
This is an “and” probability
The probability of A and B, or P(A B) Equals P(A) * P(B | A)
If A and B are independent, you can just take P(A) * P(B)
This is super simple and is best illustrated with an example
∩
+Independence
When the probability of an event is not influenced by the event before it
+Independence
A jar has 3 red marbles, 3 blue marbles, and 2 yellow marbles.
You pick out one marble. What is the probability it is red?
It was red. What is the probability that your next one is red, if you don’t put the red back in the jar?
What if you put it back in and then pick?
+Example : conditional probability and independence
You have 4 females and 2 males in a group. You need to select two people to be on a committee. You want to choose at random, and you can’t choose the same person twice. What is the probability of….
(F F)
M F)
∩
∩
+Make a tree
+The formula for what you just did
P(M F) = P(M) * P(F | M)
P(F F)=P(F) * P(F | F) This is where the conditional is important, if you choose a
female first, the total number of females that can be selected from decreased
∩
∩
+Are these events independent?
Use this formula for independence: If A and B are independent then P(B)= P(B|A) and P(A)=P(A|B)
In this example, that means that the probability of choosing a male is equal to the probability that you choose a male given you chose a female first.
It also means the probability of choosing a female equals the probability that you chose a female given you chose a male first
+Example General Multiplication Rule You are taking pizza orders. A customer can order a
small, medium, or large. They can choose thin or thick crust. They can choose up to two toppings, peperoni or mushrooms. Are these likelihoods independent?
This is a real life example of conditional probability (several conditions across several stages)
+Sample Problem: General Rule of multiplication:
What is P(Small Thin No peperoni mushrooms)?
What is the probability of small ∩ thick crust?
What is the probability of small ∩ thick crust ∩ peperoni?
∩ ∩ ∩
+
+
+Examples where you’d use both Multiplication and Addition rules
Often, in probability you don’t use one rule on its own
Trees help you determine when and where to use each rule
When you make a tree and move left to right, it is the multiplication rule
When you are going up and down, it is the addition rule.
Let’s show this with an example.
+Sample problem 3
There are three restaurants in town. They get 50%, 30%, and 20% of the business. You know that 70% of the customers that leave Restaurant 1 are satisfied. 60% at Restaurant 2 are satisfied and 50% that leave restaurant 3 are satisfied. What is the probability that someone eating in this town leaves satisfied?
+Make your tree
+Multiply through each branch to get the conditional probabilities
+Add down the line to get total p for satisfaction
You’re first multiplying across to get the conditionals
Then you’re adding up and down, to get the probability of being satisfied at 1, 2, OR 3
.35+.18+.1=.63
You have a 63% change of being satisfied when eating out in town
The formula for what you just did: What is the P[(eat at R1 satisfies) (eat at R2
satisfied) eat at R 3 satisfied]∩ ∩
∩∪ ∪
+One last example
You’re playing in a chess tournament. Your probability of winning against half of the players (type 1) is 0.3. Your probability of winning against a quarter of the players (type 2) is 0.4, and it is .5 when playing against the other quarter of the players (type 3). What’s the probability of winning when a random opponent
is chosen?
+Make your table
+Multiply out your conditionals P(win|type 1), P(win|type 2), and P(win|type3)
They are independent so you can just multiply through L to R
+Sum according to addition rule
0.15+0.1+0.125=0.375
Your chance of winning given a randomly chosen opponent is 0.375 or just over 37%
The formula shows why you’d conclude with the addition rule: P(win|type 1) (win|type2) (win|type 3)∪ ∪
+Conditional Probability
“Given X, what is the probability of Y”
Example: You’re picking one person at random from the class. Given the person in the class is a female, what the the probability he or she is blonde?
What statisticians would write: P(Blonde | Female) Tips: your total (n, or the number you divide by is only the
girls! Not the whole class) (# of blondes/#of girls)
+From your pset!
Given your cloth is from a hand loom, what is the probability that the quality is poor?• Locate Handloom cloth• How many total pieces are there made by a hand loom?• How many of those are of poor quality?
+Hints for solving Probability word problems
When there is already a table, diagramming a tree is unnecessary
Be careful to take the right total (n, denominator) Especially in conditional probabilities!
The simplest example of a conditional probability is the blonde | woman example we did above, store that in your head for easy reference, and so you’re not intimidated by the “ | “
+The Normal Distribution
+Probability Distributions
A probability distribution is a table or an equation that links each outcome of a statistical experiment with its probability of occurrence.
+Real life normal distribution
+Distributions fit with different types of variables:Discrete variables: takes on a countable number of values -the number of job classifications in an agency -the number of employees in a department -the number of training sessions
Continuous variables: takes a countless (or super big) range of numerical values -temperature -pressure -height, weight, time -Dollars: budgets, income. (not strictly continuous) but they can take so many values that are so close that you may as well treat them that way
+The Normal Distribution Characteristics -continuous variables only
-The bell curve shape is familiar
-most values cluster around the mean mu
-As values fall at a greater distance from the mean, their likelihood of occurring shrinks
-Its shape is completely determined by its mean and its standard deviation -The height of the curve is the greatest at the mean (where probability of occurrence is highest)
+ -68.26% of values fall within one standard deviation of the mean in either direction-95.44% of values fall within 2 standard deviations of the mean in either direction-99.72% of values fall within 3 standard deviations of the mean in either direction
+z scores
• The number of standard deviations a score of interests lies away from the mean in a normal distribution
• It is used to convert raw data into their associated probability of occurrence with reference to the mean
• The score we are interested in is X. To find the z score of X, subtract the mean mu from it then divide by the amount of standard deviations (sigma) to determine how many SD’s the score is from the mean
+z scores
• The z score itself equals the number of SD’s (sigma) that a score of interest (X) is from the mean (mu) in a normal distribution
• A data value X one standard deviation above the mean has a z score of 1
• A data value X 2 SD’s above the mean has a z score of 2• The probability associated with a z score of one is 0.3413; see
below in the blue oval: (68.26/2)=34.13% of the data values lie between the mean and 1 SD above it
+z scores
• The z score for 1 SD below the mean will be the same in magnitude (0.3413) but with a negative z score of -1.0
• Thus, the z score of -1.0 contains 34.13% of the data i.e. just over one third of the data fall between mu and 1 SD below it
+Example:
What is the likelihood that a value has a z score of 2.0?
+Example:
What is the likelihood that a value has a z score of 2.0?
It is equal to 95.55/2=47.72%
(Meaning, just over 47% of the data fall between mu and 2 SD above it)
+The normal distribution table
• Displays the percentage of data values falling between the mean mu and each z score
• the first 2 digits are in the far left column
• the third digit is on the top row • The associated probability is
where they meet
+Locating z score for 1.0
+example
What percent of the data lies between mu and 1.33 SD away?
+Locate 1.33 on the z table
The answer is that 40.824% of the data fall between the mean and 1.33 SD from the mean
+Application Example:
The police chief is reviewing the academy’s exam scores. The police department’s entrance exam has a normal distribution with a mean of 100 and SD of 10. Someone scored 119.2 on the exam. Is this a good score?
+Solution
-another way of asking this is: what is the probability that any random applicant takes the test and scores a 119.2?-If the probability is high, then it is an average or mediocre score, if the probability is low, then it is an exceptional score
-Step 1: convert the test score to a z score using the formula:
(119.2-100)/10=1.92
+Solution
-Step 2: Use the z score of 1.92 (how many standard deviations the score is above the mean, since it is a positive z score) Look it up in the z table.
+Solution
-Step 3: The value here is .4726-But you’re not done. Here is what you just found:
-We also need to add in the part of the curve shaded in green, or all ofThe scores under the mean.
(0.5+0.4726=) 97.26 is the percentile, or in other words, 97.26% of the scores fall below this score-The probability that a randomly selected individual will get this score or better is 1-97.26=.0274
.4726.50
+Tips
Always draw a picture, it helps you reason through your answer
The z curve is symmetric, so if a your score was a -1.92, it would still contain ~47.2% of the data.
+
The Binomial Distribution
The last section, I promise.
+A gem from the reading
+Probability Distributions
A probability distribution is a table or an equation that links each outcome of a statistical experiment with its probability of occurrence.
+Binomial Distribution Definition
• The probability an event will occur a specified number of times within a specified number of trials
• Examples: mail will be delivered before a certain time every day this weekequipment in a factory remains operational in a 10 day period • This is a DISCRETE distribution that
deals with the likelihood of observing a certain number of events in a set number of repeated trials
+A Bernoulli process
• The Binomial distribution can be used when the process is Bernoulli
• Bernoulli characteristics:• The outcome of a trial is either a success or a
failure • The outcomes are mutually exclusive• The probability of a success is constant from
trial to trial • One trial’s probability of success is not
affected by the trial before it (INDEPENDENCE)• Examples of independent events could
be multiple coin tosses , A fire occurring in a community isn’t affected by if one happened the night before
+When looking at Bernoulli Events
You can calculate their probability with the binomial distribution
• Examples of Bernoulli events: • coin flip is either heads, or
not heads• A crime is either solved or
not solved
+To calculate a probability using the Binomial Distribution you need• n=number of trials• r=number of successes• p=probability that the
event will be a success• q=(1-p)
+Breaking down the formula
Is a combination, it is read, “a combination of n THINGS taken r at a time”
The formula is:
+Example
We flip a coin three times, and we want to know the probability of getting three heads
+Step 1
Define N, P, R, and Q
n (number of trials) =3r (successes)=3 [number of heads]p (probability of getting a heads on a flip)= 0.5q (1-p)=0.5
Now fill in the formula
+Important when solving
• 0!=1• Any number raised to the power of 0
= 1