quiz 3 review - myweb.uiowa.eduย ยท quiz 3 review 171:161 spring 2014 . distribution summaries...

13
Quiz 3 Review 171:161 Spring 2014

Upload: others

Post on 16-Jun-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Quiz 3 Review 171:161 Spring 2014

Page 2: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Distribution Summaries

โ€ข Binomial: ๐‘›!

๐‘˜! ๐‘›โˆ’๐‘˜ !๐‘๐‘˜ 1 โˆ’ ๐‘ ๐‘›โˆ’๐‘˜

โ€ข Used to model โ€œkโ€ successes out of โ€œnโ€ binary trials

โ€ข Has a mean of ๐‘› โˆ— ๐‘ (number of trials times probability of success)

โ€ข Standard Deviation of ๐‘› โˆ— ๐‘ โˆ— (1 โˆ’ ๐‘)

โ€ข Note that when using the normal approximation the following formula is often used:

๐‘†๐ธ =๐‘0 โˆ— (1 โˆ’ ๐‘0)

๐‘›

Page 3: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Normal Distribution and Central Limit Theorem

โ€ข Central limit theorem tells us that the distribution of sample means will be Normal with a mean equal to the population mean and a standard error equal to the population standard deviation divided by the square root of n

โ€ข In practice we do not know the population standard deviation, so t-tests are more appropriate for inference

โ€ข Because of this, the Normal Distribution is most often used in estimating probabilities/approximating the binomial and not for hypothesis testing.

Page 4: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Studentโ€™s t-distribution

โ€ข Bell shaped, symmetric and centered around zero.

โ€ข Heavier tails than the normal distribution.

โ€ข Has 1 parameter (degrees of freedom)

โ€ข As degrees of freedom increase the tails become thinner and the distribution looks more normal.

โ€ข When sample size is small, the t-distribution performs much better than the normal distribution.

Page 5: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

How do I know which distribution to use??

Binomial problems will typically ask for the probability of k successes out of n trials or an expected number of successes. For example:

โ€ข โ€œWhat is the probability of exactly 4 out of 10 individuals having a positive test?โ€

โ€ข โ€œIf 8 cards are randomly drawn (with replacement) how many do you expect to be spades?โ€

โ€ข โ€œWhat are the chances that at least one confidence interval out of 20 does not contain the true mean?โ€

Normal distribution problems will focus on a single probability. For example: โ€ข โ€œWhat is the probability an infant weighs less than 3000 grams?โ€

โ€ข โ€œWhat is the probability that a woman is over 5โ€™6โ€™โ€™?โ€

โ€ข โ€œWhat probability that a sample of 10 students has an average IQ over 110?โ€

Page 6: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Example #1:

Height for US males is normally distributed with a mean of 70 inches and standard deviation of 3 inches.

A) What is the probability that randomly selected male is over 6โ€™6โ€™โ€™?

B) What is the probability that at least 1 male out of a group of 5 is over 6โ€™6โ€™โ€™?

C) What is the probability that a sample of 5 males has an average height of over 6โ€™6โ€™โ€™?

Page 7: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

A) Z = (76 โ€“ 70)/3 = 2 P( Z > 2) = 0.023

B) Binomial โ€ฆ P = 1 โ€“ Prob(Zero males > 6โ€™6โ€™โ€™) = 1 โ€“ (1 - 0.023 )^5 = 0.11

C) Z = (76 โ€“ 70)/(3/sqrt(5)) = 4.47 P( Z > 4.47 ) = .000004

Page 8: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Example #2:

Pair 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

IQ of PKU case 89 98 116 67 128 81 96 116 110 90 76 71 100 108 74

IQ of sibling 77 110 94 91 122 94 121 114 88 91 99 93 104 102 82

PKU Sibling

Mean 94.66667 98.80000

Standard Deviation 18.54980 13.36413

Dobson et al. [1976], 36 patients with a confirmed diagnosis of phenylketonuria (PKU) were identified and

placed on dietary therapy before reaching 121 days of age. The children were tested for IQ (Stanford-Binet

test) between the ages of 4 and 6; subsequently, their normal siblings of closest age were also tested with the

Stanford-Binet. The following are the first 15 pairs listed in the paper.

Part A) Construct a 90% confidence interval for the mean IQ of PKU siblings. Part B) The average IQ is 100, conduct a hypothesis test to determine whether PKU cases have an IQ different from normal.

Page 9: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Part A) 90% CI: ๐‘ฅ ยฑ ๐‘ก.95 โˆ— ๐‘†๐ธ

94.667 ยฑ 1.761 โˆ— (18.550

15)

(82.2, 103.1)

Part B) ๐‘‡๐‘œ๐‘๐‘  =94.667 โˆ’100

18

15

= โˆ’1.15

For alpha = 0.05 this t-statistic does not indicate significance, thus we conclude there is no evidence of a difference in IQ from normal.

Page 10: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Example #3

The population of sonar operators has a mean identification rate of 82 targets out of 100. Recently a psychologist developed a new sonar training system that tested on 15 trainees. The group of trainees had a mean id rate of 85.7 and a standard deviation of 5.1. Conduct a hypothesis test to evaluate the effectiveness of the new training scheme.

Page 11: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

๐ป0: ๐ผ๐ท ๐‘…๐‘Ž๐‘ก๐‘’ = 82 ๐ป๐ด: ๐ผ๐ท ๐‘…๐‘Ž๐‘ก๐‘’ โ‰  82

๐‘ก๐‘œ๐‘๐‘  =85.7 โˆ’ 82

5.1

15

= 2.8

With 14 degrees of freedom a t-statistic of 2.8 will have a two-sided p-value less than 0.02

Thus we can conclude that the trainee group scored significantly higher than average and the training was effective at improving identification rates.

Page 12: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

Example #4: Conceptual Questions:

โ€ข How does the width of a 90% CI compare to the width of a 95% CI?

โ€ข When a population is skewed why can we still use the Normal Distribution to make inferences about the sample mean?

โ€ข How does the t-distribution with 10 degrees of freedom compare to the t-distribution with 20 degrees of freedom? How to each compare to the Normal Distribution?

โ€ข Why does the t-distribution used instead of the Normal Distribution in most circumstances?

Page 13: Quiz 3 Review - myweb.uiowa.eduย ยท Quiz 3 Review 171:161 Spring 2014 . Distribution Summaries โ€ขBinomial: ! ๐‘˜! โˆ’๐‘˜! ๐‘๐‘˜1โˆ’๐‘ โˆ’๐‘˜ โ€ขUsed to model โ€œkโ€ successes

โ€ข The 95% CI is wider

โ€ข The mean is calculated as a sum of many observations, central limit theorem thus says it will be normally distributed if there are sufficient observations included.

โ€ข The t-distribution with fewer degrees of freedom will have thicker tails, both of these t-distributions will have thicker tails than the normal distribution.

โ€ข We are adding variability to our inference by using the sample standard deviation as an estimate of the population standard deviation.