lecture 2: thu, jan 16
DESCRIPTION
Lecture 2: Thu, Jan 16. Hypothesis Testing – Introduction (Ch 11) Concepts of testing Tests of Hypothesis (Sigma known) Rejection Region method P-value method Two – tail test example Relationship between Tests and C.I. Introduction. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/1.jpg)
1
Lecture 2: Thu, Jan 16
• Hypothesis Testing – Introduction (Ch 11)• Concepts of testing• Tests of Hypothesis (Sigma known)
– Rejection Region method– P-value method– Two – tail test example
• Relationship between Tests and C.I
![Page 2: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/2.jpg)
2
Introduction
• The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief about a parameter.
• Examples– Does the statistical evidence in a random sample of potential
customers support the hypothesis that more than 10% of the potential customers will purchase a new product?
– Is a new drug effective in curing a certain disease? A sample of patients is randomly selected. Half of them are given the drug while the other half are given a placebo. The improvement in the patients’ condition is then measured and compared.
![Page 3: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/3.jpg)
3
Hypothesis Testing in the Courtroom
• Null hypothesis: The defendant is innocent• Alternative (research) hypothesis: The defendant is guilty• The goal of the procedure is to determine whether there is
enough evidence to conclude that the alternative hypothesis is true. The burden of proof is on the alternative hypothesis.
• Two types of errors:– Type I error: Reject null hypothesis when null hypothesis is true
(convict an innocent defendant)– Type II error: Do not reject null hypothesis when null is false (fail
to convict a guilty defendant)
![Page 4: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/4.jpg)
4
Concepts of Hypothesis Testing
• The critical concepts of hypothesis testing.– Example 11.1
• The manager of a department store is thinking about establishing a new billing system for the store’s credit customers
• The new system will be cost effective only if the mean monthly account ( )is more than $170.
– There are two hypotheses about a population mean: • H0: The null hypothesis = 170
• H1: The alternative hypothesis > 170 (What you want to prove)
![Page 5: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/5.jpg)
5
= 170
• Assume the null hypothesis is true (= 170).
– Sample from the customer population, and build a statistic related to the parameter hypothesized (the sample mean).
– Pose the question: How probable is it to obtain a sample mean at least as extreme as the one observed from the sample, if H0 is correct?
![Page 6: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/6.jpg)
6
– Suppose is much larger than 170, then the mean is likely to be greater than 170. Reject the null hypothesis.
x
= 170
• Assume the null hypothesis is true (= 170). • Common sense suggests the following.
– When the sample mean is close to 170, it is not implausible that the mean is 170. Do not reject the null hypothesis.
![Page 7: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/7.jpg)
7
Types of Errors
• Two types of errors may occur when deciding whether to reject H0 based on the statistic value.
– Type I error: Reject H0 when it is true.
– Type II error: Do not reject H0 when it is false.• Example continued
– Type I error: Reject H0 ( = 170) in favor of H1 ( > 170) when the real value of is 170.
– Type II error: Believe that H0 is correct ( = 170) when the real value of is greater than 170.
![Page 8: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/8.jpg)
8
Testing the Population Mean When the Population Standard Deviation is Known
• Example 11.1– A new billing system for a department store will be cost-
effective only if the mean monthly account is more than $170.
– A sample of 400 accounts has a mean of $178.– If accounts are approximately normally distributed with
= $65, can we conclude that the new system will be cost effective?
![Page 9: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/9.jpg)
9
• Example 11.1 – Solution– The population of interest is the credit accounts at
the store.– We want to know whether the mean account for all
customers is greater than $170.H1 : > 170
– The null hypothesis must specify a single value of the parameter
H0 : = 170
Testing the Population Mean ( is Known)
![Page 10: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/10.jpg)
10
Approaches to Testing
• There are two approaches to test whether the sample mean supports the alternative hypothesis (H1)– The rejection region method is mandatory for
manual testing (but can be used when testing is supported by a statistical software)
– The p-value method which is mostly used when a statistical software is available.
![Page 11: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/11.jpg)
11
The rejection region is a range of values such that if the test statistic falls into that range, the null hypothesis is rejected in favor of the alternative hypothesis.
The rejection region is a range of values such that if the test statistic falls into that range, the null hypothesis is rejected in favor of the alternative hypothesis.
The Rejection Region Method
![Page 12: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/12.jpg)
12
Example 11.1 – solution continued
• Recall: H0: = 170 H1: > 170
therefore, • It seems reasonable to reject the null hypothesis and believe that > 170 if the sample mean is sufficiently large.
The Rejection Region Method – for a Right - Tail Test
Reject H0 here
Critical value of the sample mean
![Page 13: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/13.jpg)
13
Example 11.1 – solution continued
• Define a critical value for that is just large enough to reject the null hypothesis.
xLx
• Reject the null hypothesis if
Lxx
The Rejection Region Method for a Right - Tail Test
![Page 14: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/14.jpg)
14
• Allow the probability of committing a Type I error be (also called the significance level).
• Find the value of the sample mean that is just large enough so that the actual probability of committing a Type I error does not exceed Watch…
Determining the Critical Value for the Rejection Region
![Page 15: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/15.jpg)
15
P(commit a Type I error) = P(reject H0 given that H0 is true)Lx
170x x
= P( given that H0 is true)Lxx
40065
170xz L
Example 11.1 – solution continued
… is allowed to be
)ZZ(PSince we have:
Determining the Critical Value – for a Right – Tail Test
![Page 16: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/16.jpg)
16
Determining the Critical Value – for a Right – Tail Test
.34.17540065
645.1170x
.645.1z,05.0selectweIf
.40065
z170x
L
05.
L
40065
170xz L
= 0.05
170x Lx
Example 11.1 – solution continued
![Page 17: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/17.jpg)
17
Determining the Critical value for a Right - Tail Test
34.175xifhypothesisnullthejectRe
ConclusionSince the sample mean (178) is greater than the critical value of 175.34, there is sufficient evidence to infer that the mean monthly balance is greater than $170 at the 5% significance level.
![Page 18: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/18.jpg)
18
– Instead of using the statistic , we can use the standardized value z.
– Then, the rejection region becomes
x
n
xz
zzOne tail test
The standardized test statistic
![Page 19: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/19.jpg)
19
• Example 11.1 - continued– We redo this example using the standardized test
statistic.Recall:H0: = 170
H1: > 170– Test statistic:
– Rejection region: z > z.051.645.
46.240065
170178
n
xz
The standardized test statistic
![Page 20: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/20.jpg)
20
• Example 11.1 - continued
The standardized test statistic
645.1
Re
Zifhypothesisnulltheject
ConclusionSince Z = 2.46 > 1.645, reject the null hypothesis in favor of the alternative hypothesis.
![Page 21: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/21.jpg)
21
– The p-value provides information about the amount of statistical evidence that supports the alternative hypothesis.
– The p-value of a test is the probability of observing a test statistic at least as extreme as the one computed, given that the null hypothesis is true.
– Let us demonstrate the concept on Example 11.1
P-value Method
![Page 22: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/22.jpg)
22
0069.)4615.2z(P
)40065170178
z(P
170x 178x
The probability of observing a test statistic at least as extreme as 178, given that = 170 is…
The p-value
P-value Method
)170when178x(P
![Page 23: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/23.jpg)
23
• Because the probability that the sample mean will assume a value of more than 178 when = 170 is so small (.0069), there are reasons to believe that > 170.
• In addition note that observing a value of 178 when the true mean is 170 is rare, but under the alternative hypothesis, observing a value of 178 becomes more probable.
• We can conclude that the smaller the p-value the more statistical evidence exists to support the alternative hypothesis.
Interpreting the p-value
![Page 24: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/24.jpg)
24
• Describing the p-value– If the p-value is less than 1%, there is overwhelming
evidence that supports the alternative hypothesis.– If the p-value is between 1% and 5%, there is a strong
evidence that supports the alternative hypothesis.– If the p-value is between 5% and 10% there is a weak
evidence that supports the alternative hypothesis.– If the p-value exceeds 10%, there is no evidence that
supports the alternative hypothesis.
Interpreting the p-value
![Page 25: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/25.jpg)
25
– The p-value can be used when making decisions based on rejection region methods as follows:• Define the hypotheses to test, and the required
significance level • Perform the sampling procedure, calculate the test statistic
and the p-value associated with it.• Compare the p-value to Reject the null hypothesis only
if p-value <; otherwise, do not reject the null hypothesis. = 0.05
The p-value and the Rejection Region Methods
![Page 26: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/26.jpg)
26
• If we reject the null hypothesis, we conclude that there is enough evidence to infer that the alternative hypothesis is true.
• If we do not reject the null hypothesis, we conclude that there is not enough statistical evidence to infer that the alternative hypothesis is true.
• Remember the truth of the alternative hypothesis is what we are investigating. The conclusion focuses on the validity of the alternative hypothesis.
Conclusions of a Test of Hypothesis
![Page 27: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/27.jpg)
27
A Two - Tail Test
• Example 11.2– AT&T has been challenged by competitors who argued that
their rates resulted in lower bills.A statistics practitioner determines that the mean and standard deviation of monthly long-distance bills for all AT&T residential customers are $17.09 and $3.87 respectively.
A random sample of 100 customers is selected and customers’ bills recalculated using a leading competitor’s rates. The sample mean of customers’ bills is $17.55. Assuming the standard deviation is the same (3.87), can we infer at the 5% significance level that there is a difference between AT&T’s bills and the competitor’s bills (on the average)?
![Page 28: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/28.jpg)
28
![Page 29: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/29.jpg)
29
Two tail tests and C.I.
• Note that both tests and C.I. are computed based on the sampling distribution of the mean. To illustrate, the 95% C.I for the population mean is [16.79, 18.31] which includes 17.09.
• Thus we cannot conclude that there is sufficient evidence to infer that the population mean differs from 17.09
• Use of C.I has the advantage of simplicity but has two important drawbacks– Lack of correspondence to one-tail tests– No p-value type information
![Page 30: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/30.jpg)
30
Problem 11.54
Many Alpine ski centers base their projections of revenues and profits on the assumption that the average Alpine skier skis 4 times per year. To investigate the validity of this assumption, a random sample of 63 skiers is drawn and each is asked to report the number of times they skied the previous year. Assume that the population standard deviation is 2, and the sample mean is 4.84. Can we infer at the 10% level that the assumption is wrong?
![Page 31: Lecture 2: Thu, Jan 16](https://reader033.vdocuments.us/reader033/viewer/2022051114/56813c0d550346895da57e2b/html5/thumbnails/31.jpg)
31
• Suggested Problems: 11.6,11.42, 11.44• Next Time: Finish Chapter 11 (Section 11.4),
Begin Chapter 12