introduction to hypothesis testing - computer scienceketelsen/files/courses/csci... ·...

31
Introduction to Hypothesis Testing

Upload: others

Post on 16-Apr-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Introduction to Hypothesis Testing

Page 2: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Introduction to Hypothesis Testing

Page 3: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Administriviao Homework 5 posted. Due Friday Nov 10

o Good Milestones: o Problems 1-4 this week o Problems 5 and 6 next week

Page 4: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Previously on CSCI 3022

We’ve looked at ways to compute confidence intervals for several different statistics: E.g. a confidence interval for the mean with known sd. is given by 100(1� ↵)% µ �

X̄ � z↵/2

�pn, X̄ + z↵/2

�pn

Proposition: If X is a normally distributed random variable with mean and standard deviation , then Z is a standard normal distribution if

µ�

Z =X � µ

�or X = �Z + µ

Fact: If Z is a standard normal random variable, then we can compute probabilities using the standard normal CDF

P (Z z) =

Z z

�1f(x) dx = �(z)

Page 5: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

A Thought ExperimentExample: After the introduction of the Euro, Polish mathematicians claimed that the new Belgian 1 Euro coin is not a fair coin. Suppose I hand you a Belgian 1 Euro coin. How could you decide whether or not it is fair?

Page 6: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

A Thought ExperimentExample: After the introduction of the Euro, Polish mathematicians claimed that the new Belgian 1 Euro coin is not a fair coin. Suppose I hand you a Belgian 1 Euro coin. How could you decide whether or not it is fair?

Page 7: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Statistical HypothesesDef: A statistical hypothesis is a claim about the value of a parameter of a population characteristic.

Examples:

o Suppose the recovery time of a person suffering from disease D is normally distributed with mean and standard deviation . Hypothesis: days.

o Suppose is the recovery time of a person suffering from disease D and given treatment for D. Hypothesis:

o Suppose is the mean internet speed for Comcast and is the mean internet speed for Century Link. Hypothesis:

µ1

µ2

�1 µ1 > 10

µ2 < µ1

µ1 µ2µ1 6= µ2

= =

Page 8: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Null vs. Alternative HypothesisIn any hypothesis-testing problem, there are always two competing hypotheses under consideration:

1.

2.

The objective of hypothesis testing is to decide, based on sampled data, if the alternative hypothesis is actually supported by the data.

-

NULL HYPOTHESIS ,Ho

Alt Hypothesis ,H

,( or Ha )

Page 9: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

The Classic Jury AnalogyConsider a jury in a criminal trial

When a defendant is accused of a crime, the jury (is supposed to) presume that he or she is not guilty (not guilty: that’s the Null Hypothesis)

The jury is then presented with evidence. If the evidence seems implausible under the assumption of non-guilt, we might reject non-guilt and claim that the defendant is (likely) guilty.

Page 10: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Null vs Alternative HypothesisIs there strong evidence for the alternative?

The burden of proof is placed on those that believe the alternative claim.

The initially favored claim will not be rejected in favor of the alternative claim unless the sample evidence provides a lot of support for the alternative.

Two possible conclusions:

1.

2.

(H0) (H1)=

=

Reject Ho ( in Favor It , )FAIL to Reject THE NUH Hypothesis Ito

Page 11: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Null vs Alternative HypothesisWhy assume the Null Hypothesis? o Sometimes we don’t want to accept a particular assertion unless (or until) data can be

shown to strongly support it

o Reluctance (cost, time) to change

Example: Suppose a company is considering hiring a new outside advertising company to help generate traffic to their website. Under their current advertising they get, on average, 200 thousand hits per day. With denoting the true average number of hits they’d get per day under the new company’s advertising, they would not want to switch companies (because it would be costly) unless evidence strongly suggested that exceeds 200 thousand.

µ

µ

Page 12: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Null vs Alternative HypothesisExample: Suppose a company is considering hiring a new outside advertising company to help generate traffic to their website. Under their current advertising they get, on average, 200 thousand hits per day. With denoting the true average number of hits they’d get per day under the new company’s advertising, they would not want to switch companies (because it would be costly) unless evidence strongly suggested that exceeds 200 thousand.

µ

µ

An appropriate problem formulation would involve testing:

The conclusion that change is justified is identified with the alternative hypothesis and it would take conclusive evidence to justify rejecting and switching to the new companyH0

H .: µ= 200 Ho : µ=2O0

H , :p> 200 H , :µ > 205

Page 13: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Null vs Alternative HypothesisThe alternative to the Null hypothesis will look like one of the following assertions

H0 : ✓ = ✓0

o The equals sign is always in the Null hypothesis

o The alternative hypothesis is the one for which we are seeking statistical evidence

factual #

1.

Q > Qo

2 ,

-04 -00

3 . O # Qo

Page 14: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Test Statistics and EvidenceDef: A test statistic is a quantity derived from the sample data and calculated assuming that the Null hypothesis is true. It is used in the decision about whether or not to reject the Null hypothesis.

Intuition: o We can think of the test statistics as our evidence about the competing hypotheses.

o We consider the test statistic under the assumption that the Null hypothesis is true by asking questions like How likely would we obtain this evidence if the Null were true?

Example: To determine if the Belgian 1 Euro coin is fair you flip it 100 times and record the number of Heads. What is the test statistics? What are the Null and alternative hypotheses?

-LT

Ho :p= . 5

It , :p # . 5

Page 15: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Test Statistics and EvidenceExample: To determine if the Belgian 1 Euro coin is fair you flip it n times and record the number of Heads. What is the test statistic? What are the Null and alternative hypotheses?

Ho :p= . 5 Test STATISTIC

H , :p # .5 §= In Xnthncnitz)

It n is large THEN Dist OF p"

is

Approximately normal :c- variance

pnincp ,P±I ' )⇒ Noll .

. p= or ⇒ pa - Nc .5,•5Y÷5)

Page 16: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Test Statistics and EvidenceExample: To determine if the Belgian 1 Euro coin is fair you flip it n times and record the number of Heads. What is the test statistic? What are the Null and alternative hypotheses?

Question: What would it take to convince you that the coin is not fair?

= , F.tl• 5 • 6

XYh.ph#Enm.5iFy

Page 17: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Test Statistics and EvidenceExample: To determine if the Belgian 1 Euro coin is fair you flip it n times and record the number of Heads. What is the test statistic? What are the Null and alternative hypotheses?

Question: What would it take to convince you that the coin is not fair?

H , :p # . 5

*

HI"i ,

Page 18: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Rejection Regions and Significance LevelExample: To determine if the Belgian 1 Euro coin is fair you flip it n times and record the number of Heads. What is the test statistic? What are the Null and alternative hypotheses? Def: The rejection region is a range of values of the test statistic that would lead you to reject the Null hypothesis.

Def: The significance level indicates the largest probability of the test statistic occurring under the Null hypothesis that would lead you to reject the Null hypothesis

a. 05

frejectpnegnonIs = •

025 fTo f F = . 025

< . *\|Ky

Page 19: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Detecting Biased CoinsExample: To test if the Belgian 1 Euro coin is fair you flip it 100 times and observe 38 Heads. Do you reject the Null at the .05 significance level or not? .

Ho :p' . 5

pm - NC .5,

.si#I)=o/p=30goH , :p # °

5z .ozs= norm .ppf( . 975 ) = 0,38-

1 = 1.96-Z={3g8.TI#s, '

|Zh - Zxizorz > Zan

Too -2.4<-1.96 )= - 2.4 L

•025 p•orI Conclusion7 Evidence supportsin 'tka ,

all -1.96 1.96 Rejecting Null It . .

Page 20: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Detecting Biased CoinsExample: To test if the Belgian 1 Euro coin is fair you flip it 100 times and observe 38 Heads. Do you reject the Null at the .01 significance level or not?

Z. 005

= 2.58

in

fe .005

) ← I = .005

I v. .i|1 ty

- 2.58 2.58

Page 21: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Different Tests for Different HypothesesThe coin example was an exampled of a two-tailed hypothesis test, because we would have rejected the Null hypothesis had the coin been been biased towards heads OR tails

Alternative Hypothesis Rejection Region for Level Test

H1 : ✓ > ✓0

H1 : ✓ < ✓0

H1 : ✓ 6= ✓0

z � z↵

z �z↵

(z �z↵/2) or (z � z↵/2)

- 7

d& I

f dh /P%% * H Inza - Zx

Page 22: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Switching Advertising StrategiesExample: Suppose a company is considering hiring a new outside advertising company to help generate traffic to their website. Under their current advertising they get, on average, 200 thousand hits per day with a standard deviation of 50 thousand hits per day. You decide to hire the new ad company for a 30 day trial. During those 30 days, your website gets 210 thousand hits per day. Perform a hypothesis test to determine if the new ad campaign outperforms the old one at the .05 significance level.

he LT

a

Ho : µ=2oo XT ~ N ( 200 , 50310 )H ,

: µ > 200 @ I -200

FAIREY 1.1×1.64 ¥430 ⇒

Ho .

1.1 •05 210 . 200t.lt#5oTEo=

"

7.05=1.64

Page 23: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Switching Advertising StrategiesExample: Suppose a company is considering hiring a new outside advertising company to help generate traffic to their website. Under their current advertising they get, on average, 200 thousand hits per day with a standard deviation of 50 thousand hits per day. You decide to hire the new ad company for a 30 day trial. During those 30 days, your website gets 210 thousand hits per day. Perform a hypothesis test to determine if the new ad campaign outperforms the old one at the .05 significance level.

Page 24: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Important AssumptionsQuestion: What assumptions did we make in the previous examples?

Page 25: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Errors in Hypothesis TestingDefinitions: o A Type I Error occurs when the Null hypothesis is rejected, but the Null hypothesis is

in fact true (False Positive)o A Type II Error occurs when the Null hypothesis is not rejected, but the Null

hypothesis is in fact false (False Negative)

Question: What is the probability that we commit a Type I Error?

LT =

Page 26: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

Errors in Hypothesis TestingDefinitions: o A Type I Error occurs when the Null hypothesis is rejected, but the Null hypothesis is

in fact true (False Positive)o A Type II Error occurs when the Null hypothesis is not rejected, but the Null

hypothesis is in fact false (False Negative)

Question: What is the probability that we commit a Type I Error?Answer: This is exactly the significance level Consequence: We set by considering how willing we are to risk a Type I Error

Page 27: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia

OK! Let’s Go to Work! Get in groups, get out laptop, and open the Lecture 17 In-Class Notebook

Let’s: o Work through some more hypothesis test examples

Page 28: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia
Page 29: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia
Page 30: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia
Page 31: Introduction to Hypothesis Testing - Computer Scienceketelsen/files/courses/csci... · 2017-11-06 · Introduction to Hypothesis Testing. Introduction to Hypothesis Testing. Administrivia