mat 1000 mathematics in today's world. last time 1.two types of observational study 2.three...

30
MAT 1000 Mathematics in Today's World

Upload: ralf-miller

Post on 03-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

MAT 1000

Mathematics in Today's World

Last Time

1. Two types of observational study

2. Three methods for choosing a sample

Last Time

Population: the collection of all individuals being studied

Census: an observational study that observes the entire population

Sample survey: an observational study that only observes some of the population

Sample: the group of individuals chosen in a sample survey

Last Time

Methods of choosing a sampleVoluntary response sample: the individuals in the sample volunteer to participate

Convenience sample: the individuals in the sample are chosen because they are the easiest to reach

Simple random sample: every group in the population has the same chance of being the sample

Today

1. What does a sample tell us about the

population?

2. Practical problems in sample surveys.

What do samples tell us?

Example

Suppose we are studying the ages of Wayne State students.

Structure of the data:

Individuals: WSU students

Variable: Age

Population: all WSU students

What do samples tell us?

Example

We could try and determine the age of every individual in the population, but this is time consuming and expensive.

Better to use a sample.

What do samples tell us?

Example

We hope the mix of ages of students in our sample gives a good reflection of the mix of ages of all WSU students.

With a good sampling method (like SRS), we can be reasonably sure this is the case.

What happens after we collect data on our sample?

What do samples tell us?

Example

That data will be a big list of numbers—one age for each student in the sample.

A big list of numbers is not very informative. Better to try and summarize.

One way to summarize a list of numbers is to take the average (mean).

If our sample is good, the average age of the students in our sample ought to be close to the average age of all WSU students.

What do samples tell us?

The goal in a sample survey is to use information about a sample to “infer” information about the entire population.

The word “infer” means “derive” or “estimate.”

Here “information” refers to some kind of numbers (“average age” in the example).

So we can restate the goal of a sample survey slightly…

What do samples tell us?

The goal in a sample survey is to use numbers that describe a sample to estimate the value of a number that describes the entire population.

Numbers that describe a sample are called statistics.

Numbers that describes a population are called parameters.

In the example, the average age of all WSU students is the parameter, and the statistic is the average age of the students in our sample.

What do samples tell us?

One common type of sample survey is the opinion poll.

Example

On November 4th there will be an election for the Governor of Michigan. A recent poll of 750 Michigan voters found that 45% would vote for Rick Snyder (R) and 42% for Mark Schauer (D).

What do samples tell us?

Population: all Michigan voters

Variable: which candidate a person will vote for

Unlike “age,” the variable here is not numeric. So we can’t summarize the survey results with an average. Instead we use a proportion (percent).

Statistic: proportion of the sample who support Rick Snyder

Parameter: proportion of the population who support Rick Snyder (unknown)

What do samples tell us?

Goal: make a good estimate of the parameter

The sampling method we use matters.

Convenience sample: ask WSU professors who they plan to vote for

Voluntary response sample: Rush Limbaugh asks listeners to call in

Will these samples give a good representation of the population of all Michigan voters?

What do samples tell us?

We say that convenience sampling and voluntary response sampling are biased sampling methods.

The samples we get using these methods are probably not representative of the population.

Bias: in repeated samples, the sample statistic consistently misses the population parameter in the same direction

To avoid bias, use a simple random sample.

What do samples tell us?

If we use a simple random sample of 750 Michigan voters, every group of 750 people has the same chance of being chosen.

Could we just by chance get a simple random sample of 750 WSU professors?

Sure.

We can never know for sure that our sample represents the whole population. If we use an SRS, the best we can say is that we are confident that our sample represents the whole population.

What do samples tell us?

A good way to understand this: think of taking a SRS many many times.

Each time we get a different sample.

In each sample the proportion of people who plan on voting for Rick Snyder will be slightly different.

So we get different statistics.

But, with an SRS all of these different statistics should be close to each other, and to the parameter.

What do samples tell us?

For any sampling method we use, there will be some variability in the statistics we get.

Variability: different samples from the same population may yield different values of the sample statistic

Variability and bias are two different ways a sample can fail to represent the population.

Bias and Variability

Consider shooting arrows (statistics) at a target (the parameter):Bias means the archer systematically misses in the same direction. Variability means that the arrows are scattered.

What do samples tell us?

To reduce bias, use random sampling.

To reduce variability, use larger samples.

What do samples tell us?

In a sample survey the discrepancy between the statistic and the parameter is called “error” (doesn’t mean mistake).

What are some sources of error?

• Bias

• Variability

These are called “sampling errors.”

What do samples tell us?

There are other types of errors in a sample survey called “nonsampling errors.”

Processing errors: data entry, calculations

Response error: individuals answer incorrectly (lie or misremember)

Even the wording of questions can be a source of error!

Concerns when Asking Survey Questions

• Deliberate bias

• Unintentional bias

• Desire to please

• Asking the uninformed

• Unnecessary complexity

• Ordering of questions

• Confidentiality and anonymity

Deliberate Bias

• “If you found a wallet with $20 in it, would you return the money?”

• “If you found a wallet with $20 in it, would you do the right thing and return the money?”

Unintentional Bias

• “I have taught several students over the past few years.”o How many students do you think I have taught?o How many years am I referring to?

• “Over the past few days, how many servings of fruit have you eaten?”o How many days are you considering?o What constitutes a serving?

Desire to Please

• “Is your instructor doing a good job presenting the course material in a clear and interesting way?”

Yes No

• A 1978 poll done in Cincinnati asked people whether they “favored or opposed repealing the 1975 Public Affairs Act.”o There was no such act!o About one third of those asked expressed an

opinion about it.

Washington Post National Weekly Edition (April 10-16, 1995, p. 36)

Asking the Uninformed:Case Study

Unnecessary Complexity

• “Do you sometimes find that you have arguments with your family members and co-workers?”o Arguments with family memberso Arguments with co-workerso “sometimes find” (vague or unclear)

Ordering of Questions

• “How often do you normally go out on a date? about ___ times a month.”

• “How happy are you with life in general.”o Strong association between these questions.o If the ordering is reversed, then there would be no

strong association between these questions

Confidentiality and Anonymity

• Confidential answero respondent is known, but the information is a secreto facilitates follow-up studies

• Anonymous answero the respondent is not known, or cannot be linked to

his/her responseo usually yields more truthful answers