psychological statistics psychology is a quantitative endeavor. operationalism demands that we...

Post on 22-Dec-2015

213 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Psychological Statistics

Psychology is a quantitative endeavor.

Operationalism demands that we specify the means of measurement of concepts we study

Psychological statistics come in two varieties:

Descriptive StatisticsInferential Statistics

Raw Data

For Example - A psychologist conducts an experiment on learning with a sample of 40 rats.

Each rat is run through a complex maze with many turns and the number of incorrect turns (Errors) is recorded. Each rat is run repeatedly through the maze until “mastery” is attained (defined operationally as three consecutive trials without an error).

Raw Data

For Example - A psychologist conducts an experiment on learning with a sample of 40 rats.

Each rat is run through a complex maze with many turns and the number of incorrect turns (Errors) is recorded. Each rat is run repeatedly through the maze until “mastery” is attained (defined operationally as three consecutive trials without an error).

The measure of principal interest is the “Number of Trials to Mastery”

Descriptive Statistics

Statistics that describe or summarize collections of data

Description can be tabular (Table), pictorial (Graph), or might involve the computation of an index (e.g., the mean)

Raw Data Frequency Distribution

15 14 22 14

20 8 13 15

12 20 12 13

9 19 16 15

16 18 17 11

18 11 14 13

7 10 15 17

11 13 14 21

12 16 9 10

14 12 21 14

Score Frequency

7 1

8 1

9 2

10 2

11 3

12 4

13 4

14 6

15 4

16 3

17 2

18 2

19 1

20 2

21 2

22 1

Measures of Central TendencyMeasures of central tendency provide us with an index of where the center of the distribution is located

Mean (X) – The arithmetic average

ΣX

NX =

Computing the Mean

15 14 22 14

20 8 13 15

12 20 12 13

9 19 16 15

16 18 17 11

18 11 14 13

7 10 15 17

11 13 14 21

12 16 9 10

14 12 21 14

ΣX = 571

N = 40

X = 14.275

Measures of Central Tendency

Median – A point in the distribution at which exactly half of the scores are higher and exactly half are lower

If the distribution has an odd number of scores, arrange them in ascending order and simply select the middle score

If the distribution has an even number of scores, arrange them in ascending order and take the average of the middle two

Mode – The most commonly incurred score

Unimodal

Bimodal

Measures of Variability

These measures indicate how dispersed or spread out the scores are from the center of the distribution

Measures of Variability

These measures indicate how dispersed or spread out the scores are from the center of the distribution

Range = Highest Score – Lowest Score

Measures of Variability

The Standard Deviation is a more sophisticated measure of variability than the range. Range can be

significantly distorted by outliers.

σ =

Normal Distributions

These measures indicate how dispersed or spread out the scores are from the center of the distribution

Virtually all characteristics studied by psychologists (as well as other scientists) are likely to be normally distributed.

Skewed Distributions

Positive Skew

Skewed Distributions

Negative Skew

Correlation

Correlation refers to a relationship that exists between pairs of measures. Knowledge of the strength and direction of the relationship allows us to predict one variable from the other with an accuracy greater than chance.

For example, you can guess someone’s weight more accurately if you know how tall they are because height and weight are positively correlated.

When two variables are positively correlated that means they tend to both move higher or both move lower at the same time.

Generally, taller people weigh more than shorted people.

When two variable are negatively correlated that means that they change in value inversely. Higher scores on one generally go with lower scores on the other.

For example, the outdoor temperature and the weight of one’s clothing are negatively correlated. The higher the temperature is, the less clothing we wear. The lower the outdoor temperature, the more clothing we wear.

Correlation

Correlations can be represented graphically by the construction of a special type of graph called a scatter-plot diagram. Pairs of

scores are obtained from each subject.

For example, suppose we wanted to look at the relationship between height and self esteem in men. Perhaps we have a

hypothesis that how tall you are effects your level of self esteem. So we collect pairs of scores from twenty male

individuals. Height, measured in inches, and Self Esteem based on a self-rating scale (where higher scores mean

higher self esteem).

Man HeightSelf

Esteem

1 68 4.1

2 71 4.6

3 62 3.8

4 75 4.4

5 58 3.2

6 60 3.1

7 67 3.8

8 68 4.1

9 71 4.3

10 69 3.7

11 68 3.5

12 67 3.2

13 63 3.7

14 62 3.3

15 60 3.4

16 63 4.0

17 65 4.1

18 67 3.8

19 63 3.4

20 61 3.6

Scatter-Plot Diagrams

Coefficient of Correlation

A statistical computation that indicates the strength and direction of an underlying correlation

Always results in a signed number in the range from -1.00 to +1.00 If the sign is positive, that indicates the underlying

relationship is a positive correlation. If the sign is negative, it indicates an underlying negative correlation. The closer the value is to “1” (either positive or negative) the stronger is the

indicated underlying relationship.

Inferential Statistics

Inferential statistics Interpret or in other words allow us to draw conclusions about the meaning of our data.

We want to be able to draw conclusions that apply universally – that is to the population at large.

Since we can’t work with all members of a population, we draw a small subset from it called a “sample”.

Extrapolating from our findings using subjects from the sample, we then draw conclusions about the overall population.

Our success depends upon the representativeness of our sample. We rely on Randomization as a selection procedure,

large Sample Sizes, and Replication (Multiple Samples) to insure representativeness.

Hypothesis Testing

The ultimate test of any hypothesis would be to subject it to an experimental test.

The experiment is a research method in which the investigator manipulates a variable (IV) under

carefully controlled conditions and observes whether any changes occur in behavior (DV) as a result.

Example of an Experiment

Population – Insomniacs (people who report difficulty in falling asleep)

Research Hypothesis – A new drug (hypnoxin) may reduce the delay of sleep onset in this population.

Null Hypothesis Testing – The Null Hypothesis is the very conservative assumption that our drug will not produce any effective relief. We start with assumption and stick

with it unless the data overwhelming convince us otherwise.

Example of an Experiment

Subjects are randomly assigned to two groups. The experimental group includes subjects that are to receive

our new drug. A control group would also be tested under identical circumstances except they don’t receive

the drug.

To protect against a “placebo effect”, all subject receive an identically appearing pill with the instructions “Here

take this it may help you sleep”. For subjects in the Experimental Group the pill contains Hypnoxin, For

subjects in the Control Group the pill contains an inert ingredient. Subjects don’t know which condition they

have been assigned to.

The number of minutes to sleep onset after taking the pill and retiring is recorded as the DV.

Subject Hynoxin Control

1 43 42

2 12 88

3 22 19

4 45 41

5 55 74

6 14 66

7 23 24

8 44 17

9 15 29

10 35 38

11 14 51

12 51 57

Mean 31.08 45.50

Delay of Sleep Onset with and without Hypnoxin

Is the difference between the two conditions sufficiently large for us to reject the Null Hypothesis? Or could the difference be due to chance alone? If the probability that this is due to chance alone is

less than .05 we consider the outcome to be statistically significant.

Science Does Not Discover Truth

Science is probabilistic.

DNA Testing from a crime scene – Conclusion is probabilistic “There is a 1 in 1 billion chance that the DNA recovered might be from someone other than

the suspect”

top related