descriptive stats saturday
TRANSCRIPT
-
8/14/2019 Descriptive Stats Saturday
1/66
Educational Research
Descriptive Statistics
-
8/14/2019 Descriptive Stats Saturday
2/66
Preparing data for analysis Types of descriptive statistics
Central tendency Variation Relative position Relationships
Calculating descriptive statistics
-
8/14/2019 Descriptive Stats Saturday
3/66
Preparing Data for Analysis
Issues Scoring procedures Tabulation and coding Use of computers
-
8/14/2019 Descriptive Stats Saturday
4/66
Scoring Procedures
Instructions Standardized tests detail scoring instructions Teacher made tests require the delineation of scoring
criteria and specific procedures
Types of items Selected response items - easily and objectively
scored
Open-ended items difficult to score objectively witha single number as the result
-
8/14/2019 Descriptive Stats Saturday
5/66
Tabulation and Coding
Tabulation is organizing data Identifying all relevant information to the
analysis Separating groups and individuals within
groups Listing data in columns
-
8/14/2019 Descriptive Stats Saturday
6/66
Tabulation and Coding
Coding Assigning identification numbers to subjects Assigning codes to the values of non-
numerical or categorical variables Gender: 1=Female and 2=Male Subjects: 1=English, 2=Math, 3=Science, etc.
-
8/14/2019 Descriptive Stats Saturday
7/66
Computerized Analysis
Need to learn how to calculate descriptivestatistics by hand Creates a conceptual base for understanding the
nature of each statistic Exemplifies the relationships among statistical
elements of various procedures
Use of computerized software
SPSS Windows Other software packages
-
8/14/2019 Descriptive Stats Saturday
8/66
Descriptive Statistics
Purpose to describe or summarize datain a parsimonious manner
Four types Central tendency Variability Relative position
Relationships
-
8/14/2019 Descriptive Stats Saturday
9/66
Descriptive Statistics
Graphing data afrequency polygon Vertical axis
represents thefrequency with which ascore occurs
Horizontal axisrepresents the scores
themselves
-
8/14/2019 Descriptive Stats Saturday
10/66
Central Tendency
Purpose to represent the typical scoreattained by subjects
Three common measures Mode Median Mean
-
8/14/2019 Descriptive Stats Saturday
11/66
Central Tendency
Mode The most frequently occurring score Appropriate for nominal data
Median The score above and below which 50% of all scoreslie (i.e., the mid-point)
Characteristics Appropriate for ordinal scales
Doesnt take into account the value of each and every scorein the data
-
8/14/2019 Descriptive Stats Saturday
12/66
Central Tendency
Mean The arithmetic average of all scores Characteristics
Advantageous statistical properties Affected by outlying scores Most frequently used measure of central tendency
Formula
-
8/14/2019 Descriptive Stats Saturday
13/66
Variability
Purpose to measure the extent to whichscores are spread apart
Four measures Range Quartile deviation Variance
Standard deviation
-
8/14/2019 Descriptive Stats Saturday
14/66
Variability
Range The difference between the highest and
lowest score in a data set
Characteristics Unstable measure of variability Rough, quick estimate
-
8/14/2019 Descriptive Stats Saturday
15/66
Variability
Quartile deviation One-half the difference between the upperand lower quartiles in a distribution
Characteristic - appropriate when the
median is being used
-
8/14/2019 Descriptive Stats Saturday
16/66
Variability
Variance
The average squared deviation of allscores around the mean Characteristics
Many important statistical properties
Difficult to interpret due to squared metric Formula
-
8/14/2019 Descriptive Stats Saturday
17/66
Variability
Standard deviation The square root of the variance Characteristics
Many important statistical properties Relationship to properties of the normal curve Easily interpreted
Formula
-
8/14/2019 Descriptive Stats Saturday
18/66
The Normal Curve
A bell shaped curve reflecting the
distribution of many variables of interestto educators
See Figure 14.2
See the attached slide
-
8/14/2019 Descriptive Stats Saturday
19/66
The Normal Curve
Characteristics Fifty-percent of the scores fall above the mean and
fifty-percent fall below the mean The mean, median, and mode are the same values Most participants score near the mean; the further a
score is from the mean the fewer the number ofparticipants who attained that score
Specific numbers or percentages of scores fall
between 1 SD, 2 SD, etc.
-
8/14/2019 Descriptive Stats Saturday
20/66
The Normal Curve
Properties Proportions under the curve
1 SD 68%
1.96 SD 95%2.58 SD 99%
Cumulative proportions and percentiles
-
8/14/2019 Descriptive Stats Saturday
21/66
Skewed Distributions
Positive many low scores and few high scores Negative few low scores and many high
scores
Relationships between the mean, median, andmode Positively skewed mode is lowest, median is in the
middle, and mean is highest Negatively skewed mean is lowest, median is in the
middle, and mode is highest
-
8/14/2019 Descriptive Stats Saturday
22/66
Measures of Relative Position
Purpose indicates where a score is inrelation to all other scores in thedistribution
Characteristics Clear estimates of relative positions Possible to compare students performances
across two or more different tests provided
the scores are based on the same group
-
8/14/2019 Descriptive Stats Saturday
23/66
Measures of Relative Position
Types Percentile ranks the percentage of scores
that fall at or above a given score
Standard scores a derived score based onhow far a raw score is from a reference pointin terms of standard deviation units Z-score T-score Stanine
-
8/14/2019 Descriptive Stats Saturday
24/66
Measures of Relative Position
Z-score The deviation of a score from the mean in standard
deviation units The basic standard score from which all other
standard scores are calculated Characteristics
Mean = 0 Standard deviation = 1 Positive if the score is above the mean and negative if it is
below the mean Relationship with the area under the normal curve
-
8/14/2019 Descriptive Stats Saturday
25/66
Measures of Relative Position
Z-score (continued)
Possible to calculate relative standings likethe percent better than a score, the percentfalling between two scores, the percentfalling between the mean and a score, etc.
Formula
-
8/14/2019 Descriptive Stats Saturday
26/66
Measures of Relative Position
T-score a transformation of a z-score
where t = 10(Z) + 50 Characteristics
Mean = 50 Standard deviation = 10 No negative scores
-
8/14/2019 Descriptive Stats Saturday
27/66
Measures of Relative Position
Stanine a transformation of a z-scorewhere the stanine = 2(Z) + 5 rounded tothe nearest whole number Characteristics
Nine groups with 1 the lowest and 9 the highest Categorical interpretation Frequently used in norming tables
-
8/14/2019 Descriptive Stats Saturday
28/66
Measures of Relationship
Purpose to provide an indication of therelationship between two variables
Characteristics of correlation coefficients Strength or magnitude 0 to 1 Direction positive (+) or negative (-)
Types of correlations coefficients
dependent on the scales of measurement ofthe variables Spearman Rho ranked data Pearson r interval or ratio data
-
8/14/2019 Descriptive Stats Saturday
29/66
Measures of Relationship
Interpretation correlation does notmean causation
Formula for Pearson r
-
8/14/2019 Descriptive Stats Saturday
30/66
Calculating Descriptive Statistics
Symbols used in statistical analysis General rules form calculating by hand
Make the columns required by the formula Label the sum of each column Write the formula Write the arithmetic equivalent of the problem Solve the arithmetic problem
-
8/14/2019 Descriptive Stats Saturday
31/66
Calculating Descriptive Statistics
Using SPSS Windows Means, standard deviations, and standard
scores
The DESCRIPTIVESprocedures Interpreting output
Correlations The CORRELATIONprocedure Interpreting output
-
8/14/2019 Descriptive Stats Saturday
32/66
-
8/14/2019 Descriptive Stats Saturday
33/66
Formula for the Mean
-
8/14/2019 Descriptive Stats Saturday
34/66
Formula for Variance
-
8/14/2019 Descriptive Stats Saturday
35/66
Formula for Standard Deviation
-
8/14/2019 Descriptive Stats Saturday
36/66
-
8/14/2019 Descriptive Stats Saturday
37/66
Educational Research
Inferential Statistics
-
8/14/2019 Descriptive Stats Saturday
38/66
Concepts underlying inferential statistics Types of inferential statistics
Parametric T-tests ANOVA
One-way Factorial Post-hoc comparisons
Multiple regression ANCOVA
Non-parametric Chi-Square
-
8/14/2019 Descriptive Stats Saturday
39/66
Important Perspectives
Inferential statistics Allow researchers to generalize to a population of
individuals based on information obtained from asample of those individuals
Assesses whether the results obtained from asample are the same as those that would have
been calculated for the entire population Probabilistic nature of inferential analyses
-
8/14/2019 Descriptive Stats Saturday
40/66
Underlying Concepts
Sampling distributions Standard error Null and alternative hypotheses
Tests of significance Type I and Type II errors One-tailed and two-tailed tests
Degrees of freedom Tests of significance
-
8/14/2019 Descriptive Stats Saturday
41/66
Sampling Distributions
A distribution of sample statistics A distribution of mean scores A distribution of the differences between two mean
scores A distribution of the ratio of two variances
Known statistical properties of samplingdistributions The mean of the sampling distribution of means isan excellent estimate of the population mean The standard error of the mean is an excellent
estimate of the standard deviation of the
-
8/14/2019 Descriptive Stats Saturday
42/66
Standard Error
Sampling error the expected random orchance variation of means in samplingdistributions
The calculation of standard errors toestimate sampling error Standard error of the mean
Standard error of the differences between twomeans
-
8/14/2019 Descriptive Stats Saturday
43/66
Null and Alternative Hypotheses
The null hypothesis represents astatistical tool important to inferential
tests of significance The alternative hypothesis usually
represents the research hypothesis
related to the study
-
8/14/2019 Descriptive Stats Saturday
44/66
Null and Alternative Hypotheses
Comparisons between groups Null: no difference between the means scores of the
groups Alternative: differences between the mean scores of
the groups Relationships between variables
Null: no relationship exists between the variablesbeing studied
Alternative: a relationship exists between thevariables being studied
-
8/14/2019 Descriptive Stats Saturday
45/66
Null and Alternative Hypotheses
Acceptance of the nullhypothesis The difference between
groups is too small toattribute it to anything butchance
The relationship betweenvariables is too small toattribute it to anything butchance
Rejection of the nullhypothesis The difference between
groups is so large it can beattributed to something
other than chance (e.g.,experimental treatment) The relationship between
variables is so large it canbe attributed to somethingother than chance (e.g., a
real relationship)
-
8/14/2019 Descriptive Stats Saturday
46/66
Tests of Significance
Statistical analyses to help decide whether toaccept or reject the null hypothesis
Alpha level An established probability level which serves as the
criterion to determine whether to accept or reject thenull hypothesis
Common levels in education .01 .05 .10
-
8/14/2019 Descriptive Stats Saturday
47/66
Tests of Significance
Specific tests are used in specificsituations based on the number ofsamples and the statistics of interest One sample tests of the mean, variance,proportions, correlations, etc. Two sample tests of means, variances,
proportions, correlations, etc.
-
8/14/2019 Descriptive Stats Saturday
48/66
Type I and Type II Errors
Correct decisions The null hypothesis is true and it is accepted
The null hypothesis is false and it is rejected Incorrect decisions
Type I error - the null hypothesis is true and it isrejected
Type II error the null hypothesis is false and it isaccepted
-
8/14/2019 Descriptive Stats Saturday
49/66
Type I and Type II Errors
Reciprocal relationship between Type I andType II errors
Control of Type I errors using alpha level As alpha becomes smaller (.10, .05, .01, .001,
etc.) there is less chance of a Type I error
Value and contextual based nature ofconcerns related to Type I and Type II errors
-
8/14/2019 Descriptive Stats Saturday
50/66
One-Tailed and Two-Tailed Tests
One-tailed an anticipated outcome in a specificdirection Treatment group is significantly higher than the
control group
Treatment group is significantly lower than the controlgroup Two-tailed anticipated outcome not directional
Treatment and control groups are equal
Ample justification needed for using one-tailedtests
-
8/14/2019 Descriptive Stats Saturday
51/66
Degrees of Freedom
Statistical artifacts that affect thecomputational formulas used in tests ofsignificance
Used when entering statistical tables toestablish the critical values of the test
statistics
-
8/14/2019 Descriptive Stats Saturday
52/66
Tests of Significance
Parametric and non-parametric Four assumptions of parametric tests
Normal distribution of the dependent variable Interval or ratio data Independence of subjects Homogeneity of variance
Advantages of parametric tests More statistically powerful More versatile
-
8/14/2019 Descriptive Stats Saturday
53/66
Types of Inferential Statistics
Two issues discussed Steps involved in testing for significance Types of tests
-
8/14/2019 Descriptive Stats Saturday
54/66
Steps in Statistical Testing
State the null and alternative hypotheses Set alpha level Identify the appropriate test of significance Identify the sampling distribution Identify the test statistic Compute the test statistic
-
8/14/2019 Descriptive Stats Saturday
55/66
Steps in Statistical Testing
Identify the criteria for significance If computing by hand, identify the critical value of the
test statistic If using SPSS Windows, identify the probability level
of the observed test statistic Compare the computed test statistic to the
criteria for significance If computing by hand, compare the observed test
statistic to the critical value If using SPSS Windows, compare the probability level
of the observed test statistic to the alpha level
-
8/14/2019 Descriptive Stats Saturday
56/66
Steps in Statistical Testing
Accept or reject the null hypothesis Accept
The observed test statistic is smaller than the criticalvalue
The observed probability level of the observed statistic issmaller than alpha
Reject The observed test statistic is larger than the critical value The observed probability level of the observed statistic is
smaller than alpha
-
8/14/2019 Descriptive Stats Saturday
57/66
Specific Statistical Tests
T-test for independent samples Comparison of two means from independent samples
Samples in which the subjects in one group are not related tothe subjects in the other group
Example - examining the difference between themean pretest scores for an experimental and controlgroup
Computation of the test statistic
SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
58/66
Specific Statistical Tests
T-test for dependent samples Comparison of two means from dependent
samples
One group is selected and mean scores are comparedfor two variables Two groups are compared but the subjects in each group
are matched
Example examining the difference betweenpretest and posttest mean scores for a singleclass of students
Computation of the test statistic SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
59/66
Specific Statistical Tests
Simple analysis of variance (ANOVA) Comparison of two or more means Example examining the difference between
posttest scores for two treatment groups anda control group Computation of the test statistic SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
60/66
Specific Statistical Tests
Multiple comparisons Omnibus ANOVA results
Significant difference indicates whether a difference existsacross all pairs of scores
Need to know which specific pairs are different
Types of tests A-priori contrasts Post-hoc comparisons
Scheffe Tukey HSD Duncans Multiple Range
Conservative or liberal control of alpha
-
8/14/2019 Descriptive Stats Saturday
61/66
Specific Statistical Tests
Multiple comparisons (continued) Example examining the difference
between mean scores for Groups 1 & 2,Groups 1 & 3, and Groups 2 & 3
Computation of the test statistic
SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
62/66
Specific Statistical Tests
Two factor ANOVA Comparison of means when two independent
variables are being examined
Effects Two main effects one for each independent
variable One interaction effect for the simultaneous
interaction of the two independent variables
-
8/14/2019 Descriptive Stats Saturday
63/66
Specific Statistical Tests
Two factor ANOVA (continued)
Example examining the mean scoredifferences for male and female students inan experimental or control group
Computation of the test statistic
SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
64/66
Specific Statistical Tests
Analysis of covariance (ANCOVA) Comparison of two or more means with statistical
control of an extraneous variable Use of a covariate Advantages
Statistically controlling for initial group differences (i.e.,equating the groups)
Increased statistical power Pretest is typically the covariate
Computation of the test statistic SPSS Windows syntax
-
8/14/2019 Descriptive Stats Saturday
65/66
Specific Statistical Tests
Multiple regression Correlational technique which uses
multiple predictor variables to predict asingle criterion variable Characteristics
Increased predictability with additional variables
Regression coefficients Regression equations
-
8/14/2019 Descriptive Stats Saturday
66/66
Specific Statistical Tests
Multiple regression (continued)
Example predicting college freshmensGPA on the basis of their ACT scores, highschool GPA, and high school rank in class
Computation of the test statistic SPSS Windows syntax