descriptive stats saturday

Upload: thomas-adam-johnson

Post on 30-May-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/14/2019 Descriptive Stats Saturday

    1/66

    Educational Research

    Descriptive Statistics

  • 8/14/2019 Descriptive Stats Saturday

    2/66

    Preparing data for analysis Types of descriptive statistics

    Central tendency Variation Relative position Relationships

    Calculating descriptive statistics

  • 8/14/2019 Descriptive Stats Saturday

    3/66

    Preparing Data for Analysis

    Issues Scoring procedures Tabulation and coding Use of computers

  • 8/14/2019 Descriptive Stats Saturday

    4/66

    Scoring Procedures

    Instructions Standardized tests detail scoring instructions Teacher made tests require the delineation of scoring

    criteria and specific procedures

    Types of items Selected response items - easily and objectively

    scored

    Open-ended items difficult to score objectively witha single number as the result

  • 8/14/2019 Descriptive Stats Saturday

    5/66

    Tabulation and Coding

    Tabulation is organizing data Identifying all relevant information to the

    analysis Separating groups and individuals within

    groups Listing data in columns

  • 8/14/2019 Descriptive Stats Saturday

    6/66

    Tabulation and Coding

    Coding Assigning identification numbers to subjects Assigning codes to the values of non-

    numerical or categorical variables Gender: 1=Female and 2=Male Subjects: 1=English, 2=Math, 3=Science, etc.

  • 8/14/2019 Descriptive Stats Saturday

    7/66

    Computerized Analysis

    Need to learn how to calculate descriptivestatistics by hand Creates a conceptual base for understanding the

    nature of each statistic Exemplifies the relationships among statistical

    elements of various procedures

    Use of computerized software

    SPSS Windows Other software packages

  • 8/14/2019 Descriptive Stats Saturday

    8/66

    Descriptive Statistics

    Purpose to describe or summarize datain a parsimonious manner

    Four types Central tendency Variability Relative position

    Relationships

  • 8/14/2019 Descriptive Stats Saturday

    9/66

    Descriptive Statistics

    Graphing data afrequency polygon Vertical axis

    represents thefrequency with which ascore occurs

    Horizontal axisrepresents the scores

    themselves

  • 8/14/2019 Descriptive Stats Saturday

    10/66

    Central Tendency

    Purpose to represent the typical scoreattained by subjects

    Three common measures Mode Median Mean

  • 8/14/2019 Descriptive Stats Saturday

    11/66

    Central Tendency

    Mode The most frequently occurring score Appropriate for nominal data

    Median The score above and below which 50% of all scoreslie (i.e., the mid-point)

    Characteristics Appropriate for ordinal scales

    Doesnt take into account the value of each and every scorein the data

  • 8/14/2019 Descriptive Stats Saturday

    12/66

    Central Tendency

    Mean The arithmetic average of all scores Characteristics

    Advantageous statistical properties Affected by outlying scores Most frequently used measure of central tendency

    Formula

  • 8/14/2019 Descriptive Stats Saturday

    13/66

    Variability

    Purpose to measure the extent to whichscores are spread apart

    Four measures Range Quartile deviation Variance

    Standard deviation

  • 8/14/2019 Descriptive Stats Saturday

    14/66

    Variability

    Range The difference between the highest and

    lowest score in a data set

    Characteristics Unstable measure of variability Rough, quick estimate

  • 8/14/2019 Descriptive Stats Saturday

    15/66

    Variability

    Quartile deviation One-half the difference between the upperand lower quartiles in a distribution

    Characteristic - appropriate when the

    median is being used

  • 8/14/2019 Descriptive Stats Saturday

    16/66

    Variability

    Variance

    The average squared deviation of allscores around the mean Characteristics

    Many important statistical properties

    Difficult to interpret due to squared metric Formula

  • 8/14/2019 Descriptive Stats Saturday

    17/66

    Variability

    Standard deviation The square root of the variance Characteristics

    Many important statistical properties Relationship to properties of the normal curve Easily interpreted

    Formula

  • 8/14/2019 Descriptive Stats Saturday

    18/66

    The Normal Curve

    A bell shaped curve reflecting the

    distribution of many variables of interestto educators

    See Figure 14.2

    See the attached slide

  • 8/14/2019 Descriptive Stats Saturday

    19/66

    The Normal Curve

    Characteristics Fifty-percent of the scores fall above the mean and

    fifty-percent fall below the mean The mean, median, and mode are the same values Most participants score near the mean; the further a

    score is from the mean the fewer the number ofparticipants who attained that score

    Specific numbers or percentages of scores fall

    between 1 SD, 2 SD, etc.

  • 8/14/2019 Descriptive Stats Saturday

    20/66

    The Normal Curve

    Properties Proportions under the curve

    1 SD 68%

    1.96 SD 95%2.58 SD 99%

    Cumulative proportions and percentiles

  • 8/14/2019 Descriptive Stats Saturday

    21/66

    Skewed Distributions

    Positive many low scores and few high scores Negative few low scores and many high

    scores

    Relationships between the mean, median, andmode Positively skewed mode is lowest, median is in the

    middle, and mean is highest Negatively skewed mean is lowest, median is in the

    middle, and mode is highest

  • 8/14/2019 Descriptive Stats Saturday

    22/66

    Measures of Relative Position

    Purpose indicates where a score is inrelation to all other scores in thedistribution

    Characteristics Clear estimates of relative positions Possible to compare students performances

    across two or more different tests provided

    the scores are based on the same group

  • 8/14/2019 Descriptive Stats Saturday

    23/66

    Measures of Relative Position

    Types Percentile ranks the percentage of scores

    that fall at or above a given score

    Standard scores a derived score based onhow far a raw score is from a reference pointin terms of standard deviation units Z-score T-score Stanine

  • 8/14/2019 Descriptive Stats Saturday

    24/66

    Measures of Relative Position

    Z-score The deviation of a score from the mean in standard

    deviation units The basic standard score from which all other

    standard scores are calculated Characteristics

    Mean = 0 Standard deviation = 1 Positive if the score is above the mean and negative if it is

    below the mean Relationship with the area under the normal curve

  • 8/14/2019 Descriptive Stats Saturday

    25/66

    Measures of Relative Position

    Z-score (continued)

    Possible to calculate relative standings likethe percent better than a score, the percentfalling between two scores, the percentfalling between the mean and a score, etc.

    Formula

  • 8/14/2019 Descriptive Stats Saturday

    26/66

    Measures of Relative Position

    T-score a transformation of a z-score

    where t = 10(Z) + 50 Characteristics

    Mean = 50 Standard deviation = 10 No negative scores

  • 8/14/2019 Descriptive Stats Saturday

    27/66

    Measures of Relative Position

    Stanine a transformation of a z-scorewhere the stanine = 2(Z) + 5 rounded tothe nearest whole number Characteristics

    Nine groups with 1 the lowest and 9 the highest Categorical interpretation Frequently used in norming tables

  • 8/14/2019 Descriptive Stats Saturday

    28/66

    Measures of Relationship

    Purpose to provide an indication of therelationship between two variables

    Characteristics of correlation coefficients Strength or magnitude 0 to 1 Direction positive (+) or negative (-)

    Types of correlations coefficients

    dependent on the scales of measurement ofthe variables Spearman Rho ranked data Pearson r interval or ratio data

  • 8/14/2019 Descriptive Stats Saturday

    29/66

    Measures of Relationship

    Interpretation correlation does notmean causation

    Formula for Pearson r

  • 8/14/2019 Descriptive Stats Saturday

    30/66

    Calculating Descriptive Statistics

    Symbols used in statistical analysis General rules form calculating by hand

    Make the columns required by the formula Label the sum of each column Write the formula Write the arithmetic equivalent of the problem Solve the arithmetic problem

  • 8/14/2019 Descriptive Stats Saturday

    31/66

    Calculating Descriptive Statistics

    Using SPSS Windows Means, standard deviations, and standard

    scores

    The DESCRIPTIVESprocedures Interpreting output

    Correlations The CORRELATIONprocedure Interpreting output

  • 8/14/2019 Descriptive Stats Saturday

    32/66

  • 8/14/2019 Descriptive Stats Saturday

    33/66

    Formula for the Mean

  • 8/14/2019 Descriptive Stats Saturday

    34/66

    Formula for Variance

  • 8/14/2019 Descriptive Stats Saturday

    35/66

    Formula for Standard Deviation

  • 8/14/2019 Descriptive Stats Saturday

    36/66

  • 8/14/2019 Descriptive Stats Saturday

    37/66

    Educational Research

    Inferential Statistics

  • 8/14/2019 Descriptive Stats Saturday

    38/66

    Concepts underlying inferential statistics Types of inferential statistics

    Parametric T-tests ANOVA

    One-way Factorial Post-hoc comparisons

    Multiple regression ANCOVA

    Non-parametric Chi-Square

  • 8/14/2019 Descriptive Stats Saturday

    39/66

    Important Perspectives

    Inferential statistics Allow researchers to generalize to a population of

    individuals based on information obtained from asample of those individuals

    Assesses whether the results obtained from asample are the same as those that would have

    been calculated for the entire population Probabilistic nature of inferential analyses

  • 8/14/2019 Descriptive Stats Saturday

    40/66

    Underlying Concepts

    Sampling distributions Standard error Null and alternative hypotheses

    Tests of significance Type I and Type II errors One-tailed and two-tailed tests

    Degrees of freedom Tests of significance

  • 8/14/2019 Descriptive Stats Saturday

    41/66

    Sampling Distributions

    A distribution of sample statistics A distribution of mean scores A distribution of the differences between two mean

    scores A distribution of the ratio of two variances

    Known statistical properties of samplingdistributions The mean of the sampling distribution of means isan excellent estimate of the population mean The standard error of the mean is an excellent

    estimate of the standard deviation of the

  • 8/14/2019 Descriptive Stats Saturday

    42/66

    Standard Error

    Sampling error the expected random orchance variation of means in samplingdistributions

    The calculation of standard errors toestimate sampling error Standard error of the mean

    Standard error of the differences between twomeans

  • 8/14/2019 Descriptive Stats Saturday

    43/66

    Null and Alternative Hypotheses

    The null hypothesis represents astatistical tool important to inferential

    tests of significance The alternative hypothesis usually

    represents the research hypothesis

    related to the study

  • 8/14/2019 Descriptive Stats Saturday

    44/66

    Null and Alternative Hypotheses

    Comparisons between groups Null: no difference between the means scores of the

    groups Alternative: differences between the mean scores of

    the groups Relationships between variables

    Null: no relationship exists between the variablesbeing studied

    Alternative: a relationship exists between thevariables being studied

  • 8/14/2019 Descriptive Stats Saturday

    45/66

    Null and Alternative Hypotheses

    Acceptance of the nullhypothesis The difference between

    groups is too small toattribute it to anything butchance

    The relationship betweenvariables is too small toattribute it to anything butchance

    Rejection of the nullhypothesis The difference between

    groups is so large it can beattributed to something

    other than chance (e.g.,experimental treatment) The relationship between

    variables is so large it canbe attributed to somethingother than chance (e.g., a

    real relationship)

  • 8/14/2019 Descriptive Stats Saturday

    46/66

    Tests of Significance

    Statistical analyses to help decide whether toaccept or reject the null hypothesis

    Alpha level An established probability level which serves as the

    criterion to determine whether to accept or reject thenull hypothesis

    Common levels in education .01 .05 .10

  • 8/14/2019 Descriptive Stats Saturday

    47/66

    Tests of Significance

    Specific tests are used in specificsituations based on the number ofsamples and the statistics of interest One sample tests of the mean, variance,proportions, correlations, etc. Two sample tests of means, variances,

    proportions, correlations, etc.

  • 8/14/2019 Descriptive Stats Saturday

    48/66

    Type I and Type II Errors

    Correct decisions The null hypothesis is true and it is accepted

    The null hypothesis is false and it is rejected Incorrect decisions

    Type I error - the null hypothesis is true and it isrejected

    Type II error the null hypothesis is false and it isaccepted

  • 8/14/2019 Descriptive Stats Saturday

    49/66

    Type I and Type II Errors

    Reciprocal relationship between Type I andType II errors

    Control of Type I errors using alpha level As alpha becomes smaller (.10, .05, .01, .001,

    etc.) there is less chance of a Type I error

    Value and contextual based nature ofconcerns related to Type I and Type II errors

  • 8/14/2019 Descriptive Stats Saturday

    50/66

    One-Tailed and Two-Tailed Tests

    One-tailed an anticipated outcome in a specificdirection Treatment group is significantly higher than the

    control group

    Treatment group is significantly lower than the controlgroup Two-tailed anticipated outcome not directional

    Treatment and control groups are equal

    Ample justification needed for using one-tailedtests

  • 8/14/2019 Descriptive Stats Saturday

    51/66

    Degrees of Freedom

    Statistical artifacts that affect thecomputational formulas used in tests ofsignificance

    Used when entering statistical tables toestablish the critical values of the test

    statistics

  • 8/14/2019 Descriptive Stats Saturday

    52/66

    Tests of Significance

    Parametric and non-parametric Four assumptions of parametric tests

    Normal distribution of the dependent variable Interval or ratio data Independence of subjects Homogeneity of variance

    Advantages of parametric tests More statistically powerful More versatile

  • 8/14/2019 Descriptive Stats Saturday

    53/66

    Types of Inferential Statistics

    Two issues discussed Steps involved in testing for significance Types of tests

  • 8/14/2019 Descriptive Stats Saturday

    54/66

    Steps in Statistical Testing

    State the null and alternative hypotheses Set alpha level Identify the appropriate test of significance Identify the sampling distribution Identify the test statistic Compute the test statistic

  • 8/14/2019 Descriptive Stats Saturday

    55/66

    Steps in Statistical Testing

    Identify the criteria for significance If computing by hand, identify the critical value of the

    test statistic If using SPSS Windows, identify the probability level

    of the observed test statistic Compare the computed test statistic to the

    criteria for significance If computing by hand, compare the observed test

    statistic to the critical value If using SPSS Windows, compare the probability level

    of the observed test statistic to the alpha level

  • 8/14/2019 Descriptive Stats Saturday

    56/66

    Steps in Statistical Testing

    Accept or reject the null hypothesis Accept

    The observed test statistic is smaller than the criticalvalue

    The observed probability level of the observed statistic issmaller than alpha

    Reject The observed test statistic is larger than the critical value The observed probability level of the observed statistic is

    smaller than alpha

  • 8/14/2019 Descriptive Stats Saturday

    57/66

    Specific Statistical Tests

    T-test for independent samples Comparison of two means from independent samples

    Samples in which the subjects in one group are not related tothe subjects in the other group

    Example - examining the difference between themean pretest scores for an experimental and controlgroup

    Computation of the test statistic

    SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    58/66

    Specific Statistical Tests

    T-test for dependent samples Comparison of two means from dependent

    samples

    One group is selected and mean scores are comparedfor two variables Two groups are compared but the subjects in each group

    are matched

    Example examining the difference betweenpretest and posttest mean scores for a singleclass of students

    Computation of the test statistic SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    59/66

    Specific Statistical Tests

    Simple analysis of variance (ANOVA) Comparison of two or more means Example examining the difference between

    posttest scores for two treatment groups anda control group Computation of the test statistic SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    60/66

    Specific Statistical Tests

    Multiple comparisons Omnibus ANOVA results

    Significant difference indicates whether a difference existsacross all pairs of scores

    Need to know which specific pairs are different

    Types of tests A-priori contrasts Post-hoc comparisons

    Scheffe Tukey HSD Duncans Multiple Range

    Conservative or liberal control of alpha

  • 8/14/2019 Descriptive Stats Saturday

    61/66

    Specific Statistical Tests

    Multiple comparisons (continued) Example examining the difference

    between mean scores for Groups 1 & 2,Groups 1 & 3, and Groups 2 & 3

    Computation of the test statistic

    SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    62/66

    Specific Statistical Tests

    Two factor ANOVA Comparison of means when two independent

    variables are being examined

    Effects Two main effects one for each independent

    variable One interaction effect for the simultaneous

    interaction of the two independent variables

  • 8/14/2019 Descriptive Stats Saturday

    63/66

    Specific Statistical Tests

    Two factor ANOVA (continued)

    Example examining the mean scoredifferences for male and female students inan experimental or control group

    Computation of the test statistic

    SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    64/66

    Specific Statistical Tests

    Analysis of covariance (ANCOVA) Comparison of two or more means with statistical

    control of an extraneous variable Use of a covariate Advantages

    Statistically controlling for initial group differences (i.e.,equating the groups)

    Increased statistical power Pretest is typically the covariate

    Computation of the test statistic SPSS Windows syntax

  • 8/14/2019 Descriptive Stats Saturday

    65/66

    Specific Statistical Tests

    Multiple regression Correlational technique which uses

    multiple predictor variables to predict asingle criterion variable Characteristics

    Increased predictability with additional variables

    Regression coefficients Regression equations

  • 8/14/2019 Descriptive Stats Saturday

    66/66

    Specific Statistical Tests

    Multiple regression (continued)

    Example predicting college freshmensGPA on the basis of their ACT scores, highschool GPA, and high school rank in class

    Computation of the test statistic SPSS Windows syntax