cfa1 3.3 significance testing

Upload: cnvb-alsk

Post on 25-Feb-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/25/2019 CFA1 3.3 Significance Testing

    1/21

    1

    3.3 Significance Testing

    The topics in this section concern with the second course objective.

    The topics in this section heavily depend on those in the last section. I

    strongly recommend reading the last section at least one more time before

    starting on this section.

    The very hypotheses that were tested by hypothesis testing can be tested by

    significance testing. Significance testing uses the same test statistics (to test

    the same corresponding hypotheses) as those used in hypothesis testing.

    Ironically, as you will see, significance testing has no significance level.

    Here is the difference between significance testing and hypothesis testing.

    The conclusion of a significance test is always reject Ho with the p-value.However, the part reject Ho is always implicitly understood. As a result,

    only the p-value is given as the conclusion of significance testing. See the

    diagram of significance testing given below.

    At this point, your question should be What the heck is p-value?

    In significance testing, after the computation of the observed value from

    data, a probability is found using this computed observed value. This

    probability is called the p-value. The p stands for probability. It has

    Test StatisticData P-Value

    Significance Testing

  • 7/25/2019 CFA1 3.3 Significance Testing

    2/21

    2

    nothing to do with bodily functions. In fact, the following is the definition

    of the p-value.

    A p-value is the maximum possible probability for data falsely rejecting a

    null hypothesis.

    It would be truly helpful if you understand that p-value is probability-

    value. Read Appendix: P-Values given below in this section.

    Now, at this point, you should be asking how to find a p-value.

    You compute the observed value (from data using an appropriate test

    statistic). Generally, this observed value splits the entire probability of one

    into two probabilities, one probability on the left and another probability onthe right of the observed value. See the diagram given below.

    We are doing significance testing and not hypothesis testing. However, if

    we were doing hypothesis testing, it would be either a left-tailed test, right-

    tailed test or two-tailed test, depending on the hypotheses being tested. By

    the way, please read Appendix: Recommendationsgiven below.

    The observed value (ov)Note: An observed value is a number,

    a point on the horizontal axis.

    The probability onthe right of the

    observed value ov

    The probability on the leftof the observed value (ov)

    A probability is the area between the curve and horizontal axis when

    the probability distribution is continuous. All the test statistics inthis section have continuous distributions (such as the standard

    Normal, Students t, and Chi-square distributions).

  • 7/25/2019 CFA1 3.3 Significance Testing

    3/21

    3

    Let us called left-tailed case if it were a left-tailed test, right-tailed case

    if it were a right-tailed test, and two-tailed case if it were a two-tailed test

    in hypothesis testing. These cases will determine which one of the two split

    probabilities is the p-value.

    For a right-tailed case, the p-value is the probability on the right of the

    observed value.

    For a left-tailed case, it is the probability on the left of the observed value.

    For a two-tailed case, twice the smaller probability is the p-value.

    See the diagram given below.

    Generally in a two-tailed case, a probability is multiplied by twoto obtain

    the p-value. An observed value splits the entire probability (of one) into two

    probabilities, one is smaller than the other. We multiply the smaller

    probability with two to find the p-value. You would not have any other

    The observed value (ov)

    This probability is the p-value for aleft-tailed case since it is on the left

    of the observed value (ov).

    This probability times two

    is the p-value for a two-tailed case since this is

    smaller than the other

    probability.

    The probability is

    the p-value for aright-tailed case

    since it is on theright of the

    observed value.

  • 7/25/2019 CFA1 3.3 Significance Testing

    4/21

    4

    choice if you are multiplying one of them by two to obtain the p-value

    because a p-value is a probability. Read Appendix: P-valuesagain.

    Of course, if the two probabilities are equal to each other, you can multiply

    either probability by two to obtain the p-value in a two-tailed case.

    Why?????!!!!!! Well, you could try this with each of the two equal

    probabilities and compare their p-values.

    As you may have noticed, there is no significance level given in significance

    testing. However, the p-value is also called observed significance level,

    and the term significance in significance testing has come from the term

    significance in observed significance level. Do not confuse with an

    observed significance level (p-value) and a significance level ().

    When you hear significance level, you should think of hypothesistesting.

    When you hear significance testing, you should think of observed

    significance level (p-value).

    An observed significance levelis a probability determined by the observed

    value and the alternative hypothesis.

    A p-value is an observed significance level.

    Here is actually how to find numbers (values) for p-values. You need to

    compute observed values from test statistics. As you remember from the last

    section, the test statistics are

    ( x - o)/(s/ n ) for testing hypotheses on ,

    (^

    p - po)/ o op (1 - p )/n for testing hypotheses on p, and

    (n 1)(s/o)2for testing hypotheses on .

    Also, recall, the probability distribution of the test statistics for testing and

    p is the standard Normal distribution (with large data). The probability

    distribution of the test statistic for testing is the Chi-square distribution

  • 7/25/2019 CFA1 3.3 Significance Testing

    5/21

    5

    with n 1 degree of freedom (with large data). Now, read Appendix: How

    to Find P-Valuesgiven below.

    Let us have a numerical example by conducting significance testing on

    Ho: 20.0

    vs

    Ha: > 20.0

    from the Orange Crate Example. The test statistic is ( x - o)/(s/ n ) whichis the Normal standardization of the sample average, x , the estimator for .The sampling distribution of the test statistic is, of course, the standard

    Normal distribution. The observed value is 1.2086 = (20.22

    20.00)(0.997/ 30 ) . This is a right-tailed case so the p-value is

    P(Z > 1.2086) = 0.1134

    where Z is the standard Normal variable. This probability is found using the

    website given in Appendix: How to Find P-Values, and it is the p-value

    for this significance test. This significance testing concludes by stating

    The p-value is 0.1134 for the significance testing on the hypotheses givenabove.

    By the way, if you were testing

    Ho: = 20.0

    vs

    Ha: 20.0,

    then the p-value would be 0.2268 (= 0.1134*2) since it is a two-tailed case.

    If this were a left-tailed case, then the p-value would be 0.8866. Can you

    tell why it is 0.8866? At any rate, this means that you cannot pick a smaller

    probability for the p-value for a one-tailed test all the time.

  • 7/25/2019 CFA1 3.3 Significance Testing

    6/21

    6

    Significance testing is a system whose input is data and its output is the p-

    value (see the first diagram given above in this section). That is,

    significance testing ends with the p-value. However, with the p-value,

    hypothesis testing can be done. That is, you can conduct hypothesis testing

    after significance testing.

    Namely, once a significance testing is conducted and concluded with its p-

    value, a hypothesis testing can be conducted with the p-value if so desired.

    If the p-value is less than or equal to the significance level (p-value ),

    then the null hypothesis Ho is rejected at .

    This is because, if the p-value is less than or equal to , then the observed

    value must be in the rejection area. See the diagram given below.

    If the p-value is greater than the significance level (p-value > ), then the

    null hypothesis Ho is not rejected .

    This is because, if the p-value is greater than , then the observed value

    must be outside the rejection area. See the diagram given below.

    The observed value (ov)

    The p-value

    A right-tailed case: Thep-value (the area in blue)is less than (the area in the red;

    that is, the area to the left of the p-value and p-value itself together)and, hence, the

    observed value is in the rejection area. As a result, the Ho is rejected at .

    The critical point

  • 7/25/2019 CFA1 3.3 Significance Testing

    7/21

    7

    Let us conduct a hypothesis testing at 5% (= ) significance level on

    Ho: 20.0

    vs

    Ha: > 20.0

    with the p-value of 0.1134 from the significance testing. Since the p-value

    0.1134 is greater than = 0.05, the null hypothesis cannot be rejected at 5%

    significance level. By the way, it cannot be rejected at 1% level either.

    If you are testing

    Ho: 0.75

    vs

    Ha: < 0.75

    in the Orange Crate Example, the observed value is (30 1)(0.997376/0.75)2

    = 51.285 and its p-value is close to 99.35%. It is a left-tailed case with

    The critical point

    A right-tailed case: The p-value (the area in the red; that is, the area to the left of the and itself together)is greater than the (the area in blue)and, hence, the

    observed value is outside the rejection area. As a result, the Ho is not rejected at .

    The observed value (ov)

    The p-value

  • 7/25/2019 CFA1 3.3 Significance Testing

    8/21

    8

    hypotheses on with 29 degree of freedom. See Appendix: How to Find

    P-Values given below to find the p-value.

    If you wish, hypothesis testing can be conducted at, say, 1% significance

    level. This p-value is great than 1% and, hence, the null hypothesis is not

    rejected at 1% significance level (or any significance level less than 0.9935).

    Note that a p-value can be large, like 99.35% in this example. That is, you

    cannot automatically pick a smaller probability of the two probabilities split

    by an observed value for a p-value for a one-tailed test.

    A reason for significance testing is that a statistician does not have to do

    hypothesis testing. Suppose a pharmaceutical company has spent 10 years

    and 300 million dollars to develop a new medicine. To market the medicine

    (so that the company can recover the investment and makes some profit),this medicine must be tested and proven to be safe and effective by

    conducting clinical trials and hypothesis testing. Everyone involved,

    including statisticians, comes under a heavy pressure to approve the

    medicine.

    A statistician might be asked (forced) to alter the data or to increase the

    significance level so that null hypothesis is rejected. With significance test,

    a statistician does not have to worry about this kind of pressure. He does not

    even know what the significance level is in the significance testing (because

    there is none). The only thing for him to do is to compute the observed

    value from data and its p-value from the observed value and to give the p-

    value to the management so that they can conduct hypothesis testing

    themselves.

    Another advantage for significance testing is for computer programming. It

    is simpler and easier to program significance testing than to program

    hypothesis testing. It does not concern the different levels of significance,

    and the output is a single number. There are many computer statistical

    packages which perform F-tests, t-tests and z-tests with input data andproduce p-values for the output, letting the users do hypothesis testing.

    Let us have another numerical example form the Political Poll Example.

    You want to test

    Ho: P({D}) = 0.40

  • 7/25/2019 CFA1 3.3 Significance Testing

    9/21

    9

    vs

    Ha: P({D}) 0.40.

    Then, this is a two-tailed case, and the test statistic is

    (^

    p - po)/ o op (1 - p )/n for testing the hypotheses on p = P({D}) where po=

    0.40,^

    p = P({D}) = 357/850 = 0.42 and n = 850. The observed value is

    (0.42 0.40)/ 0.40(1 0.40) / 850 = 1.1902. This results in the p-value of

    0.234. See Appendix: How to Find P-Values given below to find the p-

    value.

    If hypothesis testing is to be performed on these hypotheses, then Ho cannotbe rejected at the 1%, or even 5%, significance level.

    Finally, the p-value is the maximum probability for Ha to be true in light of

    the obtained data. For instance, a significance test on

    Ho: P(D) P(R)

    vs

    Ha: P(D) < P(R)

    is conducted in the Political Poll Example. This test produces the p-value of

    3.6%. This p-value of 3.6% means that the maximum chance for the Ho:

    P(D) P(R) to be true is 3.6% given the data. If the hypothesis testing is

    conducted, the null hypothesis is rejected and accepted is the alternative

    hypothesis of the P(R) is greater than P(D) at 5%. However, it fails to reject

    the null hypothesis at 1%.

    As you have realized, if you can do significance testing, you can dohypothesis testing with the p-value. Significance testing is performed by

    finding the right test statistic and its probability distribution. The observed

    value, computed from the test statistic and data, gives you the p-value (using

    tables or a website). If the p-value is less than a significance level, reject

    Ho. Otherwise, Ho cannot be rejected.

  • 7/25/2019 CFA1 3.3 Significance Testing

    10/21

    10

    You can do any significance and hypothesis testing on any parameter by

    finding the appropriate test statistic and its probability distribution for testing

    the hypotheses in question. With data, you can compute an observed value

    and find the p-value by using tables or a website. With the p-value you can

    perform hypothesis testing.

    Testing on hypotheses on parameters of two independent groups is given in

    Appendix(Optional): More Testing given below.

    Appendix: P-Values

    A p-value is the maximum possible probability for data falsely rejecting a

    null hypothesis. Thus, a p-value is a probability value. Understanding of

    this fact would prevent you from making a mistake like giving a numbergreater than one or a negative number for a p-value like many students do.

    Why is this so? Well, did you understand what probability is numerically?

    Also, you multiply a probability by two when you find a p-value in a two-

    sided case. You would not multiply a probability greater than 0.5 by two to

    find a p-value. Why? If this is not obvious to you, please think through. I

    hope you will find the reason why one should not multiply a number greater

    than 0.5 by two to find a p-value.

    By the way, this is where the difference between memorization and

    understanding comes in play (to help you or to kill you).

    Appendix: Recommendations

    At this point, if you are not sure what we are talking about in this section,

    please go back and read the last section, 3.2 Hypothesis Testing. You

    should have learned and understood test statistics, observed values, left-

    tailed tests, right-tailed tests, two-tailed tests and such in the last section.These topics (concepts) are used in the subject (significance testing) and its

    related topics of this section. Lack of understanding of the topics/concepts

    in the last section would impede your understanding of the subject and the

    related topics in this section. On the other hand, their understanding would

    enable you to understand significance testing and its related topics.

  • 7/25/2019 CFA1 3.3 Significance Testing

    11/21

    11

    Generally, this is true in mathematics, applied mathematics and sciences. A

    current topic (or a subject) is developed using (or based on) the previous

    topics (subjects). Thus, understanding of previous topics (subjects) is

    necessary for learning and understanding of a current topic or subject. Lack

    of understanding of past topics would severely impede understanding of the

    current topics.

    The key to all this is understanding of topics and subjects. Memorizing

    them would not be of much help. As Confucius said, He who memorizes is

    buying a car with no engine. He does not go too far.

    Appendix: How to Find P-Values

    For testing hypotheses on and p, the p-values are

    P(Z > ov) for a right-tailed case,

    P(Z < ov) for a left-tailed case, and

    2*Min{ P(Z > ov), P(Z < ov)} for a two-tailed case,

    where ov is an observed value and Min{ P(Z > ov), P(Z < ov)} is the smaller

    of P(Z > ov) or P(Z < ov). Now go to the website,

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.php

    or

    http://sites.csn.edu/mgreenwich/stat/normal.htm

    For a right-tailed case, click on the circle under the second picture, put the

    ov under z value and click on the button . This should give you the p-

    value under probability.

    For a left-tailed case, click on the circle under the first picture and repeat theabove. You should get the p-value under probability.

    For a two-tailed case, click on the circle under the fourth picture and repeat

    the above. You should get the p-value under probability. Do not multiply

    this p-value by two. It is the p-value, given under probability for a two-

    tailed test. This is because, see the picture, the probability given under

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://surfstat.anu.edu.au/surfstat-home/tables/normal.php
  • 7/25/2019 CFA1 3.3 Significance Testing

    12/21

    12

    probability is P(Z < ov) + P(Z > ov) = 2* Min{ P(Z > ov), P(Z < ov)} =

    the p-value.

    Let us have some exercises. Conduct significance testing (that is, find the p-

    value) on Ho: p 0.23 vs Ha: p > 0.23 based on data of size 800 whose

    estimate for p is 0.21.

    The answer: The p-value is 0.9106. Hint: The ov = -1.3442 and it is a

    right-tailed case.

    Conduct significance testing on Ho: = 41.85 vs Ha: 41.85 based on

    data of size 36 whose sample mean and sample standard deviation are 42.57

    and 2.5 respectively.

    The answer: The p-value is 0.084. Hint: The ov = 1.728 and it is a two-tailed case. Remember, you do not multiply the probability by two when

    you use the website.

    For testing hypotheses on , the p-values are

    P (n-12> ov) for a right-tailed case,

    P(n-12< ov) for a left-tailed case, and

    2*Min{ P(n-12> ov), P(n-1

    2< ov)} for a two-tailed case,

    where n-12 is the Chi-square variable with n 1 degree of freedom. Now

    go to the website,

    http://surfstat.anu.edu.au/surfstat-home/tables/chi.php

    or

    http://sites.csn.edu/mgreenwich/stat/chi.htm

    For a right-tailed case, click on the circle under the second picture, put thedegree of freedom under d.f. and put the ov under

    2value. Then, click

    on thebutton which should give you the p-value under probability.

    For a left-tailed case, click on the circle under the first picture, and the repeat

    the above. You should find the p-value under probability.

    http://surfstat.anu.edu.au/surfstat-home/tables/chi.phphttp://surfstat.anu.edu.au/surfstat-home/tables/chi.phphttp://sites.csn.edu/mgreenwich/stat/chi.htmhttp://sites.csn.edu/mgreenwich/stat/chi.htmhttp://sites.csn.edu/mgreenwich/stat/chi.htmhttp://surfstat.anu.edu.au/surfstat-home/tables/chi.php
  • 7/25/2019 CFA1 3.3 Significance Testing

    13/21

    13

    For a two-tailed case, you repeat the above for both right-tailed and left-

    tailed cases. Find the smaller probability under probability and multiply it

    by two, which is the p-value. This time, you have to multiply it by two.

    Let us have some exercises. Conduct significance testing on Ho: > 35.0

    vs Ha: 35.0 based on data of size 35 whose sample mean and sample

    standard deviation are -35.7 and 32.5 respectively.

    Answer: The p-value is 0.3035. Hint: The ov = 29.316 and the degree of

    freedom is 34. It is a left-handed case.

    Conduct significance testing on Ho: = 3.7 vs Ha: 3.7base on data of

    size 48 whose sample mean and sample standard deviation are 3.4 and 4.2.

    The answer: The p-value is 0.1768. Hint: The ov = 60.561 and the degreeof freedom is 47. The two probabilities are 0.9116 and 0.0884.

    Appendix (Optional): More Testing

    When there are two independent populations (groups) or two independent

    variables, their parameters are often compared.

    Let 1and 2be the means of Groups 1 and 2 respectively. The test statistic

    ( x

    1- x

    2 o)/ 2 21 1 2 2(s ) /n +(s ) /n

    is use to test hypotheses on the difference of the means of two independent

    groups such as those given below.

    Right-tailed case;

    Ho: 1- 2 o

    vs

    Ha: 1- 2 > o

    Left-tailed case:

  • 7/25/2019 CFA1 3.3 Significance Testing

    14/21

    14

    Ho: 1- 2 o

    vs

    Ha: 1- 2 < o

    Two-tailed case:

    Ho: 1- 2 = o

    vs

    Ha: 1- 2 o

    where x

    1and x

    2are respectively sample means of Data 1 of size n1taken

    from Group 1 and Data 2 of size n2 taken from Group 2 and s1 and s2 are

    sample standard deviations of Data 1 and Date 2.

    The hypothesized difference between two means is indicated by o(the null

    value). The probability distribution of the test statistic is the standard

    Normal distribution.

    Let us have a numerical example by modifying the Orange Crate Example. Ihave received another crate of oranges from the supplier who claims that the

    oranges in the second crate are 5 oz heavier, on average, and have 80% less

    variation in their weight measurements. So, we are testing,

    Ho: 1- 2 -5.00

    vs

    Ha: 1- 2 < -5.00

    Note that the alternative hypothesis is that the average (mean) weight of

    oranges in the second crate is greater than that in the first crate by 5 oz or

    more. Thus, Ha is 1- 2 < -5.00, and this is a left-tailed case.

  • 7/25/2019 CFA1 3.3 Significance Testing

    15/21

    15

    We randomly sampled 40 oranges from the second crate and took the weight

    measurements of these 40 oranges. Their sample mean is 29.87 oz, and their

    sample standard deviation is 0.679 oz. That is, x

    2= 25.87, s2= 0.679, n2=

    40 and o= - 5.00. Of course, x

    1= 20.22, s1= 0.997 and n1= 30 from the

    original data from the first crate.

    Then, (20.22 25.87 (-5.00))/ 2 2(0.997) / 30 (0.679) / 40+ = -6.054.

    You can go to the website,

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.php

    or

    http://sites.csn.edu/mgreenwich/stat/normal.htm

    and find the p-value or you realize that it must be very close to zero (since

    ov is -6.054). We have the p-value = 0. The Ho is rejected at 5% or even at

    1% level. There is very strong evidence that, on average, the oranges in the

    second crate are heavier than those in the first crate by at least 5 oz.

    Let us, now, test the claim on the variation. The hypotheses are

    Ho: 2/10.80

    vs

    Ha: 2/1< 0.80.

    The test statistic to test hypotheses on standard deviations of two

    independent groups is

    ((n1 1)s12/12)/((n2 1)s22/ 22)

    which has an F distribution. Please read Appendix: F Distributions & P-

    valuesgiven below.

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://surfstat.anu.edu.au/surfstat-home/tables/normal.php
  • 7/25/2019 CFA1 3.3 Significance Testing

    16/21

    16

    By the way, this test statistic consists of a test statistic for testing hypotheses

    on one sigma over another test statistic for testing hypotheses on another

    sigma.

    At any rate, we have 2= 0.80*1, s1 = 0.997, and s2 = 0.679. As a result,

    ((30 1)(0.997)2/ 1

    2)/((40 1)(0.679)

    2/( 0.80*1)

    2) = 1.02604,

    which is the observed value. By going to the website,

    http://stattrek.com/Tables/F.aspx,

    you find the p-value of 0.46. This is a right-tailed case and, to learn how to

    find a p-value, please read Appendix: F Distributions & P-values again.

    The test statistic

    ( 1^

    p - 2^

    p o)/ 1 1 1 2 2 2^ ^ ^ ^

    (1- )/n + (1- )/np p p p

    is use to test hypotheses on the difference between probabilities

    (proportions) of two independent groups such as those given below.

    Right-tailed case;

    Ho: p1- p2 o

    vs

    Ha: p1- p2 > o

    Left-tailed case:

    Ho: p1- p2 o

    vs

    Ha: p1- p2 < o

    http://stattrek.com/Tables/F.aspxhttp://stattrek.com/Tables/F.aspxhttp://stattrek.com/Tables/F.aspx
  • 7/25/2019 CFA1 3.3 Significance Testing

    17/21

    17

    Two-tailed case:

    Ho: p1- p2 = o

    vs

    Ha: p1- p2 o

    The test statistic has the standard Normal distribution for large n1 and n2.

    Let us have a numerical example. In Indiana, a political poll of 850 was

    taken, and 54% of the likely voters surveyed would vote for the Republican

    candidate (recall the Political Poll Example). In Nevada, a political poll of

    900 was conducted, and 51% of the likely voters surveyed would vote for

    the Republican candidate. I am wondering whether P({R}) = p1 in Indianais greater than P({R}) = p2by at least 2% or not. So, the hypotheses to be

    tested are

    Ho: p1- p2 0.02

    vs

    Ha: p1- p2 > 0.02

    We also have 1^

    p = 0.54, 2^

    p = 0.51, n1 = 850, n2 = 900.The observed value

    is (0.54 0.51 0.02)/ (0.54)(1 0.54)/850 (0.51)(1 0.51) /900 + =

    0.41889. It is a right-tailed test so, by going to the website

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.php

    or

    http://sites.csn.edu/mgreenwich/stat/normal.htm

    you find the p-value of 0.3376. You cannot reject the null hypothesis at 5%

    or 1%, and there is no evidence that the Republican candidate in Indiana has

    at least a 2% point edge over the Republican candidate in Nevada in spite of

    the estimates.

    http://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://surfstat.anu.edu.au/surfstat-home/tables/normal.phphttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://sites.csn.edu/mgreenwich/stat/normal.htmhttp://surfstat.anu.edu.au/surfstat-home/tables/normal.php
  • 7/25/2019 CFA1 3.3 Significance Testing

    18/21

    18

    You should not compare P({R}) and P({D}) in the same state since these

    estimates P({D}) = 0.42 and P({R}) = 0.54 were obtained from the same

    political poll (only one group); that is, we do not have two independent

    groups. Any comparison of two proportions, by the hypotheses given above,

    must be conducted over two independent groups or a huge, huge group if itis only one group.

    Finally, having two collections of data does not mean that they came from

    two independent groups. If they are not independent, then, of course, you

    cannot use significance (hypothesis) testing for two independent groups.

    For example, there are thirty test scores (pre-scores) of students before

    taking a course and thirty test scores (post-scores) of the same students after

    the course. There are two collections of data, but there is only one group

    (the thirty students). So, how do you test the effectiveness of the course?

    If you think that students should learn from the course to improve the test

    scores at least ten points, then the following hypotheses should be tested.

    Ho: d -10.0

    vs

    Ha: d< -10.0

    using the data which are the differences of the pre-scores and post-scores

    (the pre-score of one student the post score of the student, for the thirty

    students). That is, create a collection of data by subtracting a post-score

    from the pre-score of each student for all the thirty students and test the

    hypotheses with the created data of differences. See the table given below.

    By the way,d is the mean of the differences between the pre-scores and the

    post-scores. The null value is -10 because the difference is obtained as pre-score post-score for every student. You are speculating that, on average,

    the post-scores are higher than the pre-scores by at least 10 points.

    Also, because of that, the alternative hypothesis is that the mean is less than

    -10 since, if the speculation is true, the mean of the differences must be less

  • 7/25/2019 CFA1 3.3 Significance Testing

    19/21

    19

    than -10. If it were greater than -10, then the post-scores would not be

    higher than the pre-scores by at least 10 points on average as speculated.

    The hypotheses are tested by using the test statistic ( x - o)/(s/ n )with the

    standard Normal distribution just like a regular significance or hypothesistesting on . By the way, x and s are the sample average and standarddeviation of the 30 differences given in the last column of the table given

    below, o is -10 and n is 30 in the example.

    Example: Some pre-scores, post-score

    Students Pre-scores Post-scores Differences

    Student 1 65 78 -13

    Student 2 43 83 -40Student 3 54 81 -27

    .

    .

    .

    .

    Student 28 67 87 -20

    Student 29 36 32 4

    Student 30 42 65 -23

    Use these differences to test the hypotheses.

    Using differences between two collections of data is common in before-after

    situations with diet programmes, educational programmes medical

    treatments/procedures and such where each person (in a group) produces one

    measurement before it and another measurement after it. To correctly test

    hypotheses, you need to look at the number of groups of objects from which

    data are taken from, not the number of collections of data.

    Appendix: F Distributions & P-Values

  • 7/25/2019 CFA1 3.3 Significance Testing

    20/21

    20

    Hypotheses concerning variances of two independent groups can be tested

    by a test statistic ((n1 1)s12/1

    2)/((n2 1)s2

    2/ 2

    2). The hypotheses come in

    the following forms.

    A right-tailed case:

    Ho: 2/1ro

    vs

    Ha: 2/1< ro.

    A left-tailed case:

    Ho: 2/1ro

    vs

    Ha: 2/1> ro.

    A two-tailed case:

    Ho: 2/1= ro

    vs

    Ha: 2/1 ro.

    Why the first set of hypotheses is a right-tailed case? Look at the Ha. If it is

    true, then 2 should be small, which leads to a small s2. Now, look at the test

    statistic, a small value of s2would result in a large observed value. That is,

    the rejection area must be on the right hand side of the real number line.

    The test statistic has an Fd1,d2distribution. This F distribution with d1andd2degrees of freedom is constructed as a quotient of two independent Chi-

    square distributions, Chi-square distribution with d1degree of freedom in the

    numerator and Chi-square distribution with d2 degree of freedom in the

    denominator.

  • 7/25/2019 CFA1 3.3 Significance Testing

    21/21

    This is the reason why an F distribution has two degrees of freedom, d1and

    d2. The first one is always from the denominator and the second one from

    the denominator. For the test statistic, it is an Fn1-1, n2-1. Go to the website,

    http://stattrek.com/Tables/F.aspx,

    put the first degree of freedom at Degrees of freedom (v1),the second degree offreedom at Degrees of freedom (v2)and the observed value atf value and clickon the button Calculate. You find the p-value for a left-tailed case at

    Cumulative probability. That is, Cumulative probability = P(Fn1-1, n2-1< observed

    value) = p-value for a left-tailed case. Thus, the p-value for a right-tailed

    case is 1 Cumulative probability and the p-value for a two-tailed case is

    2*Min{Cumulative probability,1 - Cumulative probability}.

    For example, you are testing Ho: 2/10.80 vs Ha: 2/1< 0.80 with n11 = 30 1 = 29, n2 1 = 40 1 = 39, and the observed value of 1.02604.

    Then, put 29 at Degrees of freedom (v1),39 at Degrees of freedom (v2),1.02604 atfvalue and click on the button Calculate. You find 0.54 at Cumulative

    probability. However, this 0.54 is the p-value in a left-tailed case. For the

    right-tailed case, which is our case here, the p-value is 0.46 = 1 0.54. The

    Ho cannot be rejected at 5% or 1% significance level.

    Copyrighted by Michael Greenwich, 08/2013

    http://stattrek.com/Tables/F.aspxhttp://stattrek.com/Tables/F.aspxhttp://stattrek.com/Tables/F.aspx