statistical tests for categorical data

33
Statistical tests for categorical data Dr. S. A. Rizwan, M.D. Public Health Specialist SBCM, Joint Program – Riyadh Ministry of Health, Kingdom of Saudi Arabia

Upload: rizwan-s-a

Post on 21-Apr-2017

44 views

Category:

Health & Medicine


6 download

TRANSCRIPT

Page 1: Statistical tests for categorical data

Statistical tests for categorical dataDr. S. A. Rizwan, M.D.

PublicHealthSpecialistSBCM, JointProgram– Riyadh

MinistryofHealth,KingdomofSaudiArabia

Page 2: Statistical tests for categorical data

Learningobjectives

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

• Examinetherelationshipbetweencategoricalvariables

• Constructacontingencytablefortwocategoricalvariables

• Describetheapproachtostatisticaltestingofcategoricalvariables

Page 3: Statistical tests for categorical data

Revise:Categoricalvariables

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

• Categorical(qualitative)

• Nominal(noorder)• Dichotomous,binary,binomial• Polychotomous

• Ordinal(ordered)

• Answers“what?”• Qualitativedataiscategorised

Page 4: Statistical tests for categorical data

Revise:Categoricalvariables

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

Page 5: Statistical tests for categorical data

Revise:Prerequisitesforatest

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

• Howmanyvariablesarethere?

• Whatisthenatureofdependentandindependentvariable?

• Howmanycategoriesarethereinthecategoricalvariable?

• Doesthecontinuousvariablefollownormaldistribution?

• Isthereanypairinginthedata/variables?

Page 6: Statistical tests for categorical data

Revise:DV,IV,Paireddata

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

Page 7: Statistical tests for categorical data

Statisticaltests:Bivariate

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Forunpaireddata Forpaireddata

• IfassumptionsforChisquarearemet• Chi-square(>=2levels)

• IfassumptionsforChisquareNOTmet• Fisher’sexact(>=2levels)

• Ifthegroupsarepaired• McNemar (if2levels)• RMlogisticregression (if>2levels)• Interrater reliabilityanalysis

Page 8: Statistical tests for categorical data

Statisticaltests:Multivariate

SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

Forunpaireddata Formatcheddata

Demystifying statistics!

• IfDVisbinaryand>1IV• Binarylogisticregression

• IfDVispolychotomousand>1IV• Multinomiallogisticregression

• IfDVisordinaland>1IV• Ordinalregression

• Ifthegroupsarematched• Conditionallogisticregression

• Ifrepeatedmeasurements• RMlogisticregression

Page 9: Statistical tests for categorical data

Statisticaltests:Special

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Forstratifieddata

• Cochran-Mantel-Haenszel test

Page 10: Statistical tests for categorical data

Statisticaltests:Special

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Fororderedcategoricalvariable

• ChisquaretestfortrendPassed Failed Total

R1 100 78 178

R2 175 173 348

R3 42 59 101

Total 317 310 627

Page 11: Statistical tests for categorical data

Measuresofassociation

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Oddsratio• Relativerisk• Interrater reliabilityanalysis

Page 12: Statistical tests for categorical data

Contingencytable

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Usedinbivariatesituations• Usecounts,notpercentages• Noone-sidedtests• Eachsubjectcountedonlyonce• Explainsignificantfindings

Page 13: Statistical tests for categorical data

Someselectedtopics

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Coveredinotherclasses• Chisquaretest• Cochran-Mantel-Haenszel test• Regression

• Inthisclasswewillcoverbasicsof:• Fisher’sexacttest• McNemar test• Interrater reliabilityanalysis(Agreementstatistics)

Page 14: Statistical tests for categorical data

Thoughtexercise1

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Inastudyaresearchertestedaperfumeon9ratsandusedwaterasthecontrolon9otherrats.Amongtheperfumegroup1ratshowedrestlessnesswhereasamongthecontrolgroup4ratsshowedrestlessness.Determineifthereisanassociationbetweenperfumeandrestlessness.

Page 15: Statistical tests for categorical data

Thoughtexercise2

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• 22pairsoftwinswereenrolledinthestudy.Oneofthetwinssmoked,theotherdidn’t.Thetwinswerefollowedtoseewhichtwindiedfirst.For17pairsoftwins,thesmokingtwindiedfirstandfor5pairsoftwins,thenon-smokingtwindiedfirst.

Page 16: Statistical tests for categorical data

Thoughtexercise3

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• All100pathologicalslideswereobservedby2pathologists.Theweresupposedtoclassifythediseaseasmild,moderateandsevere.Pathologist1classified60,30,10andpathologist2classified50,30,20asmild,moderateandsevere.Bothpathologistsagreedthat44weremild,20weremoderateand6weresevereanddisagreedontheremainingslides.Calculatetheagreementbetweenthetwopathologists.

Page 17: Statistical tests for categorical data

Fisher’sexacttest

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Usedintheplaceofchisquaretestforindependencewhenthecellcountsaresparse

• Morethan20%ofthecellshaveexpected frequenciesof<5

Page 18: Statistical tests for categorical data

Fisher’sexacttest

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Page 19: Statistical tests for categorical data

Fisher’sexacttest

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• 6possibletablesfortheobservedmarginaltotals:9,9,5,13.

• p-valueiscalculatedbysummingallprobabilitieslessthanorequaltotheprobabilityoftheobservedtable

Page 20: Statistical tests for categorical data

Fisher’sexacttest

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Theobservedtable(TableII)hasprobability=0.132

• P-valuefortheFisher’sexacttest=Pr (TableII)+Pr (TableV)+Pr(TableI)+Pr (TableVI)

• =0.132+0.132+0.0147+0.0147=0.293

Page 21: Statistical tests for categorical data

McNemar test

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Whendataarepairedandtheoutcomeofinterestisaproportion,theMcNemar Testisused

• Pair-Matcheddatacancomefrom• Case-controlstudieswhereeachcasehasamatchingcontrol

(matchedonage,gender,race,etc.)• Twinsstudies– thematchedpairsaretwins

• Before- Afterdata• Outcomeispresence(+)orabsence(-)ofsomecharacteristic

measuredonthesameindividualattwotimepoints

Page 22: Statistical tests for categorical data

McNemar test:matchedcase-control

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• a- numberofcase-controlpairswherebothareexposed• b- numberofcase-controlpairswherethecaseisexposedandthe

controlisunexposed• c- numberofcase-controlpairswherethecaseis• unexposedandthecontrolisexposed• d- numberofcase-controlpairswherebothareunexposed• Thecountsinthetableforacase-controlstudyarenumbersofpairs

notnumbersofindividuals.

Page 23: Statistical tests for categorical data

McNemar test:before-afterstudy

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• a- numberofsubjectswithcharacteristicpresentbothbeforeandaftertreatment

• b- numberofsubjectswherecharacteristicispresentbeforebutnotafter

• c- numberofsubjectswherecharacteristicispresentafterbutnotbefore

• d- numberofsubjectswiththecharacteristicabsentbothbeforeandaftertreatment.

Page 24: Statistical tests for categorical data

McNemar test

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Calculatedusingthecountsinthe‘b’and‘c’cellsofthetable

• ThesamplingdistributionChi-squaredistribution,thedegreesoffreedom=1

• Foratestwithalpha=0.05,thecriticalvaluefortheMcNemar statistic=3.84.

Page 25: Statistical tests for categorical data

McNemar test

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Page 26: Statistical tests for categorical data

McNemar test

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• CriticalvalueforChi-squaredistributionwith1df =3.84,pvalue=0.01

• Conclusion:Asignificantlydifferentproportionofsmokingtwinsdiedfirstcomparedtotheirnon-smokingtwinindicatingadifferentriskofdeathassociatedwithsmoking(p=0.01)

Page 27: Statistical tests for categorical data

Agreementstatistics

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Manytypesofagreementstatistics dependingon• Datatype• Typeofrepetition• Internalconsistency

Page 28: Statistical tests for categorical data

Agreementstatistics

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

• Cohen’skappa

• Measurestheagreementbetweentworaters whoeachclassifyNitemsintoCmutuallyexclusivecategories

• Usedwhenresponsesarecategorical

Page 29: Statistical tests for categorical data

Agreementstatistics

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

Page 30: Statistical tests for categorical data

Agreementstatistics

SBCM, Joint Program – RiyadhSBCM, Joint Program – RiyadhDemystifying statistics!

𝐾𝑎𝑝𝑝𝑎 =0.70 − 0.411 − 0.41 = 0.491

Page 31: Statistical tests for categorical data

Advancedlearning

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

• Chisquaretestfortrend• Specialcasesoflogisticregression• Repeatedmeasureslogisticregression• Weightedkappa• Othermeasuresofagreementanalysis

Page 32: Statistical tests for categorical data

Takehomemessages

Demystifying statistics! SBCM, Joint Program – RiyadhSBCM, Joint Program – Riyadh

• Manyapproachesareavailableforanalysingcategoricaldata• Chooseamethodappropriateforyourproblem• Checkthattheassumptionsofthemethodarevalid• Makeconclusionsbasedontheresultsofthetest