extracting chf information from clinical text using...

18
Extracting CHF information from clinical text using CLAMP Hua Xu, PhD pSCANNER 2016 1

Upload: others

Post on 12-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Extracting CHF information from clinical text using CLAMP

Hua Xu, PhD

pSCANNER 2016

1

Page 2: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

What is CLAMP - Clinical Language Annotation, Modeling, and Processing?

• AgeneralpurposeclinicalNLPsystem– “CLAMPCMD”– Builtonprovenmethods– Goodperformance,highspeed

• AnIDE(integrateddevelopmentenvironment)forbuildingcustomizedclinicalNLPpipelinesviaGUIs– “CLAMPGUI”– Annotating/analyzingclinicaltext– TrainingofML-basedmodules– Specifyingrules

• AnenterprisesolutionforNLPneedsinhealthcareorganizations–“CLAMPEnterprise”– Taskmanagement– Visualanalytics

2

Page 3: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CLAMP CMD – built on proven methods

3

NLP Tasks Ranking

Namedentityrecognition

2009i2b2,medication #2

2010 i2b2problem,treatment,test #2

2013SHARe/CLEFabbreviation #1

UMLSencoding 2014SemEval, disorder #1

Relationextraction

2012 i2b2Temporal #1

2015 SemEval Disease-modifier #1

2015BioCREATIVE Chemical-induced disease #1

Page 4: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CLAMP GUI - Efficiently build customized NLP pipelines for individual applications!

Page 5: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Annotating/Re-training

Page 6: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Specifying rules

Page 7: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Extracting CHF information using CLAMP built-in components

7

Page 8: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CHF information• CHFTerms/Concepts

– AnytermsmentionedregardingtoCongestiveHeartFailure– E.g.“congestiveheartfailure”,“systolicheartfailure”or“diastolicheart

failure”etc.• Labtests

– LaboratorytestaimingtoevaluateifpatienthasCongestiveHeartFailure– E.g.“BUN”,“RBC”etc.

• Medications– AnymedicationstotreatCongestiveHeartFailure– E.g.ACEinhibitors,Angiotensinreceptorblockersetc.

• Imagetests– imageteststoevaluateifpatienthasCongestiveHeartFailure– E.g.“EjectionFraction”,“chestx-ray”etc.

Page 9: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CLAMP built-in components • Basicnameentityrecognition

– RegularexpressionbasedNER- Numbers– MachinelearningbasedNER- problems,treatmentsandtests

• Medicationandsignatureidentification– Wraptherule-basedMedEx-UIMAasaCLAMPcomponent

• Labtestandvalueidentification– Rulebasedmodule

• UMLSEncoding– AssignUMLScodetopreviouslyidentifiednameentities

• CustomizeittoCHFusingpost-processingrules– Fltering byCUIsandnames

Page 10: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CLAMP screen shots

Page 11: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Evaluation• MIMIC-IIIcorpus– 10,000+patientwithCHFdiagnosiscode– Randomlyselected30dischargesummariesfromthesepatients

– AnurseannotatedCHFrelatedinformation• Sixcategoriesofnameentities:CHFTerm,Labtestname,Labtestname+value,Imagename,Drugname,Drugname+signatures

• Matchingcriteria:exactvs.inexact• Metrics:precision,recall,andF1

Page 12: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Preliminary ResultsCategory #ofGold #ofSys Recall(%) Precision(%) F-1measure(%)

CHFTerm 80 79 80.0/93.7 81.0/94.9 80.5/94.3

Labtestname 318 331 89.6/95.6 86.1/91.8 87.8/93.6

Labtestname+value

318 331 84.5/92.7 81.2/89.1 82.8/90.8

Image 136 139 85.2/99.2 83.4/97.1 84.2/98.1

Drugname 388 392 83.5/96.6 82.6/95.6 83.0/96.0

Drugname+signature

637 642 81.4/95.7 80.8/95.0 81.1/95.3

All 1693 1691 84.7/94.3 84.8/94.5 84.8/94.4

Page 13: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Adapting VA EF-Extractor to CLAMP

13

Page 14: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

EF-Extractor by VA• Leftventricularejectionfraction(EF) isakeycomponentofheartfailurequalitymeasuresusedwithintheDepartmentofVeteranAffairs(VA)

• EF-Extractor useregularexpressionsandrulestocapturetheEF;

• ItisUIMAbasedandcanrunonUIMAAS;• WeintegratetheEF-Extractor asan‘user-defined-component’intoCLAMP;

Page 15: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Integrate EF_Extractor into clamp

Page 16: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Further Improvement with clampAddmorerulesandRuta scripttofurtherimprovetheperformance;

Page 17: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

Test on UT dataset• Extract200sentencesfromUTnotesthatcontainkeywords

like‘ejectionfraction’or‘EF’..• Evaluation:

– Conceptlevel:RecognizeallEFmentionsandrelatedvalues;

Algorithm Dataset Precision Recall F1

Original VA* 95% 88.9% 0.919Original UTHealth 83.0% 84.7% 0.838Customized UTHealth 98.2% 89.1% 0.934

*GarvinJHetalJAMIA2012

Page 18: Extracting CHF information from clinical text using …pscanner.ucsd.edu/sites/pscanner.ucsd.edu/files/uploads...– Task management – Visual analytics 2 CLAMP CMD – built on proven

CLAMP – It’s all about transportability

• Buildingcustomizedpipelinesfordifferentapplicationsusingdefaultcomponents

• Integratingexistingcomponents/toolsintoCLAMP

• ExportingCLAMPpipelinestootherNLPplatforms