w ehr d duke u h s : what is it h d i do it data slides.pdf · working with ehr data from duke...
TRANSCRIPT
-
WORKING WITH EHR DATA FROM DUKEUNIVERSITY HEALTH SYSTEM: WHAT IS IT
AND HOW DO I DO IT?
Benjamin A. Goldstein PhD, [email protected]
Department of Biostatistics & BioinformaticsSchool of Medicine
Duke University
May 13th, 2020
1 / 74
-
TALK AGENDA
What are Electronic Health RecordsWhat are EHR data elementsTypes of studies we can do with EHR dataSome analytic considerations with EHR dataA case study in an EHR based studyOptions for accessing Duke EHR data
2 / 74
-
WHAT ARE ELECTRONIC HEALTH RECORDS?
“An Electronic Health Record (EHR) is an electronic version of apatient’s medical history, that is maintained by the provider overtime” (Centers for Medicaire & Medicaid Services (CMS) website)HITECH Act was part of the 2009 stimulus geared to incentivizethe use of EHRsSynonyms: Electronic Medical Record (EMR), Patient HealthRecord (PHR)
3 / 74
-
GROWTH OF EHR USAGE
https://dashboard.healthit.gov/quickstats/quickstats.php
4 / 74
https://dashboard.healthit.gov/quickstats/quickstats.php
-
EHR VENDORS
https://dashboard.healthit.gov/quickstats/pages/
FIG-Vendors-of-EHRs-to-Participating-Professionals.php
5 / 74
https://dashboard.healthit.gov/quickstats/pages/FIG-Vendors-of-EHRs-to-Participating-Professionals.phphttps://dashboard.healthit.gov/quickstats/pages/FIG-Vendors-of-EHRs-to-Participating-Professionals.php
-
FRONT END OF EHRS
6 / 74
-
BACK END OF EHRS
7 / 74
-
COMPLEXITY OF EPIC BACKEND
Chronicles - ~95,000 Data ElementsData stored immediately
Clarity - ~17,000 Tables & 125,000 columns
Data stored overnight
Caboodle - 19 Tables & 76 Dimensions
Disease Specific Registries- Diabetes- Afib- Etc
Analytic Tools- Provider Dashboards- Predictive models- Etc
8 / 74
-
DATA ELEMENTS
Patient DemographicsEncounters (Outpatient/Inpatient)DiagnosesProceduresLab ResultsVital SignsMedicationsSocial HistoryProvider InformationRadiological ResultsDoctor Notes
9 / 74
-
DEMOGRAPHICS
Patients have a single ID that follows them across all encounters- medical record number (MRN)Basic information: Age, Sex, Race/Ethnicity that is typically staticTime varying elements include: Payer, address
10 / 74
-
ENCOUNTER TYPE
Encounters have an encounter ID that links the encountercontext to what happened (diagnoses, tests etc.)Three Basic Encounters:
Outpatient (AV - Ambulatory Visit)Inpatient (IP)Emergency Department (ED)
Other types of encounters can include telephone consults,emails etc.
11 / 74
-
CONTEXTUALIZING INFORMATION FOR ENCOUNTERS
When someone seen (i.e. time stamps for arrival and departure)Who the patient saw (i.e. provider specialty, provider type)Where the patient was seen (i.e. clinic location, facility type)What happened (i.e. vital signs, labs taken, diagnoses made)We don’t have good information on Why — diagnoses don’toften relate to “chief complaint”
12 / 74
-
INTERNATIONAL CLASSIFICATION OF DISEASES (ICD)CODES
Hierarchical system to code all diagnoses that are made during ahealth encounterIn 2015, the US switched to the ICD-10 system (previouslyICD-9)ICD-9 had ∼13,000 unique codes, ICD-10 has ∼68,000Since these are used as billing codes, the codes can bemanipulated to increase billingCodes don’t always represent the primary concern
13 / 74
-
STRUCTURE OF ICD-10 CODES
Myocardial Infarction:I21⇒ Acute Myocardial Infarction
Subsequent numbers designate location of event, e.g. I21.01⇒ MIof left main coronary artery
I22⇒ Subsequent MII23⇒ Complications of MIhttps:
//www.icd10data.com/ICD10CM/Codes/I00-I99/I20-I25
14 / 74
https://www.icd10data.com/ICD10CM/Codes/I00-I99/I20-I25https://www.icd10data.com/ICD10CM/Codes/I00-I99/I20-I25
-
ROLLING UP ICD-CODES
Dealing with 68,000 unique codes is not realistic or efficientAgency for Healthcare Research and Quality (AHRQ) developedClinical Classification Software (CCS) systemAllows researchers to roll codes up to appropriate levels
https://www.hcup-us.ahrq.gov/toolssoftware/ccs/
AppendixCMultiDX.txt
15 / 74
https://www.hcup-us.ahrq.gov/toolssoftware/ccs/AppendixCMultiDX.txthttps://www.hcup-us.ahrq.gov/toolssoftware/ccs/AppendixCMultiDX.txt
-
CURRENT PROCEDURAL TERMINOLOGY (CPT) CODES
CPT is coding system for what happened during an encounter,e.g., surgeries, x-rays, etc.∼ 10,000 in useAlso tied to reimbursementsSimilar systems for organizing CPTs as ICDs
16 / 74
-
MEDICATIONS
EPIC has > 100 medication-related tablesMedications are often organized as:
PrescribedAdministeredReconciliation
For prescribed medication dosages may be messyLike diagnoses and procedures medications can become overlygranular
RxNorm is a system for rolling up medications into hierarchieshttps://mor.nlm.nih.gov/RxNav/search?searchBy=String&
searchTerm=acetaminophen
17 / 74
https://mor.nlm.nih.gov/RxNav/search?searchBy=String&searchTerm=acetaminophenhttps://mor.nlm.nih.gov/RxNav/search?searchBy=String&searchTerm=acetaminophen
-
LABORATORY TEST RESULTS
Laboratory tests (along with vitals) differentiate EHR data fromadministrative dataThere may be multiple tests panels used which can be labeleddifferentlyModern systems have standardized the nomenclature oflaboratory tests
Duke has a catalog of the laboratory tests used:https://testcatalog.duke.edu/
Typically will see time stamps for when test was ordered andresultedAn analytic concern is that these measurements are irregularlycaptured across encounters
18 / 74
https://testcatalog.duke.edu/
-
VITALS SIGNS
Most encounters will capture blood pressure, weight andtemperatureIn the hospital vitals may be documented every couple hoursICU monitors can capture very dense data: minute-by-minute oreven waveformData will typically be stored in long running tallies called“flowsheets”
19 / 74
-
SOCIAL HEALTH
Data such as smoking status, drug and alcohol use, employmentstatus, marital status, etc., may be reported, but is frequentlyunreliableSocioeconomic status typically doesn’t exist but proxies can beused via primary payer or neighborhood addressThere is a growing emphasis on capturing patient reportedoutcomes (PROs)
Food insecurity, PROMIS, pain, depression inventories
20 / 74
-
OTHER DATA ELEMENTS
Problem ListsDate stamped indicators for when someone has different conditions- not always reliable
Admission-Discharge-Transfer (ADT) DataTime stamps are recorded every time a patient moves in thehospital
Provider DataInformation on who a patient saw and interacted with
User DataEvery time someone signs into EPIC a log is generated
21 / 74
-
UNSTRUCTURED DATA
Structured Data refer to quantitative data in a ready-to-analyzeformatGrowing emphasis on incorporating unstructured data whichrequire some processingExamples include:
NotesImagesGenetic data
22 / 74
-
ORGANIZING DATA
DATA LAKESLoose organization of dataAble to maintain all data elementsNo explicit linkage between data elementsCan be complicated to work with
DATA MARTSStructured data in a relational formatEasier to access dataDesigned for particular use case(s)Results in loss of informationHigher maintenance cost
23 / 74
-
NEED FOR DATA MODELS
24 / 74
-
PCORNET DATA MODEL
25 / 74
-
WORKING WITH DATA MODELS
ADVANTAGESSimpler data organization, making it easier to accessUniform set of decisions so that data are consistent acrossinstitutions
DISADVANTAGESA general loss of granularity
Not all data elements fit within the data modelMany measures are grouped together
26 / 74
-
WHY WE WANT TO USE EHR DATA FOR CLINICALRESEARCH
Data Readily AvailableOften 100,000’s of PatientsInformation collected over a variety of fieldsAbility to study many different clinical questionsRepresentative population
27 / 74
-
WHY WE MAY Not WANT TO USE EHR DATA FORCLINICAL RESEARCH
DATA ARE NOT ORGANIZED FOR RESEARCHData exist in disparate placesAll patients have different pieces of informationObservational Data
28 / 74
-
EHR VS CLINICAL TRIALS DATA
RCT Data EHR DataWhy are data Data are collected for Data are collected forcollected? the study clinical careWhen are data Pre-planned study visits Random clinicalcollected? encountersWho/Where are Research staff enter Entered by cliniciansdata entered? into CRFsWhat data are Same data for Only information deemedentered? all patients important by clinicianHow are data Statistician pulls Informaticist extractsextracted? from RedCap dataHow are studies Top-down - start with study Bidirectional design -designed? and collect relevant data start with study but assess
available data
29 / 74
-
WHAT WE CAN DO WITH ELECTRONIC HEALTHRECORDS
1 Risk PredictionNear term prediction - Risk of in-hospital mortalityLong(er) term risk - 30 Day Revisit
2 Population HealthHealth Service Utilization - Assessment of high utilizersDisease Epidemiology - Experience of incident diabetes in DurhamCounty
3 Comparative Effectiveness Research (CER)Retrospective Studies - Assessment of community intervention fordiabeticsProspective Studies - Point of care randomization, Pragmatic Trials
4 Association AnalysesRisk factors for disease - Phenome Wide Association StudiesData mining - Drug-Drug interactions
30 / 74
-
EHR BASED STUDIES
Retrospective ProspectiveRisk I Returning to Hospital I Implement alert forPrediction 30 days after discharge Readmissions RiskIntervention I Compare Medical vs I Point of CareAssessment Surgical Treatment RandomizationPopulation I Experience of Incident I Screening forHealth Diabetics Diabetes
31 / 74
-
ADVANTAGES OF STUDIES BASED ON EHR DATA
POPULATION HEALTHEHRs often capture data on particular communities of interest
Estimated that ∼ 80% of Durham County residents receive healthservice at Duke University Health System affiliated providers
COMPARATIVE EFFECTIVENESSOpportunity to see real-world usage of medical treatmentsCan assess both adoption of therapies and effectiveness &safety of therapies
RISK PREDICTIONContains granular information capturing patients’ clinical stateDirect pipeline to implement models into clinical practice
32 / 74
-
WHO IS NEEDED FOR AN EHR BASED STUDY?
Biostatistician Informaticist
Epidemiologist/Clinician
Data Manipulation
Study Design
Data Extraction
Analysis
Research Question
Variable Definition
Collaborative Clinical Research
Clinical Research
33 / 74
-
SUMMARY POINTS
EHR systems are complex database systems to store patienthealth dataFor clinical researchers there are a lot of appealing reasons towant to work with EHR dataThe raw data elements often need to be processed - typicallythrough hierarchical structuresOrganizing the data into data models can aid analytics
34 / 74
-
Analytic Challenges with EHR Data
35 / 74
-
CREATING ANALYTIC DATA SETS
Don’t work directly off datamarts but create analytic data setsCreating analytic datasets requires many decisions, most ofwhich can’t be tested via sensitivity analysesNeed to define granularity of the analysis (encounter, patient,etc.)Larger projects may need datamart sub-extracts to generatemultiple analytic data sets
36 / 74
-
THREE WAYS EHR DATA DIFFER FROMTRADITIONAL CLINICAL DATA
1 We don’t have everything we want2 Outcomes are not defined - need to phenotype data3 Data are irregularly and potentially densely captured
37 / 74
-
CHALLENGE 1:WE DON’T HAVE EVERYTHING WE WANT
Patients may seek care at multiple facilitiesChanges in standard of careMissing information on when individuals are healthyMost social health information is not recorded or reliableCannot expect death to be reliably captured
Most people don’t die in the hospitalEHRs have only 20-50% sensitivity
38 / 74
-
LINKING EHR DATA
Data from other facilities (PCORNet)Claims: Center for Medicaire & Medicaid Services (CMS)Mortality: National Death Index (NDI) & Social Security DeathIndex (SSDI)Genetic DataGeocoded Information: housing, environmental, censusPersonal Tracking Data: wearables, sensors
39 / 74
-
MISSING DATA IN EHR DATA SETS
Data in EHRs are typically not missing but not collectedData are Not Missing At Random (NMAR)
Typical imputation strategies would not be appropriateWe’ve termed this Informed Presence
Inclusion of proxy measures:Missingness categoriesNumber of previous encountersNumber of times a lab was tested or code was documented
40 / 74
-
ADDRESSING INCOMPLETENESS VIA DESIGN
Inpatient analyses are typically well contained but outpatientanalyses can lead to loss to follow-upDefine local patient population
Live in the catchment of the health systemRequire a certain a number of primary care appointments beforeeligible for study
Contextual and proxy information can be linked inNeighborhood for SESClaims data for additional encountersNDI/SSDI for death
41 / 74
-
CHALLENGE 2NEED FOR COMPUTABLE PHENOTYPES
EHR data do not have direct information on disease states“Problem Lists” exist but are not always reliableWe can develop algorithms to define disease statesThe algorithms typically have high positive predictive value andspecificity but not always high sensitivity
42 / 74
-
ISSUES OF DATA DEFINITION:WHAT IS A DIABETIC?
Richesson RL, Rusincovitch SA, Wixted D, Batch BC, Feinglos MN, Miranda ML, Hammond WE, Califf RM, Spratt SE. A Comparison of Phenotype Definitions for Diabetes Mellitus. J Am Med Inf Assoc 2013 (epub ahead of print). http://www.ncbi.nlm.nih.gov/pubmed/24026307
43 / 74
-
ISSUES OF DATA DEFINITION:WHAT IS A DIABETIC?
ICD-9 250.x0 Expand. ICD-9ICD-9 & 250.x2 (249.xx, 357.2, Abnormal Diabetes250.xx (exclude type I) 362.0x, 366.41) HbA1c Glucose OGTT Meds
ICD-9 250.xx XCMS CCW X* X*NYC A1c Registry XMeds XDDC X X X X X XSUPREME-DM X* X* X X X X•eMERGE X* X X X
* Distinction between Inpatient and Outpatient Visits
44 / 74
-
DEFINITION DIFFERENCES
ANY TYPE2 TYPE2unsp
0.6
0.7
0.8
0.9
1.0
0.00 0.01 0.02 0.03 0.04 0.01 0.02 0.03 0.04 0.05 0.01 0.02 0.03 0.04 0.051−Specificity (FPF)
Sen
sitiv
ity (
TP
F)
AuthoritativeSource 250 A1C CCW DDC4 MED NW SUP A1C_OR_MED
Diabetes Validation Results faceted by Endpoint
Spratt et al. JAMIA 201745 / 74
-
ADDITIONAL PHENOTYPING CHALLENGES
Death: Need to include external dataDisease Incidence: Need to apply ‘burn-in’ periodsCensoring: Need to apply ‘burn-out’ periods
46 / 74
-
CHALLENGE 3:DATA ARE BOTH LONGITUDINAL AND CROSS-SECTIONAL
EHR data consists of cross-sections of longitudinal dataMost data are stored in datamarts that cover fixed periods of time
Need to use methods for longitudinal data to model updatingexposures
We most often use time-varying Cox ModelsMost analyses don’t account for a patient’s trajectory - just mostrecent value
Since data are a cross-section no notion of time 0Define “burn-in” periods to define eligibilityUse “burn-out” periods to define censoring
47 / 74
-
MULTIPLE MEASUREMENTS PER PERSON
OPPORTUNITIESGet to observe patient’s evolving health statusMore frequent visits than a typical longitudinal studyDenser visit information
CHALLENGESVisits are irregularly spacedDifferent ways to aggregateYou don’t know which data are not captured in your dataset
48 / 74
-
ASSESSING DATA QUALITY
Weiskopf et al. EGEMS 2017
49 / 74
-
TAKE HOMES
Turning EHR data into analytic data is an involved process thatrequires many choicesWhen analyzing data we may sit downstream of these choicesand not get to test their impactThe temporal structure of EHR data opens up the need fordifferent analytic methods
50 / 74
-
CASE STUDY: ENVIRONMENTAL IMPACTS ON ASTHMAEXACERBATION IN CHILDREN
Research Question: How do the built and natural environmentsimpact children with asthmaStudy Population: Children (ages 5-18) who have a diagnosisof asthma and live in Durham CountyExposure: Weather, air quality, distance to highway, etc.Outcome: Asthma exacerbationsData Source: Duke EHR data linked with publicly availabletemporal-spatial data
51 / 74
-
ANALYTIC STEPS
1 Abstract all children with asthma within Duke EHRDefined computable phenotype based on diagnosis codes andmedication prescription
2 Identify period of time children had address in Durham County3 Capture asthma exacerbations based on encounter type (e.g.,
outpatient, urgent care, ED, inpatient), diagnosis code andprescription for rescue medication
4 Link in exposure data based on patient address (spatial factors)and dates of service (temporal factors)
5 Extract other clinical data such as comorbidities, BMI, andlaboratory values
Data elements used: Demographics, Diagnoses, Medications,Laboratory Test, Vitals, Address, Service Location, Geo-Coded Data
52 / 74
-
DISTANCE TO HIGHWAY AND ASTHMA EXACERBATION
53 / 74
-
PREDICTING DAILY EXACERBATIONSBASED ON ENVIRONMENTAL FACTORS
54 / 74
-
OTHER ANALYSES
Protective effect of Well-Child visitsFactors associated with medication escalationBuilt environment’s impact on asthma exacerbationImpact of COVID exposure and stay at home order on kids withasthma
55 / 74
-
Accessing EHR Data at Duke
56 / 74
-
EHR DATA AT DUKE
Duke University Health System (DUHS) consists of 3 hospitals (2in Durham, 1 in Raleigh) and a network of outpatient andspecialty clinicsThe EHR system is managed by Duke Health TechnologySolutions (DHTS)
They are responsible for meeting both operations and researchdata needs
Duke finished switching to an integrated EPIC based system inAugust 2013Before this different departments had their own EHR systemsLegacy (pre-2014) data are available but may be less reliable
57 / 74
-
EHR DATA ARE Big DATA
Since 2014 there have been:>1.7 million Unique Patients>400,000 Inpatient Encounters>27 million Outpatient Encounters
These patients each have diagnoses, vital signs, labs, medicationorders, etc.
58 / 74
-
POPULATION HEALTH AT DUKE
DUHS is the primary provider inDurham County
Estimated that ∼ 80% of DurhamCounty residents get health servicesat DUHS
One “hole” is Lincoln CommunityHealth Clinics which services anunderserved population
They share an EHR system withDUHS but special permission isneeded to access their data.
Durham Neighborhood Compass isa CTSI initiative to use Duke EHRdata to inform about public health inDurham County https://compass.durhamnc.gov/en/
compass/DIABETES_TOTAL/tract
59 / 74
https://compass.durhamnc.gov/en/compass/DIABETES_TOTAL/tracthttps://compass.durhamnc.gov/en/compass/DIABETES_TOTAL/tracthttps://compass.durhamnc.gov/en/compass/DIABETES_TOTAL/tract
-
THE NEED FOR PRE-REQUIREMENTSWith Great Power Comes Great Responsibility
Most forms of EHR data contain Protected Health Information(PHI)While data are available for minimal risk research purposes, it isimportant to protect the identity of patients, many of whom aremembers of the local communityData can exist as:
Fully Identified - names, DOB, Address, etcLimited Data - minimal PHI (e.g. Dates of Service)Deidentified - no PHI
One should use the minimal amount of PHI necessary forresearch purposes
60 / 74
-
SOME PRE-REQUIREMENTS
CITI IRB TrainingAn active IRB approved by the School of MedicineData stored in a secured server (PACE)Depending on degree of direct access a DHE (Duke Health)Account
61 / 74
-
PROTECTED ANALYTICS COMPUTING ENVIRONMENT(PACE)
HIPAA/FISMA Compliant Virtual Machine (Windows and Linuxenvironments)You can easily get data in but need to go through an honestbroker to get data outCannot connect to internet but preloaded with R, Python, SAS,GIS etc.Each user has resources equivalent to a laptop
Can connect to GPU Machines through Microsoft Azure
Directory structure is based off the approved IRB protocol -effective for project teams to share data and code
https://pace.ori.duke.edu/
62 / 74
https://pace.ori.duke.edu/
-
3 POINTS OF ACCESS(IN REALITY MORE)
Self ServiceGUI Based Tools (DEDUCE)Code Based Tools (CRDM)
Expert Assisted Access - ACE Fee For Service
63 / 74
-
3 POINTS OF ACCESS(IN REALITY MORE)
Self ServiceGUI Based Tools (DEDUCE)Code Based Tools (CRDM)
Expert Assisted Access - ACE Fee For Service
63 / 74
-
DEDUCE
GUI based system to query data - easiest way to access dataBuild ”cohort” via hierarchical queries and indicate whichelements to extractOrder of operations can impact the way cohort is builtGUI based cannot “run” jobs repeatedlyData go back to the ’90s but data pre-2014 (pre-EPIC data) maybe less reliable
64 / 74
-
CLINICAL RESEARCH DATAMART (CRDM)
Newer offering - launched earlier this year with support fromCHDI, TDH, CTSI, DHTShttps://sites.duke.edu/crdm
Designed to support clinical research and development of clinicalregistriesMeet the principles that:
Data pulls need to reproducibleProvide code based access to broader range of analystsMost data queries don’t need the most up-to-date dataMost studies use many of the same data elements (ICDs, Labs,Medications, etc.)Be linkable to other data assets such as department specificdatamarts and publicly available contextual data (e.g.neighborhood data, weather etc.)
Active development of new data tables to meet researcherneeds, e.g. family linkage table
65 / 74
https://sites.duke.edu/crdm
-
ORGANIZATION OF CRDM
Data organized into an extension of the PCORnet Common DataModel (CDM)
PCORnet CDM contains most standard data elements (labs,medication, diagnoses, etc.)Added additional tables such as providers, encounter details etc.Most useful when you don’t need details of the hospital encounters(e.g. no patient bed flow)
Only contains data starting in 2014 (when Duke switched toEPIC system)Data are refreshed and QA’d quarterly (working on daily refresh)Data are in a Oracle database that can be accessed from withinPACE via R/Python/SAS
Since code-based system reproducible data pulls
66 / 74
-
CRDM STRUCTURE
PCORnetData Model
Electronic Health Records
Data
All patientsStructured data
elements, curated to match a common
data modelMetadataCode base
Project 1 Project 2 Cohort
Department/Specialty Datamarts
e.g., Stork,Transplant,Cardiology
Other datasetse.g., claims data
Data Sidecars
IRB approval
Geospatial Data
67 / 74
-
ACCESSING THE CRDM
1 Obtain IRB/RPRR approval to access EHR data2 Get PACE account3 Fill out RedCap to register project and get initial access
https://redcap.duke.edu/redcap/surveys/?s=CFKLE9EKLY
4 Set up your initial Database Connection within PACE
68 / 74
https://redcap.duke.edu/redcap/surveys/?s=CFKLE9EKLY
-
SOME CRDM USE CASES
Population HealthMapping pediatric diseases in Durham CountyDeveloping a phenotype for patients with NASH & NAFLD
Comparative EffectivenessEvaluation of opioid prescriptions in post-surgical settingUse and effectiveness of biologic therapies for patients with asthma
Epidemiological StudiesMultifactorial analysis of pediatric asthma outcomesLipoprotein A testing in patients with cardiovascular disease risk
Predictive ModelingDeveloping a risk score for 30-day readmissionsPrediction of healthcare utilization for patients with Type 2 diabetes
RegistriesHealth outcomes and healthcare utilization patterns in pediatricpatients with epilepsyGrowth trajectories in pediatric patients who are obese
69 / 74
-
ACE FEE-FOR-SERVICE
Component of Duke Health Technology Solutions (DHTS)Custom data pulls and data services (e.g. building outdatamarts, APIs etc.)Cost and time is largely dependent on complexity of the datadomains requestedMost useful when you need unstructured data elements ordetailed hospital flow data
70 / 74
-
COMPARING SERVICES
DEDUCE CRDM ACE FFSAccess: GUI Interface Direct SQL Query Work with Informaicist
Reproducibility: Moderate High High(need to query in (need to reengagesame way) informaticist)
Automatable Queries: No Yes Yes
Data Elements: Most Structured PCORnet CDM + AllData Elements Duke Specific Side Cars
Data Refresh: Daily Quarterly Daily(Working towards daily)
Cost: Free Currently Free $(Creating CostRecovery Model)
Time to Access Data: Always Available Always Available Variable
Ideal use: Cohort creation Reproducible data Need to accessw/out needing to code extraction “harder” data elements
71 / 74
-
NAVIGATING THE SYSTEM:DUKE DATASHARE CATALOG
https://medschool.duke.edu/research/
data-science-information-technology/data/
data-services-catalog72 / 74
https://medschool.duke.edu/research/data-science-information-technology/data/data-services-cataloghttps://medschool.duke.edu/research/data-science-information-technology/data/data-services-cataloghttps://medschool.duke.edu/research/data-science-information-technology/data/data-services-catalog
-
NAVIGATING THE SYSTEM:EHR ENABLED RESEARCH SUPPORT GROUP (EERS)
Investigators can request a freeconsultation to review theinvestigator’s request in order to:
Define a technical approach for aproject timelineConnect the investigator withDuke resourcesRefine budget estimates
https://ctsi.duke.edu/
ehr-enabled-research-support-eers
73 / 74
https://ctsi.duke.edu/ehr-enabled-research-support-eershttps://ctsi.duke.edu/ehr-enabled-research-support-eers
-
CONCLUSIONS
The Duke EHR system contains a wealth of information onpatient healthThe data present opportunities to study population health withinDurham CountyEHR data can be quite complex both structurally andepidemiologicallyThere are multiple ways to access EHR data at Duke
74 / 74