making obamacare work with big data
DESCRIPTION
Bob Rogers, PhD, Chief Scientist and Co-founder at Apixio, and Vishnu Vyas, Principal Scientist at Apixio will be presenting on October 30, 2013. They will describe use cases in which Apixio is using NoSQL and Hadoop to deliver powerful risk assessment results based on unstructured data in electronic health record systems.TRANSCRIPT
![Page 1: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/1.jpg)
Making ObamaCare Work With Big Data
Healthcare Use Cases
![Page 2: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/2.jpg)
About Bob
![Page 3: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/3.jpg)
About Vishnu
![Page 4: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/4.jpg)
Overview
• What is wrong with healthcare?• What is ObamaCare?• What does patient data look like?• Risk Adjustment use case• Care Network use case• Apixio’s Big Data solutions
![Page 5: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/5.jpg)
Poll
Are you a:A. ProgrammerB. Data scientistC. ManagerD. Health IT technologistE. Other?
![Page 6: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/6.jpg)
What Is Wrong With Healthcare?
Fee-For-Service
![Page 7: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/7.jpg)
Skyrocketing Cost
84 Million People Under- or Un-Insured
![Page 8: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/8.jpg)
Healthcare Reform
• Liberate Clinical Data• Care Coordination• Efficiency• Risk Adjustment
![Page 9: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/9.jpg)
What Does a Patient Look Like To A Data Scientist?
![Page 10: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/10.jpg)
Structured Data
![Page 11: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/11.jpg)
Text
![Page 12: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/12.jpg)
Scanned Documents
![Page 13: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/13.jpg)
% with Pneumococcal
Vaccine
54%
17%
No CodedHistory in EHR
Coded History
Decision Support Fails Without Access to Required Clinical Data
3 x lower
Coded29%
Non-coded71%
How is Splenectomy documented?
04/11/2023
![Page 14: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/14.jpg)
Poll:
What percent of the key clinical data to you think is missing from the coded layer?
A. 10-25 %B. 25-50 %C. 50-75%D. 75+ %
![Page 15: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/15.jpg)
>63%Missing
![Page 16: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/16.jpg)
Jonathan Everett & Bob Rogers
Real Patient Example
Coded Data
Free Text
Scanned Documents
Other Data Silos
![Page 17: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/17.jpg)
17
Question your assumptions about data.
![Page 18: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/18.jpg)
18
![Page 19: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/19.jpg)
“Heart Failure”in EHR problem list
Is it Heart Failure?
![Page 20: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/20.jpg)
Heart Failure No Heart Failure
… or Chart Failure?
![Page 21: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/21.jpg)
Where is the valuable data?
![Page 22: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/22.jpg)
How Much Data Is There?
Sources: EHR Structured, EHR Text, EHR Scanned, Claims, RAPS
200,000 Pts over 5 years 10 TBStructured: 13 M unique codes
4.8 M CPT, 4.8 M ICD9
Narrati ve: 338 M unique codes98 M CPT, 120 M ICD9
![Page 23: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/23.jpg)
Use Case: Risk Adjustment
![Page 24: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/24.jpg)
How risk scores are used1.01 1.20
![Page 25: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/25.jpg)
Risk Assessment & Risk Scores
CADICD-9 746.85
HCC 138Score: 0.312
Total Score: 0.6
+0.312-------------0.912
![Page 26: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/26.jpg)
Risk Assessment & Risk Scores
Decubitus UlcerICD-9 707.14
HCC 217Score: 0.954
Type II DiabetesICD-9 250.00
HCC 19Score: 0.215
Total Score: 0.6
+0.215+0.954-------------1.769
![Page 27: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/27.jpg)
Where’s the beef?
AssessMonitor Evaluate Treat
MEAT
![Page 28: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/28.jpg)
Manual Chart Audit
1 hour per chart100,000 patients=11.4 PERSON-YEARS!
![Page 29: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/29.jpg)
Use Case: Care Network
Referring MD represented in GreenConsulting MD represented in Blue
29
![Page 30: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/30.jpg)
Referring MD represented in GreenConsulting MD represented in Blue
30
Care Network- Referrals of Interest
![Page 31: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/31.jpg)
How Do We Solve These Problems?
![Page 32: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/32.jpg)
Apixio Architecture High Level
EHR coded data
EHR text documents
EHR scan documents
Claims
ParseOCR
Norm.Load
Client Ingest Pipeline
Patient ObjectModel
GeneralEvent Stream
HCCEvent Stream
QualityEvent Stream
ReferralEvent Stream
3rd PartyEvent Stream
API
Clinical Knowledge Exchange
CareOptimizer
Quality Optimizer
HCCOptimizer
3rd PartyEvent Stream
Application
Eligibility
Provider files
![Page 33: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/33.jpg)
Apixio Platform Physical Architecture
Audit(Trace CF)
Logging(Hive/Trace CF)
Metrics (Graphite)
Apixio Pipeline Receiver (HTTP)
Cassandra Hive/HDFS S3
Apixio REST API
Web TierJava/Python
External Clients End Users
Persistence
ComputeJob Control Pipeline
ApplicationsExperimental Infrastructure
Logging
![Page 34: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/34.jpg)
Apixio Platform Logical Architecture
• Append Only Model in Cassandra• Document Based
L0
![Page 35: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/35.jpg)
L0 – Document Level• Stored in cassandra• 2 Column Family / Customer• Append only
ApixioID DOCID1 DOCID2 DOCID3
Partial Patient Object
Partial Patient Object
Partial Patient Object
Documents Column Family
DocID:<DOCID> ApixioID
APIXIOID
Indices Column Family (2 types of data)
DocHash:<HASH> ApixioID
APIXIOID
![Page 36: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/36.jpg)
L1 – Event Streams
An event is an assertion (fact) about a specific subject (patient) at a specific time
Cassandra
Event Extraction&
Inference
HIVE/HDFS
![Page 37: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/37.jpg)
Event Extraction & Inference
Cassandra
Event Extractors
Event Transformer
HIVE/HDFS
Mapper Reducer
Event Extractors
Event Transformer
Converts Documents/Patients to Events
Combines multiple events to create new events
![Page 38: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/38.jpg)
Event Extraction & Inference
Stacking Composition
Event Extractors
Event Extractors
Event Extractors
Event Extractors
Event Extractors
Event Extractors
Sequencing Composition
Functional Composition of extractors/transformers gives us a scalable flexible inference engine.
![Page 39: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/39.jpg)
Auditing• Access information stored in a tracing CF in cassandra• Append only• Keyed by document
DocID Timestamp1 Timestamp2 Timestamp2
Activity Info Activity Info Activity Info
Audit Column Family
Parsing User Access Timeline
We can reconstruct the timeline of activity on any document once it hits our system.
![Page 40: Making obamacare work with Big Data](https://reader037.vdocuments.us/reader037/viewer/2022110306/554b8431b4c90561588b4f46/html5/thumbnails/40.jpg)
What happens when something goes wrong?
• Comprehensive Logging through custom appenders (log4j)• All pipeline level events are logged to a trace column family• Real-time metrics logged through graphite.