machine learning-based predictions of prognosis in cystic ... · machine learning-based predictions...

Machine Learning-based predictions of prognosis in Cystic Fibrosis A HMED M. A LAA 1 ,T HOMAS DANIELS 2 , R. A NDRES F LOTO 3 , AND M IHAELA VAN DER S CHAAR 1,4 1 UCLA, 2 U NIVERSITY H OSPITAL S OUTHAMPTON , 3 D EPARTMENT OF M EDICINE ,U NIVERSITY OF C AMBRIDGE &ROYAL P APWORTH H OSPITAL ,C AMBRIDGE , 4 A LAN T URING I NSTITUTE This work was supported by the UK Cystic Fibrosis Trust. O BJECTIVES Using an automated machine learning framework (Auto- Prognosis) to build a model for predicting 3-year mortality in CF patients using data from the UK CF registry. ————————————————————— Cruical for man y clinical decisions : • Establishing the optimal timing for referring patients for lung transplantation. • Administring different types of treatments. ————————————————————— Our method: AutoPro gnosis • Automatically constructs ensembles of prognostic modeling pipelines, provides “clinical explanations” for the learned models, and can easily update its learned models as more data is collected over time. • A prognostic modeling pipeline: data imputation, feature processing and classification algorithms. P ATIENT C OHORT Inclusion criteria : • Enrolled in the UK CF registry with annual follow- up data available on the 31 st of December, 2012. • Adult patients who are over 18 years of age. • Follow-up data on the 31 st of December, 2015. Cohort statistics : • A total of 4,064 patients included. • A total of 115 variables for each patient. • Mortality rate was 9.4%. D IAGNOSTIC ACCURACY How should dia gnostic accurac y be evaluated? —————————————————————————- • Most previous works focused on AUC-ROC, but AUC-ROC can be deceptively large because true negatives can be trivially predicted + AUC-ROC does not account for imbalanced outcomes. • Alternative: area under the precision-recall curve (average precision) focuses only on positive cases. —————————————————————————————————————————————- Impact on lung transplant referral decisions ——————————————————————————- • Operating point: fix the negative predictive value (NPV) at the one achieved by the "FEV 1 < 30%" criterion. • Improvement in the positive predictive value (PPV) from 48% to 65%. —————————————————————————————————————————————- R ISK F ACTORS Which patient variables best explain accuracy gains? ——————————————————————– • Variable importance rankings depend on the diagnostic accuracy metric used. • Oxygen therapy is the variable used by machine learning to improve precision. —————————————————————————————————————————————- C ONCLUSIONS • The area under precision-recall curve is a more appropriate metric than AUC-ROC for evaluating prognostic scores. This fact was overlooked by previous works. • Our results indicate that competitive machine learning approaches significantly improve prognostic forecasting and will support optimized referral for lung transplantation. • Incorporating variables related to gas exchange (Oxygenation) into predictive models in addition to spirometric variables can significantly boost the precision of lung transplant referral decisions.

Upload: others

Post on 21-Aug-2020

3 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Machine Learning-based predictions of prognosis in Cystic ... · Machine Learning-based predictions of prognosis in Cystic Fibrosis AHMED M. ALAA1,THOMAS DANIELS2, R.ANDRES FLOTO3,

Machine Learning-based predictions of prognosis in Cystic FibrosisAHMED M. ALAA1,THOMAS DANIELS2, R. ANDRES FLOTO3, AND MIHAELA VAN DER SCHAAR1,4

1UCLA, 2 UNIVERSITY HOSPITAL SOUTHAMPTON, 3 DEPARTMENT OF MEDICINE, UNIVERSITY OF CAMBRIDGE & ROYAL PAPWORTH HOSPITAL, CAMBRIDGE, 4 ALAN TURING INSTITUTE

This work was supported by the UK Cystic Fibrosis Trust.

OBJECTIVESUsing an automated machine learning framework (Auto-Prognosis) to build a model for predicting 3-year mortal-ity in CF patients using data from the UK CF registry.—————————————————————Cruical for many clinical decisions:• Establishing the optimal timing for referring patients forlung transplantation.• Administring different types of treatments.—————————————————————Our method: AutoPrognosis• Automatically constructs ensembles of prognostic mod-eling pipelines, provides “clinical explanations” for thelearned models, and can easily update its learned models asmore data is collected over time.

• A prognostic modeling pipeline: data imputation, featureprocessing and classification algorithms.

PATIENT COHORTInclusion criteria:• Enrolled in the UK CF registry with annual follow-up data available on the 31st of December, 2012.• Adult patients who are over 18 years of age.• Follow-up data on the 31st of December, 2015.

Cohort statistics:• A total of 4,064 patients included.• A total of 115 variables for each patient.• Mortality rate was 9.4%.

DIAGNOSTIC ACCURACYHow should diagnostic accuracy be evaluated? —————————————————————————-• Most previous works focused on AUC-ROC, but AUC-ROC can be deceptively large because true negativescan be trivially predicted + AUC-ROC does not account for imbalanced outcomes.• Alternative: area under the precision-recall curve (average precision) focuses only on positive cases.—————————————————————————————————————————————-

Impact on lung transplant referral decisions ——————————————————————————-• Operating point: fix the negative predictive value (NPV) at the one achieved by the "FEV1 < 30%" criterion.• Improvement in the positive predictive value (PPV) from 48% to 65%.—————————————————————————————————————————————-

RISK FACTORSWhich patient variables best explain accuracy gains? ——————————————————————–• Variable importance rankings depend on the diagnostic accuracy metric used.• Oxygen therapy is the variable used by machine learning to improve precision.—————————————————————————————————————————————-

CONCLUSIONS• The area under precision-recall curve is a more appropriate metric than AUC-ROC for evaluatingprognostic scores. This fact was overlooked by previous works.

• Our results indicate that competitive machine learning approaches significantly improve prognostic forecastingand will support optimized referral for lung transplantation.

• Incorporating variables related to gas exchange (Oxygenation) into predictive models in addition to spirometricvariables can significantly boost the precision of lung transplant referral decisions.

Lecture 27 Cystic Fibrosis Cho CYSTIC FIBROSIS: …...Lecture 27 Cystic Fibrosis Cho Class III opening of the channel- CYSTIC FIBROSIS: CFTR MUTATATION CLASSES Autosomal recessive

1 Prognosis

Cystic Pancreatic Neoplasms Diagnosis of Cystic and

disease prognosis

Series Of Fetal Structural Abnormalities Detected By … · The prognosis of congenital cystic adenomatoid malformation is variable, depending on the size rather than histological

CONFLICT PROGNOSIS

Diagnosis, Prognosis and Personalized Treatment of Cystic ... · Valter (e novos membros!), ao Tio João e família, por torcerem sempre por mim e celebrarem todos os meus pequenos

THE CANADIAN CYSTIC FIBROSIS REGISTRY Report 2015/2015...Canadian Cystic Fibrosis Registry 2015 Annual Report • 2 THE CANADIAN CYSTIC FIBROSIS REGISTRY The Canadian Cystic Fibrosis

Cystic Pancreatic Neoplasms Diagnosis of Cystic and Intraductal

ALS Prognosis

Prognosis - Hope

PROGNOSIS management for financial applications - IRgo.prognosis.com/rs/prognosis/images/ProductSheet-PROGNOSIS... · processors, ATM ISOs and retailers. ... PROGNOSIS management