navigating the chemical universe with chemmaps and opera · navigating the chemical universe with...

37
Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group, DIR/BCBB Understanding and Applying Read-Across for Human Health Risk Assessment CalEPA-OEHHA 2 nd May, 2019

Upload: others

Post on 26-Apr-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Navigating the Chemical Universe with ChemMaps and OPERA

Nicole C. KleinstreuerNICEATM Deputy Director

PI, Comp Tox Group, DIR/BCBB

Understanding and Applying Read-Across for Human Health Risk

Assessment

CalEPA-OEHHA

2nd May, 2019

Page 2: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services2

Chemical space: “…Chemical space’ is a term often used

in place of ‘multi- dimensional descriptor space’: it is a

region defined by a particular choice of descriptors…”

Dobson CM (2004) Nature 432:824–828

Chemical space

Page 3: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services3

Lipinski C, Hopkins A (2004) Nature 432:855–861.

Chemical space

Page 4: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services4

Page 5: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services5

Locate chemical

of interest

Page 6: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services6

Locate chemical

of interest Define/identify analogues

Page 7: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services7

Investigate ADME/Tox

properties

Locate chemical

of interest Define/identify analogues

Page 8: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services8

Optimize molecules,

identify replacements

Locate chemical

of interest

Investigate ADME/Tox

properties

Define/identify analogues

Page 9: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services9

Define, visualize

domains

Locate chemical

of interest

Investigate ADME/Tox

properties

Define/identify analogues

Optimize molecules,

identify replacements

Page 10: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

National Institutes of HealthU.S. Department of Health and Human Services10

Efficient navigation tool

ChemMaps.com

Borrel, Kleinstreuer, and Fourches.

Bioinformatics, 2018

Page 11: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

11

Google Maps approach

• Interactive

• Easy to use

• Informative

• Responsive

• ….

Chemical space

Page 12: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

12

DrugMap: pharmaceutical compounds

~8,000 drug entries (release 12-2018):

• ~2,500 FDA-approved small molecule drugs

• Over 5,000 experimental drugs.

https://www.drugbank.ca/

Page 13: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Environmental Chemical Space

v1: EnvMap

~48,000 chemicals with 3D descriptors

Informed by regulatory lists*:

• Endocrine Disruptor Screening Program

• Toxic Substances Control Act Inventory

• Canadian Domestic Substances List

• Swedish Chemicals Agency

~12,000 chemicals with acute systemic toxicity data

*not inclusive

v2 (under development): Extended Universe: Distributed Structure-

Searchable Toxicity (DSSTox) Database (EPA – EPA comptox)

~800,000 chemicals, (chemical infrastructure for EPA’s Safer

Chemicals Research, including the ToxCast and Tox21 high-

throughput toxicology efforts)

OPERA predictions for physchem properties and tox endpoints

Tox21map and PFASmap underway

Page 14: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

OPERA approach

• Curated open access datasets (https://doi.org/10.1186/s13321-018-0263-1)

• Open-source code (github.com/NIEHS/OPERA)

• Transparent unambiguous algorithms (https://qsardb.jrc.ec.europa.eu/qmrf/)

• Transparent validated performances (https://doi.org/10.1080/1062936X.2016.1253611)

• Defined applicability domain and limitations of the models

• Predictions available through:

• The EPA’s CompTox Dashboard (https://comptox.epa.gov/dashboard)

• Free and open-source standalone application (github.com/NIEHS/OPERA)

Page 15: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

The 5 OECD Principles

1) A defined endpoint

2) An unambiguous algorithm

3) A defined domain of applicability

4) Appropriate measures of

goodness-of-fit, robustness and

predictivity

5) Mechanistic interpretation, if

possible

* http://www.oecd.org/chemicalsafety/risk-assessment/37849783.pdf

QSARs for regulatory purposes

Page 16: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

OPERA modeling steps and considerations

Step Description

Curation of the data Flagged and curated files

available for sharing

Preparation of training and

test sets

Inserted as a field in

SDFiles and csv data files

Calculation of an initial set

of descriptors

PaDEL & CDK 2D

descriptors and fingerprints

Selection of a mathematical

method

Several approaches tested:

KNN, PLS, SVM…

Variable selection technique Genetic algorithm

Validation of the model’s

predictive ability

5-fold cross validation and

external test set

Define the Applicability

Domain

Local (nearest neighbors)

and global (leverage)

approaches

Page 17: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

KNIME Workflow to Evaluate the Data

Quality FLAGS and curated structures

Page 18: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Valence Errors Mismatching structures

Examples of Errors

Duplicate Structures Covalent Halogens

Page 19: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

QSAR-ready KNIME workflow

Remove of

duplicates

Normalize of

tautomers

Clean salts and

counterions

Remove inorganics

and mixtures

Final inspection QSAR-ready

structures

Indigo

Aim of the workflow:

• Combine different procedures and ideas

• Minimize the differences between the structures used for

prediction

• Produce a flexible free and open source workflow to be

shared

Structure standardization procedure

Mansouri et al. (http://ehp.niehs.nih.gov/15-10267/)

Fourches et al. J Chem Inf Model, 2010, 29, 476 – 488

Wedebye et al. Danish EPA Environmental Project No. 1503, 2013

Page 20: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Curation to QSAR Ready Files

Mansouri et al. OPERA models. (https://link.springer.com/article/10.1186/s13321-018-0263-1)

Property Initial file Curated Data Curated QSAR ready

AOP 818 818 745

BCF 685 618 608

BioHC 175 151 150

Biowin 1265 1196 1171

BP 5890 5591 5436

HL 1829 1758 1711

KM 631 548 541

KOA 308 277 270

LogP 15809 14544 14041

MP 10051 9120 8656

PC 788 750 735

VP 3037 2840 2716

WF 5764 5076 4836

WS 2348 2046 2010

Page 21: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

LogP Model: weighted kNN

Weighted 5-nearest neighbors9 DescriptorsTraining set: 10531 chemicalsTest set: 3510 chemicals

5 fold CV: Q2=0.85,RMSE=0.69Fitting: R2=0.86,RMSE=0.67Test: R2=0.86,RMSE=0.78

Mansouri et al. OPERA models. (https://link.springer.com/article/10.1186/s13321-018-0263-1)

Page 22: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Chemical space and AD definition

Descriptor space based the response domain:

• Global applicability domain (leverage)

• Local applicability domain (kNN)

• Accuracy estimate based on the 5NN

Reliable predictions

for structurally similar

chemicals.

Page 23: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

OPERA Standalone Application

Command line Graphical User Interface

https://github.com/NIEHS/OPERA

Page 24: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Model Property

AOH Atmospheric Hydroxylation Rate

BCF Bioconcentration Factor

BioHL Biodegradation Half-life

RB Ready Biodegradability

BP Boiling Point

HL Henry's Law Constant

KM Fish Biotransformation Half-life

KOA Octanol/Air Partition Coefficient

LogP Octanol-water Partition

Coefficient

MP Melting Point

KOC Soil Adsorption Coefficient

VP Vapor Pressure

WS Water solubility

RT HPLC retention time

OPERA v1.5:

Physchem & Env. fate• Structural properties:

Hybridization Ratio, nHBAcc, nHBDon, LipinskiRule, Topo PSA, Molar refractivity, Polarizability, electronegativity…

• pKa

• Log D

• ER activity (CERAPP) • Agonist

• Antagonist

• Binding

(https://ehp.niehs.nih.gov/15-10267/ )

• AR activity (CoMPARA)• Agonist

• Antagonist

• Binding

(https://doi.org/10.13140/RG.2.2.19612.80009, https://doi.org/10.13140/RG.2.2.21850.03520)

• Acute toxicity (CATMoS)• NT

• VT

• EPA categories

• GHS categories

• LD50

(https://doi.org/10.1016/j.comtox.2018.08.002)

• ADME• FUB

• Clint

New in OPERA v2.2:

Models versioned separately from the tool

OPERA Standalone Application

Page 25: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Supporting Regulatory Decisions

Far too many chemicals to test with standard

animal-based methods

– Cost (~$1,000,000/chemical), time, animal welfare

– ~10,000 chemicals to be tested for EDSP, >50,000 for TSCA

– Fill the data gaps and bridge the lack of knowledge

Alternative

Endocrine Disruption

Estrogen (ER) & Androgen (AR)

• Binding

• Agonism

• Antagonism

Acute Systemic Toxicity

Oral LD50s

• Toxic/ Very toxic

• LD50 Point estimates

• EPA Categories

• GHS Categories

Endpoints predicted:

Page 26: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Global Collaborative Projects

CoMPARACollaborative Modeling Project for Androgen

Receptor Activity (2017/18)

CATMoSCollaborative Acute Toxicity Modeling Suite

(2018/19)

Endocrine Disruptor Screening Program (EDSP)

ICCVAM Acute Systemic Toxicity Workgroup

Over 100 international participants representing academia, industry, and government contributed.

ICCVAM

NICEATM

Page 27: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

CERAPP (ER) & CoMPARA (AR)

Judson et al Toxicol. Sci. (2015) 148: 137-154 Kleinstreuer N. C. et al. 2017 30 (4), 946-964.

Tox21/ToxCast ER Pathway Model Tox21/ToxCast AR Pathway Model

Binding Agonist Antagonist

Train Test Train Test Train Test

Sn 0.93 0.58 0.85 0.94 0.67 0.18

Sp 0.97 0.92 0.98 0.94 0.94 0.90

BA 0.95 0.75 0.92 0.94 0.80 0.54

Binding Agonist Antagonist

Train Test Train Test Train Test

Sn 0.99 0.69 0.95 0.74 1.00 0.61

Sp 0.91 0.87 0.98 0.97 0.95 0.87

BA 0.95 0.78 0.97 0.86 0.97 0.74

CERAPP consensus CoMPARA consensus

Page 28: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Acute Oral Toxicity: CATMoS

Endpoints predicted:

I

II

III

IV

EPA Categories

I

II

III

IV

GHS Categories

NC

VT

NT

Binary models

Very Toxic

(32 models)

Non-Toxic

(33 models)

EPA

(26 models)

GHS

(23 models)

Train Eval Train Eval Train Eval Train Eval

Sn 0.87 0.67 0.93 0.70 0.73 0.50 0.63 0.45

Sp 0.94 0.96 0.96 0.88 0.96 0.91 0.91 0.92

BA 0.93 0.81 0.94 0.79 0.83 0.71 0.77 0.68

In vivo 0.81 0.89 0.82 0.79

LD50 point

estimates

(mg/kg)

LD50

(25 models)

LD50

values

Train Eval In Vivo

R2 0.84 0.64 0.80

RMSE 0.32 0.51 0.42

Page 29: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

OPERA on the EPA Dashboard

Calculation Result

for a chemical Model Performance

with full QMRF

Nearest Neighbors

from Training Set

Dashboard https://comptox.epa.gov

Prediction report

Prediction, AD and

accuracy estimates

Page 30: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Video Demonstration

• Uploading SMILES in DrugMap

• Identifying priority chemicals in PFASMap

Page 31: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Virtual Bioprofiling of PFAS (D. Fourches, NCSU)

Molecular Docking: Attempt to predict interactions between two biologically

relevant molecules (protein and ligand)

• Maximize Interactions

• Minimize Complex Energy

Three Step Process

• Protein Preparation

• Ligand Preparation

• Molecular Docking

Nuclear Receptors Under Investigation

• Estrogen Alpha Agonist

• Estrogen Alpha Antagonist

• Estrogen Beta Agonist

• Androgen Agonist

• Androgen Antagonist

Underway: Dock the entire library of ~5,000 PFAS derivatives against ~350

protein targets covering a panel of kinases, HLA variants, and all human NRs

with known 3D structures and/or amenable by homology modeling

• Progesterone

• Insulin

• Thyroid

• PPAR Gamma

• PPAR Delta

• PPAR Alpha

Page 32: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Acknowledgments

• Alexandre Borrel

• Kamel Mansouri

• Denis Fourches (NCSU)

• ILS/NICEATM

• ICCVAM partners

• Richard Judson (NCCT)

• Modeling consortiumparticipants

Page 33: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Extra Slides

Page 34: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Batch download of predictions

OPERA on the EPA Dashboard

Page 35: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

New features to be implemented

QSAR-ready SMILES from the EPA CompTox Dashboard:

https://comptox.epa.gov/dashboard/dsstoxdb/batch_search

1. Integrate the QSAR-

ready workflow to

process any chemical

structure

2. Calculate predictions

using ONLY a

chemical ID:

• CASRN,

• DTXSID,

• InChiKey

Page 36: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

757 chemicals have >75% active concordance

Actives

Inactives

Prioritization

Most models predict most chemicals as inactive

Only a small fraction of chemicals are

prioritized for further testing

Mansouri et al. (2016) EHP 124:1023–1033

DOI:10.1289/ehp.1510267

Chemical Prioritization

Page 37: Navigating the Chemical Universe with ChemMaps and OPERA · Navigating the Chemical Universe with ChemMaps and OPERA Nicole C. Kleinstreuer NICEATM Deputy Director PI, Comp Tox Group,

Informing Regulatory Decisions