ebi is an outstation of the european molecular biology laboratory. overview of chembl database...

47
EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University 16 th October 2012

Upload: pearl-shields

Post on 18-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

EBI is an Outstation of the European Molecular Biology Laboratory.

Overview of ChEMBL Database

Gareth Owen, ChEBI group, EMBL-EBI

Northwestern University

16th October 2012

Page 2: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

What is ChEMBL?

• Open access database for drug discovery• Freely available (searchable and downloadable) • Content:

• 2D structures & calculated properties (logP, MW, Lipinski, etc.) • Associated bioactivity data extracted from the primary medicinal

chemistry journals such as J. Med. Chem.• Deposited data from neglected disease screening (e.g. malaria)• Subset of data from PubChem

• Covers ~30 years of compound synthesis and testing • Annotated FDA-approved drugs• Secure searching (https://www.ebi.ac.uk/chembldb )

2

Page 3: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Database• Content

3

Assays are classified as:• Binding measurements• Functional assays• ADME/toxicity data

60% proteins

20% organisms

20% cell lines

ChEMBL14Targets: 9,003Compounds: 1,376,469Activities: 10,129,256*Publications: 46,133

* Includes:~5,900,000 (PubChem)~100,000 (Deposited malaria screening sets)

3

Page 4: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Assays – Binding, Functional, ADMET

Binding Assays• Assays which directly measure the binding of a

compound to a particular target• E.g., competition binding assays with a radioligand

• Various endpoints measured, but most commonly reported are:• IC50 (half maximal inhibitory concentration)• Ki (binding affinity)• MIC (minimum inhibitory concentration)• % Inhibition (of activity)

4

Page 5: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Functional Assays

Whole organism assays

(e.g., anti-infectives/parasitics)

Disease-derived cell-line

(e.g., human ovarian cancer cell line cytotoxicity)

Tissue or cell-based disease model

(e.g., glucose uptake by adipocytes)

Tissue or cell-based assay for target effect

(e.g., contraction of guinea-pig ileum)

Cell-based assay over-expressing target

(e.g., GPCR calcium mobilisation)

Target association

Disease association

5

Page 6: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ADMET Assays

• Assays measuring: Absorption, Distribution, Metabolism, Excretion, Toxicity properties of compounds

• Examples include:• Half-life of compound in rats• Tissue distribution of compound• Levels of metabolites

6

Page 7: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Targets:

Protein Protein complex Protein family Nucleic Acid

e.g., Nicotinic acetylcholine receptor e.g., Muscarinic receptors

Cell Line Tissue Sub-cellular Fraction Organism

e.g., DNA

e.g., Mitochondriae.g., Nervouse.g., HEK293 cells e.g., Drosophila

e.g., PDE5

7

Page 8: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Protein Targets

• Each protein target linked to a sequence in UniProt• Information from UniProt used in ChEMBL to allow

searching:• Protein name/description• Synonyms and gene names• Organism (and NCBI Tax ID)

• Proteins in ChEMBL also classified according to family (e.g., Receptor, Kinase, Protease, Transporter etc).• Used for searching by target tree (Browse Targets)

8

Page 9: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Compounds

• Chemical structures are stored as .mol files• If the stereochemistry is known it is drawn as a specific

enantiomer

• Identifying unique compounds is done using standard Inchis• Salts and parent molecules are grouped together for

displaying bioactivity data although activity data is recorded against the specific salt

9

• Tautomers of the same compound are treated as the same compound. The form shown is as in the paper

Page 10: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Home Page

10 https://www.ebi.ac.uk/chembldb

Page 11: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEMBL Main Search Page

11

Page 12: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

12Small molecule resources at the EBI

Clickable structure

Drug Information

Parent and Salt Forms

Page 13: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

13

Page 14: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

14

Click to display data

Page 15: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

15

Page 16: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

16

Page 17: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChEBI Link:

18

Page 18: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

This will take you back to ChEMBL

19

Page 19: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ChemSpider Links:

20

The link works both ways. They link TO ChemSpider and FROM ChemSpider.

They link on Standard_Inchi

Page 20: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Wikipedia Links:

21

We also have links with Wikipedia. These also use the Standard_Inchi as the common identifier. These links will link to the Compound Report Card in ChEMBL.

The links are added by a ChemoBot and can be updated with each release, if required.

Page 21: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Use Case 1 - Searching by Target

• What is known about chemical structures that bind to a specific protein (Adenosine A2a)?

• What is known about their potency/selectivity/ADMET Properties

• Is there any protein structure data?

22

Page 22: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Use Case 1 Searching by Target in ChEMBL

Choose Sources to include in search

23

Page 23: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Retrieving Bioactivity Data - Single Target

Bioactivity data for target Display all

bioactivity data for target

Click pie chart to retrieve particular end-points

3D Structures

Assay data for target

24

Page 24: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Filtering Bioactivities

Select required activity types and define cut-offs e.g Ki<100nM

Select targets of interest

25

Page 25: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Bioactivity Results

Compound structures

Target detailsActivity values Assay details References

26

Page 26: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Selectivity Data

27

For example:Can search ChEMBL for all data on compounds that have adenosine A2a Ki values <100nM

Page 27: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

ADMET Data

Summary of ChEMBL bioavailability data for compounds with A2a Ki values <100nM

Example of Bioavailability data

28

Page 28: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Use Case 2 – Searching by Structure

• What compounds contain a particular substructure?

• What is known about their bioactivities?

• Known drugs/clinical Trials

29

Page 29: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

30

Different sketchers

name

Lists of Identifiers

Types of synonyms:• Research codes• Trade names• INN, USAN

Page 30: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Similarity and Substructure Searching

31 Display/Download Bioactivity Data

Page 31: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Filtering Data on Lipinski Properties etc

Display Bioactivities of subset32

Page 32: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

33

names

Structure

Bioactivities

Page 33: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

34

BioactivitiesClinical Trials

Properties

Cross-references

Page 34: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Links to Other Resources

35

Page 35: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Links to Other Resources

36

PDBe - http://www.ebi.ac.uk/pdbe

Page 36: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Marketed Drugs

Select set of interestExport to Excel orExport SDF37

Page 37: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Use Case 3 – Similar Targets

• Are there any available data on compounds that bind to proteins similar to IRAK2?

• For these compounds what bioactivity data is there on compounds with related sub-structures?

• Is there any crystal structure data on these proteins?

38

Page 38: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Protein Sequence Search

• More precise method for identifying targets

• Input is a protein sequence of interest

• Uses BLAST* algorithm to perform pair-wise comparisons between input sequence and all proteins in the Target Dictionary, to find most closely related matches

• Results are scored according to similarity to input sequence (determined by number of amino acids that are identical or have similar properties)

*Altschul SF et al., J Mol Biol. 215(3), p403-10 (1990)39

Page 39: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Data on IRAK1,IRAK3 and IRAK4 but not IRAK2

Use Case 3 – Similar Targets

Protein Sequence of Interest e.g from UniProthttp://www.uniprot.org

40

Page 40: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

IRAK1, IRAK3 and IRAK4 data

Identify sub-structure of interestWhat other data available on compounds with this sub-structure?

41

Page 41: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Use Case 4 - Assay keyword search

• Some ChEMBL data (e.g., functional assays) may not be mapped against molecular targets

• May want to perform a more general search (e.g., for a disease process, animal model, cell type of interest)

• Examples:1. What compounds have been tested in disease models (cholesterol

lowering)?

2. What data is available for brain penetration (brain to plasma ratio)?

42

Page 42: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Assay Search for “Cholesterol Lowering”

43

Page 43: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Assay Search for “Brain to Plasma”

44

Page 44: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Accessing ChEMBL Data

45

Page 45: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

Useful Links

46

ChEMBL Blog:http://chembl.blogspot.com

If you would like help:[email protected]

For ChEMBL news and data releases subscribe to:http://listserver.ebi.ac.uk/mailman/listinfo/chembl-announce

Page 46: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

AcknowledgementsChEMBL Group

John Overington

Anne Hersey

Anna Gaulton

Mark Davies

Jon Chambers

Louisa Bellis

Kazuyoshi Ikeda

Patricia Bento

Shaun McGlinchey

Yvonne Light

Felix Krueger

Ben Stauch

Ruth Akhtar

Francis Atkinson

Rita Santos

EMBL-EBI

Samuel Kerrien, Sandra Orchard, Bruno Aranda, Rafael Jimenez, Reactome, UniProt and ChEBI teams

Collaborators

Imperial Cancer Research, University of Dundee, University of Cambridge, Sanger Centre, University of Maryland, NCBI, TDR, IUPHAR, Bayer-Schering, Pfizer, GSK, Schering-Plough, MMV, Novartis, St Jude Children’s Research Hospital

Former Inpharmatica colleagues

47

Page 47: EBI is an Outstation of the European Molecular Biology Laboratory. Overview of ChEMBL Database Gareth Owen, ChEBI group, EMBL-EBI Northwestern University

48

Exercises!