screening automatization @dsfmms.dsfarm.unipd.it/files/lezioni/psf/pdf/lbdd_1_2018_pdf.pdf · m m s...
TRANSCRIPT
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
ScreeningScreening automatizationautomatization @DSF@DSF
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
whenwhen potencypotency isis aa goodgood potency?potency?
David C. Rees, Miles Congreve,, Christopher W. Murray & Robin Carr Nature Reviews Drug Discovery 3, 660-672, 2004
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
WhatWhat aboutabout RiluzoleRiluzole??
MW = 234.2
logP = 2.3
IC50 = 16.1 M
Stephanie FedericoUNITS
Maicol BissaroMMS@UNIPD
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
xx.000 to 1?xx.000 to 1?Is this ratio really acceptable Is this ratio really acceptable
for a for a pharmapharma company?company?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
WeWe surelysurely needneed informaticsinformatics butbut::WeWe surelysurely needneed informaticsinformatics butbut::
RandomRandom ScreeningScreening
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MarketingPhase IV
ClinicalTrials
Lead to
Candidate
Hit to Lead
Screen to Hit
Target to
Validated Target
Gene to Function
VirtualScreening (VS)
HTS / in vitroScreening
Focused Synthesis/in vitro Screening
Lead Lead toto
CandidateCandidate
Hit Hit to Leadto Lead
Screen Screen to Hitto Hit
Virtual LeadOptimization
FlowchartsFlowcharts inin DrugDrug DiscoveryDiscovery::FlowchartsFlowcharts inin DrugDrug DiscoveryDiscovery::
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
VirtualScreening (VS)
HTS / in vitroScreening
Focused Synthesis/in vitro Screening
Lead Lead toto
CandidateCandidate
Hit Hit to Leadto Lead
Screen Screen to Hitto Hit
Virtual LeadOptimization
TargetStructure Determination
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
VirtualScreening (VS)
Lead Lead toto
CandidateCandidate
Hit Hit to Leadto Lead
Screen Screen to Hitto Hit
TargetStructure Determination
Structure-basedVirtual Screening (SBVS)
HT Molecular Docking
Ligand-basedVirtual Screening (LBVS)
Similarity searchPharmacophore matching
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Lead Lead toto
CandidateCandidate
Hit Hit to Leadto Lead
Screen Screen to Hitto Hit
Virtual LeadOptimization
TargetStructure Determination
Ligand-based Optimization
PharmacophoreQSAR
Structure-based Optimization
Molecular DockingMolecular Dynamics
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Menu (touristic) of our PSF course:Menu (touristic) of our PSF course:Menu (touristic) of our PSF course:Menu (touristic) of our PSF course:
••LigandLigand--based “Drug” Design (LBDD)based “Drug” Design (LBDD)
••StructureStructure--based “Drug” Design (SBDD) based “Drug” Design (SBDD)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
VirtualScreening (VS)
HTS / in vitroScreening
Lead Lead toto
CandidateCandidate
Hit Hit to Leadto Lead
Screen Screen to Hitto Hit
WeWe dodo thisthis firstfirst fundamentalfundamentalconsiderationconsideration togethertogether::WeWe dodo thisthis firstfirst fundamentalfundamentalconsiderationconsideration togethertogether::
REALREAL
VIRTUALVIRTUAL
•• hitshits– active in assay– defined and confirmed
structures– drug-like potential
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TheThe firstfirst realreal helphelp thatthat informaticsinformaticsgivesgives toto chemistrychemistry andand ininparticularparticular medicinalmedicinal chemistrychemistry isistoto ''virtualizevirtualize'' molecularmolecular structuresstructures..
TheThe firstfirst realreal helphelp thatthat informaticsinformaticsgivesgives toto chemistrychemistry andand ininparticularparticular medicinalmedicinal chemistrychemistry isistoto ''virtualizevirtualize'' molecularmolecular structuresstructures..
butbut……
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
WeWe dodo NEVERNEVER forgetforget::WeWe dodo NEVERNEVER forgetforget::
O
O
O
OH
This is one of itspossible chemical
representations
This is achemical
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… followfollow meme inin thisthis logiclogic comparisoncomparison::…… followfollow meme inin thisthis logiclogic comparisoncomparison::
Chemical Structure (CS)Chemical Structure (CS)Chemical Structure (CS)Chemical Structure (CS)
Chemical Compound (CCChemical Compound (CC))Chemical Compound (CCChemical Compound (CC))
Real worldReal world
Chemical Properties (CP)Chemical Properties (CP)Chemical Properties (CP)Chemical Properties (CP)
Numerical Numerical representations of CSrepresentations of CS
Numerical Numerical representations of CSrepresentations of CS
Virtual worldVirtual world
Molecular Descriptors (MD)Molecular Descriptors (MD)Molecular Descriptors (MD)Molecular Descriptors (MD)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…again the salicylic acid?
C9H8O4O
O
O
OH
WithWith howhow manymany chemicalchemicalrepresentationsrepresentations wewe cancan deal?deal?WithWith howhow manymany chemicalchemicalrepresentationsrepresentations wewe cancan deal?deal?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TheThe crucialcrucial informaticsinformatics differencesdifferences::TheThe crucialcrucial informaticsinformatics differencesdifferences::
C9H8O4
… this is a simple string (sequence) ofalphanumeric characters and it is veryeasy to manage… informatically!!!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
JustJust aa simplesimple exampleexample:: areare thesethese twotworepresentationsrepresentations identical?identical?JustJust aa simplesimple exampleexample:: areare thesethese twotworepresentationsrepresentations identical?identical?
C9H8O4
C9H8O4
Time of answer (sec): Time of answer (sec):
O
O
O
OH
O
O
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
O
O
O
OH
BeBe carefulcareful toto thethe chemicalchemical slangslang……BeBe carefulcareful toto thethe chemicalchemical slangslang……
O
O
O
OH
HH
H
HO
O
O
OH
HH
H
H
HH
HO
O
O
O
HH
H
H
HH
HH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
O
O
O
OH
Remember,Remember, allall ofof thesethese areare notnotidenticalidentical…… informaticallyinformatically specking!specking!Remember,Remember, allall ofof thesethese areare notnotidenticalidentical…… informaticallyinformatically specking!specking!
O
O
O
O O
O
CH3
O
OH
O
O
CH3
O
OH
HH
H
H ……
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
andand thisthis isis amazingamazing::andand thisthis isis amazingamazing::
… draw is geometry…… geometry is numbers in space!!!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
HereHere isis thethe informaticsinformatics anatomyanatomy ofofthisthis representationrepresentation::HereHere isis thethe informaticsinformatics anatomyanatomy ofofthisthis representationrepresentation::
O
O
O
OHy
x0
... the introduction of Connection Table:
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
ISIS
13 131.0986 -13.6500 0.0000 C1.0974 -14.4773 0.0000 C 1.8122 -14.8902 0.0000 C2.5287 -14.4769 0.0000 C2.5258 -13.6463 0.0000 C1.8104 -13.2372 0.0000 C 1.8080 -12.4122 0.0000 O1.0923 -12.0018 0.0000 C1.0898 -11.1768 0.0000 O0.3790 -12.4165 0.0000 C3.2387 -13.2311 0.0000 C3.9547 -13.6409 0.0000 O3.2356 -12.4061 0.0000 O
1 2 26 7 1 3 4 27 8 18 9 24 5 18 10 12 3 15 11 1 5 6 2
11 12 26 1 1
11 13 1M END
O
O
O
O
Atom typeNumber of atoms
Number of bonds
Coordinates:
x y z
The first bond is betweenatom 1 and 2 and its bondorder is 2 (double bond). 6
54
3
2
1
78
9
1011
12
13
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
justjust forforinformaticsinformaticsaddicted!addicted!
justjust forforinformaticsinformaticsaddicted!addicted!
Transformations:
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Now,Now, thisthis isis perfectperfect understandable!understandable!Now,Now, thisthis isis perfectperfect understandable!understandable!
y
x0
z
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MOE2007 3D
21 21 1.1522 0.5113 -0.0537 C0.2183 1.2057 -0.8346 C0.5665 2.3445 -1.5679 O
-1.0985 0.7524 -0.9667 C-1.8050 1.2920 -1.5911 H-1.5038 -0.3936 -0.2868 C-2.5276 -0.7473 -0.3775 H-0.5938 -1.0870 0.5099 C-0.9101 -1.9816 1.0411 H0.7269 -0.6400 0.6244 C1.4145 -1.2086 1.2469 H0.5056 3.5095 -0.8113 C0.9559 4.6611 -1.6596 C0.9409 5.5777 -1.0629 H0.2773 4.7842 -2.5073 H 1.9783 4.4900 -2.0055 H0.1372 3.6103 0.3517 O 2.5419 0.9700 0.0441 C3.0443 1.9108 -0.5504 O3.2755 0.2254 0.8972 O4.1677 0.6310 0.8800 H
1 10 2 1 2 1 1 18 1...20 21 1
M END
……clear?clear?……clear?clear?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TheThe crucialcrucial questionquestion:: whowho givesgives ususthethe goodgood coordinates?coordinates?TheThe crucialcrucial questionquestion:: whowho givesgives ususthethe goodgood coordinates?coordinates?
y
x0
z
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
List of Commonly Used Abbreviationsof coordinates archives (files):List of Commonly Used Abbreviationsof coordinates archives (files):
xxx.xxx.mol mol :: MDL (Molecular Design Limited, Inc) Molfile
xxx.xxx.mol2 mol2 : : SYBYL SYBYL MolfileMolfile
xxx.xxx.sdfsdf : : structure-data file
xxx.xxx.pdbpdb :: proteinprotein--datadata bankbank filefile
xxx.xxx.chmchm : : ChemDraw file
…
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
OpenBabel: a precious tool to convertone format to another… and much more!!OpenBabel: a precious tool to convertone format to another… and much more!!
http://openbabel.org/wiki/Main_Page
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
OpenBabel: a precious tool to convertone format to another… and much more!!OpenBabel: a precious tool to convertone format to another… and much more!!
http://openbabel.org/wiki/Main_Page
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Object:Object: string (1D)string (1D)
xC9H8O4
Object:Object: 2D graph2D graphyO
OHO
O
Object:Object: 3D graph3D graph
z
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AreAre wewe fullyfully satisfied?satisfied?AreAre wewe fullyfully satisfied?satisfied?
O
O
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
WhatWhat wewe areare reallyreally missingmissing……WhatWhat wewe areare reallyreally missingmissing……
x
y
z
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… thethe illusionillusion ofof time!time!…… thethe illusionillusion ofof time!time!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
CanCan youyou understandunderstand whatwhat isis this?this?CanCan youyou understandunderstand whatwhat isis this?this?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… butbut thisthis illusionillusion isis incrediblyincrediblyexciting!!!exciting!!!…… butbut thisthis illusionillusion isis incrediblyincrediblyexciting!!!exciting!!!
G. Deganutti A. Cuzzolin
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… butbut thisthis illusionillusion isis incrediblyincrediblyexciting!!!exciting!!!…… butbut thisthis illusionillusion isis incrediblyincrediblyexciting!!!exciting!!!
V. Salmaso M. Sturlese
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AnotherAnother importantimportant informaticsinformaticsdifference!difference!AnotherAnother importantimportant informaticsinformaticsdifference!difference!
C9H8O4O
O
O
OH
1.299 byte 2.051 byte6 byte
1 byte = 8 bit (1 bit = 0 o 1, true o false)
73.728 byte
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
DoDo youyou remember?remember?DoDo youyou remember?remember?
C9H8O4
C9H8O4
Time of answer (sec): Time of answer (sec):
O
O
O
OH
O
O
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
String_of_characters
O
O
O
OH
CombiningCombining businessbusiness withwith pleasurepleasure ??CombiningCombining businessbusiness withwith pleasurepleasure ??
… possibly, using only a keyboard?… possibly, using only a keyboard?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
String_of_charactersO
O
O
OH
CombiningCombining businessbusiness withwith pleasurepleasure ??CombiningCombining businessbusiness withwith pleasurepleasure ??
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Using simple rules it is possible to representthe “connections” between “molecularfragments” (as in the structural formula) in asimple “string” of “alphanumericcharacters” (as in the bruta formula).
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
Here is some examples:
The original SMILES specification was initiated by DavidWeininger at the USEPA Mid-Continent Ecology DivisionLaboratory in Duluth in the 1980s.
Anderson E, Veith GD, Weininger D (1987). SMILES: A line notation and computerized interpreter forchemical structures. Duluth, MN: U.S. EPA, Environmental Research Laboratory-Duluth. Report No.EPA/600/M-87/021.Weininger D "SMILES, a chemical language and information system. 1. Introduction to methodologyand encoding rules". Journal of Chemical Information and Modeling. 28 (1): 31–6.
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
C methane
CC ethane
CCC propane
CC(C)C 2-methil-propane
C1CCCCC1 cycloesane
HH
H
H
H
H
HH
H
HH
H
HH
HH
HH
H
H
H
HH
HH
HHH
Some SMILES rules:
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
11
11
C1CCCCC1 cycloesane
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
C=C ethene (ethylene)
C=CC propene
C/C=C/C trans (E)-2-butane
C/C=C\C cis (Z)-2-butane
C#C ethyne (acetylene)
H
HH
H
CH3
HH
H
CH3
HCH3
H
CH3
HH
CH3
HH
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
c1ccccc1 benzene
Cc1ccccc1 toluene
Cc1ccc(N)cc1 4-methil-anilineNc1ccc(C)cc1
c12c(cccc1)cccc2 naphthalene
CH3
CH3
NH2
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Examples for naphtalene:
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
CO
CC=O
CC(=O)C
CC(=O)O
OHH
H
H
OH
OH
H
H
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
OH
H
H
H
HH
H
OH
H
H
methanol
ethanal
acetone
acetic acid
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
In SMILES, tetrahedral centers may be indicated by asimplified chrial specification (@ or @@) written as anatomic property following the atomic symbol of the chiralatom.
Looking at the chiral center from the direction of the "from" atom (as peratom order in SMILES), @ means "the other three atoms are listed anti-clockwise; @@ means clockwise.
C[C@@H](N)C(=O)ONH2 COOH
NH2 COOH C[C@H](N)C(=O)O
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
Some examples:
Melatonin (C13H16N2O2) CC(=O)NCCc1c[nH]c2ccc(OC)cc12
Nicotine (C10H14N2) CN1CCC[C@H]1c2cccnc2
Glucose (glucopyranose) (C6H12O6)
OC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O)1
Thiamine (C12H17N4OS+) (vitamin B1) OCCc1c(C)[n+](cs1)Cc2cnc(C)nc2N
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
Some examples:
Cephalostatin-1, a steroidic trisdecacyclic pyrazine with the empirical formula C54H74N2O10
Starting with the left-most methyl group in the figure:
CC(C)(O1)C[C@@H](O)[C@@]1(O2)[C@@H](C)[C@@H]3CC=C4[C@]3(C2)C(=O)C[C@H]5[C@H]4CC[C@@H](C6)[C@]5(C)Cc(n7)c6nc(C[C@@]89(C))c7C[C@@H]8CC[C@@H]%10[C@@H]9C[C@@H](O)[C@@]%11(C)C%10=C[C@H](O%12)[C@]%11(O)[C@H](C)[C@]%12(O%13)[C@H](O)C[C@@]%13(C)CO
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
Generation of SMILES:Break cycles, then write as branches off a mainbackbone. (Ciprofloxacin)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
NON CANONICALNON CANONICALSMILESSMILES
CANONICALCANONICALSMILESSMILES
OCCC(O)CCCO
CCO
“CANONICAL” AIMS:“CANONICAL” AIMS:••One chemical one SMILES string;One chemical one SMILES string;••Same SMILES string coming from different programs;Same SMILES string coming from different programs;••Improving search provess in chemical databases.Improving search provess in chemical databases.
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
Morgan’s algorithmMorgan’s algorithmH. L. Morgan, J. Chem. Doc., 1965, 5, 107; LG, S.7
FROM CHEMICAL STRUCTURE… …TO MOLECULAR GRAPH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
H. L. Morgan, J. Chem. Doc., 1965, 5, 107; LG, S.7
Replace the value of a given node with the sumof the values of the nodes connected
Morgan’s algorithmMorgan’s algorithm
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
H. L. Morgan, J. Chem. Doc., 1965, 5, 107; LG, S.7
Morgan’s algorithmMorgan’s algorithm
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
H. L. Morgan, J. Chem. Doc., 1965, 5, 107; LG, S.7
Said… and done!
Morgan’s algorithmMorgan’s algorithm
O(O=)Cc1ccccc1OC(=O)C
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
C9H8O4O
O
O
OH
SMILESSMILES (SSimplified MMolecular IInput LLine EEntry SSpecification)
reassuming:
Three faces of the same medal!!!Three faces of the same medal!!!
O(O=)Cc1ccccc1OC(=O)C
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
DoDo youyou needneed aa SMILES’sSMILES’s translator?translator?DoDo youyou needneed aa SMILES’sSMILES’s translator?translator?
http://mms.dsfarm.unipd.it/MMsINC/search/
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
ChemicalChemical archivesarchives……
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TheThe secondsecond majormajor aidaid thatthatinformaticsinformatics givesgives toto thethe medicinalmedicinalchemistrychemistry isis thethe 'intelligent'intelligent storagestorage''ofof molecularmolecular structuresstructures..
TheThe secondsecond majormajor aidaid thatthatinformaticsinformatics givesgives toto thethe medicinalmedicinalchemistrychemistry isis thethe 'intelligent'intelligent storagestorage''ofof molecularmolecular structuresstructures..
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Why:
archiviationarchiviation…… thethe firstfirst crucialcrucialvirtualizationvirtualization processprocess..
to ensure the uniqueness of representation inthe virtual world of an object in the real world!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
andand itit isis clear,clear, too!too!
p-Phenylenediamine
PM 108.14
CAS 106-50-3
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Let'sLet's startstart buildingbuilding ourour molecularmoleculardatabase?database?Let'sLet's startstart buildingbuilding ourour molecularmoleculardatabase?database?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
O
O
O
OH
QueryQuery
SearchingSearching inin aa databasedatabase isis nownow veryveryeasy!easy!SearchingSearching inin aa databasedatabase isis nownow veryveryeasy!easy!
O(O=)Cc1ccccc1OC(=O)C
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AA bitbit ofof…… sequencesequence alignmentalignment..AA bitbit ofof…… sequencesequence alignmentalignment..
Sequence A: C3H6OSequence B: C6H6O
11.. IdentityIdentity searchsearch::
aa.. IfIf thethe numbernumber ofof characterscharacters thethe same?same?
GoGo on!on!notnotidenticalidentical
NONO YESYES
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AA bitbit ofof…… sequencesequence alignmentalignment..AA bitbit ofof…… sequencesequence alignmentalignment..
bb.. IfIf thethe numbernumber ofof characterscharacters isis thethe samesame::
C 3 H6 OC3H6O
The main diagonal represents inthe correspondence matrix theplace where we find the highest'similarity' between the twosequences.
C3H6O|||||C3H6O
100%identity!!!
CORRISPONDENCECORRISPONDENCEMATRIXMATRIX
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… importantimportant molecularmolecular Haystack!Haystack!…… importantimportant molecularmolecular Haystack!Haystack!
Date 21/11/2018 10:55:40 ESTCount 144,815,888 Commercially available organic and
inorganic chemicals
Chemical Abstract Service(www.cas.org)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
httphttp:://pubchem//pubchem..ncbincbi..nlmnlm..nihnih..gov/gov/
httphttp:://www//www..emoleculesemolecules..com/com/
httphttp:://www//www..chemspiderchemspider..com/com/
httphttp:://mms//mms..dsfarmdsfarm..unipdunipd..it/MMsINCit/MMsINC..htmlhtml
…… otherother importantimportant molecularmolecular Haystacks!Haystacks!…… otherother importantimportant molecularmolecular Haystacks!Haystacks!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AA veryvery commoncommon situationsituation:: wewe havehave onlyonly oneonecompoundcompound thatthat wewe knowknow toto bebe ''activeactive'' againstagainstourour therapeutictherapeutic targettarget::
Could you suggest a strategy to find a Could you suggest a strategy to find a novel possible candidate?novel possible candidate?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SimilaritySimilarity:: structurestructure versusversus propertiesproperties……
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
DoDo togethertogether thisthis simplesimple exerciseexercise:: cancanyouyou describedescribe whatwhat youyou areare lookinglooking ??
SimilaritySimilarity lawlaw:: similarsimilar itemsitems tendtend toto bebegroupedgrouped togethertogether..
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
and,and, sometimes,sometimes, thisthis isis truetrue alsoalso inin medicinalmedicinalchemistrychemistry::
NH
ON
COOH
COOH NH
ON N
H
ON
NH
ON N
H
ONH
NH
ONH2
N+
C
CH2O NHNH
ON
Lidocaine
Lidofenin Bupivacaine Ropivacaine
Aptocaine Tocainide Prilocaine
+ +
NH
ON
NH
ON
COOMe
NH
ON
Pyrrocaine Butanilicaine Quatacaine
Mepivacaine Articaine Clodacaine
Tolycaine Trimecaine Etidocaine
NH
ON N
H
ON
Tolycaine Trimecaine Etidocaine
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
NH
ON
COOH
COOH NH
ON N
H
ON
NH
ON N
H
ONH
NH
ONH2
N+
C
CH2O NHNH
ON
Lidocaine
Lidofenin Bupivacaine Ropivacaine
Aptocaine Tocainide Prilocaine
+ +
NH
ON
NH
ON
COOMe
NH
ON
Pyrrocaine Butanilicaine Quatacaine
Mepivacaine Articaine Clodacaine
Tolycaine Trimecaine Etidocaine
NH
ON N
H
ON
Tolycaine Trimecaine Etidocaine
IfIf wewe wantwant toto virtualizevirtualize chemicalchemical similaritysimilarity searching,searching,wewe havehave toto describedescribe exactlyexactly whatwhat similaritysimilarity is!is!
AreAre youyou ableable toto scalescale chemicalchemical similaritysimilarity ofof thesetheseanalogsanalogs respectrespect lidocaine?lidocaine?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TellTell meme whatwhat youyou areare thinkingthinking::
NH
ON
Lidocaine
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
ExactlyExactly ““molecularmolecular fragmentationfragmentation””couldcould bebe aa possiblepossible solution!solution!ExactlyExactly ““molecularmolecular fragmentationfragmentation””couldcould bebe aa possiblepossible solution!solution!
O
O
O
OHO
O
O
OHO
O
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
WeWe needneed somesome rulesrules::WeWe needneed somesome rulesrules::
S
CF3
N
N
OH
S
CF3
N
N
OHRingsRings
11::
S
CF3
N
N
OHLinkersLinkers
22::
S
CF3
N
N
OH
33::
SideSide chainschainsG.W. Bemis, M. A. Murcko, J. Med. Chem. 1996, 39, 2887 –2893.
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
TheThe sequencesequence ofof fragmentsfragments isis likelike aamolecularmolecular barbar codecode……
OO
O
OH
HowHow wewe cancan buildbuild itit upup……HowHow wewe cancan buildbuild itit upup……
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AnyAny standard?standard?AnyAny standard?standard?
MACCSMACCS (Molecular(Molecular ACCessACCess System)System) aacollectioncollection ofof 166166 functionalfunctional groupsgroupsrepresentingrepresenting thethe mostmost accessibleaccessiblemedchemmedchem chemicalchemical spacespace..
MACCS keys (1979), MDL Information Systems Inc. (originally named Molecular Design Limited, Inc.),San Leandro, CA.
There is no publication describing the MACCS 166 public keys. All of the citations for it either say avariation of "MDL did it"
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
A useful concept: A useful concept: structural keysstructural keysA useful concept: A useful concept: structural keysstructural keys
1 1 0 0 1 0 0 0
1. Define all possible chemical fragments (structural keys):
2. Assigne un bit for the each corrispondence in our molecule:
OH
OAr
OHO
ORNH2 O S
O
O
O
OH
3. Reiterate this procedure for each molecule of the database :
1 1 0 0 1 0 0 0
1 1 0 1 0 0 0 0
O
O
O
OH
OH
O
OH
OH
OAr
OH O
ORNH2 O S ……
OH
OAr
OHO
ORNH2 O S
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
whatwhat aboutabout thisthis parallelism?!parallelism?!whatwhat aboutabout thisthis parallelism?!parallelism?!
0000000100000001000000100010000000000010000010000100001000001000O
O
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… so,so, nownow itit isis veryvery easyeasy toto transformtransform mymymolecularmolecular databasedatabase inin aa collectioncollection ofof““structuralstructural keyskeys”!”!
…… so,so, nownow itit isis veryvery easyeasy toto transformtransform mymymolecularmolecular databasedatabase inin aa collectioncollection ofof““structuralstructural keyskeys”!”!
000000010000110100000010101000000000011000001000010000100000100001000101100100100101100110100111001111010000001100000001100010000100010100011101010000110000101000010011000010100000000100100000000110111001110111111010000010001000011011011000000010011010000001000101001101000100000000100000000100100000001001000010001010000100011100011101000100001011101100110110010010001101001100001000010111010011010101011111100001000001111110001000010000100010100001000101001111010100001000100000000100100000101001000010001010000001000100010100010100100000000000001010000010000100000100000000010001010001001100000000000000000001010000001000000000000000000001000101000101000000000000001010000100100000000001000000000000000101010101111100111110100000000000011010100011100100001100101000010001010001100001000001100000000001000100000011000000000110000000000001000000000100001000000000000010101000000001000001001000000100010100010100000000100000000000010000000000000100001000011000000100010000110001001010000001010010101110001000010000100010100001000111000101000100001000010011100100100000100011000000001010000101010100010100010100100000000000010010000010010100100100010000
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… well,well, butbut whywhy thisthis transformationtransformation couldcouldhelphelp usus inin measuringmeasuring chemicalchemical similarity?similarity?…… well,well, butbut whywhy thisthis transformationtransformation couldcouldhelphelp usus inin measuringmeasuring chemicalchemical similarity?similarity?
000000010000110100000010101000000000011000001000010000100000100001000101100100100101100110100111001111010000001100000001100010000100010100011101010000110000101000010011000010100000000100100000000110111001110111111010000010001000011011011000000010011010000001000101001101000100000000100000000100100000001001000010001010000100011100011101000100001011101100110110010010001101001100001000010111010011010101011111100001000001111110001000010000100010100001000101001111010100001000100000000100100000101001000010001010000001000100010100010100100000000000001010000010000100000100000000010001010001001100000000000000000001010000001000000000000000000001000101000101000000000000001010000100100000000001000000000000000101010101111100111110100000000000011010100011100100001100101000010001010001100001000001100000000001000100000011000000000110000000000001000000000100001000000000000010101000000001000001001000000100010100010100000000100000000000010000000000000100001000011000000100010000110001001010000001010010101110001000010000100010100001000111000101000100001000010011100100100000100011000000001010000101010100010100010100100000000000010010000010010100100100010000
O
O
O
OHOH
O
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
DoDo youyou rememberremember thethe EuleroEulero--VennVenn diagrams?diagrams?DoDo youyou rememberremember thethe EuleroEulero--VennVenn diagrams?diagrams?
…… andand thethe intersectionintersection setset AAB?B?…… andand thethe intersectionintersection setset AAB?B?
AA BB
AA BB
AABB
AABB
AABB
AABBAABB
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Now,Now, II cancan introduceintroduce youyou thethe JaccardJaccard index!index!Now,Now, II cancan introduceintroduce youyou thethe JaccardJaccard index!index!
BABABA
solosolo
Index Jaccard
==44
11 ++ 11 ++ 44== 00..6666
Originally coined coefficient de communauté by Paul Jaccard, JaccardJaccard,, PP.. ((19011901)) BulletinBulletin deldel lala SociétéSociété VaudoisedesVaudoisedes SciencesSciences NaturellesNaturelles 3737,, 241241--272272 ..
AABB
AABB
AABB
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Now,Now, II cancan introduceintroduce youyou thethe JaccardJaccard index!index!Now,Now, II cancan introduceintroduce youyou thethe JaccardJaccard index!index!
BABABA
solosolo
Index Jaccard
Originally coined coefficient de communauté by Paul Jaccard, JaccardJaccard,, PP.. ((19011901)) BulletinBulletin deldel lala SociétéSociété VaudoisedesVaudoisedes SciencesSciences NaturellesNaturelles 3737,, 241241--272272 ..
AA BB
AA BB
JIJI
JIJI
= 1= 1
= 0= 0
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
1 1 0 0 1 0 0 0
1 1 0 1 0 0 0 0
O
O
O
OH
OH
O
OH
AA
BBFragmentsFragments onlyonly inin AA == 11FragmentsFragments onlyonly inin BB == 11FFrraaggmmeennttss inin AABB == 22
TanimotoTanimoto similaritysimilarity indexindex ==22
11 ++ 11 ++ 22== 00..55
FromFrom JacardJacard…… toto TanimotoTanimoto index!index!FromFrom JacardJacard…… toto TanimotoTanimoto index!index!
OH
OAr
OHO
ORNH2 O S
TanimotoTanimoto, , T.T.T.T. (1957) IBM (1957) IBM InternalInternal Report 17th Report 17th NovNovIn that paper, a "similarity ratio" is given over bitmaps, where each bit of a fixed-size array represents the presence orabsence of a characteristic in the plant being modelled.
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
SimilaritySimilarity…… isis aa symmetricalsymmetrical property?property?SimilaritySimilarity…… isis aa symmetricalsymmetrical property?property?
Usually we think that if A is similar to B, thenB is similar to A. An example:
SimSim [[ ,, ]] == SimSim [[ ,, ]]
BABABA
solosolo
IndexTversky
but it not always the case in particular when thetwo ensembles are very different populated. Inthis case:
where and are user-defined parameters.
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Asymmetric (Tversky) SimilarityAsymmetric (Tversky) Similarity
BABABA
solosolo
IndexTversky
• if = = 1, equation reduces to Tanimoto coefficient;• if , T becomes asymmetric
– where = 1 and = 0, T = AB / Ai.e. the fraction of A which it is has in common with B• when T = 1.0, it indicates that A is a substructure of
B (at the level of fingerprint matching)• when T1.0 it indicates that A is almost a
substructure of B
TT ==11 TT 11
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
BeforeBefore askingasking thethe chemicalchemical meaningmeaning ofofTanimotoTanimoto similaritysimilarity index,index, let'slet's seesee aacleverclever useuse::
000000010000110100000010101000000000011000001000010000100000100001000101100100100101100110100111001111010000001100000001100010000100010100011101010000110000101000010011000010100000000100100000000110111001110111111010000010001000011011011000000010011010000001000101001101000100000000100000000100100000001001000010001010000100011100011101000100001011101100110110010010001101001100001000010111010011010101011111100001000001111110001000010000100010100001000101001111010100001000100000000100100000101001000010001010000001000100010100010100100000000000001010000010000100000100000000010001010001001100000000000000000001010000001000000000000000000001000101000101000000000000001010000100100000000001000000000000000101010101111100111110100000000000011010100011100100001100101000010001010001100001000001100000000001000100000011000000000110000000000001000000000100001000000000000010101000000001000001001000000100010100010100000000100000000000010000000000000100001000011000000100010000110001001010000001010010101110001000010000100010100001000111000101000100001000010011100100100000100011000000001010000101010100010100010100100000000000010010000010010100100100010000
O
O
O
OH
TanimotoTanimoto0.450.630.750.640.860.340.770.980.960.230.540.560.330.860.650.890.670.56
TanimotoTanimoto0.980.960.890.860.860.770.750.670.650.640.630.560.560.540.450.340.330.23
search all compounds that have a Tanimoto similarity index > 0.85!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AA bitbit ofof…… sortingsorting algorithmalgorithm..AA bitbit ofof…… sortingsorting algorithmalgorithm..
AA sortingsorting algorithmalgorithm isis anan algorithmalgorithm thatthat putsputselementselements ofof aa listlist inin aa certaincertain orderorder..
…… andand exampleexample::…… andand exampleexample::
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
HereHere isis thethe firstfirst importantimportant informaticsinformaticsapplicationapplication inin medicinalmedicinal chemistrychemistry::
SearchSearch allall chemicalchemicalrepresentationrepresentation closeclose totothethe targettarget oneone withwith aaTanimotoTanimoto similaritysimilarity indexindex>> 00..8585
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… asas forfor anyany tool,tool, therethere areare limitations!limitations!…… asas forfor anyany tool,tool, therethere areare limitations!limitations!
O
O
O
OHO
O
O
OH
O
O
O OH
1 1 0 0 1 0 0 0
1 1 0 0 1 0 0 0
1 1 0 0 1 0 0 0
OH
OAr
OHO
ORNH2 O S
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
…… butbut wewe cancan alsoalso runrun aa subsub--structurestructuresearch!search!…… butbut wewe cancan alsoalso runrun aa subsub--structurestructuresearch!search!
00000000000010000000000000000000000000000000000000000000000000000
DatabaseDatabase
000000010000110100000010101000000000011000001000010000100000100001000101100100100101100110100111001111010000001100000001100010000100010100011101010000110000101000010011000010100000000100100000000110111001110111111010000010001000011011011000000010011010000001000101001101000100000000100000000100100000001001000010001010000100011100011101000100001011101100110110010010001101001100001000010111010011010101011111100001000001111110001000010000100010100001000101001111010100001000100000000100100000101001000010001010000001000100010100010100100000000000001010000010000100000100000000010001010001001100000000000000000001010000001000000000000000000001000101000101000000000000001010000100100000000001000000000000000101010101111100111110100000000000011010100011100100001100101000010001010001100001000001100000000001000100000011000000000110000000000001000000000100001000000000000010101000000001000001001000000100010100010100000000100000000000010000000000000100001000011000000100010000110001001010000001010010101110001000010000100010100001000111000101000100001000010011100100100000100011000000001010000101010100010100010100100000000000010010000010010100100100010000
OH
OAr
OH ONH2 O S
QueryQuery
O
CH3 O
OO
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
R Cl
O
R ClS
OO R-NO2 R1N
NR2
R1
R2
P X
Restriction rulesRestriction rulesDatabase In
Database Out
…… oror wewe cancan easilyeasily REMOVEREMOVE compoundcompound withwithundesirableundesirable fragmentsfragments (chemical(chemical ororbiologicalbiological reactivity,reactivity, toxicitytoxicity etcetc..)!)!
…… oror wewe cancan easilyeasily REMOVEREMOVE compoundcompound withwithundesirableundesirable fragmentsfragments (chemical(chemical ororbiologicalbiological reactivity,reactivity, toxicitytoxicity etcetc..)!)!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Baell J. and Walters M.A. Nature 513, 481-483 (2014)
Pan-Assay Interference compouNdS
AA veryvery nicenice example!example!AA veryvery nicenice example!example!
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
Pan-assay interference compounds (PAINS) encompasssome 400 structural classes, but more than half of PAINS ina typical library fall into just 16 easily recognizablecategories..Baell J. and Walters M.A. Nature 513, 481-483 (2014)
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
BackBack toto thethe horriblehorrible ratioratio xxxx..000000::11Discover a new drug is equal to find a…Discover a new drug is equal to find a…
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AnalyzeAnalyze thethe nearestnearest chemicalchemical space?space?
SimilarSimilar chemicalchemical propertiesproperties similarsimilar chemicalchemicalstructuresstructures andand similarsimilar chemicalchemical properties?properties?
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AnalyzeAnalyze thethe nearestnearest chemicalchemical space?space?
|x1 – x2| + |y1 – y2|
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
HereHere isis thethe secondsecond importantimportant informaticsinformaticsapplicationapplication inin medicinalmedicinal chemistrychemistry::
SearchSearch allall chemicalchemicalrepresentationrepresentation closeclose totothethe targettarget oneone withwith aaTanimotoTanimoto similaritysimilarity indexindex>> 00..8585 andand withwith aa PMPM ==360360 ±± 100100 andand aa logPlogP == 33..88±± 11
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
……andand don’tdon’t forgetforget::
1- similarity = diversity
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
AnAn alsoalso thethe RANDOMRANDOM SCREENINGSCREENING isis lesslessRANDOM,RANDOM, now!now!
RandomRandom ScreeningScreening
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
“Progettazione e Sviluppodi un Farmaco”
Stefano Moro
Chimica e Tecnologia Farmaceutiche
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
EvenEven ifif wewe MUSTMUST rememberremember thethe ““similaritysimilarityparadoxparadox””::
LL--DOPADOPA DD--DOPADOPA
OH
O
NH2
OH
OH
OH
O
NH2
OH
OH
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MMsINC webMMsINC web--platform:platform:
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy
MM S S.MORO – PSF 2018/2019Confidential and Property of ©2005 Molecular Modeling Section
Dept. Pharmaceutical and Pharmacological Sciences – University of Padova - Italy