signature redactedraineslab.com/sites/default/files/labs/raines/pdfs/thesis_windsor.pdf · in...

HIV-1 Protease as a Target for Antiretroviral Therapy

by

Ian William Windsor

B.S. BiochemistryUniversity of Minnesota Twin Cities, 2011

M.S. BiochemistryUniversity of Wisconsin-Madison, 2017

Submitted to the Department of Chemistryin Partial Fulfillment of the Requirements for the Degree of

DOCTOR OF PHILOSOPHYIN CHEMISTRY

at the

Massachusetts Institute of Technology

February 2019

2019 Massachusetts Institute of Technology. All rights reserved.

Signature of Author:Signature redacted

Department of ChemistryDecember 17, 2018

-Signature redactedCertified by:

Ronald RainesFirmenich Professor of Chemistry

Thesis Supervisor

Accepted by:

MASSACHUSETS INSTOF TECHNOLOGY

MAR 212019

LIBRARIESARCHIVFS

Signature redactedRobert W. Field

ITUTE Haslam and Dewey Professor of ChemistryChairman, Departmental Committee on Graduate Students

1II

This doctoral thesis has been examined by a committee of professorsfrom the Department of Chemistry as follows:

Signature redactedNovartis Professor of

Alexander M. KlibanovChemistry and Biological Engineering

Thesis Committee Chair

-Signature redacted

Ronald T. RainesFirmenich Professor of Chemistry

Thesis Supervisor

Signature redacted

Elizabeth M. NolanAssociate Professor of Chemistry

Thesis Committee Member

2

HIV-1 Protease as a Target for Antiretroviral Therapy

By

Ian William Windsor

Submitted to the Department of Chemistry onJanuary 7, 2019 in Partial Fulfillment of the Requirements

for the Degree of Doctor of Philosophy in Chemistry

Abstract

Human immunodeficiency virus (HIV) is the causative agent of acquiredimmunodeficiency syndrome (AIDS). HIV employs three enzymes in its lifecycle, including aprotease that enables maturation of polyprotein precursors. Despite decades of progress studyingthe lifecycle of HIV and elaboration of therapeutics targeting nearly every aspect of the viral lifecycle, a cure remains elusive. Breakthroughs in HIV research have occurred alongsidefoundational advances of molecular biology, biotechnology, and medicinal chemistry, highlightingthe importance revisiting old questions with new approaches. The goal of this thesis is to advanceour biochemical knowledge of HIV-I protease and develop novel therapeutics targeting this keyviral enzyme.

In Chapter 1, I introduce HIV and the role that HIV-1 protease plays in life cycle andcurrent treatment strategies. In Chapter 2, I describe an assay that enables the determination ofsub-picomolar inhibition constants for competitive inhibitors of HIV-1 protease. This advance wasmade possible by a peptide substrate selected by phage display. I report in Chapter 3 the enhancedhydrogen bonding in the recognition of this peptide by HIV-1 protease as revealed by X-raycrystallography. The mechanism of aspartic proteases, including HIV-1 protease, has been thesubject of numerous enzymology studies spanning over half a century. In Chapter 4, I revealunappreciated non-covalent interactions within substrates of aspartic proteases that assist incatalysis. In addition to biochemical studies, this thesis includes chapters that account thedevelopment of novel antivirals. In Chapter 5, I describe the rational drug design of a boronic acidanalog of the clinical inhibitor darunavir with improved potency. A limitation of boronic acids ismetabolic instability; in Chapter 6, I reveal an intramolecular protecting group that can conferoxidative stability to boronic acids. Finally, in Chapter 7, I describe an engineering approach toinactivate human RNase 1. The inactivation relies on installing a substrate for HIV- I protease, the

cleavage of which unmasks cytotoxic activity. Together these chapters describe new ways forward

and novel therapeutics targeting HIV-1 protease. My thesis also includes an Appendix, whichdescribes the elaboration of boronic acid-based covalent pharmacological chaperones of humantransthyretin.

3

Thesis Supervisor: Ronald T. Raines

Title: Firmenich Professor of Chemistry

4

Acknowledgments

The Raines Laboratory has been an incredible place to receive training at the interface betweenchemistry and biology. I thank Prof. Ronald Raines for the opportunity to work on a variety ofexciting projects with relevance to human health. His approach to mentoring has made me anindependent researcher and instilled in me an attention to detail that I will carry on in my futureresearch endeavors.

The work presented here in my thesis would not be possible without the support of mycoworkers in the Raines Group. I thank Dr. Chelcie Eller, Dr. Kevin Desai, Dr. Trish Hoang, Dr.Joelle Lomax, and Dr. James Vasta, for helping me get started in the lab and being a constantsource of advice and support. I value the short time I overlapped with Dr. Michael Palte and hismany project ideas. My chemical training was made possible through numerous collaborationswithin the Raines Group. I thank Dr. John Lukesh, III, Dr. Thomas Smith, and Brian Graham fortheir hands-on training in organic synthesis. I also thank Dr. Brian Gold for his mentoring incomputational chemistry. Certainly, each member of the Raines group has contributed to my workthrough group meetings and everyday conversations.

I have been privileged to receive substantial training outside of the Raines Group. I amgrateful for Prof. Katrina Forest at the University of Wisconsin-Madison. Early in my studies, Iwas able to learn alongside her students, and she provided me with the resources necessary toprepare structures by X-ray crystallography. I must also thank Dr. Kittikhun Wangkanont who alsotrained with Prof. Forest for sharing his expertise and friendship during numerous trips to ArgonneNational Lab. I am also grateful for the training I received from Prof. David O'Connor at the AIDSVaccine Research Lab at UW-Madison. His staff scientist Dr. Dawn Dudley has helped mecharacterize live HIV in experiments that are ongoing and did not make it into my thesis. Thatentire group is friendly and helpful and made me feel safe working with a deadly virus. I have alsobeen fortunate to attend workshops including the NAMD molecular dynamics workshop hostedby the Theoretical and Computational Biophysics Group at the University of Illinois Urbana-Champaign, the NMRFAM workshop at UW-Madison, and a Rosetta Workshop hosted by Prof.Jens Meiler at Vanderbilt University.

I originally began my graduate studies at UW-Madison. In addition to my collaboratorsthere, I must thank my previous thesis committee members Prof. Julie Mitchell and Prof. AnjonAudhya for their support over several years. Transferring to MIT in 2017 was an easy decisionthat I made with a heavy heart. My arrival to MIT was delayed as I conducted viral studies duringthe summer of 2017 and I thank Val Ressler and Dr. Emily Garnett for setting up the biochemicallaboratory at MIT. I am grateful for the support from my new thesis committee members Prof.Elizabeth Nolan and Prof. Alexander Klibanov. I am also grateful for my ongoing collaborations

5

at MIT with Christine Isabella and Dr. Caitlin McMahon of the Kiessling Group, Dr. Ivan Buslovof the Buchwald and Pentelute Groups, and Scott Sheppard of the Cummins Group. The staff inthe Chemistry Office has been instrumental in navigating the process of transferring to MIT and

graduation.

It has been remarkable to be part of the Raines Group through two cycles of grant renewals

and see the process that enabled Ron to fund many projects over many years. My work has beenfunded by ROl GM044783 and ROl CA073808 from the NIH. I was privileged to receive an NIHBiotechnology Training Program Fellowship (T32 GM008349) and a pre-doctoral fellowship fromGenentech while at UW-Madison.

I must also thank Prof. Claudia Schmidt-Dannert and Prof. Jeffrey Gralnick at theUniversity of Minnesota Twin Cities for providing me with my first research experiences andhelping me enroll in graduate school. I thank Dr. Swati Choudary and Dr. Maureen Quin for theirhands-on training in biochemistry and helping me find my passion for science. I also thank Dr.Raghunath Padiyath at 3M for his mentoring and advice.

I have had tremendous support from friends and family during my graduate studies. I valuemy teammates from the Discertators ultimate frisbee team and the Warthogs intramural hockeyteams for their companionship and helping me stay active. I thank my parents William Windsorand Starla Guckenberg and my sister Autumn Windsor for their love and encouragement. By far,the most pivotal event I experienced during graduate school was meeting my loving partner SusanBright. She has been a constant source of support and inspiration and I cannot imagine my lifewithout her.

6

Table of Contents

PREFACE

Abstract ................................................................................................................................... 3

A cknowledgm ents................................................................................................................... 5

Table of Contents .................................................................................................................... 7

List of Tables ....................................................................................... 15

List of Figures ....................................................................................................................... 18

List of Charts......................................................................................................................... 22

List of Schem es..................................................................................................................... 22

Abbreviations........................................................................................................................ 23

CHAPTER 1

Introduction: Current and Future Prospects for the Treatment of HIV ..................................... 26

1. 1 Significance..................................................................................................................... 27

1.2 Hum an Im m unodeficiency Virus................................................................................. 28

1.2.1 The Lifecycle of HIV ............................................................................................ 28

1.2.2 Treatm ent of HIV ................................................................................................. 30

1.2.3 H IV M utations and Setbacks for a Cure .............................................................. 31

1.2.4 H IV Latency............................................................................................................. 32

1.3 Biochem istry of HIV -1 Protease................................................................................. 33

1.3.1 The Structure of HIV-1 Protease ......................................................................... 33

1.3.2 Catalysis by HIV Protease ..................................................................................... 34

1.3.3 Substrate Recognition by HIV -1 Protease............................................................ 36

7

1.4 Inhibition of HIV- 1 Protease ....................................................................................... 38

1.4.1 Natural Product-inspired Inhibitors of Aspartic Proteases ................................... 38

1.4.2 The First Generation of Clinical Inhibitors.......................................................... 39

1.4.3 The Second Generation of Protease Inhibitors ..................................................... 40

1.4.4 Drug Resistance and the Future of Protease Inhibitors........................................ 41

1.5 Versatile Recognition by Boronic Acids ..................................................................... 43

1.5.1 Chemistry of Boronic Acids ................................................................................ 43

1.5.2 Covalent Recognition of Proteins by Boronic Acids............................................ 45

1.5.3 Non-covalent Recognition of Proteins by Boronic Acids.................................... 46

1.6 Ribonucleases as Antiviral Therapeutics..................................................................... 48

1.6.1 Structure of RNases .............................................................................................. 48

1.6.2 Catalysis by RNases.............................................................................................. 49

1.6.4 Evasion of Ribonuclease Inhibitor........................................................................ 50

1.6.5 Antiviral Activity of RNases ................................................................................ 51

CHAPTER 2

Fluorogenic Assay for Inhibitors of HIV- 1 Protease with Sub-picomolar Affinity .................. 62

2.1. Introduction.................................................................................................................... 64

2 .2 . R esu lts............................................................................................................................ 6 5

2.2.1. Substrate Design. ................................................................................................. 65

2.2.2. Assay Design. ......................................................................................................... 66

2.2.3. M ichaelis-M enten Kinetics. ............................................................................... 67

2.2.4. Determination of Ki Values with M orrison's Equation. ...................................... 67

8

23 , Di scun..................................................................................................................... 60

2.4. M aterials and M ethods................................................................................................. 69

2.4.1. M aterials. ................................................................................................................ 69

2.4.2. Plasm id Preparation ............................................................................................ 69

2.4.3. Protein Purification .............................................................................................. 70

2.4.4. Enzym atic Activity Assays................................................................................... 71

2.4.5. Data Analysis.......................................................................................................... 72

2.5. A cknow ledgm ents.......................................................................................................... 73

CHAPTER 3

Substrate Selected by Phage Display Exhibits Enhanced Side-chain Hydrogen Bonding with HIV-

1 Protease ...................................................................................................................................... 82

3.1. Introduction.................................................................................................................... 84

3.2 Results and D iscussion ................................................................................................ 84

3.2.1 SGIFLETS Binds in Alternative Orientations ..................................................... 84

3.2.2 SerlA and Gly2A at the P4 and P3 Positions Occupy Alternative Conformations. 85

3.2.3 Glu6 and Ser8 Form a N etwork of Hydrogen Bonds .............................................. 86

3.2.4 Thr7 Plays a Lim ited Role..................................................................................... 87

3.3 M aterials and M ethods................................................................................................ 88

3.3.1. Protein..................................................................................................................... 88

3.3.2. Peptide..................................................................................................................... 89

3.3.3. Crystallization......................................................................................................... 89

3.3.4. Data Collection and Processing ........................................................................... 89

9

3.3.5 Structure Solution and Refinement..................................................................... 89

3 .4 C on clu sion ...................................................................................................................... 9 1

3.5 Acknowledgements..................................................................................................... 91

CHAPTER 4

An n--+7* Interaction in the Bound Substrate of Aspartic Proteases Replicates the Oxyanion Hole

112

104

106

106

108

109

112

112

113

113

114

114

.....................................................................................................................................................

4.1 Introduction...................................................................................................................

4.2 Results and Discussion .................................................................................................

4.2.1 Conform ation of the P1 Residue............................................................................

4.2.2 Conformation of the P 1' Residue ...........................................................................

4.2.3 M echanistic Insights ..............................................................................................

4.3 M aterials and M ethods..................................................................................................

4.3.1 Structural Analyses ................................................................................................

4.3.2 DFT Optim ization..................................................................................................

4.3.3 NBO Analysis ........................................................................................................

4.4 Conclusions...................................................................................................................

4.5 Acknowledgem ents.......................................................................................................

CHAPTER 5

Sub-Picom olar Inhibition of HIV- 1 Protease with a Boronic Acid ............................................

5.1 Introduction...................................................................................................................

5.2 Results and Discussion .................................................................................................

5.3.1 General...................................................................................................................

10

134

136

137

141

5. . C Syn arthe ................................................................................................. 142

5.3.3 Protein Preparation ................................................................................................. 145

5.3.4 Enzym e K inetics and. M ate........ ..... ...................... ............................ ..... 146

5.3.4.1 I ha enis nten K in e is .......................................................................... 146

5.3.4.2 M ichaelis-M enten Kinetics ......................................... ................................... 146

5.3.4.3 Inhibition K inetics.......................................................................................... 147

5.3.5 Cytotoxicity.............................................................................. 147

5.3.6 X -Ray Crystallography .......................................................................................... 148

5.3.6.1 Protein Crystallization .................................................................................... 148

5.3.6.2 X -Ray D iffraction ........................................................................................... 148

5.3.6.3 Structure Solution and Refinem ent................................................................. 149

5.3.7 Com putational A nalysis......................................................................................... 150

5.3.7.1 D FT Optim ization w ith G aussian ................................................................... 150

5.3.7.2 N atural Bonding Orbital A nalysis .................................................................. 150

5.3.7.3 A tom s in M olecules (A IM ) A nalysis.............................................................. 151

5.4 A cknow ledgm ents......................................................................................................... 152

5.5 H ydrogen Optim ization Coordinates ............................................................................ 153

5.6 N M R Spectra ................................................................................................................ 172

CHAPTER 6

Benzoxaborolone A Boronic Acid with Remarkable Oxidative Stability in Aqueous Solution

..................................................................................................................................................... 1 7 6

6.1 Introduction................................................................................................................... 178

11

6.2 Results and D iscussion ................................................................................................. 180

6.2.1 pKa V alues..............................................................................................................180

6.2.2 Susceptibility to Oxidation. ................................................................................... 181

6.2.3 A ffinity for Saccharides.........................................................................................182

6.2.4 Com putational A nalyses. ....................................................................................... 182

6.3 M aterials and M ethods.................................................................................................. 185

6.3.1 G eneral................................................................................................................... 185

6.3.2 pKa Determ ination...............................................................................................185

6.3.3 X -Ray Crystallography.......................................................................................... 186

6.3.4 Chem ical K inetics.................................................................................................. 186

6.3.4.1 Instrum entation and M aterials ........................................................................ 186

6.3.4.2 W avelength Optim ization and Calibration ..................................................... 187

6.3.4.3 Initial V elocity K inetics.................................................................................. 187

6.3.4.4 Evaluation of pH -D ependence........................................................................ 188

6.3.5 Saccharide Binding................................................................................................ 188

6.3.6 Com putational A nalyses........................................................................................ 189

6.3.6.1 D FT Optim ization w ith Gaussian................................................................... 189

6.3.6.2 N atural Bonding Orbital A nalysis .................................................................. 189

6.4 Conclusions................................................................................................................... 190

6.5 A cknow ledgm ents......................................................................................................... 190

6.6 A tom ic Coordinates of Optim ized Structures............................................................... 232

12

CHAPTER 7

Strain Inactivation in a Circular RNase 1 Zym ogen................................................................... 242

7.1 Introduction................................................................................................................... 244

7.2 Results........................................................................................................................... 246

7.2.1 Design of a Circular Zym ogen Construct .............................................................. 246

7.2.2 Design and Assessment of Glycine (G) Series of Zymogens ................................ 246

7.2.3 Design and Assessment of Strained (Str) Series of Zymogens.............................. 247

7.2.4 A ssessing Zym ogen Activation by HIV -1 Protease .............................................. 247

7.2.5 Modelling the Structural Basis of Str2 Zymogen Inactivation .............................. 248

7.2.6 Modelling the Structural Basis of Proteolytic Activation of Str2 by HIV-1 Protease

......................................................................................................................................... 2 4 8

7.3 D iscussion..................................................................................................................... 249

7.4 M aterials and M ethods.................................................................................................. 251

7.4.1 Protein Expression and Purification....................................................................... 251

7.4.2 Enzym e Activity A ssays........................................................................................ 252

7.4.3 Therm al Stability A ssays ....................................................................................... 253

7.4.4 M odelling with Rosetta.......................................................................................... 254

APPENDIX A

Stilbene Boronic Acids Form a Covalent Bond with Human Transthyretin and Inhibit its

Aggregation.................................................................................................................................272

A . 1 Introduction..................................................................................................................274

A .2 Results..........................................................................................................................277

13

A .3 D iscussion .................................................................................................................... 28 1

A .4 M aterials and M ethods................................................................................................. 285

A .4.2 M aterials .. ynthe............................................................................................... 285

A .4.2 Chem ical Synthesis.............................................................................................. 285

A .4.2.1 M aterials.m.. a.... .............................................. .......................................... 285

A .4.2.2 Conditions p ...... ........................................... ........................................ 285

A .4.2.3 Solvent Rem oval ......................................................................................... 286

A .4.2.4 N M R Spectroscopy................................................ ........................................ 286

A .4.2.5 M ass Spectrom etry......................................................................................... 286

A .4.2.6 M elting Points..................................................... ........................................... 286

A.4.2.7 Compound Purity .................................................................. 287

A.4.2.8 Synthesis............................................................................ 287

A .4.3 Protein Expression and Purification...................................................................... 305

A .4.4 Com petitive Fluorescence A ssay.......................................................................... 306

A .4.5 Fibril-Form ation A ssay ......................................................................................... 307

A.4.6 Protein Crystallization and X-ray Structure Determination.................................. 308

A .5 Conclusion ................................................................................................................... 3 10

A .5 H PLC Traces................................................................................................................ 365

A .6 N M R Spectra................................................................................................................ 366

R E FE R E N C E S .............................................................................................................. 395

14

List of Tables

Table 2.1. Inhibition Constants Reported for Amprenavir, Darunavir, and Tipranavir............ 74

Table 2.2. Kinetic Parameters of Popular HIV- 1 Protease Substrates and Substrate 1 ........... 75

Table 3.1. Endogenous and Optimized HIV-1 Protease Substrate Sequences.......................... 92

Table 3.2. Crystallographic Data Collection and Refinement Statistics ................................... 93

Table 4.S1. Measured Parameters from PDB Entries ................................................................ 115

Table 4.S2. Measured Parameters from CSD Entry SOWJUL .................................................. 117

Table 4.S3. Coordinates Extracted from PDB Entry 6bra and Optimized Hydrogen Atoms ... 118

Table 4.S4. Coordinates Extracted from PDB Entry 3b80 and Optimized Hydrogen Atoms ... 120

Table 5.1. Values of Ki for Inhibition of HIV-1 Protease .......................................................... 158

Table 5.S1. Crystallographic Data Collection and Refinement Statistics.................................. 159

Table 6.1. Rate Constants for the Oxidation of Boronic Acids and Biological Thiols at pH ~7.4

..................................................................................................................................................... 1 9 2

Table 6.2. Experimental and Computational Kinetic Parameters for the Oxidation of Boronic

A cid s ........................................................................................................................................... 19 3

Table 6.3. Values of Ka (M-1) for Boronic Acids and Saccharides............................................ 194

Table 6.S1. Crystal Data and Structure Refinement .................................................................. 195

Table 6.S2. Atomic Coordinates (x 104 ) and Equivalent Isotropic Displacement Parameters (A2 x

103) for P 180 6 5 ........................................................................................................................... 19 6

Table 6.S3. Anisotropic Displacement Parameters (A 2 x 103) for P 18065................................ 197

T able 6.S4. B ond Lengths for P 18065 ....................................................................................... 198

T able 6.S5. B ond A ngles for P 18065......................................................................................... 199

15

Table 6.S6. Hydrogen Coordinates (x 104 ) and Isotropic Displacement Parameters (A 2 x 103 ) for

P 18 0 6 5 ........................................................................................................................................ 2 0 1

Table 6.S7. Hydrogen Bonds for P18065 .................................................................................. 202

Table 6.S8. Values of Ac During Oxidation Reactions (M-1cm ) ............................................ 203

Table 6.S9. Second-Order Rate Constants for the Oxidation of Boronic Acids (M-'s-').......... 204

Table 6.S10. Calculated Energies of Starting Materials, Transition States, and Products (Hartrees)

..................................................................................................................................................... 2 0 5

Table 6.S11. Calculated Free Energies of Activation and Reaction (kcal/mol) ........................ 206

Table 6.S12. Boron-Oxygen and Competing Donor-Acceptor Interaction Energies ............... 207

Table 7.1. G and Str Zym ogen D esigns ..................................................................................... 255

Table 7.2. Properties of Str Series Zym ogens ............................................................................ 256

Table 7.S1. Activity and Stability of RNase 1 and Zymogens .................................................. 257

Table 7.S2. Str Zymogen Kinetic Parameters at pH 5.0 ............................................................ 258

Table 7.S3. Total Scores of Models Calculated by Rosetta....................................................... 259

Table A.1. Interaction of Diphenols 1-4 with Wild-type TTR and its V30M Variant...... 312

Table A.2. Interactions of Carboxylic Acids 5-8 with Wild-type TTR and its V30M Variant. 313

Table A.3. Interactions of Compounds 9-12 with Wild-type TTR and its V30M Variant ....... 314

Table A.S1. Bond Angles and Bond Lengths of Planar Boronic Esters in Small-molecule Crystal

S tru ctu res .................................................................................................................................... 3 15

Table A.S2. Crystallographic Data Collection and Refinement Statistics for the TTR-2 Complex.

..................................................................................................................................................... 3 16

Table A.S3. Data Collection and Refinement Statistics for the TTR-3 Complex. ................... 317

16

Table A.S4. Crystallographic a and Refine et Statistics for the TTr 4 Complex

Table A.S. Crystallographic Data Collection and Refinement Statistics for the TTR5 Complex.

..................................................................................................................................................... 3 19


..................................................................................................................................................... 32 0

Table A.S7. Crystallographic Data Collection and Refinement Statistics for the TTR 7 Complex.

..................................................................................................................................................... 3 2 1


..................................................................................................................................................... 3 2 2

Table A.S9. Crystallographic Data Collection and Refinement Statistics for the TTR 10 Complex.

..................................................................................................................................................... 3 2 3

Table A.S10. Crystallographic Data Collection and Refinement Statistics for the TTR 11

C o m p lex ...................................................................................................................................... 3 2 4

Table A.S11. Non-covalent Interactions and Distances in TTR Ligand Complexes ................ 325

Table A.S12. Observed c--hole Bond Lengths and Bond Angles in TTR Ligand Complexes .. 327

17

List of Figures

Figure 1.3. Structures of Aspartic Proteases Bound to Ligands. ............................................. 53

Figure 1.4.1. Structure of Pepstatin, Statine, and a Tetrahedral Intermediate .......................... 55

Figure 1.4.2. FDA Approved Inhibitors of HIV Protease........................................................ 57

Figure 1.5. Equilibria of Boronic Acids in Aqueous Solution................................................. 59

Figure 1.6. Structures of RNases Bound to Ligands................................................................. 61

Figure 2.1. Structure of Substrate 1 ......................................................................................... 77

Figure 2.2. Catalysis of the Hydrolysis of Substrate 1 by HIV-1 Protease............................... 79

Figure 2.3. Inhibition of HIV- 1 Protease by Amprenavir, Darunavir and Tipranavir .............. 81

Figure 3.1. Structure of the D25N HIV-1 Protease CA/p2 Complex...................................... 95

Figure 3.2. Electron Density and Interactions of SGIFLETS Bound in the Active Site of D25N

H IV -1 P rotease.............................................................................................................................. 97

Figure 3.3. Alternative Conformations of P3 and P4 Residues .............................................. 99

F igure 3.4. R ole of A sp30 .......................................................................................................... 101

Figure 4.1. Structural Features of Aspartic Proteases................................................................ 123

Figure 4.2. Ramachandran Plot of P1 and P1' Residues in Substrates Bound to The Active Site of

Inactivated A spartic Proteases .................................................................................................... 125

Figure 4.3. Main-chain Conformation of the P1 Residue .......................................................... 127

Figure 4.4. Main-chain Conformation of the P1' Residue ......................................................... 129

Figure 4.5. Putative Mechanism of Catalysis by HIV- 1 Protease ............................................. 131

Figure 4.6. Orbital Interactions of the Scissile Peptide Bond During Catalysis by HIV-1 Protease

..................................................................................................................................................... 13 3

18

Figure 5.1 Interaions witb a bstraP, Drunavir, or its Analg '-aN d the 2' Susie of HIV-1

P ro tea se ....................................................................................................................................... 16 3

Figure 5.2. Orbital Interactions in a Model of Boronic Acid 1 and Residue 30 of HIV- 1 Protease

..................................................................................................................................................... 1 6 5

Figure 5.S1. Raw and Processed Kinetic Data from Assays of HIV- 1 Protease Activity ......... 167

Figure 5.S2. Depiction of Electron Density of HIV- 1 Proteases with Bound Boronic Acid 1.. 169

Figure 5.S3. Toxicity of Darunavir and Boronic Acid 1 for Human Cells................................ 171

Figure 6.1. Structure of Phenylboronic Acid, Benzoxaborole, Benzoxaborolone, and Hydration

and Protonation States that are Relevant in Aqueous Solution................................................... 209

Figure 6.2. pH-Dependence of the Rate Constant for the Oxidation of Boronic Acids by Hydrogen

P ero x id e ...................................................................................................................................... 2 1 1

Figure 6.3. Calculated Structure of the Transition State for the 1,2-aryl Shift During the Oxidation

of PBA, 2-HMPBA, and 2-CPBA by Hydrogen Peroxide ......................................................... 213

Figure 6.4. Images of Key Orbitals in the Transition State for the 1,2-aryl Shift During the

Oxidation of PBA , 2-HM PA, and 2-CPBA ................................................................................ 215

Figure 6.S1. Representative "B-NMR Spectra Used to Determine the pKa Values of 2-CPBA

..................................................................................................................................................... 2 1 7

Figure 6.S2. UV-Spectroscopic Basis for Assays of the Oxidation of Boronic Acids .............. 219

Figure 6.S3. Kinetic Traces of the Oxidation of Boronic Acids by Hydrogen Peroxide at pH 7.4

..................................................................................................................................................... 2 2 1

Figure 6.S4. 1 H-NMR Spectra Acquired During the Oxidation of Boronic Acids ................... 227

19

Figure 6.S5. Optimized Structure of Each Hydrogen Peroxide Complex and Product During the

O xidation of B oronic A cids ........................................................................................................ 229

Figure 6.S6. Graph of the Extent of Protodeboronation of Boronic Acids at Three pHs.......... 231

Figure 7.1. Activity and Stability of Circular Zymogens of RNase 1 ...................................... 261

Figure 7.2. Modelling of Str2 Zymogen and Complex with HIV-1 Protease............................ 263

Figure 7.S1. Expression of 3G Zym ogen................................................................................... 265

Figure 7.S2. Proteolytic Treatment of RNase 1 with HIV- 1 Protease....................................... 267

Figure 7.S3. A ctivation K inetics of Str2.................................................................................... 269

Figure 7.S4. Top 10 Scoring Rosetta Models of Str2 and Complex with HIV- 1 Protease........ 271

Figure A.1. Three-dimensional Structure of the TTR Resveratrol Complex ............................ 329

Figure A.2. Three-dimensional Structures of TTR-Ligand Complexes that Contain a Boronic Acid

G ro u p .......................................................................................................................................... 3 3 1

Figure A.3. Three-dimensional Structures of TTR Ligand Complexes that do not Contain a

B oronic A cid G roup .................................................................................................................... 333

Figure A.4. Halogen-bonding Interactions in the TTR 10 and TTR 11 Complexes................. 335

Figure A.S1. Graphs Showing the Results of ANS Competition Assays .................................. 337

Figure A.S2. Graphs Showing the Results of 96-h Fibril-formation Assays............................. 339

Figure A.S3. MALDI-TOF Mass Spectra to Probe the Reversibility of TTR Inhibition ......... 341

Figure A.S4. Electron Density in the TTR-2 Complex.............................................................. 343

Figure A.S5. Electron Density in the TTR 3 Complex.............................................................. 345



20

Figure A.S8. Flectron nntv in the TTR 6 Coomppe x 31

Figure A.S9. Electron Density in the TTR7 Complex.............................................................. 353

Figure A.S1O. Electron Density in the TTR8 Complex........................................................ 355

Figure A.S11. Electron Density in the TTR-1 Complex...................................................... 357

Figure A.S12. Electron Density in the TTR 11 Complex.......................................................... 359

21

List of Charts

C hart A .1. D iphenol Ligands..................................................................................................... 360

Chart A.2. Carboxylic Acid Ligands......................................................................................... 361

Chart A.3. Diboronic Acid and Related Ligands....................................................................... 362

List of Schemes

Scheme 5.S1. Route for the Synthesis of Boronic Acid 1. Overall yield: 54% (unoptimized).. 157

Scheme 6.1. Boronic Acid Oxidation Pathway .......................................................................... 191

Scheme A.1. Routes for the Synthesis of Stilbenes 2-6............................................................. 363

Scheme A.2. Routes for the Synthesis of Stilbenes 7-11........................................................... 364

22

bbre-viationsT .

2-CPBA ............................................................................................. 2-carboxyphenylboronic acid2-HM PBA ............................................................................. 2-hydroxym ethylphenylboronic acidA ...................................................................................................................... Angstr6m (10-10 m )ACE ................................................................................................ angiotensin converting enzym eAID S .................................................................................. acquired im m unodeficiency syndrom eAIM .................................................................................................................. Atom s in M oleculesAM I ........................................................................................................................ Austin m odel IAN S ....................................................................................... 8-anilinonaphthalene-1-sulfonic acidART ................................................................................................................ antiretroviral therapyAZT ......................................................................................................................... azidothym idineBCA .................................................................................................................... bicinchoninic acidBCP ..................................................................................................................... bond critical pointCCR ............................................................................................................ CC chem okine receptorCD ............................................................................................................. cluster of differentiationCXCR ..................................................................................................... CXC chem okine receptorsDABCYL .............................................................. 4-((4-(dim ethylam ino)phenyl)azo)benzoic acidDCM ..................................................................................................................... dichlorom ethaneDEPC ............................................................................................................ diethyl pyrocarbonateDFT .......................................................................................................... density functional theoryDM F ................................................................................................................. dim ethylform am ideDM SO ................................................................................................................. dim ethylsulfoxideDNA .............................................................................................................. deoxyribonucleic acidDTT ............................................................................................................................. dithiothreitolEDAN S .......................................................... 5-(2-am inoethylam ino)- I -naphthalenesulfonic acidEDTA ............................................................................................. ethylenediam inetetraacetic acidE .................................................................................................................... extinction coefficientEPR .............................................................................................. electron param agnetic resonanceER ............................................................................................................... endoplasm ic reticulumESCRT .......................................................... endosom al sorting com plexes required for transportESI ............................................................................................................... electrospray ionizationFAM ............................................................................................................................... fluoresceinFDA ............................................. Food and Drug Adm inistration of the United States of Am ericaFRET ........................................................................................... Fbrster resonance energy transferG .......................................................................................................................................... glycine

gp .................................................................................................................................. glycoproteinHAART .................................................................................... highly active antiretroviral. therapyHIV .................................................................................................. hum an im m unodefiency virusHPLC .................................................................................... high pressure liquid chrom atographyHRM S ....................................................................................... high-resolution m ass spectrom etryIEFPCM ........................ integral equation formalism variant of the polarizable continuum model

23

IN ....................................................................................................................................... integraseIPTG ................................................................................... isopropyl p-D-1-thiogalactopyranosideITC ................................................................................................. isotherm al titration calorim etryKa.................................................................................................. equilibrium association constantkcat.......................................................................................................................... turnover num ber

kcat/KM .................................................................................................................. catalytic effiencyKd................................................................................................. equilibrium dissociation constantK ........................................................................................................................ inhibition constantKm ...................................................................................................................... M ichaelis constantkobs ................................................................................................................ observed rate constantLBHB .................................................................................................... low-barrier hydrogen bondlogP ........................................................................................... octanol-water partition coefficientLS-CAT......................................................................... Life Sciences Collaborative Access TeamM ............................................................................................................................................. m olarM ALDI ........................................................................ m atrix-assisted laser desorption/ionizationM D ................................................................................................................... m olecular dynam icsM DR .............................................................................................................. m ultidrug-resistantM R ............................................................................................................... m olecular replacem entm RNA .................................................................................................................... m essenger RNAMTS .............. 3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)-2H-

tetrazoliumM W CO ....................................................................................................... m olecular weight cutoffN2(g).............................................................................................................................nitrogen gasN2(l).......................................................................................................................... iquid nitrogenNBO ............................................................................................................ natural bonding orbitalNIAID .......................................................... National Institute of Allergy and Infectious DiseasesNIH ..................................................................................................... National Institutes of HealthNM R .................................................................................................... nuclear m agnetic resonanceNM RFAM ........................................................ National M agnetic Resonance Facility at M adisonNN RTI .................................................................... non-nucleoside reverse transcriptase inhibitorNpu....................................................................................................................Nostoc punctiformeNRTI .............................................................................. nucleoside reverse transcriptase inhibitorN S ................................................................................................................. non-structural protein

C ............................................................................................................................ degrees CelsiusONC ............................................................................................................................... Onconase*ORTEP........................................................................................ Oak Ridge therm al-ellipsoid plotPAGE ....................................................................................... polyacrylam ide gel electrophoresisPBA ................................................................................................................... phenylboronic acidPBS ......................................................................................................... phosphate-buffered salinePCR ........................................................................................................ polym erase chain reactionPDB .................................................................................................................... Protein Data BankPDE ..................................................................................................................... phosphodiesterasepH ............................................................... negative of the logarithm of the hydrogen ion activityPI..........................................................................................................................protease inhibitor

24

V_ I Tfl nf t1l 0- 1 I-N " :l r;tl'% "-I rVF fk -11--.1 A 'SOCi.1+;f_%" CO"Starita ....................................................... nia..", Ai V % %-,P-L L,11%w 1%-15L41 ILI" I I kJ.L L,11% LL%. I $.I d ica $ IGLLIVII %., 113L

PPII ................................................................................................................ poly-l-protein type 11P R ........................................................................................................................................ p ro tea sePSI ............................................................................................................... pounds per square inchQM /M M ........................................................................ quantum m echanic s/m olecular m echanicsRFU ........................................................................................................ relative fluorescence unitsRI ................................................................................................................... ribonuclease inhibitorRM SD .................................................................................................. root-m ean- square deviationRNA ....................................................................................................................... ribonucleic acidRNase ........................................................................................................................... ribonucleaseRT .................................................................................................................... reverse transcriptaseS ....................................................................................................................................... S v ed b ergSDS ............................................................................................................. sodium dodecyl sulfateSiv ................................................................................................ sim ian imm unodeficiency virusSPPS .................................................................................................. solid-phase peptide synthesisSSHB ............................................................................................................................ short-strongS tr ........................................................................................................................................ strain edT .......................................................................................................................................... th y m u sT 4 ....................................................................................................................................... th yro x in eTAM RA ........................................................................................... carboxytetram ethy1rhodam ineTEA ............................................................................................................................. triethylam ineTFA .................................................................................................................... trifluoroacetic acidTHF ......................................................................................................................... tetrahydrofuranTLC ....................................................................................................... thin-layer chrom atography

TM ................................................................................................ m idpoint of therm al denaturationTOF ............................................................................................................................. tim e of flightTris ........................................................................... 2-am ino-2-(hydroxym ethyl)propane- 1,3-diolTTR .............................................................................................................................. transthyretinUV .................................................................................................................................... ultravioletv is .......................................................................................................................................... v isib le

VMax ...................................................................................................................... maxim al velocityVO ............................................................................................................................... initial velocity

25

Chapter 1

Introduction: Current and Future Prospects for the

Treatment of HIV

26

1.1 Significance

An estimated 37 million individuals are infected with HIV worldwide in 2017 and AIDS associated

complications have resulted in 35 million deaths.' Since the AIDS epidemic began in 1981,

academia and industry, sometimes in collaboration, have sought to find a cure for HIV.2 The war

against AIDS began when a trio of labs independently discovered HIV.4 6 Identification of this

new class retrovirus, the lentiviruses, responsible for AIDS unlocked the possibility of testing and

the development of targeted therapeutics. Though HIV was first clinically observed in the United

States, the virus is thought to have originated in Africa through human adaption of SIV transmitted

through contact with an infected animal.7 The majority of infected individuals reside in

economically disadvantaged regions of Africa and India, making accessibility to antivirals a

prominent goal of public health organizations. Antivirals with greater potency that require smaller

dosage and compounds with enhanced shelf lives conferred by greater chemical stability are highly

desired. Virological differences amplify shortcoming in the treatment of HIV in the developing

world as therapeutic have been optimized to treat HIV-1, type M, subtype B, which is common in

the US and Europe, and not subtype C that infects the majority of people and is prominent in

Africa.8 Finally, the emergence of drug resistance has transformed HIV treatment from a general

strategy to personalized medicine that requires tailored therapeutics. 9

27

1.2 Human Immunodeficiency Virus

HIV can be transmitted via several mechanisms. Sexual intercourse is the prominent route,

although infection can arise from contact with contaminated blood, from mother to child via birth,

and intravenous drug use. One quarter of infected individuals are not aware they are infected and

contribute to nearly half of new infections, making testing and prevention a prominent feature of

eradication efforts.10 Differential presentation of acute infection and the symptoms being similar

to common pathogens lead to challenges in diagnosis. Following the acute stage, viral replication

is repressed by the immune system during the chronic infection stage." Treatment outcomes are

more favorable with earlier commencement of antiretroviral therapy (ART). The progression of

HIV to AIDS occurs within a few years without treatment to well over a decade with treatment.

AIDS is the result of depletion of the immune system, which makes the patient susceptible to

pathogens that are not problematic to an otherwise healthy individual.

1.2.1 The Lifecycle of HIV

HIV is unlike any other pathogen facing humanity due to the cell types that are permissive hosts

to infection. The lifecycle of HIV begins by a recognition event called attachment where the viral

protein gp120 binds the CD4 receptor of a CD4+ T-cell.1 2 Activated T-cells express greater

quantities of receptors utilized by HIV upon antigenic stimulation as part of the adaptive immune

response; thus, HIV infects and kills a subset of cells tasked with fighting the infection." Binding

of the co-receptor, either CXCR4 or CCR5, leads the other viral membrane protein gp4l extending

an a-helical domain into the host-cell membrane.' 4 This process takes roughly an hour and leads

28

to fusion of the viral and host cell membrane.15 Fusion deposits the viral RNA genome and viral

enzymes into the cytosol.

Historically, the first targets for therapeutic intervention were the viral enzymes. The

enzyme reverse transcriptase (RT) synthesizes a DNA copy of the viral genome in a template-

dependent manor using the viral RNA.1 6 Cyclization of the DNA through the action of integrase

in conjunction with accessory proteins forms the preinitiation complex which transduces into the

nucleus where the proviral genome becomes covalently integrated into the host cell genome. 7

Association with nuclear proteins and the accessibility of transcriptionally active regions promote

integration.1 8 Evolution of retrovirus has achieved compact, approximately 10 kilobase viral

genomes.

Instead of independently regulating transcription and translation of viral RNA and proteins,

the viral proteins are expressed as long poly-proteins known as gag and gag-pol. The process of

maturation separates the individually folded domains of the poly-proteins, enabling the assembly

of the virus particle. The final viral enzyme in the life cycle, HIV protease hydrolyzes a main-

chain amide in the unfolded, extended regions between viral proteins allowing them to fold and

assemble into their quaternary structures. 19-20 Additionally, frame shifting enables differential

abundances of viral mRNA to properly titer the amount of viral protein expressed.21

Prior to maturation, the viral poly-proteins assemble on the inner leaflet of the host cell

plasma membrane. The gp4l and gpl20 proteins are expressed as a tandem precursor protein

called env that is translated into the ER. Endogenous protease separate the two before transport to

the plasma membrane.22 Here, the membrane proteins and the cytosolic polyproteins coordinate

29

with host cell proteins from the ESCRT pathway to begin the process of budding.23 Much of the

maturation occurs post-budding, though studies with a photo-cleavable HIV- 1 protease inhibitor

reveal budding can occur without HIV-1 protease activity and upon degradation of the inhibitor,

budded, immature particles can mature to infection particles. 24 Nonetheless, HIV protease is

reported to target endogenous proteins in the cytosol that contributes to pathogenesis.25

1.2.2 Treatment of HIV

Numerous aspects of the viral life cycle have been successfully targeted in the clinic. The first HIV

inhibitor, of any class, was azidothymdine (AZT), an RNA analog that indirectly inhibits reverse

transcription. This class of drugs is called nucleotide RT inhibitors (NRTIs) and works by serving

as a substrate for RT to act as chain terminators by preventing further polymerization after

incorporation into nascent DNA.26 Another class of RT inhibitors target a hydrophobic cavity that

is required for proper enzyme dynamics. These non-nucleotide RT inhibitors (NNRTIs) fill this

hole and trap the enzyme preventing additional turnover.2 7 Protease inhibitors (PIs) are another

major class, which will be extensively reviewed in Section 1.4. Historically, the last enzyme

targeted for antiviral therapy was integrase (IN). Inhibitors of IN target the enzyme bound to DNA

and interact with active site magnesium ions.28 The fourth and most recent IN inhibitor received

FDA approval in 2018.29

Fusion and entry of HIV is also a prominent clinical target. HIV must first bind the primary

receptor, CD4. An antibody selected to bind CD4 was found to inhibit post-attachment co-receptor

engagement and provides the basis for the entry inhibitor ibalizumab, which achieved FDA

approval early in 2018 .30 Additionally, small molecules were elaborated to antagonize the CCR5

30

co-receptor leading to the development of maraviroc. 3 1 Finally, entry requires a conformational

chain of the env trimer in which the pair of 3-helix bundles of gp4l form a six-helix bundle.32

Peptides based on the C-terminal helix traps the intermediate and acts as inhibitor of viral

replication by preventing fusion.33

HIV inhibitors are used in combinations. This approach is referred to as ART but also

highly active ART (HAART). Typically, ART starts with a NRTI, a NNRTI, and an IN inhibitor,

and later includes other classes, like protease inhibitors. As the duration of infection progresses,

the drug combination is optimized to maintain low viremia.34 These molecule must be taken daily

and in large quantities. A prominent issue with HIV treatment is accessibility as well as

adherence. 35 Halting therapy can be detrimental and allow the infection more rapidly become drug-

resistant.

1.2.3 HIV Mutations and Setbacks for a Cure

Mutations in the HIV genome accumulate in patients over the course of treatment. A selection

occurs that enables viral proteins to continue the lifecycle without binding the inhibitors. The low

fidelity of RT is responsible for the mutagenesis. Unlike the robust proofreading of DNA

polymerases, which makes a mistake one in every 109-10 bases replicated, HIV RT has an error

rate of one in 104 bases.36 The adaptability of HIV can lead to drug resistance and treatment failure,

but also enables HIV to evade the adaptive immune response. The only viral proteins assessable

for recognition by the immune system are found on the surface of the viral membrane: gp120 and

gp41. gp 120 has can dramatically change to avoid neutralization by antibodies yet still maintain

affinity to CD4.37 gp120 is additionally masked by glycans when it is produced by the host cell,

31

limiting recognition by the immune system.38 An encouraging target for HIV vaccines is the helix

used by gp4l to insert into the membrane. This feature of the protein is highly conserved but only

solvent accessible for a short time. There are new frontiers in understanding immune recognition

that are promising in developing HIV antibodies that are broadly neutralizing. 39

1.2.4 HIV Latency

A confounding factor of HIV treatment is the lack of viral clearance. Antivirals can suppress HIV

infection to below detectable limits; however, halting therapy will lead to rebounding viremia.

This is due to latency. Latency is the result of cells that become infected but harbor the provirus in

a transcriptionally inactive state. A prominent example is quiescent CD4+ T-cells.40 Latent cells

inefficiently produce viral RNA and are reported to make low levels of virus in patients receiving

treatment.41 Latency is also responsible for reestablishing infection after suspension of antiviral

treatment. Emerging approaches seek to treat HIV by eradication of latent viral reservoirs. 42 These

approaches employ epigenetic modification to increase transcription of the virus.4 3 Viral infection

results in cytopathic effects that can kill the once latent cell.44 Current setbacks of this approach

are global transcriptional activation and cytopathic effects appear to be insufficient to kill all

infected cells.45

32

1.3 Biochemistry of HIV-1 Protease

Understanding the manner in which enzymes operate is crucial in the development of targeted

therapeutics. Kinetic-based assays and X-ray crystallography have been workhorse methodologies

in illuminating the mechanism of HIV-1 protease. The study of proteases has relied heavily on

solid-phase peptide synthesis (SPPS). 4 6 Prior to SPPS, the scope of accessible substrates for

proteolytic studies were limited.47 Additionally, incorporation of chromophores and FRET pairs

into substrates enabled continuous quantitation by absorbance and fluorescence spectroscopy.48-49

Kent and coworkers synthesized HIV-1 protease by SPPS that yielded diffraction quality crystals,

providing among the first publicly available structures, after heterologous expression approaches

failed to provide sufficient materials.5 0 5 1 The clinical relevance of HIV-1 protease and the tools

developed for its study have made it a model protein.

1.3.1 The Structure of HIV-1 Protease

HIV-1 protease, like all retroviral proteases, are homodimeric aspartic proteases (Figure 1.3.A).

Recently, a retroviral protease homolog was discovered in yeast.53 Eukaryotes utilize numerous

monomeric aspartic proteases with varied specificity suited for their biological functions. 54 For

example, pepsin is a non-specific protease that degrades dietary protein (Figure 1.3.B). Meanwhile,

renin specifically cleaves angiotensinogen to liberate the angiotensin precursor peptide, which in

turn, regulates blood pressure (Figure 1.3 .C). 55-56 These two classes, the monomeric eukaryotic

and the homodimeric retroviral protease, have limited sequence and structural similarity, but they

share a conserved active site. That active site consists of a pair of aspartic acid residues for which

this family is named. These residues are found at the beginning of a "DTG" sequence that is

33

arranged by a 2-fold axis as the result of inter-domain hydrogen bonding. This motif is called the

fireman's grip and is responsible for much of the affinity between the monomers of retroviral

proteases.57

Aspartic proteases employ flaps to assist in binding substrates. Monomeric proteases use a

single flap to form hydrogen bonds directly with the carbonyl oxygens of the amides flanking the

scissile amide. Retroviral proteases instead donate hydrogen bonds to a conserved water that

donates hydrogen bonds to these groups. EPR and MD studies have revealed these flaps are

dynamic but favor the closed state when a ligand is bound. 58-59 The flaps contribute to the potency

of HIV protease inhibitors and will be discussed in Section 1.4.3.

1.3.2 Catalysis by HIV Protease

Enzymes catalyze reactions by lowering the kinetic barrier to chemical reactions. HIV protease

shares a common mechanism with all aspartic proteases. Aspartic proteases provide a roughly

101 '-fold rate enhancement compared to the non-catalyzed reaction. 0 This family facilitates amide

hydrolysis by activating water for nucleophilic attack using acid-base catalysis. The mechanism

of aspartic proteases has been studied for several decades. 61 Though a rather complete picture of

catalysis has emerged, there are still unanswered questions. The reaction begins by the formation

of a Michaelis complex where a mono-protonated active site binds a water molecule and the

peptide substrate. The aspartate deprotonates water leading to nucleophilic while the other aspartic

acid protonates the scissile carbonyl oxygen to form the geminal-diol intermediate. The proton

abstracted from water is then transferred to the scissile amide nitrogen leading to a cationic gem-

diol. Scission of the bond leads to formation of the products. The second proton transfer and

34

sciss1IIo Stepa are the least unDetood and the irdeay bei i.Lery to Lthe..' tranition1L1V.J

states and may constitute a single step.62 A proton transfer occurs between the nascent termini prior

to product dissociation.

Crystallography has provided substantial insights into the mechanism. Flap dynamics are

important for substrate binding and the protease has been observed with both open and closed

flaps. A structure with open flaps reveals how the active site is solvated prior to the peptide

substrate binding. 63 A catalytically inactivate variant of HIV protease can be prepared by installing

the D25N mutation. This single atom change alters the active-site residue but still enables the

protein to attain the proper fold.64 Many structures of inactivated HIV protease bound to substrates

have be solved to reveal the Michaelis complex.65 Several structure of inefficiently cleaved

substrates have trapped the gem-diol intermediate. 66-67 Finally, a structure has been solved with

the product bound. 68 The only missing structure is the protonated gem-diol, however, this species

is likely too unstable to capture.

Kinetic-based assays have also revealed signification mechanistic details. Specificity

constants for the cleavage of peptides are dependent on substrate sequence and range from 101 to

105 M-s-1 for endogenous substrates. 69 An activated substrate containing non-canonical amino

acids exhibits the highest observed rate constant of kcat/Km = 2.1 x 107 M-s'-1. 70 Another substrate,

selected by phage display, is the best substrate containing only canonical amino acids, with kcat/KM

= 5.0 x 105 M's-1. Kinetic isotope effects have provided unique insights into the mechanism.

Hydrolysis studies with [ 180]H20 have revealed that HIV-1 protease can incorporate 180 into

substrates. 71 This suggests the first step of forming the gem-diol intermediate is reversible.

Additional incorporation of 13C and 180 into the carbonyl, 5N into the scissile amide, and 2H into

35

C" of the P1 residue have revealed that wild-type and drug-resistant mutants have the same rate-

determining step. 72 Modelling and computational prediction of the kinetic isotope effects suggests

that step is the N protonation.

1.3.3 Substrate Recognition by HIV-1 Protease

Structural biology has also provided insights into recognition by HIV protease. Aspartic protease

sequence specificity depends on the biological function. The most extensively studied human

aspartic protease is renin. Renin cleaves angiotensinogen to generate the angiotensin I peptide that

is next cleaved by ACE to creating the active hormone angiotensin II, which lowers blood

pressure.73 Renin can only efficiently cleave a single substrate.74 Conversely, the digestive protease

pepsin must break down dietary protein by cleaving many sites. Its specificity is limited, but

prefers cleaving between hydrophobic residues. 75 HIV-1 protease is semi-specific and can

efficiently cleave the variety of substrates found in the HIV polyproteins.

Crystal structures of numerous substrates bound to HIV protease revealed recognition of

the sidechains is based on sterics instead of specific donor-acceptor interactions with the

subsites.65, 76 Most protease substrates utilize at least a single hydrogen bond; however, they are

not well conserved between substrates. 65 The residues in protease substrates are named based on

the position relative to the scissile bond and a prime denotes residues after (that is, C-terminal to)

the scissile amide. The most frequently observed polar interaction occurs between Asp30 and a

glutamate or glutamine residue in the P2' position.77 Curiously, bulky hydrophobic groups like

leucine and isoleucine can also occupy the S2' subsite. Though hydrophobic groups are most

commonly found at the P1 and P1' positions, substrates come in two types. Most substrates employ

36

a pair of primary amino acids where both, either, or neither are aromatic rcsiducs and constitute

type 1 sequences. Type 2 sequences utilize a P1 aromatic residue and proline at the P1' position. 78

The PI/Pi' residues further dictate the selection of P2/P2' residues and add additional features to

the two types of cleavage sites. 79-80 Nonetheless, the bulk of recognition comes from hydrogen

bond interactions with the main chain of substrates.81-82 Insights into substrate recognition by HIV

protease have been critical in tailoring the specify of protease inhibitors.

37

1.4 Inhibition of HIV-1 Protease

Protease inhibitors are an important class of antiviral drugs that have dramatically improved

clinical outcomes for HIV patients.83 As of 2018, the FDA has approved ten small molecules

inhibitors of HIV protease for the treatment of HIV, eight of which still find clinical utilization.

The standard of care utilizes protease inhibitors as part of ART. Initial implementation of ART

required patients to take a combination of pills, but an emerging trend is a single pill containing

multiple drugs to simplify dosing. Recent interest in reducing daily intake of drugs has motivated

studies demonstrating protease inhibitor monotherapy can manage viremia after initial suppression

by ART.84 Though lack of adherence is still a major driver of treatment failure, decades of use

have selected for viruses with proteases that are no longer inhibited by even the best of clinical

inhibitors.85 Drug resistance and reducing the pill burden continues to motivate the development

of improved members of this class of drug.

1.4.1 Natural Product-inspired Inhibitors of Aspartic Proteases

Clinical inhibitors are a classic example of rational drug design where nature provided a lead

compound. In 1970, a nature product secreted by actinomycetes was discovered to inhibit the

digestive protease pepsin. 86 This inhibitor, named pepstatin, is a peptide that contains the non-

canonical amino acid statine (Figure 1.4.1 .A). An analog of leucine, statine when incorporated into

peptide substrates at the P1 position, generate potent competitive inhibitors of aspartic proteases. 87

Statine is a y-amino acid with a single S-configured hydroxyl group substituted on the P-position

(Figure 1.4.2.B). Investigations revealed the stereochemistry of the hydroxyl is important for

inhibition. The S-enantiomer is 100-fold more potent than the R-enantiomer.88 This moiety works

38

by mimicking the tetrahedral gemdiol intermediate (Figure 1.4.1.C). Jncorporation of this

substrate into substrates of renin and later HIV- 1 protease generated potent inhibitors with values

of Ki near 10-10 M.89-90 These seminal studies identified the pharmacophore employed in the

majority of clinical inhibitors.

1.4.2 The First Generation of Clinical Inhibitors

Following the success of peptide models employing the hydroxyethylene tetrahedral-intermediate

isostere, research pushed forward to identify peptidic scaffolds with enhanced stability in vivo.

The first protease inhibitor to gain regulatory approval was saquinavir. Its design was based on the

unusual property of HIV protease being able to accommodate a proline residue at the P1' position. 91

The second approved inhibitor was ritonavir. Though ritonavir is an efficient inhibitor of HIV-1

protease, its current role in treatment is a pharmacokinetic enhancer by inhibiting metabolic

enzymes that degrade protease inhibitors.92 Indinavir was the third FDA approved PI; however, its

use is no longer recommended due to side effects.93 Further elaboration of saquinavir improved

solubility of the P2 targeting group leading to the improved the potency of nelfinavir. 94 Finally,

amprenavir was designed to avoid bulking hydrophobic groups and amide linkages to make one

of most compact protease inhibitors.95 -96 The first generation of HIV protease inhibitors suffered

from poor pharmacokinetics, several side effects, and high susceptibility to drug resistance

mutations.97-98 These setbacks motivated the second generation of inhibitors.

39

1.4.3 The Second Generation of Protease Inhibitors

Taking lessons learned from the first generation, the design of the second generation of inhibitors

sought to enhance stability and maintain affinity in the presence of resistance mutations (Figure

1.4.3.). A key issue with amprenavir was the lack of solubility and poor bioavailability. This

limitation was remedied by appending a phosphate ester to the hydroxyethylene pharmacophore

to make the prodrug fosamprenavir. 99 Fosampreanvir dramatically reduced the pill burden of

amprenavir.1 00 Redesign efforts to address drug resistance of ritonavir lead to the discovery of

lopinavir.101

Moving away from amide linkages, atazanavir was designed with a hydrazine group

connecting the Pl' and P2' targeting groups.' 02 All clinical inhibitors of HIV protease utilize the

hydroxyethylene isostere with the exception of tipranvir. Tipranavir was discovered through an

extensive SAR campaign following the discovery that a hydroxyl-substituted coumarin derivative

inhibited HIV-1 protease.1 03 Tipranavir is effective against viruses resistant to other protease

inhibitors.1 04 The best-in-class and best-characterized protease inhibitor is the most recent to

achieve approval by the FDA. Darunavir is an analog of amprenavir with a modified P2-targeting

moiety.' 05 Initially studied in a saquinavir analog, the bis-THF group can accept a pair of hydrogen

bonds from the main chain amides of Asp29 and Asp30.1 06-107 Darunavir also has the unique ability

to inhibit protease dimerization.' 08 Inhibition of HIV-1 protease, by darunavir in particular, has

provided an exceptional model system to study drug resistance.

40

1 A A Th Resistancead the Fture of Pr-otes Inbhbtars

Several strategies are operative in the manifestation of drug resistance through HIV protease

mutations. 109 The most easily understood variety are the result of perturbation of the interface

between the protease and the ligand. Expansion of the active site can make an unfilled cavity

between the inhibitor and the subsite, which is disfavored by entropy.' 1 0 Alternatively, contraction

of the active site leads to steric occlusion of the inhibitor. These ideas combined with structural

analysis led to the substrate envelop hypothesis that posits inhibitors designed to closely match the

volume occupied by substrates are least susceptible to drug resistance mutations.76 Mutations of

polar side chains can also disruption non-covalent interactions with ligands. Such mutations can

be combated by targeting the main chain instead of side chains as exemplified by darunavir. Drug

resistance mutations frequently reduce viral fitness and compensatory mutations are required to

maintain infectivity." 1-112 These distal mutations can help restore protease activity lost in the

selection of primary mutations." 3 Substrates in the polyproteins also change to enhance catalysis

of the mutant protease.11 4

Despite conforming to contemporary design paradigms aimed at reducing drug resistance,

treatment failure has been observed in patients taking darunavir over an extended period." 5 The

protease flaps play a prominent role in darunavir resistance.116 An isolate named MDR 769 has

numerous mutation in the hinge region that connects the flap to the core domain. This variant was

the first protease crystallized with open flaps and provided insights into targeting the

conformation."1 7 A highly darunavir-resistant protease named PR20 has a 1000-fold reduced

affinity for darunavir."1 8 This variant has been crystallized in both open and semi-open forms and

includes similar hinge mutations to MDR769. EPR studies of MDR769 and PR20 revealed that

41

mutations in the hinge leads to changing the thermodynamically favored conformation from the

closed form to the semi-open form.1 9 MD simulations and NMR studies provided additional

structural details how the flap dynamics of PR20 are altered by hinge mutations. 120-121

Additionally, the variant P51 has been observed to bind darunavir in an alternative conformation

with open flaps.1 22 An emerging path to targeting these variants is enhancing interactions with the

flaps.1 2 3 New subsite targeting groups and core scaffolds are needed to continue fighting drug

resistance.

42

1.5 Versatile Recognition by Boronic Acids

Found to the left of carbon on the periodic table, boron exhibits features of both metals and non-

metals.1 2 4 As a p block element, the structure and bonding configuration of boron are similar to

those of organic compounds. Boron can, however, form stable ions like metals. Boron is neutral

when it forms three sigma bonds and becomes negatively charged with the addition of a fourth

bond that fills the octet. Boric acid is the thermodynamically favored form of boron in aqueous

solution. Oxidized species of boron with a single organic substituent are called boronic acids.

Boronic acids have emerged as an important intermediate in chemical synthesis.' 2 ' The unique

reactivity has also attracted the attention of medicinal chemists. Of note, boronic acids have been

extensively studied as protease inhibitors.1 26

1.5.1 Chemistry of Boronic Acids

The chemistry of boronic acids provides unique opportunities for medicinal chemists. Boronic

acids form a single bond with a carbon atom and two to three bonds with oxygen atoms. When

trivalent, a vacant p orbital exists on the boron leading to sp2 hybridization and a planar geometry.

This vacant orbital can form bonds with nucleophilic species including primarily 0 and but also

N, F, and S atoms. The formation of the tetravalent species creates an anionic, sp3 hybridized boron

with a tetrahedral geometry. In aqueous solution, boronic acids are in equilibrium between the

trigonal and tetrahedral states as described by the pKa (Figure 1.5).

Boronic acid pKa values are typically between 7 and 10.124 Functionalities substituted on

the organic component that act as electron withdrawing groups will limit donation to the vacant p

43

orbital on the boronic acid and reduce the pKa.127 Conversely, electron donating groups will add

to the p orbital and compete with hydration, thereby increasing the pKa. The exchange between

trigonal and tetrahedral states is fast and the two states are equally populated when the pH is equal

to the pKa.1 28 In addition to water, boronic acids can form bonds with alcohols and carboxylic acids

(Figure 1.5).129 Complexes with these ligands are referred to as boronic esters. The formation of

esters in aqueous contexts depends both on the solution pH and the pKa's of the boronic acid and

the ligand.13 0 The Kd of boronic esters is a function of pH. At pH near the boronic acid pKa, the

tetrahedral state and boronic ester formation is favored. Ester formation drops the pKa of the

boronic acid complex favoring the tetrahedral species. Planar esters typically have diminished

equilibrium constants relative to tetrahedral ones.' 3' Finally, ligands with greater acidity form

more stable complexes.

A special class of boronic acids form intramolecular esters. Phenylboronic acid with an

ortho benzyl alcohol group forms a cyclic boronic ester as a 5-membered ring. This ring is called

a "benzoxaborole". Benzoxaboroles have attracted substantial attention from medicinal and

carbohydrate chemists due to the enhanced sugar-binding capabilities. 3 2 The intramolecular ester

of benzoxaborole reduces its pKa to 7.4, which favors complex formation with ligands. Sugars

present many diols, which can form esters with boronic esters. Benzoxaboroles and boronic acids

in general, have the greatest affinity for 1,2-diols, specifically in the cis configuration, such as

those found on fructose and ribonucleotides. 33

44

1 < I DfD +- 1,U , A ,.AqI .. J ./-. %-%JU V CtlI%.4L %J L J'L J1 I VJL I L1 Uy 1.JL ji I%, I %ALI.a

The intriguing properties of boronic acids have found utility in the covalent recognition of

proteins. 134 A class of drugs that rely on conventional recognition strategies to position drugs at a

target site and, once bound, evoke the formation of a covalent bond are known as covalent

inhibitors. 3 5 Covalent inhibitors carry a distinct advantage: the dissociation rate constant of these

molecules is limited by the rate at which the covalent bond between the drug and the target breaks.

However, these types of molecules infrequently find clinical utility due to off-target reactivity. A

special class of covalent inhibitors forms bonds with active site residues in a reversible manner.

Boronic acids typify this group.

Several amino acids present side chains that can form complexes with boronic acids. The

majority of reported protein-boronic esters formed with hydroxyl groups of serine and threonine

sidechains. These residues are often employed as nucleophiles during catalysis.1 36 The formation

of complexes between serine and threonine sidechains has served as the basis of covalent

inhibitors. 3 7 In an extreme case, a tetravalent complex between two serine sidechains and a lysine

side chain was observed in inhibitors of penicillin-binding protein. 138 These groups typically only

form tetrahedral complexes as boronic ester formation reduces the pKa.

A classic example of boronic acid-based inhibitors are covalent inhibitors of proteases.1 39

A number of protease families employ serine or threonine as a nucleophile in amide hydrolysis.

The attack leads to a tetrahedral intermediate, which collapses as the nascent amino termini group

leaves to form an acyl-enzyme intermediate. Next, water hydrolyzes the ester linkage between the

intermediate and the active site serine through another tetrahedral intermediate. This transition

state is isosteric with a tetrahedral boronic acid ester. 4 0 The first FDA-approved boronic acid,

45

bortezomib, targets the 20S proteasome for the treatment of multiple myeloma. The boronic acid

is key to activity as the potency is dramatically reduced upon deboronation, a common form of

metabolism in vivo.141 In 2015, the orally administered boronic acid ixazomab received FDA

approval.1 4 2 As of 2018, delanzomib is in cancer clinical trials.143 These boronic acids utilize the

same mode of inhibition as bortezomib.1 4 4 Peptidic inhibitors of HIV-1 protease have been

reported; however, whether they form covalent bonds with the active site remains to be

determined. 145

Benzoxaboroles and esters thereof are an emerging class of clinical boronic acid inhibitors.

These compounds form boronic esters with the ribose ring of nucleotide cofactors. This

complexation in the active site prevents dissociation of the product, thus reducing the off rate to

trap the enzyme in an intermediate state. The first demonstration of this class came from inhibitors

of tRNA synthetases.1 46 Fungal variants of these enzymes are sufficiently orthogonal to humans

resulting in selective inhibition as exemplified by tavaborole which received FDA approval in

2014.147 Benzoxaboroles have also been employed as ligands for the metal center of PDE4 to act

as inhibitors.1 4 8 This work lead to the crisaborole, which received FDA approval in 2016 for the

treatment of psoriasis.1 4 9

1.5.3 Non-covalent Recognition of Proteins by Boronic Acids

In biological contexts, boronic acids present two hydroxyl groups. No other functional group can

so economically present this many hydrogen bond donors and acceptors. Molecular encapsulation

studies have revealed boronic acids more efficiently form hydrogen bond mediated dimers than

46

amides and carboxylates. 15 0 A be7oxaborole can present a hydroxyl grp nA kriAi ng xy

This group has been employed as a P-strand mimic in an inhibitor of rho-activated kinase 2.151

Non-covalent recognition is, however, an underutilized application of boronic acids. 3 4

47

1.6 Ribonucleases as Antiviral Therapeutics

The cells of vertebrates secrete a unique family of ribonucleases called pancreatic-type

ribonucleases (RNases). RNases exhibit a number of desirable features making them an attractive

protein to serve as the basis of chemotherapeutics.15 2 First, RNases share a conserved set of

disulfide bonds that confer resounding stability. Second, they possess cytotoxic ribonucleolytic

activity. Third, RNases do not require delivery agents, as they have the endogenous ability to gain

entry to the cytosol. These properties would make RNase highly cytotoxic, but cells are

safeguarded by the cytosolic ribonuclease inhibitor (RI) protein. Diminishing interactions with RI

without detriment to the desirable properties has been the focus of extensive work on the

engineering RNases as cancer chemotherapeutics.153

1.6.1 Structure of RNases

The RNase homolog found in the bovine pancreas, RNase A, is among the best studied

proteins.1 54 RNase A was the subject of seminal protein folding studies that demonstrated the

primary amino acid sequence contains enough information to direct the spontaneous folding of the

tertiary structure. Key to the folding of RNases is disulfide bond formation. This family of enzymes

have four conserved disulfide bonds that are important for maintaining the remarkable stability of

these proteins as well as organizing the active site residues.' 5 5 -15 6 Overall, RNases have a kidney

shape with a central P-sheet with three a-helices and several loops and turns (Figure 1.6.A).1 57

Cellular production of RNases occurs by translation of the protein into the ER where PDI can assist

in the formation of disulfide bonds prior to secretion.' 58 As indicated, these disulfides can also be

efficiently formed in vitro enabling preparative scale production of engineered variants.' 59

48

1.6.2 Catalysis by RNases

Catalysis by RNase A is also historically significant, as it was the first enzyme to have its

mechanism deduced.1 60 The active site consists of two key histidine residues and an accessory

lysine. Catalysis begins with His12 acting as a base and deprotonating the 2'-hydroxyl group of

the substrate RNA. RNases bind RNA to preorganize them for an intramolecular nucleophilic

attack by the ensuing 2'-alkoxide in-line with the 5'-oxygen of the phosphodiester, which becomes

the leaving group. His 119 protonates the nascent group to generate the 5'-hydroxyl group of the

product. The negative charge developing in the transition site is stabilized by a hydrogen bond

with Lys41. The first step produces a cyclic phosphodiester between the 2' and 3' oxygens that can

be hydrolyzed to a 3' monophosphate in a second step.' 61 There is limited sequence specificity,

however, including a preference for pyrimidine nucleotides upstream of the scissile phosphoryl

group. 162

RNase A and many of the homologs, including human RNase 1, are potent catalysts and

have catalytic efficiencies around kcat/KM = 10' M's-1, which is near the limit of diffusion. 163

Several intriguing RNases have limited ribonucleolytic activity. RNase 5, also known as

angiogenin, exhibits a second order rate constant of 102 M-Is-1.164 Remarkably, this relatively low

activity can stimulate neovascularization. Recent studies have identified angiogenin specifically

interacts with the promoter associated RNA of ribosomal RNA.1 65 Another weakly active RNase

is ranapirnase, also called Onconase* (ONC), from the oocytes of the northern leopard frog. This

unique variant employs an N-terminal pyroglutamate in the active site. Additionally, this enzyme

49

is among the most stable with a Tm value of 900 C, likely needed to withstand harsh terrestrial

conditions. 166

1.6.3 Internalization of RNases

RNase A helped pave the route to understanding the role of charge in the cellular uptake of

proteins. The glycocalyx is a 0.5-3.0 ptM think coat of glycans and glycoproteins that protects the

10-nm thick plasma membrane.1 67 This admixture of functional groups prominently features

carboxylates and sulfates and imparts a negative charge to the glycocalyx. Ribonucleases and other

cationic proteins bind the cell surface through Coulombic interactions. 168 -169 This property can also

be conferred to proteins through the addition of a terminal tag by recombinant expression or

chemical means.1 70-171 Once associated with the cell surface, endocytosis brings cationic proteins

into the cell through one of a variety of processes, including macropinocytosis and clathrin-

dependent endocytosis.' 72 The contents of the endosome are destined for destruction as the vesicle

matures into a lysosome.1 73 Acidification is thought to enable RNases and other cell-penetrating

proteins to escape the endosome through a process that is not well understood.1 74 Unlike many

therapeutically desirable proteins, RNases have the innate ability to access the cytosol. Unchecked

RNase activity in the cytosol leads to apoptosis.17 5

1.6.4 Evasion of Ribonuclease Inhibitor

Cellular RNA is safeguarded from cytotoxic RNase activity by RI.1 76 Ribonucleases form among

the tightest known protein-protein interactions with RI. Human RNase 1 and RI form a complex

50

with a Kd value of 10-16 NM (Figure 1.6.B).1 77 Sties of n s s affinities riwealed that

differences in RNases have led to compensatory changes in RI to maintain high affinities.78179

Two key properties explain the remarkable affinity between RNases and RI: shape

complementarity and charge distribution. An edge of the horseshoe-shaped RI fits snugly in the

active site of RNases to competitively inhibit substrates binding.' 80 Onconase, a remote homolog,

has a unique shape and is only weakly inhibited under low salt conditions which enhances

Coloumbic interactions.181 In addition to the complementarity of shape, RI and RNase have pairs

of charged residues arranged to enhance binding. 8 2 Disruption of these interactions has been a

prominent approach in efforts to make mammalian RNases cytotoxic to evade RI and engender

cytotoxicity. 153, 183

1.6.5 Antiviral Activity of RNases

The biological functions of RNases are emerging, but their biology remains a fertile area of

research.1 84 RNases are involved in innate immunity. 185 Human RNases 2, 3, and 7 are employed

by the immune system as cytotoxic agents toward invading pathogens.18 6-187 Cells secrete

angiogenin and RNase 4 to suppress HIV infection.1 88 Additionally, antiviral activities of several

RNases has been observed.' 8 9-'90 These encouraging results make RNases promising agents

serving as the basis of novel antivirals.

51

Figure 1.3

52

A

HIV-1 protease-JG-365B

renin-aliskirenC

pepsin-pepstatin

Figure 1.3. Structures of Aspartic Proteases Bound to Ligands. Retroviruses like HIV utilize a

homodimeric protease has part of their lifecycle. A. The structure of HIV-1 protease is shown in

complex with an early inhibitor (pdb: 7hvp).1 91 Alternatively, eukaryotes possess monomeric

aspartic proteases. Renin cleaves angiotensin and subsequent activation leads to increased blood

pressure in human. B. The drug aliskiren inhibits renin an acts to lower blood pressure (pdb:

2vOz). 192 C. Pepsin is responsible for digestion and can be inhibited by the natural product,

pepstatin (pdb: Ipso).1 93 Inhibitors of aspartic proteases will be discussed in greater detail in

Chapter 4.

53

Figure 1.4.1

A

H0 OH0 H OH O

N N NJ-KN NyQIQ OHpeptat H 0

pepstatin

B OH OH2N OH

statine

HO OH

NH

tetrahedral intermediate

54

Figure 1.4.1. Structure of Pepstatin, Statine, and a Tetrahedral Intermediate. A. Pepstatin is a

natural product that contains two statine residues. B. Statine is resembles leucine but contains an

additional hydroxylethlyene group. C. The hydroxyethylene group mimics the tetrahedral

intermediate formed after the first step of catalysis by HIV protease.

55

Figure 1.4.2

First generation

H2N 0

H0 H PH H,, H

Nz N NNHH0

0 HHb 0;

saquinavir

N:N OH H OHOH

Ndv

indinavir

OH

H "HHO-L%

0/ +

H OH H

nelfinavir

H 1 0N N Ny< H)~

amprenavir r n vritonavir

Second generation

HOH r

0 00 _y\ NNK NHO N N

0 N W NH& 2 0 0

H 0

-/ \ lopinavirca2

+

fosamprenavir H 0 0 _

0 -,, N NH2H H OH(,r

0 OH 0ONN

H 0

0 Nz N0

atazanavir

darunavir

. tipranavir

OH H I F

*tipranavir F

56

Figure 1.4.2. FDA Approved Inhibitors of HIV Protease. As of 2018, ten protease inhibitors have

been approved for the treatment of HIV. Arrows indicate the progression in design of inhibitors

with a common core scaffold.

57

HO, BOH

RBoronic Acid (sp9)

+H 20 HO OHHO-B-

RBoronate (sp)

II +HOR'

HO 0'B' 0R'R

Boronic Ester (sp2)

+H 20 HO 0HO-B- R'

R

Boronate Ester (sp3)

58

Figure 1.5

Figure 1.5. Equilibria of Boronic Acids in Aqueous Solution. Boronic acids are planar at pH below

the pKa. Boronates are favored at pH above the pKa. Boronic acids can also accept organic oxygen

species to form esters and the coordination state is driven by the boronic ester pKa which is

typically lower than the corresponding boronic acid.

59

Figure 1.6

A scissilebond

Lys41

His118

Hisl2

RNase A-d(ApTpApA)

B

RNase 1-RI

60

Figure 1.6. Structures of RNases Bound to Ligands. RNases bind nucleic acids with a cationic

active site cleft. A. This is demonstrated by a structure of RNase A bound to a substrate analog,

DNA (pdb: lrcn).1 94 RNases are potently inhibited by RI. B. Human RNase 1 and RI form a highly

complementary complex that blocks the substrate from binding the active site.1 82

61

Chapter 2

Fluorogenic Assay for Inhibitors of HIV-1 Protease

with Sub-picomolar Affinity

Contribution:

I designed and performed all experiments. I wrote the initial manuscript and contributed to the

revision of the final manuscript.

This chapter has been published in part, under the same title. Reference:

Windsor, I. W.; Raines, R. T., Fluorogenic assay for inhibitors of HIV-1 protease with sub-

picomolar affinity. Sci. Rep. 2015, 5, 11286.

62

Abstract

A fluorogenic substrate for HIV- 1 protease was designed and used as the basis for a hypersensitive

assay. The substrate exhibits a kcat of 7.4 s-1, KM of 15 tM, and an increase in fluorescence intensity

of 104-fold upon cleavage, thus providing sensitivity that is unmatched in a continuous assay of

HIV- 1 protease. These properties enabled the enzyme concentration in an activity assay to be

reduced to 25 pM, which is close to the Kd value of the protease dimer. By fitting inhibition data

to Morrison's equation, Ki values of amprenavir, darunavir, and tipranavir were determined to be

135, 10, and 82 pM, respectively. This assay, which is capable of measuring Ki values as low as

0.25 pM, is well-suited for characterizing the next generation of HIV- 1 protease inhibitors.

63

2.1. Introduction

HIV-l protease is a dimeric, aspartic acid protease. This enzyme is not only an important target

for chemotherapeutic agents but also has been a key model for the development of structure-based

drug design and in studies of drug resistance.195-197 Hundreds of small molecules that bind to the

enzymic active site and inhibit proteolyic activity have been characterized, ten of which have

achieved regulatory approval and clinical relevance for the treatment of HIV/AIDS. Three

inhibitors in particular, amprenavir, darunavir, and tipranavir, have garnered significant attention

due to their picomolar inhibition constants and reduced susceptibility to drug-resistance mutations

in the viral genome. 96, 103, 118, 198-200

Different workers have used different assays to assess inhibition by tight-binding inhibitors

of HIV-l protease. The ensuing values of inhibition constant (Ki) for the same inhibitor range over

several orders-of-magnitude (Table 2.1). These discrepancies, along with the pending emergence

of even more potent inhibitors 2 01 requiring even more sensitive characterization techniques,

provided an impetus for our work.

Solution-phase assays based on enzymatic activity are the oldest and still the most popular

approach for characterizing the inhibition of HIV-I protease.48-4 9 The loss of enzymatic activity

upon addition of inhibitor is employed as a metric to assess the formation of an enzyme inhibitor

complex. Classical methods employ graphical analysis to estimate a value of IC5 o and its

conversion to a value for the inhibition constant, Ki.202 This approach is limited to assay conditions

in which the Ki value is above or near the enzyme concentration.

High-affinity inhibitors possess Ki values below the usable enzyme concentrations of

traditional assays. As little free inhibitor is present under sub-saturating conditions, the observed

64

PnfzyfmtiC ctivity deC linrly with inhibitcr etrtion. This hgh- 1t inh;itor

problem was remedied by curve fitting as described by Morrison, 203 but a computational analysis

has revealed that the reliability of his methodology is limited to the determination of Ki values that

are no less than 100-fold below the enzyme concentration. 204

An alternative assay for characterizing high-affinity inhibitors of HIV-1 protease is

isothermal titration calorimetry (ITC). 20 -20 6 ITC is made powerful by its label-free nature and

provision of a full thermodynamic characterization. Still, ITC has notable disadvantages compared

with activity-based assays. For example, ITC has a theoretical limit in the low nanomolar range

for the direct measure of an equilibrium disassociation constant (Kd).

The key to assessing the next generation of HIV-1 protease inhibitors is reducing the

enzyme requirement in activity-based inhibition assays. Towards that end, we report here on the

design and characterization of a novel fluorogenic substrate for HIV-1 protease. Its attributes-

high kcat and kcat/KM values and high signal-to-noise ratio-are unprecedented, and enable the

rapid, facile determination of sub-picomolar values of Ki.

2.2. Results

2.2.1. Substrate Design

The first major design criterion for an improved fluorogenic substrate was identifying a peptide

that was bound by HIV protease with high affinity and cleaved rapidly in a catalytic manner. To

meet this criterion, we employed a peptide substrate that had been selected by phage display. The

sequence GSGIFLETSL was reported to have kcat/KM = 1.3 iM-s-1 for cleavage between the

phenylalanine and leucine residues.207 These values were determined by a discontinuous HPLC

65

method. Under the same conditions, an endogenous cleavage site in the HIV polyprotein has

kcat/KM = 0.022 pM-Is-1. 2 07

The second major design criterion was employing a sensitive method to detect substrate

turnover in a continuous manner. We focused on the loss of Fbrster resonance energy transfer

(FRET), which underlies many useful assays,2 08 and considered three donor/acceptor moieties.

Appending the fluorophore p-aminonitrobenzoic acid (Abz) and installing a nitro group in the para

position of phenylalanine as a quenching chromophore is the basis for a popular HIV protease

substrate.49 This pair, however, lacks sensitivity, as Abz is a weak fluorophore and its use provides

a relatively low signal-to-noise ratio. FRET between fluorescein and rhodamine is the basis for

some of the most sensitive known assays for enzymatic activity, 209 but protonation of a fluorescein

moiety at pH 5, which is optimal for catalysis by HIV-l protease,4 8 compromises the utility of this

pair for our purpose, and others.2 10-2 11 We chose 5-((2-aminoethyl)amino)naphthalene-l-sulfonic

acid (EDANS) and 4-(4-dimethylaminophenylazo)benzoic acid (DABCYL) as a FRET pair.4 8, 209

We installed these moieties into GSIFLETSL by replacing the glycine residue at the N-terminus

with a glutamate-EDANS conjugate and by replacing the leucine residue at the C-terminus with a

lysine-sDABCYL conjugate.48 Finally, we added an arginine residue to each terminus to enhance

aqueous solubility at pH 5, thereby generating substrate 1 (Figure 2.1).

2.2.2. Assay Design

Our assay was designed to minimize the enzyme concentration while maintaining a high signal-

to-noise ratio. An inherent complexity is that HIV-1 protease is an obligate dimer. Significant

attention has been paid to the Kd value of the HIV- 1 protease dimer, and conflicting ideas abound

66

regaruing an appropriaL Cnzymev CInCInRLaL11 f01 aCtivity assays. 1 re1ir anu coworkers

employed a rigorous thermodynamic approach to determine a dimer Kd value of 23 pM.2 13 We

used this value as a lower limit for the enzyme concentration in our assays. Additionally, initial

velocities require measurements from <10% substrate turnover, and a convenient upper limit for

the enzyme concentration in our assays was determined to be 6.5 nM. Upon cleavage by HIV-1

protease, the fluorescence intensity (I) of substrate 1 increases by I/Jo = 104.

2.2.3. Michaelis-Menten Kinetics

Michaelis-Menten kinetics were used to evaluate the performance of substrate 1 as a substrate for

HIV-1 protease. The initial velocity was directly proportional to enzyme concentration at a fixed

substrate concentration (Figure 2.2A), and increased with substrate concentration at a fixed

enzyme concentration (Figure 2.2B). The latter data fitted well to the Michaelis-Menten equation

(Figure 2.2C). The observed Vmax value of 1.58 nM s- 1 for substrate 1 at an enzyme concentration

of 214 pM afforded a kcat value of (7.4 0.2) s-1; the KM value was (14.7 1.0) [tM (Table 2).

2.2.4. Determination of Ki Values with Morrison's Equation

Substrate 1 was used as the basis for assays of the inhibition of HIV-1 protease by amprenavir,

darunavir, and tipranavir (Figure 2.3). The data fitted well to Morrison's equation, and afforded

values of Ki values (Table 2.1). Darunavir and tipranavir exhibited time-dependent inhibition;

amprenavir did not. Pre-equilibrium data were omitted from the initial-velocity data fitted by linear

regression.

67

2.3. Discussion

An assay of high sensitivity is critical for assessing the efficacy of high-affinity inhibitors of

enzymatic activity. The HIV- 1 protease substrate described herein provides initial velocity data of

unmatched quality and sensitivity. Substrate 1 has a 1.5-fold higher kcat value, 7-fold lower KM

value, and higher signal-to-noise ratio than does the parent substrate developed by Matayoshi and

coworkers (Table 2.2).48 The Abz-based substrate developed by Toth and Marshall has a similar

KM value, though substrate 1 provides a 17-fold greater signal-to-noise ratio.49 These

improvements in kinetic parameters have enabled us to reduce the concentration of enzyme in

standard assays to values close to the Kd value of dimeric HIV- 1 protease.

To quantify the utility of a substrate, we define the sensitivity (S) of an assay as the increase

in fluoresence intensity brought about by the action of an enzyme on a low concentration of

substrate. We express sensitivity (S) as the product of the kinetic parameter kcat/KM and the

spectroscopic parameter If/Io: S= (kcat/KM)(If/Io). By this measure, the sensitivity of an assay that

uses substrate I is >I 0-fold greater than any fluorescence-based assay for HIV-I protease activity

(Table 2.2).

The values of Ki for amprenavir, darunavir, and tipranavir derived from initial velocities

for the cleavage of substrate 1 are consistent with literature values (Table 2.1). Notably, the Ki

value for darunavir determined herein is comparable to literature values and much closer to values

reported by other activity-based methods than is the value determined by an indirect, competitive

displacement method, 201 which is two orders of magnitude lower. The high signal-to-noise, low

variation in initial velocity measurements and low standard error for fits of inhibition data by

Morrison's equation makes substrate 1 a useful probe for assaying high-affinity inhibitors.

68

LO a co11puLaL1na1 assessment, Morrison's equaL1in ca11n Ue useU Lo determine

values of Ki that are up to 100-fold lower than the concentration of enzyme in an assay.204 Because

enzyme concentrations as low as 25 pM provided valuable data herein, we believe that our assay

can be used to determine Ki values that are >250 fM. We anticipate the use of substrate 1 in the

development of the next generation of HIV- 1 protease inhibitors.

2.4. Materials and Methods

2.4.1. Materials.

The HIV-1 protease inhibitors darunavir (from Tibotec, Inc), amprenavir, and tipranavir were

obtained through the AIDS Reagent Program, Division of AIDS, NIAID, NIH. Substrate 1 was

synthesized and purified by HPLC to 99.5% by Biomatik (Wilmington, DE). All inhibitors and

peptides were used without further purification.

2.4.2. Plasmid Preparation.

Double-stranded DNA encoding a pseudo-wild-type HIV-1 protease and flanked by regions of

homology near the T7 promoter and terminator found in the pET32b vector was obtained from

IDT (Coralville, IA). This HIV-1 protease had Q7K, L331, L631, C67A, and C95A substitutions. 2 14

Linear pET32b was prepared by PCR using primers that were the reverse complements of the

DNA encoding HIV-1 protease. Gene and plasmid fragments were combined with Gibson

assembly. 215

69

2.4.3. Protein Purification.

BL-21 codon-plus RIL from Agilent Technologies (Santa Clara, CA) was transformed freshly with

the pET32b-HIV protease. A single colony was used to inoculate 1 L of Luria-Bertani medium

containing ampicillin (200 [M) in a Fernbach flask shaken at 37 'C. Expression was induced by

the addition of IPTG (to 2 mM) upon reaching saturation (OD600 nm 2.8-3.4), and the culture was

grown for an additional hour. HIV-1 protease was purified and folded as described previously. 2 16

Cells were pelleted, resuspended in 20 mM Tris-HCl buffer, pH 7.4, containing EDTA (1 mM)

and lysed at 18 kPSL using a cell disruptor from Constant Systems (Kennesaw, GA). Inclusion

bodies were isolated by centrifugation at 1 0,000g for 10 min. The pelleted inclusion bodies were

washed with resuspension buffer containing urea (1.0 M) and Triton X-100 (1% v/v), and again

with resuspension buffer. Inclusion bodies were isolated by centrifugation and lyophilized.

Inclusion bodies were dissolved by sonication in aqueous acetic acid (50% v/v) at a

concentration of 5 mg/mL. The solution was clarified by centrifugation, and soluble protein was

applied to a Superdex 75 gel-filtration column from GE Healthcare Bio-Sciences (Pittsburgh, PA)

that had been pre-equilibrated with aqueous acetic acid (50% v/v). Unfolded HIV- 1 protease that

eluted as major peak near one column-volume was pooled and lyophilized. HIV-1 protease was

folded at a concentration of 0.1 mg/mL in 100 mM sodium acetate buffer, pH 5.5, containing

ethylene glycol (5% v/v) and glycerol (10% v/v). The solution of folded HIV-1 protease was

clarified by centrifugation and concentrated with an Amicon stirred-cell concentrator equipped

with a 10K MWCO membrane from EMD Millipore (Billerica, MA). The concentrated protease

was applied again to a Superdex 75 gel-filtration column that had been pre-equilibrated with the

folding buffer. A new major peak containing dimeric HIV- 1 protease was pooled and concentrated.

70

The folding buffer was exchanged for 1 mM sodium acetate buffer, pH 5.0, containing NaCl (2

mM) using a PD-10 desalting column. A solution (~1.5 mg/mL) of purified HIV-1 protease was

flash-frozen in liquid nitrogen and stored at -80 'C until use.

2.4.4. Enzymatic Activity Assays.

Substrate 1 was dissolved at a concentration of 1.0 mM in DMF containing TFA (0.1% v/v).

Fluorescence of the EDANS moiety was measured on a M1000 Pro plate reader from Tecan

(Maennedorf, Switzerland) by excitation at 340 nm and observation of emission at 490 nm. A

fluorophore calibration was performed to enable quantitation of assay data. The product exhibits

a fluorescence of 70 RFU/nM at a gain setting of 216, and all assays were performed at this gain

setting unless indicated otherwise. Assays were performed in a Coming black, flat bottom, non-

binding surface, 96-well plate. Assays were conducted at room temperature in 200 tL of 50 mM

sodium acetate buffer, pH 5.0, containing NaCl (0.10 M), DMF (2% v/v), substrate 1 (1-40 pM),

and HIV-1 protease (25 pM-6.5 nM). Assays with 30 and 40 pM of substrate 1 required 3% and

4% v/v DMF, respectively. Inhibition assays were conducted with picomolar-nanomolar inhibitor

(depending on the enzyme concentration and Ki value) and 10 pM substrate 1. Inhibition assays

were monitored for until <7% of the substrate was converted to product. Initial velocities were

measured in quadruplicate.

Solution concentrations of HIV-1 protease (10.7 kDa) was determined by measuring the

absorbance at 280 nm and estimating the extinction coefficient as 12,500 M-cm-1 with software

from ExPASy.2 17 The fraction of active enzyme was determined by active-site titration and found

to be 76% with respect to the value based on the A 280 nm. Fluorescence was monitored over the

71

linear range of the detector, which corresponds to 700 nM of product formation at a gain setting

of 216.

2.4.5. Data Analysis.

The velocity (v) of all enzyme-catalyzed reactions was obtained by linear fit of initial-velocity data

using Prism 6 software from Graphpad (La Jolla, CA). Pre-equilibrium values from the beginning

of data sets were removed to provide fluorescence measurements that were linear as a function of

time (Figure 2.3).

Values of v in the absence of an inhibitor were fitted to the Michaelis-Menten equation

(eq 2.1) by non-linear regression using Prism 6 software.

k [E] [S] (2.1)KM + [S]O

In eq 2.1, [S]. refers to the concentration of substrate 1 prior to the addition of enzyme.

Values of v in the presence of an inhibitor were fitted to Morrison's equation (eq 2.2) by non-linear

regression using Prism 6 software.

[E]+[I]+K. 1-+ IO [E]+[I]+K. 1+ ISO -4[E] [1]V KM KM (2.2)

v_ 2[E]

72

In eq 2.2, v0 refers to the reaction in the absence of inhibitor. Enzymatic activity measured

in the absence of an inhibitor was used to determine the enzyme concentration for data obtained

in the presence of an inhibitor. These enzyme concentrations, which agreed ( 10%) with values

estimated by active-site titration, were used as constraints for the non-linear regression analysis.

2.5. Acknowledgments

I.W.W. was supported by BiotechnologyTraining Grant T32 GM008349 (NIH). Inhibitors were

obtained from the NIH AIDS Reagent Program. This work was supported by Grant RO1

GM044783 (NIH).

73

Table 2.1

74

Table 2.1. Inhibition Constants (pM) Reported for Amprenavir, Darunavir, and Tipranavir

Assay type Amprenavir Darunavir Tipranavir Reference

Enzymatic activity 100 8 88 198

(Unknown) 600 96

Enzymatic activity 8 103

(Unknown) 16 199

ITC (competitive displacement) 390 4.5 200

fluorescence (competitive displacement) 36 0.147 - 201

Enzymatic activity 57 - - 206

Enzymatic activity 135 6 10 1 82 6 This work

Table 2.2

75

Table 2.2. Kinetic Parameters of Popular HIV- 1 Protease Substrates and Substrate 1

kcat/Km

Substrate kcat (S-1) Km (pM) (pM'-s-') If/io (pM-'s)-1

DABCYL-SQNYPIVQ-EDANSb 4.9 0.2 103 8 0.048 0.004 40 2

Abz-TINleF(p-NO2)QR 8.2 0.4c 13 1 0.63 0.06 6d 4

Substrate iY 7.4 0.2 14.7 1.0 0.50 0.04 104 52

aSensitivity: S = (kcat/KM)(I/Io). bValues ( SD) are from ref. 48. Values ( SE) are from ref. 206.

dValue is from ref. 49. 'Values ( SE) are from this work.

Figure 2.1

-0 3 S

NH O

NHHN

0RESGI FLETSKR

N-

N"N

HIV-1 protease,

-03 S

NH

HN0

RESGIF

N-

NN

+ ONH

LETSKR

Substrate 1

76

Figure 2.1. Structure of Substrate 1. Substrate 1 includes EDANS and DABCYL on opposite sides

of the scissile bond. Hydrolysis catalyzed by HIV-1 protease relieves quenching by the DABCYL

moiety, enabling quantitation of the product (which contains the EDANS moiety) with

fluorescence spectroscopy.

77

Figure 2.240-

A20-

10-

200- 0 200

[HIV-1 Protease] (pM)

IL100-

0-00 2 4 6

[HIV-1 Protease] (nM)B 50000 -

40 pM

30 pM

20 pM30000 - 15 pM

10 pM2of7.5 pM

5.0 pM10000 ........ 2.5 pM

1.o pM0

0 60 120 180 240 300Time (s)

C

1.0-

Cl)

0.5 -

0.00 10 20 30 40

[Substrate 1] (pM)

78

Figure 2.2. Catalysis of the Hydrolysis of Substrate 1 by HIV-1 Protease. Initial velocities were

measured in 50 mM sodium acetate buffer, pH 5.0, containing NaCl (0.10 M), DMF (2% v/v),

TFA (0.002% v/v), substrate 1, and HIV-1 protease. (A) Plot of initial velocities at 10 pM substrate

1 and 130 pM-6.5 nM HIV-1 protease (3 replicates with a gain setting of 180). Inset: Plot of initial

velocities at 10 pM substrate 1 and 25 pM-250 pM HIV-1 protease (4 replicates). Data were fitted

by linear regression. (B) Progress curves at 1-40 pM substrate 1 and 214 pM HIV-1 protease (4

replicates). Data were fitted by linear regression to give initial velocities. (C) Plot of initial

velocities from panel B. Data were fitted by non-linear regression to the Michaelis-Menten

equation (eq 1) to derive the kinetic parameters listed in Table 2.2.

79

Figure 2.3

0 pM

15000 100 pM

250 pM

10000- 500 pM

1000 PM1500 PM

50001 2000 pM

0 120 240 360 480 600Time (s)

0 pM

20 pM40 pM

70 pM90 pM 100 pM

110 pM120 pM150 PM200 pM

40000 -

30000-

20000-

1200Time (s)

1800600

30000 -

20000-

10000-

0 pM50 pM

100 pM200pM300 pM

500 pM750 pM

1000 pM2000 pM

480120 240 360Time (s)

1.04

I /V0

0.5-

0.0-0

1.0-

0.5-

0.0-

1.0-4

V/V0

0.5 -

0.0 -

500 1000 1500[amprenavir] (pM)

2000

S

) 50 100 150[darunavir] (pM)

2200

IS

S

0 500 1000 1500 2000[tipranavir] (pM)

80

A

U-

B

DLL

C

LL

Figure 2.3. Inhibition of HIV- 1 Protease by Amprenavir, Darunavir and Tipranavir. Plots showing

the inhibition of HIV-1 protease in 50 mM sodium acetate buffer, pH 5.0, containing NaCl (0.10

M), DMF (2 % v/v), TFA (0.002% v/v), and substrate 1 (10 tM) by (A) amprenavir at 120 pM

enzyme (R2 = 0.99), (B) darunavir at 100 pM enzyme (R 2 = 0.96), and (C) tipranavir at 200 pM

enzyme (R 2 = 0.98) (4 replicates). Data were fitted by non-linear regression to Morrison's equation

(eq 2) to derive the values of Ki listed in Table 2.1.

81

Chapter 3

Substrate Selected by Phage Display Exhibits

Enhanced Side-chain Hydrogen Bonding with HIV-1

Protease

Contribution:

I performed all experiments and wrote the initial manuscript. I contributed to structural analysis

and revision of the final manuscript.


Windsor, I. W.; Raines, R. T., A substrate selected by phage display exhibits enhanced side-chain

hydrogen bonding to HIV-1 protease. Acta Crystallogr. 2018, D74, 690-694.

82

Abstract

Crystal structures of inactive variants of HIV- 1 protease bound to peptides have revealed how the

enzyme recognizes its endogenous substrates. The best of known substrates is, however, a

nonnatural one that was identified by directed evolution. We report the crystal structure of the

complex between that substrate and the D25N variant of the protease at a resolution of 1.1 A. The

structure has several unprecedented features, especially the formation of additional hydrogen

bonds between the enzyme and the substrate. This work expands the understanding of molecular

recognition by HIV- 1 protease and informs the design of new substrates and inhibitors.

83

3.1. Introduction

Elaboration of how HIV- 1 protease recognizes its endogenous substrates has been a triumph of

structural biology. 65',218-220 The homodimeric protease is known to bind peptidic substrates between

its core and flaps through formation of a mixed P-sheet-like motif. These conserved interactions

with the main chain diminish reliance on substrate side chains for recognition. The side chains of

bound substrates are buried in subsites (Figure 3.1A) through hydrophobic and non-conserved

hydrogen bonding interactions. Accordingly, HIV-1 protease substrates lack a rigid consensus

sequence (Table 3.1). This variability could provide spatial and temporal regulation of proteolytic

processing.221

Endogenous substrates exhibit modest affinity for of HIV- 1 protease, having values of KM

in the millimolar to high micromolar range. Despite extensive efforts, few good substrates for HIV-

1 protease have emerged from rational design.222 In contrast, directed evolution has generated

excellent substrates. 70, 207 In previous work, we employed a substrate for HIV-1 protease with a

low micromolar KM value, SGIFLETS, as the basis for a hypersensitive assay of catalytic

activity. 223 Here, we report on the high-resolution X-ray crystal structure of the complex of that

substrate with an inactivated protease variant.

3.2 Results and Discussion

3.2.1 SGIFLETS Binds in Alternative Orientations

Unlike analogous complexes, the substrate in the D25N HIV- 1 protease-SGIFLETS complex lies

in two antiparallel orientations (Figures 3.2a and 3.2b). These orientations are not of equal

occupancy (0.6 and 0.4 for A and B, respectively). Chemical symmetry (Table 3.1) of the residues

84

i" the P3 through P+' pksii n aa n rsiu at hrth the PA and the P4' 'positions areillI LJLL% I. .J tLJ LXJL1 . X..J yF.J Lamla~& "L1%_& ". a3,1LL I L ' L X'L LJLA'% X. T ".LII'.4 LIIJ% _F 1-"-'XI 1,%

characteristics of SGIFLETS that could have led to this redundancy. Moreover, the protease flaps

in the complex with SGIFLETS are in a previously unreported conformation in which a bridging

water molecule (wat254) accepts hydrogen bonds from the main-chain amides of both Ile50 and

Gly5 1, which are residues in the flaps. In other HIV- 1 protease structures, an intersubunit hydrogen

bond forms between main-chain atoms of Ile50 and Gly51. Despite unique interactions with its

side chains, SGIFLETS is recognized by the protease through conserved interactions (Figure

3.2C).

3.2.2 SerlA and Gly2A at the P4 and P3 Positions Occupy Alternative Conformations

Elder and coworkers used directed evolution (i.e., phage display) with the intent of diversifying

the P3 to P3' residues. 207 Instead, they found that the residues of SGIFLETS varied in the P2 to

P4' positions. Elder and coworkers postulated that a marked preference for serine in the P4 position

led to a high frequency of serine and glycine at the P4 and P3 positions in the selected substrates.

Yet, both of these residues occupy alternative conformations in the major substrate orientation

(conformation A) of the protease SGIFLETS structure.

Few endogenous substrates exhibit alternative binding modes for P3 and P4 residues. 65

Unlike the conserved recognition strategy wherein the side-chain of Asp29 accepts a hydrogen

bond from the P3 main-chain N-H and the N-H of Gly48 donates a hydrogen bond to the P4 main-

chain carbonyl oxygen (Figure 3A), the P3/P4 amide of the pl/p6 substrate interacts with the

carbonyl oxygen of Gly48 and the side-chain N12-H of Arg8 (Figure 3.3B). The

protease-SGIFLETS complex employs both recognition strategies (conformations A/B and C).

85

Though found in opposite orientations relative to the protease, conformations A and B of Serl and

Gly2 share the conserved P-sheet mode of main-chain recognition with the side-chain hydroxyl

group of SerlA/B forming a unique hydrogen bond with Lys45 in the protease flap (Figure 3.3C).

The alternative conformer (conformation C) of the major orientation is reminiscent of the

alternative recognition mode observed in the p l/p6 complex with SeriC instead forming a unique

hydrogen bond with the carbonyl oxygen of Gly49 (Figure 3.3D). Alternative recognition of the

P4 main-chain carbonyl group in the p I/p6 and selected substrates occur through both direct and

water-mediated interactions with Arg8. The unique hydrogen bonding exhibited by Serl provides

a structural explanation for the preference of serine and glycine at the P4 and P3 positions.

The tips of the protease flaps were also resolved in a previously unreported interaction

where a bridging water molecule accepts hydrogen bonds from the main-chain N-H of both Ile50

and Gly51 (Figure 3.3D). The occupancy of this novel water-bridge correlates with the previously

unreported hydrogen bond between the side-chain of SerC and the carbonyl oxygen of Gly49B.

Rotation of Gly49B to accept the hydrogen bond appears to move the tip of the flap into a

conformation that is incompatible with the inter-flap hydrogen bond, thus enabling water-bridge

formation.

3.2.3 Glu6 and Ser8 Form a Network of Hydrogen Bonds

Weber and coworkers identified hydrogen bonds between the side-chain carboxyl group of a

glutamic acid residue at position P2' of the CA/p2 substrate and the side chain of Asp30 of the

protease. 77 This interaction is also apparent in the protease-SGIFLETS complex (Figure 3.4A).

Given the pH of 5 at which these crystals were grown, a plausible explanation for the 2.7 A

86

interatomic distance between of Glu6 (P2') and 02 of Asp3(chain A) is fnrmat;in F n

intraresidue hydrogen bond (Figure 3.4B). Such a hydrogen bond is consistent with the substantial

increases in the Michaelis constant (KM) of the peptide substrate and the inhibition constant (Ki)

of an analogous inhibitor upon increasing the pH from 5.6 to 6.7.207 The different interatomic

distances (2.7 A and 3.3 A) between 081 and 0,2 of Glu6 and 062 of Asp30 suggest a single

hydrogen bond with the proximal oxygen and not a bifurcated hydrogen bond.224

Serine residues at P4/P4' also form hydrogen bonds with the carboxyl group of Asp30

(Figure 3.4A). Few polar interactions have been revealed between Asp30 and residues in the P4/P4'

positions, including arginine and serine. Neither of these interactions occur alongside a P2/P2'

interacting side chain (Figure 3.4B). In the P-strand conformation of bound substrates, the side

chains of adjacent residues (i ... i + 1) are farther from each other than are the side chains of two

residues with an intervening one (i . i + 2).225 Bulky groups can lead to a steric clash between side

chains and only spatially compatible amino acids are found at the i and i + 2 positions. The

structure of the protease -SGIFLETS complex reveals the interdependence of P2' and P4' residues

where, in addition to sterics, the identity of the residue is constrained by donor-acceptor

interactions.

3.2.4 Thr7 Plays a Limited Role

Endogenous protease substrates employ 2-5 polar residues in the core recognition sequence. Yet,

only some of these side chains participate in hydrogen bonds, with an average utilization of about

60%.226 In the protease SGIFLETS complex, three of the four polar side chains of SGIFLETS

form hydrogen bonds. The exception is Thr7. The protease buries little of the P3/P3' side chain,

87

leaving residues in this position largely exposed to solvent. Although threonine is a residue in the

P3' position of HIV- 1 protease substrates identified by phage display,207 that position seems to

have a limited role in substrate specificity 69 and could be a site for further optimization.

3.3 Materials and Methods

3.3.1. Protein

The expression plasmid for D25N HIV-1 protease was prepared as described previously 223 with

modifications. An initiating methionine codon was placed directly before the native N-terminal

proline residue, and an AAC codon was used for residue 25. D25N HIV protease was produced

heterologously in Escherichia coli cells grown in Luria-Bertani medium. Expression was induced

when the OD reached 1.5 at 600 nm, and cells were grown for an additional 4 h. Cell pellets were

suspended in 20 mM Tris-HCl buffer, pH 7.4, containing EDTA (1 mM), lysed with a cell

disrupter from Constant Systems, and collected by centrifugation at 10,500g for 30 min. The cell-

pellet was dissolved in 20 mM Tris-HCl buffer, pH 8.0, containing urea (9 M), and this solution

was clarified by centrifugation at 30,000g for 1 h. The supernatant was flowed through a 0.2-tm

filter and a Hitrap Q column from GE Healthcare, which removes anionic contaminants 227 . To fold

the protease, the resulting solution was diluted 20-fold by dropwise addition into 50 mM sodium

acetate buffer, pH 5.0 containing NaCl (100 mM), ethylene glycol (5% v/v), and glycerol

(10% v/v). The solution of folded protease was concentrated by using a stirred cell concentrator

from Amicon and applied to a G75 gel-filtration chromatography column (GE Healthcare) that

had been equilibrated with the folding buffer. The protease, which eluted near 0.5 column-

volumes, was concentrated to 10 mg/mL. The purity of the ensuing protein was verified by SDS-

PAGE.

88

3.3.2. Peptide

The SGIFLETS peptide with free N and C termini was synthesized and purified to be >99% pure

by Biomatik (Wilmington, DE). Stock solutions in DMSO containing TFA (0.1% v/v) were

prepared at a concentration of 1 mM for crystallization.

3.3.3. Crystallization

Protease and peptide stock solutions were mixed at a volume ratio of 4:1, respectively. Crystals

were grown by vapor diffusion in 2-jiL drops hanging over a mother liquor of 100 mM sodium

acetate buffer, pH 5.0, containing NaCl (1.0 M). Crystals, which grew within 24 h, were

cryoprotected in mother liquor containing glycerol (10% v/v) by flash-freezing with N2(l).

3.3.4. Data Collection and Processing

Single-crystal diffraction data were collected at the Advanced Photo Source at Argonne National

Laboratory in Sector 21 (LS-CAT) at Station G. Data were indexed, integrated, and scaled using

HKL-2000 (HKL Research). Details regarding diffraction and data reduction can be found in Table

3.2.

3.3.5 Structure Solution and Refinement

Molecular replacement was conducted with Phaser implemented in Phenix228 using PDB entry

lkjf as a starting model. Model building was conducted with COOT2 29 . Refinement with Phenix

following initial substrate placement revealed an additional anti-parallel orientation of the

89

substrate. Subsequent refinement estimated occupancies of approximately 0.6 (conformation A)

and 0.4 (conformation B) for the major and minor orientations and revealed other alternative

conformations for residues Serl and Gly2 in conformation A (Figures 3.2a and 3.2b). Because of

the complexity of constraining alternative conformations of some residues simultaneously with

other residues that occupy subsites fully (i.e., 1.0), occupancies were set manually. The residues

in conformation A with alternative conformations were assigned occupancies of 0.4 (conformation

C) with the original conformer retaining 0.2 of the total occupancy (0.6) of conformation A. Details

regarding refinement and the statistics of the final model are listed in Table 3.2.

90

3.4 Conclusion

The endogenous substrates of HIV-1 protease represent but a small subset of sequences that can

be cleaved by the enzyme. Among the best of known substrates, SGIFLETS, was derived by phage

display. Its structure bound to the protease reveals the formation of many hydrogen bonds with its

glutamic acid and serine side chains. Thus, hydrogen-bond formation could serve as the basis for

the design of optimal substrates and, perhaps, inhibitors of HIV-1 protease.

3.5 Acknowledgements

I.W.W. was supported by Biotechnology Training Grant T32 GM008349 (NIH) and a Genentech

Predoctoral Fellowship. This work was supported by Grant ROl GM044783 (NIH).

91

Table 3.1

Table 3.1. Endogenous and Optimized HIV-1 Protease Substrate Sequences

Substratea P4 P3 P2 P1 P1' P2' P3' P4'

MA/CA S Q N Y P I V Q

CA/p2 A R V L A E A M

p2/NC T A I M M Q K G

NC/pt R Q A N F L G K

P1/p6gag P G N F L Q S R

NC/TFP R Q A N F L R E

TFP/p6pol N L A F Q Q G E

p6pol/PR S F S F P Q I T

PR/RTp51 T L N F P I S P

RT/RTp66 A E T F Y V D G

RTp66/INT R K V L F L D G

Nef D C A W L E A Q

Phage displayb S G I F L E T S

aRef 230bResidues with a grey background areby phage display.

shared with the substrate identified

92

Table 3.2

Table 3.2. Crystallographic Data Collection and Refinement Statistics

PDB Code 6bra

Data CollectionX-Ray SourceDetectorWavelength, AResolution, A (last shell)Space groupa,b,c in Aa,p8, y in 0

No. of ReflectionsNo. of Unique Reflections (last shell)Redundancy (last shell)Mean I/- (last shell)Completeness (last shell)Rmeas (last shell)Wilson B-factor

RefinementWorking Set (last shell)Test Set (last shell)Rwork (last shell)Rfree (last shellRMS deviation bond lengths, in ARMS deviation bond angles, in

Total Number of AtomsProtein ResiduesProteinLigandWater

Avg. B-factorProteinLigandWater

Ramachandran Favored, Allowed, Outliers, in %(MolProbity)

LS-CAT 21-ID-GMAR 300 CCD0.9785726.0-1.11 (1.15-1.11)P212i258.033, 85.767, 46.1390,90,9061299690088 (8193)6.8 (3.9)33.6 (1.5)98.63 (91.12)0.065 (0.691)10.99

90066 (8181)1998 (181)0.1708 (0.2536)0.1840 (0.2732)0.0040.78

210120618224275

15.2513.4619.5827.04

99, 1, 0

93

Figure 3.1

Flaps

(a)

P2

P3

PI, PT,

0%P4'

Q

(b)

94

P4

P5

Figure 3.1. Structure of the D25N HIV-1 Protease CA/p2 Complex. Protease residues from chain

A are shown in white, chain B in grey, and substrate CA/p2 in ball-and-stick representation (PDB

entry 1f7a). (a) Substrate CA/p2 binds in the active site of the protease (white and grey) in an

extended conformation between the two flaps and core domain. (b) Substrate side chains are

numbered relative to the scissile bond.

95

Figure 3.2

lle3A/Glu6B Leu5A/Phe4B

Ser1N/Ser8B * "Thr7A

Gly2NCGly2B

Thr7B

Ser8AISerlBSeriC Phe4AILeu5B Ile3AIGlu6B

(a)

Ile3AJGlu6B Leu5AIPhe4B

Ser1N/Ser8B Thr7A

Gly2 IAC Gly2B

Thr7B

Ser8A/SerlBSeriC Phe4AILeu5B lle3A/Glu6B

(b)

Ile50 11950Gly48

Gy4 Ile4711047 X

Gly49 GSer8

i Gly2 4 l3 Wat102 Glu6

'A& Th7

SPhe4 WtSWat242 Wat259

Ala28 4An5 Gy7Asp29Aw6 Gly27

As29 AAla28

Wat234 Gy27 Wat240

Arg87hr26

Arg87 T s2

(C)

96

Figure 3.2. Electron Density and Interactions of SGIFLETS Bound in the Active Site of D25N

HIV-1 Protease. Protease residues from chain A are labeled in white, chain B in grey, and

SGIFLETS in black. Maps of 2Fo - F, (contoured at 1 a) (a) and Fo - Fc after simulated-annealing

refinement with the substrate excised (contoured at 3a) (b) are depicted as a mesh around the

substrate in the final structure. (c) Conformation A of SGIFLETS showing hydrogen bonds with

HIV-1 protease residues (yellow, direct hydrogen bonds; magenta, water-mediated hydrogen

bonds).

97

Gly51

Gly51

li11950110

Gly48 Gy4O

of 2 3

'AI I

Ala2 Ap29(P4)

(a)

Gly51

IleSO Gy1

G"y48

Pro2

G y3(P3) if)

WatS37

Arg8

(b)

Wft2S4A

GGySlA yA

Gb,4A

G y2A

SerlA

(N)4

(P3)

*00

(c)

X GlyGl1B

Ile50BXw,

IleSOBC GlyOBWat254B

G y40B

Serl C(PN)

Gly2C(P3) ?'%I

Wat1O9

Arg8

(d)

98

Figure 3.3

Figure 3.3. Alternative Conformations of P3 and P4 Residues. Protease residues from chain A are

shown in white, chain B in grey, and substrates in black. (a) CA/p2 complex (PDB 1 f7a). Ala2

(P4) and Arg3 (P3) form P-sheet-like interchain hydrogen bonds with Asp29 and Gly48. (b) p l/p6

complex (PDB lkjf). Gly48 and Arg8 alternatively recognize the P3/P4 amide. (c) Alternative

orientations A and B of SGIFLETS exhibit the conserved P-sheet conformation and a unique

hydrogen bond between the side-chains of SerlA/B (P4) and Lys45. (d) Alternative conformation

C is similar to that in panel b and has a unique hydrogen bond between the side-chain hydroxyl

group of SerC (P4) and the main-chain oxygen of Gly49.

99

Thr7B(P3')

(PT) ( )r8B(P4')

(a)

6n AS(P3)

(P4)

(b)

100

Figure 3.4

Figure 3.4. Role of Asp30. (a) Network of hydrogen bonds formed by Glu6 (P2') and Ser8 (P4')

of SGIFLETS. Protease residues from chain A are shown in white and SGIFLETS in orientation

B in black. (b) Analogous hydrogen bonds formed by Ser2 (P4) and Asn4 (P2) of the MA/CA

substrate (Ikj4), though these residues interact with each other and only Ser2 interacts with Asp30.

101

Chapter 4

An n-+r* Interaction in the Bound Substrate of

Aspartic Proteases Replicates the Oxyanion Hole

Contribution:

I performed all bioinformatics analysis and computational experiments, and prepared the initial

manuscript. I contributed to the design of computational experiments and revision of the final

manuscript.


Windsor, I. W.; Gold, B.; Raines, R. T., An n-+* interaction in the bound substrate of aspartic

proteases replicates the oxyanion hole. ACS Catal. 2018, In press.

102

Abstract

Aspartic proteases regulate many biological processes and are prominent targets for therapeutic

intervention. Structural studies have captured intermediates along the reaction pathway, including

the Michaelis complex and tetrahedral intermediate. Using a Ramachandran analysis of these

structures, we discovered that residues occupying the P1 and P ' positions (which flank the scissile

peptide bond) adopt the dihedral angle of an inverse y-tum and polyproline type-IL helix,

respectively. Computational analyses reveal that the polyproline type-I helix engenders an n--z*

interaction in which the oxygen of the scissile peptide bond is the donor. This interaction stabilizes

the negative charge that develops in the tetrahedral intermediate, much like the oxyanion hole of

serine proteases. The inverse y-turn serves to twist the scissile peptide bond, vacating the carbonyl

7r* orbital and facilitating its hydration. These previously unappreciated interactions entail a form

of substrate-assisted catalysis and offer opportunities for drug design.

inverse

N NI'-WH dinteraction interaction

H

103

4.1 Introduction

Aspartic proteases (EC 3.4.23) have played a central role in the history of protein science. The

human digestive enzyme pepsin was the subject of seminal studies in mechanistic enzymology

and protein crystallography.5 6,23 ' Fueled by aspartic proteases being therapeutic targets,

endothiapepsin and HIV- 1 protease emerged as model systems and together account for over

1,000 entries in the Protein Data Bank. In accord with a common reaction mechanism, aspartic

proteases share susceptibility to inhibition by the peptidic natural product pepstatin. Its

hydroxyethylene pharmacophore, which mimics an intermediate in the enzyme-catalyzed

reaction, forged the route to clinical inhibitors of aspartic proteases. 90, 195

Aspartic proteases employ a pair of aspartic acid residues to activate a water molecule for

nucleophilic attack on a peptide bond. This catalytic dyad arises from a pair of "DTG" motifs at

the interface between two globular domains (Figure 4. 1A).232 Eukaryotic aspartic proteases are

monomeric and consist of two unique domains,52 whereas retroviral homologs are homodimeric

(Figures 4.1B and 4.1 C). 233 -234 In addition to the DTG motifs, aspartic proteases share other

structural elements, including a p-barrel domain and flap that close upon polypeptide substrates.

The enzymes vary, however, in their substrate promiscuity and biological niche. For example,

renin cleaves angiotensinogen with exquisite specificity to elicit vasoconstriction, 23 5-2 36 whereas

HIV- 1 protease recognizes a variety of substrates to enable maturation of new virions.237

The catalytic mechanism of retroviral proteases has been informed by the use of substrate

mimetics8 1 and, especially, inactive variants. In seminal work, Wlodawer and coworkers employed

site-directed mutagenesis to inactivate FIV protease and then determined the structure of the

inactive variant bound to an actual substrate. 238 Schiffer and coworkers extended this strategy to

104

HJV-1 protease while developing their shape-complementary model of side-chain reognition.65'

82

Typically, the main-chain conformation of the substrates of aspartic proteases is thought to

resemble that of a P-strand.239 We have examined the structures of inactivated protease substrate

complexes, focusing on the main chain. We have discovered that the substrates do not resemble a

P-strand near the scissile peptide bond. Instead, the substrate adopts a conformation in which the

oxygen of the scissile peptide bond donates an n--+ir* interaction to the next carbonyl group in the

main chain. The discovery of this interaction, as well as the formation of an inverse y-turn within

the substrate, suggests that aspartic proteases rely on substrate-assisted catalysis 240 to effect the

cleavage of peptide bonds.

105


We began our investigation of the role of main-chain conformation in the mechanism of aspartic

proteases by gathering structures of substrates bound to inactivated enzymes. 24 1 Since 1997, 35

structures of inactivated retroviral proteases with flaps closed on peptidic substrates have been

deposited in the PDB (Table 4.S 1).65, 82, 218, 222, 238, 242-247 Next, we measured the # and qL angles of

residues in the P1 and P1' position to generate a Ramachandran plot (Figure 4.2). The peptide bond

between these two residues is the one cleaved by the enzymes. We found distinct clusters of P1

and Pl' residues, with mean values of #= -95.5' 10.10 and / = 44.9' 10.4' for P1 residues,

and # = -56.3' 14.5' and v = 140.0' 7.5' for Pl'residues.

4.2.1 Conformation of the P1 Residue

The P1 residue of bound substrates adopt conformations in the broad region of the Ramachandran

plot associated with f-strands. An analysis by Rose and coworkers identified a subset of this region

with # angles between -100' to -70' and V angles between 50' to 100' populated by members of

a coil-library that contain an inverse y-turn, called the "y-basin" (Figure 4.3A).248 Its signature

inverse y-tum motif was described nearly a half-century ago 249 and is characterized by a hydrogen

bond between the i - 1 main-chain oxygen and i + 1 main-chain nitrogen, forming a 7-membered

ring.250 The donor-acceptor angle deviates from linearity, making the hydrogen-bond energies of

inverse y-turns weaker than those of other secondary structural elements, such as P-turns and p-

sheets. The inverse y-turn centered on the P1 residue results from the P2 (i - 1) residue accepting

a hydrogen bond from the P1' (i + 1) residue (Figure 4.3B). 1-Branched amino acids are excluded

from inverse 7-turns, as their side-chain would clash with the main-chain nitrogen of residue i.248

106

Notably, the incorporation of fp-branched residues at the P1 position yields inefficient substrates

for HIV- 1 protease, consistent with the importance of an inverse y-turn conformation for

catalysis.2 51

An inverse y-turn is incompatible with the /4 of endogenous HIV protease substrates that

contain a P1' proline residue, which lacks the requisite main-chain N-H.2 0 Segregating the P1

Ramachandran plot based on sequence revealed to us that substrates with a P1' proline residue

have expanded # angles that move the conformation of the P1 residue out of the y-basin (Figure

4.3A). These structures have intramolecular hydrogen bonds between the side chains of the P2 and

P4 residues (Figure 4.3C). Their proline-containing substrates all have P2 and P4 residues with

polar groups, which could serve to increase the q angle and move the P2 carbonyl group away

from the P1' pyrrolidine ring. An asparagine residue is found most commonly at the P2 position

of substrates with a P1' proline residue, but is also found at the P2 position of those substrates

known as "p l/p6". In p I/p6 substrates, the side-chain oxygen of the asparagine residue, rather than

its main-chain oxygen, accepts a hydrogen bond from the i + 1 main-chain nitrogen (Figure

4.3D).242 In addition to an increased # angle, the P1 residue of pI/p6 substrates exhibits a reduced

y angle, which serves to rotate the main-chain N-H towards the side-chain of the asparagine

residue in the P2 position (Figure 4.3A).

The inverse y-turn in the main-chain of P1 residues has not gone unnoticed. Medicinal

chemists, guided by knowledge that proline is often found in inverse y-turns, discovered that

replacing the main-chain of substrates at the P2 to P1' positions with a seven-membered ring can

generate potent inhibitors of HIV protease.2 5 2 Although this strategy was intended to mimic the

107

inverse y-turn, structural analyses revealed that the ring shifted by a "half' residue relative to that

in substrates. 2 -2

4.2.2 Conformation of the Pl' Residue

The main chain of the P1' residue occupies a region of the Ramachandran plot associated with the

polyproline type-II (PPII) helix (Figure 4.4A) and exhibits structural features conducive for

preorganization (Figure 4.4B, Table 4.S1). Comparing the main-chain dihedral angles of Pl'

residues (Figure 4.4C) with the structure of a PPII helix solved by Wennemers and coworkers

using direct methods reveals a striking similarity (Figure 4.4D).25 The lack of hydrogen bond-

donating groups in the main-chain of polyproline entices donation of oxygen electron density to

an alternative acceptor-the r* orbital of the i + 1 carbonyl group.25 1-25 1 (For reviews of the n-**

interaction, see refs. 258-259.) In a-helices, donor carbonyl groups employ n-+w* interactions with

the si face of the adjacent carbonyl group, whereas in PPII helices the interaction is with the re

face. Previous systematic, energetic analysis by our group estimated that a residue occupying the

dihedral angle near the average for residues in the Pl' position ( = -55' and V = 140.0') has an

n-T* interaction with an energy of 1.0 kcal/mol.260

The PPII-helical conformation is not limited to proline-rich sequences. Unfolded protein

sequences likewise occupy this conformation. 2 61 Protease substrates must be sufficiently

disordered to avoid adopting a secondary structural element like an a-helix or P-sheet, which

would preclude binding, but must adopt a conformation complementary to the enzymic active

site. 2 62 An analysis by Brown and Zondlo identified proline, leucine, and alanine as the residues

with the greatest propensity to form PPII helices when flanked by pairs of proline residues. 263

108

These three amino acids constitute half of the Pl' residues in endogenous substrates. Thir analysis

also identified P-branched and small polar residues as being disfavored in a PPII helix, and those

residues lead to inefficient substrates of HIV- 1 protease when in the P1' position.66

4.2.3 Mechanistic Insights

Informed by our bioinformatics analysis, we sought insight on the catalytic mechanism of aspartic

proteases. The catalytic mechanism of amide hydrolysis proceeds first via nucleophilic attack of

water at the P1 carbonyl group to generate a tetrahedral intermediate that is a geminal diol (Figure

4.5). Proteolytic cleavage is completed upon protonation of the P1' nitrogen and subsequent C-N

bond scission. To investigate this mechanism, we performed quantum chemical calculations on

coordinates extracted from high-resolution co-crystal structures of a substrate and a gem-diol

intermediate bound to HIV- 1 protease. To represent the Michaelis complex, we chose a recent co-

crystal structure of a highly efficient substrate bound to an inactivated protease (PDB entry

6bra2 64), which has the highest resolution of any such structure. To represent the gem-diol

intermediate, we chose a structure by Weber and coworkers that employed threonine as the Pl'

residue (PDB entry 3b80 66).

When extracting coordinates, we sought to preserve noncovalent interactions between the

P2/P 1 and the P1'/P2' carbonyl groups near the scissile peptide bond. To do so, we extracted the

coordinates of the P1 and Pl' residues to the P-carbon of their side chains, the distal amides and

C' atoms from the P2 and P2' residues along with a conserved water molecule, the main-chain

amides of the protease flap that form hydrogen bonds with the conserved water molecule, and the

aspartic acid or asparagine side chains of protease residues 25 and 25'. Next, we optimized the

109

positions of hydrogen atoms with density functional theory (DFT) calculations. Hydrogen atoms

were added to the structure to create a neutral structure of the Michaelis complex and a

monoanionic structure of the tetrahedral intermediate. There was no ambiguity in assigning

hydrogen atoms in the substrate structures. The protonation of the gem-diol intermediate was

modeled to match neutron diffraction structures in which both hydroxy groups form hydrogen

bonds with an aspartate residue,265 as is proposed for the tetrahedral intermediate. Finally, we

performed Natural Bonding Orbital (NBO) analysis of the ensuing models to estimate the strength

of noncovalent interactions. 266-267

In the Michaelis complex, the inverse y-turn-PPII-like motif activates the n* orbital of the

P1 carbonyl group for nucleophilic attack by water. The two flanking carbonyl groups (i.e., those

of the P2 and P1' residues) form hydrogen bonds, either with a conserved water molecule in

retroviral proteases or with the flap of monomeric proteases. The ensuing constraint increases the

electrophilicity of the P1 carbonyl group by both enhancing the n-+* interaction with the Pl'

carbonyl group and preventing n->*7r donation from the P2 carbonyl group, which would

otherwise raise the energy of the ;r* orbital of the P1 carbonyl group and diminish its

electrophilicity. 268

As the reaction proceeds to the tetrahedral intermediate, the n--+x* interaction grows

stronger, increasing from 0.66 to 2.01 kcal/mol. This latter energy, which is remarkably high,

serves to delocalize developing negative charge (Figure 4.6A and 6B). Thus, the n-+-** interaction

between the P1 and P1' carbonyl groups effectively acts like the renowned oxyanion hole of serine

proteases, which delocalizes developing negative charge through the formation of a pair of

hydrogen bonds with enzymic N-H groups. 269 -273 As has been postulated for hydrogen-bonding

110

1 ih the xninbn hr1l,273-274 fh - tit-rcticon is ctrnnar in the te-tr1hedral intermediate thanoxyLI an1on kiJ"A%"L 1J% Lll% It IILI iJiL%1LL%'L1J1Ai 1k.1 AX J A-

in the Michaelis complex. This differential binding of the Michaelis complex and tetrahedral

intermediate is likely to enhance catalysis.2 75 2 8 0

Like the n--7c* interaction, the inverse y-turn is important in the enzymatic reaction

mechanism. In the first step, amidic resonance must be overcome. At first glance, a hydrogen bond

to the main-chain nitrogen of the P1' residue might be thought to increase amidic resonance and

thereby decrease the electrophilicity of the scissile peptide bond. Upon forming the inverse y-tum,

however, the scissile peptide bond is twisted out of plane, decreasing amidic resonance and

increasing electrophilicity. 281 -28 s In the Michaelis complex, the amide nitrogen is pyramidalized

slightly and the lone pair hybridizes to a small degree (~3% s-character, versus <1% s-character

in adjacent amides) because of the inverse y-turn hydrogen bond, which has an energy of a 0.66

kcal/mol (Figure 4.6C). This interaction is absent in substrates that have proline in the P1' position,

and these substrates are cleaved only slowly by the protease. 286 Similarly, the asparagine side chain

in the P2 position of the p I/p6 substrate competes for the y-turn, resulting in efficient cleavage. 79

In the tetrahedral intermediate, the energy of the inverse y-tum hydrogen bond is

maintained at 0.67 kcal/mol (Figure 4.6D). Although the scissile peptide bond is now hydrated, its

nitrogen lone pair still contains only ~7% s-character, which is much less than the 25% s-character

of an sp3-hybridized orbital. We find that the low hybridization is due to the presence of the

neighboring gem-diol, in which a higher energy p-rich lone pair of the nitrogen is a better donor

of electron density to the two o*c-o orbitals. Nonetheless, the inverse y-tum also aligns the nitrogen

lone pair towards the protonated oxygen of Asp25, resulting in a hydrogen bond with an energy of

0.79 kcal/mol (Figure 4.6E). This aspartic acid residue also maintains a hydrogen bond of energy

111

5.36 kcal/mol with the gem-diol (Figure 4.6F). As the proton transfer to the N-H proceeds, the

energies of the two nN-*cy*C-O interactions likely decrease and the energy of the no--+U*C-N

interaction likely increases, promoting C-N bond scission to form the products.

Computational chemists have devoted much attention to the catalytic mechanism of

aspartic acid proteases, especially that of HIV-1 protease. 287 -290 These studies have tended to

employ a hybrid quantum mechanical/molecular mechanical (QM/MM) approach in which the

higher level of theory is used to describe molecular fragments that are smaller than those analyzed

herein. 7 2 , 291-292 Our findings suggest that previous analyses have missed critical details: the n-*

interaction and inverse y-tum. New computational strategies are needed to select fragments that

capture all of the interactions that make significant contributions to catalysis by aspartic acid

proteases and other enzymes. 293


4.3.1 Structural Analyses

The atomic coordinates of X-ray crystal structures that were deposited in the Protein Data Bank as

of August 9, 2018 and that contain an inactivated retroviral aspartic proteases were downloaded.

The structure SOWJUL25 5 was downloaded from the Cambridge Structural Database. Angles and

distances in these structures were measured to the nearest 0.1 A and 0.1 0, respectively, with the

program PyMOL from Schr6dinger (New York, NY). Structures containing multiple protease

dimers in the asymmetric unit or alternative conformations within a single protease molecule were

measured individually and given equal weight in calculations of the mean. PDB codes and chain

112

idntfiers are listed, An Qal .1 alo ' N + ith the mesred AvA I1- L-s Sftuctures we~re de-picte-d wtMI~L1LLL11..'1_a "4L.v 1L L 11 I. .L L I~ "]ionrr uft. IIIIItL11'lA.L V "L4%L'.3.. "JLI LL%'LL4I% .) VV

the program PyMOL.

4.3.2 DFT Optimization

Atoms important for determining the conformation of the scissile peptide bond in the Michaelis

complex and gem-diol intermediate were extracted from PDB entries 6bra264 and 3b80,66

respectively. All termini were made into amides. Hydrogen atoms were added to extracted

structures in idealized geometries with the program GaussView 6 from Gaussian (Wallingford,

CT) and optimized at the M06-2X/6-311 +G(d,p) level of theory along with the integral equation

formalism variant of the polarizable continuum model (IEFPCM) model for water-solvation 294 by

using Gaussian 16, Revision A.03 software from Gaussian.29 s Optimized structure coordinates are

listed in Tables S3 and S4.

4.3.3 NBO Analysis

Optimized structures were subjected to Natural Bonding Orbital analysis using the NBO 6.0

software from the Theoretical Chemistry Institute of the University of Wisconsin-Madison

(Madison, WI). 2 96 Orbital interaction energies were calculated by second-order perturbation

analysis. Orbitals were depicted with the program NBOView 1. 1.297

113

4.4 Conclusions

Ramachandran analysis of the residues in substrates bound to aspartic proteases has revealed

previously unappreciated noncovalent interactions. When bound, the substrate adopts a

conformation with an n--+7c* interaction and an inverse y-tum, both of which activate the scissile

peptide bond for nucleophilic attack by a water molecule. To our knowledge, the use of these

interactions in substrate-assisted catalysis had not been described previously. Notably, these

findings extend the reach of n--7r* interactions between adjacent main-chain carbonyl groups to

enzymatic catalysis. 298

4.5 Acknowledgements

B.G. was supported by an Arnold 0. Beckman Postdoctoral Fellowship. This work was supported

by Grant RO 1 GM044783 (NIH).

114

M1

I L6 67 ZEI 9*Et- 8*6E 97,6- A 18SOA/AN~d96 87 L*9E1 I*8t,- 6*9E 9'101- a dAHSOA/JN~d4I 09*1 9!bt

9*E6 87Z Z*6EI Wt'- t717 E7001- D daHNO-4/ANDcNI ~99 1 qoj7

CtV6 v*z 6 ! Z*1Vt- 17L 101- D dXNO'1AN~dH 9vi I qot

El16 87 9*S1 1*9t,- 8'H 9*L01- A n"sO&J/ onl -FE6 87Z '9E1 WLA'- 8*8E 1*101- 3l dNISOdANDd" 98* 1 qqot

1*001 67Z t,*M 8*tl- 91't 9'E6- A 'dsol/AN4cfI -

17,6 87 9*9E 1 E~t- 68 E 9*011- 3 d"SO1/ANDcfl 8' 5qot

9*0O1 67 91 ! 6'6E- 7 E86- A 'HxoiNOtAND--FE6 87 9*t I F*it- Ot, Z'01- 3 d)Jo'1/ANDd) 89*1 jqot'

V6 87 E71 91 t- LV9E I*E01- A NJN6A/JINDd --

VE6 87 ~99c Z*9- 6 V901- 3 JaN0d/AN~d 06*1 pqoj7

L'86 WE 8L'M E 09- LVtg &06- Ui WAV3A/IA)1 O17 JtlJ11l tl* E C'Ml 98- E'978- Ul D-owII/I'Nl OE*Z ;DJ17

L17,6 0' 9,1 1 L EL- 9*Jt7 E'gL- Ul 1UXHV/IAIUN117,6 WE 9*191 L EL- VLI7 1~L- D) IUNHV/IAUN 98*I1 AIA

V9~6 WE 6'9t1 Z*8- Z*9 96- d "1S0'71/ANtD 087 IREtZ*001 V E 9*Vl E*ZL~- 09 tM~O!- 9 ;D13ooAtw 09*1 j~xqE

9'L01 WE V9ZI 17,9- 9*Lt Vi10l- d DUIAA/JIOV gzz aixuz

670 1 WE LEI Z*99- 9*19 6- d VDUA1k/AA3V 007 Ixuz

E'101 Z' Z*fI g*E9- 6'89 866- d DU1AJAlIv ooz pxu:

9*8L 87 'L9 I 6*9t- L'8E 6*06- VIIV'd Dr1A/NAON ltJ li978 67 9*tg I 8*t79- 9*E1 t8- d DIA/NVO1 98'1 SUJZ8*6L 87 LZ9 V09- WV t E, 19- d N)IA/NqVO) ovz nsil

601 t" E 9*ti L76- L L W,8/J- d NDID-I/NAO' 007 bslIi

788 87 V6N' E"89- 91717l 676- d dHlSO1/JNDd-J 007 61w IE796 67 Lt, 6*89- 179 9,88- d WAVHV/1A)IV 98*1 81tH!IViol1 WE 9,8 t,1 9*L- L*8t' 9'LO I- d Ald/ANOSA 06*1 LilU IV96 WE ~991 L89- V6t' 976- d lIUA/l11x4J 007 4I)11

1*901 FE 9,8EI 6*tL- S*9 Z6- d VDUAA/AI3V 07 1 I9'06 87Z Vtl*9,I 8*S- t'Lt, 6*L6- d d)JSO'1/ANtDdA% 07 JfI

01 67Z L*8E1 Z89- 8Otl p*88- d D-owI'/w'Nv 00-z L[31Z'011 0W 8 1 U CL- 8717, V801- S Ald/ANOSA

9'E01 WE VLI7I 6*6L- L'8 Lt d OA~d/ANOsA 067 t[FNI

VL6 67 Lt, 6'69- U*19 Wl88- d VIV/I7A)IV-N 007 tLJI

(O)o (yp (0) 1 (O)q5 (0) 1h (O)o 1~ 3oumnbm (Y) UT

11d Id *S;D>J LGd

SOUJ13 Ud UJOJJ SJ;Dlwu~htd poJflsuoW, IS-t' alqtjL

IS1', alq'u

4ij7 1.67 PGNF/LQSS E -91.6 -43.5 138.7 2.8 96.3

- PGNF/LQSSP F -101.5 39 -41.1 130.4 2.7 97.6

4cj8 2.00 RPGNF/LQSRL E -102.2 42.2 -41.6 130.2 2.7 98.3

RPGNF/LQSRL F -105.3 36.4 -40.9 132.4 2.7 95.7

4qj9 1.83 RPGNF/LQSSP P -93.7 44.2 -45 131.2 2.8 97.1

4qja 1.54 RPGNF/LQSRL P -99.5 36.1 -40.7 135.3 2.7 95.3

6bra 1.11 SGIF/LETS S, AltA -95.4 43.2 -48.7 137.1 2.9 96.8

- SGIF/LETS S, AltB -96.3 49.1 -52.9 139.9 2.9 96.9

Mean -95.5 44.9 -56.3 140.0 2.9 96.8

SD 10.1 10.4 14.5 +7.5 0.2 7.0

116

Table 4.52

Table 4.S2. Measured Parameters from CSResidue 0 (0) y (0)

1 -67.1 146.52 -65.7 138.43 -73.1 163.94 -72.8 152.05 -72.5 165.76 -69.0 152.5

Mean SD -70.0 3.2 153.2 10.4

D Entry SOWJULd (A)

3.02.93.13.13.23.0

3.1 0.1

117

O (0)

102.9106.589.898.988.298.2

97.4 7.2

Table 4.S3

Table 4.S3. Coordinates Extracted from PDB Entry 6bra and Optimized Hydrogen Atoms

-7.24337900-5.80699300-4.80021800-5.29355300-3.59684100-6.27593900-7.93901300-5.61028800-4.68357900-7.46746800-5.59438500-7.459219004.503157005.34987000

6.512720004.342846003.535911005.054321004.779132005.560863005.955380003.936433006.399491004.91144200

-7.17082300-5.71364400-4.76266000-3.48856200-5.15569000-3.20799100-7.41367500-5.51747800-2.75724100-7.38023900-5.47457500-7.826774004.523789004.931442005.581182003.993352005.421174003.881802004.55309600

2.591745003.004540002.070421001.028135002.255594000.914223003.291071003.034978000.414384001.603380004.003120002.59959500-2.66943900-2.76866700-3.17583400-3.67535700-2.19653500-2.10169600-2.39460500-2.34637600-3.33569000-1.82897900-1.65184500-2.02229800-2.71314600-3.06179800-2.06995400-2.42722900-0.99479400-3.31462600-1.73125900-3.09730000-1.79518600-2.69378500-4.05755700-3.451077001.885302002.646338003.690052000.960250001.666490002.529634002.13944900

1.992045002.280060001.671150000.995418001.808227000.817659002.458716003.354135000.467424002.397042001.891608000.921921002.179544000.932732000.996107002.571492002.004803002.92973800-0.21005500-1.43920100-1.67071400-0.17010400-1.34663600-2.24986800-1.97665500-2.23923800-1.63863300-1.59960800-1.19167900-1.98804600-2.38298500-3.31616500-1.28692500-0.90596000-1.85569700-2.44070200-2.31648200-1.07263700-1.17225000-2.09544400-2.89670200-2.919578000.10396200

118

4.869995003.982854004.449488004.444743005.950142000.370234000.552342001.58140000

-0.642979000.631456001.06865400

-0.44006000-0.28425800-0.78651700-1.45469300-0.87481900-1.262926000.79572600

-0.70112300-1.95233900-0.39749100-0.46524300-0.512834000.209295001.280318000.202904000.15368400

-1.55446100-0.289750000.180511001.24559700

-0.388558000.28427600-1.271922000.466960001.23860500

-0.352689002.894091002.208896002.43829800

2.833933001.293499002.264310003.839478002.91615100

-0.69843300-0.77105800-0.34060000-0.944206000.30418000

-1.40411200-1.32007100-1.67021900-0.59667800-0.87688100-3.04365700-1.65377400-1.71863800-3.31902500-3.05404000-3.779020000.657054001.713998001.259976000.649419002.943051000.801576001.946168003.275553003.757392002.698814001.548763001.300476002.035685000.232823001.827520001.65364200

-0.051398000.20227300

-0.14724500

1.35 1446000.119268002.178185001.352682001.484382004.534080003.026134002.484609004.846854004.869502004.987407002.329057000.91765700

-0.03793000-1.038016000.637096002.811519000.73918500-0.402329000.817011001.284236000.26996700

-0.72645700-1.98784400-1.92448200-0.164639001.06044800

-0.958493000.74989100

-0.888952000.05629500

-3.13939800-4.40653100-3.13055400-4.53155200-4.43859500-5.21321200-0.03173400-0.690199000.82216300

119

Table 4.S4. Coordinates Extracted from PDB Entry 3b80 and Optimized Hydrogen Atoms

7.144451005.695564004.740019005.185072003.512748005.524412007.770721005.506653007.260118007.44646700

-4.88670300-5.84044200-7.01061400-4.93341000-5.16320200-3.87256400-5.24390200-6.09137500-5.91998700-4.24254800-7.13316700-5.866430007.104292005.622926004.664937005.153619003.447049005.436858007.206119005.431581007.552681007.61095500

-4.58914100-5.38652300-6.34267800-3.68157000-5.19536800-4.32201100-4.95617200-5.54338400-4.84241500-4.20955300

2.434650002.775215001.821774000.742139002.158737002.729958002.825849003.774763001.352509002.88159300-2.90194400-3.11528900-3.45335100-1.86292900-3.55427200-3.13656200-2.88638700-2.81548900-3.68547000-2.76818000-2.79615400-1.90824500-2.47808900-2.76242500-1.81667600-0.78549200-2.16615000-2.69045600-1.57263600-3.75738300-3.31695200-2.340040002.265796002.911550003.647032001.816458001.496048003.026130002.608335003.231697003.170744001.93674000

-2.11890900-2.39618600-1.71029600-1.22960900-1.64786200-3.47221500-2.91984800-2.00507900-2.06190000-1.17077000-1.80800600-0.66000300-0.79196300-2.13138800-2.63690200-1.485252000.496587001.644241002.277720000.565882001.325370002.205352002.056258002.302639001.587710001.054748001.601225003.374169001.458295001.900409001.525010003.012137001.621902000.488661000.717745001.218740002.100535002.35696300

-0.70598800-1.88395000-2.71595700-0.81436400

120

-5.76467600-6.46494400-0.07444500-0.21829900-1.296113000.539257000.39939600

-1.060163000.78858200

-0.32451500-1.372059000.880864001.383127001.430523001.446433002.732498000.578092000.53812600

-0.03958500-0.89314300-0.308163002.078173000.72722500

-0.248185001.779410001.686431003.337859001.60183200

-0.356181000.753610002.310595000.26264200

-0.69001100-0.576747001.24667100

-1.70767400-0.48838800-2.68350000-2.11674400-2.146388002.90752400

4.277773002.71239500

-1.49178600-1.23103700-0.81880500-2.37894500-0.63248900-1.64990100-1.609922001.315565000.69171300

-1.48066500-0.36281400-2.95758000-0.75256400-0.083158000.835287001.828426003.251922003.159121000.73948100

-1.469218003.86119100-1.63137000-1.57861400-2.92308000-0.451630001.950569003.72308800

-3.79904400-3.094527001.663363001.706071000.807367001.868908001.757740002.584689000.03434700

-0.265968000.252187001.66595800

-2.14718700-4.33948300-2.84087100-2.38899200-4.49753300-4.81469400-4.77640400-0.626792002.053819001.92693100

-2.087069000.08419500

-0.231871001.44040900

-0.31069000-0.198445000.872663000.55723400-0.11389600-0.762195001.538743000.07996200-0.29043400-2.537316000.826873000.337195001.075789001.48799300

-0.38138200-0.860496003.245102004.370811004.977190003.340202003.984439004.98273100-0.30178000-1.016062000.46350200

-1.08886900

121

Figure 4.1

C

)

122

Figure 4.1. Structural Features of Aspartic Proteases. (A) Aspartic proteases share the "DTG"

motif with an inter-domain hydrogen bond. Here, the residues from a eukaryotic pepsin (PDB

entry ipsol 93) and HIV-1 protease (PDB entry 5hVp2 99) are aligned with an RMSD of 0.272 A. (B)

The eukaryotic pepsin consists of two unique domains and has only one flap. (C) HIV- 1 protease

consists of identical monomers that associate with two-fold symmetry

123

135 -

90

451l

O)o

--45

-0

-135

18Q-180 -135 -90 -45 U

o(o)

124

Figure 4.2

40 VU 135 IOU

Figure 4.2. Ramachandran Plot of P1 and P1' Residues in Substrates Bound to The Active Site of

Inactivated Aspartic Proteases.

125

Figure 4.3

100- r -- --- ---- --n -- -A 100y-basin

80-

60- Pl'Pro 4 :

y() P2Asn,,, - -.' -- ----40 - a

P2 Asn20

0-120 -100 -80 -60

B P1'P2

P4

P3 APi

C P4P2 P'

P3

P1

P2 i

P3

P4 01

126

Figure 4.3. Main-chain Conformation of the P1 Residue. (A) Ramachandran plot of P1 residues

in substrates bound to the active site of inactivated aspartic proteases. Data points are colored by

the presence of a proline residue at the P1' position and an asparagine residue at the P2 position

(red), an asparagine residue at the P2 position (blue), or neither (black). All data points are shown

relative to the y-basin. (B) The P2, P1, and Pl' residues engage in an inverse y-turn (NC/pl

substrate; PDB entry ltsq).244 (C) A proline residue at the P1' position precludes the formation of

an inverse y-turn but is found with polar residues at the P2 and P4 positions that form hydrogen

bonds (MA/CA substrate; PDB entry lmt7).218 (D) The side chain of an asparagine residue at the

P2 position forms a hydrogen bond with the nitrogen of the scissile peptide bond (pl/p6 substrate;

PDB entry 4obf).242

127

Figure 4.4

A 170.

160-160 polyproline

150- . 0

(0) . P1'

140 0 00 * 0

130-

-90 -70 -50 -30

B CP2

R P1l

P1i

D

polyproline0

128

Figure 4.4. Main-chain Conformation of the P ' Residue. (A) Ramachandran plot of P1' residues

substrates bound to the active site of inactivated aspartic proteases (black) and the residues in a

PPII helix (red).25 5 (B) Parameters used to assess n-+rw* interactions between adjacent main-chain

carbonyl groups in a protein (Tables SI and S2). (C) Structure of a P1 residue bound to an

inactivated HIV-1 protease and showing its n-*r* interaction (PDB entry lkjh; d = 3.0 A, 0 =

95.4 ).65 (D) Structure of a polyproline fragment showing its n--+7w* interactions (CSD entry

SOWJUL; d = 3.1 A 0.1 A, 0= 97.40 7.2 ).2

129

Figure 4.5

Inversey-tumn 00--mH NL" N "

N H-n0H P, interaction- 0 H

% OH, 0

0 >

Asp25 Asp23'

Michaelis Complex

H P

4--.O\

C H H

0 ",b01-

Asp25 Asp25'

gem-Dial Intermediate

0--- HH, 4N

H pfjK HO\ HH

0- 0-'

0 bASp25 ASp25'

gem-Diol Intermediate(N-protonated)

-4-

0

0H2Nij0 H7

H pHHOH

0- 0

'0 0Asp25 Ap25'

Product Complex

130

4-

Figure 4.5. Putative Mechanism of Catalysis by HIV-1 Protease. The hydrolysis reaction proceeds

through at least three discrete steps. Upon binding the peptidic substrate and lytic water, Asp25 of

one monomer deprotonates the water while Asp25' of the other monomer transfers a proton to the

oxygen of the scissile peptide bond. This oxygen also forms an n-+7* interaction (red dashed line)

with the main-chain carbonyl group of the next residue in the substrate. The n--7r* interaction is

even stronger in the tetrahedral intermediate. The inverse y-turn (blue dashed line) disturbs the

planarity of the scissile peptide bond and orients the nitrogen lone pair for protonation by Asp25.

131

Figure 4.6

R

P1 P1

Asn25'

Michaelis Complex(n-r* interaction)

P1

P1,

Asp25'

Asp25

gem-Diol Intermediate(inverse y-turn)

gem-Diol Intermediate(n-+r* interaction)

E

P1

P1',

Asp25

Asp25

gem-Diol Intermediate(0-H - N hydrogen bond)

gem-Diol Intermediate(0-H ---0 hydrogen bond)

132

D

CP1,A I

P1P1

Asn25'

Michaelis Complex(inverse y-turn)

F

iP

Asp25'

Asp25

Figure 4.6. Orbital Interactions of the Scissile Peptide Bond During Catalysis by HIV-1 Protease.

An n--+r* interaction occurs within the substrate (A) and tetrahedral intermediate (B). An inverse

y-turn likewise forms within the substrate (C) and tetrahedral intermediate (D). The hydrogen bond

of the inverse y-turn enforces a hydrogen bond between the lone pair of the nitrogen of the scissile

peptide bond and an active-site aspartic acid residue (E), despite a competing hydrogen bond of

the aspartic acid residue with the gem-diol (F).

133

Chapter 5

Sub-Picomolar Inhibition of HIV-1 Protease with a

Boronic Acid

Contribution:

I performed all experiments, prepared all materials accept for the synthetic compounds, and

wrote the initial manuscript. I contributed to the inhibitor design, preparation of the final

crystallographic model, and revision of the final manuscript.


Windsor, I.W.; Palte, M.J.; Lukesh 1II, J.C.; Gold, B.; Forest, K.T.; Raines, R.T., Sub-picomolar

inhibition of HIV-l protease with a boronic acid. J. Am. Chem. Soc. 2018, 140, 14015-14018.

134

Abstract

Boronic acids have been typecast as moieties for covalent complexation and are employed only

rarely as agents for noncovalent recognition. By exploiting the profuse ability of a boronic acid

group to form hydrogen bonds, we have developed an inhibitor of HIV-1 protease with

extraordinary affinity. Specifically, we find that replacing an aniline moiety in darunavir with a

phenylboronic acid leads to 20-fold greater affinity for the protease. X-Ray crystallography

demonstrates that the boronic acid group participates in three hydrogen bonds, exceeding that of

the amino group of darunavir or any other analog. Importantly, the boronic acid maintains its

hydrogen bonds and its affinity for the drug-resistant D30N variant of HIV-1 protease. The

BOH - OC hydrogen bonds between the boronic acid hydroxy group and Asp30 (or Asn30) of the

protease are short (ro ...o = 2.2 A), and density functional theory analysis reveals a high degree of

covalency. These data highlight the utility of boronic acids as versatile functional groups in the

design of small-molecule ligands.

HHHN'

NH

H B

H

135

5.1 Introduction

Clinical inhibitors of HIV- 1 protease are quintessential triumphs of structure-based drug design.97 ,

300 The protease cleaves diverse sequences that connect individual domains of viral polyproteins,

recognizing four substrate residues on each side of the scissile bond.65 The components of most

effective inhibitors-a tetrahedral-intermediate mimetic flanked by subsite-targeting groups

have undergone iterative optimization for 30 years. 30 1 The discovery of the bis-THF moiety of

darunavir, which targets the enzymic S2 subsite, was a major breakthrough.1 05 Its two bis-THF

oxygen atoms accept hydrogen bonds from the main-chain amides of Asp29 and Asp30, leading

to low picomolar affinity (Table 5. 1).199, 302 Mutations that overcome such main-chain interactions

are rare, 11, 122, 303-304 and darunavir is among the most resilient of protease inhibitors.3 05 -3 07

136


Despite countless attempts at optimization, an ideal functional group for the S2' subsite has been

elusive. Inspection of structures of complexes between substrates and darunavir analogs (Table

5.1, Figure 5.1) in conjunction with biochemical characterization revealed opportunities to us. Half

of endogenous substrates occupy the S2' subsite with a glutamine or glutamic acid residue.65, 7 7,2 07

These side-chains have been observed to form hydrogen bonds with both the main-chain N-H and

the side-chain carboxylate group of Asp30 (Figure 5. 1A). The aniline nitrogen of darunavir and

the methoxy group of an anisole analog form only a single hydrogen bond (Figures 5.1 B and 5.1 C).

Benzyl alcohol and cyclopropyl-amino-benzothiazole groups can form two hydrogen bonds with

Asp30, one with the main-chain N-H and another with the side-chain (either via a water-bridge or

directly) (Table 5.1, Figures 5.11D and 5.1E). Other aryl sulfonamide substituents, including

benzoic acid and benzamide, form a water-bridge with Gly48 in addition to accepting a hydrogen

bond from the main chain of Asp30, but again exhibit a <2-fold increase in affinity, but provide

<2-fold increases in affinity (Table 5.1, Figure 5.1 F). This water-bridge with Gly48 is another

interaction that can be exploited to recognize the main chain. Yet, no extant protease inhibitor

interacts with all three of these targets: main chain and side chain of Asp30, and a water molecule

that bridges to the main chain of Gly48.

We reasoned that an optimal functional group for targeting the S2' subsite would serve as

both a donor and an acceptor of hydrogen bonds. We were aware that the two hydroxy groups

presented by boronic acids are versatile in this manner. 13 4 , 308-309 These hydroxy groups display

four lone pairs and two hydrogen-bond donors. No other functional group provides six

opportunities to form hydrogen bonds so economically. We anticipated that one hydroxy group of

137

a boronic acid could form both interactions with Asp30 while allowing the other hydroxy group to

form a water-bridge with Gly48. Accordingly, we synthesized boronic acid 1, in which the

4-sulfonylaniline moiety of darunavir is replaced with a 4-sulfonylphenylboronic acid (Table 5.1,

Scheme 5.S 1).

Boronic acid 1 is a competitive inhibitor of catalysis by HIV-1 protease. By using a

hypersensitive assay of catalytic activity,2 23 we found its inhibition constant (Ki) to be 0.5 0.3

pM, which is indicative of 20-fold greater affinity compared to darunavir itself (Table 5.1, Figure

5.S ID). Because the boronic acid moiety of 1 is anticipated to interact with Asp30, we suspected

that D3ON HIV-1 protease, which is a common variant that endows resistance, could compromise

the affinity of boronic acid 1. For example, the D30N substitution entices darunavir to form a

water-bridge between its aniline nitrogen and the nascent asparagine, diminishing affinity by 30-

fold.310 Remarkably, the affinity of boronic acid 1 for the D30N variant (Ki = 0.4 0.3 pM) is

indistinguishable from that for wild-type HIV- 1 protease.

To understand the basis for the extraordinary affinity and resiliency of boronic acid 1, we

determined the X-ray crystal structures of its complexes with both wild-type HIV- 1 protease and

the D30N variant. The two structures were solved at resolutions of 1.60 A (Rfree = 0.1967) and 1.94

A (Rfree = 0.2203), respectively (Table 5.S1, Figure 5.S2). True to its design, the boronic acid

participated in all three hydrogen-bonding interactions (Figures 5.1G and 5.1H). Of special note

are BOH... OC hydrogen bonds observed in both structures (Figures 5.1G and 5.1H). The

interatomic distance of 2.2 A between the boronate oxygen and side-chain 06 of residue 30 is

reminiscent of a low-barrier hydrogen bond (LBHB).311312

138

We analyzed the atypically short hydrogen bonds between boronic acid 1 and HIV-1

protease with computational methods. First, we optimized the hydrogen atoms by applying density

functional theory (DFT) to a simple model extracted from the crystal structure. We examined the

electronic structure by using Natural Bonding Orbital (NBO) analysis. 266 NBO analysis revealed

an interaction energy of 69.8 kcal/mol between boronic acid 1 and the wild-type protease. The

typically non-hybridized p-type lone pair of the carboxylate oxygen hybridizes to sp3 -99 in the

hydrogen-bonded complex. This large interaction energy and hybridization suggest a large degree

of covalency in the BOH OC hydrogen bond. Next, we assessed the covalency of the short

hydrogen bond between boronic acid 1 and the wild-type protease with quantum theory of atoms

in molecules (AIM). 313 AIM calculations-specifically, structural elements at the bond critical

point (BCP)-enable quantification of the covalency between neighboring atoms. At the

BOH.-. OC BCP, we calculated an electron density (p) of 0.174 eV-3, a Laplacian (V 2p) of -0.08

e- 5 , and a bond index of 0.22. Typical OH - OC hydrogen bonds display p < 0.2 eA 3 , positive

V2p values, and a bond index <0.1.314 Instead, the attributes of BOH OC are consistent with the

attributes of an LBHB."-31

An LBHB arises from functional groups with closely matched pKa values.311-312, 315 This

requirement can be met by a carboxylic acid and a boronic acid,1 8 which are isoelectronic. In the

enzyme- inhibitor complex (Figure 5.1G), the boronic acid group of 1 displays an no,p-p3B

interaction (i.e., resonance) of 89.1 kcal/mol, and the carboxylic acid group of Asp30 in HIV- 1

protease displays a comparable nop--+r*c=o interaction of 87.9 kcal/mol. The ensuing

hyperconjugative interaction between a boronic acid and carboxyl acid is reminiscent of a

139

resonance-assisted hydrogen bond.3 1 6-3 18 Such hyperconjugation is absent in other inhibitors, such

as the benzyl alcohol analog of darunavir (Figure 5.1 D).

Boronic acids possess attractive properties beyond their versatile hydrogen bonding.

Boronic acid 1, like darunavir, is not toxic to human cells at concentrations up to 1 mM (Figure

5.S3). In vivo, aniline moieties can exhibit problematic genotoxicity as a result of metabolic

activation. 319-321 In contrast, the major metabolite of boronic acids is the oxidative deboronation

product, an alcohol, which is modified further in phase II metabolism for efficient excretion.14 1'

322,323

We conclude that a boronic acid group in a ligand can be profuse and versatile in forming

hydrogen bonds with a protein. These attributes are especially valuable in the design of ligands for

proteins that are under the selective pressure of drug resistance. In those instances, the ability of

boronic acids to form multiple hydrogen bonds enhances affinity, and the admixture of hydrogen-

bond acceptors and donors enables adaption to mutations.

140


5.3.1 General

Commercial reagents were used without further purification. (2S,3S)-1,2-epoxy-3-(Boc-

amino)-4-phenylbutane was from Sigma-Aldrich (St. Louis, MO). All glassware was oven- or

flame-dried, and reactions were performed under N2(g) unless stated otherwise. Dichloromethane

was dried over a column of alumina. Triethylamine was dried over a column of alumina and

purified further by passage through an isocyanate scrubbing column. Flash chromatography was

performed with columns of 40-63 A silica, 230-400 mesh (Silicycle, Quebec City, Canada). Thin-

layer chromatography (TLC) was performed on plates of EMD 250-pim silica 60-F2 54.

The term "concentrated under reduced pressure" refers to the removal of solvents and other

volatile materials using a rotary evaporator at water aspirator pressure (<20 torr) while maintaining

the water-bath temperature below 40 'C. Residual solvent was removed from samples at high

vacuum (<0.1 torr). The term "high vacuum" refers to vacuum achieved by a mechanical belt-

drive oil pump.

NMR spectra were acquired with a Bruker DMX-400 Avance spectrometer at the National

Magnetic Resonance Facility at Madison (NMRFAM) and referenced to TMS or residual protic

solvent. Electrospray ionization (ESI) mass spectrometry was performed with a Micromass LCT

at the Mass Spectrometry Facility in the Department of Chemistry at the University of Wisconsin-

Madison.

All procedures were performed in air at ambient temperature (~22 'C) and pressure (1.0

atm) unless indicated otherwise.

141

5.3.2 Chemical Synthesis

0

0 N NHH OH

SI O

11 11i Br

Et3 N, C Br Br

O< S0 N N

DCM H OH

S2

A round-bottom flask containing compound S1 (1.791 g, 5.323 mmol; synthesized as described

previously 324) was dissolved in 60 mL of DCM, and the resulting solution was cooled to 0 'C.

Triethylamine (1.2 mL, 8.6 mmol) and 4-bromobenzenesulfonyl chloride (1.362 g, 5.330 mmol)

were then added, and the reaction mixture was left to stir overnight under an atmosphere of dry

N2(g). After 16 h, the reaction mixture was concentrated under reduced pressure, and the product

was purified by column chromatography (silica, 30% v/v EtOAc in hexanes), resulting in

compound S2 as a white solid (2.543 g, 86%). 'H NMR (400 MHz, CDCl 3, 6): 7.67-7.62 (m, 4H),

7.33-7.22 (m, 5H), 4.64 (d, J= 8.4 Hz, 1H), 3.87-3.76 (m, 3H), 3.11 (d, J= 6.1 Hz, lH), 3.03-

2.84 (m, 4H), 1.91-1.81 (m, 1H), 1.36 (s, 9H), 0.91 (d, J= 6.6 Hz, 3H), 0.88 (d, J= 6.6Hz, 3H).

13C NMR (100 MHz, CDCl 3, 6): 156.2, 137.8, 137.7, 132.5, 129.6, 128.9, 128.6, 127.9, 126.6,

79.9, 72.7, 58.4, 54.9, 53.4, 35.6, 28.4, 27.2, 20.2, 20.0. HRMS-ESI (m/z): [M + Na]+ calcd for

C25H35BrN205SNa, 577.1343; found, 577.1364.

N' N Br

0 N NH OH

S2

KOAc, Pd(dppf)C1 2-CH 2C 2 ,

o 0B-B

Dioxane, 80 0C

B90

'N BNH OH

S3

142

Compound S2 (0.262 g, 0.472 mmol), KOAc (0.139 g,1.416 mmol), bis(pinacolato)diboron

(0.708 g, 2.78 mmol), and Pd(dppf)C12-CH2Cl2 (34.54 mg, 0.0472 mmol) were placed in a dry

Schlenk tube. The reaction flask was then evacuated and backfilled with N2(g) five times. Freshly

degassed 1,4-dioxane (5 mL) was then added, and the reaction mixture was heated to 80 'C and

stirred for 24 h under a N2(g) atmosphere. After 24 h, the reaction was vacuum filtered through a

pad of Celite and concentrated under reduced pressure, and the product was purified by column

chromatography (silica, 30% v/v EtOAc in hexanes), giving compound S3 as a white solid (0.253

g, 89%). 1H NMR (400 MHz, CDCl 3, 6): 7.94 (d, J= 8.2 Hz, 2H), 7.75 (d, J= 8.2 Hz, 2H), 7.3 1-

7.28 (m, 2H), 7.25-7.21 (m, 3H), 4.67 (d, J= 8.6 Hz, 1H), 3.93-3.91 (m, 1H), 3.84-3.81 (m, lIH),

3.78-3.74 (m, 1H), 3.13-3.06 (m, 2H), 3.01-2.81 (m, 4H), 1.89-1.81 (m, 1H), 1.36 (s, 12H), 1.35

(s, 9H), 0.89 (d, J= 6.6 Hz, 3H), 0.86 (d, J= 6.6 Hz, 3H). 13C NMR (100 MHz, CDC13, 6): 156.1,

140.6, 137.9, 135.5, 129.7, 128.6, 126.5, 126.4, 84.6, 79.8, 72.8, 58.6, 54.9, 53.7, 35.5, 29.8, 28.4,

27.2, 25.0, 20.2,20.0. HRMS-ESI (m/z): [M + Na]' calcd for C3IH47BN207SNa, 624.3126; found,

624.3151.

9 1) HCI, DioxaneB- H 0 0 0 H 0 9

0 N NN2) E'SHB

OH S4 OHS3 DCM S5

To a round-bottom flask containing compouind S3 (0.327 g, 0.543 mmol) was added 15 mL of 4.0

M HCl in dioxane. After stirring for 4 h, the reaction mixture was purged with N2(g) to remove

excess HCl(g). Once the evolution of HCL(g) ceased, the reaction mixture was concentrated under

143

reduced pressure and dried overnight under high vacuum. The residue was then dissolved in 10

mL of DCM and placed under an inert atmosphere. Triethylamine (0.38 mL, 2.7 mmol) and

compound S4 (0.147 g, 0.543 mmol; synthesized as described previously 324 ) were then added, and

the reaction mixture was stirred overnight. After reacting for 16 h, the reaction was concentrated

under reduced pressure, and the product was purified by column chromatography (silica, 5% v/v

MeOH in DCM), yielding compound S5 as a white solid (0.293 g, 82%). 'H NMR (400 MHz,

CDCl 3, 6): 7.94 (d, J= 7.6 Hz, 2H), 7.75 (d, J= 7.6 Hz, 2H), 7.30-7.26 (m, 2H), 7.22-7.20 (m,

3H), 5.65 (d, J= 5.1 Hz, 1H), 5.04-4.99 (m, 1H), 4.95-4.92 (m, 1H), 3.96 (t, J= 8.2 Hz, IH),

3.88-3.83 (m, 3H), 3.72-3.68 (m, 2H), 3.63-3.59 (m, 1H), 3.20-3.14 (m, 1H), 3.09-3.05 (m, 1H),

3.02-2.97 (m, 2H), 2.93-2.87 (m, 1H), 2.83-2.78 (m, 2H), 1.87-1.79 (m, 1H), 1.49-1.43 (m, 1H),

1.36 (s, 12H), 1.26-1.24 (m, 1H), 0.93 (d, J = 6.5 Hz, 3H), 0.87 (d, J= 6.5 Hz, 3H). '3 C NMR

(100 MHz, CDCl 3, 5): 155.4, 140.1, 137.5, 135.4, 129.3, 128.6, 126.6, 126.3, 109.3, 84.5, 73.4,

72.7, 70.7, 69.6, 58.0, 55.1, 53.7, 45.3, 35.6, 27.3, 25.8, 24.9, 24.8, 20.1, 19.8; HRMS-ESI (m/z):

[M + NH4] +calcd for C33H51BN309S, 676.3431; found, 676.3440.

HH OH

0,11,-a NaIO 4, NH 4OAc N j

0 H HOH Acetone/H 20 K/H H OHS5 1

A round-bottom flask was charged with compound S5 (0.150 g, 0.228 mmol), which was dissolved

in acetone (10 mL) and H20 (10 mL). The resulting solution was placed under an atmosphere of

dry N2(g), and sodium periodate (0.195 g, 0.911 mmol) and ammonium acetate (70.2 mg, 0.911

mmol) were added. After stirring for 12 h, the reaction mixture was concentrated under reduced

144

nresseir, and the nroiot wvs pirifuar by ncliim-n t-hrnmatngrn- (si1a 20% v/v MeOH in

DCM), giving rise to compound 1 as an off-white solid (0.113 g, 86%). An analytically pure

sample of compound 1 was obtained by reverse-phase HPLC using a preparatory C18 column and

a linear gradient of 10-80% v/v acetonitrile (0.1% v/v TFA) in water (0.1% v/v TFA) over 45 min.

Compound 1 eluted at 38 min and, after lyophilization, was isolated as a white powder. 1H NMR

(400 MHz, methanol-d4, 6): 7.81 (s, 4H), 7.26-7.21 (m, 4H), 7.20-7.15 (in, 1H), 5.59 (d, J= 5.0

Hz, 1H), 4.93 (q, J= 6.4 Hz, 1H), 3.93 (dd, J= 9.9, 6.2 Hz, 1H), 3.84-3.65 (m, 5H), 3.46-3.43

(m, 1H), 3.21-3.10 (m, 2H), 2.99-2.84 (in, 3H), 2.53 (dd, J= 14.3, 10.4 Hz, 1H), 2.07-1.99 (m,

1H), 1.54-1.46 (m, 1H), 1.37-1.32 (m, 1H), 0.94 (d, J= 6.6 Hz, 3H), 0.88 (d, J= 6.6 Hz, 3H). "C

NMR (100 MHz, methanol-d4, 6): 157.7, 140.3, 135.2, 135.1, 130.5, 129.3, 127.4, 127.2, 110.8,

74.6, 74.5, 72.1, 70.6, 58.9, 57.4, 53.9, 46.9, 37.2, 28.0, 27.0, 20.5, 20.4. HRMS-ESI (m/z): [M +

OMe]- calculated for C3 0H4 4BN20 10S, 635.2814; found, 635.2821.

5.3.3 Protein Preparation

Wild-type HIV-1 protease and its D3ON variant were produced heterologously in Escherichia coli

and purified as described previously. 223, 264 This HIV-1 protease had Q7K, L331, L631, C67A, and

C95A substitutions. Protease solutions in 1 mM sodium acetate buffer, pH 5.0, containing NaCl

(2 mM) were concentrated to 4.0 mg/mL (wild-type) or 15 mg/mL (D30N), flash-frozen in N2(l),

and stored at -80 'C. The flash-freezing was done within 30 min of elution from the gel-filtration

column. On the day of an assay, a concentrated stock solution was thawed rapidly, incubated on

ice and diluted with room-temperature assay buffer (50 mM sodium acetate buffer, pH 5.0,

145

containing 0.10 M NaCi) to create a 20x stock solution, which was then used immediately for

assays.

5.3.4 Enzyme Kinetics

5.3.4.1 Instrumentation and Materials

Assays for the catalytic activity of wild-type HIV-1 protease and its D30N variant were

conducted with a fluorogenic substrate, RE(Edans)SGIFLETSK(Dabcyl)R, and an M1000 plate

reader (Tecan) as described previously.2 23 In these assays, all reactions were initiated nearly

simultanteously, that is, within a few seconds. All linear and non-linear regression analyses were

performed with Prism 6 software from GraphPad (La Jolla, CA).

5.3.4.2 Michaelis-Menten Kinetics

Michaelis-Menten kinetics were performed to assess the impact of the D30N substitution

on the catalytic activity of HIV-1 protease. As observed previously, 3 the D30N substitution

decreases the value of kcat by about an order of magnitude, from 7.4 s-1 to 0.50 s-1, and increases

the value of KM slightly, from 15 ptM to 16 ptM. 223 These values were used as parameters in the

fitting of data acquired in the presence of inhibitors. Reaction progress curves were fitted by linear

regression to determine reaction velocities (Figure 5.S lA). Non-linear regression analyses of the

resulting data were performed with the mean reaction velocity, its standard deviation, and the

number of reactions per data point (n) (Figure 5.S1B). Reactions were performed in quadruplicate

(n = 4).

146

5.3.4.3 Inhibition Ki netiCS

The Morrison equation 203-204, 325 was used to evaluate values of Ki that were lower than the

concentration of enzyme used in assays, essentially as described previously.223 In these assays, the

initial fluorescence of assay buffer containing substrate and inhibitor in a combined volume of 190

[pL was measured. Measurement was paused briefly for the addition of the 20x stock solution of

enzyme. Data collected immediately after the addition of enzyme and prior to achieving enzyme-

inhibitor equilibrium were omitted from the analyzed data set. The mean velocity and its standard

deviation (derived from linear regression) at a particular concentration of inhibitor were

normalized to the velocity in the absence of inhibitor. These normalized values were fitted with

the "Morrison Ki" subroutine in Prism 6 software. The Morrison equation can be used to determine

the enzyme concentration, but doing so led to wild-type enzyme concentrations that differed by

>20% from concentrations measured by active-site titration with darunavir. Accordingly, wild-

type enzyme concentrations were set to the value derived from the Michaelis-Menten equation in

the absence of inhibitor, as described previously.223 These values agreed well (<10% variation)

with those measured by active-site titration. In contrast, the D3ON enzyme concentrations provided

by the Morrison equation agreed well (<10% variation) with those measured by active-site titration

with darunavir. Accordingly, D3ON enzyme concentrations were fitted simultaneously with values

of Ki by using the "Morrison Ki" subroutine in Prism 6 software.

5.3.5 Cytotoxicity

MT-4 cells (catalog #120) and darunavir (catalog #11447) were obtained from the AIDS

Reagent Program of the NIAID (NIH). Cells were grown at 37 'C in RPMI 1640 containing fetal

147

bovine serum (10% v/v) and Hyclone antibiotic/antimycotic solution (1% v/v) from GE Healthcare

(Chicago, IL). Cells were plated at a density of 2.5 x 104 cells per 100 pLL in a flat-bottomed, 96-

well plate from Coming (Coming, NY). Concentrated protease inhibitor stocks were prepared at

200 mM in DMF and diluted by >200-fold with final a solvent content of 0.5% by volume in each

condition. After incubation at 37 'C for 48 h, cells were assayed for viability using the CellTiter

96® AQueous One Solution Cell Proliferation MTS Assay from Promega (Madison, WI) by

monitoring absorbance at 450 nm with a GloMax multidetection plate reader from Promega. Each

condition was assayed in triplicate.

5.3.6 X-Ray Crystallography

5.3.6.1 Protein Crystallization

Compound 1 was dissolved in DMF at a concentration of 15 mg/mL. Hanging drop vapor-

diffusion plates were setup by mixing a protease stock solution with the ligand stock solution at

ratios of 1:9 (wild-type:1) or 1:4 (D3ON:1) in 2-pL drops over a reservoir of 100 mM Tris-HCl

buffer, pH 7.4, containing NaCl (200 mM for wild-type; 400 mM for D3ON). Crystals grew within

2-3 days. Crystals were cryo-protected in mother liquor containing 25% v/v glycerol and flash-

frozen in N2 (l).

5.3.6.2 X-Ray Diffraction

Diffraction experiments were conducted at the Advanced Photon Source of Argonne

National Lab with the Life Sciences Collaborative Access Team at sector 21. Frames were indexed,

148

Intrted and scaled with HKL2000 (HKL Researh) Details ofthe diffraction experiment are

listed in Table 5.S1.

5.3.6.3 Structure Solution and Refinement

Molecular replacement (MR) was conducted using Phaser software as implemented in

Phenix using 3nu3 as a reference model with solvent, ligand, and alterative conformations

removed.228 The MR solution was refined with Phenix Refine, and further model building was

conducted with Coot. 229 The atomic coordinates of boronic acid 1 in an idealized conformation

were prepared using WebMO (WebMO, LLC). The final geometry was optimized by the AMI

method and restraints were prepared with eLBOW in Phenix. Compound 1 bound to the wild-type

protease in two antiparallel conformations with unequal occupancy (Figure 5.S2A). The boronic

acid was only well resolved in the major conformation with 0.7 occupancy. Two conformations of

Asp30 were also resolved with major and minor conformers. The conformers of the ligand and

Asp30 that had similar occupancies are believed to be the pair that participate in a hydrogen bond

(Figures 5.1G and 5.S2A). There was sufficient density only in the lower-resolution structure of

the D30N protease complex to place a single molecule of boronic acid 1 and determine the

conformation of Asn30 (Figure 5.S2B). The major conformers in the wild-type structure are nearly

identical to the single conformation in the structure of the D30N protease complex (Figures 5.2G

and 5.2H). Details of the final refined structures are listed in Table 5. S1.

149

5.3.7 Computational Analysis

5.3.7.1 DFT Optimization with Gaussian

Simplified models of the low-barrier hydrogen bonding (LBHB) interactions were

extracted from the X-ray structures of compound 1 bound to wild-type HIV-1 protease and its

D3 ON variant. Average coordinate errors for the structures can be found in Table 5.1. The ligand

was truncated after the sulfonamide, leaving behind two methyl groups on the nitrogen to maintain

sterics. The entire Asp30 or Asn30 residue along with flanking main-chain amide bonds with the

distal C' as a methyl group were included. DFT calculations were used to optimize hydrogens with

Gaussian 16, Revision A.03 software from Gaussian (Wallingford, CT).326 Proton optimization

was conducted at the M06-2X/6-3 11 +G(d,p) level of theory employing the IEFPCM solvation

model.

5.3.7.2 Natural Bonding Orbital Analysis

NBO analysis was conducted with NBO 6.0 software, 296 using the $CHOOSE keyword to

give the Lewis structure where the boronic acid acts as the proton donor (the proton is bonded to

the boronic acid oxygen). When the $CHOOSE keyword is not utilized, the NBO program

generates a low valence proton sandwiched between two oxyanions. NBO analysis shows the

dominant interaction is with the in-plane, predominantly p-type lone pair of the carboxylate

oxygen. The strength of the LBHB interaction is likely largely overestimated at 69.8 kcal/mol

(taken as the 2nd-order perturbation energy minus the steric exchange). The D30N substitution

weakens the interaction to 58.3 kcal/mol. To model attenuation of the carboxylate charge via

external H-bonding, the strength of the interaction with the acid was investigated and found to be

150

40.7 keal/nol The highly covalent nature of this hydrogen bond is apparent. In addition to large

interaction energies, hybridization of the carboxylate lone pair from p to sp3.99 (sp 3-93 for the D30 N

amide) is observed in the NBO analysis. Lone pairs involved in hydrogen bonding on each oxygen

hybridize when $CHOOSE is not used.

5.3.7.3 Atoms in Molecules (AIM) Analysis

To investigate further the covalency of the SSHB, we turned to an alternative description

of chemical bonding-the bond critical point (BCP) concept stemming from the topological

quantum theory of atoms in molecules (AIM) developed Bader. Here, structural elements are

identified by critical points of electron density (Vp = 0), with atoms at maxima and bonds at

minima. The sign of the Laplacian (V 2p) at the BCP gives insights into bonding character, with

covalent bonds showing a negative value and ionic bonds positive. Previous investigations have

found that LBHB have a substantial degree of covalency and in extreme cases can exist as a 2-

center, 3-center bond.3 ' 27

AIMAll (Version 17.01.25) was used for atoms in molecules (AIM) calculations (Table

5.S2). We found a covalent character between the proton and both oxygens, with a calculated

electron density of 0.270 eK-3 and 0.174 e^ 3 at the BCP for the BO-H and BOH - OC bonds,

respectively. Additionally, the V2p was negative for both bonds at -1.38 eA-5 and -0.08 e^-,

respectively. Bond indices were 0.31 and 0.22, respectively. Typical 0... H hydrogen bonds

display bond orders <0.1, p < 0.2 e^ 3 , and positive Laplacian values.327 Thus, we report a short,

asymmetric hydrogen bond with substantial covalent character. The bond in the D30N variant

gives similar values, although the covalency is slightly diminished.

151

5.4 Acknowledgments


predoctoral fellowship. M.J.P. was supported by Molecular and Cellular Pharmacology Training

Grant T32 GM008688 (NIH) and American Heart Association predoctoral fellowship

09PRE2260125. B.G. was supported by an Arnold 0. Beckman Postdoctoral Fellowship. This

work was supported by Grants RO1 GM044783 (NIH) and MCB 1518160 (NSF), and made use

of NMRFAM (University of Wisconsin-Madison), which is supported by Grant P41 GM103399

(NIH). This research used resources of the Advanced Photon Source, a U.S. Department of Energy

(DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne

National Laboratory under Contract No. DE-AC02-06CH11357. Use of the LS-CAT Sector 21

was supported by the Michigan Economic Development Corporation and the Michigan

Technology Tri-Corridor (Grant 085P1000817).

152

5.5 Hydrogen Optimization Coordinates

Wild-Type HIV- 1 Protease H-opt

2.252662003.239154004.338821002.879147003.787691003.195697002.006527004.104311004.376624003.556846005.415608004.064402003.72329800

-5.14279700-5.24254700-5.01267200-4.93421600-4.99906600-5.934297000.559584001.56085700

-3.40535400-3.24347700-2.03415200-0.85659700-0.95695400-2.206390000.460161002.082072001.301968002.708561004.702729003.282810005.002961005.029310003.375396004.610381002.93628400

-5.77261400

-0.622126000.114190000.480363000.288976000.847874002.107283002.40464800

-0.15093900-1.55330000-2.45545100-1.750136002.844520004.07932400-0.399971000.735726000.484126001.988689001.73728200

-0.49749600-3.61106600-1.51851600-0.20819400-1.58515100-2.19963700-1.49815900-0.075894000.52101800

-2.25397700-0.03053600-0.81852900-1.567725001.09645200

-0.166495000.183136002.550038004.831473004.44901800

3.90230900-0.22908800

3.276686002.385971002.819532001.117288000.13252900

-0.47630900-0.33961800-0.98900400-0.47286100-0.755904000.19012900

-1.14662100-1.831904002.033238001.16671100

-0.453539001.76162400

-1.10641600-0.87425200-0.94613300-0.86930700-0.62101900-0.69608700-0.79049300-0.80740500-0.71027600-0.62403800-0.899470004.176084002.781522003.573726000.67047500-1.70546900-1.51185000-1.16676900-1.12171300-2.33967400-2.566167002.90786100

153

H

HH

HHHHHH

H

HH

HF:

-5.50228500-4.11114200

-5.61718600-5.07506100-3.905559001.476512002.47420200

-4.13898700-2.00887000-0.06258900-2.257050002.02704500

-1774.6501795

-1.29879500-0.565428002.152591002.797266002.02556000

-3.90241200-2.00591500-2.20073300-3.282640000.537568001.60540500

-0.13026100

154

1.532355002.370066002.597985001.047427002.14457500

-0.94910500-0.94438700-0.69582600-0.85591300-0.71249800-0.566821000.75827100

2.130450003.122923004.237561002.738151003.627842003.068097001.872796003.815788004.404887005.652155003.751143003.961081003.60732200

-5.16375400-5.33696700-5.05922900-5.09386600-5.04022900-5.990624001.833807000.67580300

-3.39102100-2.19298300-0.91487300-0.74862100-1.94851600-3.186401000.646445004.574788002.918115006.176150006.093556004.578111001.166575001.876775002.572106001.992808004.934199004.500931002.850584003.21443400

-0.683385000.07919400

0.399970000.357029000.887266002.185041002.46825100

-0.11316000-1.43411700

-1.40038800-2.483348002.957408004.20749400-0.468337000.673690000.468091001.897309001.74737400

-0.45708700-1.59797000-3.63479900-0.251439000.48039300

-0.12499600-1.52913700-2.24427900-1.63261600-2.260064000.32773700

-0.22740300-0.54069300-2.252551001.08958300

-0.83388000-0.04808900-1.65124200-0.130113002.691498004.615050004.027241004.92568700

3.268275002.392798002.829036001.148194000.12987600

-0.43967400-0.36044000-1.02318200-0.57497500-0.10038800-0.67465300-1.04330800-1.705279002.016608001.14702600

-0.503535001.76276900

-1.13490700-1.03186700-0.87131800-0.88195700-0.61264200-0.49372600-0.56782000-0.73696700-0.83438900-0.77080900-0.82806100-1.68218000-1.60345000-0.051890000.214942000.627195002.782784000.804409003.508741004.19686600

-1.02315100-2.17066600-2.46940300-0.98405600

155

D310N HII-1I P r ot e ase H- optt

H -5.43906800H -4.12879000H -5.82713500H -5.25551800H -4.07429700H -5.79401500H 2.61656500H -0.17255900H -2.23509200H -0.04674800H -1.95850700H -4.06516600

HF: -1755.2029447

-1.38981800-0.54718800-0.347898002.725047001.977045002.02211400

-2.23063300-4.074524001.558648000.52472300

-3.31934100-2.26481000

1.502207002.371259002.873904001.073508002.168285002.59327000-0.93202500-0.80524400-0.36903200-0.49893700-0.97807700-0.86938500

156

Scheme 5.S1. Route for the Synthesis of Boronic Acid 1. Overall yield: 54% (unoptimized).

Et3N, CI I-& Br0 0

N NH DCM

S1 OH 86%

KOAc, Pd(dppf)C 2-CH 2C 2,

Br 0__B-B B0_9

H Dioxane, 80 0C HH OH 89% H OH

S2

S3

1) HCI, Dioxane

2) Et3N, H ,, 0S4 0

DCM

82% (2 steps)

O

0 , 0,'C

0 N N Na1O 4, NH40AcH H OH Acetone/H 20

S5 86%

OHH

OH

H0 N OH H O

157

Table 5.1

Table 5.1. Values of Ki for Inhibition of HIV-1 Protease

Asp30,T,, 0 0

O NH--- "0O N N'S R

S 2 N OH S2,Asp29 NH H--- 81ZS

R

NH2

darunavir

01

- OH

OH

0

NSI

HN-<

NH2

0

OH

OH

Ki (pM)

10 1

14

12

12.7

10

8.9

0.5 0.3

Relative Affinity'

1.0

1.2

1.3

1.6

1.8

20

a Values of Ki can depend on assay conditions. Here, values are compared by using darunavir

as a benchmark with Relative Affinity = Ki,danavir /Ki,analog as reported in the indicated

reference.

158

Ref.

223

97

97

328

329

328

This work

Tnhe 5.SI

Table 5.S1. Crystallographic Data Collection and Refinement StatisticsComplex Wild-Type D30NPDB code 6c8x 6c8y

Data CollectionX-ray sourceDetectorWavelength, AResolution A (last shell)Space groupa, b, c (A)a,f8, y (0)

No. of ReflectionsNo. Unique ReflectionsRedundancy (last shell)Mean /- (last shell)Completeness (last shell)Rmeas (last shell)Rsym

Rpim

Wilson B-factor

RefinementWorking set (last shell)Test set (last shell)Rwork (last shell)Rfree (last shell)RMS deviation bond lengths (A)RMS deviation bond angles (0)Coordinate error (maximum likelihood

Total number of atomsProtein residuesProteinLigandSolvent

Mean B-factorProteinLigandSolvent

21-ID-DMAR 300 CCD0.9785327.0-1.60 (1.63-1.60)P 2 121258.7, 86.2, 46.29 90, 90437,30930,670 (1507)14.3 (13.2)38.3 (3.5)100 (100)0.095 (0.667)0.092 (0.642)0.025 (0.181)15.5

30,625 (2,779)1,508 (132)0.174 (0.196)0.197 (0.206)0.0080.940.14 A

1867198158788192

18.517.613.428.2

21-ID-FMAR 225 CCD0.9787248.8-1.94 (1.97-1.94)P 2 1212

59.1, 86.4, 45.890, 90,90128,39117,966 (874)7.1 (7.0)13.9 (2.7)100 (100)0.119 (0.671)0.111 (0.621)0.044 (0.250)21.7

17,927 (1,760)877 (88)0.1733 (0.190)0.2203 (0.270)0.0080.920.22 A

1776198156149166

23.122.224.231.5

159

Ramachandran favored, allowed, outliers (%) 100, 0, 0 98.8, 1.0, 0

160

Ramachandran favored, allowed, outliers (%) 100, 0, 0 98.8, 1.0, 0

Table 5.S2

Table 5.S2. Interaction Energies for Boronic Acid 1 and HIV-1Protease Residue 30

HIV- 1 Protease Bond p (eA-3) V2p (eA-5) BondIndex

Wild-type BO .H 0.270 -1.38 0.31BOH OC 0.174 -0.08 0.22

D30N BO --H 0.295 -1.68 0.32BOH- OC 0.172 0.03 0.20

161

Figure 5.1

CAsp3 Asp3O Asp3O

Asp3OAsp3s Asp3O

Asp3 A Asn30

u2.2 At2.2 A

48

162

Figure 5.1. Interactions with a Substrate, Darunavir, or its Analogs and the S2' Subsite of HIV-1

Protease. (A) A substrate (PDB entry lkj7). (B) Darunavir (4hla). (C) Anisole analog (2i4u). (D)

Benzyl alcohol analog (3o9g). (E) Cyclopropyl-amino-benzothiazole analog (5tyr).

(F) Benzamide analog (4i8z). (G) Boronic acid 1 bound to wild-type HIV-1 protease (6c8x). (H)

Boronic acid 1 bound to D3ON HIV-I protease (6c8y). Major conformers are shown for inhibitors

that bound in non-symmetry-related conformations.

163

A

(~..

w *

164

Figure 5.2

Figure 5.2. Orbital Interactions in a Model of Boronic Acid 1 and Residue 30 of HIV-1 Protease

Derived from X-ray Crystal Structures (PDB entries 6c8x and 6c8y). NBO rendering of the

hydrogen bond between a boronic acid hydroxy group and 06 of Asp30 (A) and Asn30 (B) with

hydrogen atoms optimized at the M06-2X/6-311 +G(d,p) level of theory employing the IEFPCM

solvation model.

165

Figure 5.S1

-milE

U U - -

:e*O eas:O 6;

40 pM

30 pM

20 pM

15 pM

10 pM

5 pM

1 pM

A15000-

10000-

U--

5000-

0

C50000-

40000-

L, 30000-

20000-

500 1000 1500

Time (s)2000 2500

0 pM2WPM

40 pM

70 pM*0 ~9 Px0M

YU~AI~1100 pM

2000

B 20-

1 5-

V 110-(nM/s)

0.5-

0.0-

D

1.0-

0.0

F1.0-

V0V0.5-

2500 3000

Time (s)

10 20[Substrate] (pM)

50

[1] (pM)

50

(1] (pM)

166

1100 1200 1300 1400 1500 1600 1700 1800

Time (s)

0 pM

20 pM

40 pM

70 pM90 pM100 pM110 pM

E

D-

.3,0 . . . . . . .

8000-

7000

6000

500

15 0

100

100

,

Figure 5.S1. Raw and Processed Kinetic Data from Assays of HIV-1 Protease Activity. (A)

Reaction progress curves of 5 nM D3ON HIV-1 protease titrated with a fluorogenic substrate. (B)

Non-linear fitting of reaction velocities to the Michaelis-Menten equation provides values of kcat

= 0.50 0.01 s-1 and KM = 16 1 tM (R2 = 0.99). (C) Reaction progress curves of 85 pM wild-

type HIV-1 protease titrated with boronic acid 1 and 10 VM substrate. (D) Non-linear fitting of

reaction velocities to Morrison's equation, providing a value of Ki = 0.5 0.3 pM (R2 = 0.96). (E)

Reaction progress curves of 100 pM D3 ON HIV- 1 protease titrated with boronic acid 1 and 10 pM

substrate. (F) Non-linear fitting reaction velocities to Morrison's equation, providing a value of Ki

= 0.4 0.3 pM (R2 = 0.98).

167

Figure 5.S2

B

168

r

Figure 5.S2. Depiction of Electron Density of HIV- 1 Proteases with Bound Boronic Acid 1. Fo -

Fe density maps resulting from simulated-annealing refinement with the ligand removed and

residues proximal to the boronic acid replaced with glycine were overlayed on final models from

(A) wild-type HIV-l protease (PDB entry 6c8x) and (B) D3ON HIV-l protease (6c8y). Density is

depicted at 3.0a- within 1.5 A of side-chains proximal to boronic acid 1.

169

Figure 5.S3

A

0.6-

0.4-so

0.2-

0. I I I INo Compound 107 104 10-5 104 10-3 Medium Alone

[darunavir] (M)

B

0.6-

0.4-A450

0.2 -

0.0-

4-0

j-~7 4--- 0--

0

I I I I

No Compound 10-7 10- 10-5 10-4

[boronic acid 1] (M)

0

S

-*44-

10-3 Medium Alone

170

Figure 5.S3. Toxicity of Darunavir and Boronic Acid 1 for Human Cells. Graphs show the

absorbance at 450 nm from a tetrazolium dye-based assay of the viability of MT-4 cells (adult T-

cell leukemia) after incubation with darunavir (A) or boronic acid 1 (B) for 48 h. A high A450 value

is indicative of viable cells.

171

5.6 NMR Spectra

'H NMR Spectrum of Compound S2 in CDC13

(4.o 55 9.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0

f, (ppm)

13C NMR Spectrum of Compound S2 in CDC1 3

I V

tO 200 190 180 570 160 150 140 530 120 110 (ppm100 90fl (PP-~)

.'/ I %I I \/' V

80 70 60 5o 4b 30 20 10 0

172

0 '10 10 , 400 -lp AW -k - -wo VOMOdA - ipwfw A -. 1 - I-.-.. , I M , 1044 Mb.OIW-

UL(

'H NMR Spectrum of Compound S3 in CDC13

IiiK.0 9.5 9.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5

fs (ppm) 2.S 2.0 1.s 1O 0.5 0.0 -0

13C NMR Spectrum of Compound S3 in CDC1 3

I I I

- i~~J59 ~ m..,dh

LO 200 AO d I& 1 ' 140 1 60 ' 10 ' 20 1 60 o10fi (PP-~)

90 so 70 60 so 40 30

173

ofmi inv

also

I~~~1A I

1120 10 0

' ...'

771:7 - -- - - - =-:; I

. M . ! . ! . ! I ! . I I --

4.0o 3.s 3.0

I I

.1II I

H NMR Spectrum of Compound S5 in CDC13

.0 6.5s 9.0 ,8.5 .8.0 .7.5 .7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 .3.0 2.5 20 15 10 05 00 -fi. (PP-)

13 C NMR Spectrum of Compound S5 in CDC1 3

| \~ \ \ -

200 190 180 170 160 150 140 130 120 Oil 100 0fl (ppm)

PR I \

V IiL~

60 70 6b 50 40 30 20 10 0

174

P I i ll -- I ll I 1 -111- 1 - -I

'H NMR Spectrum of Compound 1 in Methanol-d4

V

0 9.5 9.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 45fi (ppm)

1 3 C NMR Spectrum of Compound 1 in Methanol-d4

I Iaa se E

I..,'r 'r'' ki-ri.. . . -. . .,-- - - ,,TA-6h.., M

200 19 0 180 170 160 I 150 140 13010

LliD 000 50 80 70 60 50 40 30 20

fl (ppms)

/l"A A I

AiL I Il

a10 -55/ I

175

I 0 1

AMMIE 4w-nnMosskqo-t-" 1 11 6d--, 1-4 A-,,

1.0 0.5 04.0 3.5 3.0 2*5 2.0 1.5

'M I 1 -1 1- .- M PIR r W, qi-- W-Yr ,-r w - , 1 7 - -* - T-, I

90 io 70 6b so 40 30 2010 0

Chapter 6

Benzoxaborolone-A Boronic Acid with Remarkable

Oxidative Stability in Aqueous Solution

Contribution:

I performed kinetic analysis and computational experiments. I contributed to performing kinetic

experiments, designing computational and pH titration experiments, preparing the initial

manuscript, and revising the final manuscript.

This chapter has been prepared for publication as:

Graham, B.J*; Windsor, I.W.*; Gold, B.; Raines, R.T., Benzoxaborolone-A Boronic Acid with

Remarkable Oxidative Stability in Aqueous Solution. In preparation.

*Denotes equal contributions

176

Abstract

Oxidative deboronation can limit the utility of boronic acids in aqueous contexts. We measured

the oxidative stability of phenylboronic acid and two derivatives: benzoxaborole (which has an o-

hydroxymethyl group) and benzoxaborolone (which has an o-carboxyl group). At pH 7.4,

phenylboronic acid and benzoxaborole have similar stability towards oxidation by hydrogen

peroxide. In contrast, benzoxaborolone is 2 x 1 04-fold more stable. Computational analyses reveal

that the resistance to oxidation arises from diminished stabilization of the p-orbital of boron that

develops during the rate-limiting step. Like phenylboronic acid and benzoxaborole,

benzoxaborolone binds to saccharides, albeit with slightly less affinity. These findings establish

benzoxaborolone as a pharmacophore of potential utility in chemical biology, medicinal chemistry,

and related areas.

177

6.1 Introduction

The utility of Suzuki-Miyaura coupling has made boronic acids common reagents in synthetic

organic chemistry. 125,309, 330 The electronic properties of boronic acids also impart a unique ability

to bind covalently, yet reversibly, to proteins and carbohydrates. This attribute has found

widespread application in chemical biology and medicinal chemistry,1 34, 331-333 as well as in

materials chemistry.334-337

Despite its beneficial attributes, only four drugs in clinical use contain boron.33 8 The clinical

implementation of boron began with the proteasome inhibitor bortezomib, which contains an

alkylboronic acid. 339 Arylboronic acids are more stable than alkylboronic acids, and can likewise

form cyclic esters with 1,2- and 1,3-diols. In 2006, Hall reported that benzoxaborole, which is the

cyclic ester of 2-(hydroxymethyl)phenylboronic acid (2-HMPBA), forms more stable esters than

does phenylboronic acid (PBA) (Figure 6.1). 132, 340-341 Subsequently, ixazomib was approved for

the treatment of multiple myeloma and two benzoxaboroles, tavaborole and crisaborole, were

approved for the treatment of fungal infections and dermatitis, respectively. 342

We suspected that the sparsity of boronic acids in the clinic might be attributable to their

having poor metabolic stability. In aqueous solution, boronic acids degrade by two major

pathways: (1) protodeboronation, in which a C-B bond is replaced with a C-H bond, and (2)

oxidative deboronation, in which a C-B bond is replaced with a C-O bond. Protodeboronation

requires an extreme pH, high temperature, or a metal catalyst,343 conditions that are not relevant

physiologically. In contrast, the primary metabolite of boronic acids in vivo is the oxidative

deboronation product, an alcohol,322, 34 reactivity that can be replicated in vitro by using reactive

oxygen species or P450 enzymes.1 41' 345 Boronic acids also react with hydrogen peroxide in

178

cellulZ,346 dA this 1110 1"s been epy in scnensors cf hydrngen nrivde nd ;int~~t, r-L~ LII & "t" L-1" A 113 i~1 sIIIJ re iii c ;on1 sLII I~J'i

the unmasking of potential chemotherapeutic agents.347-349

Herein, we provide the first detailed characterization of the oxidative stability of

phenylboronic acid and two simple derivatives. One is benzoxaborole. The other is

benzoxaborolone, which is the cyclic ester of 2-carboxyphenylboronic acid (2-CPBA) (Figure

6.1). Unlike phenylboronic acid and benzoxaborole, benzoxaborolone has not been utilized in the

context of chemical biology, medicinal chemistry, or materials chemistry.2 10 Our findings

incentivize its use.

179


6.2.1 pKa Values.

An intrinsic property of boronic acids is their pKa value, which is governed by the transition

between the neutral trigonal species and the tetrahedral anionic species (Figure 6.1).350 The pKa

values of phenylboronic acid (pKa = 8.7)133 and benzoxaborole (pKa = 7.3)127 are known. Before

embarking on an analysis of oxidative stability, we sought to determine the pKa values of

benzoxaborolone. An initial potentiometric pH titration revealed that the pKa values of the boronic

acid and carboxylic acid groups of 2-CPBA are extreme. In accord, "B-NMR-titration

experiments showed that a single species is dominant from pH 0.9-12, with a chemical shift

consistent with the monoanion. 2-CPBA was, however, converted quantitatively to the dianion at

high pH, affording a pKa2 value of 13.0 (Figure 6.S1). No 1 B chemical shift consistent with a

neutral species was observed at any pH value, even in 6 M HCl.

To provide further insight, we crystallized 2-CPBA at low pH and used X-ray diffraction

analysis to determine its structure (Tables 6.Sl--6.S7). Crystalline 2-CPBA was a zwitterion in

which the monoanion was protonated on a hydroxyl group (Figure 6.1). Thus, the cyclic ester was

maintained, rather than dissociated into a boronic acid and carboxylic acid. 3 This structure

provides additional evidence for the stability of 2-CPBA species containing a lactone and a

tetrahedral boron atom. We conclude that the predominant form of 2-CPBA in water is its

monoanion (Figure 6.1).

180

6. 2.2 Sus4ceptibility ti Oxidtion.

As an oxidant, we chose hydrogen peroxide. Discovered 200 years ago, 354 hydrogen peroxide is

now known to be the major reactive oxygen species in humans. 355 The oxidation of phenylboronic

acids by aqueous hydrogen peroxide yields a phenol and boric acid.13 2,356-359 As the phenol product

exhibits a significant blue-shift in UV absorbance relative to the boronic acid starting material, we

were able to employ a continuous UV-absorbance assay to monitor the oxidation of PBA, 2-

HMPBA, and 2-CPBA (Table 6.S8; Figure 6.S2).360 We found that the oxidation rates were first-

order with respect to hydrogen peroxide and each boronic acid (Figure 6.S3). The ensuing second-

order rate constants varied with pH (Table 6.S9) and were half-maximal for PBA and 2-HMPBA

when pH = pKa. We found that oxidation rates increase in a pH-dependent manner after and before

a pH-independent regime (Figure 6.2). This observation is consistent with the existence of two

pH-independent rate constants, ki and k2. Whereas the oxidation rates of PBA and 2-HMPBA at

physiological pH are similar to those for biological thiols, the comparative stability of CPBA to

oxidation is extraordinary (Table 6.1).

The reaction mechanism for boronic acid oxidation is consistent with complexation of

peroxide anion to the boron atom, a 1,2-aryl shift to form a borate ester and hydroxide ion, and

hydrolysis of the borate ester to form a phenol and borate salt.357 -359 The nucleophilicity of

hydrogen peroxide in water is high.36' Moreover, the rate constants for oxidation (Figure 6.2) are

much less than those for complexation of ligands to boronic acids (k= 103-10 5 M-1s-1). 3 62 -3 64 Thus,

the complexation of the peroxide is not the rate-determining step in the oxidation reaction. This

conclusion is supported by 'H-NMR spectra, which reveal the accumulation of covalent

intermediates during oxidation (Figure 6.S4).

181

For PBA, 2-HMPBA, and 2-CPBA, the transition from a neutral trigonal state to a

monoanionic tetrahedral state occurs when the pH surpasses the pKa (Figure 6.1). Hence, the

oxidation rate at pH >> pKa is largely that of the tetrahedral species. (The borolone is differentiated

by its existence as a monoanionic tetrahedral species above pH 1.) The second-order rate constants

for the oxidation for the monoanionic tetrahedral states of PBA, 2-HMPGA, and 2-CPBA are listed

in Table 6.2. Whereas the borole moiety provides an order-of-magnitude in resistance to oxidation,

the borolone provides six orders-of-magnitude.

6.2.3 Affinity for Saccharides.

PBA and 2-HMPBA are employed often for their ability to form esters with saccharides. 134 , 331-337

As the affinity of boronic acids for saccharides is generally greatest near their pKa value,35 0 the low

pKai of 2-CPBA portends a low affinity. The results of 'H-NMR titration experiments with D-

fructose, D-glucose, and N-acetylneuraminic acid (Neu5Ac) are consistent with this expectation

(Table 6.3). The loss in affinity of 2-CPBA for saccharides is, modest (~4-fold versus PBA and

- 13-fold versus 2-HMPBA) compared to the gain in oxidative stability (Table 6.1).

6.2.4 Computational Analyses.

We employed computational analyses to elucidate the physical origin of the remarkable oxidative

stability of 2-CPBA. Specifically, we performed density functional theory calculations to model

stationary points along each oxidation reaction coordinate, enabling energetic comparisons with

the pH-independent rate constant, kmonoanion (Table 6.2). We began by modelling the requisite

tetrahedral intermediate of each boronate with hydrogen peroxide and the boric ester products

182

(h 1 Tab1l'c Q1 () an Q 1 1 ; ire' . 5) Next we c1a1cnted transit;on stafte strucituires c'fI .3. A , X %1 3 " XJ'J tI. L W. " I% " L I% X.J ,..) JI VV %L4XiWL4& 4 L XX x "&" LX.I'&

the oxidation reaction, which replicated the results of similar systems (Figure 6.3).358 The

calculated relative rate constants for oxidation match exceptionally well to the experimental ones

(Table 6.2). Optimized transition state structures show a concerted mechanism in which the 1,2-

aryl shift from boron to the proximal peroxide oxygen is concomitant with peroxide

oxygen-oxygen bond cleavage, in agreement with previous mechanistic work.357' 365-366 As the

reaction progresses, the tetrahedral boronate is transformed into a planar, trigonal boric acid.

We turned to Natural Bonding Orbital (NBO) analysis to reveal interactions that are

responsible for the increased activation energies of 2-HMPBA and, especially, 2-CPBA (Table

6.S 12). In PBA, a p-type lone pair (nop) of each hydroxyl groups stabilizes the developing p-

orbital on boron (pB) via a classical (hyper)conjugative no--pB z-bonding interaction (Figure

6.4A). These interactions are absent in the starting tetrahedral boronate (where a much weaker

no-*u*B-C interaction of 8.0 kcal/mol is present) but are worth 35.2 kcal/mol in the transition state

and 69.3 kcal/mol in the product.

Intramolecular boronic ester formation affects both the donor abilities of the no,p lone pairs

and no,p-PB orbital overlap, resulting in a substantially weaker n,p-+PB interaction of 20.4

kcal/mol in the transition state for 2-HMPBA (Figure 6.4B). Despite the presence of only a

relatively weak C-H u-acceptor, a significant nO,p-+-U*C-H anomeric effect of 9.8 kcal/mol is

present in the transition state for 2-HMPBA (Figure 6.4D), reducing the no,p--+PB interaction. The

bulk of the -15 kcal/mol decrease in the no,p+PB interaction, arises from a decrease in orbital

overlap due to geometric constraints enforced by the 5-membered ring. Despite the large decrease

in stabilization of the developing empty p-orbital on boron, the rate of 2-HMPBA oxidation is only

183

slightly less than that of PBA, perhaps because of strain-relief upon ring-expansion, reflected in

the more exergonic reaction (Table 6.S 11).

Much more dramatic effects are observed during the oxidation of 2-CPBA. There, the

no,pPB interaction is diminished to 4.0 kcal/mol. More importantly, the no,p-*PB interaction

competes with no,p-,r*c=o ester resonance of 66.3 kcal/mol (Figure 6.4E). In addition to the large

decrease in its donor ability, the no,p lone pair is almost orthogonal to the pB orbital that develops

in the transition state. As a result, donation from the other lone pair, which resides in a lower-

energy sp2.2 orbital, is worth 9.1 kcal/mol (Figure 6.4D). Altogether, the transition state for 2-

CPBA oxidation is -7 kcal/mol greater in energy than that for PBA oxidation, suggesting that

strain-relief also contributes to the energetics of the 2-CPBA transition state.

We note that analogous arguments regarding donation to the PB orbital have been proposed

to be important in protodeboronation pathways.8,343,367 Indeed, experimental data indicate similar

trends in protodeboronation and oxidative deboronation pathways, with 2-CPBA being

significantly more stable than PBA or 2-HMPBA near physiological pH (Figure 6.S6).

184


6.3.1 General

Commercial reagents were used without further purification. Phenylboronic acid and

2-carboxyphenylboronic acid were from Combi-Blocks (San Diego, CA). 2-

(Hydroxymethyl)phenylboronic acid and aqueous hydrogen peroxide (30% v/v) were from

Sigma-Aldrich (St. Louis, MO).

NMR spectra were obtained with an Avance-400 spectrometer from Bruker (Billerica,

MA). The pH of buffers (10 mM) was determined with an Accumet XL50 pH-meter from Fischer

Scientific (Hampton, NH), calibrated with a pH 3.00 reference standard buffer from VWR

International (Radnor, PA) and pH 12.00, 12.45, and 13.00 reference standard buffers from Ricca

Chemicals (Arlington, TX).

All procedures were performed in air at ambient temperature (~22 'C) and pressure (1.0

atm) unless indicated otherwise.

6.3.2 pKa Determination

A 100 mM solution of 2-CPBA was prepared in 10 mL of 1 M NaOH containing D 20 (5% v/v).

The pH of the solution was measured and adjusted with 10 M NaOH and concentrated HCl, and

0.65-mL samples were transferred at the indicated pH values to 5-mm quartz NMR tubes from

Norell (Morganton, NC)) and analyzed by 'H and "B NMR spectroscopy, referencing to the

solvent peak. For the samples in 6 M DCl in D20 and 5 M NaOH in D20, the pH was outside of

the useful range of the pH probe. The acidic sample gave poor referencing from the solvent peak

due to an extreme chemical shift of the water peak, and was referenced to an external standard of

185

BF3 Et2 0 using a sealed melting-point capillary filled with the reference compound and placed

inside the NMR tube.

6.3.3 X-Ray Crystallography

Crystals of 2-CPBA grew out of the extremely acidic conditions (6 M DCl) in deuterium oxide

employed during NMR-titration experiments. Crystals were roughly 0.5 mm x 0.5 mm x 0.1 mm.

Crystals were mounted in mineral oil. X-ray diffraction data were collected from a single crystal

using a Bruker D8 Venture Kappa Duo diffractometer equipped with a Bruker Photon2 CPAD

detector and MoKa radiation (k = 0.71073 A). Diffraction data were reduced with SAINT software

from Bruker. The structure was solved with SHELXT and refined with SHELXL. Details

regarding the refinement and quality of the final structure are listed in Table 6.S 1.

6.3.4 Chemical Kinetics

6.3.4.1 Instrumentation and Materials

The oxidation reactions were monitored by UV absorbance using a Cary 60 UV-vis

spectrophotometer from Agilent. Some extinction coefficients were determined with an M1000

plate reader from Tecan. The extinction coefficient of PBA in PBS was found to be identical in

both instruments. Buffers were degassed prior to use. Measurements were performed with 1.0-mL

sample volumes in disposable, plastic, UV-transparent cuvettes. All measurements were

performed in triplicate (at least). Stocks of boronic acids and phenolates were prepared by

dissolving the compounds in buffer, assisted by sonication if necessary. All stock solutions were

prepared on the day of their use.

186

6.3. 4.2 Wavelength Optimization and Calibration

Initial wavelength scans in the range of 250-350 nm revealed a bathochromic shift between

boronic acids and phenolate (exemplified at pH 7.4 in Figure 6.S1). Wavelengths with optimal

signal-to-noise ratios were determined from these spectra and used for single-wavelength kinetic

assays. The hydrogen peroxide used as an oxidant also absorbs light at these wavelengths.

Extinction coefficients for the two reactants and the product were determined at each pH (Table

6.S8). Extinction coefficient for the oxidation of boronic acids were calculated as the difference in

absorbance between the product and substrates with the equation:

Acoxidation = phenol - (Eboronic acid - &hydrogen peroxide) (6.1)

Initial boronic acid concentrations were <1 mM, and the final pH of reaction mixtures was

confirmed not to change by more than 0.1 units.

6.3.4.3 Initial Velocity Kinetics

Initial velocity kinetic assays were performed by measuring the UV absorbance of 990 [tL of a

boronic acid solution and then initiating the reaction by the addition of 10 tL of a 100 x hydrogen

peroxide stock solution. Rates were measured in triplicate. Slopes from the first 10% of the

reaction (or from 10 min of a slow reaction: kobs < 10-2 M-s-1) were calculated and converted to

units of M/s by dividing by the value of Acoxidation. Reactions were slow enough not to require a

stopped-flow apparatus, but the initial velocity (vo) for the fastest reactions (kobs ;> 1 M's-1) could

only be calculated from a few dozen data points. Rates at three different concentrations of boronic

187

acid and hydrogen peroxide were measured and determined to be first-order with respect to both

substrates (Figure 6.S3). Second-order rate constants were determined with the equation:

kobs - V" (6.2)[boronic acid] [hydrogen peroxide]

6.3.4.4 Evaluation ofpH-Dependence

The pH-dependence of oxidation reactions was assessed by using the buffers listed in Table 6.S8.

Data were fitted to the equation:

kobs =ki +k2(1+1OpKapH) (6.3)

with pKa values shown in Figure 6.1. Reactions conducted at pH > 11.7 (which is the pKa of

hydrogen peroxide) engendered the evolution of gas. These rates, which were slower than

expected, were omitted from the analysis, though the relative rates of the different boronic acids

at pH >11.7 fitted the expected trend.

6.3.5 Saccharide Binding

The affinity of boronic acids to saccharides was assayed by 1H-NMR-titration experiments as

described previously.368 Due to its low affinity, D-glucose was used at twice the concentrations of

other saccharides.

188

%J-' -J %-%J L L"L-Lk'JL'" vJL "t.Y 1 3%10J

6.3.6.1 DFT Optimization with Gaussian

Optimizations were performed using Gaussian 16 software from Gaussian at the M06-2X/6-

31 1+G(d,p) level of theory and employing the IEFPCM solvation model for water.326 Frequency

calculations were performed to confirm each stationary point as a minimum or first-order saddle

point. Energies for each structure were determined with the freq keyword and are listed in Table

6.S10.

6.3.6.2 Natural Bonding Orbital Analysis

Natural Bonding Orbital (NBO) analysis of the optimized structures were conducted with NBO

6.0 software. 29 6 Interaction energies and the off-diagonal NBO Fock matrix element, which is

proportional to the overlap integral, are listed in Table 6.S 12.

189

6.4 Conclusions

The metabolic stability of a compound is a critical determinant of its utility in chemical biology

and medicinal chemistry. We have found that the oxidative stability of boronic acids can be

enhanced greatly by the formation of a borolone. The endowed stability results from

stereoelectronic constraints imposed by the ring and carbonyl group. The ensuing "twisting" and

"pulling" renders stabilization of the developing p-orbital on boron much less efficient than in the

more reactive phenylboronic acid and benzoxaborole. This functional group, which retains affinity

for saccharides and tightly clusters many hydrogen-bond donating and accepting groups, could

have a broad range of applications.

6.5 Acknowledgments


predoctoral fellowship. B.G. was supported by an Arnold 0. Beckman Postdoctoral Fellowship.

This work was supported by Grant RO1 GM044783 (NIH). We are grateful to Dr. Peter Mtller

(Department of Chemistry, Massachusetts Institute of Technology) for determining the crystal

structure of benzoxaborolone.

190

Scheme 6.1. Boronic Acid Oxidation Pathway

HO 0 HR 01HHOB' 'R HOOH 01

OH - OH HO, PH1- 01'3 '0 0'B

HO' 00 H'2Computationally Investigated Pathway

R = H, CHraryl, CO-aryl

191

Table 6.1

Table 6.1. Rate Constants for the Oxidation of Boronic Acids and Biological Thiols at pH ~7.4

Compound kobs (M-s-')

PBA 2.4

2-HMPBA 2.4

2-CPBA 0.00015

Cysteinea 2.9

N-Acetylcysteinea 0.16

Glutathionea 0.89

aValues are from ref. 230, 369.

192

T able 6.2

Table 6.2. Experimental and Computational Kinetic Parameters for the Oxidation of BoronicAcids

experimental computational

kmonoaniona kmonoanion AGI k

(M-s-1) (rel) (kcal/mol) (rel)

PBA 49 1 27.0 1

2-HMPBA 4.2 0.12 28.3 0.085

2-CPBA 0.00016 0.0000033 34.3 0.0000034

aValues of kmonoanion (which is k2 for PBA and 2-HMPBA, and ki for 2-CPBA) are derived from

the data in Figure 6.2 and refer to the monoanionic species depicted in Figure 6.1.

193

Table 6.3

Table 6.3. Values of Ka (M-1) for Boronic Acids and Saccharides

D-fructose D-glucose Neu5Ac

PBAa 128 20 5 1 13 1

2-HMPBAa 336 43 28 4 43 5

2-CPBA 23 2 1.7 0.1 6.0 0.5

aValues are from ref. 368. bEach value is the mean standard deviation (SD) for >5

measurements in 0.10 M sodium phosphate buffer (pH 7.4) containing 2% (v/v) D20.

194

Thi. A.Q1

Table 6.SL Crystal Data and Structure Refinement

Identification code

Empirical formula

Formula weight

Temperature

Wavelength

Crystal system

Space group

Unit cell dimensions

Volume

Z

Density (calculated)

Absorption coefficient

Fooo

Crystal size

Theta range for data collection

Index ranges

Reflections collected

Independent reflections

Completeness to theta = 25.242'

Absorption correction

Refinement method

Data / restraints / parameters

Goodness-of-fit on F2

Final R indices [I> 2sigma(I)]

R indices (all data)

Extinction coefficient

Largest diff. peak and hole

P18065 (In-house, CCDC ID pending)

C7 H7 B0 4

165.94

100(2) K

0.71073 AMonoclinic

P21/c

a = 13.3947(5) Ab = 7.9080(3) Ac = 7.0529(2) A

743.84(4) A3

4

1.482

0.119

a 90'

b = 95.3376(12))

g= 900

Mg/m 3

mm'

344

0.450 x 0.400 x 0.080 mm 3

1.527 to 36.330'

-22 < h < 22, -13 < k < 13, -11 < l < 11

51761

3598 [Rint = 0.0680]

99.9%

Semi-empirical from equivalents

Full-matrix least-squares on F2

3598/3/ 118

1.072

R1 = 0.0365, wR2 = 0.0990

R1 = 0.0388, wR2 = 0.1012

n/a

0.629 and -0.320 e.A-3

195

Table 6.S2

Table 6.S2. Atomic Coordinates (x 104) and Equivalent Isotropic Displacement Parameters

(A2 x 10') for P18065. Ueq is defined as one third of the trace of the orthogonalized Uij tensor.

Atom x y z Ueq

B(1) 1461(1) 3951(1) 4044(1) 9(1)

0(1) 1110(1) 2160(1) 3907(1) 10(1)

0(2) 1382(1) 4478(1) 5992(1) 11(1)

0(3) 741(1) 5015(1) 2725(1) 9(1)

0(4) 796(1) 7021(1) 518(1) 13(1)

C(1) 2511(1) 4345(1) 3192(1) 9(1)

C(2) 2315(1) 5593(1) 1804(1) 10(1)

C(3) 3055(1) 6373(1) 848(1) 13(1)

C(4) 4041(1) 5863(1) 1323(1) 15(1)

C(5) 4263(1) 4622(1) 2713(1) 16(1)

C(6) 3506(1) 3857(1) 3654(1) 13(1)

C(7) 1227(1) 5960(1) 1592(1) 9(1)

196

Table 6. S3

Table 6.S3. Anisotropic Displacement Parameters (A 2 x 10) for P18065. The anisotropic

displacement factor exponent takes the form: --2u2[h2a* U1 + ... + 2hka*b* U12].

Atom U 1 U2 2 U33 U23 U13 U12

B(3) 10(1) 8(1) 9(1) 1(1) 3(1) 0(1)

0(l) 11(l) 8(l) 10(l) 0(1) 4(l)-()

C(2) 15(1) 9(1) 9(1) 0(1) 5(1) 1(1)

0(3) 9(1) 9(l) 10(1) 2(l) 3(l) 0(1)

C(4) 13(1) 11(1) 14(1) 4(1) 3(1) 2(1)

C(7) 9(1) 8(1) 10(1) 0(1) 3(1) 0(1)

C(2) 10(l) 9(1) 10(l) 1 (1) 4(l) 0(l)

C(3) 13(l) 13(l) 15(l) 3(l) 6(l) -1 (1)

C(4) 11(1) 16(l) 19(1) I(1 7(l) -2(l)

C(5) 10(l) 19(1) 19(1) I(1 4(l) 1 (1)

C(6) 10(l) 16(l) 14(l) 3(l) 2(l) 2(l)

C(7) 10(l) 8(l) 10(l) 0(l) 3(l) 0(l)

197

Table 6.S4

Table 6.S4. Bond Lengths for P18065.

Atom Atom Length (A)

BI 02 1.4487(7)

BI 01 1.4934(7)

BI 03 1.5284(7)

BI C1 1.6095(8)

01 HA 0.927(10)

01 HIB 0.890(10)

02 H2 0.819(10)

03 C7 1.3109(7)

04 C7 1.2377(7)

Cl C6 1.3964(8)

Cl C2 1.3976(8)

C2 C3 1.3931(8)

C2 C7 1.4797(8)

C3 C4 1.3914(9)

C3 H3 0.9500

C4 C5 1.3996(9)

C4 H4 0.9500

C5 C6 1.3996(8)

C5 H5 0.9500

C6 H6 0.9500

198

Table 6 .S5

Table 6.S5. Bond

Atom Atom

02 BI

02 BI

01 BI

02 BI

01 BI

03 BI

BI 01

BI 01

HIA 01

BI 02

C7 03

C6 Cl

C6 Cl

C2 Cl

C3 C2

C3 C2

Cl C2

C4 C3

C4 C3

C2 C3

C3 C4

C3 C4

C5 C4

C6 C5

C6 C5

C4 C5

Cl C6

Cl C6

C5 C6

Angles

Atom

01

03

03

Cl

Cl

Cl

HIA

HIB

HIB

H2

BI

C2

BI

BI

Cl

C7

C7

C2

H3

H3

C5

H4

H4

C4

H5

H5

C5

H6

H6

for P18065.

Angle (0)

106.48(4)

108.72(4)

107.76(4)

116.22(5)

116.14(4)

100.97(4)

118.5(7)

111.3(7)

108.4(10)

110.6(8)

111.30(4)

118.09(5)

135.00(5)

106.54(4)

123.81(5)

127.02(5)

109.14(4)

117.17(5)

121.4

121.4

120.46(5)

119.8

119.8

121.27(5)

119.4

119.4

119.19(5)

120.4

120.4

199

C7 03 121.97(5)

04 C7 C2 126.50(5)

03 C7 C2 111.53(5)

200

04

Table 6.S6

Table 6.S6. Hydrogen Coordinates (x 104 ) and Isotropic

Displacement Parameters (A 2 x 103) for P18065.

Atom x y z Ueq

H(1A) 1204(8) 1554(14) 2813(14) 14

H(1B) 468(8) 2077(14) 4129(15) 14

H(2) 1207(9) 5469(13) 6029(16) 16

H(3) 2893 7217 -89 16

H(4) 4567 6361 700 18

H(5) 4940 4292 3024 19

H(6) 3667 3017 4596 16

201

Table 6.S7

Table 6.S7. Hydrogen Bonds for P18065.a

D-H A dD-H (A) dH.- -A (A) dD. A (A) LDHA (0)

O(1)-H(1A)- -O(2)#1 0.927(10) 1.559(10) 2.4860(6) 178.9(11)

O(1)-H(1B) 0(3)#2 0.890(10) 2.569(11) 3.1332(6) 122.0(9)

O(l)-H(lB) - 0(4)#2 0.890(10) 1.734(10) 2.6230(6) 176.7(11)

O(2)-H(2) O(4)#3 0.819(10) 2.082(10) 2.8885(6) 168.0(11)

aSymmetry transformations used to generate equivalent atoms:

#1: x, -y +,/2 z - '2; #2: -x, y -,/2 -z + /2; #3: x, -y +3 /2, Z + '/2

202

Table 6. S8

Table 6.S8. Values of Ac During Oxidation Reactions (M-'cm-')

pH Buffer PBA 2-HMPBA 2-CPBA

3.2 sodium citrate 1069 2 1336 3 3386 15

5.0 sodium acetate 962 2 1448 3 3364 9

6.2 sodium citrate 816 5 1521 7 2830 7

7.3 sodium phosphate 1220 8 1474 9 3332 10

7.3 PBS 1029 33 1516 35 2568 266

9.0 sodium borate 1210 6 2025 9 3147 46

10.5 sodium citrate 1791 9 1940 9 3469 40

203

Table 6.S9

Table 6.S9. Second-Order Rate Constants for the Oxidation of Boronic Acids (M-ls-1)a

pH PBA 2-HMPBA 2-CPBA

3.2 (1.3 0.1) x 10-2 (3.8 0.2) x 10- 3 (2.2 0.1) x 104

5.0 (1.2 0.01) x 10-2 (2.4 0.1) x 10-2 (1.7 0.1) x 1 0 -4

6.2 (1.8 0.01) x 10-' (2.0 0. 1) x 10-1 (1.4 0. 1) x 10-4

7.3 2.4 0.2 2.4 0.2 (1.5 0.1) x 10-4

7.3a 6.5 0.2 3.5 0.1 (2.8 0.5) x 10-4

9.0 24 3 4.5 0.4 (7 .1 2 .6 ) x 10-4

10.5 53 1 4.7 0.2 (4.1 0.1) x 10-2

aEach value is the mean SD from initial-velocity experiments performed in triplicate

with at least three different concentrations of boronic acid and peroxide. bDetermined

in PBS.

204

TabMe I 1m A

Table 6.S10. Calculated Energies of Starting Materials, Transition States,

and Products (Hartrees)

Structure HF Zero Point H G ImaginaryFrequency

PBASM -559.3340606 -559.192210 -559.181182 -559.227363 -

PBATS -559.2857740 -559.147518 -559.135260 -559.184357 748.1166

PBAProd -483.4941140 -483.364085 -483.354294 -483.399097 -

2-HMPBA _SM -597.4252050 -597.275288 -597.263948 -597.311131 -

2-HMPBA _TS -597.3757670 -597.229576 -597.217704 -597.26600 743.2438

2-HMPBAProd -521.5891240 -521.450972 -521.441425 -521.485137 -

2-CPBA _SM -671.4918544 -671.360385 -671.348323 -671.397383 -

2-CPBA _TS -671.4329050 -671.305185 -671.292760 -671.342679 757.4783

2-CPBAProd -595.6372030 -595.517838 -595.507956 -595.552297 -

205

Table 6.S11

Table 6.S11. Calculated Free Energies ofActivation and Reaction (kcal/mol)

Boronic Acid AG+ AGreaction

PBA 27.0 -61.0

2-HMPBA 28.3 -62.4

2-CPBA 34.3 -50.4

206

T tl 6.1

Table 6.S12. Boron-Oxygen and Competing Donor-Acceptor Interaction Energies

Boronic Acid Donor Acceptor E2 (kcal/mol) F1

PBA noip PB 35.2 0.118n02,p PB 37.4 0.122noi,p PB 20.4 0.089

2-HMPBA n02,p PB 43.9 0.131noi,p U*C-H 9.8 0.080

noi,sp2.2 PB 9.1 0.069

noi,p PB 4.0 0.0392-CPBA n02,p PB 47.7 0.136

noi,sp2.2 r*C=O 3.3 0.057nO,p x*C=O 66.3 0.151

207

HO'B.OH + H20+

Hg, OHHO-B-

APhonylboronic Acid (PSA) monoanlon

HO ,BOH HOl *H 20 HO- H20 B-0 H. HOpo

OH pN x73

2-(Hydmxym-thyl)phen lbornlc Acid Benzoxaborole monoanlon(244MP8A)

HO, OHaB 0 - H2 0

~fOH

2-Carboxyphonwbomrnlc Add(2-CPSA)

H O ,H + H20

H O

dianlon

HO, B-0

Benoxaboroloe 81

t

0

3

+H* OilC 2 C1 C7

HO C K

onC4 C3

monoonlon

208

Figure 6.1

Figure 6.1. Structure of Phenylboronic Acid (top), Benzoxaborole (middle), Benzoxaborolone

(bottom), and Hydration and Protonation States that are Relevant in Aqueous Solution. pKa values

are given for phenylboronic acid,1 33 benzoxaborole,' 2 7 and benzoxaborolone (this work). The

ORTEP diagram of the protonated monoanion was determined by X-ray crystallography.

209

PBA

2-HMPBA

physiological pH

- 2-CPBA*

3 4 5 6 7 8pH

9 10 11 12

210

Figure 6.2

102

10

100

10

10-4

2

Figure 6.2. pH-Dependence of the Rate Constant for the Oxidation of Boronic Acids by Hydrogen

Peroxide. Data were fitted to eq 3 with pKa values in Figure 6.1 to give ki = (1.1 0.2) x 10-2 M-

Is- and k2= 49 4 M--'s- 1 for PBA; ki = (3.4 0.4) x 10-' M-'s-1 and k2 = 4.2 0.2 M-'s-' for 2-

HMPBA; and ki (1.6 0.1) x 10-4 M-'s-1 and k2 = 9.7 1.3 M-1s-' for 2-CPBA.

211

Figure 6.3

B C

p

6

212

Figure 6.3. Calculated Structure of the Transition State for the 1,2-aryl Shift During the Oxidation

of (A) PBA, (B) 2-HMPBA, and (C) 2-CPBA by Hydrogen Peroxide. Structures were optimized

at the M06-2X/6-31 1+G(d,p) level of theory using the IEFPCM solvation model.

213

Figure 6.4

A tno

B no,

PBA

B

2-HMPBA

CD

n.2-CPA2kc-HMmP l

2-CPBA 2-HMPBA

nokc.Vmo

2-CPBA

214

Figure 6.4. Images of Key Orbitals in the Transition State for the 1,2-aryl Shift During the

Oxidation of PBA, 2-HMPA, and 2-CPBA. Images were created with NBOView 1.1. The energy

is for the depicted donor-acceptor interaction (Table 6.S12). (A) A hydroxyl oxygen of PBA

donates electron density from a p-orbital into the vacant p-orbital of boron. (B) The bridging

oxygen of 2-HMPBA donates less electron density from its p-orbital than in panel A. (C) The

bridging oxygen of 2-CPBA donates still less electron density from its sp2.2 orbital than in panels

A or B. (D) The bridging oxygen of 2-HMPBA engages in a competing interaction with the U-*

orbital of a benzylic C-H bond. (E) The bridging oxygen of 2-CPBA donates strongly into the

Z* c=o orbital (i.e., ester resonance); its p-orbital is nearly orthogonal to the p-orbital of boron.

215

Figure 6.S1

HO B OHo XOH

e dianion

SF JEt Of

5 M NsOH

pH-13.0

\\ pH=12.8

pH=12.7

pH-12.6

pH=12.0

\\ pH-10 4

pH=8.8

pH-7.3

pH-G I

pH-5,8

pH=4.8

pH-3.2

f1 (ppm)

216

Figure 6.S1. Representative 11B-NMR Spectra Used to Determine the pKa Values of 2-CPBA. A

solution of 2-CPBA (0.10 M) was prepared in 10 mL of 1 M aqueous NaOH containing D20 (5%

v/v). The solution pH was adjusted with 10 M NaOH or concentrated HC1, and 650 PL samples

were removed at the designated pH values and analyzed by 1 B NMR spectroscopy, referencing to

the solvent peak except in the case specified. The pKa was calculated by fitting the fraction of

dianion/monoanion to the expected speciation based on pH. The shift observed in the low pH

samples is likely affected by a strong shift in the solvent reference peak, as the sample with external

reference under extremely low pH conditions does not show the same shift.

217

Figure 6.S2

300Wavelength (nn

- PBA.--- phenol- 2-HMPBA- - -- 2-hydroxymethylphenol

2-CPBA-- -- sialic acid

350

1.0 mM PBA + 5 mM H202

0.5 mM 2-HMPBA + 5 mM H202

1.0 mM 2-CPBA + 25 mM H2O2

2Time (min)

4 6

218

I' ~./I - 'a

A

CD

cc0

B

20Ch

5-

4-

3-

2-

1 -

0-2

1.5 -

1.0-

0.5-

i0

0.0 00

Figure 6.S2. UV-Spectroscopic Basis for Assays of the Oxidation of Boronic Acids. (A) UV

spectra of PBA, 2-HMPBA, 2-CPBA and their phenolic oxidation products. (B) Representative

raw data showing the change in absorbance upon oxidation in PBS buffer.

219

Figure 6.S3

35-

30

25-C(D

20-

0.4

PBA Oxidation (5 mM H 2 0 2)

R2 = 1.000

0.6 0.8 1.0[PBA] (mM)

2-HMPBA Oxidation (5 mM H 20 2)

R2 = 0.9981

0.25 0.35 0.45 0.55[2-HMPBA] (mM)

40-

-30

20-0

Q.10 -

0

PBA Oxidation (1 mM PBA)

2 = 0.9999

2 4 6

[H 202] (mM)

,2-HMPBA Oxidation (0.5 mM 2-HMPBA)

.2 18-

o 16

E 12 R2 = 0.9994

02 10

)1%

6 8[H202] (mM)

10

2-CPBA Oxidation (50 mM H202)

R2 = 0.9982

2-CPBA Oxidation (1 mM 2-CPBA)

R2= 1.000

0

CL>4

10

30 50

[H202] (mM)70

220

0

CL

E

0

10-

8-

6-

A.

0

CL

A)

15-

10-

5-

0.2 0.4 0.6 0.8 1.0[2-CPBA] (mM)

Figure 6.S3. Kinetic Traces of the Oxidation of Boronic Acids by Hydrogen Peroxide at pH 7.4.

Second-order rate constants derived from these data are listed in Table 6.S 1.

221

Figure 6.S4

A

8 .5 8.0 7.5 7.0 6 .5 6.0 5.5

f1 (ppm)50 4.5 4.0 3.5 3.0 2.5

223

72 h

Phenol

45 min

Initial

72 h

B

8.4 8.2 8.0 7.8 7.6 7.4 7.2 7.0 6.8 6.6 6.4 6.2 6.0 5.8 5.6 5.4 5.2 5.0

f1 (ppm)4.8 4.6 4.4 4.2 4.0 3.8 3.6 3.4 3.2 3.0 2.8 2.6

224

o-Hydroxymethylphenol

45 min

Initial

.....................

8.0 7.5 7.0 6.5 6.0

fi (ppm)

5.5 5.0 4.5

225

C

72 h

Salicylic Acid

45 min

Initial

4.0 3.5

AA 2-CPBA

v Salicylic Acid

1 Week

1 Day

-42 Minutes

-40 Minutes

~38 Minutes

-36 Minutes

-34 Minutes

-32 Minutes

~30 Minutes

-28 Minutes

~26 Minutes

~24 Minutes

-22 Minutes

-20 Minutes

-18 Minutes

-16 Minutes

-14 Minutes

~12 Minutes

-~10 Minutes

-8 Minutes

-6 Minutes

-4 Minutes

~2 Minutes

Initial

8.3 8.2 8.1 8.0 7.9 7.8 7.7 7.6 7.5 7.4 7.3 7.2 7.1 7.0 6.9 6.8 6.7 6.6 6.5

f1 (ppm)

226

Figure 6.S4. 1H-NMR Spectra Acquired During the Oxidation of Boronic Acids: (A) PBA, (B)

2-HMPBA, and (C) 2-CPBA. Solutions (50 mM) were prepared in PBS. A solution of sodium

bisulfite (0.5 M) in D20 was prepared as to quench the oxidation reaction. A 2-mL aliquot of the

boronic acid solution was placed in a 1-dram vial, and 100 pL was added to 600 pL of the quench

solution to provide an initial sample. A 50-ptL aliquot of 30% (v/v) hydrogen peroxide solution

was added to each boronic acid solution and the resulting solutions were stirred for 45 min. A 100-

[tL aliquot of the reaction mixture was added to 600 [IL of the quench solution, and the samples

were analyzed by 'H-NMR spectroscopy. Solutions of phenol, 2-hydroxymethylphenol, and

salicylic acid were subjected to the same quench and analysis. The reaction mixtures were

incubated for 72 h and another aliquot was quenched and analyzed. No 1H-NMR peaks

corresponding to oxidation of the phenols were observed. (D) Reaction conditions: 0.99 mL of 2-

CPBA (40 mM) in 0.10 M sodium phosphate buffer pH 7.3, containing D20 (10% v/v) was added

to an NMR tube, and the sample was locked and shimmed in an NMR spectrometer. Hydrogen

peroxide (10 [pL of a 30% v/v solution) was added to the tube, the contents were mixed by thorough

shaking, and the tube was returned to the spectrometer and analyzed (16 scans, 5-s relaxation

delay) obtaining a spectrum approximately every 2 min. Spectra of the starting material and

product alone were obtained in the same buffer solution.

227

Figure 6.S5

A

B

228

Figure 6.S5. Optimized Structure of Each Hydrogen Peroxide Complex (A) and Product (B)

During the Oxidation of Boronic Acids. Left: PBA; middle: 2-HMPBA; right: 2-CPBA. The

structure of each transition state is shown in Figure 6.3.

229

Figure 6.S6

-0- PBA--- 2-HMPBA

+ 2-CPBA

115

0cc

C

20m0CDcIn

50-

40-

30-

20-

10-

0-0 5

pH10

230

. I .

Figure 6.S6. Graph of the Extent of Protodeboronation of Boronic Acids at Three pHs. Solutions

of PBA, 2-HMPBA, and 2-CPBA (50 mM) were prepared in 1.0 M sodium phosphate buffer

containing D 20 (5% v/v) at pH 2.2, 7.3, and 12.2. A 1.0-mL aliquot of each solution was placed

in sealed 1-dram vials. The vials were heated at 95 'C with shaking at 650 rpm in a VWR

temperature-controlled heating block/shaker. After 2 h, the vials were cooled on ice, and the

solutions were transferred to NMR tubes. Protodeboronation was quantified by "B-NMR

spectroscopy, comparing the area under the boric acid peak with the area under the sum of the

boric acid peak plus the PBA, 2-HMPBA, or 2-CPBA peak.

231

6.6 Atomic Coordinates of Optimized Structures

PBASM

2.61827900

1.23227000

0.44567400

1.11039300

2.49341900

3.25563200

3.20433000

0.73206100

0.51653000

2.98178700

4.33412300

-1.70824700

-1.80290300

-1.58202600

-2.44895400

-1.72543100

-3.15761200

-3.20705000

-1.17147900

1.12521200

1.19868800

0.05242800

-1.17977300

-1.27019400

-0.11300800

2.03264900

2.16218300

-2.07939900

-2.24040600

-0.17649900

-0.91796700

-0.50654400

1.46575300

1.62446400

-0.20430700

-0.28048700

-0.85707400

0.13131400

-0.19495500

-0.05840900

0.09478700

0.10838800

-0.03071300

-0.18366700

-0.30985700

-0.05960600

0.24074700

-0.01828700

-0.29045500

1.16823200

2.02803100

0.69708600

0.31474600

-1.09532100

-0.94207200

-0.16201400

0.24414200

232

2.49121700

1.09967700

0.38654400

1.07729300

2.46838500

3.17775000

3.03998500

0.57097000

0.52012800

3.00043200

4.26020500

-1.67877900

-1.99296900

-1.73479800

-1.69954100

-1.30653200

-3.04840300

-3.07996000

-1.33526700

1.20031600

1.20925000

0.01131700

-1.19829000

-1.21213800

-0.01176700

2.13549300

2.15698900

-2.12904900

-2.15603800

-0.02070800

-1.25390300

-1.10141400

1.10398800

1.90932900

0.05944500

0.10635700

-0.82699600

-0.01736500

0.10719200

0.04165700

0.00406800

0.08020200

0.14777700

0.15992400

0.12162200

0.00923200

0.06568400

0.19375500

0.21479500

-1.12350200

-2.01739700

-1.29646400

-0.77570800

0.87929500

1.51030400

1.73848500

-0.51049100

233

PBA Tq

-2.22122800

-0.90317800

-0.25258900

-0.90317400

-2.22122400

-2.88371500

-2.73216800

-0.36702000

-0.36701400

-2.73216200

-3.91111800

2.56740400

3.29641300

2.56736900

3.29637700

1.04154500

2.08335800

1.20337100

1.20945000

-0.00000300

-1.20945600

-1.20337500

-0.00000200

2.14381700

2.13258900

-2.13259600

-2.14382100

-0.00000100

-1.21056000

-1.18775000

1.21057300

1.18777300

-0.00000400

0.00000300

-0.19541500

0.24789000

0.46768800

0.24787500

-0.19543000

-0.41849200

-0.36487400

0.43154100

0.43151400

-0.36490100

-0.76124800

-0.37254100

-0.99412300

-0.37256200

-0.99414700

0.93510400

0.03430700

234

PBAProd

2-C3n A _NM

0.41434200

1.04169400

2.42042500

3.19465600

2.58908500

1.20401700

2.89200400

4.27135300

3.20184800

0.74758100

-1.17588200

0.01643200

-0.05817500

0.28932000

-1.21183800

-1.80319300

-1.93483900

-1.91029400

-3.23159400

-3.38642600

-0.37053600

0.86602700

0.98129800

-0.17857500

-1.42620900

-1.52100900

1.95151400

-0.11212000

-2.32115000

-2.49947000

-0.08785700

1.97900200

2.44215100

2.77229100

1.39468800

-0.63774800

-0.66346200

-0.05883500

-0.47335800

-0.67604800

0.11937500

-0.04542400

-0.19748200

-0.18016600

-0.01565400

0.12998700

-0.32313500

-0.29106000

-0.00157700

0.25482000

0.25203200

-0.03121800

-1.02562100

0.67474600

0.34513100

-0.98448600

1.37108700

2.11518100

-0.90353800

0.03442300

235

2-CPBATS

0.38149400

0.92588600

2.29987100

3.12209500

2.57446800

1.19349600

2.73241500

4.19592200

3.22592700

0.76283200

-1.30986200

-0.13613600

-0.43168200

0.21352400

-1.23675000

-1.33459300

-1.97462900

-2.28724000

-3.09004200

-2.94131500

-0.39892900

0.86752600

1.02811000

-0.09856700

-1.36686900

-1.52371800

2.01395100

0.01360700

-2.23189900

-2.51075800

-0.00773600

1.94260200

2.18900700

2.86088400

1.42299300

-0.64591800

-0.57191900

-1.44928100

-0.65071900

0.01673600

0.08455300

-0.12218500

-0.25411100

-0.18823700

0.01097400

0.13747600

-0.39344700

-0.28426100

0.06349000

0.27272800

0.52607600

-0.10908600

-1.13779200

0.36923400

0.61291800

-0.71374900

1.63449500

1.40052800

-1.31895600

-1.99482600

236

2CPBA_ DroA

-0.29330200

-0.52874300

-1.84036900

-2.89871000

-2.64392300

-1.34005700

-2.03018100

-3.91480500

-3.46267100

-1.11251500

2.04705400

0.63547600

0.71313500

0.49436300

1.87347000

0.98709900

3.29467300

3.29324300

-0.60220800

0.76917600

1.21960000

0.32098800

-1.04717300

-1.51335300

2.28734300

0.68413200

-1.75477000

-2.57162300

-0.22325500

1.71795800

2.09172100

2.57706000

1.11547100

-1.08468100

-0.74172200

-1.70007100

0.10280000

0.05950300

-0.05803200

-0.11572500

-0.06467300

0.03875000

-0.10224200

-0.20477300

-0.11273800

0.07101200

-0.03707000

0.18050500

1.20750400

-0.47749100

-0.16693600

0.19912500

-0.15154100

-0.09987100

237

2-HMPBASM

C 0.43032800

0.96250100

2.31914100

3.18193700

2.67433000

1.30709300

2.67766600

4.24907700

3.35861700

0.93063000

-1.16769100

-1.28424600

-1.84652700

-1.72503900

-2.67912400

-3.20738200

-3.07902700

-0.12504800

0.04805800

-0.65426600

0.62903900

0.88150800

-0.20812000

-1.50807200

-1.73456000

1.90365000

-0.05348000

-2.34932100

-2.75029000

-0.50145000

1.06743900

-0.91129700

-1.06354700

-1.08773800

-0.46044100

0.49909600

1.66108900

2.86105000

238

0.10103600

0.01405900

-0.14134900

-0.21309300

-0.12696100

0.02932100

-0.20194000

-0.33335900

-0.18207200

0.09748600

0.27329600

0.29394700

-0.96697900

1.46168100

1.35036800

-0.84432200

-0.81299000

0.12093600

0.05721500

2-HMPBATS

0.33428700

0.94073000

2.32028200

3.09347700

2.48660700

1.10088000

2.77370000

4.17135400

3.10292600

0.63262700

-1.35158200

-1.22477200

-1.36384500

-1.99240700

-2.27069500

-3.16610300

-2.97411700

-0.02563400

0.22442000

-0.63323100

0.60943300

0.73247900

-0.42430100

-1.67319400

-1.78881000

1.71175300

-0.35815700

-2.56413800

-2.76175300

-0.14543100

1.31811700

-0.90695700

-0.49048700

-1.40981800

-0.92140200

-0.49538900

1.74270500

2.90250100

0.08100900

-0.07942200

-0.19326300

-0.14757800

0.01298900

0.11644900

-0.29874900

-0.23101700

0.04985100

0.21252000

0.49782900

0.37094100

-0.64321500

1.68258000

1.65485100

-1.22776600

-2.06806000

-0.01058100

-0.24331100

239

2-HMBPAProd

-0.29330200

-0.52874300

-1.84036900

-2.89871000

-2.64392300

-1.34005700

-2.03018100

-3.91480500

-3.46267100

-1.11251500

2.04705400

0.63547600

0.71313500

0.49436300

1.87347000

0.98709900

3.29467300

3.29324300

-0.60220800

0.76917600

1.21960000

0.32098800

-1.04717300

-1.51335300

2.28734300

0.68413200

-1.75477000

-2.57162300

-0.22325500

1.71795800

2.09172100

2.57706000

1.11547100

-1.08468100

-0.74172200

-1.70007100

0.10280000

0.05950300

-0.05803200

-0.11572500

-0.06467300

0.03875000

-0.10224200

-0.20477300

-0.11273800

0.07101200

-0.03707000

0.18050500

1.20750400

-0.47749100

-0.16693600

0.19912500

-0.15154100

-0.09987100

240

Chapter 7

Strain Inactivation in a Circular RNase 1 Zymogen

Contribution:

I performed all experiments, designed the constructs, and wrote the initial manuscript. I

contributed to the preparation of all the materials and analysis of the data.

This chapter has been prepared for publication as:

Windsor, I. W.; Graff, C.; Raines, R. T., Strain inactivation of a circular RNase 1 zymogen. In

preparation.

242

A hstr2ct

Latent reservoirs thwart a reliable cure for HIV as viremia rebounds upon halting antiretroviral

therapy, making HIV infection a chronic illness. The strategy nature devised to enable cellular

production of toxic proteins - expression of inactivated, zymogen forms that are proteolytically

activated at the site of action - provides a framework to engineer prodrug variants that target HIV

infected cells. De novo engineering of zymogens has only achieved limited success. Here, we

report the design and characterization of HIV protease-activated zymogens of RNase 1. Following

our initial success of sterically occluding the active site by circularly permuting RNase A to

connect the native termini with a protease substrate, we created a circular variant of RNase 1.

Installing a linker of insufficient length to permit folding of both the N- and C-terminal secondary

structural elements imposes strain on the active site and inactivates RNase zymogens up to 28,000-

fold. Disulfide bonds limit distortion of the fold to the termini, allowing these zymogens to

maintain conformational stability above physiological temperature. Though a moderate reduction

in efficiency is observed, the strained zymogens are still readily cleaved by HIV- 1 protease, which

restores wild-type ribonucleolytic activity. Zymogens of RNase 1 hold the promise to engender

cytotoxicity specifically to HIV infected cells and could find utility as part of a combinatorial viral

eradication therapy.

243

7.1 Introduction

The treatment of HIV remains a fundamental challenge in medicine. Numerous antiviral

compounds that target the many aspects of the viral lifecycle have achieved clinical utility.370

Antiviral compounds can suppress a patient's viral load to below detectable limits, but treatment

must continue as viremia rebounds upon suspension of therapy. 37 1 These treatments have

transformed HIV from a death sentence to a chronic illness; however, a cure remains elusive.3 72 A

key challenge is latency of the virus. Early in HIV infection, a subset of CD4+ cells become

infected with the virus that quiesce to harbor the integrated HIV provirus without producing viral

RNA or proteins. 373 An emerging eradication approach is stimulation of virus production to induce

cytopathic effects; however, activation alone is proving insufficient to kill latently infected cells. 43

Recently, it has been proposed that strategies to target cells that produce viral proteins with

cytotoxic therapeutics are a missing piece of eradication approaches.3 74

Intracellular production of cytotoxic proteins requires precise regulation and often includes

safeguards to attenuate activity until arriving at the intended location. Proteases involved in blood

coagulation and digestion as well as the apoptotic capsases are expressed as zymogens: an inactive

precursor that requires subsequent proteolysis to obtain the mature, active form. The concept of a

zymogen provides a conceptual framework to engineer cytotoxic enzymes as pro-drugs activated

by viral proteins, specifically proteases. Altering caspase 3 for activation by HIV-1 protease has

demonstrated the potential of zymogens as antivirals. 375 Our lab engineered the first zymogen de

novo by circularly permuting ribonuclease (RNase) A to connect the native N- and C-termini with

a linker containing a protease recognition sequence. 376 Several iterations of this strategy produced

RNase A zymogens activated by plasmepsin II, NS3 protease, and HIV-1 protease.377-378 We

244

procs-d the linker blocks I the Activ s;+i -dA 1cudes sbra cte birng as A - t% 964-fold%_LA%' ' LJ L'.% J 1 %''1Xxo LaiJ'. LV..'LI v % 3 1 IL.A IL.. "~J~ LILLLI.L UL1JAA'II& "0 1J L%' __ J - .LL%&

inactivation was achieved. This circular permutation strategy was replicated by Rib6 and

coworkers to create zymogens of the amphibian RNase homolog, Onconase"m, though minimal

inactivation, up to 1.3-fold, was observed.379

Strain has also been employed in zymogen designs. Ozawa and coworkers created circular

variants of firefly luciferase using intein-mediated cis-splicing to connect the termini with a

caspase 3 substrate. 380 The ensuing zymogen is an apoptosis probe with a <10-fold change in signal

in cellulo. Loh and coworkers later pursued strain to inactivate the bacterial RNase, barnase;

however, these proteins were found to be folded and active only at low temperatures. 381 Here, we

extend our original zymogen engineering strategy of sterically occluding the active site with the

imposition of strain to create zymogens with unprecedented inactivation that is relieved by the

action of HIV- 1 protease.

245

7.2 Results

7.2.1 Design of a Circular Zymogen Construct

Circular RNase 1 zymogens were created by intein-mediated cis-splicing with the Nostoc

punctiforme (Npu) DnaE split intein.382 The Npu intein utilizes a "CFN" N-terminal splice

junction.383 We utilized the "CFQ" sequence found starting at residue 58 of human RNase 1. We

omitted the 4-residue C-terminal extension (125-128) of RNase 1 that is not important for

activity.384

7.2.2 Design and Assessment of Glycine (G) Series of Zymogens

Previous RNase A zymogens exhibited modest inactivation suggesting a 14-residue linker would

not effectively block the active site of RNase 1. We designed a series of zymogens to test this

hypothesis by bringing the linker closer to the active site with progressively shorter linkers.

Additionally, we employed a phage display substrate that is among the most efficiently cleaved by

HIV-1 protease (SGIFLETS).207 We flanked this sequence with varying numbers of glycine

residues to create the G series of zymogens (Table 7.1). A plasmid encoding the intein fragments

flanking a circularly permuted RNase 1 connected by the substrate linker directed the expression

of the two soluble proteins and another insoluble one corresponding to the intein fragments and

the circular RNase 1 (Figure 7.S1), indicative of intein-mediated cyclization. 382 These zymogens

all exhibited roughly two orders of magnitude less activity than wild-type RNase 1 and no trend

in inactivation was observed despite a progressive reduction in thermal stability (Figure 7.1).

246

72 . Desicn and A ssisscmnt nf Stmraned (tvr) s c-f 7 mgns-tr-c

We next sought to remove residues from the termini of RNase 1 to bring the linker closer to active

site with the intent of imposing strain. Crystallization optimization of RNase 1 revealed the first 7

N-terminal residues can be removed with only an order of magnitude loss in activity.385 Lys7 is

important for binding the phosphoryl group of the RNA backbone in the P2 subsite.1 94 386 Folding

studies of RNase A demonstrated that residues beyond 122 can be removed without diminishing

activity or stability. 387 We therefore constrained truncation to residues 7 through 122. The terminal

residues of our protease substrate are both serine. Removal of the three N-terminal (AKES) and

two C-terminal (ASV) residues and connecting them with the protease substrate would replace the

truncated serine residues. This construct was selected as a starting point for the strained (Str) series

of zymogens and further truncated by progressively removing residues 4, 5, and 6 (Table 7.1). The

Stri zymogen showed a similar two orders of magnitude reduction in activity similar to that of the

glycine series. Further truncations substantially reduced the activity of the zymogens with a

disproportionately smaller reduction in stability (Figure 7.1).

7.2.4 Assessing Zymogen Activation by HIV-l Protease

The zymogens exhibiting substantial inactivation, i.e., kcat/KM < 104 M-ls-1, were subjected to

activation studies with HIV-1 protease. Proteolysis by the pathogenic protease must be limited to

the inactivating linker of the zymogen. We determined that wild-type RNase 1 is not cleaved by

HIV-1 protease (Figure 7.S2). Next, we digested zymogens with HIV-1 protease and measured the

final activity. Proteolysis restored near wild-type activity to the Str2-4 zymogens (Table 7.2).

247

Finally, we developed a continuous assay to measure the kcat/KM value for HIV-1 protease cleavage

of the zymogens (Table 7.2 and 7.S2, Figure 7.S3).

7.2.5 Modelling the Structural Basis of Str2 Zymogen Inactivation

The zymogen Str2 possessed the best combination of properties: catalytic inactivation, thermal

stability, and proteolytic activation. Accordingly, we modelled the Str2 zymogen with Rosetta to

reveal the structural origins of catalytic inactivation in our zymogen design. We examined the top

ten scoring models of four-hundred and found the N-terminal a-helix was nearly intact while the

C-terminal P-strand was distorted in each case (Figure 7.2A and 7.2B, Figure 7.S4A).

7.2.6 Modelling the Structural Basis of Proteolytic Activation of Str2 by HIV- 1 Protease

We again used modelling with Rosetta to reveal the structural basis of recognition of the

inactivating linker by HIV-1 protease. We examined the top ten scoring models of two-hundred

and found the N-terminal a-helix and C-terminal P-strand must be completely unfolded to

accommodate HIV- 1 protease with its flaps closed upon a substrate installed in the linker (Figure

7.2A and 7.2C, Figure 7.S4B).

248

'7. Discussion

The mechanism of our zymogen inactivation relies on two key strategies: steric occlusion and

strain. Roughly two orders of magnitude in activity are lost by installing the linker across the active

site, as observed with previous circularly permuted RNase A zymogens. We suspect this is the

limit of inactivation imposed by blocking the active site with a flexible linker. Further inactivation

was achieved through local installation of strain, a violation of design criteria in the previous

circularly permuted zymogens enabled here by intein-mediated cyclization.37 6 The key catalytic

residues are located in secondary structural elements at the termini that are not stabilized by

disulfide bonds, unlike the core structure (Figure 7.2). Through installation of strain by progressive

truncation, we were able to perturb active site residues specifically to achieve unprecedented

inactivation of a de novo engineered zymogen without compromising global stability. Our

structural modelling revealed the how steric occlusion and strain are operational in our zymogen

design. The top 10 scoring structures are predicted to be similar in energy, yet conformationally

divergent in the linker. This suggests the zymogen linker is conformationally dynamic but also

unambiguously demonstrates that both termini cannot simultaneously adopt the wild-type fold.

Despite the increased proximity of our linker to the active site compared to previous

zymogens, our designs maintained efficient activation. We observed two-orders of magnitude

reduction of the second order rate constant of HIV- 1 protease toward the zymogens with respect

to a fluorogenic peptide substrate, which is cleaved with a kcat/KM value of 5.0 x 105 M1 s-1.223 Our

chosen substrate is among the most efficiently hydrolyzed by HIV-1 protease and the activity is

reduced to rate constants similar to endogenous substrates when incorporated into zymogens. 2 86

As indicated by our modelling, the loss in catalytic efficiency results from the energetic cost of

249

unfolding the N- and C-terminal structural elements of the zymogen to accommodate the flaps of

the protease. Not all proteases employ flaps and zymogens may be more efficiently activated by

other pathogenic proteases like plasmepsins which have an open active site cleft.

RNases are also a privileged class of enzymes for the creation of zymogens due to their

therapeutically desirable properties: cell permeability and cytotoxic activity. 152 We anticipate

translation of these results will depend on successful evasion of the cytosolic ribonuclease inhibitor

protein and the inactivating linker being sufficiently orthogonal to avoid cleavage by endogenous

proteases while lacking immunogenicity.1 83

250

7A.4 M e-1is and Methods

7.4.1 Protein Expression and Purification

The RNase 1 gene was obtained from a previously reported molecular clone.388 Intein fragments

were obtained from previously described vectors provided by Tom Muir. 3 89 The circular RNase 1

zymogen construct was initially prepared using the linker sequence from a circularly permuted

RNase A zymogen containing the p2/NC cleavage site. PCR was used to prepare five DNA

fragments with terminal homology: the pET32b plasmid, NpuC, a C-terminal RNase fragment, an

N-terminal RNase fragment, and the NpuN fragment. Gibson assembly was used to combine

fragments into the final expression constructs. Modification of the original plasmid was done by

producing RNase fragments with PCR by including the new linkers and truncations and combining

it with a plasmid fragment that contained both intein fragments.

RNase were expressed and purified as previously. 388 All chromatography and assay buffers

were treated with DEPC prior to use, with the exception of Tris which was added from RNase free

stocks (Invitrogen). Zymogens were folded in the presence of 0.5 M arginine-HCl and were

purified with additional chromatography on a MonoS column (GE Healthcare) to ensure purity

and removal of contaminating RNases. HIV-1 protease was expressed and purified as described

previously. 2 23 Protein purity was confirmed by SDS-PAGE and concentrations were determined

using the PierceM BCA assay kit (ThermoFischer Scientific). HIV-1 protease was also treated

with 10 mM DTT at 4' C for 4 h to inactivate contaminating RNases. The ensuing prep was

desalted with a HiTrapo column (GE Healthcare) and did not possess detectable RNase activity.

Analysis of protein purity and other gel-based assays were performed with Any kDTM Mini-

PROTEAN® TGX TM precast protein gels (Bio-Rad).

251

7.4.2 Enzyme Activity Assays

The kinetics of RNA hydrolysis was monitored by the increase in fluorescence intensity of a

doubly labeled fluorogenic substrate, 6-FAM-dArUdAdA-6-TAMRA, upon exposure to

RNases.3 90 Initial (1o) and final (If) intensities along with linear slopes (m = AI/At) or the second

derivative of quadratic fits (2a = A2I/At2) were measured and used to calculate values kcat/KM.

Assays required 10 pM to 1 pM to achieve velocities that reached 10% RNA turnover within

several minutes. Activated zymogens were prepared for kinetic testing by digesting 50 !LM

zymogen with 52 nM HIV-1 protease at 370 C. Continuous activation assays were performed with

2.6 nM HIV-1 protease at room temperature. Fluorescence intensity was measured with a M1000

microplate reader (Tecan) by monitoring emission at 515 nm with excitation at 493 nm. Assays

were performed in quadruplicate in a flat, black 96 well plate (Corning). Assay buffers were DEPC

treated and consisted of 50 mM of either Tris pH 7.4 or Acetate pH 5.0 with 100 mM NaCl.

Kinetics of RNase activity of were determined with equation 1 by assaying initial velocities

under second-order conditions.

kcat __ At (7.1)KM [RNase] (If-Io)

Kinetics of zymogen cleavage by HIV-protease was determined with equation 7.2 by

assaying the increase in RNase activity observed upon the addition of protease.

A2

Jkcat k __ _tz_(7.2)KM HIV Protease K kcat [Zymogen][HIV Protease](If -I,,)

KM Zymogen, Activated

252

V\alues rof ka/K' At H- 7.4 thJ I rpItimna Nf Ras 1 are reportetd n the main tpvt 391T " -7 - 'VLdL' A LIVI Lk J A I ._, LI'W, yJ. IF JL LI I IL4 %J L -L A. I , L4,I, "X% X' If

These values were also determined at pH 5, the optima of HIV- 1 protease, and used as parameters

in fitting equation 7.2 (Table 7.S2). Values of kcat/KM are reported as the mean of quadruplicates

and the standard deviation.

7.4.3 Thermal Stability Assays

Thermal stabilities of RNases were determined by differential scanning fluorometry. G series

zymogen thermal stabilities were measured with a Bio-Rad CFX connect RT-PCR machine.

Samples of 5 [tM protein with 1% v/v sypro orange (Sigma-Aldrich) in 25 uL in PBS were heated

from 25-95 'C at 1 'C/min. Single fluorescent measurements per degree were reported. Change

in fluorescence per degree was calculated by AFt = Ft+i - Ft-1 with Excel software (Microsoft).

The maximal change in fluorescence and the 5 flanking measurements above and below were fit

with the Gaussian function built into Prism 6 (Graphpad). Values of Tm are reported as the mean

and standard deviation of triplicate measurements.

Str zymogens melts were conducted with 0.6% (v/v) Sypro Orange and 30 ptg zymogen in

a 20 pL final volume of PBS. Melts were observed with a ViiA 7 RT PCR machine (Applied

Biosystems) by increasing the temperature from 20 'C to 96 'C at 1 'C/min in steps of 1 'C. The

Tm was determined with the Protein Thermal Shift software (Applied Biosystems) using the

Boltzman model and reported as the mean and standard error.

253

7.4.4 Modelling with Rosetta

Modeling of the Str2 zymogen and its complex with HIV-1 protease were conducted with the

Rosetta.392 RNase 1 from PDB entry lz7x was extracted from chain A. 182 RNase 1 was circularly

permuted at residue 89 to include the linker between the termini. The the 15 N-terminal and 12 C-

terminal residues were removed to be computationally predicted. KIC loop modelling was

performed using fragments picked by the Robetta Server to prepare 400 models of the zymogen

linker and proximal residues of the native termini.

Modelling of the complex was conducted in two steps. RosettaDock was employed to dock

HIV- 1 protease in the closed conformation bound to the "SGIFLETS" from chain A of PDB entry

6bra into the actives site cleft of the N- and C-terminally truncated structure of RNase 1.264, 395

Docking models were manually inspected to identify orientations with proximal termini of RNase

1 and the substrate in the active site of HIV- 1 protease. Five docking models compatible with a

zymogen-protease complex were subjected to further loop modelled. The docking model that

generated the lowest energy complex model by loop modelling was used to prepare an additional

200 models. The top 10 lowest energy models (Table 7.S3) are shown in Figure 7.S4.

254

Table 7.1

Table 7.1. G and Str Zymogen Designs

Zymogen Linker N-truncation C-truncation

3G GGGSGIFLETSGGG None EDST2G GGSGIFLETSGG None EDSTIG GSGIFLETSG None EDSTOG SGIFLETS None EDSTStrI SGIFLETS KES SVEDSTStr2 SGIFLETS KESR SVEDSTStr3 SGIFLETS KESRA SVEDSTStr4 SGIFLETS KESRAK SVEDST

255

Properties of Str

kcat/Km

(activated)

6.7 0.08 x 107

3.2 t 0.02 x 106

1.4 0.02 x 107

Series Zymogens

Relative

Activity

11,000

2,800

24,000

kcat/KM

(HIV- 1 Protease)

3.9 0.3 x 10'

1.3 0.1 x 103

2.5 0.1 X 103

256

Table 7.2

Table 7.2.

Zymogen

Str2

Str3

Str4

Table 7. Si

Table 7.S1. Activity and Stability of RNase

RNase/Zymogen kcat/KM (M-'s-')

RNase Ia 2.1 0.2 x 107

3G 2.8 0.6 x 10'

2G 4.6 0.2 x 104

1G 3.2 0.2 x 10'

OG 2.1 0.2 x 10'

Str1 8.5 0.2 x 104

Str2 5.8 0.05 x 103

Str3 1.1 10.01 x 103

Str4 5.7 0.03 x 102

N.D. = Not determined.aValues are from ref. 183.

1 and Zymogens

Tm (OC)

57 2

50.4 0.4

48.9 0.3

48.1 0.1

46.9 0.2

N.D.

47.5 0.1

42.6 0.1

42.0 0.1

257

Table 7.S2

Table 7.S2. Str Zymogen Kinetic Parameters at pH 5.0

Zymogen kcat/Km (M-'s-') kcat/Km (M-'s- )Zymogen Inactive Activated

Str2 2.9 0.06 x 10 5.2 0.2 x 105

Str3 7.3 0.08 x 102 7.0 0.3 x 105

Str4 9.2 0.3 x 102 2.4 0.4 x 106

258

TabIc 7.S3

Table 7.S3. Total Scores of Models

Str2Model

(REU)

1 -197.697

2 -196.236

3 -193.333

4 -192.159

5 -191.876

6 -191.655

7 -191.556

8 -191.252

9 -191.203

10 -190.923

REU Rosetta Energy Units

Calculated by Rosetta

Complex

(REU)

-522.837

-518.158

-511.494

-466.197

-457.239

-456.408

-454.677

-453.33

-451.303

-451.246

259

108,

107

106

kt/K 105'(M- 1s-1)

104

103

in2

Zymogen

Figure 7.1

-60

-55

T-50 Tm

(*C)

-45

40

260

Figure 7.1. Activity and Stability of Circular Zymogens of RNase 1. Plotting catalytic efficiency

(kcat/KM) and thermal stability (Tm) of wild-type RNase 1 and zymogen reveals an exponential loss

in activity is associated with a linear loss of stability. Wild-type values are taken from Reference

183. Errors bars for report standard deviation, with the exception of Str zymogen stabilities which

report standard error.

261

C

:4

C

262

Figure 7.2

A B

Figure 7.2. Modelling of Str2 Zymogen and Complex with HIV-1 Protease. The active site

residues H12, K41, and H118 and disulfide-bonded cysteine sidechains are shown as sticks.

Connecting the termini of A. wild-type RNase 1 with a flexible linker and truncating 4 residues

from the N-terminus and 6 residues from the C-terminus prevents both terminal secondary

structure elements from folding. B. The lowest energy structures generated by Rosetta are

predicted to have an unfolded C-terminal P-strand. C. Unfolding of both terminal secondary

structural elements are required for HIV- 1 protease to recognize the substrate containing linker as

modelled by Rosetta.

263

SpectraTMBR

LadderInsoluble

TotalSoluble P dtl

Pre-induction

25 kDa

15 kDa

10 kDa

264

Figure 7.S1

Figure 7.S1. Expression of 3G Zymogen. Induction of expression of the single open-reading-

frame zymogen construct results in a single band in the insoluble fraction (the circular zymogen)

and two bands in the soluble fraction (intein fragments), neither of which are observed pre-

induction, as analyzed by SDS-PAGE.

265

Figure 7.S2

SpectraTM + + RNase 1BR + - HIV-1 Protease

Ladder

25 kDa

15 kDa

10 kDa

266

Figure 7.S2. Proteolytic Treatment of RNase 1 with HIV-1 Protease. After 3 hours of treatment at

370 C, no detectable cleavage of wild-type RNase 1 was observed by SDS-PAGE.

267

Figure 7.S3

-- Str2 zymogenStr2 zymogen +HIV-1 protease

4000

5000-

4000-

3000-

2000-

1000-

U-

00 1000 2000

time (s)

.. 3..3000

268

'

Figure 7.S3. Activation Kinetics of Str2. Hydrolysis of 20 nM of fluorogenic RNase substrate at

pH 5.0 by 10 nM Str2 zymogen lead to a linear increase in product formation (as reported by RFU)

at substrate turnover below 10%. Addition of 2.6 nM HIV-1 protease leads to a time dependent

increase in activity, which is fit by a second-order polynomial.

269

Figure 7.S4

A B

270

Figure 7.S4. Top 10 Scoring Rosetta Models of Str2 and Complex with HIV- 1 Protease. A. The

linker inactivated the zymogen is predicted to occupy multiple conformations which indicates this

region may be conformationally dynamic. B. HIV-1 protease requires partial unfolding of RNase

1 in order to close its flaps on the substrate containing linker.

271

Appendix A

Stilbene Boronic Acids Form a Covalent Bond with

Human Transthyretin and Inhibit its Aggregation

Contribution:

I prepared proteins, established biochemical assays, and performed crystallization and diffraction

analysis. I contributed to ligand design, preparation of final crystallographic models, preparation

of the initial manuscript, and revision of the final manuscript.

This appendix has been published in part, under the same title. Reference:

Smith, T.P.*; Windsor, I.W.*; Forest, K.T.; Raines, R.T., Stilbene boronic acids form a covalent bond

with human transthyretin and inhibit its aggregation. J. Med Chem. 2017, 60, 7820-7834.

*Denotes equal contributions

272

A

Transthyretin (TTR) is a homotetrameric protein. Its dissociation into monomers leads to the

formation of fibrils that underlie human amyloidogenic diseases. The binding of small molecules

to the thyroxin-binding sites in TTR stabilizes the homotetramer and attenuates TTR amyloidosis.

Herein, we report on boronic acid-substituted stilbenes that limit TTR amyloidosis in vitro. Assays

of affinity for TTR and inhibition of its tendency to form fibrils were coupled with X-ray

crystallographic analysis of nine TTR ligand complexes. The ensuing structure-function data led

to a symmetrical diboronic acid that forms a boronic ester reversibly with serine 117. This

diboronic acid inhibits fibril formation by both wild-type TTR and a common disease-related

variant, V30M TTR, as effectively as does tafamidis, a small-molecule drug used to treat TTR-

related amyloidosis in the clinic. These findings establish a new modality for covalent inhibition

of fibril formation and illuminate a path for future optimization.

273

A.1 Introduction

Amyloidosis is a disease caused by the aggregation of a normally soluble protein.396-397

Endogenous proteins can be causal for these diseases, which include Alzheimer's, Huntington's,

and Parkinson's.398 One such protein, transthyretin (TTR),399 is a homotetrameric protein

comprised of four identical monomer units, each consisting of 127 amino-acid residues that fold

into a P-sandwich (Figure A. 1).400~401 The dissociation of the TTR tetramer and aggregation of the

ensuing monomers underlies familial amyloid polyneuropathy, familiar cardiomyopathy, and

senile systemic amyloidosis.3 98, 4 02

TTR is present in both blood (0.3 g/L = 4 pM) and cerebrospinal fluid (0.1-0.4 'M). 399,403

A primary role of TTR in vivo is to transport thyroxin (T4) and retinol, a hydrophobic hormone

and fat-soluble vitamin Al, respectively. Due to the abundance of other lipid-binding proteins (e.g.,

thyroid-binding globulin and albumin), most of the T4-binding sites of TTR are empty in blood. In

cerebrospinal fluid, TTR also binds to $-amyloid, attenuating the neurotoxicity that underlies

Alzheimer's disease.4 03-4 0 5

The binding of a ligand can stabilize the folded state of a protein. 406-409 Evidence for the

coupling of binding and stability appeared as early as 1890, when O'Sullivan and Thompson

demonstrated that cane sugar increases markedly the thermostability of invertase, which is an

enzyme that catalyzes the hydrolysis of sucrose. 4 10 Since then, ligands have been used to enhance

the conformational stability of countless proteins, including TTR. Many small molecules have

been synthesized and tested as putative TTR ligands, and several have demonstrated efficacy in

attenuating amyloidosis. 4 114 12 Most efforts have focused on ligands that bind to the two identical

T4-binding sites at a dimer-dimer interface (Figure A. 1), as such ligands discourage dissociation

274

to the monomeric state. 413 A few of these compounds have become viable treatment options,

including diflunisal, which is an FDA-approved non-steroidal anti-inflammatory drug that has had

limited adoption due to long-term gastrointestinal side-effects associated with cyclooxygenase

inhibition, 414-4 and tafamidis, which is used in the clinic to treat TTR-related amyloidosis.416418

An attractive approach to increase the potency and pharmacokinetics of a ligand is to evoke the

formation of a covalent bond. 4 19-42' This strategy is well suited for TTR amyloidosis, not only

because with an optimized dosage there might be no appreciable competition in serum with the

natural ligand, T4, but also because sustained stabilization of the TTR tetramer deters the

accumulation of monomers that leads to a cascade of aggregation. 422 Previous work by Kelly and

coworkers has shown that small molecules can modify TTR chemoselectively by targeting the c

amino group of Lys 15/Lys 15' at the entry to the T4-binding binding site. This work employed

irreversible reactions, including conjugate addition with activated esters and thioesters 4 23-4 24 and

vinyl sulfonamides, 42 5 and sulfation with aryl fluorosulfates. 42 64 27 Such ligands can, however,

react irreversibly with other plasma proteins,42 5 leading to potential immunogenic responses to the

protein-ligand adduct419 and the generation of potential toxic byproducts. 424

To accrue the benefits of covalent binding without the liabilities, we sought ligands for

TTR that bind in a covalent but reversible manner. Boronic acids interact with Lewis bases in

aqueous media.1 28,428 Boronic acids (including the FDA-approved drug Bortezomib 429 ) are well

known as serine/threonine protease inhibitors, 12 6, 139, 430-431 anti-microbial and anti-cancer

agents, 3 3 8 , 341 and delivery vehicles. 3 68 , 432-434 Boronic acid-based fluorogenic probes have been

developed for sensing both saccharides 435 and reactive oxygen species, 348-349 as well as for

molecular recognition 4 3 6 and protein conjugation. 4 374 38 These applications arise from the ability

275

of boronic acids to form a covalent bond with a Lewis base that is reversible under physiological

conditions.309,439 Notably, boronic acids are benign, as their metabolic byproduct, boric acid, is

present in a normal diet.440

Here, we report on the development of boronic acid-based ligands for the T4-binding site

of the TTR tetramer. An iterative strategy involving chemical synthesis and structure-function

analysis led us to covalent inhibitors of TTR aggregation. This strategy serves as a model for a

new class of amyloidosis inhibitors.

276

A.2 Results

We chose stilbene as a scaffold for the design of an initial series of boronic acid-containing TTR

ligands (Chart A. 1). This scaffold is present in the natural product resveratrol (1), and has been

employed in other TTR ligands. 400 , 412, 424, 441443 Moreover, stilbenes are readily accessible by a

convergent synthetic route, as the two rings can be functionalized separately and then joined with

a Wittig reaction (Schemes A. 1 and A.2).

Resveratrol occupies the T4-binding site with moderate affinity (Table A. 1 and Figure

A. 1).400 We began by replacing the phenolic hydroxyl group, which is known to form a hydrogen

bond with Ser 17/117', with a boronic acid group to generate stilbene 2 (Chart A. 1). Halogen

substitution is known to provide additional van der Waals interactions within the inner pocket of

the T4-binding site, enhancing the affinity of TTR ligands.444 Accordingly, we added a chloro

group meta to the boronic acid moiety to generate stilbenes 3 and 4.

We performed competitive binding assays to compare affinities among the diphenol series

of ligands (Table A.1 and Figure A.SlA). We observed no change in values of Kd,2 between

stilbene 1 and 2. Interestingly, chlorinated stilbene 3 showed a higher value of Kd,2 relative to its

non-halogenated counterpart, stilbene 1. This decrease in affinity contrasts with stilbene 4, which

exhibited increased affinity as a result of chlorination and showed the strongest affinity for TTR

of stilbenes 1-4, having a Kd,2 value of 441 nM.

Next, we assessed the ability of these molecules to inhibit fibril formation by both wild-

type TTR and the common V30M variant, which is associated with familial amyloid

polyneuropathy (Table A. 1 and Figure A.S2A). We found that all of the stilbenes in this series

inhibited aggregation at 7.2 pM and at a 2:1 ligand/protein ratio. Stilbenes 1 and 2 showed 25%

277

and 23% fibril formation for V30M TTR, whereas stilbenes 3 and 4 showed 14% and 11%,

respectively. Herein, we consider compounds that limit aggregation to <1 0% over 96 h as potent

inhibitors, and stilbenes 1-4 did not achieve this threshold.

Next, we solved co-crystal structures of TTR and resveratrol analogs 2-4 to discern if the

boronated stilbenes formed a covalent bond with TTR. To our surprise, each phenylboronic acid

moiety was observed in the "reverse" binding mode (Figure A.2A and 2B) relative to its parent

phenol, resveratrol (Figure A. 1 and 3A). In other words, the boronic acid group resided in the outer

pocket of the T4-binding site No indication of boronic ester formation with amino acid residues

was apparent (Figure A.S4 and S6).

In an attempt to reconfigure this unexpected orientation, we designed a second series of

stilbenes in which a carboxylic acid group was installed at the meta position of the ring not

modified in the first series (Chart A.2). Previous work had shown that incorporating an anionic

substituent into ligands could introduce advantageous electrostatic interactions with Lys15/15' of

TTR.417 445 We hypothesized that this interaction would orient the boronic acid to the inner

pocket of the T4-binding site, and perhaps promote boronate-ester formation.

The first pair, stilbenes 5 and 6, exhibited Kd,2 values in the low micromolar range, 3- and

2-fold higher than those of stilbenes 1 and 2, respectively (Table A.2 and Figure A.S1B). The

installation of a chloro group in stilbenes 7 and 8 restored values of Kd,2 to the high nanomolar

range. Consistent with a decreased affinity apparent in the value of Kd,2, the non-halogenated

stilbenes 5 and 6 also showed a diminished ability to prevent fibril formation under acidic fibril-

forming conditions (Table A.2 and Figure A.S2B). Incubation of wild-type TTR with phenol 5

showed 77% fibril formation and 112% at 7.2 [tM for the V30M variant, respectively. Boronic

278

acid 6 nerformed better as an inhibitor than did phenol 5 for both wild-type TTR and the V3OM

variant (11% and 28%, respectively). The chlorinated pair 7 and 8 showed potent fibril inhibition

(<10%) for both TTRs. These differences were, however, within experimental error. Again, co-

crystallography revealed that boronic acids 6 and 8 were bound in the reverse mode, relative to

their paired phenols. In other words, the carboxylic acid group was in the inner pocket, near

Ser 17/117', and the boronic acid group was in the outer pocket, near Lys15/15' (Figures A.2C

and A.2D, and Table A.S 11). As with boronic acids 2 and 4, boronic acids 6 and 8 did not exhibit

covalent interactions with binding-pocket residues. The structure of 5 in complex with TTR may

include alternative conformations for the phenolic group occupying the inner pocket but we were

not confident placing those conformations (Figures A.S7A and A.S7B).

The final series of stilbenes contained a boronic acid moiety on each ring (Chart A.3). We

synthesized C2 symmetrical diboronic acid 10, which has two meta-chloro and para-boronic acid

groups (relative to the stilbene olefin), as well as the analogous C2 symmetrical diphenol 9.

Additionally, we synthesized stilbene 11, which has one boronic acid group meta to the linker, to

investigate whether this position of the boronic acid enhances interactions with Lys 15/15' in the

outer pocket. We also included tafamidis (12) in this series as a benchmark for our assays.

In the competitive binding assay, diphenol 9 had a Kd,2 value of 819 nM (Table A.3 and

Figure A.SlC). The analogous diboronic acid, 10, had a Kd,2 value of 469 nM, which was the

largest decrease in Kd,2 value that we observed between a boronic acid and its paired phenol. These

two stilbenes comprise the only pair that can be compared directly, as their C2 symmetry precludes

alternative binding orientations. The difference in affinity for TTR was amplified in the fibril

formation assay, where diboronic acid 10 exhibited more potent inhibition of fibril formation than

279

did diphenol 9, both for wild-type TTR (3% versus 12% at 7.2 pM) and for the V30M variant (8%

versus 27% at 7.2 ptM) (Table A. 1 and Figure A.S2C). The binding of diboronic acid 10 with TTR

was reversible, as assessed with mass spectrometry (Figure A.S3). Asymmetric diboronic acid 11

had a value of Kd,2 similar to that of diphenol 9 (864 nM) as well as a similar ability to inhibit fibril

formation, suggesting that the location of the boronic acid plays a role in optimizing interactions

within the binding pocket. Gratifyingly, co-crystallographic data of TTR with either diboronic acid

10 or diboronic acid 11 showed the formation of a boronic ester with Ser 17/117' (Figures A.2E

and A.2F).

280

A.3 Diselsinn

We sought a new class of small-molecule ligands for TTR, which is a validated target for

pharmacological intervention. 4 02 , 416-418 Towards that goal, we investigated the effects of

incorporating a boronic acid substituent on the well-known stilbene scaffold, embodied in

resveratrol (1). The results enabled us to reach two conclusions. First, a boronic acid group can

enhance the potency of a TTR ligand. Second, a boronate ester can form with a weakly nucleophilic

amino-acid residue.

The installation of a boronic acid group tends to increase the affinity of a stilbene for TTR.

The value of Kd,2 for each boronic acid ligand for TTR is lower than (or equivalent to) that of its

analogous phenol (Tables A. 1-3). Moreover, a boronic acid group enhances the ability of a stilbene

to deter the formation of TTR fibrils (Tables A. 1-3). Nonetheless, our structural studies revealed

that in each complex between TTR and a stilbene containing a single boronic acid group, that

group was in the outer pocket of the T4-binding site, exposed to solvent (Figures A.2A-A.2D,

A.S4, A.S6, A.S8, and A.S10).

The modest increase in affinity incurred by adding a single boronic acid group could be

due to a weak noncovalent interaction between the 6-amino group of Lys15/15' and the vacant

p-orbital of the boron. Kelly and coworkers demonstrated that this amino group, which likely has

a low pKa, can act as a nucleophile. 423 In our complexes, the relevant B N distances are 3.3-

3.5 A for stilbenes 2, 4, 6, 8, and 10 (Table A.S1 1). We did not, however, observe electron-

densities or atomic geometries consistent with the formation of a dative N-*B bond between these

two functionalities, nor were any additional hydrogen bonds apparent between TTR residues and

the boronic acid group (Figures A.2A-A.2D, A.S4, A.S6, A.S8, and A.S 10). Additional hydrogen

281

bonds did, however, arise elsewhere in the complexes with a single boronic acid. For example, the

two meta hydroxyl groups in stilbenes 2 and 4 (but not 1 and 3) interact closely (2.0-2.3 A) with

Ser 17/117' (Table A.S1 1).

Stilbenes and similar compounds bind to TTR in one of two modes.4 14 , 422, 427, 446-449

Although a consensus explanation is not apparent, the polarity of pendant functional groups can

play a role in ligand orientation.443 450 In our stilbenes, however, the relevant logP values of phenol

(1.46), benzoic acid (1.87), and phenylboronic acid (1.59) are similar,4 51 suggesting that

hydrophilicity alone contributes little to their orientation in the T4-binding site. The installation of

a chloro group on one ring, as in stilbenes 4 and 8, led to increased affinity and enhanced efficacy

(Tables A. 1 and A.2), but did not affect binding orientation (Figures A.2B and A.2D).

To negate binding orientation as a factor, we designed a class of molecules containing a

boronic acid substituent on each stilbene ring. If such a ligand were to bind to TTR, then a boronic

acid group would necessarily be in the inner pocket of the T4-binding site. We were gratified that

this strategy was successful, as the boronic acid group of both stilbenes 10 and 11 that bound in

the inner pocket formed an ester with Ser 17/117'. Boronic esters have demonstrable utility as

mimics of the high-energy tetrahedral intermediate in reactions catalyzed by serine/threonine

proteases.126, 139,430-431 In those covalent complexes, an active-site serine or threonine residue forms

a tetrahedral, sp3-hybridized boronate ester with a boronic acid group.45 2

Despite the hydration of boronic acid groups in aqueous solution 428 and in marked contrast

to other known boronate esters with proteins, we observe planar, sp2-hybridized boronate esters

with Ser 17/117' of TTR (Figures A.2E and A.2F). We are aware of only one other structure in

which a planar boronate ester is formed with a hydroxyl residue of a protein (PDB entry 1p06).453

282

That other structure is, however, distinct because alone pair of electrons from a proximal histidine

residue appears to participate in a dative bond with the vacant p-orbital of the boron. The presence

of a planar ester could indicate that formation of a tetrahedral adduct is hindered sterically, unlike

in the active site of a serine/threonine protease that has evolved to bind tightly to a tetrahedral

intermediate. Hence, our results could demarcate the lower limit of affinity enhancement that can

be realized from boronic acid-based inhibitors.

Chloro groups have a variable contribution to the affinity of stilbenes for TTR. A chloro

group can fill unoccupied cavities, which would otherwise compromise affinity.4 544 55 Second, the

position of the chloro group in both of the ester-forming boronates enables the formation of a

halogen bond.4 56 The relevant ... Cl distance is close to the sum of the van der Waals radii (ro +

rci = 3.27 A; Figure A.4).444 The geometries observed in the TTR complexes with stilbenes 10 and

11 (Table A.S12) suggest a halogen-bond energy of 0.7-0.9 kcal/mol.457 Still, the consequences

of installing a chloro group on the stilbene scaffold are unlike those of installing a boronic acid

group, which consistently increases affinity for TTR (vide supra). For example, the addition of a

chloro group to stilbene 1 to form stilbene 3 decreases affinity, whereas the addition of a chloro

group to stilbene 5 to form stilbene 7, increases affinity. Notably, the phenolic ring of stilbene 5

was found to occupy two conformations in the inner pocket of the T4-binding site, which could

explain, in part, the lower affinity of stilbene 5 relative to stilbene 7 (Figures A.3B, A.S7, and

A.S9).

The judicious use of halogen substituents in boronated stilbenes merits further

investigation. For example, a chloro group positioned meta to the boron atom has been found to

decrease the pKa of phenyl boronic acid from 8.8 to 8.2.362 Such a more acidic boronic acid can

283

form more stable boronate esters.458 An analogous difference in Lewis acidity could be responsible

for some of the differences observed in the affinity of boronic acids for TTR.

Likewise, the esterification of TTR by boronic acid ligands warrants additional

optimization. In particular, a mono-boronate analog of compound 10 that binds with its lone

boronic acid group in the inner pocket could be used to reveal the precise contribution of a boronate

ester to the thermodynamics and kinetics 459 of binding. For proper orientation, such a ligand would

likely require installation of a highly polar functional group that demands the solvation attainable

in the outer pocket.

284

A.4 Materials and Methods

A.4.1 Materials

Resveratrol (1), 8-anilino- 1 -naphthalenesulfonic acid (ANS), and other reagents for biochemical

assays were from Sigma-Aldrich (St Louis, MO). Tafamidis (2-(3,5-dichlorophenyl)-6-

benzoxazolecarboxylic acid) (12) was from Carbosynth Limited (Berkshire, UK). DNA

oligonucleotides were from Integrated DNA Technologies (Coralville, IA).

A.4.2 Chemical Synthesis

Stilbenes 2-11 were synthesized by the routes shown in Schemes A. 1 and A.2.

A. 4.2.1 Materials

Reagents and solvents were from Sigma-Aldrich (Milwaukee, WI) and were used without further

purification. All glassware was flame-dried, and all reactions were performed under an atmosphere

of N2(g). Reagent-grade solvents: dichloromethane (DCM), tetrahydrofuran (THF), triethylamine

(TEA), and dimethylformamide (DMF) were dried over a column of alumina and were removed

from a dry still under an inert atmosphere. Flash column chromatography was performed with

Silicycle 40-63 A silica (230-400 mesh), and thin-layer chromatography (TLC) was performed

with EMD 250-pm silica gel 60-F254 plates.

A. 4.2.2 Conditions

All procedures were performed in air at ambient temperature (~22 'C) and pressure (1.0 atm)

unless indicated otherwise.

285

A. 4.2.3 Solvent Removal

The phrase "concentrated under reduced pressure" refers to the removal of solvents and other

volatile materials using a rotary evaporator at water aspirator pressure (<20 torr) while maintaining

a water bath below 40 'C. Residual solvent was removed from samples at high vacuum (<0.1 torr).

A. 4.2.4 NMR Spectroscopy

'H and 13C NMR spectra were acquired with a Bruker Avance III 500i spectrometer at the National

Magnetic Resonance Facility at Madison (NMRFAM). Chemical shift data are reported in units of

6 (ppm) relative to residual solvent.

A. 4.2.5 Mass Spectrometry

Mass spectra of small molecules were acquired with an LCT electrospray ionization (ESI)

instrument from Waters. Mass spectra of proteins were acquired with a microflex LRF desorption

ionization-time-of-flight (MALDI-TOF) instrument from Bruker (Billerica, MA). Both

instruments were in the Paul Bender Chemical Instrumentation Center at the University of

Wisconsin-Madison.

A. 4.2.6 Melting Points

Melting points were determined with an OptiMelt MPA100 instrument from Stanford Research

Systems (Sunnyvale, CA) over a range of 100-400 'C with a 0.5 'C/min heating rate. Melting

286

points values are given as the sngLe Mnis-Ics -nt Clponds 2, 3, andA 5 had mlting pIints

>400 OC.

A. 4.2. 7 Compound Purity

The purity of all compounds was judged to be >95%, as assessed by 'H and 13 C NMR

spectroscopy, mass spectrometry, melting-point determination, and reversed-phase high-

performance liquid chromatography (HPLC) using a C18 column and 515/717/996 analytical

instrument from Waters (Milford, MA) with a gradient of 10-80% v/v acetonitrile in water over

40 min.

A.4.2.8 Synthesis

5-[(1E)-2-(4-Bromo)ethenyl]-1,3-dimethoxybenzene (2a). 3,5-Dimethoxybenzyl bromide (1.2 g,

5.5 mmol) was dissolved in neat triethylphosphite (1.2 mL, 6.6 mmol) and heated to 150 'C for 4

h. The reaction mixture was cooled to 0 'C and diluted with DMF (40 mL). NaH (60% w/v in

mineral oil, 0.28 g, 7.12 mmol) was added to the resulting solution, and the reaction mixture was

stirred at 0 'C for 20 min. A solution of 4-bromo-benzaldehyde (1.0 g, 5.5 mmol) in DMF (15 mL)

was then added dropwise. The reaction mixture was allowed to warm to room temperature and

stirred overnight. The reaction mixture was then diluted with EtOAc (20 mL), and washed with

10% w/v citric acid (30 mL), followed by brine (30 mL). The organic layer was separated, dried

with Na2SO4(s), and filtered. The solvent was removed under reduced pressure, and the crude

product was purified by flash column chromatography (10% v/v EtOAc in hexanes) to afford

compound 2a as a white solid (1.56 g, 89%). 'H NMR (500 MHz, CDCL 3, 6): 3.83 (s, 6H), 6.41 (s,

287

1H), 6.65 (s, 2H), 6.98-7.01 (d, J= 16.28 Hz, 1H), 7.01-7.04 (d, J= 16.34 Hz, 1H), 7.35-7.37 (d,

J= 8.48 Hz, 2H), 7.46-7.48 (d, J= 8.51 Hz, 2H); '3C NMR (100 MHz, CDCl3, 6): 55.42, 100.31,

104.74, 121.57, 128.03, 128.16, 129.50, 131.92, 136.19, 139.07, 161.12; ASAP-MS (m/z): [M +

H] caled for C16HI 5BrO2, 319.0329; found, 319.0331.

5-[(JE)-2-(4-Bromo)ethenyl]-1,3-benzenediol (2b). Compound 2a (0.5 g, 1.6 mmol) was

dissolved in DCM (7 mL), and the resulting solution was cooled to 0 'C. A solution of 1.0 M BBr3

in DCM (7.8 mL) was added dropwise at 0 'C. The reaction mixture was allowed to warm to room

temperature and stirred for 4 h. The reaction mixture was then poured carefully into a separation

funnel containing ice water (-15 mL). The mixture was extracted with DCM (3 x 15 mL). The

organic layers were combined and washed with brine (15 mL), dried with Na2SO4(s), and filtered.

The solvent was removed under reduced pressure, and the crude product was suspended in ice-

cold DCM. The resulting precipitate was isolated by filtration to afford compound 2b as a white

solid (0.326 g, 70%). 'H NMR (500 MHz, CD30D, 6): 6.22 (s, 1H), 6.51 (s, 2H), 7.00-7.03 (d, J

16.33 Hz, 1H), 7.04-7.07 (d, J= 16.33 Hz, 1H), 7.45-7.47 (d, J= 8.5 Hz, 2H), 7.50-7.51 (d, J

8.5 Hz, 2H); 13C NMR (125 MHz, CD30D, 6): 103.43, 106.15, 121.97, 128.05, 129.17, 130.97,

132.78, 138.04, 140.42, 159.80; HRMS-ESI (m/z): [M - H]- calcd for C14HilBrO 2, 288.9870;

found, 288.9869.

5-[(1E)-2-(4-Boronic Acid Pinacol Ester) ethenylU-1,3-benzenediol (2c). Compound 2b (0.250 g,

0.858 mmol), KOAc (0.245 g, 2.57 mmol), bis(pinacolato)diboron (0.652 g, 2.57 mmol), and

Pd(dppf)C12 (0.062 g, 0.0858 mmol) were added to a Schlenk flask, which was then evacuated and

backfilled with N2(g). Dioxane (8.5 mL) was deoxygenated by sonication under high vacuum and

backfilled with N2(g). The deoxygenated dioxane was then added by cannula into the reaction

288

flask, and the reaction mixture was heated to 80 0C and stirred overnight. The reaction mixture

was then filtered, and the solvent was removed under reduced pressure. The crude product was

purified by flash column chromatography (20% v/v EtOAc in hexanes) to afford compound 2c as

a white solid (0.245 g, 85%). 'H NMR (500 MHz, CD 30D, 6): 1.34 (s, 12H), 6.22 (s, 1H), 6.51 (s,

2H), 7.02-7.06 (d, J== 16.27 Hz, 1H), 7.06-7.10 (d, J= 16.36 Hz, 1H), 7.50-7.52 (d, J= 7.93 Hz,

2H), 7.71-7.72 (d, J= 7.79 Hz, 2H); 13C NMR (125 MHz, CD30D, 6): 25.19, 85.04, 103.42,

160.21, 126.77, 129.23, 131.16, 136.08, 140.54, 141.63, 159.75; HRMS-ESI (m/z): [M - H]' calcd

for C20H23BO4 , 336.1653; found, 336.1653.

5-[(]E)-2-(4-Boronic acid)ethenyl]-1,3-benzenediol (2). Compound 2c (0.050 g, 0.147 mmol)

was dissolved in 4:1 THF/H 20 (1.5 mL). NaIO4 (0.157 g, 0.739 mmol) was added to the resulting

solution, followed by 1.0 M HCl (36 ptL, 0.036 mmol). The reaction mixture was allowed to stir

overnight. The reaction mixture was then diluted with H20 (2 mL) and extracted with EtOAc (3 x

4 mL). The organic layers were combined and washed with brine (10 mL), dried with Na2SO4(s),

and filtered. The solvent was removed under reduced pressure, and the crude product was purified

by flash column chromatography (3% v/v CH30H in DCM) to afford compound 2 as a white

crystalline solid (0.027 g, 67%) with mp >400 'C. 1H NMR (500 MHz, CD30D, 6): 6.20 (s, 1H),

6.50 (s, 2H), 7.95 (bs, 2H), 7.51-7.52 (d, J= 8.17 Hz, 2H), 7.60-7.62 (d, J= 8.13 Hz, 2H); 13C

NMR (125 MHz, CD30D, 6): 103.32, 106.13, 126.69, 129.26, 130.67, 135.09, 140.06, 140.64,

159.78; HRMS-ESI (m/z): [M - H]- calcd for the single methyl boronic ester C, 5 Hi5 BO4 ,

268.1027; found, 268.1027.

5-[(]E)-2-(2-Chloro-4-bromo)ethenyl]-1, 3-dimethoxybenzene (3a). 3,5-Dimethoxybenzyl

bromide (0.74 g, 3.4 mmol) was dissolved in neat triethylphosphite (0.7 mL, 4.0 mmol), and the

289

resulting solution was heated to 150 'C for 4 h. The reaction mixture was cooled to 0 'C and added

to DMF (20 mL). NaH (60% w/v in mineral oil, 0.17 g, 4.36 mmol) was added to the resulting

solution, and the reaction mixture was stirred at 0 'C for 20 min. A solution of 4-bromo-2-chloro

benzaldehyde (0.74 g, 3.36 mmol) in DMF (10 mL) was then added dropwise. The reaction

mixture was allowed to warm to room temperature and stirred overnight. The reaction mixture was

then diluted with EtOAc (15 mL), and washed with 10% w/v citric acid (20 mL), followed by brine

(20 mL). The organic layer was separated, dried with Na2SO4(s), and filtered. The solvent was

removed under reduced pressure, and the crude product was purified by flash column

chromatography (5% v/v EtOAc in hexanes) to afford compound 3a as a white solid (1.10 g, 91%

over 2 steps). 'H NMR (400 MHz, CDCl3, 6): 3.84 (s, 6H), 6.44 (s, 1H), 6.69 (s, 2H), 6.97-7.01

(d, J= 16.24 Hz, 1H), 7.36-7.40 (m, 2H), 7.50-7.52 (d, J= 8.42 Hz, 1H), 7.55 (s, 1H); 13C NMR

(100 MHz, CDCl 3, 5): 55.51, 100.60, 105.11, 121.37, 124.27, 127.60, 130.27, 131.89, 132.43,

134.20, 134.41, 138.81, 161.12; ASAP-MS (m/z): [M + H]' calcd for C1 6HI4BrClO2, 352.9939;

found, 352.9939.

5-[(1E)-2-(2-Chloro-4-bromo)ethenyl]-1,3-benzenediol (3b). Compound 3a (0.500 g, 1.42

mmol) was dissolved in DCM (14 mL), and the reaction mixture was cooled to 0 'C. A solution

of 1.0 M BBr3 (7.10 mmol) in DCM (7.1 mL) was then added dropwise at 0 'C. The reaction

mixture was allowed to warm to room temperature and stirred for 4 h. The reaction mixture was

then poured carefully into a separation funnel containing ice water (~10 mL) and extracted with

DCM (3 x 10 mL). The organic layers were combined and washed with brine (15 mL), dried with

Na2SO4(s), and filtered. The solvent was removed under reduced pressure, and the crude product

was suspended in cold DCM (5 mL). The resulting precipitate was filtered to afford compound 3b

290

as a white solid (0.335 g, 72%). IH NMMR (500 '-"z, CD3OD, (5): 6.23, (s, 1ITh) 6.51 (s, '),' 7.03-

7.07 (d, J= 16.23 Hz, 1H), 7.31-7.35 (d, J= 16.23 Hz, 1H), 7.45-7.47 (d, J= 8.31 Hz, 1H), 7.60

(s, 1H), 7.67-7.69 (d, J= 8.47 Hz, 1H); 13 C NMR (125 MHz, CD30D, 6): 103.79, 106.34, 124.89,

126.71, 133.26, 133.34, 133.90, 135.91,137.59,140.29, 159.89; HRMS-ESI (m/z): [M - H]- calcd

for C14HIoBrClO 2, 322.9480; found, 322.9480.

5-[(]E)-2-(2-Chloro-4-boronic Acid Pinacol Ester)ethenyl]-1,3-benzenediol (3c). Compound 3b

(0.100 g, 0.308 mmol), KOAc (0.088 g, 0.926 mmol), bis(pinacolato)diboron (0.235 g, 0.926

mmol), and Pd(dppf )C1 2 (0.0225 g, 0.0308 mmol) were added to a flame-dried Schlenk flask,

which was then evacuated and backfilled with N2(g). Dioxane (3.5 mL) was deoxygenated by

sonication under high vacuum and backfilled with N2(g). The deoxygenated dioxane was then

added by cannula into the reaction flask, and the reaction mixture was heated to 80 'C and stirred

overnight. The reaction mixture was filtered and the solvent was removed under reduced pressure.

The crude product was purified by flash column chromatography (2% v/v CH30H in DCM) to

afford compound 3c as a white solid (0.080 g, 69%). 'H NMR (500 MHz, CD30D, 6): 1.33 (s,

12H), 6.23 (s, 1H), 6.53 (s, 2H), 7.05-7.09 (d, J= 16.29 Hz, lH), 7.40-7.44 (d, J= 16.28 Hz, 1H),

7.60-7.62 (d, J= 7.98 Hz, 1H), 7.70 (s, 1H), 7.72-7.74 (d, J= 7.83 Hz, 1H); 13C NMR (125 MHz,

CD 30D, 6): 25.18, 75.82, 85.42, 103.90, 106.42, 124.82, 126.88, 133.83, 133.90, 134.11, 136.74,

139.11, 140.17, 159.84; HRMS-ESI (m/z): [M - H]- caled for C 2 0H2 2BC10 4, 370.1263; found,

370.1263.

5-[(E)-2-(2-chloro-4-hydroxyphenyl)ethenyl]-1,3-benzenediol (3). Compound 3 was derived

from the oxidation of compound 3c by an aryl N-oxide in one step as described previously. 460 Here,

compound 3c (0.020 g, 0.053 mmol) was dissolved in DCM (0.6 mL). NN-Dimethyl-p-toluidine-

291

N-oxide (0.012 g, 0.081 mmol) was added to the resulting solution, and the reaction mixture was

stirred for 1 h. The solvent was removed under reduced pressure, and the crude product was

purified by flash column chromatography (15% v/v EtOAc in hexanes) to afford compound 3 as a

pale yellow solid (0.010 g, 70%) with mp >400 'C. 1 H NMR (500 MHz, CD30D, 6): 6.18-6.19 (t,

J= 2.15 Hz, 1H), 6.47 (s, 2H), 6.74-6.76 (d, J= 8.66 Hz, 1H), 6.82 (s, 1H), 6.82-6.85 (d, J=

14.57 Hz, 1H), 7.31-7.34 (d, J= 16.16 Hz, 1H), 7.58-7.59 (d, J= 8.64 Hz, 1H); 13C NMR (125

MHz, CD30D, 6): 103.10, 105.96, 115.92, 117.03, 124.95, 127.75, 128.32, 129.78, 134.77,

140.91, 159.08, 159.77; HRMS-ESI (m/z): [M - H]- calcd for C 14Hi 1C10 3, 261.0324; found,

261.0325.

5-[(]E)-2-(2-Chloro-4-boronic Acid)ethenyl]-1, 3-benzenediol (4). Compound 3c (0.050 g,

0.134 mmol) was dissolved in 4:1 THF/H20 (1.3 mL). NaIO4 (0.143 g, 0.670 mmol) was added to

the resulting solution, followed by 1.0 M HCl (33 tL, 0.033 mmol). The reaction mixture was then

stirred overnight. The reaction mixture was diluted with H20 (1.0 mL) and extracted with EtOAc

(3 x 2 mL). The organic layers were combined and washed with brine (6 mL), dried with

Na2SO4(s), and filtered. The solvent was removed under reduced pressure, and the crude mixture

was purified by flash column chromatography (4% v/v CH30H in DCM) to afford compound 4 as

a white crystalline solid (0.022 g, 88%) with mp 186.0 'C. 1H NMR (500 MHz, CD30D, 6): 6.23

(s, 1H), 6.52 (s, 2H), 7.05-7.09 (d, J= 16.20 Hz, 1H), 7.41-7.45 (d, J= 16.23 Hz, 1H), 7.54-7.56

(d, J= 7.85 Hz, 1H), 7.63 (s, 1H), 7.75-7.76 (d, J= 7.81 Hz, 1H); 13C NMR (125 MHz, CD 30D,

6): 103.77, 106.32, 124.88, 126.71, 133.26, 133.34, 133.90, 135.91, 137.59, 140.29, 159.89;

HRMS-ESI (m/z): [M - H]- calcd for C14HI 2BClO 4, 288.0481; found, 288.0482.

292

3-[(!E )-2-4Boohnlehn!-ezi AA//d Mthy Es 7 "4tr' (I. ) E/Cy I3fI-

(bromomethyl)benzoate (2.0 g, 8.7 mmol) was dissolved in neat triethylphosphite (1.78 mL,

10.4 mmol) and heated to 150 'C for 4 h. The reaction mixture was cooled to 0 'C and diluted with

DMF (87 mL). NaH (60% w/v in mineral oil, 0.69 g, 17 mmol) was added to the resulting solution,

and the reaction mixture stirred at 0 'C for 20 min. A solution of 4-bromobenzaldehyde (1.62 g,

8.73 mmol) in DMF (87 mL) was then added dropwise. The reaction mixture was allowed to warm

to room temperature and stirred overnight. The reaction mixture was then diluted with EtOAc (50

mL), and washed with 10% w/v citric acid (20 mL), followed by brine (20 mL). The organic layer

was separated, dried with Na2SO4(s), and filtered. The solvent was removed under reduced

pressure, and the crude product was purified by flash column chromatography (10 % v/v EtOAc

in hexanes) to afford the 5a as a white solid (1.75 g, 63% over 2 steps). 'H NMR (500 MHz,

CD30D, 6): 3.84 (s, 3H), 6.94-6.96 (d, J= 8.70 Hz, 2H), 7.09-7.12 (d, J= 16.28, 1H), 7.20-7.25

(d, J= 16.41 Hz, 1H), 7.45-7.48 (t, J= 7.67, 1H), 7.53-7.55 (d, J= 8.62 Hz, 2H), 7.76-7.78 (d, J

= 7.85 Hz, 1H), 7.88-7.90 (d, J= 7.72, 1H), 8.18 (s, 1H); 13C NMR (125 MHz, CD30D, 6): 54.31,

113.73, 124.97, 126.93, 127.58, 127.80, 128.40, 129.13, 130.01, 138.21, 159.72, 168.62; HRMS-

ESI (m/z): M calcd for C16H13BrO 2 , 316.0094; found, 316.0081.

3-[(JE)-2-(4-Bromophenyl)ethenyl]-benzoic Acid (5b). Compound 5a (1.5 g, 4.7 mmol) was

dissolved in 3:1 THF/EtOH (47 mL). A solution of 2 M NaOH (4.7 mL, 9.4 mmol) was added to

the resulting solution, and the reaction mixture was stirred overnight. The reaction mixture was

then diluted with EtOAc (20 mL), and washed with 10% w/v citric acid (20 mL), followed by brine

(20 mL). The organic layer was separated, dried with Na2SO4(s), and filtered. The solvent was

removed under reduced pressure, and the crude product was purified by flash column

293

chromatography (20% v/v EtOAc in hexanes) to afford compound 5b as a white solid (1.36 g,

96%). 'H NMR (500 MHz, DMSO, 6): 7.33-7.36 (d, J= 16.52 Hz, 1H), 7.40-7.43 (d, J= 16.44

Hz, 1H), 7.50-7.54 (t, J = 7.68 Hz, 1H), 7.58-7.60 (d, J= 8.64 Hz, 2H), 7.63-7.61 (d, J= 8.66

Hz, 2H), 7.84-7.88 (t, J= 8.63 Hz, 2H), 8.16 (s, 1H), 13.10 (s, 1H); 13 C NMR (125 MHz, DMSO,

6): 121.27, 127.81, 128.68, 128.92, 129.00, 129.10, 129.51, 131.04, 132.09, 136.61, 137.66,

167.69; HRMS-ESI (m/z): M caled for CI 5Hl1BrO2, 301.9937; found, 301.9944.

3-[(]E)-2-(4-Boronic acid Pinacol Ester) ethenyl]-benzoic Acid (Sc). Compound 5b (1.0 g, 3.3

mmol), KOAc (0.971 g, 9.9 mmol), bis(pinacolato)diboron (2.5 g, 9.9 mmol), and Pd(dppf)C12

(0.24 g, 0.33 mmol) were added to a flame-dried Schlenk flask, which was then evacuated and

backfilled with N2(g). Dioxane (33 mL) was deoxygenated by sonication under high vacuum and

backfilled with N2(g). The deoxygenated dioxane was then added by cannula into the reaction

flask, and the reaction mixture was stirred overnight at 80 'C. The reaction mixture was filtered,

and the solvent was removed under reduced pressure. The crude product was purified by flash

column chromatography (20% v/v EtOAc in hexanes) to afford compound 5c as a white solid

(0.850 g, 73%). 'H NMR (500 MHz, CDC13, a): 1.36, (s, 12H), 7.22 (s, 2H), 7.46-7.50 (t, J= 7.44

Hz, 1H), 7.54-7.55 (d, J= 7.72 Hz, 1H), 7.76-7.77 (d, J= 8.12 Hz, 2H), 7.81-7.83 (d, J= 8.10

Hz, 2H), 8.00-8.01 (d, J= 8.11 Hz, 1H), 8.27-8.28 (d, J= 7.72 Hz, 1H), 8.28 (s, 1H); 13 C NMR

(125 MHz, CDCl 3, 6): 24.90, 83.87, 125.97, 128.19, 128.32, 128.94, 129.28, 129.70, 130.06,

131.67, 135.23, 137.70, 139.50; HRMS-ESI (m/z): [M + NH4] calcd for C2 1H2 3BO4, 367.2064;

found, 367.2062.

3-[(]E)-2-(4-Hydroxyphenyl)ethenyl]-benzoic Acid (5). Compound 5c (0.050 g, 0.142 mmol)

was dissolved in DCM (1.4 mL). NN-Dimethyl-p-toluidine-N-oxide (0.032 g, 0.213 mmol) was

294

added to the resulting solution, and the reaction mixture was allowed to stir for 1 h. The

mixture was filtered, and the solvent was removed under reduced pressure. The crude product was

purified by flash column chromatography (2% v/v MeOH in DCM) to afford compound 5 as a

white solid (0.030 g, 88%) with mp >400 0C. 'H NMR (500 MHz, CD30D, 5): 6.78-6.80 (d, J=

8.61 Hz, 2H), 7.02-7.05 (d, J= 16.32 Hz, 1H), 7.15-7.19 (d, J= 16.34 Hz, 1H), 7.42-7.44 (d, J=

8.26 Hz, 2H), 7.42-7.45 (t, J= 7.28 Hz, 1H), 7.73-7.75 (d, J= 7.82 Hz, 1H), 7.85-7.87 (d, J=

7.79 Hz, 1H), 8.15 (s, 1H); 13C NMR (125 MHz, CD 30D, 6): 116.52, 125.56, 128.25, 129.04,

129.10, 129.80, 130.04, 130.90, 131.42, 132.30, 139.80; HRMS-ESI (m/z): [M - H]- calcd for

Ci5 H,20 3, 239.0714; found, 239.0716.

3-[(]E)-2-(4-Boronic acid)ethenyl]-benzoic Acid (6). Compound 5c (0.5 g, 1.48 mmol) was

dissolved in 4:1 THF/H20 (15 mL). NaIO4 (0.405 g, 1.89 mmol) was added to the resulting

solution, followed by 1.0 M HCl (0.15 mL, 0.158 mmol). The reaction mixture was then stirred

overnight. The reaction mixture was diluted with H2 0 (10 mL) and extracted with EtOAc (3 x 15

mL). The organic layers were combined and washed with brine (20 mL), dried with Na2SO4(s),

and filtered. The solvent was removed under reduced pressure, and the crude product was purified

by flash column chromatography (10% v/v CH30H in DCM) to afford compound 6 as a white

crystalline solid (0.345 g, 90%) with mp 174.2 'C. 'H NMR (500 MHz, CDCl 3, 6): 7.24-7.27 (d,

J= 16.43 Hz, 1H), 7.28-7.32 (d, J= 16.45 Hz, 1H), 7.49-7.76 (t, J= 7.70 Hz, 1H), 7.58-7.59 (d,

J= 7.94 Hz, 2H), 7.62-7.64 (d, J= 7.92 Hz, 2H), 7.80-7.82 (d, J= 7.87 Hz, 1H), 7.90-7.92 (d, J

= 7.81 Hz, 1H), 8.21 (s, 1H); 13C NMR (125 MHz, CDCl 3, 6): 125.51, 127.32, 127.86, 128.37,

128.51, 129.41, 130.43, 131.07, 133.70, 137.77, 138.27, 168.32; HRMS-ESI (m/z): [M - H]- calcd

for single methyl boronic acid C1 6H1 5BO4, 280.1026; found, 280.1033.

295

Methyl 4-Bromo-2-chlorobenzoate (7a). 4-Bromo-2-chlorotoluene (1.0 g, 4.8 mmol) was

dissolved in 1:1 water/tert-butanol (20 mL). KMnO4 (1.53 g, 9.7 mmol) was added to the resulting

solution, and the reaction mixture was heated to 70 'C with a reflux condenser for 2 h. The reaction

mixture was then allowed to cool to room temperature, and more KMnO4 (1.5 g, 9.7 mmol) was

added. The reaction mixture was then reheated to 70 'C in a flask with a reflux condenser and

stirred overnight at 70 'C. The warm reaction mixture was filtered, and the resulting KMnO4 cake

was rinsed with water (~10 mL). The filtrate was acidified to pH 3 with concentrated HCl and

extracted with EtOAc (3 x 20 mL). The organic layers were combined, dried with NaSO 4 (s), and

filtered. The solvent was removed under reduced pressure to afford the carboxylic acid precursor

as a white solid. This precursor was dissolved in 3 M HCl in MeOH (15 mL) and heated to reflux

for 12 h. The reaction mixture was allowed to cool to room temperature, and N2(g) was bubbled

through the solution for 20 min to remove excess HCl(g). The solvent was removed under reduced

pressure, and the crude product was purified by flash column chromatography (20% v/v EtOAc in

hexanes) to afford compound 7a as a colorless oil (1.15 g, 96% yield over 2 steps). IH NMR (500

MHz, CD30D, 6): 3.93 (s, 3H), 7.59-7.61 (d, J= 8.47 Hz, 1H), 7.76-7.77 (m, 2H); 13 C NMR (125

MHz, CD30D, 6): 53.07, 127.34, 130.51, 131.41, 133.71, 134.63, 135.53, 166.72; HRMS-ESI

(m/z): M calcd for C8H6BrClO 2, 247.9235; found, 247.9237.

4-Bromo-2-chlorobenzyl Alcohol (7b). Compound 7a (1.00 g, 4.01 mmol) was dissolved in

THF (47 mL), and the resulting solution was cooled to 0 'C. 2 M LiBH4 in THF (12 mL, 23.5

mmol) was then added dropwise, followed by methanol (4 mL). The reaction mixture was allowed

to warm to room temperature and stirred overnight. The reaction mixture was quenched by adding

EtOAc (20 mL) dropwise, followed by water (15 mL), and then acidification to pH 5 with 1.0 M

296

HCl. The resulting lithium salts wTere removed by filtration, and the filtrate iwas extracted with

EtOAc (3 x 15 mL). The organic layers were combined, dried with NaSO4(s), and filtered. The

solvent was removed under reduced pressure, and the crude product was purified by flash column

chromatography (20% v/v EtOAc in hexanes) to afford compound 7b as a white solid (0.843 g,

95%). 'H NMR (500 MHz, CDCl 3, 6): 1.91 (bs, 1H), 4.74 (s, 2H), 7.38-7.39 (d, J= 8.22 Hz, 1H),

7.42-7.44 (d, J= 8.23, 1H) 7.53 (s, 1H); 13 C NMR (500 MHz, CDCl 3, 6): 62.48, 121.68, 129.93,

130.42, 132.05, 133.54, 137.43; HRMS-ESI (m/z): M calcd for C7H6OBrCl, 219.9286; found,

219.9282.

4-Bromo-2-chlorobenzaldehyde (7c). Compound 7b (0.5 g, 2 mmol) was dissolved in DCM

(22 mL). Pyridinium dichromate (PDC, 2.57 g, 6.84 mmol) was added to the resulting solution,

and the reaction mixture was allowed to stir ovemight. The reaction mixture was then filtered

through a pad of Celite*, and the solvent was removed under reduced pressure. The crude product

was purified by flash column chromatography (10% v/v EtOAc in hexanes) to afford compound

7c as a white solid (0.440 g, 88%). 'H NMR (500MHz, CDCl 3, a): 7.54-7.5 5 (d, J= 8.25 Hz, 1H),

7.66 (s, 1H), 7.78-7.80 (d, J=8.32 Hz, IH), 10.42 (s, 1H); 13C NMR (125 MHz, CDCl3,A) 129.78,

130.55, 131.09, 131.45, 133.49, 138.71, 188.95; ASAP-MS (m/z): [M+H]* calcd for C7H4BrClO,

218.9207; found, 218.9216.

3-[(JE)-2-(2-Chloro-4-bromophenyl)ethenyl]-benzoic Acid Methyl Ester (7d). Ethyl 3-

(bromomethyl)benzoate (1.04 g, 4.55 mmol) was dissolved in neat triethylphosphite (0.9 mL, 5.46

mmol), and the resulting solution was heated to 150 'C for 4 h. The reaction mixture was cooled

to 0 'C and diluted with DMF (40 mL). NaH (60% w/v in mineral oil, 0.23 g, 5.91 mmol) was

added to the resulting solution, and the reaction mixture was stirred at 0 0C for 20 min. A solution

297

of 4-bromo-2-chlorobenzaldehyde (7c; 1.0 g, 4.6 mmol) in DMF (5 mL) was added dropwise. The

reaction mixture was then allowed to warm to room temperature and stirred overnight. The reaction

mixture was diluted with EtOAc (5 mL), and washed with 10% w/v citric acid (5 mL), followed

by brine (5 mL). The organic layer was separated, dried with Na2SO4(s), and filtered. The solvent

was removed under reduced pressure, and the crude product was purified by flash column

chromatography (10% v/v EtOAc in hexanes) to afford compound 7d as a white solid (1.3 g, 83%

over 2 steps). 1H NMR (500MHz, CDCl 3, 6): 3.95 (s, 3H), 7.08-7.11 (d, J= 16.31 Hz, 1H), 7.39-

7.41 (d, J= 8.53 Hz, 1H), 7.44-7.47 (t, J= 8.02 Hz, 1H), 7.45-7.49 (d, J= 16.56 Hz, 1H), 7.53-

7.55 (d, J= 8.45 Hz, 1H), 7.56 (s, 1H), 7.71-7.73 (d, J= 7.73 Hz, 1H), 7.96-7.97 (d, J = 7.70 Hz,

lH), 8.19 (s, 1H); 13C NMR (125 MHz, CDCl 3, 6): 52.43, 121.70, 125.02, 127.66, 128.15, 129.02,

129.35, 130.39, 130.88, 131.12, 132.55, 134.26, 134.35, 137.19, 167.02; HRMS-ESI (m/z): [M +

NH4] calcd for C16HI 2BrClO2, 368.0048; found, 368.0053.

3-[(]E)-2-(2-Chloro-4-bromophenyl)ethenyl]-benzoic Acid (7e). Compound 7d (1.34 g, 3.83

mmol) was dissolved in 3:1 THF/EtOH (40 mL). 2.0 M NaOH (3.3 mL, 7.7 mmol) was added to

the resulting solution, and the reaction mixture was stirred overnight. The reaction mixture was

diluted with EtOAc (20 mL), and washed with 10% w/v citric acid (30 mL), followed by brine (30

mL). The organic layer was separated, dried with Na2SO4(s), and filtered. The solvent was removed

under reduced pressure, and the crude product was purified by flash column chromatography (20%

v/v EtOAc in hexanes) to afford compound 7e as a white solid (1.22 g, 95%). 'H NMR (500 MHz,

DMSO, 6): 7.42-7.45 (d, J= 16.39 Hz, 1H), 7.47-7.50 (d, J= 16.41 Hz, 1H), 7.53-7.56 (t, J=

7.71 Hz, 1H), 7.60-7.62 (d, J= 8.54, 1H), 7.79 (s, 1H), 7.86-7.88 (d, J= 8.63 Hz, 1H), 7.88-7.92

(t, J= 7.86, 2H), 8.15 (s, 1H), 13.01 (bs, IH); 13 C NMR (125 MHz, DMSO, 6): 121.99, 124.85,

298

128.89, 129.10, 129.94 130.17, 131.41, 131 73, 12),Q 132.3Q 13Q7, 13A61, 1322 1Q()7,

167.31; HRMS-ESI (m/z): [M - H]- caled for Ci5H oBrClO 2, 334.9479; found, 334.9476.

3-[(]E)-2-(2-Chloro-4-boronic Acid Pinacol Ester)ethenyl]-benzoic Acid (7f). Compound 7e

(0.200 g, 0.580 mmol), KOAc (0.171 g, 1.791 mmol), bis(pinacolato)diboron (0.45 g, 1.79 mmol),

and Pd(dppf)C1 2 (0.043 g, 0.059 mmol) were added to a flame-dried Schlenk flask, which was then

evacuated and backfilled with N2(g). Dioxane (6 mL) was deoxygenated by sonication under high

vacuum and backfilled with N2(g). The deoxygenated dioxane was then added by cannula into the

reaction flask, and the reaction mixture was stirred overnight at 80 'C. The reaction mixture was

filtered, and the solvent was removed under reduced pressure. The crude product was purified by

flash column chromatography (1% v/v CH30H in DCM) to afford compound 7f as a white solid

(0.183 g, 82%). 'H NMR (500 MHz, CD30D, &): 1.36 (s, 12H), 7.33-7.36 (d, J= 16.30 Hz, 1H),

7.49-7.52 (t, J= 8.41 Hz, 1H), 7.61-7.64 (d, J= 16.47 Hz, 1H), 7.66-7.67 (d, J= 7.46, 1H), 7.73

(s, 1H), 7.83-7.85 (d, J= 7.75 Hz, 2H), 7.95-7.97 (d, J= 7.73, 1H), 8.24 (s, 1H); 13C NMR (125

MHz, CD30D, 6): 25.19, 85.51, 126.33, 127.18, 129.00, 130.07, 130.41, 132.16, 132.61, 134.18,

136.79, 138.74, 138.86, 169.56; HRMS-ESI (m/z): [M + NH4]'calcd for C21H22BClO4, 401.1675;

found, 401.1666.

3-[(JE)-2-(2-Chloro-4-hydroxyphenyl)ethenyl]-benzoic Acid (7). Compound 7f (0.100 g,

0.259 mmol) was dissolved in DCM (2.6 mL). NN-Dimethyl-p-toluidine-N-oxide (0.060 g, 0.389

mmol) was added to the resulting solution, and the reaction mixture was then stirred overnight.

The solvent was removed under reduced pressure, and the crude product was purified by flash

column chromatography (2% v/v MeOH in DCM) to afford compound 7 as a white solid (0.051

g, 72%) with mp 213.6 0C. 'H NMR (500 MHz, CD30D, 6): 6.77-6.68 (d, J= 8.63 Hz, 1H), 6.84

299

(s, 1H), 7.06-7.10 (d, J= 16.34 Hz, 1H), 7.45-7.48 (t, J= 7.72 Hz, 1H), 7.49-7.52 (d, J= 16.31

Hz, 1H), 7.65-7.67 (d, J= 8.63 Hz, 1H), 7.75-7.77 (d, J= 7.91 Hz, 1H), 7.89-7.91 (d, J= 7.78

Hz, 1H), 8.17 (s, 1H); 13C NMR (125 MHz, CD30D, 6): 115.98, 117.09, 126.42, 127.43, 128.47,

128.48, 128.58, 129.60, 129.92, 131.70, 132.43, 135.06, 139.41, 159.45, 169.74; HRMS-ESI

(m/z): [M - H]- caled for C15 HIClO3, 273.0324; found, 273.0327.

3-[(JE)-2-(2-Chloro-4-boronic Acid)ethenyl]-benzoic (8). Compound 7f (0.100 g, 0.260 mmol)

was dissolved in 4:1 THF/H20 (2.6 mL). NaIO4 (0.28 g, 1.30 mmol) was added to the resulting

solution, followed by 0.02 mL of 1.0 M HCl (0.02 mmol). The reaction mixture was then stirred

overnight. The reaction mixture was diluted with H20 (2 mL) and extracted with EtOAc (3 x 3

mL). The organic layers were combined and washed with brine (5 mL), dried with Na2SO4(s), and

filtered. The solvent was removed under reduced pressure, and the crude product was purified by

flash column chromatography (1-3% v/v CH30H in DCM) to afford compound 8 as a white

crystalline solid (0.058 g, 75%) with mp 161.0 'C. 'H NMR (500 MHz, CD30D, 6): 7.32-7.35 (d,

J= 16.37 Hz, 1H), 7.51-7.54 (t, J= 7.73 Hz, 1H), 7.51-7.67 (in, 3H), 7.84-7.85 (d, J= 7.29 Hz,

2H), 7.97-7.98 (d, J= 7.76 Hz, 1H), 8.25 (s, 1H); 13 C NMR (125 MHz, CD30D, 6): 126.41,

126.94, 128.93, 130.04, 130.29, 132.07, 132.63, 133.31, 134.12, 135.95, 137.29, 138.82, 169.61;

HRMS-ESI (m/z): [M - H]- calcd for the single methyl boronic ester C1 6H 14BC10 4 , 314.0637;

found, 314.0635.

1,1'-(]E)-(1,2-Ethenediyl)bis[2-chloro-4-bromo]-benzene (9a). 4-Bromo-1-(bromomethyl)-2-

chloro-benzene (0.300 g, 1.05 mmol) was dissolved in neat triethylphosphite (0.217 mL, 1.26

mmol) and heated to 150 'C for 4 h. The reaction mixture was then cooled to 0 'C and diluted with

DMF (8 mL). NaH (60% w/v in mineral oil, 0.054 g, 1.36 mmol) was added to the resulting

300

schltinn and the reactinn Miviir Aw7as ctrreA at 0 C 4fror '2 m;i. A -lft f_4-rrr'o m -chrf--on, -t e r~qC xx x XXLIx% VV O L 'J L11I% %.L ".L WJ '.- lk L Z1.j .J 111I. 1 1. 13%JL4LI.J11 JL

benzaldehyde (7c; 0.230 g, 1.05 mmol) in DMF (2.5 mL) was then added dropwise. The reaction

mixture was allowed to warm to room temperature and stirred overnight. The reaction mixture was

diluted with EtOAc (8 mL), and washed with 10% w/v citric acid (10 mL), followed by brine (10

mL). The organic layer was then separated, dried with Na2SO 4 (s), and filtered. The solvent was

removed under reduced pressure, and the crude product was suspended in cold DCM (10 mL). The

resulting precipitate was collected by filtration to afford compound 9a as a white solid (0.306 g,

72% yield over 2 steps). 'H NMR (500 MHz, CDCl 3, 6): 7.39 (s, 2H), 7.41-7.43 (d, J= 8.42 Hz,

2H), 7.58 (s, 2H), 7.58-7.59 (d, J= 7.12 Hz, 2H); 3C NMR (125 MHz, CDCl 3, 6): 121.99, 126.70,

127.83, 130.35, 132.45, 133.89, 134.30; ASAP-MS (m/z): M' calcd for C14H8Br2C2, 403.8365;

found, 403.8367.

1,] '-(JE)-(1,2-Ethenediyl)bis[2-chloro-4-boronic Acid Pinacol Ester]-benzene (9b). Compound

9a (0.050 g, 0.123 mmol), KOAc (0.071 g, 0.742 mmol), bis(pinacolato)diboron (0.187 g,

0.742 mmol), and Pd(dppf)C12 (9 mg, 0.012 mmol) were added to a flame-dried Schlenk flask,

which was then evacuated and backfilled with N2(g). Dioxane (2 mL) was deoxygenated by

sonication under high vacuum and backfilled with N2(g). The deoxygenated dioxane was then

added by cannula into the reaction flask, and the reaction mixture was heated to 80 'C and stirred

overnight (Note: higher yields of the diboronated product were found at more dilute reaction

concentrations). The reaction mixture was filtered, and the solvent was removed under reduced

pressure. The crude product was purified by flash column chromatography (50% v/v DCM in

hexanes) to afford 9b as a white solid (0.056 g, 92%). 'H NMR (500 MHz, CDCl3, a): 1.35 (s,

24H), 7.56 (s, 2H), 7.67-7.69 (d, J= 7.76 Hz, 2H), 7.73-7.75 (d, J= 7.85 Hz, 2H), 7.83 (s, 2H);

301

"C NMR (125 MHz, CDCl 3, 6): 24.88, 84.18, 126.17, 127.94, 133.02, 133.45, 136.06, 137.45;

ASAP-MS (m/z): [M + H]f calcd for C26H32B2C1204, 499.2009; found, 499.2001.

1,1 '-(JE)-(1,2-Ethenediyl)bis[2-chloro-4-hydroxy]-benzene (9). Compound 9b (0.020 g,

0.040 mmol) was dissolved in DCM (0.5 mL). NN-Dimethyl-p-toluidine-N-oxide (0.0 18 g, 0.120

mmol) was added to the resulting solution, and the reaction mixture was stirred for 1 h. The solvent

was removed under reduced pressure, and the crude product was purified by flash column

chromatography (2% v/v MeOH in DCM) to afford compound 9 as a white solid (9 mg, 80%) with

mp 203.9 'C. 'H NMR (500 MHz, CD 30D, 6): 6.75-6.77 (d, J= 8.61 Hz, 2H), 6.83 (s, 2H), 7.24

(s, 2H), 7.55-7.57 (d, J= 8.59 Hz, 2H); 13 C NMR (125 MHz, CD 30D, a): 115.93, 117.07, 125.22,

128.00, 128.36, 134.73, 159.09; HRMS-ESI (m/z): [M - H]- calcd for C1 4H1 0C1202 , 278.9985;

found, 278.9985.

1,1'-(JE)-(1,2-Ethenediyl)bis[2-chloro-4-boronic acid]-benzene (10). Compound 9b (0.020 g,

0.040 mmol) was dissolved in 4:1 THF/H20 (0.6 mL). NaIO4 (0.042 g, 0.200 mmol) was added to

the resulting solution, followed by a few drops of 1.0 M HCl. The reaction mixture was then stirred

overnight. The reaction mixture was then diluted with H20 (1 mL) and extracted with EtOAc (3 x

2 mL). The organic layers were combined and washed with saturated brine (3 mL), dried with


was purified by flash column chromatography (3% v/v CH30H in DCM) to afford compound 10

as a white crystalline solid (0.010 g, 74%) with mp 234.2 0 C. 'H NMR (500 MHz, CD 30D, a):

7.59 (s, 1H), 7.59-7.60 (d, J= 8.42 Hz, 2H), 7.66 (s, 1H), 7.78-7.80 (d, J= 7.78 Hz, 2H); '3 C

NMR (125 MHz, CD30D, 6): 127.14, 128.69, 133.38, 134.25, 135.98; MALDI-MS (m/z): M

calcd for C14H12 B2C12 0 4 , 336.03; found, 336.00.

302

2-Bromo-1-[1-72-(3-bromophenyflethenyl]-4ch/orobenzene (1a). 3 -Bromobenzyl bromide

(0.500 g, 2.00 mmol) was dissolved in neat triethylphosphite (0.411 mL, 2.4 mmol), and the

resulting solution was heated to 150 'C for 4 h. The reaction mixture was cooled to 0 'C and then

diluted with DMF (10 mL). NaH (60% w/v in mineral oil, 0.096 g, 2.4 mmol) was added to the

resulting solution, and the reaction mixture stirred at 0 'C for 20 min. A solution of 4-bromo-2-

chloro-benzaldehyde (7c; 0.447 g, 2.04 mmol) in DMF (10 mL) was then added drop-wise. The

reaction mixture was allowed to warm to room temperature and stirred overnight. The reaction

mixture was diluted with EtOAc (20 mL), and washed with 10% w/v citric acid (30 mL) followed

by brine (30 mL). The organic layer was separated, dried with Na2SO4(s), and filtered. The solvent

was removed under reduced pressure, and the crude product was suspended in ice-cold DCM (5

mL). The resulting precipitate was collected by filtration to afford compound IIa as a white solid

(0.514 g, 69% over 2 steps). 'H NMR (500 MHz, CD30D, 6): 7.09-7.13 (d, J= 16.3 Hz, 1H),

7.26-7.29 (d, J= 12.53 Hz, 2H), 7.35-7.39 (t, J= 7.61 Hz, 1H), 7.53 (s, 3H), 7.61-7.63 (d, J=

7.74 Hz, 1H), 7.79 (s, 1H); 13C NMR (125 MHz, CD30D, 6): 124.42, 125.32, 126.40, 127.54,

127.67, 131.61, 131.80, 131.80, 132.97, 134.94, 135.71, 141.09; ASAP-MS (m/z): M calcd for

C14H9Br2Cl, 369.8754; found, 369.8740.

2-Boronic Acid Pinacol Ester-1-[(1E)-2-(3-Boronic acid pinacol ester)ethenylU-4-chloro-benzene

(11b). Compound 1 la (0.100 g, 0.270 mmol), KOAc (0.152 g, 1.62 mmol), bis(pinacolato)diboron

(0.410 g, 1.62 mmol), and Pd(dppf)C12 (0.0270 g, 0.020 mmol) were added to a flame-dried

Schlenk flask, which was then evacuated and backfilled with N2(g). Dioxane (3.6 mL) was

deoxygenated by sonication under high vacuum and backfilled with N2(g). The deoxygenated

dioxane was then added by cannula into the reaction flask, and the reaction mixture was heated to

303

80 'C and stirred overnight. The reaction mixture was filtered, and the solvent was removed under

reduced pressure. The crude product was purified by flash column chromatography (2% v/v DCM

in hexanes) to afford 11b as a white solid (0.111 g, 87%). 'H NMR (500 MHz, CD30D, 6): 1.36

(s, 12H), 1.38 (s, 12H), 7.27-7.30 (d, J= 16.29 Hz, 1H), 7.38-7.41 (t, J= 7.47 Hz, lH), 7.55-7.58

(d, J= 16.30 Hz, 1H), 7.64-7.66 (d, J= 7.73 Hz, lH), 7.67-7.70 (t, J= 7.88 Hz, 2H), 7.72 (s, lH),

7.81-7.82 (d, J= 7.75 Hz, 1H), 7.96 (s, 1H); 13C NMR (125 MHz, CD30D, 6): 26.45, 26.48, 50.43,

50.60, 86.55, 86.74, 126.43, 128.29, 130.59, 132.18, 134.79, 135.29, 135.31, 135.40, 136.90,

138.03, 138.91, 140.44; HRMS-ESI (m/z): [M + NH4]* calcd for C2 6 H33B2C1O4, 484.2607; found,

484.2600.

2-Boronic Acid-1-[(JE)-2-(3-boronic Acid)ethenyl]-4-chloro-benzene (11). Compound 11b

(0.100 g, 0.214 mmol) was dissolved in 4:1 THF/H20 (2.1 mL). NaIO4 (0.227 g, 1.07 mmol) was

added to the resulting solution, followed by a few drops of 1.0 M HCl. The reaction mixture was

then stirred overnight. The reaction mixture was diluted with H20 (2 mL) and extracted with

EtOAc (3 x 2 mL). The organic layers were combined and washed with brine (4 mL), dried with


was purified by flash column chromatography (2% v/v CH30H in DCM) to afford compound 11

as a white crystalline solid (0.040 g, 63%) with mp 146.3 0C. 'H NMR (500 MHz, CD 30D, 6):

7.24-7.27 (d, J= 16.3 Hz, 1H), 7.37-7.40 (t, J= 7.57 Hz, 1H), 7.52-7.57 (m, 3H), 7.63-7.64 (d,

J= 4.84 Hz, 2H), 7.79 (s, 2H); 13C NMR (125 MHz, CD 30D, 6): 125.12, 126.78, 128.85, 129.02,

129.16, 129.80, 133.24, 133.95, 134.31, 135.93, 137.64; HRMS-ESI (m/z): [M - H]- caled forthe

single methyl boronate ester C15H1 4B2CIO4, 315.0772; found, 315.0772.

304

A.4.3 Protein Expression and Purification

Plasmids that direct the expression of wild-type TTR and its V30M variant were prepared in the

pET32b vector from Merck KGaA (Darmstadt, Germany) by standard methods. 223 To create the

plasmid encoding V30M TTR, two double-stranded DNA fragments were prepared by PCR using

complementary primers containing V30M-generating substitutions and gene-specific primers

targeting opposite termini for assembly with the plasmid fragment. Wild-type TTR and its V30M

variant were produced in Escherichia coli strain BL-21 from Merck KGaA cultured in Luria-

Bertani medium containing ampicillin (200 iM) at 37 'C. Gene expression was induced when

OD6oo nm reached ~2.0, and cells were then grown for an additional 4 h at 37 'C. Cell pellets were

resuspended in 20 mM Tris-HCl buffer, pH 7.4, containing EDTA (1.0 mM) and lysed with a

high-pressure cell disruptor from Constant Systems (Kennesaw, GA). The soluble fraction was

isolated by centrifugation for 10 min at 10,500g and for 1 h at 30,000g.

Wild-type TTR and its V30M variant were purified as described previously, 461 with minor

modifications. The lysate was fractionated with aqueous ammonium sulfate at 60-85% saturation.

The precipitate was dissolved in 20 mM Tris-HCl buffer, pH 7.8, containing EDTA (1.0 mM) and

dialyzed overnight against this same buffer. The isolate was clarified at 30,000g for 30 min,

filtered, and applied to a Hitrap Q HP column from GE Healthcare Life Sciences (Pittsburgh, PA)

that had been equilibrated with the dialysis buffer. TTR was eluted with the same buffer containing

NaCl (1.0 M) and was subjected to gel-filtration chromatography on a Superdex 75 column from

GE Healthcare Life Sciences (Pittsburgh, PA) that had been pre-equilibrated with 10 mM sodium

phosphate buffer, pH 7.6, containing KCl (100 mM). Pure tetrameric TTR eluted at -0.6 column

volumes as monitored by SDS-PAGE. The concentration of wild-type TTR and its V30M variant

305

was determined from the A280 .m by using e = 18.5 x 10' M'cm' and confirmed with a

bicinchoninic acid (BCA) assay using a kit from Pierce Biotechnology (Rockford, IL).

A.4.4 Competitive Fluorescence Assay

Fluorescence measurements were performed with a Photon Technology International

Quantamaster spectrofluorometer (Edison, NJ). Wild-type TTR was found to form a complex with

ANS that has a Kd value of 3.2 pM (data not shown).462 To determine the affinity for ligands, wild-

type TTR tetramers (500 nM) were incubated in 2 mL of 10 mM sodium phosphate buffer, pH 7.6,

containing KCl (100 mM) and ANS (5.0 pM) until the fluorescence signal (excitation: 410 nm;

emission: 460 nm) was stable (~30 min). A ligand (1 nM-10 pM) was then added in aliquots (5

[tL) from a stock solution in dimethyl formamide (DMF). The fluorescence intensity at each ligand

concentration was recorded before adding the next dose. Control experiments with only the solvent

(DMF) revealed that the fluorescence intensity dropped proporationally with the dilution factor.

Accordingly, average intensities were adjusted for the dilution incurred upon adding ligand and

were expressed as a percent change from the initial measurement. The final DMF concentration

was 6.3% by volume after 27 additions of ligand.

The data did not fit well to the one- or two-site competitive binding models used previously

to describe other systems. 462 The asymmetric behavior is likely a consequence of two distinct

binding events and a modest decrease in fluorescence intensity resulting from the first ANS

dissociation event. The steepest inflection point at higher ligand concentrations can be attributed

to the half-maximal concentration required to compete ANS from the second binding site. The

Prusoff-Cheng relation was used to account for this competition, resulting in a logistic equation: 463

306

% I = %O + %sf S (1)

1 '2 1+ ANS - [ligand])

where Kd,2 is the equilibrium dissociation constant of the second site, n is the Hill coefficient, and

S is the symmetry parameter. Values of Kd,2 were determined with Prism 6 software from Graphpad

(La Jolla, CA) by holding constant the ligand concentration, ANS concentration, and Kd,ANS

3.2 ptM and varying other parameters to maximize the value of R2, which was >0.99 for all datasets.

The value of Kd,2= (373 10) nM for the TTR-tafamidis complex determined with this method is

similar to the value of Kd,2 = 278 nM determined with isothermal titration calorimetry. 41" Values

of Kd,1 were not obtainable by this method, though modest inflection points were observed in the

low nanomolar range (Figure A.S 1). Reported values of K,1 for other ligands fall within this range,

however, our efforts to improve kinetic stabilization properties focus on reducing the Kd,2 value.

A.4.5 Fibril-Formation Assay

Light-scattering at 400 nm was used to assess the formation of fibrils under acidic conditions, as

described previously. 464 In this assay, turbidity is assumed to be proportional to the mass of the

protein converted to fibrils. Ligands (7.2 or 14.4 pM) were incubated with TTR tetramers (7.2

pM) for 30 min prior to twofold dilution with 50 mM sodium acetate buffer, pH 4.4, containing

KCl (100 mM). Absorbance at 400 nm was measured ten times with four replicates in clear, flat-

bottomed, 96-well plates at 0 and 96 h using an M1000 plate reader from Tecan (Maennedorf,

Switzerland). Percent fibril formation was calculated from the difference in the light scattering that

307

accumulated after 96 h in ligand-containing wells versus wells containing only vehicle (DMF),

using the equation:

% fibril formation = A 4 0 0 nm,ligand,96 h- A 4 0 0 nmigand,o h X 100% (2)A 4 0 0 nm,DMF,96 h- A 4 0 0 nm,DMF,0 h

Standard deviations from the four replicates were propagated through this calculation.

A.4.6 Protein Crystallization and X-ray Structure Determination

Crystals were grown by vapor diffusion using a solution of wild-type TTR (-6.0 mg/mL in 10 mM

sodium phosphate buffer, pH 7.6, containing 100 mM KCl) and stilbenes 2-8, 10, or 11 (7.2 mM,

added from a stock solution in DMF). Hanging drops (2 !IL of solution containing protein and

ligand plus 2 ptL of mother liquor) were equilibrated with a reservoir of 1.0-1.3 M sodium citrate

buffer, pH 5.5, containing glycerol (1-3% v/v). 4 00 Crystals appeared after 1-3 days. TTR

precipitated in the presence of stilbene 9 (7.2 mM). Crystals were cryoprotected by brief transfer

into a solution of 1.5 M sodium citrate buffer, pH 5.5, containing glycerol (10% v/v). Diffraction

data were collected at Sector 21 of the Life Sciences Collaborative Access Team (LS-CAT) at the

Advanced Photon Source of Argonne National Laboratory (Argonne, IL). Data were reduced using

HKL2000 (Tables A.S2-A.S 1 0).465 Boronate ester restraints were obtained by measuring the bond

lengths and angles from ten CSD small-molecule structures (Table A.S1). Initial phases were

obtained by molecular replacement using the protein atoms of Protein Data Bank (PDB) entry

2qgb as a model. 450 Refinement and model building were conducted with the programs Phenix and

Coot (Tables A.S2-A.S 10 and Figures A. S4-A.S 12).228-229 Ligand models in idealized geometries

308

we-re prepanredr withi thei prngram WebAM ,nd res:1i"t weipipdin Phenix w i he rrgraMI - ' I-~ - - - ' P ..-' %4A9 TV L IVAJ', (LLI.L L%'.JLAL Z1iIL VV% AL F' ,JJJ. ,%. AAAL '. A I 1 '..L ,~ VVALL LAII %.'LJI L I

eLBOW. Restraints generated by eLBOW were modified to impose planarity on the four carbon

atoms in the olefin of the stilbene and on the carbon, boron, and two oxygen atoms in a boronic

acid group. Short interatomic distances in TTR ligand complexes are listed in Table A.S11.

Atomic coordinates and structure factors for all nine TTRdligand complexes have been deposited

in the PDB.

309

A.5 Conclusion

A series of paired stilbenes was designed, synthesized, and tested as ligands for TTR. Each ligand

contained either a phenolic hydroxyl group or a phenylboronic acid group, which was intended to

bind within the inner pocket of the T4-binding site in the dimer-dimer interface. We found that the

functional groups on the stilbene can alter the binding mode, precluding a rigorous thermodynamic

analysis. Nevertheless, our boronic acids are the first ligands observed to form a reversible

covalent bond with TTR-one with a serine residue deep in the inner pocket. Crystal structures

also revealed the first sp2-hybridized boronic esters observed with a protein. These stilbene boronic

acids inhibit the TTR fibril-formation that leads to amyloidosis. Their efficacy extends to V30M

TTR, which is a common disease-related variant. We envision that the unique attributes of boronic

acid groups could find utility in pharmacological stabilizers of other proteins.

310

A . A k nowedg*ets

We are grateful to Drs. R. M. Murphy and C. L. Jenkins for contributive discussions. I.W.W. was

supported by Biotechnology Training Grant T32 GM008349 (NIH) and a Genentech Predoctoral

Fellowship. This work was supported by Grants RO 1 GM044783 (NIH) and MCB 1518160 (NSF),

and made use of NMRFAM (University of Wisconsin-Madison), which is supported by Grant P41

GM103399 (NIH).

311

Table A.1

Table A.1. Interaction of Diphenols 1-4 with Wild-type TTR and its V30M Variant

wild-type TTR V30M TTR

compound Kd,2 (pM) %FF 1:la %FF 2:la %FF 1:la %FF 2:la binding modeb

1 (resveratrol) 0.47 0.03 28 8 9 2 53 12 26 4 forward

2 0.47 0.02 21 4 9 2 48 11 23 6 reverse

3 0.73 0.02 22 4 7 3 44 10 14 4 forward

4 0.44 0.01 19 4 4 2 36 8 12 3 reverse

a%FF, percent fibril formation. b"forward": upper phenyl ring depicted in Chart A.2 lies in the

outer pocket of the T4-binding site, according to X-ray diffraction analysis (Figures A.2 and A.3);

"reverse": upper ring lies in the inner pocket.413

312

Tnhh- A.7

Table A.2. Interactions of Carboxylic Acids 5-8 with Wild-type TTR and its V30M Variant


compound Kd,2 (pM) %FF 1:la %FF 2:la %FF 1:la %FF 2:la binding modeb

5 1.8 0.1 75 24 77 14 112 25 120 32 forward

6 0.99 0.02 21 4 11 2 48 12 28 8 reverse

7 0.47 0.01 21 4 3 1 41 10 9 2 forward

8 0.45 0.01 16 4 4 1 32 9 10 2 reverse

a%FF, percent fibril formation. b,"forward": upper phenyl ring depicted in Chart A.2 lies in the

outer pocket of the T4-binding site, according to X-ray diffraction analysis (Figures A.2 and

A.3); "reverse": upper ring lies in the inner pocket.413

313

Table A.3

Table A.3. Interactions of Compounds 9-12 with Wild-type TTR and its V30M Variant


compound Kd,2(PM) %FF 1:la %FF2:1a %FF 1:la %FF2:la bindingmodeh

9 0.82 0.01 30 6 12 2 62 16 27 10 NDC

10 0.47 0.03 20 3 3 3 32 9 8 3 covalent

11 0.79 0.03 26 4 10 3 54 16 16 5 forward/covalent

12 (tafamidis) 0.37 0.01 22 5 0 6 5 13 8 7 forwardd

a%FF, percent fibril formation. b,"forward": upper phenyl ring depicted in Chart A.3 lies in the

outer pocket of the T4-binding site, according to X-ray diffraction analysis (Figure A.2); 413

"covalent": formation of a boronate ester with Ser 17. CND, not determined. dRef. 417.

314

Table A.S1

Table A.S1. Bond Angles and Bond Lengths of Planar Boronic Esters in Small-molecule CrystalStructures in the Cambridge Structural Database (CSD) and in Protein Co-crystal Structures Reportedin this Work. The parameter n refers to the number of B-OR' bonds in the structure. The means valuesof the bond angles a and f, and the bond length rB-OR' from these small-molecule structures were used inthe refinement of X-ray diffraction data from the TTR-10 and TTR-11 complexes with the programphenix refine.

SR'

CSD entry

DOFLIUDOFLOADOFLUGHOXPOALUKWUKNEYVIXQEHMATREZYEATOMKAJWUMCAK

a (0)124125122120118119119119118118

/f (0)123124123120120121120122112109

rB-OR' (A)1.371.371.381.371.361.361.351.361.391.37

n2242I11421

Mean SD 120 3 119 5 1.37 0.02

Ligand Chain a (0) 6 (0) rB-OR' (A)10 A 116 109 1.44

10 B 116 108 1.3711 A 121 116 1.4211 B 124 109 1.43

315

Table A.S2

Table A.S2. Crystallographic Data Collection and Refinement Statistics

for the TTR 2 Complex.

Complex 2PDB Code 5u48

Data CollectionX-Ray SourceDetectorWavelength (A)Resolution, last shell (A)Space group

Unit cell, a, b, c (A)Unit cell, a, p6, y (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/a (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)

RefinementWorking Set (last shell)Test Set (last shell)Rwork (last shell)Rftee (last shell)RMSD of Bond Lengths (A)RMSD of Bond Angles (0)


Average B-factorProteinLigandWater

Ramachandran Favored, Allowed, Outliers(%) from MolProbity

LS-CAT 21-ID-GMAR 300 CCD0.9785735.5-1.50 (1.55-1.50)P21 21 21a = 43.228, b = 84.947c = 64.606a =,8= y = 90284,04039,026 (1904)7.3 (7.3)28.6 (2.6)99.9 (100.0)0.056 (0.738)0.021 (0.272)18.160.2

35,141 (3422)3843 (376)0.178 (0.233)0.205 (0.260)0.0071.05198523118263812123.923.422.431.9

98.7, 1.3, 0

316

Tahle A S3

Table A.S3. Data Collection and Refinement Statistics for the TTR 3

Complex.

ComplexPDB Code

35u49


Unit cell, a, b, c (A)Unit cell, a, f, y (0 )No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/ (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)

RefinementWorking Set (last shell)Test Set (last shell)Rwork (last shell)Rfree (last shell)RMSD of Bond Lengths (A)RMSD of Bond Angles (0)




LS-CAT 21-ID-DDetris Eiger 9M1.23980133.0-2.22 (2.30-2.22)12 2 2a = 44.517, b 65.895c = 84.594a ='8 = y = 9039,1095956 (300)6.6 (4.4)19.2 (2.3)98.7 (98.0)0.395 (1.722)0.151 (0.757)38.31.3

5598 (1311)621 (146)0.202 (0.249)0.275 (0.318)0.0081.04925116896181141.241.339.538.5

96.5, 3.5, 0

317

Table A.S4

Table A.S4. Crystallographic Data Collection and Refinement Statisticsfor the TTR 4 Complex.Complex 4PDB Code 5u4a


Unit cell, a, b, c (A)Unit cell, a, ,8, y (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)





LS-CAT 21-ID-GMAR 300 CCD0.9785735.5-1.90 (1.96-1.90)P21 2j 21a = 42.822, b 84.928c = 64.19a =,8 = y = 90137,45619,212 (936)7.2 (7.2)19.4 (2.3)99.8 (100.0)0.097 (0.787)0.037 (0.289)33.61.4

17,310 (1291)1859 (143)0.223 (0.247)0.276 (0.279)0.0071.0319072321792407539.038.932.444.6

97.4, 2.2, 0.4

318

Table A.ql


for the TTR-5 Complex.

ComplexPDB Code

55u4b


Unit cell, a, b, c (A)Unit cell, a,,8, y (0)

No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)

RefinementWorking Set (last shell)Test Set (last shell)Rwork (last shell)Rfee (last shellRMSD of Bond Lengths (A)RMSD of Bond Angles (0)




LS-CAT 21-ID-GMAR 300 CCD0.9785738.5-1.45 (1.5-1.45)P 2 1 2, 2,a = 43.052, b = 85.446c = 64.107a='p=y=90311,25242,801 (2112)7.3 (7.2)25.4 (2.7)99.9 (100.0)0.065 (0.582)0.024 (0.214)15.40.2

38,552 (3756)4230 (405)0.186 (0.241)0.207 (0.269)0.0071.03205123118583615719.618.827.327.2

99.2, 0.8, 0

319

Table A.S6


for the TTR-6 Complex.Complex 6PDB Code 5u4c


Unit cell, a, b, c (A)Unit cell, a, p, y (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)





LS-CAT 21-ID-GMAR 300 CCD0.9785736.0-1.7 (1.76-1.70)P 21 2, 2,a = 42.924, b = 85.580c = 63.762a =8= y = 90191,62326,541 (1303)7.2 (7.1)23.9 (2.0)99.6 (99.1)0.065 (0.818)0.024 (0.301)19.40.6

23,900 (2334)2593 (224)0.176 (0.225)0.204 (0.271)0.0071.061204323118694013423.823.730.830.1

97.9. 1.7. 0.4

320

Thh A.S7


for the TTR-7 Complex.

Complex 7PDB Code 5u4d

Data CollectionX-Ray SourceDetectorWavelength (A)Resolution, last shell (A)Space groupUnit cell, a, b, c (A)

Unit cell, a,f8, y (')No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)





LS-CAT 21-ID-FMAR 225 CCD0.9787238.5-1.55 (1.60-1.55)P2, 21 2a = 43.207, b = 84.814c = 64.618a =#= y = 9021,607435,258 (1735)6.1 (5.2)23.7 (2.1)99.8 (99.5)0.072 (0.675)0.028 (0.291)19.20.2

31740 (3067)3474 (364)0.206 (0.303)0.243 (0.356)0.0071.038198923118213813026.425.826.634.1

97.5, 2.5, 0

321

Table A.S8


for the TTR 8 Complex.

Complex 8PDB Code 5u4e


Unit cell, a, b, c (A)Unit cell, a, 8, ' (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/a (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)





LS-CAT 21-ID-GMAR 300 CCD0.9785726.0-1.45 (1.50-1.45)P21 21 21a = 43.029, b = 85.861c = 63.831a=/=y=90309,84942,788 (2094)7.2 (7.1)30.2 (2.5)100.0 (99.9)0.055 (0.659)0.20 (0.243)15.40.4

38,504 (3747)4229 (405)0.18 (0.259)0.198 (0.290)0.0071.163215423119018416919.819.220.128.6

98.4, 1.2, 0.4

322

Tabhh A.Q9

Table A.S9. Crystallographic Data Collection and Refinement Statisticsfor the TTR -10 Complex.

ComplexPDB Code

105u4f


Unit cell, a, b, c (A)Unit cell, a, ,3, y (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)





LS-CAT 21 -ID-DDetris Eiger 9M1.23980138.5-1.50 (1.55-1.50)P21 21 21a = 42.920, b = 85.223c = 63.942a =8= y = 90228,24338,097 (1877)6.0 (6.1)26.3 (3.9)99.5 (99.6)0.089 (0.466)0.035 (0.184)22.220.6

36,057 (3506)2000 (195)0.193 (0.247)0.220 (0.300)0.0071.12197723118124212326.926.429.335.0

98.7, 1.3, 0

323

Table A.S1O


for the TTR-11 Complex.ComplexPDB Code

115u4g


Unit cell, a, b, c (A)Unit cell, a, ,, y (0)No. of ReflectionsNo. of Unique ReflectionsRedundancy (last shell)Mean I/- (last shell)Completeness (last shell)R-meas (last shell)R-pim (last shell)Wilson B-factorAverage Mosaicity (0)

RefinementWorking Set (last shell)Test Set (last shell)Rwork (last shell)Rfree (last shellRMSD of Bond Lengths (A)RMSD of Bond Angles (0)




LS-CAT 2 1-ID-FMAR 225 CCD0.9787238.5-1.8 (1.86-1.80)P21 21 21a = 43.019, b = 85.118c = 64.023a =,8 = y = 90163,19022,450 (1087)7.3 (7.3)29.4 (3.1)99.7 (99.4)0.057 (0.513)0.021 (0.187)22.80.6

20,232 (1898)2189 (199)0.188 (0.246)0.223 (0.281)0.0091.207194723117994010827.627.330.532.6

97.9, 1.7, 0.4

324

Table A.1I

Table A.S11. Non-covalent Interactions and Distances in TTR Ligand Complexes.Ligand Interaction type Ligand atom Protein atom Distance (A)2 Dative BO1/A NZ, A/15 Lys/A 2.82 Dative BO1/B NZ, A/15 Lys/B 3.22 Dative Average 3.02 Hydrogen bond 004/A NZ, A/15 Lys'/A 2.62 Hydrogen bond 003/B NZ, A/15 Lys'/B 2.22 Hydrogen bond Average - 2.4

Hydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bond

DativeDativeDative

Hydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHdr b d A

002/A002/A001/A001/A002/A002/A001/A001/A

Average002/ABO1/ABO1/B

Average003/A003/B

Average001/A021/A002/B001/B

Average001,A/A001,A/A001,B/A001,B/A001,A/B001,A/B00IB/B001,B/B

OGA/117 Ser/AOG,B/117 Ser/AOGA/117 Ser'/AOG,B/117 Ser'/AOGA/117 Ser/BOGB/I 17 Ser/BOG,A/117 Ser'/BOG,B/117 Ser'/B

OGA/1 17 Ser/ANZ, A/15 Lys/ANZ, A/15 Lys/B

222222222344444444444555555555

OGA/1 17OGB/1 17OGA/1 17OGB/I 17OG,A/1 17OGA/1 17OGA/1 17OG,A/1 17

2.73.02.72.73.02.72.92.62.82.43.44.43.93.43.03.22.23.02.12.82.52.72.92.42.53.02.63.32.8

Ser'/BSer'/BSer'/BSer'/BSer'/ASer'/ASer'/ASer'/A

I y urgen Unu Ave rmage - .85 Hydrogen bond 003/B NZ, A/15 Lys/A 2.55 Hydrogen bond 002/B NZ, A/15 Lys'/A 3.65 Hydrogen bond Average - 3.16 Dative BOI/A NZ, A/15 Lys/A 2.96 Dative BO1/B NZ, A/15 Lys'/B 3.06 Hydrogen bond Average - 3.0

Hydrogen bond 002/A NZ, A/15 Lys'/A 3.0

325

NZ, A/15 Lys'/ANZ, A/15 Lys'/B

OGA/117 Ser/AOGA/1 17 Ser'/AOG,A/1 17 Ser/BOGA/117 Ser'/B

6

11 Hydrogen bond Average

NZ, A/15 Lys/BHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bond

DativeDativeCative

Hydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bondHydrogen bond

DativeHydrogen bondHydrogen bondHydrogen bondHydrogen bond

DativeHydrogen bondHydrogen bond

001/BAverage003/A003/A004/A004/A003/B003/B004/B004/B

Average003/A003/B

Average001/A002/A

AverageBO1,A/ABOIA/BAverage002,A/A

001/BAverage003/A003/A004/A004/A003/B003/B004/B004/B

AverageB02/A003/A002/A003/B

AverageBOi/A002/A0/04/B

OG, A/i 17 Ser'/AOG, A/i 17 Ser/B

NZ, A/1- Lys/ANZ, A/15 Lys'/A

NZ, A/15 Lys/ANZ, A/- Lys'/B

NZ, A/15 Lys'/ANZ, A/15 Lys/B

OGA/117 Ser'/AOG,B/117 Ser'/AOGA/117 Ser'/AOG,B/117 Ser'/AOGA/117 Ser/BOG,B/117 Ser/BOG,A/117 Ser/BOG,B/117 Ser/B

NZ-, A/i Lys/ANZ, A/15 Lys'/ANZ, A/15 Lys'/ANZ, A/15 Lys'/B

NZ, A/15 Lys/A 3.2OG,B/117 Ser'/A 2.8OGB/1i7 Ser'/B 3.0

2.9

326

111111

OGA/1 17OG,B/1 17OG,A/1 17OG,B/1 17OG,A/1 17OG,B/1 17OGA/1 17OG,B/1 17

Ser'/ASer'/ASer'/ASer'/ASer/BSer/BSer/BSer/B

4.03.53.12.92.72.33.03.12.42.52.82.22.32.32.72.32.52.92.92.93.03.03.02.92.62.73.02.83.02.52.72.83.53.62.72.62.7

Tahle A.'1)

Table A.S12. Observed o-hole Bond Lengths and Bond Angles in TTR-Ligand Complexes

Ligand Ligand atom Protein residue rci... o (A) Oc-ci o ( ) *.o=c ( )

10 C102/A 108 Ala'/A 3.6 175.4 83.410 C102/B 108 Ala'/B 3.6 174.9 87.811 CIO1/A 108 Ala'/A 3.7 173.4 85.111 C1OI/B 108 Ala'/B 3.7 175.3 84.3Average 3.7 174.8 85.2

327

Figure A.1

rN

C

328

Figure A.1. Three-dimensional Structure of the TTR-Resveratrol Complex. TTR monomers (tan,

red, green, and purple ribbons) have a P-sandwich fold and assemble into a tetramer, which binds

to two molecules of resveratrol (ball-and-stick). The rings of resveratrol (1) occupy inner and outer

pockets of the two T4-binding sites at the dimer-dimer interfaces. The image was created with the

program PyMOL and PDB entry 1 dvs. 4 00

329

Figure A.2

A TTR-2

C TTR-6

E TTR-10

B TTR-4

LU I -oo

F TTR-11

330

Figure A.2. Three-dimensional Structures of TTR Ligand Complexes that Contain a Boronic Acid

Group. One monomer (chain B) of the TTR tetramer is shown, and is in the same orientation in

each panel. The main chain of TTR is rendered as a ribbon, and the side chains of Lys 15 and

Serl 17 are shown explicitly. Ligands are depicted in a ball-and-stick rendition with CPK coloring

and boron atoms labeled explicitly. Alternative conformations of Ser 17 or the ligand are shown

in some panels. Arrows indicate the O117 -B bond in the TTR- 10 and TTR-11 complexes. Images

were created with the program PyMOL. (A) TTR 2 (PDB entry 5u48). (B) TTR-4 (5u4a). (C)

TTR-6 (5u4c). (D) TTR-8 (5u4e). (E) TTR-10 (5u4f). (F) TTR-11 (5u4g).

331

Figure A.3

A TTR-3

B TTR-5

C TTR-7

332

Figure A.3. Three-dimensional Structures of TTR-Ligand Complexes that do not Contain a

Boronic Acid Group. One monomer of the TTR tetramer is shown, and is in the same orientation

in each panel. The main chain of TTR is rendered as a ribbon, and the side chains of Lys15 and

Ser 17 are shown explicitly. Ligands are depicted in a ball-and-stick rendition with CPK coloring.

Alternative conformations of Ser 17 or the ligand are shown in panels B and C. Images were

created with the program PyMOL. (A) TTR-3 (PDB entry 5u49, chain A). (B) TTR 5 (5u4b,

chain B). (C) TTR -7 (5u4d, chain B).

333

Figure A.4

B

TTR-11

334

A

TTR-10

Figure A.4. Halogen-bonding Interactions in the TTR 10 and TTR 11 Complexes. One monomer

(chain B) of the TTR tetramer is shown. Chloro groups in the two ester-forming boronates exhibit

C-Cl -0108' angles that are nearly linear and Cl - 0108' distances (dashed yellow lines) that are

3.6-3.8 A (Table S 12). Images were created with the program PyMOL. (A) TTR -10 (PDB entry

5u4f). (B) TTR 11 (5u4g).

335

Figure A.S1

100- Diphenols 1-4

80

cj 60-

40-D2

20 ~--- 3-64

0--9 -8 -7 -6 -5

log[ligand]

100- Carboxylic Acids 5-8

80-

D60-

- 540-

-+- 6

c 20---8

0 --9 -8 -7 -6 -5

log[ligand]

100- Compounds 9-12

80-

60-

40 --10

S0)

20 -#- 12

0--9 -8 -7 -6 -5

log[ligand]

336

Figure A.S1. Graphs Showing the Results of ANS Competition Assays. (A) Compounds 1-4.

(B) Compounds 5-8. (C) Compounds 9-12. Data were fitted to eq 1 to derive values of K,2 (Table

1); R2 > 0.99 for each dataset.

337

Figure A.S2

100- Diphenols 1-4 M 1

80m4

60-

40 d

20-

0-

wild-type TTR V30M HTR

160 Carboxylic Acids 5-8 m140

S120CUmE 1000 S80-

60-

40-

20

0-1:1wild-

100-

80-

60-

40-

20d

0-

wild-ty~

1:2 1:1 1:2type HTR V30M TTR

Compounds 9-12

m

1p:2 1 31 1:2)e TTR V30M TR

9I10

1112

338

C0

1-

0U-

U-

C:0

0L

-o

Figure A.S2. Graphs Showing the Results of 96-h Fibril-formation Assays at Two Different

TTR:Compound Ratios. (A) Compounds 1-4. (B) Compounds 5-8. (C) Compounds 9-12. Error

bars represent the standard deviation of 16 measurements propagated through eq 2.

339

Figure A.S3

13952.400

13942.163

a ~ m12500 13000 13500 14600 14500 15000 15500 16600 16500

28237.175

42359.733

56344.407

42236.33256152.383

340

A

5-

C',CcDC

2000-

1500-

1000-

500-

B

300

200.

100

0-

20000 25000 30000 35000 40000 45000 50000 55000 60000

m/z

. . - -.. I i

"I , 1.

I IIIIII'm I

Figure A.S3. MALDI-TOF Mass Spectra to Probe the Reversibility of TTR Inhibition. TTR (3.6

tM) was treated with compound 10 (360 ptM) (red) or with buffer alone (black), both containing

DMF (5% v/v). No significant shift in mass was observed for monomeric TTR (A) or its higher

molecular mass oligomers (B), indicating that the TTR -10 complex dissociates.

341

Figure A.S4

A

Er,

k

B

Er

alAW

kI

342

S

Figure A.S4. Electron Density in the TTR-2 Complex. (A) Final structure. Blue: 2Fo - Fc

contoured at 1.0; red and green: Fo - F, contoured at -3.0O and 3.O, respectively. (B) Final

structure after ligand removal and refinement by simulated annealing. Green: F - F. contoured at

3.0a.

343

Figure A.S5

A B

344

Figure A.S5. Electron Density in the TTR-3 Complex. (A) Final structure. Blue: 2F - Fe

contoured at 1.0a; red and green: Fo - F, contoured at -3.Ou and 3.0a, respectively. (B) Final

structure after ligand removal and refinement by simulated annealing. Green: Fo - Fc contoured at

3.0-.

345

Figure A.S6

A B

'(

I

I

346

Ni

N

[a AM]

Figure A.S6. Electron Density in the TTR-4 Complex. (A) Final structure. Blue: 2F - Fc

contoured at 1.0a; red and green: Fo - Fc contoured at -3.0a and 3.0u, respectively. (B) Final

structure after ligand removal and refinement by simulated annealing. Green: Fo - Fe contoured at

3.0u.

347

Figure A.S7

A

V

/

N4

I

B

IIA

I

I

348

Am

6

Figure A.S7. Electron Density in the TTR-5 Complex. (A) Final structure. Blue: 2Fo - F,

contoured at 1.0a; red and green: F, - Fc contoured at -3.0a and 3.0u, respectively. (B) Final


3.0u.

349

Figure A.S8

A

w

N

'F

350

7v tB

/

Y

/-7 //

44

I


contoured at 1.0a; red and green: Fo - Fc contoured at -3.0a and 3.0a, respectively. (B) Final


3.0a.

351

Figure A.S9

A

04

/

~1

6

B

352

J a,

'All4

.1;4:V

ii

480


contoured at 1.0u; red and green: Fo - Fc contoured at -3.0O and 3.0u, respectively. (B) Final


3.0u.

353

Figure A.S1O

A

4~

B

Ai

4~

Ai

Ioa

AA

354

Q jb

0

Figure A.S1O. Electron Density in the TTR-8 Complex. (A) Final structure. Blue: 2Fo - Fc

contoured at 1.0u; red and green: Fo - F, contoured at -3.0a and 3.0u, respectively. (B) Final


3.0u.

355

Figure A.S11

A B

356

Figure A.S11. Electron Density in the TTR 10 Complex. (A) Final structure. Blue: 2F - Fc

contoured at 1.0a; red and green: F - F, contoured at -3.0c and 3.0u, respectively. (B) Final


3.0u.

357

Figure A.S12

A B

4

[

4

y

358


contoured at 1.0a; red and green: Fo - Fc contoured at -3.0a and 3.0u, respectively. (B) Final

structure after ligand removal and refinement by simulated annealing. Green: Fo - Fe contoured at

3.0u.

359

Chart A.l. Diphenol Ligands

HO OH

NI

OH

I (resveratrol)

HO OH

NI

HOB'H

2

HO OH

IN

NI

OH

3

HO OH

I

BH0'B'OH

4

360

Chart A.2. Carboxylic Acid Ligands

0

OHI-

OH

5

0

OH

I

HO' BOH

6

0

OHI-

C

OH

0OH

'N OH

HO' ~OH

7 8

361

Chart A.3. Diboronic Acid and Related Ligands

OH

CI

CI

OH

OH

9

HO, B'OH

CI

CI

HO

H O'BOH

10

OHBB OH

CI

HO 'BOH

11

0OH

N O

CI CI

12 (tafamidis)

362

Scheme A.1. Routes for the Synthesis of Stilbenes 2-6

I I1. triethylphosphlite (neat) 0 N HO OH

150C B8r3 N2. NaH, DMF - CM

0 *C to rt O*C to rt

89% (2 steps) 70%

Br Br2a

1. trtethylphosphlta (neat) 0 0 H150 *C BBr 32. NaH, DMF DCM0 *C to rt 0 C tort

91% (2 steps) 72%CI

Br3a

2b

0 OH

C

Br3b

bis(pinacolato)diboron HO OHKOAc, Pd(dppf)C1

2dioxane80 IC

85%

2c

bis(pinacolato)dlboron HO OHKOAc, Pd(dppf)C1

2dioxane80 *C

89%1 C

c

HO OHNaIb 4I M HCIH2O/THF

67%

Ho' BOH2

0-N+ HO OH

DCM

70%|CI

OH3

HO OHNaO

4 N1MHCIH2OrTHF

88%

HO' B'OH

1. triethylphosphite (neat)0 150*C

2. NaH, DMF0 0 C to rt

Br 83% (2 steps)

Br Br

0N ~

Br5a

0

OH2 M NaOHTHF/EtOH

96%

Br

5b

bls(pinacolato)diboronKOAc, Pd(dppf)C6dioxane80 *C

73%

0

OH

Sc

0-yN+

DCM

88%

0

OH

OH5

0

NaNO4 OH

1 M HCIH2OTTHF

90%

HO'B,OH

6

363

B

lr Br

Br

Cl

Br

Scheme A.2. Routes for the Synthesis of Stilbenes 7-11

r C

Br1. KMnO4tBuOH/H20 96%2. HCtIMeOH (2 steps)

reflux

0 LIBH4 HO PDC 0THF/Ma0H CM

C1 0 .C to rt r95% 88%

Br Br Br7a 7b 70

1. 0

-~ 0'

Br

triehylphosphite (neat) 0150 1C

2. NsH, DMF 2 M NaOH0 *C to rt THF/EtOH

83% (2 steps) 95%

Br7d

1. Br 1. Br

Cl 89% BrBr 72% (2 steps)

(2 steps) triethylphosphite (neat)iethylphasphite (neat) 150 C150 1C 2. NaH, DMF

2. NaH, DMF 0 *C to rtOCtort

Br

Cl %

C1

Br9a

bls(pinacotato)dlboronKOAc, Pd(dppf)C

2 92%dioxaneBa .C .1

0'B

Cl

- CI

B'

S1Bi1

Br bIs(pInacotuto)dlboronKOAc, Pd(dpPf)C 2

- diaxane80 1C

87%- Ct

Br11a

N.

Cl

0,B,0

11b

OH

C

Br

7.

NatO41 M HCI

H 2O/THF83%

bis(pinacolato)diboronKOAc, Pd(dppf)C 2dioxane80 *C

82%

0- 0

DCM72%

0 aOH

OHN. 7

C1 0

N. NatO4 N. OH

0, , H20THF N

75% a

7f

HO'B OH

OHB'OH

Cl

HO' ,OH

11

0- OHN.

C

DCM80% C1

H

HO, BOH

NatO41 MHCI CIH2OiTHF N

74% C1

HO' 'OH10

364

A.5 HPLC Traces

Compound 2 2.12.01.91.81.71.6.1.51.41.31.01.11.00.90.80.70.6

N 0.50.40.30.20.1

L - 0.0

5 10 15 20 25 30 35 40

time (min)

Compounc 3 2.1.2.01.9.1.81.7-1.8.1.51.41.31.21.11.00.90.80.70.50.50.40.30.20.1

.asi Za 0.0

5 10 15 20 25 30 35 40

time (min)

Compound 5

5 10 15 20 25 30 35 40

time (min)

365

2.12.01.91.81.71.61.5

. 1.4

1.31.2

-1.0Q 0.9

0.80.70.60.50.40.30.20.10.0

A.6 NMR Spectra

'H NMR (CDC13) and 13 C NMR (CDC13) of Compound 2a

II 1 1I

0

1i 11 I v I8 .5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5s 2.0 1.5 O ~ 0. 0.0 -0.5 -1.0

(n 40 -0 arc I" Li(p~

0%t . -40 5

165 160 155 150 145 140 135 130 125 120 115 110 105 100 915 9'0 85 00 75 70 65 600 5'5fl (PPMt)

366

'H NMR (CD30D) and 13 C NMR (CD30D) of Compound 2bI0 "It US R

U 3

LM14 1-

9.5 9.0 8.5 8.0 7.S 7.0 6.5 6.0 5.5fl (ppm)

44 a "N N LAN -L4.. fi 7! ! 'o 0 -

a* r4 0v M g

5.0 4.5 aC D I0004

-A M~ -

11 I

210 200 190 180 170 160150 140 130 120 110 10 90 80 70 60 50 40 30 20 10 0 -10f6 (ppm)

367

7

SD O 3.0S000

aq ID V100 00

I

2.5 2.0 1.5

11

'H NMR (CD 30D) and 13C NMR (CD30D) of Compound 2c

rlNO 8O NP% N N40 41-4 u~ v 0 1-1.9- N

I I r I

III ~ itYY A l 407

9.0 8.5 g8.0 75 ! G 6.0

170 160 150 140 130 120 110 100 90 80fl (pp.)7

3.5 3.0 2.5 2.0 1.5 1.0 0.5

70 60 50

0.0 -0.5

40 30 20 10 0

368

T

n p l s i g gp msg n r-1 , ,ni sy , , , my ,r4

fl (ppmV

'H NMR (CD30D) and 13 C NMR (CD30D) of Compound 2

C4 0*M - | o a*6*0

I I

it I~ I

TNv -4 -4 4 0

InInInW!l

, I I"

4 4 mm N4 N4-

11-1 1 1-1\

5.5 4.5 , 0fl (Im)

3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.

in

1......LLI i__________________________ _______________

170 160 150 140 130 120 110 100 90 80Ri (PPM*)

70 60 50 40 30

369

8.5 8.00; 7.5

20 10 0

-4 04

II

'H NMR (CDC13) and 13 C NMR (CDCL3) of Compound 3aU, " 0acMtow o- N M vr!rn! nt!nI": ft! In asC!'R V l

n t w 8 SW 0c-4e -4-4 O

y'a6

0000 0000000o 000

.YS -9 ) -2 2 T . - !

9.0 8.5 8.0 7.5 7.0 1 6.5 560 0 I M W 5.0 fl4M 4.0 3.5 3."- rtvZID

AM em C) 0\ \ O0

J.,wN W INSl mtwPM*ruPI t. lTIt.

(n 1.5 1.0 0.5 0.0

210 200 190 180 170 160 150 140 130 120 110 100f, (ppst)

90 80 70 60 50 40 30 20 10 0 -10

370

A&I

'H NMR (CD 30D) and 13C NMR (CD30D) of Compound 3b

47k 3 N, . A - NP M

/fI//

KIsLs's- r vfm rqc - 0q~~~ c.RR

7. P a 5.5004 ++4+ -40 r 4

M.L4 \4 r4 r4 V

.0 a 4.5 4.0 9.5

II

3.0 2.5 2.0 1.5 1.0 0.5

-l fl -b s a- ,w , i . ..a -n .. s - - - .mn

570 160 150 140 130 120 110 100 90 80fi (ppm)

70 60 50 40 30 20 50

371

8.5 8.0 0.0

0

AFAW" W*Wdolv#~ Aft

LI! I

7.5

I

'H NMR (CD 30D) and 1 3 C NMR (CD 30D) of Compound 3cM5 N0

CoI

I

T,

5.0 4.S 4 0 3.5f1 (pp.4)

NO/

I I

8.5 8.0 7.S 7.0 6.5 6.0 5.5

v0 .-4 \ -4 / 0 M "

Lq \l /ic

0 0(I!

2.5 .

000

190 110 170 160 150 140 130 120 110 100 90 80 70 60 50 40 30 20 10 0fl (ppm)

372

V 05. " " O $A M M &AV4: t q Ui wi N %!N 4040404wto 0 t

/ /

4,4ll , lI l

At 10 1 . . . I I I I I . I .

3.0 1.5 1.0 0 .5 0.0

'H NMR (CD 3OD) and 13 C NMR (CD3OD) of Compound 3- " 11 M"9N 91 a, 0SS0

S WWCON..-l.iN %D w w ws

y %ell ' -~

/

- ;XF-U

I

LryO LAi WA N"

CSP~ a00Nr IL

7n r4ss ry P

5.0 4.5fl (pp.)4.0 3.S 3.0

000'

oh o ow77

2.5 2.0 1.5 1.0 0.5 0.0

0 0 a000

00 00v0V, -

T= = -- .- -- , -- 1 "

i8o 170 160 150 140 130 120 110 000 90 80fl (pp.)

70 60 50 40 30

373

9.5 9.0 8.S 0.0 7.5 7.0 6.5 6.0 5.5

'34at a

LA U

01 0

i

I

20 10 0

U.-

'H NMR (CD 30D) and "C NMR (CD 30D) of Compound 4

0e - C @.

8.0 7.5 7.0 6.5 6.0 55 5.0 4.51 (p )40 3.

00

*MMM"V6N ccr4 -1 /1 4r 4- 4

a0

210 200 190 180 170 160150 140 130 120 110 100 90 00 70 60 50 40 30 20 10 0 -10f1 (PPM)

374

;*. d- M 0V

111/ P.i i .P r ,

Ch

LA

-7 71 :-! i -:= L 1 ,

I

I . I L

3.0 2.5 2.0 1.5 1.0 O.5

'H NMR (CD 30D) and 1 3 C NMR (CD 30D) of Compound 5a

as Limos A A s0 "

If i fHllI 1

I" itj7. 70 6.N5A

@4 NLA 404M5IM4000 0 Lfl0 V V4 CN N. N N4 NCN

I I 'iii

5.5 5.0 4.5 0f1 (pow)

00 eN,

I0 N

L

U0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0

4- q

7777 ... - ...assa nna aI s m M aa A-I

70 160 150 140 130 120 110 100 90 80fl (ppm)

70 60 50 40 30 20 10 0

375

7

..0 8.5 8.0 6.0

.1

'H NMR (DMSO) and 3C NMR (DMSO) of Compound 5b

It'13.0 12.5 12.0 11.5 11.0 10.5 10.0 9.5 9.0 8.5 8.0 7.S 7.0 6S 6.0 5.5 S.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0

81 (Pp )

IR Pl iq 7 R0

..P . . II. wi M .- 00 00

376

7

~~~2~~

6

180 170 100 150 040 130 120 010 100 90 80 70 60 50 40 30 20 10 0fl (ppm)

'H NMR (CDC13) and ' 3 C NMR (CDCL 3) of Compound 5c

/ 1 / // I

I I I .!R *

.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0Qi (pp.)

3.0 2.5 2.0 1.5 1.0 0 .5 0.0

0 1(U)4

d;3 rmL Am ad a go N&N

III

70 160 150 140 130 120 IN 110 ;0 o) 70 60 50 40 30 20 00 0

377

C

11 L"T - - -- 7'=7=Z--7-1'1 L.

8

c;I

I3.5

P.

'H NMR (CD 30D) and 3 C NMR (CD30D) of Compound 5

l i i v i I le_ _ __ _ _ Ir I-

p 8d4,b

6.5 6.0 5.5 5.0 4.5 40 3.5 3.0 2.5 2.0 15 1.0 0.5 0.0f5 lPp5)

a I05

0; 0f%4 0 0 $A U

0

0

10 170 100 150 140 130 120 110 100 90 ' 0 70 60 50 40 30 20 10

fl(0058)

378

'H NMR (CD 30D) and "C NMR (CD30D) of Compound 6

$I 7

8 8 I_

8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5 .0

K3 N .4

mm M MN "'I

3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0

~wawm-aa-

170 160 150 140 130 120 110 100 90 80 70 60 00 40 30 00 10 0f1 (ppM6)

379

NSN*m.elrlrlSNNNNNNNNNN N N N N N

Jill'

--.- J-" P, -Ml 0 0 " ON A j I I -04000APOP

'H NMR (CD 30D) and 13 C NMR (CD 30D) of Compound 7a

3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.1

00000000000000w w vN 0 4

w413 to rDOMaca co 40 0

r" en ~ Mv

Li~/S--

70 160 150 140 130 120 110 100 90 80 70 60fl (ppm)

380

t, - v M M

I'

4.5 1pp4.8.0 7.5 7.0

50 40 30 20 00 C

6.5 6.0 5.5 5 .0

20 10C

'HNMR (CDC13) and 13 CNMR (CDC1 3) of Compound 7b

MZ a. N N

II I

c ct~

7.0 6.5 6. 5.5 5.0 4.5 40

U

8:0 7'5

P'. VMS fm M4 wIN- r

I I

3.S 3.0 2.5 2.0 1.5 1.0 0.5 0.0

L)

140 130 120 110 100 90 0 70fl (PPMo)

60 S0 40 30 20 10

381

S1.4 F%

i

LM

0

'H NMR (CDC1 3) and '3C NMR (CDC13) of Compound 7c

%D LAr4N

8.0 7.5 7.0 6.S 6.0 S.S 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.(I (Pppw

LA

382

-I.0

190 180 170 160 150 140 130 120 110 0 90 00 70 60 50 40 30 20 ;0 0fl (p.)

. 10.5 10.O 9,5 9,0 8.5

Z

ch LA LA foo

'H NMR (CDC13) and 13C NMR (CDCL3) of Compound 7d

I I I L I

I I1k

1~

epil.-JI-,M 4d 3i-.' ;

9.0 8. 8.0 7.5 7.06.5 6.0 5.5 5.0 4.5 4.0f1 (pV)

3.5 3.0 2.5 2.0 0.5 1.0 0.5 0.0

0IA fa 61%N a M @OANf M0 NOsMNM A -4 4 Mm 0-440

170 160 150 140 130 120 00 1;0 0 0f1 (ppoo)

70 60 50 40 00 20 10 0

383

S

08uuNUUU0

I

i

'H NMR (DMSO) and "C NMR (DMSO) of Compound 7e

fl ~ ~ ~ ~ & C VOOOfd~tfO * -N

7C

illi _______

13.5 13.0 12.0 12.0 10.0 0.0 10.5 10.0 9.0 9.0 8.0 8.0 7.5 7.0 0.0 6.0 5.5 5.0 4.5 4.0 3.5f1 (pp;;)

3.0 2.5 2.0 1.5 1.0 0.5 0.0

N RCm.-8-4*CCLAN MM-dN

-wdl--OMAN-

110 100 90 80 70 60 50 40 30 20 10 0fl (pp.)

C

g

I1111i170 160 I0 140 130 120

384

1 - I .- [ 117 77= _7 -_

I

I

- L_ ... 91.J

'H NMR (CD 30D) and 1 3 C NMR (CD 30D) of Compound 7f

q;; 4 4 ;

T

/ Ill//I /

________________________

00000

9.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 5.0 45f1 (Pp )

gg w104N scorn0

.0- !I - t

A-'- i6-@5

210 200 190 180 170 160 150 140 130 120 110 100 90 80 70 60 50 40 30 20 10 0 -10fl (ppm)

385

MD

14

4-0 3.S 3.0 i.s 2.0 1.5 1.0 05 0.0

'H NMR (CD 30D) and 13C NMR (CD30D) of Compound 7

rr0 N P N N P% P.18

9.0 8.5 8.0 7.5 7.0 6.S 6.0 5.5 5.0 4.54 40fl (pp.,)

2 15 . 0

0o0

Ol W l

I I, -,- I [ ll 1 1,

.0

'0

200 1;0 180 170 160 150 140 130 120 110 10 80fl(0743)8070605040302010

0

386

3.S 3 0 2.5 2.0 1.5 1.0 D.5 0.0

j, 7 O 6 'O 5 4 3'0 20 10 0

'H NMR (CD 30D) and 13 C NMR (CD30D) of Compound 8

Oi

0.~

0 15 1. s 00

0

03

---- U--

100 170 160 150' - ' - - - ' I ' ' ' " '

140 130 120 110 100 90 80 70 60 S0 40 30 20 0 0f, (pp.s)

387

1I11

"I i - -i

S. 1 4 ( 40 . 30 .

.4

-. 0 Mviomwevvin

I

9:0 8's 8,0 7'5 7'0 6'5 6'0 5's

N M &A N 14 M N M M 1-0Nli IR R . O V:

N &A V M N fM 0 0 0 0 0

4 4 -4 14 14 4 -4 -4 4 4 -6

'H NMR (CDC13) and 13 C NMR (CDC13) of Compound 9a

tM vvNNNN,Ln ' t In r

1-4 f-,

od~r F% r4q

.0 9.5 8.0 7:S 7:0 6'S 6.0 5.s 5 0 4.5 4,0

22

U

MWOM -I -- .j -I P. -ift

IDID

150 140 130 120 10 100 90 80 70fl (ppM) 60 s0 40 30 20 10 0

388

6- - - -- I " - -- 6, !-2- -

3.s 3.0 2.S 2 0 1.5 1.0 O.S 0.0 -O.5

I"

F" f" M r" r4 C4

1H NMR (CDC1 3) and 13C NMR (CDCb3) of Compound 9b

Mm @ \

*.J~l~ 1,

5.5 0.0 7.5 7.0

l l-' 1

6.5 6.0 5.5 s.0 4.s 4*0)

3.5 3.0 2.5 2.0 Is

MMMUUE

150 140 130 120 110 100 90 80 70

389

80

6A

1:0 O's 00

6 ;0 s ;0 40 30 20 10 0

'H NMR (CD30D) and 13C NMR (CD30D) of Compound 9

Fn W F

-z-

8.0 7.5 7.0 6.5 6.0 55 5.0 4.5 4 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0

coo000

M toN 0 f F i

M r41 r,.,4

180 170 160 150 140 130 120 110 100 90 8 70 6'0 50 40 3'0 20 1 0

390

'H NMR (CD30D) and 13C NMR (CD30D) of Compound 10

SO

9.5 9.0 8.5 8.0 7.5 7.0 6.5 6.0 5.5 50 4.5 4.0 3.5 3.0

00

WN1 M L

2.5 2.0 1.5 1.0 0.5 0.0 -

cl

180 170 160 150 140 030 120 110 100 00 80 70 00 00 40 30 20 10 0

180 170 160 150 140 1;0 120 110 100 90 80 70 60 50 4'0 3'0 2'0 1031 (PPM)

391

'H NMR (CD 30D) and 13C NMR (CD30D) of Compound 11a

I I lI i

6.0 55 50 45 40 3.5 3'0 2'5 2.0 1.5 1.0 0.5 0.0

®R MMO W 0 *S 1 --- ---

160 ISO 140 130 120 110 100 90 s0 70 60 50R0 (pp.s)

392

A 4

v y N

c:i M r4

id-0 A R-m-- - ---- 0 41PV444"MM IN ww

| ' | ' | ' | '

L- -.0 85 8.0 7.5 7.0 6.5

40 30 20 10 0

'H NMR (CD 30D) and 13C NMR (CD30D) of Compound lib

7

liii, 1/

8.0 7. 5 7.0 f.5 6.0 5.5

'll " 'A , I"

4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0

inoIYA

0 0i0

toI

II II

130 120 110 00 90 00 A)f0 (ppos) 00 50 40 30 20 10 0

393

I

140

W! Wi

I

5.0

-JJ A

'H NMR (CD30D) and 13C NMR (CD 30D) of Compound 11

Ch Mr AM"$6 N Pl. *

I P L 1 I""CYY

cJ a R4.1

4.0

4.51(pp)4.08. .. 0 7.5 7.0 6.5 6.0

10 t L !L ol:

11,1

3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0

a

.11II

0

394

170 160 150 140 130 120 110 100 90 80 70 10 50 40 30 20 10 0f1 (p.0

" C4 q

S.5 5.0

1. UNAIDS, Fact Sheet - World AIDS Day 2018. 2018.2. Centers for Disease, C., Pneumocystis pneumonia--Los Angeles. MMWR Morb. Mortal.Wkly. Rep. 1981, 30 (21), 250-2.3. Jaffe, H. W., The early days of the HIV-AIDS epidemic in the USA. Nat. Immunol. 2008,9 (11), 1201-3.4. Barre-Sinoussi, F.; Chermann, J. C.; Rey, F.; Nugeyre, M. T.; Chamaret, S.; Gruest, J.;Dauguet, C.; Axler-Blin, C.; Vezinet-Brun, F.; Rouzioux, C.; Rozenbaum, W.; Montagnier, L.,Isolation of a T-lymphotropic retrovirus from a patient at risk for acquired immune deficiencysyndrome (AIDS). Science 1983, 220 (4599), 868-71.5. Levy, J. A.; Hoffman, A. D.; Kramer, S. M.; Landis, J. A.; Shimabukuro, J. M.; Oshiro, L.S., Isolation of lymphocytopathic retroviruses from San Francisco patients with AIDS. Science1984, 225 (4664), 840-2.6. Popovic, M.; Sarngadharan, M. G.; Read, E.; Gallo, R. C., Detection, isolation, andcontinuous production of cytopathic retroviruses (HTLV-III) from patients with AIDS and pre-AIDS. Science 1984, 224 (4648), 497-500.7. Faria, N. R.; Rambaut, A.; Suchard, M. A.; Baele, G.; Bedford, T.; Ward, M. J.; Tatem, A.J.; Sousa, J. D.; Arinaminpathy, N.; Pepin, J.; Posada, D.; Peeters, M.; Pybus, 0. G.; Lemey, P.,HIV epidemiology. The early spread and epidemic ignition of HIV-1 in human populations.Science 2014, 346 (6205), 56-61.8. Derache, A.; Wallis, C. L.; Vardhanabhuti, S.; Bartlett, J.; Kumarasamy, N.; Katzenstein,D., Phenotype, genotype, and drug resistance in subtype C HIV-1 infection. J Infect. Dis. 2016,213 (2), 250-6.9. Mu, Y.; Kodidela, S.; Wang, Y.; Kumar, S.; Cory, T. J., The dawn of precision medicinein HIV: state of the art of pharmacotherapy. Expert Opin. Pharmacother. 2018, 19 (14), 1581-1595.10. Hall, H. I.; Holtgrave, D. R.; Maulsby, C., HIV transmission rates from persons living withHIV who are aware and unaware of their infection. AIDS 2012, 26 (7), 893-6.11. Pilcher, C. D.; Eron, J. J., Jr.; Galvin, S.; Gay, C.; Cohen, M. S., Acute HIV revisited: newopportunities for treatment and prevention. J Clin. Invest. 2004, 113 (7), 937-45.12. Dalgleish, A. G.; Beverley, P. C.; Clapham, P. R.; Crawford, D. H.; Greaves, M. F.; Weiss,R. A., The CD4 (T4) antigen is an essential component of the receptor for the AIDS retrovirus.Nature 1984, 312 (5996), 763-7.13. Berger, E. A.; Doms, R. W.; Fenyo, E. M.; Korber, B. T.; Littman, D. R.; Moore, J. P.;Sattentau, Q. J.; Schuitemaker, H.; Sodroski, J.; Weiss, R. A., A new classification for HIV-1.Nature 1998, 391 (6664), 240.14. Mao, Y.; Wang, L.; Gu, C.; Herschhorn, A.; Xiang, S. H.; Haim, H.; Yang, X.; Sodroski,J., Subunit organization of the membrane-bound HIV-1 envelope glycoprotein trimer. Nat. Struct.Mol. Biol. 2012, 19 (9), 893-9.15. Gallo, S. A.; Reeves, J. D.; Garg, H.; Foley, B.; Doms, R. W.; Blumenthal, R., Kineticstudies of HIV-1 and HIV-2 envelope glycoprotein-mediated fusion. Retrovirology 2006, 3, 90.

395

16. di Marzo Veronese, F.; Copeland, T. D.; DeVico, A. L.; Rahman, R.; Oroszlan, S.; Gallo,R. C.; Sarngadharan, M. G., Characterization of highly immunogenic p66/p51 as the reversetranscriptase of HTLV-III/LAV. Science 1986, 231 (4743), 1289-9 1.17. Bowerman, B.; Brown, P. 0.; Bishop, J. M.; Varmus, H. E., A nucleoprotein complexmediates the integration of retroviral DNA. Genes Dev. 1989, 3 (4), 469-78.18. Schroder, A. R.; Shinn, P.; Chen, H.; Berry, C.; Ecker, J. R.; Bushman, F., HIV-1integration in the human genome favors active genes and local hotspots. Cell 2002, 110 (4), 521-9.19. Louis, J. M.; Clore, G. M.; Gronenborn, A. M., Autoprocessing of HIV-1 protease is tightlycoupled to protein folding. Nat. Struct. Biol. 1999, 6 (9), 868-75.20. Mervis, R. J.; Ahmad, N.; Lillehoj, E. P.; Raum, M. G.; Salazar, F. H.; Chan, H. W.;Venkatesan, S., The gag gene products of human immunodeficiency virus type 1: alignment withinthe gag open reading frame, identification of posttranslational modifications, and evidence foralternative gag precursors. Journal of virology 1988, 62 (11), 3993-4002.21. Jacks, T.; Power, M. D.; Masiarz, F. R.; Luciw, P. A.; Barr, P. J.; Varmus, H. E.,Characterization of ribosomal frameshifting in HIV-1 gag-pol expression. Nature 1988, 331(6153), 280-3.22. Checkley, M. A.; Luttge, B. G.; Freed, E. 0., HIV-1 envelope glycoprotein biosynthesis,trafficking, and incorporation. Journal of molecular biology 2011, 410 (4), 582-608.23. Morita, E.; Sandrin, V.; McCullough, J.; Katsuyama, A.; Baci Hamilton, I.; Sundquist, W.I., ESCRT-III protein requirements for HIV-1 budding. Cell Host Microbe 2011, 9 (3), 23 5-242.24. Schimer, J.; Pavova, M.; Anders, M.; Pachl, P.; Sacha, P.; Cigler, P.; Weber, J.; Majer, P.;Rezacova, P.; Krausslich, H. G.; Muller, B.; Konvalinka, J., Triggering HIV polyproteinprocessing by light using rapid photodegradation of a tight-binding protease inhibitor. Nat.Commun. 2015, 6, 6461.25. Yang, H.; Nkeze, J.; Zhao, R. Y., Effects of HIV-1 protease on cellular functions and theirpotential applications in antiretroviral therapy. Cell Biosci 2012, 2 (1), 32.26. Quan, Y.; Rong, L.; Liang, C.; Wainberg, M. A., Reverse transcriptase inhibitors canselectively block the synthesis of differently sized viral DNA transcripts in cells acutely infectedwith human immunodeficiency virus type 1. Journal of virology 1999, 73 (8), 6700-7.27. De Clercq, E., The role of non-nucleoside reverse transcriptase inhibitors (NNRTIs) in thetherapy of HIV-1 infection. Antiviral Res. 1998, 38 (3), 153-79.28. Espeseth, A. S.; Felock, P.; Wolfe, A.; Witmer, M.; Grobler, J.; Anthony, N.; Egbertson,M.; Melamed, J. Y.; Young, S.; Hamill, T.; Cole, J. L.; Hazuda, D. J., HIV-1 integrase inhibitorsthat compete with the target DNA substrate define a unique strand transfer conformation forintegrase. Proc. Natl. Acad. Sci. USA 2000, 97 (21), 11244-9.29. Tsiang, M.; Jones, G. S.; Goldsmith, J.; Mulato, A.; Hansen, D.; Kan, E.; Tsai, L.; Bam,R. A.; Stepan, G.; Stray, K. M.; Niedziela-Majka, A.; Yant, S. R.; Yu, H.; Kukolj, G.; Cihlar, T.;Lazerwith, S. E.; White, K. L.; Jin, H., Antiviral activity of bictegravir (GS-9883), a novel potentHIV-1 integrase strand transfer inhibitor with an improved resistance profile. Antimicrob. AgentsChemother. 2016, 60 (12), 7086-7097.30. Jacobson, J. M.; Kuritzkes, D. R.; Godofsky, E.; DeJesus, E.; Larson, J. A.; Weinheimer,S. P.; Lewis, S. T., Safety, pharmacokinetics, and antiretroviral activity of multiple doses of

396

ibalizumab (formerly TNX-355), an anti-CD4 monoclonal antibody, in huiman immunodeficiencyvirus type 1-infected adults. Antimicrob. Agents Chemother. 2009, 53 (2), 450-7.31. Van Der Ryst, E., Maraviroc - a CCR5 antagonist for the treatment of HIV-1 infection.Front Immunol. 2015, 6, 277.32. Furuta, R. A.; Wild, C. T.; Weng, Y.; Weiss, C. D., Capture of an early fusion-activeconformation of HIV-1 gp4l. Nat. Struct. Biol. 1998, 5 (4), 276-9.33. Wild, C. T.; Shugars, D. C.; Greenwell, T. K.; McDanal, C. B.; Matthews, T. J., Peptidescorresponding to a predictive alpha-helical domain of human immunodeficiency virus type 1 gp4lare potent inhibitors of virus infection. Proc. Natl. Acad Sci. USA 1994, 91 (21), 9770-4.34. Saag, M. S.; Benson, C. A.; Gandhi, R. T.; Hoy, J. F.; Landovitz, R. J.; Mugavero, M. J.;Sax, P. E.; Smith, D. M.; Thompson, M. A.; Buchbinder, S. P.; Del Rio, C.; Eron, J. J., Jr.;Fatkenheuer, G.; Gunthard, H. F.; Molina, J. M.; Jacobsen, D. M.; Volberding, P. A., Antiretroviraldrugs for treatment and prevention of HIV infection in adults: 2018 recommendations of theInternational Antiviral Society-USA Panel. JAMA 2018, 320 (4), 379-396.35. Mills, E. J.; Nachega, J. B.; Bangsberg, D. R.; Singh, S.; Rachlis, B.; Wu, P.; Wilson, K.;Buchan, I.; Gill, C. J.; Cooper, C., Adherence to HAART: a systematic review of developed anddeveloping nation patient-reported barriers and facilitators. PLoS Med 2006, 3 (11), e438.36. Abram, M. E.; Ferris, A. L.; Shao, W.; Alvord, W. G.; Hughes, S. H., Nature, position, andfrequency of mutations made in a single cycle of HIV- 1 replication. Journal of virology 2010, 84(19), 9864-78.37. Chen, L.; Kwon, Y. D.; Zhou, T.; Wu, X.; O'Dell, S.; Cavacini, L.; Hessell, A. J.; Pancera,M.; Tang, M.; Xu, L.; Yang, Z. Y.; Zhang, M. Y.; Arthos, J.; Burton, D. R.; Dimitrov, D. S.; Nabel,G. J.; Posner, M. R.; Sodroski, J.; Wyatt, R.; Mascola, J. R.; Kwong, P. D., Structural basis ofimmune evasion at the site of CD4 attachment on HIV-1 gp120. Science 2009, 326 (5956), 1123-7.38. Zhou, T.; Doria-Rose, N. A.; Cheng, C.; Stewart-Jones, G. B. E.; Chuang, G. Y.;Chambers, M.; Druz, A.; Geng, H.; McKee, K.; Kwon, Y. D.; O'Dell, S.; Sastry, M.; Schmidt, S.D.; Xu, K.; Chen, L.; Chen, R. E.; Louder, M. K.; Pancera, M.; Wanninger, T. G.; Zhang, B.;Zheng, A.; Farney, S. K.; Foulds, K. E.; Georgiev, I. S.; Joyce, M. G.; Lemmin, T.; Narpala, S.;Rawi, R.; Soto, C.; Todd, J. P.; Shen, C. H.; Tsybovsky, Y.; Yang, Y.; Zhao, P.; Haynes, B. F.;Stamatatos, L.; Tiemeyer, M.; Wells, L.; Scorpio, D. G.; Shapiro, L.; McDermott, A. B.; Mascola,J. R.; Kwong, P. D., Quantification of the impact of the HIV-1-glycan shield on antibodyelicitation. Cell Rep. 2017, 19 (4), 719-732.39. Barouch, D. H.; Whitney, J. B.; Moldt, B.; Klein, F.; Oliveira, T. Y.; Liu, J.; Stephenson,K. E.; Chang, H. W.; Shekhar, K.; Gupta, S.; Nkolola, J. P.; Seaman, M. S.; Smith, K. M.;Borducchi, E. N.; Cabral, C.; Smith, J. Y.; Blackmore, S.; Sanisetty, S.; Perry, J. R.; Beck, M.;Lewis, M. G.; Rinaldi, W.; Chakraborty, A. K.; Poignard, P.; Nussenzweig, M. C.; Burton, D. R.,Therapeutic efficacy of potent neutralizing HIV-1-specific monoclonal antibodies in SHIV-infected rhesus monkeys. Nature 2013, 503 (7475), 224-8.40. Murray, A. J.; Kwon, K. J.; Farber, D. L.; Siliciano, R. F., The latent reservoir for HIV-1:how immunologic memory and clonal expansion contribute to HIV-1 persistence. J Immunol.2016, 197 (2), 407-17.

397

41. Ryscavage, P.; Kelly, S.; Li, J. Z.; Harrigan, P. R.; Taiwo, B., Significance and clinicalmanagement of persistent low-level viremia and very-low-level viremia in HIV-1-infectedpatients. Antimicrob. Agents Chemother. 2014, 58 (7), 3585-98.42. Margolis, D. M.; Garcia, J. V.; Hazuda, D. J.; Haynes, B. F., Latency reversal and viralclearance to cure HIV-1. Science 2016, 353 (6297), aaf6517.43. Shirakawa, K.; Chavez, L.; Hakre, S.; Calvanese, V.; Verdin, E., Reactivation of latentHIV by histone deacetylase inhibitors. Trends Microbiol. 2013, 21 (6), 277-85.44. Costin, J. M., Cytopathic mechanisms of HIV-1. Virol. J 2007, 4, 100.45. Darcis, G.; Van Driessche, B.; Van Lint, C., HIV latency: should we shock or lock? TrendsImmunol. 2017, 38 (3), 217-228.46. Mitchell, A. R., Bruce Merrifield and solid-phase peptide synthesis: a historicalassessment. Biopolymers 2008, 90 (3), 175-84.47. Inouye, K.; Voynick, I. M.; Delpierre, G. R.; Fruton, J. S., New synthetic substrates forpepsin. Biochemistry 1966, 5 (7), 2473-83.48. Matayoshi, E. D.; Wang, G. T.; Krafft, G. A.; Erickson, J., Novel fluorogenic substratesfor assaying retroviral proteases by resonance energy transfer. Science 1990, 247 (4945), 954-8.49. Toth, M. V.; Marshall, G. R., A simple, continuous fluorometric assay for HIV protease.International journal ofpeptide and protein research 1990, 36 (6), 544-50.50. Miller, M., The early years of retroviral protease crystal structures. Biopolymers 2010, 94(4), 521-9.51. Wlodawer, A.; Miller, M.; Jaskolski, M.; Sathyanarayana, B. K.; Baldwin, E.; Weber, I.T.; Selk, L. M.; Clawson, L.; Schneider, J.; Kent, S. B., Conserved folding in retroviral proteases:crystal structure of a synthetic HIV-1 protease. Science 1989, 245 (4918), 616-21.52. Dunn, B. M.; Goodenow, M. M.; Gustchina, A.; Wlodawer, A., Retroviral proteases.Genome Biol. 2002, 3 (4), REVIEWS3006.53. Siva, M.; Svoboda, M.; Veverka, V.; Trempe, J. F.; Hofmann, K.; Kozisek, M.; Hexnerova,R.; Sedlak, F.; Belza, J.; Brynda, J.; Sacha, P.; Hubalek, M.; Starkova, J.; Flaisigova, I.;Konvalinka, J.; Saskova, K. G., Human DNA-damage-inducible 2 protein is structurally andfunctionally distinct from Its yeast ortholog. Sci. Rep. 2016, 6, 30443.54. Revuelta, M. V.; van Kan, J. A.; Kay, J.; Ten Have, A., Extensive expansion of Al familyaspartic proteinases in fungi revealed by evolutionary analyses of 107 complete eukaryoticproteomes. Genome Biol. Evol. 2014, 6 (6), 1480-94.55. Sparks, M. A.; Crowley, S. D.; Gurley, S. B.; Mirotsou, M.; Coffman, T. M., Classicalrenin-angiotensin system in kidney physiology. Compr. Physiol. 2014, 4 (3), 1201-28.56. Fruton, J. S., A history of pepsin and related enzymes. Q. Rev. Biol. 2002, 77 (2), 127-47.57. Strisovsky, K.; Tessmer, U.; Langner, J.; Konvalinka, J.; Krausslich, H. G., Systematicmutational analysis of the active-site threonine of HIV-I proteinase: rethinking the "fireman's grip"hypothesis. Protein Sci 2000, 9 (9), 1631-41.58. Galiano, L.; Bonora, M.; Fanucci, G. E., Interflap distances in HIV-1 protease determinedby pulsed EPR measurements. J Am. Chem. Soc. 2007, 129 (36), 11004-5.59. Hornak, V.; Okur, A.; Rizzo, R. C.; Simmerling, C., HIV-1 protease flaps spontaneouslyopen and reclose in molecular dynamics simulations. Proc. Natl. Acad Sci. USA 2006, 103 (4),915-20.

398

60. Radzicka, A.; Wolfenden, R., Rates of uncatalyzed peptide bond hydrolysis in neutralsolution and the transition state affinities of proteases. J Am. Chem. Soc. 1996, 118 (26), 6105-6109.61. Szecsi, P. B., The aspartic proteases. Scand J Clin. Lab. Invest. Suppl. 1992, 210, 5-22.62. Rodriguez, E. J.; Angeles, T. S.; Meek, T. D., Use of nitrogen-15 kinetic isotope effects toelucidate details of the chemical mechanism of human immunodeficiency virus 1 protease.Biochemistry 1993, 32 (46), 12380-5.63. Liu, F.; Kovalevsky, A. Y.; Louis, J. M.; Boross, P. I.; Wang, Y. F.; Harrison, R. W.;Weber, I. T., Mechanism of drug resistance revealed by the crystal structure of the unligandedHIV-1 protease with F53L mutation. Journal of molecular biology 2006, 358 (5), 1191-9.64. Sayer, J. M.; Liu, F.; Ishima, R.; Weber, I. T.; Louis, J. M., Effect of the active site D25Nmutation on the structure, stability, and ligand binding of the mature HIV-1 protease. J Biol.Chem. 2008, 283 (19), 13459-13470.65. Prabu-Jeyabalan, M.; Nalivaika, E.; Schiffer, C. A., Substrate shape determines specificityof recognition for HIV-1 protease: analysis of crystal structures of six substrate complexes.Structure 2002, 10 (3), 369-381.66. Kovalevsky, A. Y.; Chumanevich, A. A.; Liu, F.; Louis, J. M.; Weber, I. T., Caught in theact: the 1.5 A resolution crystal structures of the HIV-1 protease and the 154V mutant reveal atetrahedral reaction intermediate. Biochemistry 2007, 46 (51), 14854-64.67. Shen, C. H.; Tie, Y.; Yu, X.; Wang, Y. F.; Kovalevsky, A. Y.; Harrison, R. W.; Weber, I.T., Capturing the reaction pathway in near-atomic-resolution crystal structures of HIV- 1 protease.Biochemistry 2012, 51 (39), 7726-32.68. Das, A.; Prashar, V.; Mahale, S.; Serre, L.; Ferrer, J. L.; Hosur, M. V., Crystal structure ofHIV- 1 protease in situ product complex and observation of a low-barrier hydrogen bond betweencatalytic aspartates. Proc. Natl. Acad Sci. USA 2006, 103 (49), 18464-9.69. Tozser, J.; Weber, I. T.; Gustchina, A.; Blaha, I.; Copeland, T. D.; Louis, J. M.; Oroszlan,S., Kinetic and modeling studies of S3-S3' subsites of HIV proteinases. Biochemistry 1992, 31(20), 4793-800.70. Szeltner, Z.; Polgar, L., Rate-determining steps in HIV-1 protease catalysis. The hydrolysisof the most specific substrate. J Biol. Chem. 1996, 271 (50), 32180-32184.71. Hyland, L. J.; Tomaszek, T. A., Jr.; Roberts, G. D.; Carr, S. A.; Magaard, V. W.; Bryan,H. L.; Fakhoury, S. A.; Moore, M. L.; Minnich, M. D.; Culp, J. S.; et al., Human immunodeficiencyvirus-I protease. 1. Initial velocity studies and kinetic characterization of reaction intermediatesby 180 isotope exchange. Biochemistry 1991, 30 (34), 8441-53.72. Kipp, D. R.; Hirschi, J. S.; Wakata, A.; Goldstein, H.; Schramm, V. L., Transition states ofnative and drug-resistant HIV-1 protease are the same. Proc. Natl. Acad. Sci. USA 2012, 109 (17),6543-8.73. Wang, W.; Liang, T. C., Substrate specificity of porcine renin: P1', P1, and P3 residues ofrenin substrates are crucial for activity. Biochemistry 1994, 33 (48), 14636-41.74. Dhanaraj, V.; Dealwis, C. G.; Frazao, C.; Badasso, M.; Sibanda, B. L.; Tickle, I. J.; Cooper,J. B.; Driessen, H. P.; Newman, M.; Aguilar, C.; et al., X-ray analyses of peptide-inhibitorcomplexes define the structural basis of specificity for human and mouse renins. Nature 1992, 357(6378), 466-72.

399

75. Shintani, T.; Nomura, K.; Ichishima, E., Engineering of porcine pepsin. Alteration of SIsubstrate specificity of pepsin to those of fungal aspartic proteinases by site-directed mutagenesis.J Biol. Chem. 1997, 272 (30), 18855-61.76. Ali, A.; Bandaranayake, R. M.; Cai, Y.; King, N. M.; Kolli, M.; Mittal, S.; Murzycki, J. F.;Nalam, M. N.; Nalivaika, E. A.; Ozen, A.; Prabu-Jeyabalan, M. M.; Thayer, K.; Schiffer, C. A.,Molecular basis for drug resistance in HIV-1 protease. Viruses 2010, 2 (11), 2509-35.77. Weber, I. T.; Wu, J.; Adomat, J.; Harrison, R. W.; Kimmel, A. R.; Wondrak, E. M.; Louis,J. M., Crystallographic analysis of human immunodeficiency virus 1 protease with an analog ofthe conserved CA-p2 substrate -- interactions with frequently occurring glutamic acid residue atP2' position of substrates. Eur. J Biochem. 1997, 249 (2), 523-530.78. Tozser, J.; Bagossi, P.; Weber, I. T.; Louis, J. M.; Copeland, T. D.; Oroszlan, S., Studieson the symmetry and sequence context dependence of the HIV-1 proteinase specificity. J Biol.Chem. 1997, 272 (27), 16807-14.79. Potempa, M.; Lee, S. K.; Kurt Yilmaz, N.; Nalivaika, E. A.; Rogers, A.; Spielvogel, E.;Carter, C. W., Jr.; Schiffer, C. A.; Swanstrom, R., HIV-1 protease uses bi-specific S2/S2' subsitesto optimize cleavage of two classes of target sites. Journal ofmolecular biology 2018.80. Tozser, J., Comparative studies on retroviral proteases: substrate specificity. Viruses 2010,2 (1), 147-65.81. Miller, M.; Schneider, J.; Sathyanarayana, B. K.; Toth, M. V.; Marshall, G. R.; Clawson,L.; Selk, L.; Kent, S. B.; Wlodawer, A., Structure of complex of synthetic HIV-1 protease with asubstrate-based inhibitor at 2.3 A resolution. Science 1989, 246 (4934), 1149-52.82. Prabu-Jeyabalan, M.; Nalivaika, E.; Schiffer, C. A., How does a symmetric dimerrecognize an asymmetric substrate? A substrate complex of HIV-1 protease. Journal of molecularbiology 2000, 301 (5), 1207-20.83. Palella, F. J., Jr.; Delaney, K. M.; Moorman, A. C.; Loveless, M. 0.; Fuhrer, J.; Satten, G.A.; Aschman, D. J.; Holmberg, S. D., Declining morbidity and mortality among patients withadvanced human immunodeficiency virus infection. HIV Outpatient Study Investigators. N Engl.J Med 1998, 338 (13), 853-60.84. Paton, N. I.; Stohr, W.; Oddershede, L.; Arenas-Pinto, A.; Walker, S.; Sculpher, M.; Dunn,D. T., The protease inhibitor monotherapy versus ongoing triple therapy (PIVOT) trial: arandomised controlled trial of a protease inhibitor monotherapy strategy for long-termmanagement of human immunodeficiency virus infection. Health Technol. Assess. 2016, 20 (21),1-158.85. Nachega, J. B.; Marconi, V. C.; van Zyl, G. U.; Gardner, E. M.; Preiser, W.; Hong, S. Y.;Mills, E. J.; Gross, R., HIV treatment adherence, drug resistance, virologic failure: evolvingconcepts. Infect. Disord. Drug Targets 2011, 11 (2), 167-74.86. Umezawa, H.; Aoyagi, T.; Morishima, H.; Matsuzaki, M.; Hamada, M., Pepstatin, a newpepsin inhibitor produced by Actinomycetes. J Antibiot. 1970, 23 (5), 259-62.87. Marciniszyn, J., Jr.; Hartsuck, J. A.; Tang, J., Mode of inhibition of acid proteases bypepstatin. J Biol. Chem. 1976, 251 (22), 7088-94.88. Dreyer, G. B.; Metcalf, B. W.; Tomaszek, T. A., Jr.; Carr, T. J.; Chandler, A. C., 3rd;Hyland, L.; Fakhoury, S. A.; Magaard, V. W.; Moore, M. L.; Strickler, J. E.; et al., Inhibition ofhuman immunodeficiency virus 1 protease in vitro: rational design of substrate analogue inhibitors.Proc. Natl. Acad Sci. USA 1989, 86 (24), 9752-6.

400

89. Rich, D. H.;- Green, J.; Toth, M. V.; Marshall, G. R.; Kent, S. B., Hydroxyethylamineanalogues of the p1 7/p2 4 substrate cleavage site are tight-binding inhibitors of HIV protease. JMed. Chem. 1990, 33 (5), 1285-8.90. Boger, J.; Lohr, N. S.; Ulm, E. H.; Poe, M.; Blaine, E. H.; Fanelli, G. M.; Lin, T. Y.; Payne,L. S.; Schorn, T. W.; LaMont, B. I.; Vassil, T. C.; Stabilito, II; Veber, D. F.; Rich, D. H.; Bopari,A. S., Novel renin inhibitors containing the amino acid statine. Nature 1983, 303 (5912), 81-4.91. Roberts, N. A.; Martin, J. A.; Kinchington, D.; Broadhurst, A. V.; Craig, J. C.; Duncan, I.B.; Galpin, S. A.; Handa, B. K.; Kay, J.; Krohn, A.; et al., Rational design of peptide-based HIVproteinase inhibitors. Science 1990, 248 (4953), 358-61.92. Merry, C.; Barry, M. G.; Mulcahy, F.; Ryan, M.; Heavey, J.; Tjia, J. F.; Gibbons, S. E.;Breckenridge, A. M.; Back, D. J., Saquinavir pharmacokinetics alone and in combination withritonavir in HIV-infected patients. AIDS 1997, 11 (4), F29-33.93. Noor, M. A.; Lo, J. C.; Mulligan, K.; Schwarz, J. M.; Halvorsen, R. A.; Schambelan, M.;Grunfeld, C., Metabolic effects of indinavir in healthy HIV-seronegative men. AIDS 2001, 15 (7),F11-8.94. Romines, K. R.; Watenpaugh, K. D.; Howe, W. J.; Tomich, P. K.; Lovasz, K. D.; Morris,J. K.; Janakiraman, M. N.; Lynn, J. C.; Horng, M. M.; Chong, K. T.; et al., Structure-based designof nonpeptidic HIV protease inhibitors from a cyclooctylpyranone lead structure. J Med. Chem.1995, 38 (22), 4463-73.95. Ghosh, A. K.; Thompson, W. J.; McKee, S. P.; Duong, T. T.; Lyle, T. A.; Chen, J. C.;Darke, P. L.; Zugay, J. A.; Emini, E. A.; Schleif, W. A.; et al., 3-Tetrahydrofuran and pyranurethanes as high-affinity P2-ligands for HIV-1 protease inhibitors. J Med. Chem. 1993, 36 (2),292-4.96. Kim, E. E.; Baker, C. T.; Dwyer, M. D.; Murcko, M. A.; Rao, B. G.; Tung, R. D.; Navia,M. A., Crystal-structure of HIV-1 protease in complex with VX-478, a potent and orallybioavailable inhibitor of the enzyme. J Am. Chem. Soc. 1995, 117 (3), 1181-1182.97. Ghosh, A. K., Aspartic Acid Proteases as Therapeutic Targets. Wiley-VCH: Weinheim,Germany, 2010; p xxiii, 615 p.98. Condra, J. H.; Schleif, W. A.; Blahy, 0. M.; Gabryelski, L. J.; Graham, D. J.; Quintero, J.C.; Rhodes, A.; Robbins, H. L.; Roth, E.; Shivaprakash, M.; et al., In vivo emergence of HIV-1variants resistant to multiple protease inhibitors. Nature 1995, 374 (6522), 569-71.99. Falcoz, C.; Jenkins, J. M.; Bye, C.; Hardman, T. C.; Kenney, K. B.; Studenberg, S.; Fuder,H.; Prince, W. T., Pharmacokinetics of GW433908, a prodrug of amprenavir, in healthy malevolunteers. J Clin. Pharmacol. 2002, 42 (8), 887-98.100. Hester, E. K.; Chandler, H. V.; Sims, K. M., Fosamprenavir: drug development foradherence. Ann. Pharmacother. 2006, 40 (7-8), 1301-10.101. Sham, H. L.; Kempf, D. J.; Molla, A.; Marsh, K. C.; Kumar, G. N.; Chen, C. M.; Kati, W.;Stewart, K.; Lal, R.; Hsu, A.; Betebenner, D.; Korneyeva, M.; Vasavanonda, S.; McDonald, E.;Saldivar, A.; Wideburg, N.; Chen, X.; Niu, P.; Park, C.; Jayanti, V.; Grabowski, B.; Granneman,G. R.; Sun, E.; Japour, A. J.; Leonard, J. M.; Plattner, J. J.; Norbeck, D. W., ABT-378, a highlypotent inhibitor of the human immunodeficiency virus protease. Antimicrob. Agents Chemother.1998, 42 (12), 3218-24.102. Bold, G.; Fassler, A.; Capraro, H. G.; Cozens, R.; Klimkait, T.; Lazdins, J.; Mestan, J.;Poncioni, B.; Rosel, J.; Stover, D.; Tintelnot-Blomley, M.; Acemoglu, F.; Beck, W.; Boss, E.;

401

Eschbach, M.; Hurlimann, T.; Masso, E.; Roussel, S.; Ucci-Stoll, K.; Wyss, D.; Lang, M., Newaza-dipeptide analogues as potent and orally absorbed HIV-1 protease inhibitors: candidates forclinical development. J Med Chem. 1998, 41 (18), 3387-401.103. Turner, S. R.; Strohbach, J. W.; Tommasi, R. A.; Aristoff, P. A.; Johnson, P. D.; Skulnick,H. I.; Dolak, L. A.; Seest, E. P.; Tomich, P. K.; Bohanon, M. J.; Horng, M. M.; Lynn, J. C.; Chong,K. T.; Hinshaw, R. R.; Watenpaugh, K. D.; Janakiraman, M. N.; Thaisrivongs, S., Tipranavir(PNU-140690): a potent, orally bioavailable nonpeptidic HIV protease inhibitor of the 5,6-dihydro-4-hydroxy-2-pyrone sulfonamide class. J Med Chem. 1998, 41 (18), 3467-76.104. Poppe, S. M.; Slade, D. E.; Chong, K. T.; Hinshaw, R. R.; Pagano, P. J.; Markowitz, M.;Ho, D. D.; Mo, H.; Gorman, R. R., 3rd; Dueweke, T. J.; Thaisrivongs, S.; Tarpley, W. G., Antiviralactivity of the dihydropyrone PNU-140690, a new nonpeptidic human immunodeficiency virusprotease inhibitor. Antimicrob. Agents Chemother. 1997, 41 (5), 1058-63.105. Ghosh, A. K.; Kincaid, J. F.; Cho, W.; Walters, D. E.; Krishnan, K.; Hussain, K. A.; Koo,Y.; Cho, H.; Rudall, C.; Holland, L.; Buthod, J., Potent HIV protease inhibitors incorporating high-affinity P2-ligands and (R)-(hydroxyethylamino)sulfonamide isostere. Bioorg. Med. Chem. Lett.1998, 8 (6), 687-90.106. Ghosh, A. K.; Kincaid, J. F.; Walters, D. E.; Chen, Y.; Chaudhuri, N. C.; Thompson, W.J.; Culberson, C.; Fitzgerald, P. M.; Lee, H. Y.; McKee, S. P.; Munson, P. M.; Duong, T. T.; Darke,P. L.; Zugay, J. A.; Schleif, W. A.; Axel, M. G.; Lin, J.; Huff, J. R., Nonpeptidal P2 ligands forHIV protease inhibitors: structure-based design, synthesis, and biological evaluation. J MedChem. 1996, 39 (17), 3278-90.107. De Meyer, S.; Azijn, H.; Surleraux, D.; Jochmans, D.; Tahri, A.; Pauwels, R.; Wigerinck,P.; de Bethune, M. P., TMC 114, a novel human immunodeficiency virus type 1 protease inhibitoractive against protease inhibitor-resistant viruses, including a broad range of clinical isolates.Antimicrob. Agents Chemother. 2005, 49 (6), 2314-21.108. Hayashi, H.; Takamune, N.; Nirasawa, T.; Aoki, M.; Morishita, Y.; Das, D.; Koh, Y.;Ghosh, A. K.; Misumi, S.; Mitsuya, H., Dimerization of HIV-1 protease occurs through two stepsrelating to the mechanism of protease dimerization inhibition by darunavir. Proc. Natl. Acad Sci.USA 2014, 111 (33), 12234-9.109. Weber, I. T.; Agniswamy, J., HIV-1 protease: structural perspectives on drug resistance.Viruses 2009, 1 (3), 1110-36.110. Rick, S. W.; Topol, I. A.; Erickson, J. W.; Burt, S. K., Molecular mechanisms of resistance:free energy calculations of mutation effects on inhibitor binding to HIV-1 protease. Protein Sci1998, 7 (8), 1750-6.111. Ragland, D. A.; Whitfield, T. W.; Lee, S. K.; Swanstrom, R.; Zeldovich, K. B.; Kurt-Yilmaz, N.; Schiffer, C. A., Elucidating the interdependence of drug resistance from combinationsof mutations. J. Chem. Theory Comput. 2017, 13 (11), 5671-5682.112. Van Marck, H.; Dierynck, I.; Kraus, G.; Hallenberger, S.; Pattery, T.; Muyldermans, G.;Geeraert, L.; Borozdina, L.; Bonesteel, R.; Aston, C.; Shaw, E.; Chen, Q.; Martinez, C.; Koka, V.;Lee, J.; Chi, E.; de Bethune, M. P.; Hertogs, K., The impact of individual humanimmunodeficiency virus type 1 protease mutations on drug susceptibility is highly influenced bycomplex interactions with the background protease sequence. Journal of virology 2009, 83 (18),9512-20.

402

113. Piana, S.; Carloni, P.; Rothlisberger, U., Drug resistance in HIV-lprotease: fexibility-assisted mechanism of compensatory mutations. Protein Sci 2002, 11 (10), 2393-402.114. Kolli, M.; Stawiski, E.; Chappey, C.; Schiffer, C. A., Human immunodeficiency virus type1 protease-correlated cleavage site mutations enhance inhibitor resistance. Journal of virology2009, 83 (21), 11027-42.115. Kurt Yilmaz, N.; Swanstrom, R.; Schiffer, C. A., Improving viral protease inhibitors tocounter drug resistance. Trends Microbiol. 2016, 24 (7), 547-557.116. Liu, Z.; Huang, X.; Hu, L.; Pham, L.; Poole, K. M.; Tang, Y.; Mahon, B. P.; Tang, W.; Li,K.; Goldfarb, N. E.; Dunn, B. M.; McKenna, R.; Fanucci, G. E., Effects of hinge-region naturalpolymorphisms on human immunodeficiency virus-type 1 protease structure, dynamics, and drugpressure evolution. J Biol. Chem. 2016, 291 (43), 22741-22756.117. Martin, P.; Vickrey, J. F.; Proteasa, G.; Jimenez, Y. L.; Wawrzak, Z.; Winters, M. A.;Merigan, T. C.; Kovari, L. C., "Wide-open" 1.3 A structure of a multidrug-resistant HIV-1 proteaseas a drug target. Structure 2005, 13 (12), 1887-95.118. Agniswamy, J.; Shen, C. H.; Aniana, A.; Sayer, J. M.; Louis, J. M.; Weber, I. T., HIV-1protease with 20 mutations exhibits extreme resistance to clinical inhibitors through coordinatedstructural rearrangements. Biochemistry 2012, 51 (13), 2819-28.119. de Vera, I. M.; Smith, A. N.; Dancel, M. C.; Huang, X.; Dunn, B. M.; Fanucci, G. E.,Elucidating a relationship between conformational sampling and drug resistance in HIV-1protease. Biochemistry 2013, 52 (19), 3278-88.120. Chetty, S.; Bhakat, S.; Martin, A. J.; Soliman, M. E., Multi-drug resistance profile of PR20HIV-1 protease is attributed to distorted conformational and drug binding landscape: moleculardynamics insights. J Biomol. Struct. Dyn. 2016, 34 (1), 135-51.121. Louis, J. M.; Roche, J., Evolution under drug pressure remodels the folding free-energylandscape of mature HIV-1 protease. Journal of molecular biology 2016, 428 (13), 2780-92.122. Zhang, Y.; Chang, Y. C.; Louis, J. M.; Wang, Y. F.; Harrison, R. W.; Weber, I. T.,Structures of darunavir-resistant HIV-1 protease mutant reveal atypical binding of darunavir towide open flaps. A CS Chem. Biol. 2014, 9 (6), 1351-8.123. Aoki, M.; Hayashi, H.; Yedidi, R. S.; Martyr, C. D.; Takamatsu, Y.; Aoki-Ogata, H.;Nakamura, T.; Nakata, H.; Das, D.; Yamagata, Y.; Ghosh, A. K.; Mitsuya, H., C-5-modifiedtetrahydropyrano-tetrahydofuran-derived protease inhibitors (PIs) exert potent inhibition of thereplication of HIV-1 variants highly resistant to various PIs, including darunavir. Journal ofvirology 2015, 90 (5), 2180-94.124. DeFrancesco, H.; Dudley, J.; Coca, A., Boron chemistry: an overview. In Boron Reagentsin Synthesis, American Chemical Society: 2016; Vol. 1236, pp 1-25.125. Lennox, A. J.; Lloyd-Jones, G. C., Selection of boron reagents for Suzuki-Miyauracoupling. Chem. Soc. Rev. 2014, 43 (1), 412-43.126. Smoum, R.; Rubinstein, A.; Dembitsky, V. M.; Srebnik, M., Boron containing compoundsas protease inhibitors. Chem. Rev. 2012, 112 (7), 4156-220.127. Tomsho, J. W.; Pal, A.; Hall, D. G.; Benkovic, S. J., Ring structure and aromatic substituenteffects on the pKa of the benzoxaborole pharmacophore. ACS Med Chem. Lett. 2012, 3 (1), 48-52.128. Lorand, J. P.; Edwards, J. 0., Polyol complexes and structure of the benzeneboronate ion.J Org. Chem. 1958, 24 (6), 769-774.

403

129. Babcock, L.; Pizer, R., Dynamics of boron acid complexation reactions. Formation of 1:1boron acid-ligand complexes. Inorg. Chem. 1980, 19 (1), 56-61.130. Yan, J.; Springsteen, G.; Deeter, S.; Wang, B., The relationship among pKa, pH, andbinding constants in the interactions between boronic acids and diols-it is not as simple as itappears. Tetrahedron 2004, 60 (49), 11205-11209.131. Bromba, C.; Carrie, P.; Chui, J. K. W.; Fyles, T. M., Phenyl boronic acid complexes ofdiols and hydroxyacids. Supramol. Chem. 2009, 21 (1-2), 81-88.132. Dowlut, M.; Hall, D. G., An improved class of sugar-binding boronic acids, soluble andcapable of complexing glycosides in neutral water. J Am. Chem. Soc. 2006, 128 (13), 4226-4227.133. Bosch, L. I.; Fyles, T. M.; James, T. D., Binary and ternary phenylboronic acid complexeswith saccharides and Lewis bases. Tetrahedron 2004, 60 (49), 11175-11190.134. Diaz, D. B.; Yudin, A. K., The versatility of boron in biological target engagement. Nat.Chem. 2017, 9 (8), 731-742.135. Lagoutte, R.; Patouret, R.; Winssinger, N., Covalent inhibitors: an opportunity for rationaltarget selectivity. Curr. Opin. Chem. Biol. 2017, 39, 54-63.136. Simon, G. M.; Cravatt, B. F., Activity-based proteomics of enzyme superfamilies: serinehydrolases as a case study. J Biol. Chem. 2010, 285 (15), 11051-5.137. Fu, H.; Fang, H.; Sun, J.; Wang, H.; Liu, A.; Sun, J.; Wu, Z., Boronic acid-based enzymeinhibitors: a review of recent progress. Curr. Med. Chem. 2014, 21 (28), 3271-80.138. Zervosen, A.; Herman, R.; Kerff, F.; Herman, A.; Bouillez, A.; Prati, F.; Pratt, R. F.; Frere,J. M.; Joris, B.; Luxen, A.; Charlier, P.; Sauvage, E., Unexpected tricovalent binding mode ofboronic acids within the active site of a penicillin-binding protein. J Am. Chem. Soc. 2011, 133(28), 10839-48.139. Lienhard, G. E.; Koehler, K. A., 2-Phenylethaneboronic acid, a possible transition-stateanalog for chymotrypsin. Biochemistry 1971, 10 (13), 2477-2483.140. Matthews, D. A.; Alden, R. A.; Birktoft, J. J.; Freer, S. T.; Kraut, J., X-ray crystallographicstudy of boronic acid adducts with subtilisin BPN' (Novo). A model for the catalytic transitionstate. J. Biol. Chem. 1975, 250 (18), 7120-6.141. Pekol, T.; Daniels, J. S.; Labutti, J.; Parsons, I.; Nix, D.; Baronas, E.; Hsieh, F.; Gan, L. S.;Miwa, G., Human metabolism of the proteasome inhibitor bortezomib: identification of circulatingmetabolites. Drug Metab. Dispos. 2005, 33 (6), 771-777.142. Raedler, L. A., Ninlaro (Ixazomib): first oral proteasome inhibitor approved for thetreatment of patients with relapsed or refractory multiple myeloma. Am. Health Drug Benefits2016, 9 (Spec Feature), 102-5.143. Vogl, D. T.; Martin, T. G.; Vij, R.; Hari, P.; Mikhael, J. R.; Siegel, D.; Wu, K. L.; Delforge,M.; Gasparetto, C., Phase I/I study of the novel proteasome inhibitor delanzomib (CEP-18770)for relapsed and refractory multiple myeloma. Leuk. Lymphoma 2017, 58 (8), 1872-1879.144. Schrader, J.; Henneberg, F.; Mata, R. A.; Tittmann, K.; Schneider, T. R.; Stark, H.;Bourenkov, G.; Chari, A., The inhibition mechanism of human 20S proteasomes enables next-generation inhibitor design. Science 2016, 353 (6299), 594.145. Vanderheyden, P. M.; Fierens, F. L.; De Backer, J.; Vauquelin, G., Reversible and syntopicinteraction between angiotensin receptor antagonists on Chinese hamster ovary cells expressinghuman angiotensin II type 1 receptors. Biochem. Pharmacol. 2000, 59 (8), 927-35.

404

146. Rock, F L.; Mao, W.; Yaremhk, A.; Tunalo, M.; Crepin, T.; Zh i H.; 7hng, Y. K.;Hernandez, V.; Akama, T.; Baker, S. J.; Plattner, J. J.; Shapiro, L.; Martinis, S. A.; Benkovic, S.J.; Cusack, S.; Alley, M. R., An antifungal agent inhibits an aminoacyl-tRNA synthetase bytrapping tRNA in the editing site. Science 2007, 316 (5832), 1759-61.147. Elewski, B. E.; Aly, R.; Baldwin, S. L.; Gonzalez Soto, R. F.; Rich, P.; Weisfeld, M.; Wiltz,H.; Zane, L. T.; Pollak, R., Efficacy and safety of tavaborole topical solution, 5%, a novel boron-based antifungal agent, for the treatment of toenail onychomycosis: results from 2 randomizedphase-III studies. J Am. Acad Dermatol. 2015, 73 (1), 62-69.148. Freund, Y. R.; Akama, T.; Alley, M. R.; Antunes, J.; Dong, C.; Jarnagin, K.; Kimura, R.;Nieman, J. A.; Maples, K. R.; Plattner, J. J.; Rock, F.; Sharma, R.; Singh, R.; Sanders, V.; Zhou,Y., Boron-based phosphodiesterase inhibitors show novel binding of boron to PDE4 bimetalcenter. FEBS Lett. 2012, 586 (19), 3410-4.149. Nazarian, R.; Weinberg, J. M., AN-2728, a PDE4 inhibitor for the potential topicaltreatment of psoriasis and atopic dermatitis. Curr. Opin. Investig. Drugs 2009, 10 (11), 1236-42.150. Ajami, D.; Dube, H.; Rebek, J., Jr., Boronic acid hydrogen bonding in encapsulationcomplexes. J Am. Chem. Soc. 2011, 133 (25), 9689-91.151. Akama, T.; Dong, C.; Virtucio, C.; Sullivan, D.; Zhou, Y.; Zhang, Y. K.; Rock, F.; Freund,Y.; Liu, L.; Bu, W.; Wu, A.; Fan, X. Q.; Jarnagin, K., Linking phenotype to kinase: identificationof a novel benzoxaborole hinge-binding motif for kinase inhibition and development of high-potency rho kinase inhibitors. J. Pharmacol. Exp. Ther. 2013, 347 (3), 615-25.152. Lomax, J. E.; Eller, C. H.; Raines, R. T., Rational design and evaluation of mammalianribonuclease cytotoxins. Methods Enzymol. 2012, 502, 273-90.153. Rutkoski, T. J.; Raines, R. T., Evasion of ribonuclease inhibitor as a determinant ofribonuclease cytotoxicity. Curr. Pharm. Biotechnol. 2008, 9 (3), 185-9.154. Raines, R. T., Ribonuclease A. Chem. Rev. 1998, 98 (3), 1045-1066.155. Klink, T. A.; Woycechowsky, K. J.; Taylor, K. M.; Raines, R. T., Contribution of disulfidebonds to the conformational stability and catalytic activity of ribonuclease A. Eur. J. Biochem.2000, 267 (2), 566-72.156. Ruoppolo, M.; Vinci, F.; Klink, T. A.; Raines, R. T.; Marino, G., Contribution of individualdisulfide bonds to the oxidative folding of ribonuclease A. Biochemistry 2000, 39 (39), 12033-42.157. Kartha, G.; Bello, J.; Harker, D., Tertiary structure of ribonuclease. Nature 1967, 213(5079), 862-5.158. Zito, E.; Melo, E. P.; Yang, Y.; Wahlander, A.; Neubert, T. A.; Ron, D., Oxidative proteinfolding by an endoplasmic reticulum-localized peroxiredoxin. Mol. Cell 2010, 40 (5), 787-97.159. delCardayre, S. B.; Ribo, M.; Yokel, E. M.; Quirk, D. J.; Rutter, W. J.; Raines, R. T.,Engineering ribonuclease A: production, purification and characterization of wild-type enzymeand mutants at Gln1 1. Protein Eng. 1995, 8 (3), 261-73.160. Findlay, D.; Herries, D. G.; Mathias, A. P.; Rabin, B. R.; Ross, C. A., The active site andmechanism of action of bovine pancreatic ribonuclease. Nature 1961, 190, 781-84.161. delCardayre, S. B.; Raines, R. T., The extent to which ribonucleases cleave ribonucleicacid. Analytical biochemistry 1995, 225 (1), 176-8.162. Kelemen, B. R.; Schultz, L. W.; Sweeney, R. Y.; Raines, R. T., Excavating an active site:the nucleobase specificity of ribonuclease A. Biochemistry 2000, 39 (47), 14487-94.

405

163. Thompson, J. E.; Kutateladze, T. G.; Schuster, M. C.; Venegas, F. D.; Messmore, J. M.;Raines, R. T., Limits to catalysis by ribonuclease A. Bioorg. Chem. 1995, 23 (4), 471-481.164. Leland, P. A.; Staniszewski, K. E.; Park, C.; Kelemen, B. R.; Raines, R. T., Theribonucleolytic activity of angiogenin. Biochemistry 2002, 41 (4), 1343-50.165. Hoang, T. T.; Raines, R. T., Molecular basis for the autonomous promotion of cellproliferation by angiogenin. Nucleic Acids Res. 2017, 45 (2), 818-831.166. Notomista, E.; Catanzano, F.; Graziano, G.; Piaz, F. D.; Barone, G.; D'Alessio, G.; DiDonato, A., Onconase: an unusually stable protein. Biochemistry 2000, 39 (30), 8711-8718.167. Reitsma, S.; Slaaf, D. W.; Vink, H.; van Zandvoort, M. A.; oude Egbrink, M. G., Theendothelial glycocalyx: composition, functions, and visualization. Pflugers Arch. 2007, 454 (3),345-59.168. Sundlass, N. K.; Eller, C. H.; Cui, Q.; Raines, R. T., Contribution of electrostatics to thebinding of pancreatic-type ribonucleases to membranes. Biochemistry 2013, 52 (37), 6304-12.169. Chao, T. Y.; Lavis, L. D.; Raines, R. T., Cellular uptake of ribonuclease A relies on anionicglycans. Biochemistry 2010, 49 (50), 10666-73.170. Fuchs, S. M.; Raines, R. T., Pathway for polyarginine entry into mammalian cells.Biochemistry 2004, 43 (9), 2438-44.171. Richard, J. P.; Melikov, K.; Brooks, H.; Prevot, P.; Lebleu, B.; Chernomordik, L. V.,Cellular uptake of unconjugated TAT peptide involves clathrin-dependent endocytosis andheparan sulfate receptors. J Biol. Chem. 2005, 280 (15), 15300-6.172. Chao, T. Y.; Raines, R. T., Mechanism of ribonuclease A endocytosis: analogies to cell-penetrating peptides. Biochemistry 2011, 50 (39), 8374-82.173. Varkouhi, A. K.; Scholte, M.; Storm, G.; Haisma, H. J., Endosomal escape pathways fordelivery of biologicals. J Control Release 2011, 151 (3), 220-8.174. L6nn, P.; Kacsinta, A. D.; Cui, X.-S.; Hamil, A. S.; Kaulich, M.; Gogoi, K.; Dowdy, S. F.,Enhancing endosomal escape for intracellular delivery of macromolecular biologic therapeutics.Sci. Rep. 2016, 6, 32301.175. Lee, J. E.; Raines, R. T., Ribonucleases as novel chemotherapeutics: the ranpirnaseexample. BioDrugs 2008, 22 (1), 53-8.176. Haigis, M. C.; Kurten, E. L.; Raines, R. T., Ribonuclease inhibitor as an intracellular sentry.Nucleic Acids Res. 2003, 31 (3), 1024-32.177. Dickson, K. A.; Haigis, M. C.; Raines, R. T., Ribonuclease inhibitor: structure andfunction. Prog. Nucleic Acid Res. Mol. Biol. 2005, 80, 349-74.178. Johnson, R. J.; Lavis, L. D.; Raines, R. T., Intraspecies regulation of ribonucleolyticactivity. Biochemistry 2007, 46 (45), 13131-40.179. Lomax, J. E.; Bianchetti, C. M.; Chang, A.; Phillips, G. N., Jr.; Fox, B. G.; Raines, R. T.,Functional evolution of ribonuclease inhibitor: insights from birds and reptiles. Journal ofmolecular biology 2014, 426 (17), 3041-56.180. Kobe, B.; Deisenhofer, J., A structural basis of the interactions between leucine-richrepeats and protein ligands. Nature 1995, 374 (6518), 183-6.181. Turcotte, R. F.; Lavis, L. D.; Raines, R. T., Onconase cytotoxicity relies on the distributionof its positive charge. FEBS] 2009, 276 (14), 3846-57.

406

182. Johnson, R. J.; McCoy, J. G.; Bingman, C. A ; Phillips, G. N., Jr; Raines, R. T., Inhibitionof human pancreatic ribonuclease by the human ribonuclease inhibitor protein. Journal ofmolecular biology 2007, 368 (2), 434-49.183. Johnson, R. J.; Chao, T. Y.; Lavis, L. D.; Raines, R. T., Cytotoxic ribonucleases: thedichotomy of Coulombic forces. Biochemistry 2007, 46 (36), 10308-16.184. Lomax, J. E.; Eller, C. H.; Raines, R. T., Comparative functional analysis of ribonuclease1 homologs: molecular insights into evolving vertebrate physiology. Biochem. J 2017, 474 (13),2219-2233.185. Koczera, P.; Martin, L.; Marx, G.; Schuerholz, T., The ribonuclease A superfamily inhumans: canonical RNases as the buttress of innate immunity. Int. J Mol. Sci. 2016, 17 (8).186. Pulido, D.; Torrent, M.; Andreu, D.; Nogues, M. V.; Boix, E., Two human host defenseribonucleases against mycobacteria, the eosinophil cationic protein (RNase 3) and RNase 7.Antimicrob. Agents Chemother. 2013, 57 (8), 3797.187. Rosenberg, H. F., Eosinophil-derived neurotoxin (EDN/RNase 2) and the mouseeosinophil-associated RNases (mEars): expanding roles in promoting host defense. Int. J Mol. Sci.2015, 16 (7), 15442-55.188. Cocchi, F.; DeVico, A. L.; Lu, W.; Popovic, M.; Latinovic, 0.; Sajadi, M. M.; Redfield,R. R.; Lafferty, M. K.; Galli, M.; Garzino-Demo, A.; Gallo, R. C., Soluble factors from T cellsinhibiting X4 strains of HIV are a mixture of beta chemokines and RNases. Proc. Natl. Acad Sci.USA 2012, 109 (14), 5411-6.189. Youle, R. J.; Wu, Y. N.; Mikulski, S. M.; Shogen, K.; Hamilton, R. S.; Newton, D.;D'Alessio, G.; Gravell, M., RNase inhibition of human immunodeficiency virus infection of H9cells. Proc. Natl. Acad. Sci. USA 1994, 91 (13), 6012-6.190. Bedoya, V. I.; Boasso, A.; Hardy, A. W.; Rybak, S.; Shearer, G. M.; Rugeles, M. T.,Ribonucleases in HIV type 1 inhibition: effect of recombinant RNases on infection of primary Tcells and immune activation-induced RNase gene and protein expression. AIDS Res. Hum.Retroviruses 2006, 22 (9), 897-907.191. Swain, A. L.; Miller, M. M.; Green, J.; Rich, D. H.; Schneider, J.; Kent, S. B.; Wlodawer,A., X-ray crystallographic structure of a complex between a synthetic protease of humanimmunodeficiency virus 1 and a substrate-based hydroxyethylamine inhibitor. Proc. Natl. AcadSci. USA 1990, 87 (22), 8805-9.192. Rahuel, J.; Rasetti, V.; Maibaum, J.; Rueger, H.; Goschke, R.; Cohen, N. C.; Stutz, S.;Cumin, F.; Fuhrer, W.; Wood, J. M.; Grutter, M. G., Structure-based drug design: the discoveryof novel nonpeptide orally active inhibitors of human renin. Chem. Biol. 2000, 7 (7), 493-504.193. Fujinaga, M.; Chernaia, M. M.; Tarasova, N. I.; Mosimann, S. C.; James, M. N., Crystalstructure of human pepsin and its complex with pepstatin. Protein Sci 1995, 4 (5), 960-72.194. Fontecilla-Camps, J. C.; de Llorens, R.; le Du, M. H.; Cuchillo, C. M., Crystal structure ofribonuclease A.d(ApTpApApG) complex. Direct evidence for extended substrate recognition. JBiol. Chem. 1994, 269 (34), 21526-31.195. Brik, A.; Wong, C.-H., HIV-1 protease: mechanism and drug discovery. Org. Biomol.Chem. 2003, 1, 5-14.196. Ghosh, A.; Chapsal, B.; Weber, I.; Mitsuya, H., Design of HIV protease inhibitors targetingprotein backbone: an effective strategy for combating drug resistance. Acc. Chem. Res. 2008, 41,78-86.

407

197. Wensing, A. M. J.; van Maarseveen, N. M.; Nijhuis, M., Fifteen years of HIV proteaseinhibitors: raising the barrier to resistance. Antiviral Res. 2010, 85, 59-74.198. Altman, M. D.; Ali, A.; Reddy, G. S.; Nalam, M. N.; Anjum, S. G.; Cao, H.; Chellappan,S.; Kairys, V.; Fernandes, M. X.; Gilson, M. K.; Schiffer, C. A.; Rana, T. M.; Tidor, B., HIV-1protease inhibitors from inverse design in the substrate envelope exhibit subnanomolar binding todrug-resistant variants. J Am. Chem. Soc. 2008, 130 (19), 6099-113.199. Ghosh, A. K.; Dawson, Z. L.; Mitsuya, H., Darunavir, a conceptually new HIV-1 proteaseinhibitor for the treatment of drug-resistant HIV. Bioorg. Med Chem. 2007, 15 (24), 7576-80.200. King, N. M.; Prabu-Jeyabalan, M.; Nalivaika, E. A.; Wigerinck, P.; de Bethune, M. P.;Schiffer, C. A., Structural and thermodynamic basis for the binding of TMC 114, a next-generationhuman immunodeficiency virus type 1 protease inhibitor. Journal of virology 2004, 78 (21),12012-21.201. Hazen, R.; Harvey, R.; Ferris, R.; Craig, C.; Yates, P.; Griffin, P.; Miller, J.; Kaldor, I.;Ray, J.; Samano, V.; Furfine, E.; Spaltenstein, A.; Hale, M.; Tung, R.; St Clair, M.; Hanlon, M.;Boone, L., In vitro antiviral activity of the novel, tyrosyl-based human immunodeficiency virus(HIV) type 1 protease inhibitor brecanavir (GW640385) in combination with other antiretroviralsand against a panel of protease inhibitor-resistant HIV. Antimicrob. Agents Chemother. 2007, 51(9), 3147-54.202. Burlingham, B. T.; Widlanski, T. S., An intuitive look at the relationship of Ki and IC5 0 : amore general use for the Dixon plot. J Chem Educ 2003, 80 (2), 214-218.203. Williams, J. W.; Morrison, J. F., The kinetics of reversible tight-binding inhibition.Methods Enzymol. 1979, 63, 437-467.204. Murphy, D. J., Determination of accurate K values for tight-binding enzyme inhibitors:An in silico study of experimental error and assay design. Analytical biochemistry 2004, 327 (1),61-7.205. Velazquez-Campoy, A.; Freire, E., Isothermal titration calorimetry to determineassociation constants for high-affinity ligands. Nature protocols 2006, 1 (1), 186-91.206. Hanlon, M. H.; Porter, D. J.; Furfine, E. S.; Spaltenstein, A.; Carter, H. L.; Danger, D.;Shu, A. Y.; Kaldor, I. W.; Miller, J. F.; Samano, V. A., Inhibition of wild-type and mutant humanimmunodeficiency virus type 1 proteases by GW0385 and other arylsulfonamides. Biochemistry2004, 43 (45), 14500-7.207. Beck, Z. Q.; Hervio, L.; Dawson, P. E.; Elder, J. H.; Madison, E. L., Identification ofefficiently cleaved substrates for HIV- 1 protease using a phage display library and use in inhibitordevelopment. Virology 2000, 274 (2), 391-401.208. Wu, G., Assay Development: Fundamentals and Practices. John Wiley & Sons: Hoboken,NJ, 2010.209. Lavis, L. D.; Raines, R. T., Bright ideas for chemical biology. ACS Chem. Biol. 2008, 3,142-155.210. Lavis, L. D.; Rutkoski, T. J.; Raines, R. T., Tuning the pKa of fluorescein to optimizebinding assays. Anal. Chem. 2007, 79, 6775-6782.211. Sun, W.-C.; Gee, K. R.; Klaubert, D. H.; Haugland, R. P., Synthesis of fluorinatedfluoresceins. J Org. Chem. 1997, 62, 6469-6475.

408

212. Ingr, M.; Uhlikova, T.; Strisovsky, K.; Majerova, E.; Konvalinka, J., Kinetics of thedimerization of retroviral proteases: the "fireman's grip" and dimerization. Protein Sci. 2003, 12(10), 2173-82.213. Todd, M. J.; Semo, N.; Freire, E., The structural stability of the HIV-1 protease. Journalof molecular biology 1998, 283 (2), 475-88.214. Tie, Y.; Boross, P. I.; Wang, Y. F.; Gaddis, L.; Hussain, A. K.; Leshchenko, S.; Ghosh, A.K.; Louis, J. M.; Harrison, R. W.; Weber, I. T., High resolution crystal structures of HIV-1 proteasewith a potent non-peptide inhibitor (UIC-94017) active against multi-drug-resistant clinicalstrains. Journal of molecular biology 2004, 338 (2), 341-52.215. Gibson, D. G.; Young, L.; Chuang, R. Y.; Venter, J. C.; Hutchison, C. A., 3rd; Smith, H.0., Enzymatic assembly of DNA molecules up to serval hundred kilobases. Nat. Methods 2009, 6,343-345.216. Hui, J. 0.; Tomasselli, A. G.; Reardon, I. M.; Lull, J. M.; Brunner, D. P.; Tomich, C. S.;Heinrikson, R. L., Large scale purification and refolding of HIV- 1 protease from Escherichia coliinclusion bodies. Journal ofprotein chemistry 1993, 12 (3), 323-7.217. Artimo, P.; Jonnalagedda, M.; Arnold, K.; Baratin, D.; Csardi, G.; de Castro, E.; Duvaud,S.; Flegel, V.; Fortier, A.; Gasteiger, E.; Grosdidler, A.; Hernandez, C.; loannidis, V.; Kuznetsov,D.; Liechti, R.; Moretti, S.; Mostaguir, K.; Redaschi, N.; Rossier, G.; Xenarios, I.; Stockinger, H.,ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 2012, 40, W597-W603.218. Prabu-Jeyabalan, M.; Nalivaika, E. A.; King, N. M.; Schiffer, C. A., Viability of a drug-resistant human immunodeficiency virus type 1 protease variant: structural insights for betterantiviral therapy. Journal of virology 2003, 77 (2), 1306-1315.219. Liu, Z.; Wang, Y.; Brunzelle, J.; Kovari, I. A.; Kovari, L. C., Nine crystal structuresdetermine the substrate envelope of the MDR HIV-1 protease. Protein J. 2011, 30 (3), 173-183.220. Tie, Y.; Boross, P. I.; Wang, Y. F.; Gaddis, L.; Liu, F.; Chen, X.; Tozser, J.; Harrison, R.W.; Weber, I. T., Molecular basis for substrate recognition and drug resistance from 1.1 to 1.6angstroms resolution crystal structures of HIV- 1 protease mutants with substrate analogs. FEBS J.2005, 272 (20), 5265-77.221. Lee, S. K.; Potempa, M.; Swanstrom, R., The choreography of HIV-1 proteolyticprocessing and virion assembly. J. Biol. Chem. 2012, 287 (49), 40867-40874.222. Altman, M. D.; Nalivaika, E. A.; Prabu-Jeyabalan, M.; Schiffer, C. A.; Tidor, B.,Computational design and experimental study of tighter binding peptides to an inactivated mutantof HIV-1 protease. Proteins 2008, 70 (3), 678-694.223. Windsor, I. W.; Raines, R. T., Fluorogenic assay for inhibitors of HIV-1 protease with sub-picomolar affinity. Sci. Rep. 2015, 5, 11286.224. Feldblum, E. S.; Arkin, I. T., Strength of a bifurcated H bond. Proc. Natl. Acad. Sci. USA2014,1]] (11), 4085-4090.225. Ridky, T. W.; Cameron, C. E.; Cameron, J.; Leis, J.; Copeland, T.; Wlodawer, A.; Weber,I. T.; Harrison, R. W., Human immunodeficiency virus, type 1 protease substrate specificity islimited by interactions between substrate amino acids bound in adjacent enzyme subsites. J Biol.Chem. 1996, 271 (9), 4709-4717.226. Ozen, A.; Haliloglu, T.; Schiffer, C. A., Dynamics of preferential substrate recognition inHIV-1 protease: redefining the substrate envelope. Journal of molecular biology 2011, 410 (4),726-744.

409

227. Velazquez-Campoy, A.; Todd, M. J.; Vega, S.; Freire, E., Catalytic efficiency and vitalityof HIV-1 proteases from African viral subtypes. Proc. Natl. Acad Sci. USA 2001, 98 (11), 6062-6067.228. Adams, P. D.; Afonine, P. V.; Bunkoczi, G.; Chen, V. B.; Davis, I. W.; Echols, N.; Headd,J. J.; Hung, L. W.; Kapral, G. J.; Grosse-Kunstleve, R. W.; McCoy, A. J.; Moriarty, N. W.; Oeffner,R.; Read, R. J.; Richardson, D. C.; Richardson, J. S.; Terwilliger, T. C.; Zwart, P. H., PHENIX: Acomprehensive Python-based system for macromolecular structure solution. Acta Crystallogr.2010, D66 (Pt 2), 213-221.229. Emsley, P.; Lohkamp, B.; Scott, W. G.; Cowtan, K., Features and development of Coot.A cta Crystallogr. 2010, D66 (Pt 4), 486-50 1.230. de Oliveira, T.; Engelbrecht, S.; van Rensburg, E. J.; Gordon, M.; Bishop, K.; zur Megede,J.; Barnett, S. W.; Cassol, S., Variability at human immunodeficiency virus type 1 subtype Cprotease cleavage sites: an indication of viral fitness? Journal of virology 2003, 77, 9422-9430.231. Knowles, J. R., On the mechanism of action of pepsin. Phil. Trans. Roy. Soc. Lond. 1970,B257, 135-146.232. Dunn, B. M., Structure and mechanism of the pepsin-like family of aspartic peptidases.Chem. Rev. 2002, 102 (12), 4431-4458.233. Wlodawer, A.; Gustchina, A., Structural and biochemical studies of retroviral proteases.Biochimica et biophysica acta 2000, 1477 (1-2), 16-34.234. Li, M.; Dimaio, F.; Zhou, D.; Gustchina, A.; Lubkowski, J.; Dauter, Z.; Baker, D.;Wlodawer, A., Crystal structure of XMRV protease differs from the structures of otherretropepsins. Nat. Struct. Mol. Biol. 2011, 18 (2), 227-229.235. Zhou, A.; Carrell, R. W.; Murphy, M. P.; Wei, Z.; Yan, Y.; Stanley, P. L. D.; Stein, P. E.;Broughton Pipkin, F.; Read, R. J., A redox switch in angiotensinogen modulates angiotensinrelease. Nature 2010, 468 (7320), 108-111.236. Green, D. W.; Aykent, S.; Gierse, J. K.; Zupec, M. E., Substrate specificity of recombinanthuman renal renin: effect of histidine in the P2 subsite on pH dependence. Biochemistry 1990, 29(12), 3126-3133.237. Tbzser, J., Comparative studies on retroviral proteases: Substrate specificity. Viruses 2010,2 (1), 147-165.238. Laco, G. S.; Schalk-Hihi, C.; Lubkowski, J.; Morris, G.; Zdanov, A.; Olson, A.; Elder, J.H.; Wlodawer, A.; Gustchina, A., Crystal structures of the inactive D30N mutant of felineimmunodeficiency virus protease complexed with a substrate and an inhibitor. Biochemistry 1997,36 (35), 10696-10708.239. Tyndall, J. D.; Nall, T.; Fairlie, D. P., Proteases universally recognize beta strands in theiractive sites. Chem. Rev. 2005, 105, 973-999.240. Dall'Acqua, W.; Carter, P., Substrate-assisted catalysis: molecular basis and biologicalsignificance. Protein Sci. 2000, 9, 1-9.241. Note3, 2018.242. Kolli, M.; Ozen, A.; Kurt-Yilmaz, N.; Schiffer, C. A., HIV-1 protease-substratecoevolution in nelfinavir resistance. Journal of virology 2014, 88 (13), 7145-7154.243. Tyndall, J. D.; Pattenden, L. K.; Reid, R. C.; Hu, S. H.; Alewood, D.; Alewood, P. F.;Walsh, T.; Fairlie, D. P.; Martin, J. L., Crystal structures of highly constrained substrate and

410

hydrolysis products bound to HIV-1 protease. Implicatinc fror the -t1Ayti-7 dxT caism.Biochemistry 2008, 47 (12), 3736-3744.244. Prabu-Jeyabalan, M.; Nalivaika, E. A.; King, N. M.; Schiffer, C. A., Structural basis forcoevolution of a human immunodeficiency virus type 1 nucleocapsid-p I cleavage site with a V82Adrug-resistant mutation in viral protease. Journal of virology 2004, 78, 12446-12454.245. Prabu-Jeyabaian, M.; Nalivaika, E. A.; Romano, K.; Schiffer, C. A., Mechanism ofsubstrate recognition by drug-resistant human immunodeficiency virus type 1 protease variantsrevealed by a novel structural intermediate. Journal of virology 2006, 80, 3607-3616.246. Bandaranayake, R. M.; Prabu-Jeyabalan, M.; Kakizawa, J.; Sugiura, W.; Schiffer, C. A.,Structural analysis of human immunodeficiency virus type 1 CRF01 _AE protease in complex withthe substrate p I-p6 . Journal of virology 2008, 82, 6762-6766.247. Ozen, A.; Lin, K. H.; Kurt Yilmaz, N.; Schiffer, C. A., Structural basis and distal effectsof Gag substrate coevolution in drug resistance to HIV-1 protease. Proc. Natl. Acad Sci. USA2014, 111 (45), 15993-15998.248. Perskie, L. L.; Street, T. 0.; Rose, G. D., Structures, basins, and energies: a deconstructionof the Protein Coil Library. Protein Sci. 2008, 17 (7), 1151-1161.249. Nemethy, G.; Printz, M. P., The y turn, a possible folded conformation of the polypeptidechain. comparison with the p turn. Macromolecules 1972, 5 (6), 755-758.250. Note2, 2018.251. Pettit, S. C.; Simsic, J.; Loeb, D. D.; Everitt, L.; Hutchison, C. A., 3rd; Swanstrom, R.,Analysis of retroviral protease cleavage sites reveals two types of cleavage sites and the structuralrequirements of the P1 amino acid. J Biol. Chem. 1991, 266 (22), 14539-14547.252. Newlander, K. A.; Callahan, J. F.; Moore, M. L.; Tomaszek, T. A.; Huffman, W. F., Anovel constrained reduced-amide inhibitor of HIV-1 protease derived from the sequentialincorporation of 7-tum mimetics into a model substrate. J Med. Chem. 1993, 36 (16), 2321-233 1.253. Moore, M. L.; Dreyer, G. B., Substrate-based inhibitors of HIV-1 protease. Perspect. DrugDiscov. Des. 1993, 1 (1), 85-108.254. Hoog, S. S.; Zhao, B.; Winborne, E.; Fisher, S.; Green, D. W.; DesJarlais, R. L.;Newlander, K. A.; Callahan, J. F.; Abdel-Meguid, S. S.; Moore, M. L.; Huffman, W. F., A checkon rational drug design: crystal structure of a complex of human immunodeficiency virus type 1protease with a novel y-turn mimetic inhibitor. J Med Chem. 1995, 38 (17), 3246-3252.255. Wilhelm, P.; Lewandowski, B.; Trapp, N.; Wennemers, H., A crystal structure of anoligoproline PPII-helix, at last. J Am. Chem. Soc. 2014, 136 (45), 15829-15832.256. Adzhubei, A. A.; Sternberg, M. J.; Makarov, A. A., Polyproline-I helix in proteins:structure and function. Journal of molecular biology 2013, 425 (12), 2100-2132.257. Horng, J. C.; Raines, R. T., Stereoelectronic effects on polyproline conformation. ProteinSci. 2006, 15 (1), 74-83.258. Singh, S. K.; Das, A., The n--*w* interaction: a rapidly emerging non-covalent interaction.Phys. Chem. Chem. Phys. 2015, 17, 9596-9612.259. Newberry, R. W.; Raines, R. T., The n-sc* interaction. Acc. Chem. Res. 2017, 50 (8),1838-1846.260. Bartlett, G. J.; Choudhary, A.; Raines, R. T.; Woolfson, D. N., n-oc* interactions inproteins. Nat. Chem. Biol. 2010, 6 (8), 615-620.

411

261. Shi, Z.; Chen, K.; Liu, Z.; Kallenbach, N. R., Conformation of the backbone in unfoldedproteins. Chem. Rev. 2006, 106 (5), 1877-1897.262. Cubellis, M. V.; Caillez, F.; Blundell, T. L.; Lovell, S. C., Properties of polyproline II, asecondary structure element implicated in protein-protein interactions. Proteins 2005, 58 (4), 880-892.263. Brown, A. M.; Zondlo, N. J., A propensity scale for type II polyproline helices (PPII):aromatic amino acids in proline-rich sequences strongly disfavor PPII due to proline-aromaticinteractions. Biochemistry 2012, 51 (25), 5041-5051.264. Windsor, I. W.; Raines, R. T., A substrate selected by phage display exhibits enhancedside-chain hydrogen bonding to HIV-1 protease. Acta Crystallogr. 2018, D74, 690-694.265. Coates, L.; Tuan, H. F.; Tomanicek, S.; Kovalevsky, A.; Mustyakimov, M.; Erskine, P.;Cooper, J., The catalytic mechanism of an aspartic proteinase explored with neutron and X-raydiffraction. J Am. Chem. Soc. 2008, 130 (23), 7235-7237.266. Weinhold, F., Natural bond orbital analysis: a critical overview of relationships toalternative bonding perspectives. J Comput. Chem. 2012, 33 (30), 2363-2379.267. Weinhold, F.; Landis, C. R., Discovering Chemistry with Natural Bond Orbitals. JohnWiley & Sons: Hoboken, NJ, 2012.268. Choudhary, A.; Fry, C. G.; Kamer, K. J.; Raines, R. T., An n--->* interaction reduces theelectrophilicity of the acceptor carbonyl group. Chem. Commun. 2013, 49, 8166-8168.269. Robertus, J. D.; Kraut, J.; Alden, R. A.; Birktoft, J., Subtilisin; a stereochemical mechanisminvolving transition-state stabilization. Biochemistry 1972, 11, 4293-4303.270. Kraut, J., Serine proteases: structure and mechanism of catalysis. Annu. Rev. Biochem.1977, 46, 331-358.271. Henderson, R.; Wright, C. S.; Hess, G. P.; Blow, D. M., a-Chymotrypsin: what can welearn about catalysis from X-ray diffraction? Cold Spring Harb. Symp. Quant. Biol. 1972, 36, 63-69.272. Bryan, P.; Pantoliano, M. W.; Quill, S. G.; Hsiao, H.-Y.; Poulos, T., Site-directedmutagenesis and the role of the oxyanion hole in subtilisin. Proc. Natl. Acad. Sci. USA 1986, 83,3743-3745.273. Henderson, R., Structure of crystalline a-chymotrypsin. IV. The structure ofindoleacryloyl-a-chymotrypsin and its relevance to the hydrolytic mechanism of the enzyme.Journal of molecular biology 1970, 54, 341-354.274. Simon, L.; Goodman, J. M., Enzyme catalysis by hydrogen bonds: The balance betweentransition state binding and substrate binding in oxyanion holes. J Org. Chem. 2010, 75 (6), 1831-1840.275. Pauling, L., Chemical achievement and hope for the future. Am. Sci. 1948, 36, 51-58.276. Jencks, W. P., Strain and conformation change in enzymatic catalysis. In Current AspectsofBiochemical Energetics, Kaplan, N. 0.; Kennedy, E. P., Eds. Academic Press: New York, NY,1966; pp 273-298.277. Albery, W. J.; Knowles, J. R., Evolution of enzyme function and the development ofcatalytic efficiency. Biochemistry 1976, 15, 5631-5640.278. Lienhard, G. E., Enzymatic catalysis and transition-state theory. Science 1973, 180, 149-154.

412

279. Pauling, L., Nature of forces between large m t4oles nf 1ilngical interest. Nature 1948,161, 707-709.280. Pauling, L., Molecular architecture and biological reactions. Chem. Eng News 1946, 24,1375-1377.281. Kirby, A. J.; Komarov, I. V.; Wothers, P. D.; Feeder, N., The most twisted amide: structureand reactions. Angew. Chem. Int. Ed. 1998, 37, 785-786.282. Mujika, J. I.; Mercero, J. A.; Lopez, Z., Water-promoted hydrolysis of a highly twistedamide: rate acceleration caused by the twist of the amide bond. J. Am. Chem. Soc. 2005, 127, 445-4453.283. Aube, J., A new twist on amide solvolysis. Angew. Chem. Int. Ed. 2012, 51, 3063-3065.284. Pauling, L., In Nature of the Chemical Bond, Cornell University Press: Ithaca, NY, 1939;pp 192-193.285. Pearl, L. H., The catalytic mechanism of aspartic proteases. FEBS Lett. 1987, 214, 8-12.286. Tozser, J.; Blaha, I.; Copeland, T. D.; Wondrak, E. M.; Oroszlan, S., Comparison of theHIV- 1 and HIV-2 proteinases using oligopeptide substrates representing cleavage sites in Gag andGag-Pol polyproteins. FEBS Lett. 1991, 281 (1-2), 77-80.287. Krzemin'ska, A.; Moliner, V.; $widerek, K., Dynamic and electrostatic effects on thereaction catalyzed by HIV-1 protease. J. Am. Chem. Soc. 2016, 138 (50), 16283-16298.288. Ribeiro, A. J. M.; Santos-Martins, D.; Russo, N.; Ramos, M. J.; Fernandes, P. A.,Enzymatic flexibility and reaction rate: A QM/MM study of HIV- 1 protease. ACS Catal. 2015, 5(9), 5617-5626.289. Garrec, J.; Sautet, P.; Fleurat-Lessard, P., Understanding the HIV-1 protease reactivity withDFT: what do we gain from recent functionals? J Phys. Chem. B 2011, 115 (26), 8545-8558.290. Carnevale, V.; Raugei, S.; Piana, S.; Carloni, P., On the nature of the reaction intermediatein the HIV-1 protease: A quantum chemical study. Comput. Phys. Commun. 2008, 179 (1-3), 120-123.291. Calixto, A. R.; Bras, N. F.; Fernandes, P. A.; Ramos, M. J., Reaction mechanism of humanrenin studied by quantum mechanics/molecular mechanics (QM/MM) calculations. ACS Catal.2014, 4 (11), 3869-3876.292. Bras, N. F.; Ramos, M. J.; Fernandes, P. A., The catalytic mechanism of mouse reninstudied with QM/MM calculations. Phys. Chem. Chem. Phys. 2012, 14 (36), 12605-12613.293. Calixto, A. R.; Ramos, M. J.; Fernandes, P. A., Influence of frozen residues on theexploration of the PES of enzyme reaction mechanisms. J. Chem. Theory Comput. 2017, 13 (11),5486-5495.294. Tomasi, J.; Mennucci, B.; Cammi, R., Quantum mechanical continuum solvation models.Chem. Rev. 2005, 105, 2999-3093.295. Gaussian, 2016.296. E. D. Glendening, J., K. Badenhoop, A. E. Reed, J. E. Carpenter, J. A. Bohmann, C. M.Morales, C. R. Landis, and F. Weinhold NBO 6.0, Theoretical Chemistry Institute, University ofWisconsin: Madison, WI, 2013.297. Wendt, M.; Weinhold, F. NBOView 1.1, Theoretical Chemistry Institute, University ofWisconsin-Madison: Madison, WI, 2001.298. Akgun, B.; Hall, D. G., Fast and tight boronate formation for click bioorthogonalconjugation. Angew. Chem. Int. Ed. 2016, 55, 3909-3913.

413

299. Fitzgerald, P. M.; McKeever, B. M.; VanMiddlesworth, J. F.; Springer, J. P.; Helmbach, J.C.; Leu, C.-T.; Herger, W. K.; Dixon, R. A.; Darke, P. L., Crystallographic analysis of a complexbetween human immunodeficiency virus type 1 protease and acetyl-pepstatin at 2.0-A resolution.J Biol. Chem. 1990, 25, 14209-14219.300. Vondrasek, J.; Wlodawer, A., Inhibitors of HIV-1 protease: a major success of structure-assisted drug design. Annu. Rev. Biophys. Biomol. Struct. 1998, 27, 249-284.301. Ripka, A. S.; Rich, D. H., Peptidomimetic design. Curr. Opin. Chem. Biol. 1998, 2 (4),441-452.302. Koh, Y.; Nakata, H.; Maeda, K.; Ogata, H.; Bilcer, G.; Devasamudram, T.; Kincaid, J. F.;Boross, P.; Wang, Y. F.; Tie, Y.; Volarath, P.; Gaddis, L.; Harrison, R. W.; Weber, I. T.; Ghosh,A. K.; Mitsuya, H., Novel bis-tetrahydrofuranylurethane-containing nonpeptidic protease inhibitor(PI) UIC-94017 (TMC 114) with potent activity against multi-PI-resistant humanimmunodeficiency virus in vitro. Antimicrob. Agents Chemother. 2003, 47 (10), 3123-3129.303. de Vera, I. M.; Blackburn, M. E.; Fanucci, G. E., Correlating conformational shift inductionwith altered inhibitor potency in a multidrug resistant HIV-1 protease variant. Biochemistry 2012,51 (40), 7813-7815.304. King, N. M.; Prabu-Jeyabalan, M.; Nalivaika, E. A.; Schiffer, C. A., Combatingsusceptibility to drug resistance: Lessons from HIV-1 protease. Chem. Biol. 2004, 11 (10), 1333-1338.305. Ghosh, A. K.; Sridhar, P. R.; Leshchenko, S.; Hussain, A. K.; Li, J.; Kovalevsky, A. Y.;Walters, D. E.; Wedekind, J. E.; Grum-Tokars, V.; Das, D.; Koh, Y.; Maeda, K.; Gatanaga, H.;Weber, I. T.; Mitsuya, H., Structure-based design of novel HIV-1 protease inhibitors to combatdrug resistance. J. Med Chem. 2006, 49 (17), 5252-61.306. Lefebvre, E.; Schiffer, C. A., Resilience to resistance of HIV-1 protease inhibitors: profileof darunavir. AIDS Rev. 2008, 10 (3), 131-142.307. Nalam, M. N.; Ali, A.; Reddy, G. S.; Cao, H.; Anjum, S. G.; Altman, M. D.; Yilmaz, N.K.; Tidor, B.; Rana, T. M.; Schiffer, C. A., Substrate envelope-designed potent HIV-1 proteaseinhibitors to avoid drug resistance. Chem. Biol. 2013, 20 (9), 1116-1124.308. Tzeli, D.; Theodorakopoulos, G.; Petsalakis, I. D.; Ajami, D.; Rebek, J., Theoretical studyof hydrogen bonding in homodimers and heterodimers of amide, boronic acid, and carboxylic acid,free and in encapsulation complexes. J. Am. Chem. Soc. 2011, 133 (42), 16977-16985.309. Hall, D. G., Boronic Acids: Preparation and Applications in Organic Synthesis, Medicineand Materials, 2nd ed Wiley-VCH: Weinheim, Germany, 2011.310. Kovalevsky, A. Y.; Tie, Y.; Liu, F.; Boross, P. I.; Wang, Y.-F.; Leshchenko, S.; Ghosh, A.K.; Harrison, R. W.; Weber, I. T., Effectiveness of nonpeptide clinical inhibitor TMC- 114 on HIV-1 protease with highly drug resistant mutations D30N, I50V, and L90M. J Med Chem. 2006, 49(4), 1379-1387.311. Perrin, C. L., Are short, low-barrier hydrogen bonds unusually strong? Acc. Chem. Res.2010, 43 (12), 1550-1557.312. Cleland, W. W.; Frey, P. A.; Gerlt, J. A., The low barrier hydrogen bond in enzymaticcatalysis. J. Biol. Chem. 1998, 273, 25529-25532.313. Bader, R. F. W., A quantum theory of molecular structure and its applications. Chem. Rev.1991, 91 (5), 893-928.

414

314. Schitt, B.; Iversen Nadrn (- V - arn P V ric T. C., On the electronicnature of low-barrier hydrogen bonds in enzymatic reactions. Proc. Nati. Acad. Sci. USA 1998, 95(22), 12799-12802.315. pKa, 2018.316. Gilli, G.; Bellucci, F.; Ferretti, V.; Bertolasi, V., Evidence for resonance-assisted hydrogenbonding from crystal-structure correlations on the enol form of the f-diketone fragment. J Am.Chem. Soc. 1989, 111, 1023-1028.317. Grosch, A. A.; van der Lubbe, S. C. C.; Fonseca, G. C., Nature of intramolecular resonanceassisted hydrogen bonding in malonaldehyde and its saturated analogue. J Phys. Chem. A 2018,122, 1813-1820.318. G6ra, R. W.; Maj, M.; Grabowski, S. J., Resonance-assisted hydrogen bonds revisited.Resonance stabilization vs. charge delocalization. Phys. Chem. Chem. Phys. 2013, 15, 2514-2522.319. Gentile, J. M.; Gentile, G. J.; Plewa, M. J., Mutagenicity of selected aniline derivatives toSalmonella following plant activation and mammalian hepatic activation. Mutat. Res. 1987, 188,185-196.320. Bomhard, E. M.; Herbold, B. A., Genotoxic activities of aniline and its metabolites andtheir relationship to the carcinogenicity of aniline in the spleen of rat. Crit. Rev. Toxicol. 2005, 35,783-835.321. Makhdoumi, P.; Limoee, M.; Ashraf, G. M.; Hossini, H., Molecular mechanism of anilineinduced spleen toxicity and neuron toxicity in experimental rat exposure: a review. Curr.Neuropharmacol. 2018, DOI: 10.2174/1570159X16666180803164238.322. Bu, W.; Akama, T.; Chanda, S.; Sullivan, D.; Ciaravino, V.; Jarnagin, K.; Freund, Y.;Sanders, V.; Chen, C.-W.; Fan, X.; Heyman, I.; Liu, L., Early rapid identification of in vivo ratmetabolites of AN6414, a novel boron-containing PDE4 inhibitor by QTRAP LC/MS/MS tosupport drug discovery. J Pharm. Biomed. Anal. 2012, 70, 344-353.323. drug, 2018.324. Surleraux, D. L.; Tahri, A.; Verschueren, W. G.; Pille, G. M.; de Kock, H. A.; Jonckers, T.H.; Peeters, A.; De Meyer, S.; Azijn, H.; Pauwels, R.; de Bethune, M. P.; King, N. M.; Prabu-Jeyabalan, M.; Schiffer, C. A.; Wigerinck, P. B., Discovery and selection of TMC 114, a nextgeneration HIV-1 protease inhibitor. J Med. Chem. 2005, 48 (6), 1813-1822.325. Greco, W. R.; Hakala, M. T., Evaluation of methods for estimating the dissociationconstant of tight binding enzyme inhibitors. J Biol. Chem. 1979, 254, 12104-12109.326. Frisch, M. J.; Trucks, G. W.; Schlegel, H. B.; G. E. Scuseria; Robb, M. A.; Cheeseman, J.R.; Scalmani, G.; Barone, V.; Petersson, G. A.; Nakatsuji, H.; Li, X.; Caricato, M.; Marenich, A.V.; Bloino, J.; Janesko, B. G.; Gomperts, R.; Mennucci, B.; Hratchian, H. P.; Ortiz, J. V.;Izmaylov, A. F.; Sonnenberg, J. L.; D. Williams-Young; Ding, F.; Lipparini, F.; Egidi, F.; Goings,J.; Peng, B.; Petrone, A.; Henderson, T.; Ranasinghe, D.; Zakrzewski, V. G.; Gao, J.; Rega, N.;Zheng, G.; Liang, W.; Hada, M.; Ehara, M.; Toyota, K.; Fukuda, R.; Hasegawa, J.; Ishida, M.;Nakajima, T.; Honda, Y.; Kitao, 0.; Nakai, H.; Vreven, T.; Throssell, K.; J. A. Montgomery, J.;Peralta, J. E.; Ogliaro, F.; Bearpark, M. J.; Heyd, J. J.; Brothers, E. N.; Kudin, K. N.; Staroverov,V. N.; Keith, T. A.; Kobayashi, R.; Normand, J.; Raghavachari, K.; Rendell, A. P.; Burant, J. C.;Iyengar, S. S.; Tomasi, J.; Cossi, M.; Millam, J. M.; Klene, M.; Adamo, C.; Cammi, R.; Ochterski,J. W.; Martin, R. L.; Morokuma, K.; Farkas, 0.; Foresman, J. B.; Fox, D. J., Gaussian 16, RevisionA.03. Gaussian, Inc.: Wallingford CT, 2016.

415

327. Molkanov, K.; Jelsch, C.; Wenger, E.; Stare, J.; Madsen, A. 0.; Kojid-Prodid, B.,Experimental evidence of a 3-centre, 2-electron covalent bond character of the central 0-H-Ofragment on the Zundel cation in crystals of Zundel nitranilate tetrahydrate. CrystEngComm 2017,19 (28), 3898-3901.328. Yedidi, R. S.; Maeda, K.; Fyvie, W. S.; Steffey, M.; Davis, D. A.; Palmer, I.; Aoki, M.;Kaufman, J. D.; Stahl, S. J.; Garimella, H.; Das, D.; Wingfield, P. T.; Ghosh, A. K.; Mitsuya, H.,P2' benzene carboxylic acid moiety is associated with decrease in cellular uptake: evaluation ofnovel nonpeptidic HIV-1 protease inhibitors containing P2 bis-tetrahydrofuran moiety.Antimicrob. Agents Chemother. 2013, 57 (10), 4920-4927.329. Ghosh, A. K.; Rao, K. V.; Nyalapatla, P. R.; Osswald, H. L.; Martyr, C. D.; Aoki, M.;Hayashi, H.; Agniswamy, J.; Wang, Y. F.; Bulut, H.; Das, D.; Weber, I. T.; Mitsuya, H., Designand development of highly potent HIV-1 protease inhibitors with a crown-like oxotricyclic coreas the P2-ligand to combat multidrug-resistant HIV variants. J Med Chem. 2017, 60 (10), 4267-4278.330. Suzuki, A., Cross-coupling reactions of organoboranes: an easy way to construct C-Cbonds (Nobel lecture). Angew. Chem. Int. Ed. 2011, 50, 6723-6737.331. Das, B. C.; Thapa, P.; Karki, R.; Schinke, C.; Das, S.; Kambhampati, S.; Banerjee, S. K.;Van Veldhuizen, P.; Verma, A.; Weiss, L. M.; Evans, T., Boron chemicals in diagnosis andtherapeutics. Future Med Chem. 2013, 5 (6), 653-676.332. Ni, N.; Wang, B., Applications of boronic acids in chemical biology and medicinalchemistry. In Boronic Acids: Preparation and Applications in Organic Synthesis, Medicine andMaterials, Wiley-VCH: Weinheim, Germany, 2011; pp 591-620.333. Hall, D. G.; Akgun, B., Boronic acids as bioorthogonal probes in site-selective labeling ofproteins. Angew. Chem. Int. Ed. 2018, 57, In Press.334. Liu, J.; Lavigne, J. J., Boronic acids in materials chemistry. In Boronic Acids: Preparationand Applications in Organic Synthesis, Medicine and Materials, 2nd ed, Hall, D. G., Ed. Wiley-VCH: Weinheim, Germany, 2011; pp 621-676.335. Brooks, W. L. A.; Sumerlin, B. S., Synthesis and applications of boronic acid-containingpolymers: from materials to medicine. Chem. Rev. 2015, 116 (3), 13 75-1397.336. Li, D.; Chen, Y.; Liu, Z., Boronate affinity materials for separation and molecularrecognition: structure, properties and applications. Chem. Soc. Rev. 2015, 44, 8097-8123.337. Severin, K., Boronic acids as building blocks for molecular nanostructures and polymericmaterials. Dalton Trans. 2009, 5254-5264.338. Trippier, P. C.; McGuigan, C., Boronic acids in medicinal chemistry: anticancer,antibacterial and antiviral applications. MedChemComm 2010, 1 (3), 183-198.339. Adams, J.; Behnke, M.; Chen, S.; Cruickshank, A. A.; Dick, L. R.; Grenier, L.; Klunder, J.M.; Ma, Y.-T.; Plamondon, L.; Stein, R. L., Potent and selective inhibitors of the proteasome:dipeptidyl boronic acids. Bioorg. Med Chem. Lett. 1998, 8, 333-338.340. Adamczyk-Wozniak, A.; Cyranfski, M. K.; Zubrowska, A.; Sporzyfnski, A.,Benzoxaboroles-old compounds with new applications. J Organomet. Chem. 2009, 694 (22),3533-3541.341. Adamczyk-Wozniak, A.; Borys, K. M.; Sporzyn'ski, A., Recent developments in thechemistry and biological applications of benzoxaboroles. Chem. Rev. 2015, 115 (11), 5224-5247.

416

342. Liu, C. T.; Tomsho, J. WV.; Bernkovic, 5. J., The unique chemistry of benzoxaboroles:current and emerging applications in biotechnology and therapeutic treatments. Bioorg. MedChem. 2014, 22 (16), 4462-4473.343. Cox, P. A.; Reid, M.; Leach, A. G.; Campbell, A. D.; King, E. J.; Lloyd-Jones, G. C., Base-catalyzed aryl-B(OH)2 protodeboronation revisited: from concerted proton-transfer to liberationof a transient arylanion. J. Am. Chem. Soc. 2017, 139 (37), 13156-13165.344. Jacobs, R. T.; Nare, B.; Wring, S. A.; Orr, M. D.; Chen, D.; Sligar, J. M.; Jenks, M. X.;Noe, R. A.; Bowling, T. S.; Mercer, L. T.; Rewerts, C.; Gaukel, E.; Owens, J.; Parham, R.;Randolph, R.; Beaudet, B.; Bacchi, C. J.; Yarlett, N.; Plattner, J. J.; Freund, Y.; Ding, C.; Akama,T.; Zhang, Y.-K.; Brun, R.; Kaiser, M.; Scandale, I.; Don, R., SCYX-7158, an orally-activebenzoxaborole for the treatment of stage 2 human African trypanosomiasis. PLoS Negl. Trop. Dis.2011, 5 (6), e1151.345. Labutti, J.; Parsons, I.; Huang, R.; Miwa, G.; Gan, L. S.; Daniels, J. S., Oxidativedeboronation of the peptide boronic acid proteasome inhibitor bortezomib: contributions fromreactive oxygen species in this novel cytochrome P450 reaction. Chem. Res. Toxicol. 2006, 19 (4),539-546.346. Giorgio, M.; Trinei, M.; Migliaccio, E.; Pelicci, P. G., Hydrogen peroxide: a metabolic by-product or a common mediator of ageing signals? Nat. Rev. Mol. Cell Biol. 2007, 8 (9), 722-728.347. Zhong, Q.; Zhang, C.; Zhang, Q.; Miele, L.; Zheng, S.; Wang, G., Boronic prodrug of 4-hydroxytamoxifen is more efficacious than tamoxifen with enhanced bioavailability independentof CYP2D6 status. BMC Cancer 2015, 15 (1), 625.348. Lippert, A. R.; Van de Bittner, G. C.; Chang, C. J., Boronate oxidation as a bioorthogonalreaction approach for studying the chemistry of hydrogen peroxide in living systems. Acc. Chem.Res. 2011, 44 (9), 793-804.349. Hoang, T. T.; Smith, T. P.; Raines, R. T., A boronic acid conjugate of angiogenin thatshows ROS-responsive neuroprotective activity. Angew. Chem. Int. Ed. 2017, 56 (10), 2619-2622.350. Springsteen, G.; Wang, B., A detailed examination of boronic acid-diol complexation.Tetrahedron 2002, 58 (26), 5291-5300.351. Hashimoto, T.; Gailvez, A. 0.; Maruoka, K., In situ assembled boronate ester assisted chiralcarboxylic acid catalyzed asymmetric trans-aziridinations. J. Am. Chem. Soc. 2013, 135, 17667-17670.352. Simmons, S.; Fratini, A.; Benin, V., A sodium salt of the dimer of boronoterephthalic acidanhydride. Acta Crystallogr. 2011, C67 (5), m123-m125.353. Zhang, L.; Cheng, J.; Carry, B.; Hou, Z., Catalytic boracarboxylation of alkynes withdiborane and carbon dioxide by an N-heterocyclic carbene copper catalyst. J Am. Chem. Soc.2012, 134 (35), 14314-14317.354. Thenard, L., Observations sur l'influence de l'eau dans la formation des acides oxigends.Ann. Chim. Phys. 1818, 9, 314-317.355. Sies, H.; Berndt, C.; Jones, D. P., Oxidative stress. Annu. Rev. Biochem. 2017, 86, 715-748.356. Ainley, A. D.; Challenger, F., CCLXXX.-Studies of the boron-carbon linkage. part I. Theoxidation and nitration of phenylboric acid. J Chem. Soc. 1930, (0), 2171-2180.

417

357. Kuivila, H. G.; Armour, A. G., Electrophilic displacement reactions. IX. Effects ofsubstituents on rates of reactions between hydrogen peroxide and benzeneboronic acid. J Am.Chem. Soc. 1957, 79 (21), 5659-5662.358. Larkin, J. D.; Markham, G. D.; Milkevitch, M.; Brooks, B. R.; Bock, C. W., Computationalinvestigation of the oxidative deboronation of boroglycine, H2N-CH2-B(OH)2, using H20 andH202. J Phys. Chem. A 2009, 113 (41), 11028-11034.359. Zou, Y. Q.; Chen, J. R.; Liu, X. P.; Lu, L. Q.; Davis, R. L.; Jorgensen, K.; Xiao, W. J.,Highly efficient aerobic oxidative hydroxylation of arylboronic acids: photoredox catalysis usingvisible light. Angew. Chem. Int. Ed. 2012, 51 (3), 784-788.360. Sikora, A.; Zielonka, J.; Lopez, M.; Joseph, J.; Kalyanaraman, B., Direct oxidation ofboronates by peroxynitrite: mechanism and implications in fluorescence imaging of peroxynitrite.Free Radic. Biol. Med. 2009, 47 (10), 1401-1407.361. Buncel, E.; Urn, I.-H., The a-effect and its modulation by solvent. Tetrahedron 2004, 60,7801-7825.362. Yamamoto, Y.; Matsumura, T.; Takao, N.; Yamagishi, H.; Takahashi, M.; Iwatsuki, S.;Ishihara, K., Fast trigonal/tetragonal interconversion followed by slow chelate-ring closure in thecomplexation of boronic acids. Inorg. Chim. A cta 2005, 358 (12), 3355-3361.363. Matsumura, T.; Iwatsuki, S.; Ishihara, K., Direct kinetic measurements for the fastinterconversion process between trigonal boronic acid and tetragonal boronate ion at lowtemperatures. Inorg. Chem. Commun. 2005, 8 (8), 713-716.364. Ni, N.; Laughlin, S.; Wang, Y.; Feng, Y.; Zheng, Y.; Wang, B., Probing the general timescale question of boronic acid binding with sugars in aqueous solution at physiological pH. Bioorg.Med Chem. 2012, 20 (9), 2957-2961.365. Kuivila, H. G., Electrophilic displacement reactions. III. Kinetics of the reaction betweenhydrogen peroxide and benzeneboronic acid. J Am. Chem. Soc. 1954, 76 (3), 870-874.366. Toyao, T.; Ueno, N.; Miyahara, K.; Matsui, Y.; Kim, T.-H.; Horiuchi, Y.; Ikeda, H.;Matsuoka, M., Visible-light, photoredox catalyzed, oxidative hydroxylation of arylboronic acidsusing a metal-organic framework containing tetrakis(carboxyphenyl)porphyrin groups. Chem.Commun. 2015, 51 (89), 16103-16106.367. Cox, P. A.; Leach, A. G.; Campbell, A. D.; Lloyd-Jones, G. C., Protodeboronation ofheteroaromatic, vinyl, and cyclopropyl boronic acids: pH-rate profiles, autocatalysis, anddisproportionation. J Am. Chem. Soc. 2016, 138 (29), 9145-9157.368. Ellis, G. A.; Palte, M. J.; Raines, R. T., Boronate-mediated biologic delivery. J Am. Chem.Soc. 2012, 134 (8), 3631-3634.369. Winterbourn, C. C.; Metodiewa, D., Reactivity of biologically important thiol compoundswith superoxide and hydrogen peroxide. Free Radic. Biol. Med 1999, 27 (3-4), 322-328.370. Collaboration, H.-C.; Ray, M.; Logan, R.; Steme, J. A.; Hernandez-Diaz, S.; Robins, J. M.;Sabin, C.; Bansi, L.; van Sighem, A.; de Wolf, F.; Costagliola, D.; Lanoy, E.; Bucher, H. C.; vonWyl, V.; Esteve, A.; Casbona, J.; del Amo, J.; Moreno, S.; Justice, A.; Goulet, J.; Lodi, S.; Phillips,A.; Seng, R.; Meyer, L.; Perez-Hoyos, S.; Garcia de Olalla, P.; Hernan, M. A., The effect ofcombined antiretroviral therapy on the overall mortality of HIV-infected individuals. AIDS 2010,24 (1), 123-37.371. Chun, T. W.; Justement, J. S.; Murray, D.; Hallahan, C. W.; Maenza, J.; Collier, A. C.;Sheth, P. M.; Kaul, R.; Ostrowski, M.; Moir, S.; Kovacs, C.; Fauci, A. S., Rebound of plasma

418

viremia fnhllwing oesstio- n nf ntiretrrvniral ther Aspte prnFrunly low leve1 s f HIV

reservoir: implications for eradication. AIDS 2010, 24 (18), 2803-8.372. Hendrickson, C. J.; Pascoe, S. J. S.; Huber, A. N.; Moolla, A.; Maskew, M.; Long, L. C.;Fox, M. P., "My future is bright...I won't die with the cause of AIDS": ten-year patient ARToutcomes and experiences in South Africa. J Int. AIDS Soc. 2018, 21 (10), e25184.373. Siliciano, R. F.; Greene, W. C., HIV latency. Cold Spring Harb. Perspect. Med 2011, 1(1), a007096.374. Kim, Y.; Anderson, J. L.; Lewin, S. R., Getting the "kill" into "shock and kill": strategiesto eliminate latent HIV. Cell Host Microbe 2018, 23 (1), 14-26.375. Vocero-Akbani, A. M.; Heyden, N. V.; Lissy, N. A.; Ratner, L.; Dowdy, S. F., KillingHIV-infected cells by transduction with an HIV protease-activated caspase-3 protein. Nat. Med.1999, 5 (1), 29-33.376. Plainkum, P.; Fuchs, S. M.; Wiyakrutta, S.; Raines, R. T., Creation of a zymogen. Nat.Struct. Biol. 2003, 10 (2), 115-9.377. Johnson, R. J.; Lin, S. R.; Raines, R. T., A ribonuclease zymogen activated by the NS3protease of the hepatitis C virus. FEBS.J 2006, 273 (23), 5457-65.378. Turcotte, R. F.; Raines, R. T., Design and characterization of an HIV-specific ribonucleasezymogen. AIDS Res. Hum. Retroviruses 2008, 24 (11), 1357-63.379. Callis, M.; Serrano, S.; Benito, A.; Laurents, D. V.; Vilanova, M.; Bruix, M.; Ribo, M.,Towards tricking a pathogen's protease into fighting infection: the 3D structure of a stablecircularly permuted onconase variant cleavedby HIV-1 protease. PLoS ONE 2013, 8 (1), e54568.380. Kanno, A.; Yamanaka, Y.; Hirano, H.; Umezawa, Y.; Ozawa, T., Cyclic luciferase for real-time sensing of caspase-3 activities in living mammals. Angew. Chem. Int. Ed. 2007, 46(40), 7595-9.381. Butler, J. S.; Mitrea, D. M.; Mitrousis, G.; Cingolani, G.; Loh, S. N., Structural andthermodynamic analysis of a conformationally strained circular permutant of barase.Biochemistry 2009, 48 (15), 3497-507.382. Evans, T. C., Jr.; Martin, D.; Kolly, R.; Panne, D.; Sun, L.; Ghosh, I.; Chen, L.; Benner, J.;Liu, X. Q.; Xu, M. Q., Protein trans-splicing and cyclization by a naturally split intein from thednaE gene of Synechocystis species PCC6803. J Biol. Chem. 2000, 275 (13), 9091-4.383. Zettler, J.; Schutz, V.; Mootz, H. D., The naturally split Npu DnaE intein exhibits anextraordinarily high rate in the protein trans-splicing reaction. FEBS Lett. 2009, 583 (5), 909-14.384. Bal, H. P.; Batra, J. K., Human pancreatic ribonuclease--deletion of the carboxyl-terminalEDST extension enhances ribonuclease activity and thermostability. Eur. J. Biochem. 1997, 245(2), 465-9.385. Pous, J.; Mallorqui-Fernandez, G.; Peracaula, R.; Terzyan, S. S.; Futami, J.; Tada, H.;Yamada, H.; Seno, M.; de Llorens, R.; Gomis-Ruth, F. X.; Coll, M., Three-dimensional structureof human RNase 1 delta N7 at 1.9 A resolution. Acta Crystallogr. 2001, D57 (Pt 4), 498-505.386. Fisher, B. M.; Ha, J. H.; Raines, R. T., Coulombic forces in protein-RNA interactions:binding and cleavage by ribonuclease A and variants at Lys7, Arg 10, and Lys66. Biochemistry1998, 37(35), 12121-32.387. Fujii, T.; Ueno, H.; Hayashi, R., Significance of the four carboxyl terminal amino acidresidues of bovine pancreatic ribonuclease A for structural folding. J Biochem. 2002, 131 (2),193-200.

419

388. Leland, P. A.; Staniszewski, K. E.; Kim, B. M.; Raines, R. T., Endowing human pancreaticribonuclease with toxicity for cancer cells. J Biol. Chem. 2001, 276 (46), 43095-102.389. Shah, N. H.; Vila-Perello, M.; Muir, T. W., Kinetic control of one-pot trans-splicingreactions by using a wild-type and designed split intein. Angew. Chem. Int. Ed. 2011, 50 (29),6511-5.390. Kelemen, B. R.; Klink, T. A.; Behlke, M. A.; Eubanks, S. R.; Leland, P. A.; Raines, R. T.,Hypersensitive substrate for ribonucleases. Nucleic A cids Res. 1999, 27 (18), 3696-701.391. Eller, C. H.; Lomax, J. E.; Raines, R. T., Bovine brain ribonuclease is the functionalhomolog of human ribonuclease 1. J Biol.Chem. 2014, 289 (38), 25996-6006.392. O'Meara, M. J.; Leaver-Fay, A.; Tyka, M. D.; Stein, A.; Houlihan, K.; DiMaio, F.; Bradley,P.; Kortemme, T.; Baker, D.; Snoeyink, J.; Kuhlman, B., Combined covalent-electrostatic modelof hydrogen bonding improves structure prediction with Rosetta. J Chem. Theory Comput. 2015,11 (2), 609-22.393. Stein, A.; Kortemme, T., Improvements to robotics-inspired conformational sampling inrosetta. PLoS ONE 2013, 8 (5), e63090.394. Gront, D.; Kulp, D. W.; Vernon, R. M.; Strauss, C. E.; Baker, D., Generalized fragmentpicking in Rosetta: design, protocols and applications. PLoS ONE 2011, 6 (8), e23294.395. Wang, C.; Bradley, P.; Baker, D., Protein-protein docking with backbone flexibility.Journal of molecular biology 2007, 373 (2), 503-19.396. Dobson, C. M., Protein folding and misfolding. Science 2003, 426, 884-890.397. Chiti, F.; Dobson, C. M., Protein misfolding, functional amyloid, and human disease. Annu.Rev. Biochem. 2006, 75, 333-366.398. Knowles, T. P.; Vendruscolo, M.; Dobson, C. M., The amyloid state and its associationwith protein misfolding diseases. Nat. Rev. Mol. Cell Biol. 2014, 15 (6), 3 84-396.399. Richardson, S. J., Cell and molecular biology of transthyretin and thyroid hormones. Int.Rev. Cytol. 2007, 258, 137-193.400. Klabunde, T.; Petrassi, H. M.; Oza, V. B.; Raman, P.; Kelly, J. W.; Sacchettini, J. C.,Rational design of potent human transthyretin amyloid disease inhibitors. Nat. Struct. Biol. 2000,7, 312-321.401. Hamilton, J. A.; Benson, M. D., Transthyretin: a review from a structural perspective. Cell.Mol. Life Sci. 2001, 58, 1491-1521.402. Hund, E., Familial amyloidotic polyneuropathy: current and emerging treatment optionsfor transthyretin-mediated amyloidosis. The application of clinical genetics 2012, 5, 37-41.403. Yang, D. T.; Joshi, G.; Cho, P. Y.; Johnson, J. A.; Murphy, R. M., Transthyretin as both asensor and a scavenger of -amyloid oligomers. Biochemistry 2013, 52, 2849-2861.404. Li, X.; Zhang, X.; Ladiwala, A. R. A.; Du, D.; Yadav, J. K.; Tessier, P. M.; Wright, P. E.;Kelly, J. W.; Buxbaum, J. N., Mechanisms of transthyretin inhibition of P-amyloid aggregation invitro. J. Neurosci. 2013, 33, 19423-19433.405. Stein, T. D.; Anders, N. J.; DeCarli, C.; Chan, S. L.; Mattson, M. P.; Johnson, J. A.,Neutralization of transthyretin reverses the neuroprotective effects of secreted amyloid precursorprotein (APP) in APPSW mice resulting in tau phosphorylation and loss of hippocampal neurons:Support for the amyloid hypothesis. J Neurosci. 2004, 24, 7707-7717.406. Celej, M. S., Protein stability induced by ligand binding correlates with changes in proteinflexibility. Protein Sci. 2003, 12, 1496-1506.

420

407. Hingorani, K. S.; Metcalf, /T. C.; Dmig, n. T.; Garm-an, S. . Powers, E T.L. M., Ligand-promoted protein folding by biased kinetic partitioning. Nat. Chem. Biol. 2017, 13,369-371.408. Martinez Molina, D.; Jafari, R.; Ignatushchenko, M.; Seki, T.; Larsson, E. A.; Dan, C.;Sreekumar, L.; Cao, Y.; Nordlund, P., Monitoring drug target engagement in cells and tissuesusing the cellular thermal shift assay. Science 2013, 341 (6141), 84-7.409. Miller, D. W.; Dill, K. A., Ligand binding to proteins: the binding landscape model. ProteinSci. 1997, 6, 2166-2179.410. O'Sullivan, C.; Tompson, F. W., LX.-Invertase: a contribution to the history of anenzyme or unorganised ferment. J Chem. Soc. 1890, 57, 834-93 1.411. Obici, L.; Merlini, G., An overview of drugs currently under investigation for the treatmentof transthyretin-related hereditary amyloidosis. Expert Opin. Investig. Drugs 2014, 23, 1239-1251.412. Nencetti, S.; Orlandini, E., TTR fibril formation inhibition: is there a SAR? Curr. MedChem. 2012, 19, 2356-2379.413. Johnson, S. M.; Wiseman, R. L.; Sekijima, Y.; Green, N. S.; Adamski-Werner, S. L.; Kelly,J. W., Native state kinetic stabilization as a strategy to ameliorate protein misfolding diseases: afocus on the transthyretin amyloidoses. Acc. Chem. Res. 2005, 38 (12), 911-921.414. Adamski-Werner, S. L.; Palaninathan, S. K.; Sacchettini, J. C.; Kelly, J. W., Diflunisalanalogs stabilize the native state of transthyretin. potent inhibition of amyloidogenesis. J. Med.Chem. 2004, 47, 355-374.415. Castano, A.; Helmke, S.; Alvarez, J.; Delisle, S.; Maurer, M. S., Diflunisal for ATTRcardiac amyloidosis. Congest. Heart Fail. 2012, 18 (6), 315-319.416. Scott, L. J., Tafamidis: a review of its use in familial amyloid polyneuropathy. Drugs 2014,74, 1371-1378.417. Bulawa, C. E.; Connelly, S.; DeVit, M.; Wang, L.; Weigel, C.; Fleming, J. A.; Packman,J.; Powers, E. T.; Wiseman, R. L.; Foss, T. R.; Wilson, I. A.; Kelly, J. W.; Labaudiniere, R.,Tafamidis, a potent and selective transthyretin kinetic stabilizer that inhibits the amyloid cascade.Proceedings of the National Academy of Sciences of the United States ofAmerica 2012, 109 (24),9629-9634.418. Sekijima, Y., Recent progress in the understanding and treatment of transthyretinamyloidosis. Journal of clinical pharmacy and therapeutics 2014, 39 (3), 225-23 3.419. Johnson, D. S.; Weerapana, E.; Cravatt, B. F., Strategies for discovering and deriskingcovalent, irreversible enzyme inhibitors. Future Med Chem. 2010, 2 (6), 949-964.420. Baillie, T. A., Targeted covalent inhibitors for drug design. Angew. Chem. Int. Ed. 2016,55, 2-17.421. Miller, R. M.; Taunton, J., Targeting protein kinases with selective and semipromiscuouscovalent inhibitors. Methods Enzymol. 2014, 548, 93-116.422. Johnson, S. M.; Connelly, S.; Fearns, C.; Powers, E. T.; Kelly, J. W., The transthyretinamyloidoses: from delineating the molecular mechanism of aggregation linked to pathology to aregulatory-agency-approved drug. Journal of molecular biology 2012, 421 (2-3), 185-203.423. Choi, S.; Connelly, S.; Reixach, N.; Wilson, I. A.; Kelly, J. W., Chemoselective smallmolecules that covalently modify one lysine in a non-enzyme protein in plasma. Nat. Chem. Biol.2010, 6 (2), 133-139.

421

424. Choi, S.; Ton Ong, D. S.; Kelly, J. W., A stilbene that binds selectively to transthyretin incells and remains dark until it undergoes a chemoselective reaction to create a bright bluefluorescent conjugate. J Am. Chem. Soc. 2010, 132, 16043-16051.425. Suh, E. H.; Liu, Y.; Connelly, S.; Genereux, J. C.; Wilson, I. A.; Kelly, J. W., Stilbenevinyl sulfonamides as fluorogenic sensors of and traceless covalent kinetic stabilizers oftransthyretin that prevent amyloidogenesis. J Am. Chem. Soc. 2013, 135 (47), 17869-17880.426. Grimster, N. P.; Connelly, S.; Baranczak, A.; Dong, J.; Krasnova, L. B.; Sharpless, K. B.;Powers, E. T.; Wilson, I. A.; Kelly, J. W., Aromatic sulfonyl fluorides covalently kineticallystabilize transthyretin to prevent amyloidogenesis while affording a fluorescent conjugate. J Am.Chem. Soc. 2013, 135 (15), 5656-5668.427. Baranczak, A.; Liu, Y.; Connelly, S.; Han Du, W.-G.; Greiner, E. R.; Genereux, J. C.;Wiseman, R. L.; Eisele, Y. S.; Bradbury, N. C.; Dong, J.; Noodleman, L.; Sharpless, K. B.; Wilson,I. A.; Encalada, S. E.; Kelly, J. W., A fluorogenic aryl fluorosulfate for intraorganellar transthyretinimaging in living cells and in Caenorhabditis elegans. J Am. Chem. Soc. 2015, 137 (23), 7404-7414.428. Peters, J. A., Interactions between boric acid derivatives and saccharides in aqueous media:structures and stabilities of resulting esters. Coord. Chem. Rev. 2014, 268, 1-22.429. Adams, J., The development of proteasome inhibitiors as anticancer drugs. Cancer Cell2003, 5, 417-421.430. Antonov, V. K.; Ivanina, T. V.; Berezin, I. V.; Martinek, K., n-Alkylboronic acids asbifunctional reversible inhibitors of c-chymotrypsin. FEBS Lett. 1970, 7, 23-25.431. Kettner, C.; Shenvi, A., Inhibition of the serine proteases leukocyte elastase, pancreaticelastase, cathepsin G, and chymotrypsin by peptide boronic acids. J Biol. Chem. 1984, 259,15106-15114.432. Wang, J.; Wu, W.; Jiang, X., Nanoscaled boron-containing delivery systems andtherapeutic agents for cancer treatment. Nanomedicine 2015, 10 (7), 1149-1163.433. Ma, R.; Shi, L., Phenylboronic acid-based glucose-responsive polymeric nanoparticles:synthesis and applications in drug delivery. Polym. Chem. 2014, 5 (5), 1503.434. Andersen, K. A.; Smith, T. P.; Lomax, J. E.; Raines, R. T., Boronic acid for the tracelessdelivery of proteins into cells. ACS Chem. Biol. 2016, 11, 319-323.435. Wu, X.; Li, Z.; Chen, X. X.; Fossey, J. S.; James, T. D.; Jiang, Y. B., Selective sensing ofsaccharides using simple boronic acids and their aggregates. Chem. Soc. Rev. 2013, 42 (20), 8032-8048.436. Whyte, G. F.; Vilar, R.; Woscholski, R., Molecular recognition with boronic acids-applications in chemical biology. J Chem. Biol. 2013, 6 (4), 161-174.437. Cal, P. M. S. D.; Frade, R. F. M.; Cordeiro, C.; Gois, P. M. P., Reversible lysinemodification on proteins by using functionalized boronic acids. Chem.-Eur. J. 2015, 21, 8182-8187.438. Ak9ay, G.; Belmonte, M. A.; Aquila, B.; Chuaqui, C.; Hird, A. W.; Lamb, M. L.; Rawlins,P. B.; Su, N.; Tentarelli, S.; Grimster, N. P.; Su, Q., Inhibition of Ml-i through covalentmodification of a noncatalytic lysine side chain. Nat. Chem. Biol. 2016, 12, 931-936.439. Bull, S. D.; Davidson, M. G.; van den Elsen, J. M. H.; Fossey, J. S.; Jenkins, A. T. A.;Jiang, Y.-B.; Kubo, Y.; Marken, F.; Sakurai, K.; Zhao, J.; James, T. D., Exploiting the reversible

422

covalent bonding of boronic acids-recognition, sensing, and assembly. Ac. Chem. Res. 2012, 46

(2), 312-326.440. Services, U. S. D. o. H. a. H., Toxicology Profile for Boron. Agency for Toxic Substancesand Disease Registry: Atlanta, GA, 2010.441. Baures, P. W.; Peterson, S. A.; Kelly, J. W., Discovering transthyretin amyloid fibrilinhibitors by limited screening. Bioorg. Med Chem. 1998, 6, 1389-1401.442. Bourgault, S.; Choi, S.; Buxbaum, J. N.; Kelly, J. W.; Price, J. L.; Reixach, N., Mechanismsof transthyretin cardiomyocyte toxicity inhibition by resveratrol analogs. Biochemical andbiophysical research communications 2011, 410 (4), 707-713.443. Johnson, S. M.; Cohen, S. M.; Wilson, I. A.; Kelly, J. W., Toward optimization of thelinker substructure common to transthyretin amyloidogenesis inhibitors using biochemical andstructural studies. J Med. Chem. 2008, 51, 6348-6358.444. Cotrina, E. Y.; Pinto, M.; Bosch, L.; Vila, M.; Blasi, D.; Quintana, J.; Centeno, N. B.;Arsequell, G.; Planas, A.; Valencia, G., Modulation of the fibrillogenesis inhibition properties oftwo transthyretin ligands by halogenation. J Med Chem. 2013, 56 (22), 9110-9121.445. Palaninathan, S. K., Nearly 200 X-ray crystal structures of transthyretin-what do they tellus about this protein and the design of drugs for TTR amyloidoses? Curr. Med Chem. 2012, 19,2324-2342.446. Oza, V. B.; Smith, C.; Raman, P.; Koepf, E. K.; Lashuel, H. A.; Petrassi, H. M.; Chiang,K. P.; Powers, E. T.; Sacchettini, J. C.; Kelly, J. W., Synthesis, structure, and activity of diclofenacanalogues as transthyretin amyloid fibril formation inhibitors. J Med Chem. 2002, 45, 321-332.447. Palaninathan, S. K.; Mohamedmohaideen, N. N.; Orlandini, E.; Ortore, G.; Nencetti, S.;Lapucci, A.; Rossello, A.; Freundlich, J. S.; Sacchettini, J. C., Novel transthyretin amyloid fibrilformation inhibitors: synthesis, biological evaluation, and X-ray structural analysis. PLoS ONE2009, 4 (7), e6290.448. Neumann, P.; Cody, V.; Wojtczak, A., Ligand binding at the transthyretin dimer-dimerinterface: structure of the transthyretin-T4Ac complex at 2.2 A resolution. Acta Crystallogr. 2005,D61, 1313-1319.449. Muzio, T.; Cody, V.; Luft, J. R.; Pangborn, W.; Wojtczak, A., Complex of rat transthyretinwith tetraiodothyroacetic acid refined at 2.1 and 1.8 A resolution. Acta Biochim. Pol. 2001, 48 (4),877-884.450. Johnson, S. M.; Connelly, S.; Wilson, I. A.; Kelly, J. W., Biochemical and structuralevaluation of highly selective 2-arylbenzoxazole-based transthyretin amyloidogenesis inhibitors.J Med Chem. 2008, 51, 260-270.451. Note l, 2017.452. Groll, M.; Berkers, C. R.; Ploegh, H. L.; Ovaa, H., Crystal structure of the boronic acid-based proteasome inhibitor bortezomib in complex with the yeast 20S proteasome. Structure 2006,14 (3), 451-456.453. Bone, R.; Frank, D.; Kettner, C. A.; Agard, D. A., Structural analysis of specificity: a-lyticprotease complexes with analogues of reaction intermediates. Biochemistry 1989, 28 (19), 7600-7609.454. Kawasaki, Y.; Chufan, E. E.; Lafont, V.; Hidaka, K.; Kiso, Y.; Mario Amzel, L.; Freire,E., How much binding affinity can be gained by filling a cavity? Chemical biology & drug design2010, 75 (2), 143-151.

423

455. Eriksson, A. E.; Baase, W. A.; Zhang, X.-J.; Heinz, D. W.; Blaber, M.; Baldwin, E. P.;Matthews, B. W., Response of a protein structure to cavity-creating mutations and its relation tothe hydrophobic effect. Science 1992, 255, 178-183.456. Voth, A. R.; Khuu, P.; Oishi, K.; Ho, P. S., Halogen bonds as orthogonal molecularinteractions to hydrogen bonds. Nat. Chem. 2009, 1, 74-79.457. Riley, K. E.; Merz, K. M., Jr., Insights into the strength and origin of halogen bonding: thehalobenzene-formaldehyde dimer. J Phys. Chem. A 2007, 111 (9), 1688-1694.458. Peters, J. A., Interactions between boric acid derivatives and saccharides in aqueous media:Structures and stabilities of resulting esters. Coord. Chem. Rev. 2015, 268, 1-22.459. Alhamadsheh, M. M.; Connelly, S.; Cho, A.; Reixach, N.; Powers, E. T.; Pan, D. W.;Wilson, I. A.; Kelly, J. W.; Graef, I. A., Potent kinetic stabilizers that prevent transthyretin-mediated cardiomyocyte proteotoxicity. Sci. Transl. Med 2011, 3, 97ra8 1.460. Zhu, C.; Wang, R.; Falck, J. R., Mild and rapid hydroxylation of aryl/heteroaryl boronicacids and boronate esters with N-oxides. Org. Lett. 2012, 14, 3494-3497.461. McCutchen, S. L.; Colon, W.; Kelly, J. W., Transthyretin mutation Leu-55-Prosignificantly alters tetramer stability and increases amyloidogenicity. Biochemistry 1993, 32 (45),12119-12127.462. Grimm, F. A.; Lehmler, H. J.; He, X.; Robertson, L. W.; Duffel, M. W., Sulfatedmetabolites of polychlorinated biphenyls are high-affinity ligands for the thyroid hormonetransport protein transthyretin. Environmental health perspectives 2013, 121 (6), 65 7-662.463. Giraldo, J.; Vivas, N. M.; Vila, E.; Badia, A., Assessing the (a)symmetry of concentration-effect curves: empirical versus mechanistic models. Pharmacology & therapeutics 2002, 95 (1),21-45.464. Miroy, G. J.; Lai, Z.; Lashuel, H. A.; Peterson, S. A.; Strang, C.; Kelly, J. W., Inhibitingtransthyretin amyloid fibril formation via protein stabilization. Proceedings of the NationalAcademy of Sciences of the United States ofAmerica 1996, 93 (26), 15051-15056.465. Otwinowski, Z.; Minor, W., Processing of X-ray diffraction data collected in oscillationmode. Methods Enzymol. 1997, 276, 307-326.

424

signature redactedraineslab.com/sites/default/files/labs/raines/pdfs/thesis_windsor.pdf · in...

Documents