© 2016 robert wilson powell iii

IMPROVING THE THERMOSTABILITY AND INCREASING THE SUBSTRATE RANGE OF OLD YELLOW ENZYME HOMOLOGS AND AMINOLEVULINIC ACID SYNTHASE

THROUGH PROTEIN ENGINEERING

ROBERT WILSON POWELL III

A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT

OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY

UNIVERSITY OF FLORIDA

“To my family and friends”

ACKNOWLEDGMENTS

I would like to acknowledge my faculty advisor Dr. Jon Stewart for his guidance. I

am very grateful to him for the opportunity to do work in his lab. I feel fortunate to have

been able to work on projects which I enjoyed so much. I am very proud of the work we

do in this lab and am grateful to have been able to contribute what I could during my

time. I would like to acknowledge the University of Florida. And, I would like to

acknowledge the NSF for funding my research.

I would like to acknowledge Dr. Bradford Sullivan for his assistance during the

OYE 1 Y196 project. I would like to acknowledge Athena Patterson for her assistance

during the OYE2.6 Y78Xsm / I113Xsm project. I would like to acknowledge Steven

Crichton for all his work during the OYE2.6 Y78Xsm / I113Xsm and OYE2.6 Y78Xsm /

I113C / F247X projects. I would like to acknowledge Dr. Filip Boratynski for all of his

work during the OYE 2.6 thermostability project. I would like to acknowledge Matthew

Burg and Dr. Steven Bruner for all of their help during the OYE 3 crystallization project. I

would like to acknowledge Steven Crichton again for all his work during the mALAS

project. Every one of these people have contributed to these projects in a meaningful

way and I am grateful to them all for it.

I would like to thank Dr. Hillary Lathrop for all her help with the Beckman and

Gilson HPLCs as well as for all her work maintaining our instrument room. I would like

to acknowledge Sarah Franz for all her help with the MSTFA derivatization protocols. I

would like to thank Louis Mouterde for all his work with the acyl-CoA separation method

development. I am grateful for their help running and maintaining the lab. I would like to

thank my family and friends for their support. I would especially like to thank my mom,

Janet Powell, for all of her support during my time here. And, I would also like to thank

Nicole Gibbons for her love and support. I am very thankful for all the help Nicole has

given me during our time here. I am very grateful to have met her and to have been able

to go on this journey with her.

TABLE OF CONTENTS page

ACKNOWLEDGMENTS .................................................................................................. 4

LIST OF TABLES .......................................................................................................... 11

LIST OF FIGURES ........................................................................................................ 12

LIST OF ABBREVIATIONS ........................................................................................... 19

ABSTRACT ................................................................................................................... 20

CHAPTER

1 PROBING POSITION Y196 IN OLD YELLOW ENZYME 1 .................................... 22

Background ............................................................................................................. 22

Isolation and Purification of OYE ...................................................................... 22 Catalytic cycle and substrates of OYE ............................................................. 23 Structure and Mechanism of OYE .................................................................... 24

Project Overview .............................................................................................. 25 Results and Discussion........................................................................................... 25

OYE 1 YI96 Site-Saturation Mutagenesis Library ............................................. 25 Crystal Structure of OYE 1 YI96C .................................................................... 27

Experimental ........................................................................................................... 28 General............................................................................................................. 28 Cloning ............................................................................................................. 29

Construction of plasmids used to make the OYE 1 Y196 library ................ 29 Construction of mutants in the OYE 1 Y196X randomized library .............. 29

Substrates ........................................................................................................ 30 2-(Hydroxymethyl)-cyclopent-2-enone (1) .................................................. 30 2-(Hydroxymethyl)-cyclohex-2-enone (2) ................................................... 31

Methyl 2-(hydroxymethyl)acrylate (3) ......................................................... 31 (S)-(+)-Carvone (7) .................................................................................... 31

(R)-(-)-Carvone (8) ..................................................................................... 32

2-Methyl-2-cyclopenten-1-one (11) ............................................................ 32

2-Methyl-2-cyclohexen-1-one (12) ............................................................. 32 Screening Assay .............................................................................................. 33 Protein Purification and Crystallogenesis of Y196C OYE 1 .............................. 33 The Data Collection and Crystal Structure of OYE 1 Y196 Mutants ................. 35

Conclusions ............................................................................................................ 36

2 IMPROVING THE PRODUCT RANGE OF OYE 1 AND OYE 2.6 THROUGH PROTEIN ENGINEERING ...................................................................................... 61

Background ............................................................................................................. 61

Mutagenesis of the East Side of the Active Site in OYE ................................... 61

Project Summary .............................................................................................. 64 Results and Discussion........................................................................................... 65

OYE 2.6 Y78Xsm / I113Xsm Randomized Library .............................................. 65 OYE 2.6 Y78Xsm / I113Xsm Presequenced Library ........................................... 68 OYE 2.6 Y78Xsm / I113C / F247X Randomized Libraries ................................. 68

Experimental ........................................................................................................... 70 General............................................................................................................. 70

Cloning ............................................................................................................. 70 Construction of plasmids used to make the OYE 2.6 ................................. 70 Construction of the mutants in the OYE 2.6 Y78Xsm / I113Xsm

randomized library .................................................................................. 70 Construction of the mutants in the OYE 2.6 Y78Xsm-I113Xsm

presequenced library .............................................................................. 73 Addressing concatomeric primer inserts .................................................... 74

Construction of mutants in the OYE 2.6 Y78Xsm / I113C / F247X randomized libraries ............................................................................... 74

Substrates ........................................................................................................ 75 Methyl 2-(hydroxymethyl)acrylate (1) ......................................................... 75

2-(Hydroxymethyl)-cyclohex-2-enone (2) ................................................... 76 2-(Hydroxymethyl)-cyclopent-2-enone (3) .................................................. 76

Screening Assay .............................................................................................. 76 Conclusions ............................................................................................................ 77

3 IMPROVING THE THERMOSTABILITY OF OYE 2.6 THROUGH PROTEIN ENGINEERING ....................................................................................................... 95

Background ............................................................................................................. 95

Improving Thermostability through Mutagenesis .............................................. 95 Project Summary .............................................................................................. 97

Results and Discussion........................................................................................... 98 Residues with High Local B-Factors ................................................................. 98 Dimer Interface Residues ............................................................................... 100

Combining Thermostabilizing Mutations ......................................................... 100 Crystallization of OYE 2.6 D141E-S388P ....................................................... 101

Experimental ......................................................................................................... 104 General........................................................................................................... 104

Cloning ........................................................................................................... 104 Construction of the plasmid used as a template for the OYE 2.6 thermal

stability libraries .................................................................................... 104 Construction of OYE 2.6 libraries ............................................................. 105

Substrates ...................................................................................................... 106

2-Methyl-2-cyclopenten-1-one (1) ............................................................ 106 Screening ................................................................................................. 107

The Protein Purification and Crystallization of OYE 2.6 Mutants .................... 109 Data Collection and Structure Solution of OYE 2.6 D141E-S388P ................ 112 B-Factor Data and Statistics ........................................................................... 113

Conclusions .......................................................................................................... 113

4 THE STRUCTURE OF Saccharomyces cerevisiae OLD YELLOW ENZYME 3 ... 135

Background ........................................................................................................... 135

Crystallization of OYE Family Members ......................................................... 135 Project Summary ............................................................................................ 136

Results and Discussion......................................................................................... 137 Crystallization of OYE 3.................................................................................. 137 Data Reduction and Structure Solution .......................................................... 139

OYE 3 W116 Site Saturation Mutagenesis ..................................................... 143 X-Ray Crystallography Studies of OYE 3 W116 Mutants and Related

Proteins ....................................................................................................... 144

Experimental ......................................................................................................... 146 General........................................................................................................... 146 Cloning ........................................................................................................... 146

Construction of plasmid used for libraries ................................................ 146 Construction of an OYE 3 W116 site-saturation mutagenesis library ....... 147

Testing of Phenol Binding to OYE 3 ............................................................... 149 GST-OYE 3 Fusion Protein Purification and Crystallogenesis ....................... 149 Native OYE 3 Protein Purification and Crystallogenesis ................................ 150

Alkene Substrate for OYE 3 ........................................................................... 154 2-(Hydroxymethyl)-cyclopent-2-enone (1) ................................................ 154

2-(Hydroxymethyl)-cyclohex-2-enone (2) ................................................. 154 (R)-Pulegone (3) ...................................................................................... 155 (S)-(+)-Carvone (4) .................................................................................. 155 (R)-(-)-Carvone (5) ................................................................................... 156

2-Methyl-2-cyclopenten-1-one (11) .......................................................... 156

2-Methyl-2-cyclohexen-1-one (12) ........................................................... 156 3-Methyl-cyclohexen-1-one (13) .............................................................. 157

3-Ethyl-cyclohexen-1-one (14) ................................................................. 157 3-Methyl-cyclopenten-1-one (15) ............................................................. 157 4-Ethyl-4-methyl-2-cyclohexen-1-one (21) ............................................... 158

4-Isoproply-4-methyl-2-cyclohexen-1-one (22) ........................................ 158 4,4-Diethyl-2-cyclohexen-1-one (23) ........................................................ 158 Spiro[5.5]undec-1-en-3-one (24) .............................................................. 159 2-Butylidenecyclohexanone (25) .............................................................. 159

Screening ....................................................................................................... 159 Conclusions .......................................................................................................... 160

5 IMPROVING THE SUBSTRATE RANGE OF AMINOLEVULINIC ACID SYNTHASE THROUGH PROTEIN ENGINEERING............................................. 190

Background ........................................................................................................... 190

Positions of Interest ........................................................................................ 191 Threonine 148 .......................................................................................... 191

Isoleucine 151 .......................................................................................... 191

Arginine 85 ............................................................................................... 192

The Glycine Loop ........................................................................................... 192 Project Overview ............................................................................................ 193

Results and Discussion......................................................................................... 195 Detecting δ-AL-pyrrole Compounds with Ehrlich’s Reagent ........................... 195 Preparation and Detection of Succinyl-CoA ................................................... 196 In Situ Succinyl-CoA Formation ...................................................................... 196

Detecting Amino Products Using PITC and MSTFA Derivatives .................... 197

mALAS R85, T148 and I151 Site-Saturation Mutagenesis Libraries .............. 199 Experimental ......................................................................................................... 199

General........................................................................................................... 199 Cloning ........................................................................................................... 200

Construction of plasmid used to make ALAS libraries ............................. 200

Construction of ALAS libraries ................................................................. 200 Preparation and Detection of Succinyl-CoA ................................................... 202

Succinyl-CoA Regeneration System .............................................................. 203

Detecting δ-AL-Pyrrole Compounds with Ehrlich’s Reagent .......................... 204 Derivatizing Amino Acids Using PITC ............................................................ 205 Detecting PITC Amino Acid Derivatives by HPLC .......................................... 206

Derivatizing Amino Acids Using MSTFA......................................................... 206 Detecting MSTFA Amino Acid Derivatives Using GC-MS .............................. 207

Conclusions .......................................................................................................... 207

APPENDIX

A LIST OF PRIMERS ............................................................................................... 223

Chapter 1 Primers ................................................................................................. 223 Chapter 2 Primers ................................................................................................. 224

Chapter 3 Primers ................................................................................................. 225 Chapter 4 Primers ................................................................................................. 226

Chapter 5 Primers ................................................................................................. 228

B MUTAGENIC PLASMIDS ..................................................................................... 232

C PLASMID SEQUENCES ....................................................................................... 233

Sequence of pET3b-OYE1 ................................................................................... 233

Sequence of pBS2 ................................................................................................ 235 Sequence of pFB1 ................................................................................................ 238 Sequence of pRP4 ................................................................................................ 241

Sequence of pGF23 .............................................................................................. 243

D GC AND HPLC METHODS .................................................................................. 246

AZW2.Meth ........................................................................................................... 246 AZW3.Meth ........................................................................................................... 246 BTS2.Meth ............................................................................................................ 247

BTS3.Meth ............................................................................................................ 247

BTS4.Meth ............................................................................................................ 248 BTS7.Meth ............................................................................................................ 248

BTS8.Meth ............................................................................................................ 249 FB1.Meth .............................................................................................................. 249 JON.Meth .............................................................................................................. 250 SEF.Meth .............................................................................................................. 250 YAP.Meth .............................................................................................................. 251

LMM.Meth ............................................................................................................. 251 RWP2.Meth .......................................................................................................... 252

E PLASMID MAPS ................................................................................................... 253

pET3b-OYE .......................................................................................................... 253 pBS2 ..................................................................................................................... 253 pFB1 ..................................................................................................................... 254

pRP4 ..................................................................................................................... 254 pGF23 ................................................................................................................... 255

LIST OF REFERENCES ............................................................................................. 256

BIOGRAPHICAL SKETCH .......................................................................................... 263

LIST OF TABLES

Table page 2-1 List of the presequenced alkene reductase libraries ........................................... 79

2-2 OYE 2.6 best variants discovered during the OYE 2.6 ISM studies ................... 80

2-3 Q scores for NNK randomized libraries. ............................................................. 80

3-1 Crystallographic data collection and refinement statistics ................................ 116

3-2 Q scores for NNK randomized libraries. ........................................................... 117

4-1 Crystallographic data collection and refinement statistics. ............................... 162

4-2 Crystallographic data collection and refinement statistics. ............................... 163

4-3 Distances between the β-carbon of each active site residue to the ligand ....... 164

5-1 Retention times of PIT-amino acid derivative standards from HPLC ................ 209

A-1 List of mutagenic primers for chapter 1. ........................................................... 223

A-2 List of mutagenic and sequencing primers for chapter 2 .................................. 224

B-1 Mutagenic plasmids used in this study. ............................................................ 232

LIST OF FIGURES

Figure page 1-1 The catalytic cycle of G6PDH investigated by Warburg and Christian ............... 38

1-2 The catalytic cycle of OYE 1 established by Massey and Vas ........................... 38

1-3 List of OYE 1 substrates from the literature ........................................................ 39

1-4 FMN Diagram displaying the FMN environment in the active site of OYE 1 ....... 50

1-5 Catalytic mechanism of OYE 1 ........................................................................... 50

1-6 OYE substrate binding modes ............................................................................ 51

1-7 OYE 1 active site diagram .................................................................................. 52

1-8 List of Baylis-Hillman substrates screened by OYE 1 Y196 library ..................... 53

1-9 List of carvone substrates screened by OYE 1 Y196 library ............................... 54

1-10 List of screening substrates screened by OYE 1 Y196 library ............................ 55

1-11 Calculations for obtaining a Qscore of a pooled plasmid mix from a NNK primer mix using data from a theoretical sequencing electropherogram ....................... 56

1-12 OYE 1 Y196 library screening results for 1 ......................................................... 57

1-17 OYE 1 Y196 library screening results for 11 ....................................................... 59

1-18 OYE 1 Y196 library screening results for 12 ....................................................... 60

1-19 Alignment of OYE 1 wt. and OYE 1 Y196C ........................................................ 60

2-1 Flipped binding mode ......................................................................................... 81

2-2 List of Chapter 2 substrates ................................................................................ 82

2-3 (S)-(+)-carvone bound in a flipped binding mode to the active site of OYE 1 W116I ................................................................................................................. 83

2-4 Mechanism of OYE 2.6 ....................................................................................... 83

2-5 Diagram of the residues in the OYE 2.6 active site ............................................ 84

2-6 Malonate bound in the active site of OYE 2.6 Y78W-I113C ............................... 84

2-7 Substrate 1 and 2 modeled into the active of OYE 2.6 Y78W and OYE 2.6 wt .. 85

2-8 Sequence alignment of OYE 2.6 with a sample containing primer inserts .......... 85

2-9 Best variants discovered during both the ISM project and this project using a small residue matrix for obtaining the (R)-5 product from substrate 2 ................ 86

2-10 OYE 2.6 Y78Xsm / I113Xsm library screening results for substrate 2 ................ 87

2-11 OYE 2.6 Y78A / I113C / F247X screening results for substrate 2 ...................... 88

2-12 OYE 2.6 Y78C / I113C / F247X screening results for substrate 2 ...................... 89

2-13 OYE 2.6 Y78G / I113C / F247X screening results for substrate 2 ...................... 90

2-14 OYE 2.6 Y78S / I113C / F247X screening results for substrate 2 ...................... 91

2-15 OYE 2.6 Y78T / I113C / F247X screening results for substrate 2....................... 92

2-16 OYE 2.6 Y78V / I113C / F247X screening results for substrate 2 ...................... 93

2-17 Calculations for obtaining a Qscore of a pooled plasmid mix from a KST primer mix using data from a theoretical sequencing electropherogram ....................... 94

3-1 The fraction of B-factors for each position over the average B-factors of all positions in the structure ................................................................................... 118

3-2 The relative B-factors of all three published OYE 2.6 wild type structures ....... 118

3-3 The positions targeted during the ISM thermostability project, their region in the protein, and their B-factors for structure 3TJL ............................................ 119

3-4 The positions targeted during the local maximum project, their region in the protein, and their B-factors for structure 3TJL .................................................. 119

3-5 The B-factor values for the positions targeted during the local maximum project ............................................................................................................... 120

3-6 The positions selected for mutagenesis during the dimer interface project ...... 120

3-7 The results for the small scale screening of the OYE 2.6 S388 library testing all 19 possible replacements with substrate 1 .................................................. 121

3-8 The results of screening the OYE 2.6 E41X NNK randomized library .............. 121

3-9 The results of screening the OYE 2.6 D141X NNK randomized library ............ 122

3-10 The results of screening the OYE 2.6 E145X NNK randomized library ............ 122

3-11 The results of screening the OYE 2.6 K330X NNK randomized library ............ 123

3-12 The results of screening the OYE 2.6 I214X NNK randomized library .............. 123

3-13 The results of screening the OYE 2.6 W244X NNK randomized library ........... 124

3-14 The results of screening the OYE 2.6 L260X NNK randomized library ............. 124

3-15 The results of screening the OYE 2.6 F307X NNK randomized library ............ 125

3-16 The results of screening the OYE 2.6 I311X NNK randomized library .............. 125

3-17 The results from the large scale screening assay............................................. 126

3-18 The results of the best mutants from all ten ISM-libraries ................................. 127

3-19 The results of the best mutants from OYE 2.6 E41X ........................................ 127

3-20 The results of the best mutants from OYE 2.6 D141X ...................................... 128

3-21 The results of the best mutants from OYE 2.6 E145X ...................................... 128

3-22 The results of the best mutants from all projects .............................................. 129

3-23 The positions targeted in both the ISM thermostability and local maximum projects ............................................................................................................. 130

3-24 The relative B-factor fractions from the OYE 2.6 D141E-S388P structure ....... 130

3-25 The relative B-factor fraction of all OYE 2.6 positions along the 3UPW structure ........................................................................................................... 131

3-26 The relative B-factor fraction of all OYE 2.6 positions along the 3TJL structure ........................................................................................................... 131

3-27 The B-factor fractions for all three published OYE 2.6 wt. structures ............... 132

3-28 The B-factor fractions of both the three OYE 2.6 wt. structures and the OYE 2.6 D141E-S388P structure .............................................................................. 132

3-29 The relative B-factor fractions of OYE 2.6 wt. in structure 3TJL ....................... 133

3-30 The relative B-factor fractions of OYE 2.6 D141E-S388P structure .................. 133

3-31 The regeneration system used to make NADPH which reduces the FMN of OYE and allows the protein to turnover ............................................................ 134

4-1 Schematic illustration of the FMN environment in the active site of OYE homologs .......................................................................................................... 165

4-2 The mechanism of OYE 3 ................................................................................ 165

4-3 Diagram of the positions in the active site of OYE homologs ........................... 166

4-4 Loop 6 in OYE homologs .................................................................................. 167

4-5 List of OYE 3 substrates and reported conversion from the literature .............. 168

4-6 First set of substrates and theoretical binding mode products .......................... 177

4-7 Second set of substrates and theoretical binding mode products..................... 178

4-8 Third set of substrates and theoretical binding mode products......................... 179

4-9 The reactions used to test phenol binding by OYE 3 ........................................ 180

4-10 Crystals and crystallization conditions for of OYE 1, OYE 2.6, and OYE 3 ...... 180

4-11 The structure of OYE 3 ..................................................................................... 181

4-12 The active site for both OYE 1 and OYE 3 with bound FMN and substrate ..... 181

4-13 The active site for both OYE 3 and OYE 3 W116V with bound FMN and p-HBA .................................................................................................................. 182

4-14 The active site for both OYE 1 and OYE 1 F296S with bound FMN and p-HBA .................................................................................................................. 182

4-15 Results from screening the OYE 3 W116 site-saturation library against substrate 1 ........................................................................................................ 183

4-20 Results from screening the OYE 3 W116 site-saturation library against substrate 11 ...................................................................................................... 185

5-1 The reaction of glycine and succinyl-CoA to make δ-AL using ALAS as a catalyst ............................................................................................................. 210

5-2 The proposed mechanism of ALAS .................................................................. 211

5-3 The active site of ALAS from R. capsulatus with glycine bound to PLP ........... 212

5-4 The active site of ALAS from R. capsulatus with succinyl-CoA ........................ 212

5-5 Reaction scheme of the derivatizing of δ-AL-pyrrole with Ehrlich’s reagent ..... 213

5-6 Reaction scheme of succinic anhydride with CoA to make succinyl-CoA ......... 213

5-7 The coupling of ALAS production of CoA from succinyl-CoA to α-Ketoglutarate Dehydrogenase production of NADH from NAD+ ....................... 214

5-8 Derivatization of amino acids using PITC ......................................................... 215

5-9 Derivatization of amino acids using MSTFA ..................................................... 215

5-10 The results for the reaction of δ-AL-pyrrole with Ehrlich’s reagent ................... 215

5-11 The results for the reaction of δ-AL-pyrrole with Ehrlich’s reagent using a plate reader ...................................................................................................... 216

5-12 The reaction of succinyl-CoA with hydroxyl amine to displace the CoA. .......... 217

5-13 HPLC results from the reaction of succinic anhydride and CoA to make succinyl-CoA..................................................................................................... 217

5-14 HPLC results from the synthesis of succinyl-CoA from succinic anhydride and CoA co-eluted with 10x CoA ...................................................................... 218

5-15 HPLC results from the reaction of succinyl-CoA with hydroxylamine ............... 219

5-16 Results of the succinyl-CoA regeneration system ............................................ 220

5-17 Results of reactions from the succinyl-CoA regeneration system using mALAS and mALAS R433K ............................................................................. 221

5-18 Results of amino acid derivatization with MSTFA ............................................. 222

D-1 AZW2.Meth....................................................................................................... 246

D-2 AZW3.Meth....................................................................................................... 246

D-3 BTS2.Meth........................................................................................................ 247

D-4 BTS3.Meth........................................................................................................ 247

D-5 BTS4.Meth........................................................................................................ 248

D-6 BTS7.Meth........................................................................................................ 248

D-7 BTS8.Meth........................................................................................................ 249

D-8 FB1.Meth .......................................................................................................... 249

D-9 JON.Meth ......................................................................................................... 250

D-10 SEF.Meth ......................................................................................................... 250

D-11 YAP.Meth ......................................................................................................... 251

D-12 LMM.Meth......................................................................................................... 251

D-13 RWP2.Meth ...................................................................................................... 252

E-1 pET3b-OYE ...................................................................................................... 253

E-2 pBS2 ................................................................................................................. 253

E-3 pFB1 ................................................................................................................. 254

E-4 pRP4 ................................................................................................................ 254

E-5 pGF23 .............................................................................................................. 255

LIST OF ABBREVIATIONS

FPLC Fast Protein Liquid Chromatography

GC-FID Gas Chromatography Flame Ignition Detector

GC-MS Gas Chromatography Mass Spectrometry

HPLC High Pressure Liquid Chromatography

RMSD Root Mean Square Deviation

Y78X Y78 represents tyrosine at position 78 in the sequence of a protein. The X represents a random set of amino acids at that position that can include any, of the 20 amino acids.

Y78XKST Y78 represents tyrosine at position 78 in the sequence of a protein. The XKST represents a mutation at that position to a set of random amino acids which were obtained from a KST codon mix: alanine, cysteine, glycine, and serine.

Y78Xsm Y78 represents tyrosine at position 78 in the sequence of a protein. The Xsm represents a mutation at that position to a set of random amino acids which include the relatively small amino acids: alanine, cysteine, glycine, serine, threonine, and valine.

Abstract of Dissertation Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

IMPROVING THE THERMOSTABILITY AND INCREASING THE SUBSTRATE RANGE OF OLD YELLOW ENZYME HOMOLOGS AND AMINOLEVULINIC ACID SYNTHASE

Robert Wilson Powell III

December 2016

Chair: Jon Dale Stewart Major: Chemistry

We examined the consequences of mutating the general acid at position Y196 in

OYE 1. We screened a library of OYE 1 Y196 site-saturation mutants against substrates

of interest and discovered that only the cysteine variant gave an interesting result. We

then crystalized this mutant to determine how a cysteine was acting as a general acid.

We discovered that it was definitely not properly positioned, and the identity of the

actual general acid remains unknown.

We also examined an OYE homolog, OYE 2.6. Previous protein engineering

done by our group, as well as revelations from the crystal structure, gave us new insight

into ways to target the active site for mutagenesis. By making a matrix of small amino

acids mutantations at two neighboring positions, we hoped to open up the active site

and make it more amenable to alpha substituted 6 membered rings. We found that a

third mutation on the opposite side of the active site would give further improvement.

We then set out to improve the thermostability of OYE 2.6 by directed evolution.

We chose positions within OYE 2.6 that had high B-values in the crystal structure for

mutagenesis. After successfully increasing the thermostability of OYE 2.6, we then set

out to prove our hypothesis by crystallizing the thermostable mutant and comparing the

B-values at the mutated position. We found that we had indeed lowered the B-factor

value at the position which we mutated.

We next examined OYE 3 as a biocatalyst. We made a library of OYE 3 W116

mutants and screened it against several substrates of interest. We then set out to solve

the crystal structure of OYE 3 so that we could more appropriately examine the

structure of our mutants in complex with our substrates.

Lastly, we waded in mutagenesis of a PLP-dependent enzyme, mALAS. We

made three libraries of mALAS targeting positions T148, I151, and R85. We hoped that

these positions would be malleable positions in the active site that would allow us to

expand the substrate range of mALAS.

CHAPTER 1 PROBING POSITION Y196 IN OLD YELLOW ENZYME 1

Background

Isolation and Purification of OYE

Old Yellow Enzyme (OYE) was first isolated by Warburg and Christian from

brewers’ bottom yeast (Saccharomyces carlsbergensis; subsequently re-named

Saccharomyces pastorianus) in 19321 while studying the oxidation of glucose-6-

phosphate (G-6-P) to 6-phospho-D-glucono-1,5-lactone. Glucose-6-phosphate

dehydrogenase (G6PDH) oxidized G-6-P using NADP+, which produced NADPH. OYE

was responsible for oxidizing NADPH by reducing O2 to H2O2 (Figure 1-1).2 The enzyme

was termed a ‘gelbe ferment’, or a yellow ferment. In 1938, Warburg and Has

discovered a second type of ‘gelbe ferment,’ leading them to designate the original

‘gelbe ferment’ as the ‘old yellow enzyme’ (OYE)3 and the enzyme has been known by

this name ever since. In 1955, Theorell and Åkeson purified the enzyme from the lysate

of brewers’ bottom yeast using a series of organic solvents and then crystallized the

protein using ammonium sulfate precipitation.4

Since its original purification in 1955, OYE has been the subject of significant

study, particularly by the late Professor Vincent Massey. In 1968, Massey and Matthews

discovered that after purification, oxidized OYE formed a charge transfer complex with

an unidentified “green forming compound”.5 The “green forming compound” lost affinity

for OYE when the FMN cofactor was reduced by sodium dithionite and removing this

“green forming compound” by dialysis further improved purification of the enzyme.

Massey and Abramovitz would later use phenol as a ‘green forming compound’ to bind

to the oxidized enzyme on a phenol column as the basis for a very efficient affinity

purification method for OYE.6 After binding, OYE could be eluted by in situ sodium

dithionite reduction. This affinity purification strategy significantly improved the isolation

of OYE and the same phenol columns are still widely used today for purification of OYE

and its homologs.

Several OYE isoforms exist in brewers’ bottom yeast and these complicated

efforts to obtain diffraction-quality crystals. In 1991, Massey and Saito solved this

problem by isolating the gene encoding OYE from Saccharomyces carlbergensis and

cloning it into an E. coli expression plasmid (pET3b).7 Subsequently, two additional

OYE-encoding genes were cloned from Saccharomyces cerevisiae, leading the initial

gene to be designated as OYE 1. Massey and Stott cloned and overexpressed OYE 2

in 19938 and OYE 3 was cloned and overexpressed in 1995 by Massey and Niino.9

Catalytic cycle and substrates of OYE

During their experiments to purify OYE through crystallization in 1955, Theorell

and Åkeson discovered that OYE had a flavin mononucleotide (FMN) cofactor.4 Later

studies by Massey et al. investigated the oxidative half reaction of OYE. These spectral

studies of the OYE charge transfer complex established the catalytic cycle of OYE and

have opened the way for it to be used as a biocatalyst ever since (Figure 1-2).8,10,11

While NADPH appears to be the physiological reductant for OYE, the native partner for

flavin re-oxidation has never been discovered.

Several studies have been performed with non-native flavin re-oxidation

substrates for OYE 1, some of the first involving quinones tested by Massey and Stott

during spectral studies.8 This suggested that OYE 1 could reduce electron-deficient

alkenes at the expense of NADPH, which touched off an avalanche of interest in using

this enzyme for asymmetric chemical synthesis. A vast array of alkene substrates have

been tested with OYE 1. These include many α,β-unsaturated ketones, aldehydes,10–14

nitroalkenes,15–18 and even alkynes (Figure 1-3).19

Structure and Mechanism of OYE

Though OYE 1 was first purified and crystallized from lysate by Theorell and

Åkeson in 1955, its three-dimensional structure was not determined until 1994 when

Karplus and Fox successfully obtained diffraction quality crystals of OYE 1.4,20 The

overall structure of OYE 1 features an α/β-barrel (TIM barrel) with the active site buried

in the barrel along with the FMN cofactor. Within the active site, T37, G72, Q114, and

R243 form hydrogen bonds with the FMN and lock it into place (Figure 1-4). In 1998,

Massey and Brown used site directed mutagenesis and a series kinetic experiments to

establish the role of H191 and N194.21 These amino acids stabilize the oxyanion

intermediate prior to its protonation. In the same year, Massey and Kohli also examined

the role of Y196. Site directed mutagenesis was used to prepare a phenylalanine

substitution and the kinetic properties of both it as well as the wild type protein were

compared. Interestingly, the Y196F mutation had little effect on ligand binding; however,

the oxidative half reduction was nearly 6 orders of magnitude slower than the wild-

type.11 This study helped establish tyrosine as the general acid in the catalytic

mechanism of OYE 1 (Figure 1-5).

Our group has investigated the potential of OYE 1 as a biocatalyst.13,22,23 Much of

our work has been focused on preparing and testing various OYE 1 mutants for their

potential to accept synthetically interesting substrates. Amino acids mentioned above

are essential for FMN or substrate binding and / or catalysis and for this reason, were

not subjected to mutagenesis (Figure 1-7). On the other hand, we found that W116

accepted a wide variety of substitutions, some of which allowed for alternative (“flipped”)

substrate binding modes that led to opposite stereopreference (Figure 1-6).22–25

Project Overview

While Y196 was shown to be critical for the oxidation half-reaction and

associated with oxyanion protonation (Figure 1-5), we hypothesized that other residues

with even greater acidity might yield more efficient OYE 1 variants. We therefore

explored site-saturation mutagenesis of Y196 in OYE 1. A randomized library variants at

this position was created and screened against two alkene substrates (11 and 12 shown

in Figure 1-10). After finding catalytically active members in the library, plasmids were

sequenced and the final variant not already present was added. The complete Y196

site-saturation mutagenesis library was then screened against a broader panel of

substrates (Figures 1-8 through 1-10). These efforts revealed that the Y196C mutant

was catalytically active against a subset of the substrate collection. This was

reminiscent of the Y193C variant of Pichia stipitis OYE 2.6, a fascinating, if inconsistent,

mutant discovered in a parallel project by Adam Walton. To determine how a cysteine at

position 196 of OYE 1 could lead to substrate protonation, we determined the crystal

structure of this variant. We hoped that the information from this structure might shed

light on the analogous Y193C mutant of P. stipitis OYE 2.6 that has stubbornly resisted

all efforts at crystallization.

Results and Discussion

OYE 1 YI96 Site-Saturation Mutagenesis Library

We chose to prepare the site-saturation library en masse, deferring DNA

sequencing until it became clear that one or more active variants were actually present.

To this end, a randomized library of OYE 1 Y196 was made using a pair of primers with

a NNK mix at the codon for Y196 (N = any base, K = G / T). This base doping scheme

encompasses 32 different codons that includes all 20 amino acids plus one stop codon.

Using a variation of a cloning method reported by Zheng et al. that our group developed

for cloning NNK randomized libraries,26,27 PCR was performed and a pooled plasmid

sample obtained from transformants was sequenced. Using a method for evaluating

libraries developed in our lab by Sullivan and Walton,26 the plasmid mix was evaluated

for codon degeneracy. Sequencing of this pooled sample revealed a plasmid mix a

codon mix that gave a Q score of 0.85 (Figure 1-11), which implied that all 19 mutants

could be obtained with subsequent transformation into an E. coli overexpression strain.

A randomized library of 95 individual OYE 1 Y196 mutants (plus the wild-type control)

was assembled from transformants derived from the pooled plasmid sample. Each

variant was screened against 2-methyl-2-cyclohexen-1-one (substrate 12) to detect

catalytic activity. Plasmid DNAs from hits in this initial screen were sequenced to

determine the codon at position 196. The best variant was Y196C. This was fascinating

since an analogous mutant from P. stipitis OYE 2.6 (Y193C) also proved to be the

optimal substitution for the catalytic Tyr side-chain.28

These initial results were sufficiently promising to justify sequencing the

randomized library to determine whether all of the position 196 variants were actually

present and identify any that needed to be added to complete the set. These efforts

revealed 18 / 19 mutants, with only OYE 1 Y196W being absent. The original Q score

predicted that all 19 variants were present in the plasmid mix used to transform the E.

coli overexpression strain. Our obtaining 18 mutants was therefore very encouraging,

further underscoring the value and accuracy of our simple library quality control

determination. The missing OYE 1 Y196W mutant was made by standard methods,

then and the complete library was then used to screen other substrates (Figures 1-12

through 1-18). Unfortunately, none of the Y196 variants improved conversion for either

the Baylis-Hillman adducts (1 – 3) or either carvone enantiomer (7 and 8), with no

conversion for the former and minimal conversion for the latter. Also, no mutant gave

any activity against substrate 11, a screening substrate containing a 5-membered ring.

As expected from our initial screening efforts, we observed minor conversion for

substrate 12. The best variant (Y196C) gave racemic reduction product from 12 (wild-

type gives >98% ee (R)), although the conversion was only 20%.

Crystal Structure of OYE 1 YI96C

Given the difference in spatial locations between the acidic protons of a Tyr and

Cys residue, it was not clear how the Y196C mutant retained catalytic activity. We

therefore crystallized this OYE 1 variant in the hopes that the structure would provide

insight. 4-Hydroxybenzaldehyde (p-HBA) is an OYE inhibitor that binds strongly and can

provide information on the active site environment. We therefore soaked the mutant

crystals with p-HBA.

The OYE 1 Y196C mutant structure was near identical to the OYE 1 wt. structure

(1OYB) with a RMSD of 0.29 Å. The active site aligned nearly completely as well, with

the only difference being the Y196C mutation. Surprisingly, the structure revealed that

cysteine was not properly placed to participate directly in the alkene reduction step, as

its thiol moiety was directed away from the active site (Figure 1-19). In fact, the β-

carbon of the cysteine was 0.6 Å further than the tyrosine to the C2 of the p-HBA and

the sulfur was 5.0 Å further than the oxygen of the tyrosine. The thiol did not form a

disulfide bridge with a neighboring residue nor did it have an alternate oxidation state

like sulfenic (R-SOH), sulfinic (R-SO2H), or sulfonic acid (R-SO3H). This implies that a

general acid other than the Cys side-chain at position 196 acts as the general acid. The

actual proton source remains unknown, although it is intriguing that alkene 12 was

reduced to racemic product, consistent with solvent protonation after dissociation of the

enol(ate). On the other hand, this does not explain why the structurally analogous

carvone enantiomers were not accepted by the Y196C variant. While OYE is able to

tolerate the cysteine substitution, this change seriously impairs its function.

Experimental

General

Restriction endonucleases, Phusion Hot Start II High-Fidelity DNA Polymerase

and T4 DNA ligase were purchased from New England Biolabs. Primers were obtained

from Integrated DNA Technologies (IDT). All other reagents were obtained from

commercial suppliers and used as received. Plasmids were purified on small scales by

Wizard® minicolumns (Promega Life Sciences) and on large scales using CsCl density

gradient ultracentrifugation.29 DNA sequencing was carried out by the University of

Florida ICBR using capillary fluorescence methods using standard protocols. LB

medium contained 5 g/L Bacto-Yeast Extract, 10 g/L Bacto-Tryptone and 10 g/L NaCl.

ZY medium contained 5 g/L Bacto-Yeast Extract and 10 g/L Bacto-Tryptone. 50x5052

contained 25% glycerol, 2.5% glucose, and 10% a-lactose monohydrate. NPS x20

contained 66 g/L (NH4)2SO4, 136 g/L KH2PO4, and 142 g/L Na2HPO4.

Cloning

Construction of plasmids used to make the OYE 1 Y196 library

The plasmid used as a template to make the OYE 1 Y196 library was pET3b-

OYE (Appendix E Figure E-1), a pET3b derivative containing the gene for OYE 1.7

pET3b-OYE was originally a gift from Dr. Betty Jo Brown (University of Michigan).

Construction of mutants in the OYE 1 Y196X randomized library

All PCR samples were purified using Wizard® Plus SV Gel PCR Clean up kits by

Promega according to the manufacturer’s instructions. Samples were then incubated

overnight with two 0.5 μL aliquots of 20 U/μL DpnI at 37°C to remove the parent

template. The first portion of DpnI was added immediately after PCR clean up and the

second was added after 4 hours of digestion. After DpnI digestion, samples were

purified using Wizard® Plus SV Gel PCR Clean up kits by Promega.

Digested PCR samples were used to transform ElectroTen-Blue®

electrocompetent cells (ETB) using a Gene Pulser® from BioRad. Electroporation was

carried out with 4 μL of PCR sample and 50 μL of ETB cells using 2.5 kV.

Electroporated samples were incubated with 600 μL of SOC media at 37°C for 1 h.

Cells were then plated onto LB-amp agar plates and grown at 37°C for 36 h. The best

results were obtained from ETB daughter cells grown from the commercial stock on the

same day as the transformation would take place. Granddaughter cells provided fewer

transformants. Transformed cells were then pooled by rinsing the plate with a minimal

volume of LB and scraping with a rubber policeman. Plasmid DNA was extracted and

purified using Wizard® Plus minipreps DNA purification system by Promega according

to the manufacturer’s instructions. Purified, pooled plasmids were sequenced by ICBR

using Sanger sequencing. Raw electropherograms (chromat files) obtained from Sanger

sequencing were analyzed to estimate the samples degeneracy. Degeneracy could be

gauged for samples using a NNK primer mix (Figure 1-11) by the method developed by

Sullivan and Walton.26 This gave a Q-score of 0.85, suggesting that all position 196

variants were present in the library. A 4 μL aliquot of the purified, pooled plasmid was

used to transform overexpression strain E. coli BL21 (DE3) Gold cells using

electroporation (2.5 kV). Three transformants per possible codon would be required to

obtain a good representation for each possible codon in a pooled plasmid sample. For

this reason, 95 randomly-chosen colonies were used to seed 600 μL of LB-amp in a 96

well plate. Well H12 was reserved for the wild-type control. The plate was shaken

overnight at 37ºC to reach saturation. The library was stored by mixing 120 μL of each

culture with 30 μL of sterile 80% glycerol in a fresh 96 well plate. This gave a final

glycerol concentration of 15%, allowing the plate to be stored indefinitely at -80ºC.

Substrates

A list of substrates and products is shown if Figure 1-8 through 1-10.

2-(Hydroxymethyl)-cyclopent-2-enone (1)

2-(Hydroxymethyl)-cyclopent-2-enone was prepared in our lab by Bradford

Sullivan28 using the method developed by Kar and Argade.30 2-(Hydroxymethyl)-

cyclopent-2-enone can be detected during screening by GC-FID using a Beta Dex 225

column (0.25 mm × 30 m). The temperature program used began with an initial

temperature of 140°C for 10 min, followed by an increase at 20°C/min to a temperature

of 180°C at which the program remained for 5 min (GC method is listed as AZW2.Meth

in Appendix D). 2-(Hydroxymethyl)-cyclopent-2-enone eluted near 13.1 min. The

reduced products (S)- and (R)-6 eluted near 11.4 and 10.2 min, respectively.

2-(Hydroxymethyl)-cyclohex-2-enone (2)

2-(Hydroxymethyl)-cyclohex-2-enone was prepared in our lab by Bradford

Sullivan28 using the method developed by Rezgui and El Gaied.31 2-(Hydroxymethyl)-

cyclohex-2-enone was detected during screening by GC-FID using a Beta Dex 225

column (0.25 mm × 30 m). The temperature program used began with an initial

in Appendix D). 2-(Hydroxymethyl)-cyclohex-2-enone eluted near 13.1 min. The

reduced products (S)- and (R)-7 eluted near 10.2 and 10.8 min, respectively.

Methyl 2-(hydroxymethyl)acrylate (3)

Methyl 2-(hydroxymethyl)acrylate was prepared in our lab by Bradford Sullivan28

using the method developed by Turki et al.32 Methyl 2-(hydroxymethyl)acrylate was

detected during screening by GC-FID using a Beta Dex 225 column (0.25 mm × 30 m).

The temperature program began with an initial temperature of 100°C for 12 min,

followed by an increase at 20°C/min to a temperature of 180°C at which the program

remained for 5 min (GC method is listed as AZW3.Met in Appendix D). Methyl 2-

(hydroxymethyl)acrylate eluted near 11.8 min. The reduced products (S)- and (R)-7

eluted near 10.7 and 11.3 min, respectively.

(S)-(+)-Carvone (7)

(S)-(+)-Carvone was purchased from Sigma Aldrich and it can be detected by

GC-MS using a DB-17 column (0.25 mm × 30 m). The temperature program used

began with an initial temperature of 60°C for 2 min, followed by an increase at 10°C/min

to a temperature of 195°C at which the program remained for 10 min (GC method is

listed as JON.Meth in Appendix D). (S)-(+)-Carvone eluted near 12.5 min. A mixture of

reduced product isomers, (+)-Dihydrocarvone (Acros) was used as a standard to assign

the peaks for both cis- and trans-9 (11.7 and 11.3 min, respectively).

(R)-(-)-Carvone (8)

(R)-(-)-Carvone was purchased from Sigma Aldrich and it can be detected during

screening by GC-MS using a DB-17 column (0.25 mm × 30 m). The temperature

program began with an initial temperature of 60°C for 2 min, followed by an increase at

10°C/min to a temperature of 195°C at which the program remained for 10 min (GC

method is listed as JON.Meth in Appendix D). (R)-(-)-Carvone eluted near 12.5 min. A

mixture reduced product isomers, (+)-Dihydrocarvone (Acros) was used as a standard

to assign the peaks for both cis- and trans-10 (11.7. and 11.3 min, respectively).

2-Methyl-2-cyclopenten-1-one (11)

2-Methyl-2-cyclopenten-1-one was purchased from Sigma Aldrich. 2-methyl-2-

cyclopenten-1-one was detected during screening by GC-MS using a DB-17 column

(0.25 mm × 30 m). The temperature program began with an initial temperature of 60°C

for 2 min, followed by an increase at 10°C/min to a temperature of 195°C at which the

program remained for 10 min (GC method is listed as JON.Meth in Appendix D). 2-

methyl-cyclopenten-1-one eluted near 7.0 min, and the reduced product 13 eluted near

5.2 min.

2-Methyl-2-cyclohexen-1-one (12)

2-Methyl-2-cyclohexen-1-one was purchased from Sigma Aldrich. 2-methyl-2-

cyclohexen-1-one was detected during screening by GC-MS using a DB-17 column

(0.25 mm × 30 m). The temperature program began with an initial temperature of 60°C

for 2 min, followed by an increase at 10°C/min to a temperature of 195°C at which the

program remained for 10 min (GC method is listed as JON.Meth in Appendix D). 2-

methyl-2-cyclohexen-1-one eluted near 8.5 min and the reduced product 14 eluted near

7.2 min.

Screening Assay

E. coli BL21 (DE3) Gold cells harboring plasmids containing OYE 1 Y196

mutants of interest were grown in a 96 well plate containing 600 μL of LB-amp. Cells

were grown at 37°C with 250 rpm of agitation overnight. The saturated cultures were

then used to inoculate a larger 2 mL square bottom 96 well plate. This larger square

bottom plate contained 600 μL of an auto-induction media. The auto-induction medium

contained a mix of ZY media, 50x5052, 20x NPS, and 200 μg/mL ampicillin.33 Cells

were induced in an aeration case developed at the University of Florida by the machine

shop in the Department of Chemistry. Induction occurred at 37°C with 350 rpm of

agitation overnight. The increased agitation is required for induction to occur. Induced

cells were then separated from the auto-induction medium by centrifugation. The

supernatant was removed and the induced, pelleted cells were resuspended in 600 μL

of reaction mixture, which contained 50 mM KPi, 100 mM glucose and 15 mM alkene

substrate, pH 7.0. Reactions were shaken at 250 rpm overnight at room temperature

before quenching by adding 500 μL of ethyl acetate. The organic phase was separated

by centrifugation and analyzed by GC.

Protein Purification and Crystallogenesis of Y196C OYE 1

The Y196C OYE 1 mutant was purified by the same methods used previously for

wild-type and mutant OYE 1,34 which is a modification of the procedure originally

developed by Massey.6 E. coli BL21 (DE3) Gold cells harboring pET3b-OYE 1 Y196C

were grown at 37°C in a 4 L New Brunswick Scientific M19 fermenter containing LB-

amp. Cells were grown in the fermenter with 600 rpm of agitation for 2 h to achieve mid

log phase. Protein overproduction was induced by adding IPTG and glucose to final

concentrations of 0.4 mM and 100 mM, respectively. The culture was grown at 30°C

with 600 rpm of agitation for 4 h. The culture was then chilled at 4°C for 30 min before

centrifugation at 5,000 × g. The wet cell pellet was then resuspended in 100 mM Tris-Cl

buffer containing 10 μM phenylmethane sulfonyl fluoride (PMSF) at pH 8.0. Cells were

then lysed under 12,000 psi with the aid of a French press. Cell extract was centrifuged

at 18,000 × g for 1 h to remove insoluble debris. Nucleotides were precipitated by

adding protamine sulfate to a final concentration of 1 mg/mL and stirring at 4°C for 20

min. The supernatant was separated by centrifugation at 18,000 × g for 20 min. Protein

was precipitated out of solution by adding solid ammonium sulfate in 5 portions every 5

min to achieve a final concentration of 78% saturation. The protein precipitate was then

separated by centrifugation at 18,000 × g for 1 h.

Purification of OYE 1 by an N-(4-hydroxybenzoyl) aminohexyl agarose affinity

column requires that the active site be emptied of any bound ligand that would interfere

with binding to the phenol moiety of the column matrix. This was accomplished by

successive buffer exchanges during dialysis. The ammonium sulfate pellet obtained

from the salt cut was resuspended in 100 mM Tris-Cl, 100 mM (NH4)2SO4, 10 μM

PMSF, pH 8.0. This was dialyzed against 1 L of 100 mM Tris-Cl, 100 mM (NH4)2SO4, 10

μM PMSF, pH 8.0 overnight at 4ºC. The sample was then dialyzed against 1 L of the

same buffer containing 10 mM sodium dithionite for 2 h at 4ºC. After 2 h, the buffer was

exchanged for a fresh 1 L of buffer containing 10 mM sodium dithionite and dialysis

continued for 2 h. The sample was then transferred to a fresh 1 L of buffer without

dithionite and dialyzed for 2 h, after which, the buffer was exchanged with a final 1 L of

fresh buffer and dialyzed overnight. The final sample was then centrifuged at 18,000 × g

for 30 min to remove any insoluble debris accumulated during dialysis.

Dialyzed protein samples were loaded onto the affinity column in 10 mL portions.

The affinity column was equilibrated with 100 mM Tris-Cl, 100 mM (NH4)2SO4, 10 μM

PMSF, pH 8.0. Binding of OYE 1 turned the column green. After washing with 30 mL of

starting buffer, the desired protein was eluted by washing with 100 mM Tris-Cl, 100 mM

(NH4)2SO4, 10 μM PMSF, 4 mM sodium dithionite, pH 8.0. OYE 1 Y196C was then

further purified by gel filtration with a Superdex 200 column (Pharmacia) using 50 mM

Tris-Cl, 50 mM NaCl, pH 7.5. Pooled fractions containing the desired protein were then

concentrated by ultrafiltration using an Amicon centrifugation tube to a final

concentration of 20 mg/mL. Protein concentration was determined by absorbance at

280 nm using an extinction coefficient (ε) and molecular weight (MW) estimated by

protparam (OYE 1 Y196C had an ε of 70,820 M-1cm-1 with a MW of 44,954 Da).35

Crystals were grown using the published conditions discovered by Fox and

Karplus.20 Wells contained 6 μL of 20 mg/mL protein in 50 mM Tris-Cl, 50 mM NaCl, pH

7.5 and used hanging drop vapor diffusion. The crystallization solution contained 35%

(v/v) PEG 400, 100 mM Na HEPES, 200 mM MgCl2, pH 8.3. The best crystals obtained

were obtained after 10 days at 6°C. After crystallization, crystals were mounted in

appropriate loops and soaked with p-HBA before being flash cooled in liquid nitrogen

and sent for data collection. No cryoprotectant was used.

The Data Collection and Crystal Structure of OYE 1 Y196 Mutants

The best crystals diffracted to a maximum usable resolution of 1.36Å using the

X6A beamline at Brookhaven National Laboratory. The unit cell measured was 141.1

141.1 42.8 Å 90 90 90 and the crystals belonged to space group P 43 21 2. The

asymmetric unit contained 1 molecule with a solvent content of 48.05% and a Matthew’s

coefficient of 2.37 Å3/Dal.36

Reflection data were processed using the iMOSFLM program from the CCP4

program suite to a resolution of 1.36Å.37 Phases were obtained using the Phaser-MR

utility of the PHENIX program suite by molecular replacement using a modification of S.

pastorianus OYE 1 (PDB code 1OYB) as the search model.38 All ligands and water

molecules were removed prior to molecular replacement. Inspection of the model

showed one OYE 1 chain present in the asymmetric unit. The best solution for the

space group was determined to be P 43 21 2. The initially calculated 2Fo-Fc and Fo-Fc

maps showed electron density patterns that could be easily identified as the FMN

cofactor. The FMN cofactor was modeled into the structure. Initial refinement using the

xyz coordinates, B-factors, real-space, and occupancies refinement strategy features in

PHENIX refine as well as continued cycles of model building with the aid of the structure

validation tools in COOT produced a model with an Rfree of 0.1824.39 Continual

iterations of using the structural validation tools in COOT and PHENIX.refine produced

a model with an Rfree of 0.1698. At this point the p-HBA was modeled into the active site

and subsequent rounds of model building with COOT and refinement produced an Rfree

of 0.1554.

Conclusions

The Y196 mutants of OYE 1 showed no improvement of product range for the

enzyme. However, the observation that Y196 could be successfully replaced by Cys is

intriguing since the crystal structure of the mutant with bound p-HBA showed that the

cysteine residue was not appropriately positioned to act as a general acid. Our current

hypothesis is that solvent supplies the proton to the enol(ate) formed by enone

reduction, possibly after dissociation from the active site.

These OYE 1 structural data may also provide guidance for the analogous

Y193C mutant of P. stipitis OYE 2.6. The latter could not be crystallized because the

protein could not be purified with reproducible properties. Like OYE 1, Cys was the sole

functional replacement for Tyr in P. stipitis OYE 2.6. Whether the structure and catalytic

mechanism of this mutant resembles its OYE 1 counterpart remains to be determined.

Figure 1-1. The catalytic cycle of G6PDH investigated by Warburg and Christian.1,28

Figure 1-2. The catalytic cycle of OYE 1 established by Massey and Vas.10,40

Stott et al. (1993)8

Vaz et al. (1995)10

Figure 1-3. List of OYE 1 substrates from the literature.

Vaz et al. (1995)10

Figure 1-3. (Continued).

Vaz et al. (1995)10

Kohli et al. (1998)11

Meah et al. (2000)15

Meah et al. (2001)16

Williams et al. (2004)17

Swiderska et al. (October 2006)12

Swiderska et al. (December 2006)18

Mueller et al. (March 2007) 19 Bougioukou et al. (2008)13

Mueller et al. (September 2007)41

Hall et al. (2008)42

Padhi et al. (2009)23

Winkler et al. (2010)43

Stueckler et al. (September 2010)14

Stueckler et al. (October 2010)44

Brenna et al. (June 2011)45

Brenna et al. (July 2011)46

Brenna et al. (December 2011)47

Brenna et al. (2012)48

Durchschein et al. (2012)49

Tasnadi et al. (March 2012)50

Tasnadi et al. (June 2012)51

Brenna et al. (January 2014)53

Turrini et al. (2015)55

Figure 1-4. FMN Diagram displaying the FMN (gold) environment in the active site of OYE 1. OYE 1 (green) uses hydrogen bonding partners to lock FMN in place within the active site (T37, G72, Q114, and R243). OYE 1 also uses the hydrophobic bonding partners beneath to FMN shown with blue circles (P35, L36, and I351) to lock it into position.

Figure 1-5. Catalytic mechanism of OYE 1. Mechanism for the reduction of a bound α-unsaturated carbonyl substrate (black) by a reduced FMN (gold) in the active site of OYE 1 (green). OYE 1 uses positions H191 and N194 as hydrogen bonding partners to lock the carbonyl into position.

Figure 1-6. OYE substrate binding modes. In both binding modes, cyclohexenone docks in the active site above and parallel to the plane of the reduced FMN. The carbonyl oxygen forms hydrogen bonds with the residues N194 and H191. The hydride from N5 is transferred to the electron deficient β-carbon. Y196 acts as a general acid to protonate the resulting enolate (see figure 1-5) at the α-carbon. The “flipped” conformation requires a shift in the angle of hydride and proton transfer and is sterically crowded by the presence of Trp116. The stereochemistry of each binding mode product is indicated with the protic hydrogen (green) and the hydride hydrogen (gold) below each scheme. For a prochiral substrate the two binding modes determine product stereochemistry.28

Figure 1-7. OYE 1 active site diagram. This diagram shows the areas of the OYE 1

active site. The orange section contains the “east side” of the active site which will be discussed more in chapter 2. The magenta section contains other active site positions which form a gate with the positions on loop 6. Loop 6 is the mobile part of this gate and opens up to allow NADPH to enter the active site and reduce oxidized FMN. The green section includes position on loop 6 which will be of discussed more in chapter 4. The Blue segment contains the “west side” of the active site which will be discussed more in Chapter 2. The yellow segment contains positions that interact with the bound substrate (N194 and H191) and position Y196 which acts as a general acid in OYE 1 catalysis.

Figure 1-8. List of Baylis-Hillman substrates screened by OYE 1 Y196 library.

Figure 1-9. List of carvone substrates screened by OYE 1 Y196 library.

Figure 1-10. List of screening substrates screened by OYE 1 Y196 library.

Figure 1-11. Calculations for obtaining a Qscore of a pooled plasmid mix from a NNK primer mix using data from a theoretical sequencing electropherogram. This figure shows a theoretical electropherogram with theoretical peak heights. a) The fraction of each peak height at a position over all peak heights at that position is shown on the left side of each peak. Peaks correspond to: blue/thymine, orange/guanine, green/adenine, purple/cytosine, and the dashed black represents perfect degeneracy. b) The peak fraction of each base is used to estimate the amount of codons containing that base at that position. c) The sum of those estimates are used to obtain a Q value (QN or QK) for that position. d) The sum of the weighted QN and QK values is used to calculate the Qscore for the pooled mix. The Qscore approaches perfect degeneracy (equal amounts of bases at each position) as it approaches 1.0.26

Figure 1-12. OYE 1 Y196 library screening results for 1. No conversion was observed for any mutants. OYE wt. results are consistent with previous findings.22

Figure 1-15. OYE 1 Y196 library screening results for 7. OYE wt. results for (S)-(+)-carvone are consistent with previous findings.24

Figure 1-16. OYE 1 Y196 library screening results for 8. OYE wt. results are consistent with previous findings.24

Figure 1-17. OYE 1 Y196 library screening results for 11. The stereochemistry of products for substrate 11 was not evaluated during screening.

Figure 1-18. OYE 1 Y196 library screening results for 12. The stereochemistry of products for substrate 12 was not evaluated during screening. The stereochemistry of OYE 1 Y196C was evaluated after initial screening by GC-FID using a Beta Dex 225 column (0.25 mm x 30 m) using AZW2.Meth (Appendix D). Peaks were never assigned.

Figure 1-19. Alignment of OYE 1 wt. and OYE 1 Y196C. This figure shows OYE 1 wt. (green) aligned to and OYE 1 Y196C (orange) (PDB ID 1OYB) with FMN cofactor (yellow). Both structures have p-HBA bound in the active site.20

CHAPTER 2

IMPROVING THE PRODUCT RANGE OF OYE 1 AND OYE 2.6 THROUGH PROTEIN ENGINEERING

Background

Mutagenesis of the East Side of the Active Site in OYE

Given the potential of alkene reductases for making products with high

enantiomeric excess, we were eager to explore this class of enzymes for use as a

biocatalysts. Long before we waded into an expansive mutagenesis projects which are

frequent in this group now, we wanted to probe a set of alkene reductases against a

series of substrates. To this end, our lab made an alkene reductase library with 16

alkene reductases.56 This collection of alkene reductases was assembled by Despina

Bougioukou and was screened against a series of substrates of interest.56 Table 2-1

lists all the fully sequenced alkene reductase libraries made by the Stewart group.

One enzyme in that library that was of interest to our group was Old Yellow

Enzyme (OYE 1) from Saccharomyces pastorianus. Using the crystal structure of OYE

1 solved by Fox and Karplus,20 our group identified a position in the active site that we

believed might affect the orientation of substrate binding and could influence the type of

products we could obtain. Position W116 is located on the east side of the OYE 1 active

site and the tryptophan at this position is in close proximity to groups that extend off the

alpha carbon of any bound carbonyl substrate. We believed that substitution of this

bulky tryptophan for a different residue may allow substrates to bind in an alternate

“flipped binding mode”. Figure 2-1 shows the flipped binding mode in the OYE 1 active

site. Binding in this mode would give the alternate enantiomer at the alpha carbon

following catalysis. To pursue this idea, in 2009 Despina Bougioukou and Santosh

Padhi used site saturation mutagenesis at position W116 of OYE 1 to discover new

mutants that would allow flipped binding.23 Our group was very interested in a set of

Baylis-Hillman substrates (substrates 1-3) as well as a pair of carvone substrates

(substrates 7 & 8) (Figure 2-2). Bougioukou and Padhi screened the carvone substrates

against the OYE 1 W116 mutants. The blind screening done with these mutants

identified functional mutations including OYE 1 W116F, W116I, W116L, W116M and

W116Y. It was also discovered that OYE 1 W116I would flip (S)-(+)-carvone and give

the trans product, trans-9 (diasteriomeric excess (de) = 88% in favor of trans and

conversion was 98%). OYE 1 wild type (wt) however, gave the cis product (cis-9) for

(S)-(+)-carvone (de = 93% in favor of cis and conversion was 48%). These findings

were significant enough to warrant making a full degenerate library containing all 20

residues substituted at position W116. In 2011, Adam Walton and coworkers screened

the Baylis-Hillman substrates against this OYE 1 W116 library.22 They found that some

of these mutants would allow flipped binding for 2-(hydroxymethyl)-cyclopent-2-enone

(substrate 1) and methyl 2-(hydroxymethyl)acrylate (substrate 3). In 2013, Yuri Pompeu

and Bradford Sullivan screened the carvone substrates against the completed OYE 1

W116 mutants.24 They identified new mutants that could provide the alternate trans

product (trans-9) for (S)-(+)-carvone (OYE 1 W116A, W116C, W116E, W116G, W116I,

W116M, W116N, W116Q, W116S, W116T, and W116V) as well as the alternate cis

product (cis-10) for (R)-(-)-carvone OYE 1 W116A, and W116V). Furthermore, the

crystallographic studies done by Pompeu and Sullivan showed (S)-(+)-carvone bound in

the flipped binding mode within the active site of the OYE 1 W116I mutant. Figure 2-3

shows the active site of OYE 1 W116I with (S)-(+)-carvone bound.24 And as such, this

work provided firm evidence for the idea that position W116 played a role in

discriminating substrates during binding and that mutating this position would make the

active site more amenable to a flipped binding mode which could alter the

stereochemistry of the products.

Another OYE homolog that our group investigated was OYE 2.6 from Pichia

stipitis. OYE 2.6 has an isoleucine at position I113 which is the analogous position to

W116 in OYE 1. What is fascinating about OYE 2.6 is that it gives the same cis product

(cis-9) as OYE 1 wt for (S)-(+)-carvone. This is noteworthy because when these two

homologs have the same residue at an analogous position they give different products.

Because of this, our group began focusing on OYE 2.6 as a biocataylst.34,57,58 In one of

our most ambitious mutagenesis projects to date, our group extensively mutated several

positions in the OYE 2.6 active site to explore its potential for engineered

biocatalysis.34,57,58 The approach our group used during this project was a technique

developed by Manfred Reetz called iterative saturation mutagenesis (ISM).59 ISM is a

progressive protein engineering strategy that targets several positions of interest over a

series of mutagenic rounds. The best mutant identified for each position is then used as

an “anchor” for a second round of mutagenesis where a different position is

randomized. Successive rounds are carried out with the best mutant at each position

being added to the next round of randomization. The goal of this strategy is to hone in

on multi-mutated variants with exceptional properties. Targeting positions around the

OYE 2.6 active site, our lab made several 1st, 2nd, and 3rd generation libraries.57 Figure

2-4 has a diagram showing all the residues in the OYE 2.6 active site. The ISM

experiments were concluded after variants were discovered that would allow the flipped

binding mode and give the (R)- product with high enantiomeric excess (ee) for two of

the three Baylis-Hillman substrates. These variants worked for both substrate 3 and

substrate 1.57 A 2nd generation library uncovered an OYE 2.6 Y78W / F247A double

mutant that gave the best results for the 2-(hydroxymethyl)-cyclohex-2-enone (substrate

2). Table 2-2 summarizes the best variants of OYE 1 and OYE 2.6 for obtaining de and

ee for carvone and Baylis-Hillman substrates. The best result however was a

conversion 43% and an ee of 37% (S), which is not ideal since the product of interest is

the R enantiomer ((R)-5). As such, a good solution for obtaining the (R)-5 product from

2 was not discovered during the ISM project.

Project Summary

Though an enzyme that would exclusively make the (R)-5 product from 2 was

never discovered, we did find a set of promising positions that gave us a good idea on

how to move forward. Since OYE 1 wt only gives 10% conversion for 2 we decided to

focus our efforts on engineering OYE 2.6 which gives nearly 100% conversion. We

believed that the answer would be found at the east side of the active site. Molecular

modeling shows that the alternate flipped binding mode of 2 would crash into the east

side residues in OYE 2.6. Figure 2-7 shows OYE 2.6 with a 6-membered ring modeled

into the active site. The best double mutant combination identified during the ISM

project with a pair of east side mutants, was OYE 2.6 Y78W / I113C. This double

mutant includes two positions that are located on the east side of the active site and

their residues extend into the space where any substrate that attempts a flipped binding

to OYE 2.6 would occupy. Figure 2-6 shows the crystal structure of OYE 2.6 Y78W /

I113C with malonate bound into active site. The best triple mutant discovered for

obtaining the (R)-5 product from 2 was OYE 2.6 Y78W / I113C / F247A which includes

the two mutants at positions on the east side of the OYE 2.6 active site of interest as

well as one position on the west side.57 Given the results of these variants, we wanted

to further investigate these three positions.

The main aim of this project was to find a combination of mutations that would

improve the yield for the (R)-5 product for OYE 2.6 and in doing so, hopefully discover

new biocatalysts that could be used on other future substrates. In this project we

examine what would happen if we compacted the large obstructive steric bulk on the

east side of the active site in OYE 2.6. We began by making a randomized matrix library

of the two positions on the east side of the active site that had the most notable effect

on enantiomeric excess during the ISM experiments, positions Y78 and I113. These

positions would be replaced with a pair of smaller residues; alanine, cysteine, glycine,

serine, threonine, and valine. We then assembled a library with presequenced double

mutants at those positions. Since the best double and triple mutant variants contained

mutations at position F247, that was another position we wanted to target during this

project. Since we believe that position Y78 and I113 interact with each other, it was

important to find the right combinations at these two positions before looking at the

F247 position. After anchoring off the best double mutants discovered during the

mutagenesis of the east side of active site, a set of 2nd round libraries were made

targeting the F247 position located on the west side of the active site.

OYE 2.6 Y78Xsm / I113Xsm Randomized Library

Initial cloning simultaneously targeted two positions located on the east side of

the OYE 2.6 active site, a tyrosine at position Y78 and an isoleucine at positon I113.

The aim of this strategy was to replace these two residues with a pair of small amino

acids which would take up less space and allow the alternate flipped binding mode for 2

which would in turn produce more (R)-5 product. The smaller pair of residues would

include a combination of either alanine, cysteine, glycine, serine, threonine or valine.

Given the substantial amount of cloning that had already been done on this enzyme

(fourteen 1st generation, nine 2nd generation, and two 3rd generation libraries)28,57 it was

preferable to survey the active site with as minimal effort as possible. To minimize the

laborious task of cloning a matrix of double mutants, a randomized cloning approach

was chosen to assemble the first library. A KST random primer mix contains an equal

number of codons for alanine, cysteine, glycine, and serine. Anchoring off a pBS2

vector, a pET derivative with GST-OYE 2.6 fusion protein (Appendix E, Figure E-2), and

primers using a KST mix at position Y78 would provide a mix of mutants containing

many of the single mutations needed. Anchoring off of this random pBS2-OYE 2.6

Y78XKST plasmid mix, primers using a KST mix at position I113 provided a random mix

containing up to 16 of the double mutations of interest. Anchoring off OYE 2.6 Y78T,

and OYE 2.6 Y78V with OYE 2.6 I113XKST primers as well as anchoring off OYE 2.6

I113T and OYE 2.6 I113V with OYE 2.6 Y78XKST primers provided us with four random

mixes containing up to 16 more double mutations of interest. The final four threonine-

valine double mutants had to be made individually. In this way the number of PCR

reactions, and the subsequent cloning steps, was reduced. Mutants obtained from both

the pooled KST randomized PCR cloning, and the completed set of valine and

threonine double mutants were used to assemble a random library containing small

residues at both the Y78 and the I113 position.

The three Baylis-Hillman substrates were used to screen the mutants in the OYE

2.6 Y78Xsm / I113Xsm randomized library. The positions that gave the most notable

results were then sequenced to determine the mutations present. By sequencing the

mutants after screening, we save time and effort by not sequencing numerous less

successful mutants. The majority of successful mutants were double mutants that were

successfully cloned. However, in a few cases promising positions were revealed to

contain concatomeric repeats of primer inserts in the sequence at the Y78 position.

Figure 2-8 shows the alignment of OYE 2.6 sequence with a sample containing portions

of primer inserts in the sequence. No concatomeric repeats of the primer was observed

in samples in which the I113 was targeted for cloning.

Positions in the OYE 2.6 Y78Xsm / I113Xsm randomized library that produced the

(R)-5 product were sequenced. Sequencing revealed that a cysteine mutation at

position I113 was present in most of the successful mutants. These substrates had

been extensively screened against an OYE 2.6 I113C single mutant and an OYE 2.6

Y78W / I113C double mutant in previous work.57 In fact, during the ISM experiments

anchoring off OYE 2.6 Y78W in which position I113 was randomized, it was revealed

that OYE 2.6 Y78W / I113C was the best double mutant in that library. OYE 2.6 I113C

and OYE 2.6 Y78W / I113C gave 100% and 98% conversion and 81% and 60% ee (S)

for the (S)-5 product of 2 respectively (Figure 2-9). However, the OYE 2.6 Y78Xsm /

I113C mutants had performed better than the single OYE 2.6 I113 and the OYE 2.6 Y78

/ I113 double mutant variants from the ISM experiments. Since these experiments,

which included a small amino acid mutation at position Y78, provided improved results

for obtaining the (R)-5 product from 2, we decided to fully explore the potential of double

mutants at these two positions. Thus, a complete library was assembled containing a

full matrix of all 36 double mutants.

OYE 2.6 Y78Xsm / I113Xsm Presequenced Library

Given the promising results of some of the mutants in the OYE 2.6 Y78Xsm /

I113Xsm randomized library, efforts were made to make all the small amino acid double

mutants of OYE 2.6 at positions Y78 and I113. Though the best results obtained from

screening the OYE 2.6 Y78Xsm / I113Xsm randomized library all contained the I113C

mutation, it was a concern that if we solely focused on the OYE 2.6 I113C double

mutants we might miss variants with remarkable results. Therefore, all 36 double

mutants would be made and screened. Substrate 2 was screened against the mutants

of the completed OYE 2.6 Y78Xsm / I113Xsm presequenced. Most of the mutants present

provided excellent conversion for 2. Figure 2-10 shows the results of screening the OYE

2.6 Y78Xsm / I113Xsm presequenced library against 2. However, the only set of mutants

that gave any (R)-5 product were the I113C mutants. Every OYE 2.6 I113C mutant gave

at least some (R)-5 product. Though no mutation at position Y78 stood out as the clear

choice for obtaining better yields of (R)-5, I113C was clearly the best substitution at that

position. At this point we felt there was not much more that could be done with solely

looking at these two positions. Moving forward, we chose to target a position that

worked well in combination with other double mutants made on the east side of the

active site, position F247.

OYE 2.6 Y78Xsm / I113C / F247X Randomized Libraries

We then investigated the west side of the OYE 2.6 active site, anchoring off the

best the double mutants discovered while screening the presequenced OYE 2.6 Y78Xsm

/ I113Xsm library. Previous mutagenesis projects57 revealed that a set of OYE 2.6 Y78W

/ I113C / F247A and F247H triple mutants would give good results for 2. Also, double

mutants that included a F247 mutation were in fact the best double mutants discovered

in the ISM project for obtaining (R)-5. Anchoring off of OYE 2.6 Y78A, Y78C, Y78G,

Y78S, Y78T, and Y78V / I113C, a set of 6 libraries were made in which position F247

was randomized with a set of primers containing a NNK mix of bases at the codon for

F247. A NNK mix contains 32 codons which include at least one codon for every

residue. This is the approach our group used during the ISM experiments to make

mutagenic libraries. Pooled samples of transformants were sequenced to access their

degeneracy using an evaluation method developed in our group during the ISM

experiments.26 Samples with sufficient degeneracy were further cloned into expression

strains and assembled into a randomized library. Table 2-3 contains all the Q scores

and the estimated number of amino acids obtainable from transformation with that

plasmid mix. Substrate 2 was used to screen the six OYE 2.6 Y78Xsm / I113C / F247X

libraries (Figure 2-11 through 2-16 shows the results of triple mutant screenings).

Unfortunately, few of the triple mutant variants gave any significant improvement over

the double-mutant anchor used to make the library. The six best results were

sequenced and not surprisingly four were the double mutant controls. The only two

triple mutants discovered during sequencing were OYE 2.6 Y78C / I113C / F247H and

F247W. With near 100% conversion and racemic ee, these mutations are the best

results discovered during this project and best variants of OYE 2.6 for obtaining (R)-5

from substrate 2 (Figure 2-9).