expression of a lipase in prokaryote and eukaryote - diva portal

62
Expression of a lipase in prokaryote and eukaryote host systems allowing engineering Marianne Wittrup Larsen Doctoral Thesis Royal Institute of Technology School of Biotechnology Stockholm 2009

Upload: others

Post on 09-Feb-2022

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

Expression of a lipase in prokaryote and eukaryote host systems allowing engineering

Marianne Wittrup Larsen

Doctoral Thesis

Royal Institute of Technology

School of Biotechnology

Stockholm 2009

Page 2: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

© Marianne Wittrup Larsen 2009 Royal Institute of Technology School of Biotechnology AlbaNova University Center SE-106 91 Stockholm Sweden ISBN 978-91-7415-420-7 TRITA-BIO Report 2009:14 ISSN 1654-2312 Printed in Stockholm, September 2009 Universitetsservice US-AB Drottning Kristinas väg 53 B 100 44 Stockholm Cover illustrations: Escherichia coli (left) and Pichia pastoris (right), size indication 10 m. Photographs by Kaj Kauko.

Page 3: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

ABSTRACT

Pseudozyma (Candida) antarctica lipase B (PalB) was expressed in Escherichia coli facilitating protein engineering. The lack of glycosylation was evaluated for a deeper understanding of the difficulties in expressing PalB in E. coli. Different systems were tested: periplasmic expression in Rosetta (DE3), cytosolic expression in Rosetta-gami 2(DE3), Origami 2(DE3), and co-expression of groES and groEL. Periplasmic expression resulted 5.2 mg/L active PalB at 16 �C in shake flasks. This expression level was improved by using the EnBase technology, enabling fed-batch cultivation in 24-deep well scale. The feed rate was titrated with the addition of -amylase, which slowly releases glucose as energy source. Different media were evaluated where the EnBase mineral salt medium resulted in 7.0 mg/L of active PalB.

Protein secreted directly into the media was obtained using the consti-tutive glyceraldehyde-3-phosphate dehydrogenase (GAP) promoter for screening and production of PalB in P. pastoris. A protease sensitive fusion protein CBM-PalB (cellulose-binding module) was used as a model sys-tem. When optimised, the expression system resulted in 46 mg/L lipase in 72 hours in shake flask, 37 mg/L lipase in 28 hours in 96-deep-well plate format, and 2.9 g PalB per 10 L bioreactor cultivation.

The E. coli expression system was used to express a small focused library of PalB variants, designed to prevent water from entering the active site through a hypothesised tunnel. Screening of the library was performed with a developed assay, allowing for simultaneous detection of both trans-acylation and hydrolytic activity. From the library a mutant S47L, in which the inner part of the tunnel was blocked, was found to catalyse trans-acylation of vinyl butyrate in 20 mM butanol 14 times faster than hydrolysis. Water tunnels, assisting water in reaching the active sites, were furthermore found by molecular modelling in many hydrolases. Molecular modelling showed a specific water tunnel in PalB. This was supported by experimental data, where the double mutant Q46A S47L catalysed trans-acylation faster than hydrolysis compared to the wild type PalB.

Page 4: Expression of a lipase in prokaryote and eukaryote - DiVA Portal
Page 5: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

“Denmark seen from a foreign land looks but like a grain of sand. Denmark as we Danes conceive it is so big you won’t believe it” -Piet Hein-

Page 6: Expression of a lipase in prokaryote and eukaryote - DiVA Portal
Page 7: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

LIST OF ARTICLES

This thesis is based on the following articles which are referred to by their roman numerals.

I Larsen, M. W., Bornscheuer, U. T. and Hult, K. (2008) Expression of Candida antarctica lipase B in Pichia pastoris and various Escherichia coli systems. Protein Expression Purif 62:90-97.

II Neubauer, A., Golson, R., Ukkonen, K., Krause, M., Tegel, H.,

Ottosson, J., Larsen, M. W., Hult, K., Neubauer, P., Vasala, A. (2009) Controlling nutrient release in cell cultivation. Genet Eng Biotechnol N 29:50-51.

III Larsen, M. W., Enfors, S-O., Jahic, M. and Hult, K. (2009) Screening

and production of Pseudozyma (Candida) antarctica lipase B in Pichia pastoris using the GAP promoter as alternative to the AOX promoter expression system. Manuscript

IV Larsen, M. W., Zielinska, D. F., Martinelle, M., Hildalgo, A., Jensen,

L. J., Bornscheuer, U. T. and Hult, K. (2009) Suppression of water as a nucleophile in Pseudozyma (Candida) antarctica lipase B catalysis. Manuscript

V Gruber, C. C. *, Larsen, M. W. *, Martinelle, M., Kroutil, W. and

Hult, K. (2009) A tunnel provides the active site of a lipase with substrate water. Manuscript

*These authors contributed equally to this work

Page 8: Expression of a lipase in prokaryote and eukaryote - DiVA Portal
Page 9: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

TABLE OF CONTENTS

1 INTRODUCTION 1

1.1 Protein 1 1.1.1 Posttranslational modifications 2

1.2 PalB (CALB) 2 1.2.1 PalB as biocatalyst 2 1.2.2 Aim of the present investigation 4

2 HETEROLOGOUS PROTEIN EXPRESSION SYSTEMS 5

2.1 Prokaryotic expression 5 2.1.1 Gram-positive bacteria 6 2.1.2 Gram-negative bacteria 7 2.1.3 Optimising protein expression in E. coli 10

2.2 Eukaryotic expression 13 2.2.1 Filamentous fungi 13 2.2.2 Yeast 14 2.2.3 Optimising protein expression in P. pastoris 18

2.3 Choosing a heterologous expression system 19 2.3.1 Target protein 20 2.3.2 Functional goal/regulatory issues 21 2.3.3 Up-scaling/down-stream processing 23

3 DNA TECHNIQUES FOR ENZYME ENGINEERING 24

3.1 Choosing an engineering strategy 24

3.2 Examples of DNA techniques for enzyme engineering 26 3.2.1 Point mutation 26 3.2.2 Saturation mutagenesis 30 3.2.3 Directed gene evolution 30

Page 10: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3.2.4 Permutation 37 3.2.5 Random mutagenesis 39

3.3 Final remarks on DNA techniques for enzyme engineering 39

4 SUBSTRATE TUNNELS IN ENZYMES 41

5 PRESENT INVESTIGATION IN A WIDER PERSPECTIVE 44

ACKNOWLEDGEMENTS 45

REFERENCES 47

Page 11: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

1

1 INTRODUCTION

1.1 Protein

Microorganisms are small chemical factories that can be used in house-holds and industry – and has been used so for centuries for the pro-duction of non-proteins and proteins products. Non-protein products are a major industry, with examples as ethanol and citric acid. Protein pro-ducts include enzymes for the detergent industry such as proteases and lipases.

Today obtaining a gene, “adapting” it and expressing it – is in theory a standard procedure, although it is in some cases very labour intense. DNA is no longer a mystery and the number of sequenced genomes increases exponentially. This naturally raises the question, what do all the genes code for?

Proteins and the understanding of their complex structure/function relationship are key issues, not only in the hunt for better more suitable medicines, but also in biocatalysis, where the demand for improved and/or specialised enzymes is immense.

Figure 1-1. The central dogma: transcription of DNA translated into protein

The genetic code contains only four nucleotides which are read in triples as codons, see Figure 1-1. Each codon corresponds to an amino acid. There are 20 naturally occurring amino acids, which fascinatingly give rise to almost unlimited possibilities for variation. The understanding of small variations of amino acids and their consequences for protein structure and function is an important research field not only for medical purposes but also for the more basic understanding of biocatalysis as a step in the direction of tailor-made enzymes.

Page 12: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2

1.1.1 Posttranslational modifications

In all eukaryote systems posttranslational modification is a mean of adding crucial information and functions to the protein. In higher eukaryotes it is a blueprint for recognition, since non-recognised proteins are considered potentially harmful and are quickly degraded. Several different forms of posttranslational modifications exist, such as methylation, glycosylation as well as the cleaving of the leader signal sequences after the arrival to the destination compartment. In the present work, glycosylation has been considered.

1.2 PalB (CALB)

In the quest for new efficient biocatalysts, researchers at Novo Nordisk (Bagsværd, DK) came across the yeast Candida antarctica, isolated from lake sediment in Lake Vanda, Vitoria Land in Antarctica (Patkar et al. 1993). They found that the yeast secreted two different lipases and named them A and B, according to their separation after ion exchange chromatogra-phy. The sequence and crystal structure of lipase B from Candida antarctica (CALB) was determined the following year (Uppenberg et al. 1994). Since then CALB has become a commercially available product, sold under the name Novozyme 435.

Shortly thereafter the anamorphic yeast genus Candida was reorganised according to their morphological and physiological properties. Candida antarctica with a few other Candida species, were reclassified as Pseudozyma, based on an analysis of partial sequences of 26s ribosomal DNA (Boekhout 2009, personal communication). Pseudozyma antarctica was found unable to assimilate ethanol nor did it grow in a vitamin free medium. In summary, Candida antarctica is now called Pseudozyma antarctica (Boekhout 1995), and hence in the following text the name Pseudozyma antarctica lipase B (PalB) will be used instead of CALB.

1.2.1 PalB as biocatalyst

PalB is one of the world’s most widely used and studied enzymes, as indicated by the number of scientific publications containing PalB or the commercial name Novozyme 435. The interest and use of PalB in biocata-lysis has increased steadily the last 19 years as seen from Figure 1-2.

Page 13: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

Introduction

3

Publications per year

0

50

100

150

200

250

1990

1991

1992

1993

1994

1995

1996

1997

1998

1999

2000

2001

2002

2003

2004

2005

2006

2007

2008

2009

0

500

1000

1500

2000

2500

C. antarctica lipase

Novozym 435

Figure 1-2. Publications containing “Candida antarctica lipase” or “Novozyme 435” per year (SciFinder Aug 2009)

The selectivity and activity combined with stability in organic solvents are some of the key features that account for the popularity of PalB. The lipase has been used in such different applications as resolution of amines and secondary alcohols, production of biodiesel and polymer synthesis (Anderson et al. 1998).

The natural reaction of PalB is to hydrolyze esters. The reaction mecha-nism of PalB passes through an acyl-enzyme intermediate which will be attacked by water to achieve hydrolysis. However, PalB is also able to use other substrates for example, alcohols as nucleophiles, resulting in an acyl transfer, Figure 1-3. In biocatalysis many reactions are preformed in a water-limited environment to suppress hydrolysis. To limit water in reaction media has practical and economical drawbacks. It would there-fore be of great interest to decrease the preference of the lipase towards water as a nucleophile.

Page 14: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

4

Figure 1-3. Reaction pathway of PalB

1.2.2 Aim of the present investigation

In this thesis both prokaryote and eukaryote expression systems were investigated, aiming at a system suitable for screening of Pseudozyma antarctica lipase B variant libraries. This was applied on a semi-rationally designed enzyme library of PalB, in order to find candidates with a signi-ficantly decreased ratio between hydrolysis and acyl transfer in a water environment.

Page 15: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

5

2 HETEROLOGOUS PROTEIN EXPRESSION

SYSTEMS

Microorganisms are used to produce proteins on-demand, but there is always a price to pay. The protein may be correctly folded and high yield, but the expression system is complex, making a new protein (variant) time consuming. Or the system may be easily changed and fine tuned, but results in only small amounts of protein.

If the primary goal is to achieve high amounts of protein, it is sensible to spend time developing an optimal expression system, which might be more labour intense to set up; for example a eukaryotic system. Eukaryotic expression systems are more tedious to manipulate and are generally avoided in directed evolution of enzymes. They do however offer the pos-sibility of fast and easy screening processes, as the recombinant protein can be easily secreted into the surrounding environment. On the other hand if the primary goal is to make libraries of protein variants, a fast and easily manipulated system is desirable, for example a prokaryote expres-sion system. The well characterised Escherichia coli host system offers a wide range of possibilities for genetic manipulation, expression regulation and optimisation. Overall it is therefore important to know as much as possible about the protein, as well as the needs, before setting up an ex-pression system.

In the following text a short introduction to the various host systems will be described.

2.1 Prokaryotic expression

Prokaryotic hosts are usually the starting point in heterologous protein expression since they are easy to manipulate, have a short generation time, and are well studied systems. Traditionally, prokaryotes can be classified as Gram-positive or Gram-negative depending on whether they stain in a Gram test or not. This difference originates from different physiologies of the bacteria. Gram-positive bacteria lack an outer membrane in contrast to Gram-negative bacteria, which have both an outer and inner membrane.

Page 16: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

6

On the other hand a Gram-positive bacterium has a thick cell wall (pepti-doglycan layer). This is important to keep in mind when targeting ex-pressed protein to the periplasmic space – or to the extracellular environ-ment. Prokaryotes can generally not perform the posttranslational modifi-cations found in many eukaryotes.

2.1.1 Gram-positive bacteria

The Gram-positive Bacillus subtilis is the most commonly used Bacilli strain for recombinant protein expression and is likely the second most used prokaryote organism after E. coli. B. subtilis is a non-pathogenic soil micro-organism Generally Regarded As Safe (GRAS) and it shows a non-biased codon usage (Li et al. 2004). One of the reasons for the wide use of B. subtilis is that the bacterium is able to secrete proteins directly into media and does not produce lipopolysaccharides (a common by-product of E.coli), which can cause degenerative disorders in animals and humans (Yin et al. 2007). Furthermore, B. subtilis can be cultivated to very high cell densities in simple medium and its growth is well characterised. One of the more unwanted properties of B. subtilis is secretion of high amounts of proteases and that it occasionally shows plasmid instability. B. subtilis accepts a broad spectrum of bacteriophages and plasmids (Yin et al. 2007), but the transformation process seems more troublesome (Terpe 2006).

Another Gram-positive soil bacterium is Corynebacterium glutamicum, cur-rently used in industrial scale for the production of the amino acids L-glu-tamic acid and L-lysine (Kalinowski et al. 2003), and classified as GRAS. C. glutamicum constitute a promising heterologous protein expression system and has the benefit of very low proteolytic activity. The bacterium is able to efficiently secrete proteins into the media and has the advantage of a broad acceptance of heterologous signals, e.g. Gram-negative promoters. Moreover, it will not form spores such as B. subtilis does (Srivastava and Deb 2005). A surface display system in C. glutamicum using porin proteins as anchor proteins was recently reported. Porins are naturally found in C. glutamicum as cell wall associated proteins involved in the uptake and passage of hydrophilic small molecules. As a model passenger protein, -amylase from Streptococcus bovis 148 was fused C terminally to porin and was found actively displayed. The results however indicate that the length of porin is slightly short (6.3 nm) in comparison to the outer layer of C. glutamicum (5-8 nm) which might affect the accessibility of the displayed

Page 17: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

7

amylase. A linker region between the porin and the amylase might circumvent this issue (Tateno et al. 2009). Surface display in a Gram-positive host appears attractive due to its thick and rigid cell wall.

2.1.2 Gram-negative bacteria

The Gram-negative E. coli has long been the work horse for protein expression – and is still the first host to try. There are several reasons for that: the simplicity of E. coli, fast growth, inexpensive media, well charac-terised high cell density cultivation, and well known genetics. The com-mercially available E. coli expression systems include various cell strains as well as an extensive molecular toolbox for modification: vectors, promo-ters, tags, and the possibility to co-express other proteins or tRNAs. There are however some major drawbacks in E. coli protein expression: the lack of a secretion mechanism and the limited ability to deal with disulfide bridges. A further disadvantage with E. coli is the lack of posttranslational modifications, which is a general drawback with prokaryotes host systems.

Several different promoters for protein expression in E. coli exist. Most of them are either chemical or temperature inducible (reviewed in Makrides 1996), but constitutive promoters can be found (Löfdahl et al. 1983). Most proteins are expressed by induction with either isopropyl -D-thiogalactopyranoside (IPTG) as in the commercial pET system, or by L-arabinose as in the commercial pBAD system (Sørensen and Mortensen 2005).

Protein expression in the pET system is based on the T7 promoter, which linked to the lac operator constitutes a strong and tightly regulated promoter. The T7lac promoter was used in both Papers I and II. The T7 promoter is know to be slightly leaky, but even the tighter regulated T7lac promoter gave rise to a small background expression. During the evaluation of expressed active PalB in Paper I, a low background hydrolysis was found in the uninduced samples, with tributyrin as a sub-strate. The T7lac promoter is based on the bacteriophage T7 promoter, which is not recognised by the E. coli RNA polymerase. Thus no back-ground expression occurs during the cloning work, meaning that any problems encountered here are not caused by the protein of interest. Expression is possible in a host strain carrying a genomic gene for T7 RNA polymerase - (DE3) type strain, which in addition carry the lac

Page 18: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

8

Figure 2-1. Recom

binant protein expression in the pET system.

The T7 promoter on the pET vector is regulated by the lac

operator, followed by the A

TG start codon, a M

ultiple Cloning Site for insertion of Your Favourite G

ene and a T7 terminator signal.

Resistance is plasmid dependant and varies in the pET system

. The plasm

id furthermore contains an origin of replication for propa-

gation in E. coli. (A

dapted from Sørensen and M

ortensen 2005).

Page 19: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

9

repressor, lacUV5 promoter, and T7 RNA polymerase. The lac repressor regulates the transcription of the T7 RNA polymerase, which upon induc-tion with IPTG is released whereby the T7 RNA polymerase is transcri-bed. The T7 RNA polymerase then transcribes the T7 promoter and ex-pression of recombinant protein takes place, see Figure 2-1 (reviewed in Sørensen and Mortensen 2005; Terpe 2006).

Recombinant proteins expressed in E. coli are normally expressed in the cytosolic space, but they can be directed to the periplasmic space. Secre-tion into media is limited in Gram-negative bacteria such as E. coli. In Paper I active PalB was expressed to appear either in the cytosol or in the periplasmic space. The yield of active enzyme was significantly higher in the periplasmic space. Normally expression in the cytoplasm is preferred due to the high production yields, but the formation of disulfide bridges is problematic. Directing the recombinant protein to the periplasm facilitates the formation of disulfide bonds, as the environment is less reducing than in the cytoplasm. Foldases such as the Dsb system, which are actively involved in the formation of disulfide bonds, are found in the periplasmic space (Rietsch & Beckwith, 1998). Furthermore, there are fewer host proteins in the periplasmic space facilitating the following protein purify-cation. Expressed proteins are directed to the periplasm by N-terminal fusion to a leader sequence. In Papers I and II the PelB was used, but several others are found, such as OmpA, OmpC, OmpF, OmpT, PhoA, PhoE, SpA, Tat, MalE and LamB (Blight et al. 1994; Mergulhao et al. 2005; Terpe 2006). Proteolysis in the periplasmic space caused by envelope proteases is one of the problems encountered when directing expressed proteins to that compartment (Jonasson et al. 2002). Other obstacles might be insufficient translocation through the inner membrane – a complex and not completely understood process.

E. coli normally does not secrete proteins into the extracellular environ-ment, but recombinant proteins can be secreted. There are two types of secretion mechanisms in E coli. In type I, the protein is secreted in a one step process across the inner and outer membranes via the -haemolysin pathway of E. coli. The type II mechanism is a two step process, guiding the protein across the membranes one at a time, which is a more complex situation (Mergulhao et al. 2005). Although it sounds quite attractive to secrete proteins in E. coli, the process is complicated and limited, which is likely why it is not more widely used today.

Page 20: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

10

It is also possible to display recombinant proteins on the cell surface of E. coli, which can be useful in for example whole cell biocatalysis. One example is the display of PalB on the cell surface of E. coli, using PgsA anchor protein whereby an activity of 4.6 U/g (dry cell weight) was ob-tained (Narita et al. 2006). As a comparison the commercial Novozyme preparation marketed by Sigma-Aldrich has an activity of more than 10 000 U/g. Another example is the autodisplay system developed on the basis of the AIDA-I autotransporter protein to which a passenger protein is attached using a linker followed by a signal peptide. The signal peptide guides the transport of the precursor protein across the inner membrane. In the periplasm the precursor folds as a -barrel within the outer mem-brane and the passenger protein is transported to the outer surface (Jose 2006)

2.1.3 Optimising protein expression in E. coli

There is a variety of ways to increase the expressed protein yield, assist protein maturation, and generally stabilise the protein. One approach is to slow down the transcription process. Decreasing the temperature is the most common way of slowing down the cells (decreasing the specific growth rate) and thus slowing down the transcription rate. This does not affect the translation rate equally much and therefore gives the cells more time to correctly process the proteins. Another way of giving the cells more time to process protein is to use a weaker promoter whereby there will be fewer transcripts per cell to take care of, or by reducing the con-centration of inducer, which theoretically should result in the same situ-ation.

Codon bias, the preference of one specific codon over other synonym codons, is known in both eukaryotes and prokaryotes (Makrides 1996). In addition, organisms tend to have their own codon preference and in some cases even own usage. Non-universal codon usage is found for example in certain Mycoplasma bateria, which use the UGA codon to incorporate tryptophan rather than a stop codon as in the universal genetic code (Inamine et al. 1990). Another example is the yeast Candida rugosa, which reads CUG, the universal genetic code for leucine, as a serine (Alberghina and Lotti 1997). During heterologous protein expression a different codon preference than E. coli means that in a translation process a shortage of certain tRNAs can arise and thus hamper the translation process slightly.

Page 21: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

11

There are two ways around that; co-expression of rare tRNAs or exchang-ing less frequently used codons with codons used more commonly in E. coli. The simplest solution, to supply rare tRNAs, was applied in Paper I, were PalB was expressed in Rosetta (DE3) and co-expressed with six rare tRNAs (tRNAAGG, tRNAAGA, tRNAAUA, tRNACUA, tRNACCC and tRNAGGA). In the investigation it was found that codon bias was not the only hurdle in expressing active PalB in E. coli.

Molecular chaperones assist in the assembly of the amino acid chain to a correctly folded protein, thereby minimising the formation of inclusion bodies. It is believed that the co-expression of molecular chaperones will increase the yield of correctly folded active protein, but the data so far is inconclusive (Makrides 1996). Chaperones act in concert so co-expression of just one of them might not give a beneficial result. PalB was in Paper I co-expressed with the molecular chaparones groES-groEL, which were found to have a beneficial effect on the yield of active PalB at 37 °C. This effect was however overruled by other systems at lower temperatures.

E. coli as an expression host suffers from the inability to form disulfide bridges found in many eukaryotic proteins. The reducing environment of the cytoplasmic space prevents formation of disulfide bonds needed for the formation of active protein. There are two common approaches to solve this problem in vivo, one being to direct the protein to the periplas-mic space where a group of highly oxidizing proteins catalyses the for-mation of disulfide bonds (Gasser et al. 2008). Another possibility is to express the protein in a genetically mutated E. coli strain, where the genes encoding thioredoxin reductase (trxB) and glutathione oxido-reductase (gor) are inactivated, enabling the formation of disulfide bridges (Bessette et al. 1999). Both of these approaches were applied during the expression of PalB, which contains three disulfide bridges (Paper I). Little difference between expression in the periplasm (providing rare tRNA as well) and expression in the cytosol (gor trxB strain) was found in the yield of active protein at 37 °C and 25 °C. However, at 16 °C expression in the peri-plasmic space was found to be remarkably better.

In many cases the recombinant protein expression in E. coli results in insoluble inclusion bodies. Inclusion bodies facilitate protein recovery and protect the protein from degradation. However, protein refolding is an extra step in protein production and there is no guaranty that it will be possible to obtain active protein in the end. One way to enhance protein

Page 22: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

12

solubility is the addition of N-terminal solubility tags. Ideally a solubility tag can also be used as an affinity tag. This is not always the case, and it might be necessary to add both solubility and affinity tags to a protein. Examples of common affinity tag are His6, FLAG and GST (Waugh 2005). Two of the most commonly used solubility tags are MBP and NusA, where NusA cannot be used as affinity tag.

Recent studies found that the expression of active PalB is closely associated with correct disulfide bond formation. Enhanced solubility by fusion tags did not result in a correspondingly increased yield of active PalB, suggesting that folding was not the only problem when the protein was expressed in BL21(DE3). Expression in Origami B(DE3) improved the activity significantly, implying that disulfide bond formation is a limi-ting factor (Xu et al. 2008).

Different media compositions will affect the cell growth and hence the protein expression. Richer media will generally result in a higher cell den-sity, which might not result in a higher yield of expressed protein. A clear example of that is given in Paper II, where three different media, super-broth medium, complex EnBase medium (CM) and EnBase mineral salt medium (MSM), were evaluated for the expression of active PalB in the periplamic space. The EnBase (BioSilta, Oulu, FI) technology is based on a slow nutrient release, tuneable from cultivation to cultivation by varying the concentration of -amylase responsible for the glucose release. The idea is to “mimic” bioreactor cultivation in 24-deep-well format, in a two step process, going from a fed-batch phase where -amylase is responsible for the feed addition to a batch phase where additional media is supplied and protein expression induced. This worked very well and resulted in a very high cell density. Interestingly, the highest cell density obtained with CM medium did not give a corresponding high yield of active enzyme. Instead the MSM medium resulted in a two times higher yield than achieved in shake flask cultivations (Paper I).

The combination of fed-batch and a booster in small scale is very attractive in the process of enzyme engineering, where enough protein is obtained for a first characterisation by minimal effort. Another very attrac-tive method of protein expression is to use auto-induction media, which is based on the function of the lac operon under growth. Auto-induction medium can be minimal or of a more complex nature. The key is that the

Page 23: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

13

medium contains three different carbon sources: glucose, glycerol, and lactose.

Glucose is the preferred carbon source during the initial growth phase where it represses the utilisation of alternative carbon sources and their metabolism. As a result, the lac operon is repressed as well, maintaining protein expression at a low level. Depletion of glucose shifts the cells towards using lactose and glycerol.

Lactose results in a production of allolactose from a promiscusous reac-tion of -galactosidase, and allolactose acts as the physiological inducer of the lac operon. Lactose can be used as a carbon and energy source by itself, but it is metabolised into glucose and galactose, where galactose is a dead end – it can not be metabolised further by BL21(DE3) cells. More importantly, lactose will induce the cells, driving all cell activities towards protein expression.

Glycerol supports growth as well as glucose, but does not prevent induction by lactose and it is therefore used as a carbon and energy source in the media (Studier 2005).

2.2 Eukaryotic expression

Proteins have been expressed in quite a diverse range of eukaryotes such as mammalian cells, insect cells, filamentous fungi and yeasts. These cells offer different positive and negative aspects, which should be carefully considered before choosing an expression system. Generally mammalian cells are used for the production of pharmaceuticals, whereas especially filamentous fungi are mainly used in large industrial processes.

2.2.1 Filamentous fungi

Filamentous fungi accounts for nearly half of the production of fermen-tation derived industrial enzymes, a global marked estimated to be worth almost 5 billion US$ in 2009 (Lubertozzi and Keasling 2009). The derived enzymes are used in a wide rage of industrial processes such as detergents (proteases, lipases) but also such as for example the production of soy sauce, bean curd seasoning, and vinegar (Machida et al. 2008). The tool-box for genetic modification of moulds is far from complete, which makes

Page 24: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

14

protein expression in these organisms slightly more complex. Moulds can exist in a perfect stage and an imperfect stage, meaning that they repro-duce both sexually and asexually, although in many cases the perfect (diploid) state is still unknown.

The metabolic products from filamentous fungi are classified as either primary or secondary. Primary metabolites from Aspergillus niger include the commercial production of citric and gluconic acid (Lubertozzi and Keasling 2009). Secondary metabolites secreted into the surrounding media as a kind of “chemical warfare” originally led to the discovery of penicillin. However, not all secondary metabolites are as useful as peni-cillin. Aflatoxin A, produced by Aspergillus flavus, is one of the most potent carcinogenic substances known today. The GRAS status of the filamentus fungi A. niger and Aspergillus oryzae make them attractive hosts for protein expression. Aspergillus nidulans is used for basic genetic research as a toxigenic model organism (Lubertozzi and Keasling 2009). A. oryzae is used for the commercial production of a lipase for laundry detergent (Christensen et al. 1988). A. oryzae is able to secrete vast amounts of protein, further enhanced by a solid-state cultivation in which ~50 g -amylase from 1 kg of wheat bran can be produced (Machida et al. 2008).

2.2.2 Yeast

Several different yeast strains have been used for heterologous protein ex-pression, where some of the most commonly used are Pichia pastoris and Saccharomyces cerevisiae. Other yeasts are emerging as promising recombinant protein expression hosts, as for example Yarrowia lipolytica.

Y. lipolytica (formerly Candida, Endomycopsis or Saccharomycopsis lipolytica) is able to grow on a range of carbon sources such as glucose, alcohols, acetatee, alkanes, fatty acids, and oils (Barth and Gaillardin 1997). Y. lipolytica has distinguished capacity to secrete several proteins into the sur-rounding environment, for example proteases, lipases and phosphatases, RNase, and an esterase (reviewed in Barth and Gaillardin, 1997). Y. lipolytica can grow as yeast cells, form hyphae and pseudohyphae depen-ding of growth conditions (Madzak et al. 2004). With the aim of develop-ing a high-throughput eukaryotic expression system, the extracellular lipase Lip2p was used as a model enzyme in Y. lipolytica. The yeast was suc-cessfully cultivated and screened in 96-microtiter plate format and was furthermore found easy to transform. Overall Y. lipolytica is a promising

Page 25: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

15

host for recombinant protein expression (Bordes et al. 2007). Few studies deal with glycosylation pattern in Y. lipolytica so far, but it appears to glycosylate in a “light” way and to be generally closer to the mammalian type of glycosylation than S. cerevisiae (Madzak et al. 2004).

An overwhelming wealth of information is found on “Bakers yeast” or S. cerevisiae, which is probably the best characterised eukaryote system today. The fermentation abilities of S. cerevisiae have assisted the making of bread and alcoholic beverages for centuries, and the organism is therefore GRAS. Recombinant protein genes can be maintained in S. cerevisiae with a 2 m based plasmid in high copy number or integrated into the genome in low copy form (Sudbery 1996). The -mating factor from S. cerevisiae is used as a secretion signal in both S. cerevisiae as well as in other yeasts. As a recombinant expression system, S. cerevisiae suffers from a low secretory capacity (Cereghino and Cregg 1999) and a tendency to hyper-glycosylate expressed proteins (Gellissen and Hollenberg 1997). Nonetheless, S. cerevi-siae was expression host for the first commercialised recombinant hepatitis B vaccine (Valenzuela et al. 1982). Functional expression of a fungal laccase has also been obtained in S. cerevisiae. Furthermore, the laccase was subjected to directed evolution and enzyme variants screened with S. cerevisiae as expression host. The availability of a well known expression system with stable plasmids made S. cerevisiae an optimal host in this case (Bulter et al. 2003).

P. pastoris cells stay haploid unless forced diploid by nitrogen limitation. Both stages can be maintained stable by suitable medium. Spores can be formed by shifting the diploid cells to a nitrogen limited medium. In contrast to S. cervisiae, P. pastoris does not ferment well and has a preference for respiratory growth (Cereghino and Cregg 2000). P. pastoris was origin-ally selected for its ability to utilise methanol as sole carbon source and was intended as a producer of single-cell protein to high protein animal feed. During the 1970s the oil crises caused a dramatic increase in the cost of methane and at the same time the price of soybeans, an alternative source of animal feed, fell. Production of single-cell protein from methan-ol was never favourable. Alcohol oxidase is responsible for the methanol utilisation, and the enzyme constitutes 35% of the total amount of cellular protein, when grown on methanol as sole carbon source. The protein is not found when P. pastoris is cultivated on ethanol or glucose as sole carbon source. This strong up-regulation made the promoter regulating

Page 26: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

16

alcohol oxidase gene suitable for inducible recombinant protein expres-sion, and in 1985 P. pastoris was transformed and a histidine selectable marker HIS4 isolated (Cregg et al. 1985). Today P. pastoris can be trans-formed in different ways by electroporation, lithium chloride, or by spheroplasting (Gietz RD and RA. 2001). Transformation efficiency is much lower than in for example E. coli, and this is one of the drawbacks when using P. pastoris as host for recombinant protein expression.

The P. pastoris expression system is patented and is sold under license. Different plasmids, carrying different promoters and different selections markers, are therefore available through one company. All cloning and mutation work takes place in a normal E. coli cloning strain. Transforma-tion of “your favourite gene” (YFG) into P. pastoris is a stepwise process, starting with a shuttle vector, able to propagate in E. coli but unable to ex-press in E. coli. The shuttle vectors carry in addition to a prokaryote origin of replication also in most cases two different selectable markers for pro- and eukaryote selection. When YFG is cloned into the chosen P. pastoris vector and the sequence has been confirmed correct, the plasmid is linearised using a unique restriction site and transformed into P. pastoris. The linearisation is done to stimulate the occurrence of a crossover. Inside the cell a homolog crossover event takes place, whereby the gene of interest is incorporated into the genome. The pGAP plasmid is cut in the GAP gene region in a unique restriction site not found in YFG. The crossover event can take place several times, which is a natural event during transformation into P. pastoris, occurring in 1-10% of all clones. Head-to-tail multi-copy inserts occur from this crossover event, which will affect the protein expression yield as the promoter will be present in more copies.

The pGAP vector was used in Paper III, where the PalB gene copy number was determined by RT-PCR in order to evaluate the protein expression level in comparison to the “traditional” pPIC9 plasmid. The pPIC9 vector is based on the strongly inducible AOX gene promoter. Transformation of this plasmid is more complex, as the AOX gene pro-moter in reality consists of two genes: the AOX1 gene product accounts for the majority of alcohol oxidase activity in P. pastoris while the AOX2 gene product will support a much slower growth on methanol than the AOX1 gene product. This allows for selection of different phenotypes: Mut+ (methanol utilisation plus) or MutS (methanol utilisation slow). The phenotype MutS origins from a partial deletion of the AOX1 gene. Special

Page 27: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

17

P. pastoris strains exist where the AOX1 gene is partially deleted (KM71) resulting in MutS transformants. Another interesting strain is MC100-3, where both the AOX1 gene and the AOX2 gene are deleted, resulting in an inability to utilize methanol as carbon source, but where methanol still can be used as inducer. It is important to characterise the phenotype before starting to express recombinant protein, in order not to drown the cells in methanol.

Integration of transformed DNA into the P. pastoris genome eliminates the risk of losing YFG, with the drawback that it is tedious to retrieve the gene back again from the genome if so wanted. An episomal expression vector for P. pastoris was successfully used to screen a genome library with the aim of setting up a high-throughput screening system. Yeast plasmid DNA was isolated back in sufficient concentrations for performing muta-genic PCR, in for example enzyme evolution (Lee et al. 2005).

P. pastoris is able to perform many of the eukaryotic posttranslational modifications, such as disulfide bond formation, proteolytic processing, and glycosylation. Glycosylation in eukaryotes is generally a species speci-fic process and differs greatly even within yeasts. P. pastoris has a different glycosylation pattern than mammalian cells and will generally not hyper-glycosylate proteins, although it is seen in some cases (Cereghino and Cregg 2000).

For recombinant protein expression in yeast constitutive or inducible promoters can be employed. The inducible AOX promoter was the first described and most widely used, but a range of other promoters exists today (Cregg et al. 1985). The constitutive GAP promoter is based on the regulation of glyceraldehyde-3-phosphate dehydrogenase. The GAP pro-moter was found to vary in strength depending on the carbon source used, where cultivation with glucose gives a higher yield than the AOX1 promoter grown on methanol (Waterham et al. 1997).

The constitutive GAP promoter was investigated with the aim of finding an expression system that would secrete high amounts of active PalB in both 96-well format and in bioreactor cultivations (Paper III). Previously the inducible AOX1 promoter had been used to express CBM-CALB successfully (Jahic et al. 2002; Rotticci-Mulder et al. 2001). P. pastoris is sensitive towards methanol concentrations higher than 3.65 g/L (Zhang et al. 2000), and in the AOX1 system, methanol is used as both inducer and carbon source, demanding a tight regulation of methanol in the media

Page 28: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

18

during cultivation. In 96-well plate format it is difficult to obtain a high degree of regulation and furthermore evaporation has to be considered as a critical factor.

Recombinant proteins expressed in P. pastoris can be produced intra-cellularly or secreted to the surrounding environment using a leader se-quence such as the -mating factor from S. cerevisiae. Secretion into the media has been done very efficiently, as exemplified in Paper III. Moreover, secretion has the advantage of not having to break the hard cell wall of yeasts to retrieve the product. In addition fewer proteins are found in the media whereby recovery of the protein is facilitated.

2.2.3 Optimising protein expression in P. pastoris

Protein expression in P. pastoris can be optimised in several ways, where logically the initial one is to choose a promoter. After cloning and trans-formation, selecting a highly producing colony can be done either by small scale expression or directly on plate by e.g. dot-blot. Multi-copy inserts can also be selected for, easiest with the pGAP vector, where the concentra-tion of selecting agent (zeocin) is gradually raised, selecting for colonies with more copies of the resistance gene (sh ble). Further optimisation will depend on which promoter - and to some extent which strain chosen, e.g. methanol inducible, deficient in one or both AOX regulating genes. To fully optimise the expression, bioreactor cultivations are needed where parameters such as oxygen level, pH, temperature and fed rate can be controlled.

Using the GAP promoter, the carbon source can be varied and will give rise to different yields of active protein as seen in Paper III. In laboratory scale, shake flask cultivations can only be optimised to a certain degree. Besides, it seldom matters whether the absolute optimal yield is obtained. Nonetheless, it is worthwhile to follow a eukaryotic culture in order to monitor how the cells are doing by measuring for example the optical density, OD600, and the pH. If the optical density of the cells is not increa-sing with time, the carbon source might be depleted or the pH might be too low. During the cultivation the pH naturally drops which is usually not compensated for in shake flask cultivations, but if the pH drops too low (as was seen in Paper III), it will decrease the growth and possibly be harmful to the expressed protein.

Page 29: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

19

Proteases are generally not wanted during recombinant protein expres-sion and any unwanted degradation should be kept under control. In Paper III a fusion protein with a protease susceptible linker region was used, making it possible to visualise the degree of degradation as both the fusion partner - cellulose binding protein - and PalB are visible on a strained SDS-gel. Furthermore, the activity of PalB could be measured independently of the presence or absence of the fusion partner. The pro-tease deficient P. pastoris strain SMD1168H was used in Paper III to mini-mise proteolysis. This strain suffers from the genetic changes and has a lower viability, slower growth rate and is difficult to transform (Cereghino and Cregg 2000). Proteolytic degradation can also be minimised in the process by lowering the pH of the media and the cultivation temperature as showed in a study where PalB was produced using the AOX1 promoter (Jahic et al. 2003).

2.3 Choosing a heterologous expression system

From an ethical and economical perspective there is a strong interest in avoiding higher eukaryotes (such as insect cells, mammalian cells, and whole animal) in the production of pharmaceuticals and antibodies. This is one of the driving forces in the understanding of protein expression in prokaryotes, where a fuller understanding would allow for tailored expres-sion systems and also minimise the “trial and error” approach.

Prokaryotes, especially E. coli, have the advantage of being fast growing, easy manipulated and very well characterised. Lower eukaryotes such as yeast, have the advantage of being able to perform posttranslational modi-fications and secrete into the media. An overview is given in Table 2-1.

Page 30: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

20

Table 2-1. Protein expression in various hosts.

Organism Growth Spores Manipulation Secre-tion

Glycosy-lation

E. coli Fast No Easy – well

known system Difficult/No

No

B. subtilis Fast Yes Medium Yes No

C. glutamicum Fast No Medium – not yet

established Yes No

P. pastoris Medium Only

under force

Easy Yes Yes

S. cerevisiae Medium Yes Easy Yes Yes

- highly

Y. lipolytica Medium Medium – not yet established

Yes - highly

Yes

A. niger/ A. nidulans/ A. oryzae

Slow Yes Medium/hard – not yet established

Yes - highly

Yes

2.3.1 Target protein

The impact of the target protein on the host must be considered. Is the target protein toxic to the cells? In that case, the expression must be tightly regulated and induced at the right time, knowing that this will likely kill the cells. However it is possible to over-regulate an expression system, with little or no expressed protein as a result.

What other properties does the target protein have? Does the target protein originate from an organism with different codon usage compared to the expression host? Is there a need for advanced folding and process-ing of the target protein, or will the protein need posttranslational modifi-cations to retain the active mature conformation?

Page 31: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

21

Posttranslational modifications and the importance of a native glycolsyl-ation pattern can make the choice of a host expression system rather straight forward or at least exclude prokaryotes as expression hosts.

However, it is in general a good idea to “keep it simple” and start with a standard E. coli expression system such as BL21(DE3) or equivalent and then gradually build up the complexity in the system if the first try did not result in active protein. An overview of advantages and disadvantages in host systems are given in Table 2-2.

2.3.2 Functional goal/regulatory issues

Especially for the production of pharmaceuticals there are strong regu-latory issues to consider. In E. coli the lipopolysaccharide by-products are unwanted, although they can be removed in the down-stream processing. In addition, the use of the selective antibiotic ampicilin, commonly used in laboratory scale, is unwanted in the production of pharmaceuticals. Induc-tion with IPTG is expensive in large industrial scale, and other means of induction should be considered.

On the other hand, recombinant protein expression in E. coli is easy, fast, inexpensive, and not based on any higher animal organism. In bio-catalysis it is generally a good choice of host system, however most large scale industrial process producing enzymes for use in biocatalysis are not produced in E coli. Eukaryote expression systems give very high yields and the product secreted directly in the medium, which is very attractive when producing a “perfect” enzyme not intended for evolution (as it is tedious to retrieve the genomically integrated gene). In yeast the inducible AOX1 promoter might not be wanted for large scale production since methanol is difficult to handle in a large amounts as well as being based on petro-leum.

Page 32: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

22

Table 2-2. Host organisms

Organism Advantages Disadvantages

E. coli Well known & developed Fast & inexpensive cultivation Many plasmids, tags etc. available Easy manipulation

Formation of lipopoly-saccharides

No posttranslational modifications

B. subtilis No lipopolysaccharides Transformed with many different bacteriophages & plasmids

Secretes well into medium

Harder than E. coli to transform

Form spores No posttranslational modifications

C. glutamium No lipopolysaccharides Accept heterologous signals including Gram-negative promoters

No posttranslational modifications

Not yet fully established

P. pastoris Does not hyperglycosylate Posttranslational modifications Does not ferment well High cell density cultivations Commercial kits

Non mammalian glycosylation pattern

Low transformation efficiency

S. cervisiae Well known Plasmids can be used

Hyperglycosylate Non mammalian glycosylation pattern

Low expression yield

Y. lipolytica High secretion of several proteins

Hyperglycosylate Non mammalian glycosylation pattern

A. niger/ A. nidulans/ A. oryzae

Secretes well Potentially very high producer Posttranslational modifications Easy cultivation Sexual reproduction possible (A. nidulans)

Toolbox for genetic manipulation still limited (e.g. plasmids)

Strain instability Spore formation

Page 33: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

2. Heterologous protein expression systems

23

2.3.3 Up-scaling/down-stream processing

Before starting to clone the process outline has to be clear, and the use of any additional affinity tags or leader sequences should be considered here. This means considering the whole process; intra or extracellular expres-sion? Will the purification strategy be based on affinity tags? These issues do to some extent depend on the scale of the process; a small lab scale with only little constraints or a large scale where for examples volumes and costs have to be considered.

In the present investigation an E. coli expression system was set up as fast growing and easily modified system in order to screen a library. The yield of active expressed protein (~5 mg/L, Paper I) was sufficient for screening purposes with the chosen substrates. However in some applica-tions a higher yield of active protein is advantageous due to much slower reactions. Higher yield of active PalB was obtained with the EnBase system (~8 mg/L, Paper II) in a 24 well plate format. To facilitate a screening procedure in 96-well format, expression directly into the medi-um was desirable. It was therefore relevant to set up an P. pastoris expres-sion system were both higher yields (~35 mg/L, Paper III) and facilitated isolation of product could be obtained. The P. pastoris system furthermore has the advantage of being directly scalable and ready for biorector cultiva-tions, whereby the total yield of active PalB will suffice for even demand-ing biocatalysis reactions (~10 g per bioreactor batch in 10 L reactor, Paper III).

Page 34: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

24

3 DNA TECHNIQUES FOR ENZYME

ENGINEERING

Evolution of enzymes is a constantly ongoing process although the progress in nature is slow seen with a human time perspective. Natural selection or “survival of the fittest” was first introduced as a hypothesis for the biological evolution of species by Charles Darwin in “On the Origin of Species” in 1859. Looking in a smaller scale it applies to en-zymes as well: it seems as there is a connection between the organism, its natural environment, the feeding energy source, and the enzymes found in the organism. Organisms living in a certain environment have developed enzyme systems to degrade and use the available “food” that might not be found in other environments. Said in other words, the specificity of the enzymes varies among species but a clue might be given when looking at the natural environment.

How the evolution of the enzymes came upon is an interesting topic in itself. Some enzymes share similar folds but catalyse different reaction types (divergent evolution). Others catalyse similar reaction types but do not share protein folds (convergent evolution). With the broader under-standing of enzyme evolution comes the desire to master the proteins and guide the evolution - or speed it up quite a bit in order to catalyse what-ever reaction might be desired. The driving force for directed enzyme evo-lution is not only a basic understanding of enzyme catalysis, there are also strong economical and environmental advantages in using enzymes as biocatalysts in many industrial processes. The enzymes generally need to be adapted or tailored to the process by for example increasing the enantioselectivity, improving thermostability, or broadening the substrate preference.

In the following chapter a few techniques used for enzyme engineering will be discussed.

3.1 Choosing an engineering strategy

Some years ago researchers were creating very large libraries and a lot of effort went into automation of colony picking, incubation hotels, and screening facilities. The libraries were generally first generations and where more or less randomised. This approach was a very important step in the

Page 35: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

25

understanding of enzyme evolution. Today, increasingly more research is focused towards directed enzyme evolution and several generations of libraries, thereby gradually improving the enzyme for the desired catalysis under given circumstances. In a recent review strategies for finding better enzymes were discussed in more details (Kazlauskas and Bornscheuer 2009).

When choosing an engineering strategy, three major issues have to be considered:

A) How much is known about the enzyme in question?

If a good protein structure is available molecular modelling can provide substantial information about the active site and enzyme-substrate interaction as well as give information about movements, which might contribute to the overall stability. This opens up for a rational approach to engineering the enzyme.

B) What if little is known about the enzyme?

In the absence of a protein structure, homology structures might provide some guidelines about the protein fold and active site structure. If a good homology structure can be obtained, it is possible to apply a rational strategy. However, if no structural information of significant value is available, a random strategy must be the first step in engineering your favourite enzyme. It is possible that one round of random mutagenesis will identify some target sites, which can then be subjected to a rational engineering strategy.

C) Is there a good expression host system available?

This issue was discussed in Section 2.3.

Finally, it is also possible to cut some corners and just order a complete gene, designed to the specific preference, synthesised, and physically deliv-ered within two weeks. The delivery time and cost for this is developing fast in favour of the costumer, and for “difficult” genes e.g., containing repetitive base-par combinations or with a high G/C content, this is a good opportunity.

Page 36: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

26

In conclusion, choosing an enzyme engineering strategy is to some extent like buying a house. One can make a list of features and pre-ferences, but in reality other factors such as available space, time, money, and people to do the work, will affect the decision.

3.2 Examples of DNA techniques for enzyme engineering

There is an overwhelming number of approaches to modify DNA with the aim to improve enzyme properties. Hence, the discussion below will be limited to techniques involving only one gene.

3.2.1 Point mutation

Minor changes as e.g. point mutations in the amino acid sequence can have a huge impact on the enzyme’s properties. One or two amino acid substitutions may look like a straightforward example, but it is far from simple. Which one of the few hundreds amino acids in the protein sequence should be substituted and into which one of the remaining 19 amino acids? The number of combinations in a protein makes “guessing” look like trying to find a needle in a haystack. It takes a deeper understanding of the structure and function of the enzyme to build a hypothesis around a point mutation that changes the enzyme’s properties. Despite the complex task of understanding the enzyme structure/function relationship, many examples of such changes can be found in the literature. One example of changed enzyme properties in Pseudozyma antarctica lipase B (PalB) was reported, where both increased and decreased enantioselectivity was found for the selected substrates in various mutants. In that investigation the mutants: Trp104His, Thr42Val, Ser47Ala and the double mutant Thr42Val Ser47Ala were used (Rotticci et al. 2001).

Changing the enzyme function was not the goal in Paper I, where a serine was introduced instead of an asparagine on position 74 in PalB to disrupt the glycosylation sequon from recognition in P. pastoris. This substitution did not affect the yield of actively expressed PalB in P. pastoris, whereby the glycosylation – or lack of such - was shown not to influence the processing and folding of PalB in P. pastoris. This evaluation was conducted in P. pastoris to eliminate glycosylation as a hurdle in the less

Page 37: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

27

trivial production of PalB in E. coli. Hereafter both wild-type PalB and the Asn74Ser mutant were expressed in E. coli to rule out any effect of the Asn74Ser point mutation on the protein expression level in E. coli. No impact of the point mutation was found on the yield of expressed protein. However, the specific activity of PalB was slightly affected. The specific activity of the Asn74Ser mutant was determined to 290 s-1, which was slightly lower than that of wild-type PalB, determined to 440 s-1 when expressed in E. coli (Paper I).

There are different ways of introducing a point mutation in a sequence, where the commercially available QuikChange (Stratagene 2009) and the overlap extension PCR (An et al. 2005; Ho et al. 1989) are two of the most widely used.

The overlap extension PCR (OE-PCR) technique (Ho et al. 1989), Figure 3-1, differs from the QuikChange in several ways. Mutagenic primers in both directions are designed to overlap each other and are used in two parallel reactions to introduce the mutation. In one reaction, the forward primer (a) binds (upstream) Your Favourite Gene (YFG) and as reverse the mutagenic primer (b) is used. In a parallel reaction the forward mutagenic primer (c) is used together with the reverse primer (d) binding downstream YFG. The second step in OE-PCR is recombination and amplifycation of the full-length gene by mixing the products from the two reactions (AB and CD) and amplifying it using the outer primers (a) and (d). The final product is then a mutated gene, ready for further cloning as for instance digestion with restrictions enzymes, product purification, ligation into plasmid, and finally transformation into cells. The method is more labour demanding than the QuikChange Site Directed Mutagenesis method (described below), but it circumvents the digestion of parental DNA with Dpn I, and it leaves more room for method optimisation and control. The reaction products can be visualised on agarose gels in all steps of the process, and the OE-PCR method was found very reliable in the present investigation.

Page 38: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

28

Figure 3-1. Overlap extension PCR (OE-PCR) (adapted from Ho et al., 1989)

The QuikChange Site-Directed Mutagenesis is also a commonly applied technique for site-directed mutation (Stratagene 2009). The procedure for introducing point mutations, differs from the OE-PCR in several ways as seen in the outline in Figure 3-2. Mutagenic primers are designed with a high melting temperature Tm to ensure a strong hybridisation to the tem-plate (parental) DNA. After amplification in a thermo-cycler (step 1), the methylated parental DNA is digested with Dpn I, leaving only the newly synthesised and non-methylated DNA (step 2). In the third and final step the product is transformed into E. coli in which the nicks in the DNA will be repaired. This is in principle a simple one-day procedure, however two drawbacks were encountered with this method. The method specifications include designing primers of 25 to 45 base pairs with a Tm 78 °C. Pri-mers with such a high Tm were found difficult to design without favouring the generation of secondary structures within the primers. Secondary structures within primers were in several cases a challenge hard to avoid due to the sequence of the gene and the site chosen for mutation. Highly repetitive regions or a high GC content in the DNA sequence can make it difficult to meet the demands on the primers within the kit. The second

Page 39: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

29

weak point was found in the Dpn I digestion. If this step is not 100% complete, the parental DNA will transform better than the newly synthe-sised mutated DNA, affording a high fraction of wild-type clones. The big advantage of the method and the kit is that it is a ready-mix, contains clear specifications. and a helpful technical assistance.

Figure 3-2. QuikChange site-directed mutagenesis method, used in Papers I, IV and V. Sense DNA strand shown in light grey and antisense DNA strand in dark grey. After transformation of mutated double-stranded DNA into the E. coli strain XL10-Gold, the cells joins the nicks and the mutated gene product can be expressed directly (figure adapted from Stratagene 2009).

The QuikChange method was used to introduce point mutations in Papers I, IV and V. In Papers IV and V: a single mutant Gln46Ala was

Thermal cycles

Step 3: Transformation

Step 2: Dpn I digestion of template DNA

Step 1: Mutant Strand Synthesis (Thermal Cycling)

Perform thermal cycling to:1) Denature input DNA2) Anneal mutagenic

primers3) Extend primers

Digest methylatedand hemimethylatedDNA with Dpn I

Transform mutatedssDNA into XL10-Gold ultracompetent cells with ligate nicks

Me

Me

Me

Me

Me

Me

Thermal cycles

Step 3: Transformation

Step 2: Dpn I digestion of template DNA

Step 1: Mutant Strand Synthesis (Thermal Cycling)

Perform thermal cycling to:1) Denature input DNA2) Anneal mutagenic

primers3) Extend primers

Digest methylatedand hemimethylatedDNA with Dpn I

Transform mutatedssDNA into XL10-Gold ultracompetent cells with ligate nicks

MeMe

MeMe

Me

Me

MeMe

MeMe

Page 40: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

30

made as well as a double mutant Gln46Ala Ser47Leu. Surprisingly no protein was recovered when changing the glutamine 46 into a leucine (data not shown). The nature of these mutations (Gln46Ala and Gln46Ala Ser47Leu) will be discussed in Chapter 4. Due to lack of protein by the Gln46Leu mutant a Gln46Ala mutant was created. Thus, given more time it would be interesting to look into other amino acid substitutions on this position. One possibility would be to perform saturation mutagenesis on this site.

3.2.2 Saturation mutagenesis

Saturation mutagenesis in is itself not a technique, it rather reflects the number of possible substitutions that can be introduced. The consequence of this is that the design of the degenerated primers will regulate the number of possible substitutions. Completely random and saturated muta-genesis on a position uses a NNN mixture of all four nucleotides (A, T, C and G) in equal concentrations. This will result in a gene library represen-ting all 64 codons in the genetic code, including three terminal codons. Since there are only 20 amino acids, the genetic library will be much larger than the resulting protein library. Furthermore the codons for amino acids are not evenly distributed, e.g. there is only one codon for methonine but four codons for alanine, Table 3-2. This codon bias can be minimised by using alternative degenerate codons such as NN(G/T), which will reduce the number of possible gene variants but still include all amino acids. Other possibilities for limitations are NAN (polar amino acids) or NTN (nonpolar amino acids) (West and Hecht 1995) or (A/G/C)AN which will exclude two stop codons. In the present investigation the number of possible codons was even further limited, see section 3.2.3.

3.2.3 Directed gene evolution

Directed evolution is in a recent review described as a fitness climbing, improving the protein one amino acid at the time. This can be visualised as a gorgonian (Iciligorgia schrammi – a soft coral) or like the crown of a tree, growing from one point branching widely all the time. Each branch represents a point of evolutionary pressure where the only productive way is climbing up towards improved enzyme “fitness”. The basic idea is that there are several ways to improve an enzyme; some mutations might not

Page 41: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

31

give a high improvement but together with other mutations the enzyme will improve overall (Tracewell and Arnold 2009).

To stay within the fitness metaphor, it matters where you start climbing. Improvements in enzyme fitness might be found far from the active site, but are more likely to be found in close proximity to the active site. This was stated and proven with the Pseudomonas fluorescens esterase, where muta-tions close to the active site was found to increase the enantioselectivity more than previously found with random mutagenesis in the complete enzyme (Park et al. 2005). This was followed up by the evaluation of published improvements of enzymes, in terms of the individual mutation’s distance to the active site of the specific enzyme. Again the overall conclusion was that it is in general better to be closer to the active site, although improvement further away are found as well (Morley and Kazlauskas 2005).

One very interesting approach to enzyme engineering is the Combina-torial Active-site Saturation Test (CAST) which was originally developed to broaden the substrate acceptance of Pseudomonas aeruginosa lipase (PAL). From the crystal structure five sites were chosen around the active site in order to broaden the substrate acceptance of PAL. Within each site, two amino acids close to one another in the protein secondary structure were chosen for saturation mutagenesis. Structural guidelines for choosing close amino acids so that a possible synergistic effect from side chains inter-actions could take place, were for an amino acid in a loop (n + 1), in sheet (n + 2), in 310 helix (n+3) or in a helix (n+4). Each library was screened individually, and mutants with a broader substrate acceptance were found (Reetz et al. 2005). In a follow-up investigation the best mutants from the first libraries were combined and a mutant was found to catalyse the hydrolysis of bulky esters 145 times faster than the wild-type PAL and showed in addition an improved enantioselctivity (Reetz et al. 2006a). The concept was further developed into iterative CASTing were a few target sites were subjected to saturation mutagenesis, resulting in separate small libraries. These small libraries were then screened indivi-dually and the overall best mutant identified. The best mutant was then used as template for another round of saturation mutagenesis on all sites, except for the site already “optimised”. This concept has successfully been used to increase protein thermostability of a Bacillus subtilis lipase (LipA) (Reetz et al. 2006b).

Page 42: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

32

Figure 3-3. Schematic presentation of the iterative CASTing approach, in the case of four sites A-D with 4! = 24 possibilities (adapted from Reetz and Sanchis, 2008)

Iterative CASTing was used for the construction and analysis of a fitness landscape for the evolution of Aspergillus niger epoxide hydrolase. A superior mutant was found after a CASTing process with 5 target sites. This gives 5! = 120 possible ways to accumulate mutations (Figure 3-3). Reetz and Sanchis interestingly found that there were several ways to get a good mutant and that the mutations had synergistic effects that accumu-lated. Hence none of the introduced changes in the best variant were significantly better or worse than the others – they were all needed (Reetz and Sanchis 2008). This is in agreement with the ideas of fitness climbing presented by Tracewell and Arnold as previously discussed.

The idea of directed libraries was deployed in Paper IV, where three amino acid positions surrounding a hypothesised water tunnel were chosen as good candidates for substitution. The three sites were chosen based on the crystal structure of PalB, as to limit the water access to the active site trough the tunnel. To prevent water from entering the tunnel, substitution with hydrophobic amino acids were necessary. Since hydro-philic amino acids were not needed, the library could be even further limited in size by excluding hydrophilic amino acids and terminal signals. The theoretical size of the library was thereby reduced from 8000 variants (with 20 possible amino acids) to 343 (with only 7 possible amino acids), Table 3-1.

Page 43: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

33

Table 3-1. The possible number of enzyme variants when mutating one to three sites with saturation mutagenesis (20 amino acids) and a limited selection (7 amino acids)

Number Number of possible amino acidsof sites 20 7

1 20 7 2 400 49 3 8000 343

The obvious benefit from these limitations was a drastic reduction of

the size of the library. It was also more focused, making the following screening less tedious. Designing the focused library was not trivial. The following amino acids were allowed on all three target positions in the library: alanine, valine, leucine, isoleucine, methionine, proline and threo-nine. Threonine was not desired, but in order to avoid threonine, either isoleucine and methionine, or alanine and proline would have been lost as well, see Table 3-2. In order to get this mixture of amino acids (Ala, Val, Leu, Ile, Met, Pro and Thr), degenerate primers were designed with a non-linear distribution of bases.

The degenerate position was defined by VYK, with V(G=40.0%, A=40.0%, C=20.0%), Y(T=80.2%, C=19.8%) and K(G=49.8%, T=50.2%). On the first position G, A, and C were possible, on the second position only T and C were allowed and on the third position G and T were allowed (1st, 2nd and 3rd in Table 3-2, respectively). This gave the desired distribution among amino acids highlighted in blue in Table 3-2.

The mixture of nucleotides were optimized to produce specified amino acid sub-populations using an algorithm developed by (Jensen et al. 1998). A number of clones were sequenced after mutagenesis, and the distri-bution among the found mutations was in very good agreement with the designed mixture (Table 1 in Paper V).

Several different methods were evaluated with the aim of creating the library screened in Paper IV. There are many different methods available for setting up a small library. Even though they in principal are general methods, it was less trivial to adapt them to PalB. It is likely that the nature of some genes, such as highly repetitive and/or GC rich regions, disfavour the implementation of a specific method or makes the fine tuning of the method challenging. This was the case when the MUPREC method was adapted to PalB with the aim of creating small libraries.

Page 44: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

34

Table 3-2. The universal genetic code. The amino acids encoded by the degenerated primers used in the library in Paper IV are highlighted in blue.

Position in code 1st 2nd 3rd

A U G C

Lys Ile Arg Thr A Asn Ile Ser Thr U Lys Met Arg Thr G

A

Asn Ile Ser Thr C

STOP Leu STOP Ser A Tyr Phe Cys Ser U

STOP Leu Trp Ser G

U

Tyr Phe Cys Ser C

Glu Val Gly Ala A Asp Val Gly Ala U Glu Val Gly Ala G

G

Asp Val Gly Ala C

Gln Leu Arg Pro A His Leu Arg Pro U Gln Leu Arg Pro G

C

His Leu Arg Pro C

The Multiplex-PCR-based Recombination (MUPREC) method was introduced as a high-fidelity method for directed evolution and has the advantage that it does not require digestion of parental DNA. Bacillus subtilis lipase A was subjected to directed evolution using MUPREC, whereby the enantioselectivity was improved (Eggert et al. 2005). The method can be used in two ways, as a one-step or as a two-step procedure where the two-step procedure leaves room for optimisation Figure 3-4. The principle behind the method is that Your Favorite Gene (YFG) is available in two different plasmids, in this case pET and pGAP and requires a two-step PCR procedure. The first PCR reaction contains a for-ward primer (a) to the pET plasmid binding just outside YFG plus for-ward and reverse mutagenic primers to all mutation sites. This results in gene fragments of various lengths. In the second PCR step YFG from the pGAP plasmid is added to the first PCR product along with a reverse outer primer (b) to pGAP. In this second PCR step recombination occurs, and a mutated full length of YFG is obtained.

Page 45: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

35

Figure 3-4. Schematic presentation of the MUPREC method. Primer pairs carrying point mutations are used in mixtures of both sense and anti-sense primers in a multiplex-PCR to amplify gene fragments. These gene fragments are recombined in a second PCR. The distribution among formed gene fragments during the multiplex-PCR can be directly monitored during the two step procedure. In the 2-step protocol, the first step is multi-plex PCR which is followed by a recombination PCR in the second step. The 1-step protocol will generate fragments and megaprimers in the early cycles and fragment assembly and recombination in the late cycles. (Adapted from Eggert et al., 2005).

PalBPalB gene (pET22)gene (pET22)

a

b

c

a

PalBPalB gene (pET22)gene (pET22)

a

PalBPalB gene (gene (pGAPpGAP))

b

1-st

ep P

CR

2-st

ep P

CR

Multiplex-PCR

Recombination PCR

PalBPalB gene (gene (pGAPpGAP))

PalBPalB gene (pET22)gene (pET22)

a

b

c

a

PalBPalB gene (pET22)gene (pET22)

a

PalBPalB gene (gene (pGAPpGAP))

b

1-st

ep P

CR

2-st

ep P

CR

Multiplex-PCR

Recombination PCR

PalBPalB gene (gene (pGAPpGAP))

Page 46: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

36

In the one-step PCR procedure all of the components are mixed at once and the resulting products are the same as in the two-step procedure. Although the one-step PCR procedure is convenient, it provides little information about the distribution among the individual amplified gene fragments. One important issue with the method is that the first point of mutation downstream of the forward outer primer to plasmid (A) is likely to be overrepresented in the final library mixture. The concentration of the added mutagenesis primers must therefore be titrated to give a rela-tively even distribution among the amplified gene fragments. Furthermore, it is necessary to gel purify the final product to remove the whole plasmid templates as well as clone the mutated gene into a plasmid again. This approach gives a high degree of control over the process, but was unfor-tunately unsuccessful when applied to PalB.

Similar to the well known QuikChange method, the QuikChange Multi Site method takes advantage of the Dpn I digestion of the methylated parental template DNA, whereby only newly synthesized mutant DNA is preserved (Stratagene 2006). In the present investigation several attempts to apply the QuikChange Multi Site Mutagenesis kit on PalB failed, likely due to formation of secondary structures within the primers as well as primer displacement by the polymerase. Therefore alternative methods were evaluated.

An attractive method was developed for changing several codons in the Mycoplasma pneumoniae in one reaction (Hames et al. 2005). M. pneumoniae has a non-universal codon usage by which the UGA codon encodes tryptophan rather than a stop codon as in the universal code. This is a severe problem when expressing M. pneumoniae proteins in heterologous host systems such as E. coli. Hames and co-workers therefore developed the Multiple-Mutation Reaction (MMR), which can introduce up to nine site-directed mutations in a single experiment. The principle of the method is that degenerate primers are designed to hybridize stronger to the tem-plate than the external primers. The mutagenic primers must furtherrmore be phosphorylated at their 5’-end so that the thermostable DNA ligase is able to ligate these to the 3’-end of the upstream extended primers. To prevent degradation of extended primers the employed DNA polymerase must not show any 5’ 3’ exonuclease activity. The resulting mutated gene can then be cloned for further use. See Figure 3-5.

In the present investigation it was undesirable to have an additional clo-ning step, which the MMR method would cause. The polymerase was

Page 47: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

37

therefore allowed to continue round the plasmid, and the resulting method was a combination of the MMR and QuikChange Multi Site, outlined in Figure 3-6. In this method only mutagenic primers and no external primers are employed, whereby the polymerase is set to amplify the whole plasmid.

Figure 3-5. Strategy used in the Multiple-Mutation Reaction (MMR) for the mutagenesis and amplification of a gene. All mutagenic primers used are phos-phorylated at the 5’-end. (Adapted from Hames et al., 2005).

The method outlined in Figure 3-6 was successfully adapted and used in Paper IV after a system optimisation. There are some possible drawbacks with this rather direct approach. The final product was transformed direct-ly into an expression host Rosetta (DE3), which already carry one plasmid coding for rare tRNAs. This might result in decreased transformation efficiency due to an overall high metabolic burden. In addition the Dpn I digestion is a crucial step in the method – an incomplete digestion will contaminate the library with template DNA (in this case wild-type PalB), which was seen during the investigation leading to Paper IV.

3.2.4 Permutation

A completely different approach to mutations was employed to Pseudozyma antarctica lipase B (PalB) by Lutz and co-workers. They hypothesised that an internal relocation of PalB’s N and C terminal to a close proximity of the active site could increase the flexibility of the structure and hence the probability of a broader substrate acceptance. To test the hypothesis PalB was subjected to circular permutation. First the PalB gene was amplified adding the same restriction site to both the 5’- and 3’-ends of the gene, whereby compatible ends for ligation were obtained. This construct was intralinked resulting in a circular gene corresponding to PalB joined N- to C-terminals with a six amino acid long linker. Having a circular PalB gene, random partial digestion was applied whereby linear genes were achieved.

Page 48: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

38

These were in turn ligated to the expression vector and transformed into the Pichia pastoris expression host for screening and characterisation. Indeed variants with improved catalysis towards selected substrates were found. Even more interestingly, it was discovered that increased flexibility in close proximity to the active site residues seems to have a negative effect on catalysis. On the other hand, regions contributing to the topology of the active side - without taking part in the catalysis - can be relaxed with a beneficial effect (Qian and Lutz 2005).

Figure 3-6. Principle mutagenesis method used to in Paper IV. Sense DNA strand shown in light grey and antisense DNA strand in dark grey. After trans-formation of mutated single stranded DNA into the E. coli strain Rosetta (DE3), the cells fills in the complementary stand, and the mutated gene product can be expressed directly (figure adapted from Stratagene 2006).

Me

Me

Me

Me

MeMe

Thermal cycles

Step 3: Transformation

Step 2: Dpn I digestion of template DNA

Step 1: Mutant Strand Synthesis (Thermal Cycling)

(Predominant productfrom Step 1)

Perform thermal cycling to:1) Denature input DNA2) Anneal mutagenic

primers (all primers bind to same strand)

3) Extend primers and ligate nicks with thermostable ligase

Digest methylatedand hemimethylatedDNA with Dpn I

Transform mutatedssDNA into competentcells

MeMe Me

Me P

P

P

P

Me

Me

Me

Me

MeMe

Thermal cycles

Step 3: Transformation

Step 2: Dpn I digestion of template DNA

Step 1: Mutant Strand Synthesis (Thermal Cycling)

(Predominant productfrom Step 1)

Perform thermal cycling to:1) Denature input DNA2) Anneal mutagenic

primers (all primers bind to same strand)

3) Extend primers and ligate nicks with thermostable ligase

Digest methylatedand hemimethylatedDNA with Dpn I

Transform mutatedssDNA into competentcells

MeMe Me

Me P

P

P

P

P

P

P

P

Page 49: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

3. DNA techniques for enzyme engineering

39

3.2.5 Random mutagenesis

Introduction of random mutations in a gene is easily achieved using error-prone PCR (EP-PCR). It is also the most commonly used method to introduce random mutations. The principle of EP-PCR is that PCR based synthesis tends to produce random sequence errors, especially when using a DNA polymerase without proofreading abilities like Taq. The sponta-neous error rate in Taq is too low for EP-PCR and is therefore enhanced in different ways. Commonly used approaches to enhance the error rate include changing the composition of the PCR buffer (e.g., adding magne-sium chloride, manganese chloride or rising the pH), or by changing the ratio of the added dNTPs creating a biased mixture of the four nucleotides by mixing the dNTPs in concentrations differing from 10 up to 1000 times in between the four. In addition high amounts of DNA polymerase can be used, as well as long extension time or a low annealing temperature (Cadwell and Joyce 1992; Ling and Robinson 1997). The exact reaction conditions can be titrated to a desired mutagenesis load where the optimal load will be dependent on the target protein. Drummond and co-authors showed that a low mutagenesis rate results in many functional variants, but few unique ones, while a high mutagenic load resulted in few func-tional variants, but mostly unique variants (Drummond et al. 2005). Muta-tions introduced with EP-PCR tends to be biased, where transitions A:T -> G:C and G:C -> A:T are more common than transversions (A:T -> T:A, A:T -> C:G, G:C -> T:A or G:C -> C:G). This is important to keep in mind during the evaluation of the mutagenesis (Rasila et al. 2009).

3.3 Final remarks on DNA techniques for enzyme engineering

In the present chapter a brief introduction to DNA techniques using one gene as a template has been given. The engineering strategy chosen in the present investigation (section 3.1) was based on the fact that there are several good crystal structures of PalB available. Since there is plenty of in-formation available on PalB, a random approach was not considered. A feasible expression system for PalB was not available and was therefore developed (Papers I, II and III). Based on molecular modelling a few target sites were chosen for point mutations, resulting in a semi-rational

Page 50: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

40

approach for the creation of enzyme variants (Paper IV). The enzyme library was screened for good variants and position 47 was found impor-tant for the water tunnel. Encouraged by these results further molecular modelling concluded that also position 46 was a good target site. The best mutant from the screening was then subjected to site-directed mutagenesis on position 46 and characterised (Papers IV and V).

When choosing a host system for the recombinant protein expression, the conclusion from Chapter 2 was that it is important to know as much as possible about the protein of interest in order to facilitate protein ex-pression. This conclusion is true for choosing a DNA technique for en-zyme engineering as well. With more information available, a more direc-ted/rational approach can be used.

Page 51: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

41

4 SUBSTRATE TUNNELS IN ENZYMES

Enzymes are able to catalyze chemical reactions in a specific manner. Some of these reactions can also be performed by organic synthesis, but this may require protecting groups to guide the reaction in the desired direction. A similar “protecting” approach is used by some enzymes, but instead of adding protecting groups, they shelter the compound itself in a intramolecular tunnel or a channel. In the following a “tunnel” will be used to describe the passage of a compound from one point to another within the protein structure.

A highly advanced infrastructure is found in carbamoyl phosphate synthetase in which two intramolecular tunnel segments connect three active sites. Together the tunnels cover a distance of nearly 100 Å. The complex pathway is used for the tunnelling of ammonia. Hydrolysis of glutamine results in ammonia, which travels the first half of the tunnel to react with bicarbonate. The produced carbamate travels through the second half of the tunnel where it is phosphorylated by an ATP, and the ultimate product carbamoyl phosphate is formed (Weeks et al. 2006).

Lipases act on fat and oils, whereby the active site will be buried in a highly hydrophobic surface/solution. For PalB and other lipases, the subs-trate will enter the active site, and formation of the acyl enzyme will take place (Scheme 1 in Paper V and Figure 1-3). A nucleophile is required for the next step to take place (hydrolysis or transacylation). Water is normally the nucleophile for PalB, but it is unlikely that water passes trough the hydrophobic solution, which the substrate constitutes. Therefore alterna-tive routes for water to enter the active site were considered. Previously, channels on the protein surface providing the active site with water has been reported for two lipases. In these two lipases, Rhizomucor miehei and Humicola (Thermomyces) lanuginosa, channels for water transport, not surroun-ded by the protein backbone, have been identified, thereby allowing water passage to the active site (Norin et al. 1994). The idea of a special path for water supply to the active site of a lipase was recently further developed for the Thermomyces lanuginosa lipase (Santini et al. 2009). T. lanuginosa lipase displays interfacial activation regulated by the movements of a lid. For this lipase molecular dynamics simulations revealed a polar channel connecting the active-site with the outside solvent. The system is suggested to be regulated by a “valve” in the form of a tyrosine which by interacting with the catalytic triad could regulate the water content of the active site.

Page 52: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

42

Figure 4-1. Space-fill model of PalB. Substrate entering the active site is shown in yellow. The entrance to the hypothesised water channel is shown in the middle of the picture, guarded by the Gln46 shown in dark red and with Ser47on the inside of the tunnel – barely visible in bright green. The glycosylation site is seen at the top of the picture. Picture based on pdb-file 1lbs (Uppenberg et al. 1995).

Likewise, a hydrophilic path in PalB was found during molecular modelling, passing from the active site through the enzyme structure and out on the side of the enzyme near the glycosylation site of PalB (Figure 4-1). Based on this modelling the sites 42, 47 and 106 were chosen as likely candidates for point mutations. A small library was designed with the aim to prohibit water from entering the active site through a hypothesised tunnel. In order to screen the library an assay was developed (Paper IV) suitable for screening in 96-microtilter plate format as well as for normal cuvettes. The assay was based on the reaction pathway of PalB, Figure 1-3, where the first step is the formation of an acyl enzyme. For the chosen reaction conditions this step was found not rate limiting (Paper IV) as the addition of nucleophile in the form of butanol resulted in a higher total rate. In the presence of two competing nucleophiles, the next step in the reaction pathway, the deacylation, will result in a distribution among the transacylation reaction and the hydrolysis.

substrateglycosylation

S47

Q46

substrateglycosylation

S47

Q46

Page 53: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

4. Substrate tunnels in enzymes

43

A soluble substrate system was used to rule out any interface effects or limitations. The assay allowed for both the transacylation and the hydro-lysis activity to be detected simultaneously (Figure 1, Paper V). The effect of substitution of amino acids on position 42, 47, and 106 within the hydrophilic water path was investigated in Paper IV,where a small semi-rational library was screened in 96-well plate format. A PalB variant, Ser47Leu, selected from the screening was found to catalyse the trans-acylation reaction 14 times faster than the hydrolysis reaction (Paper IV).

Encouraged by these results of mutating position 47, extensive mole-cular modelling was performed, resulting in a confirmation of the hypo-thesis – there is tunnel through which water can enter the active site without competing with incoming substrates and leaving products (Paper V). In an extension of the work water tunnels and channels were defined in hundreds of hydrolases (Paper V). From these modelling results, position 46 was found to be a good candidate for amino acid substitution at the entrance of the water tunnel. The double mutant Gln46Leu Ser47Leu was prepared, but failed to express any active protein. Hence the double mutant Gln46Ala Ser47Leu was prepared. This PalB variant showed both a strong increase in transacylation activity as well as inhibition of hydrolysis (Papers IV and V).

In the present investigation a complete blockage of the tunnel was not achieved and the task is far from trivial. In the example above with carbamoyl phosphate synthetase, attempts to block the tunnels for the passage of ammonia and carbamate failed and resulted in a conformational change, whereby ammonia could escape (Weeks et al. 2006).

It is likely that is necessary to substitute several amino acids around and within the water tunnel, whereby a synergistic effect can be achieved and possibly block the tunnel to a higher extent.

Page 54: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

44

5 PRESENT INVESTIGATION IN A WIDER PERSPECTIVE

Although yeasts are among the best studied microbes, our knowledge of these is still rather limited. They have nevertheless proved beyond doubt that they have an extraordinary potential, and some of the most important industrial enzymes used today do indeed originate from yeast. A deeper understanding of the yeasts might improve the understanding of the useful enzymes they produce; the enzyme substrate preference is likely linked to the natural environment in which the organism producing the enzyme lives. A particularly interesting yeast is the Pseudozyma antarctica known for the production of lipase.

Substantial research and knowledge about Pseudozyma antarctica lipase B (PalB) exits today. PalB can be used as a model system to learn more about protein-structure-function relationships, based on the know-how we have today. Moreover, PalB is a very important industrial enzyme, and its improvement of stability and selectivity is of great interest.

In summary, in the present investigation two different expression systems for PalB were developed. Directed evolution of PalB has been problematic due to the lack of a feasible and easily manipulated expression system. The E. coli system presented here could be used for directed evolution of PalB. For the production and facilitated enzyme recovery a constitutive P. pastoris expression system was developed using glycerol as carbon source. The constitutive expression system in P. pastoris was easily up-scaled from 96-deep-well plate format to bioreactor cultivations, and thus it bypasses the issue with the use of methanol.

The driving force for the development of these expression systems was to design and screen a small library of PalB variants. The library was designed semi-rationally, aiming at increased transacylation over hydrolysis activity. This was based on a hypothesised water tunnel in PalB supplying the active site with water. The experimental results together with the molecular modelling results supports the existence of such a water tunnel in PalB. This opens up for more directed evolution of PalB in order to close the water tunnel in a more efficient manner, whereby it would be possible to obtain an even higher degree of transacylation in a water environment.

Page 55: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

45

ACKNOWLEDGEMENTS

In science people become experts on a lot of things. “Expert” is of cause is a very subjective concept, but a famous Dane, Niels Bohr, defined it quite adequately as: “An expert is a person who has made all the mistakes that can be made in a very narrow field”.

Professor Karl Hult for accepting me as a student – twice. It takes a special talent to shift my interests from the colourful world of micro fungi to the less colourful, but ever so interesting world of protein (and DNA). Which you managed to do without my notice

I have indeed learned a lot, but mostly I am thankful for the opportunity to work with such wonderful people, many of them are now among my best friends!

Professor Uwe Bornscheuer for welcoming me in your lab on several occasions, for always being so positive, enthusiastic, efficient, and for finding time. Thanks to a great group - I really enjoyed the time spend in your lab!

Dr. Mats Martinelle for supervision – especially for helping to sorting out “the true colours” in the complex assay development.

Co-authors from whom I learned a lot, especially Bato who taught me how to cultivate in bioreactor and took all the nightshifts of the “cell-sitting”.

Dr. Linda Fransson for the colourful picture of PalB in the thesis.

Kaj Kauko and Anders Hamberg for the nice cover illustration.

Diploma students during the years - thank you for surviving my supervision

Past and present members of the Biocat group, especially; Linda, my counter balance in all aspects of life & science, for all support, fun & dis-cussions; Jenny for introducing me to Swedish culture (Kräftsskiva, Lidingöloppet, etc.), and for all the fun; Joke, for sharing my Alien years in Sweden and for being blond and continental (we never get it quite right do we?); Cissi, for keeping me company during many running events – as well as late nights. Distance is relative, but never far; Malin for always being positive, and for sharing my point of view on so many things; Hamberg

Page 56: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

46

for all tech support in the lab and for (mis)understanding my Swedish; Karim – beers will change the world ; Magnus for top professional advice on exercise injury minimisation/recovery.

Roommates for science discussions as well as many conversations about the weather

Lotta: For organising my life – I’m sure the world would stop spinning if you told it to. For many hours of hard work(out) and for helping me with “whatever” “whenever” I needed it. Ela: The lab is simply no fun without you!

The whole Glyco science group and former Wood biotechnology group with whom I both shared and learned many things, especially Stuart, Kristina, Emma, TC, Åsa, Ulla, Alex, Anders W and Gustav.

…And to the rest of Floor 2 for the friendly atmosphere.

Neo, for science, fun and staying online.

Tak til hele min familie - nære som fjerne, men specielt til min mor og far for at I altid har bakket mig op. Kombinerer man vand, pH målinger, GC analyser og morfars kemiske virksomhed – så havner man i biokemi!

This work would never have started – nor finished - without Claus, who supported me along the very long way (“work harder not smarter” or??). For keeping track of all the little photons and electrons flying/beaming around the world, for all your cutting edge visions about the future and for simply being the best Microsoft wizard in town.… But mostly for making me smile and for always getting the most fun out of any given situation

Till världens bästa barn, Oscar och Vicky: denna bok handlar om BASILUSKER, hur dom lever, hur dom mår och vad dom gör. Dom gör förresten inte så mycket skoj – i alla fall inte när jag tittar på dom

And finally to ALL the little ”Yeast bacteria” who struggled SO hard with protein expression – may you all rest in peace…

Page 57: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

47

REFERENCES

Alberghina, L. and Lotti, M. 1997. The evolution of a non universal codon as detected in Candida rugosa lipase. J. Mol. Catal. B: Enzym. 3, 37-41.

An, Y., Ji, J., Wu, W., Lv, A., Huang, R. and Wei, Y. 2005. A rapid and effi-cient method for multiple-site mutagenesis with a modified overlap extension PCR. Appl. Microbiol. Biotechnol. 68, 774-778.

Anderson, E.M., Larsson, K.M. and Kirk, O. 1998. One biocatalyst - many applications: the use of Candida antarctica B-lipase in organic synthesis. Biocatal. Biotransform. 16, 181 - 204.

Barth, G. and Gaillardin, C. 1997. Physiology and genetics of the dimor-phic fungus Yarrowia lipolytica. FEMS Microbiol. Rev. 19, 219-237.

Bessette, P.H., Aslund, F., Beckwith, J. and Georgiou, G. 1999. Efficient folding of proteins with multiple disulfide bonds in the Escherichia coli cytoplasm. Proc. Natl. Acad. Sci. U. S. A. 96, 13703-13708.

Blight, M.A., Chervaux, C. and Holland, I.B. 1994. Protein secretion path-ways in Escherichia coli. Curr. Opin. Biotechnol. 5, 468-474.

Boekhout, T. 1995. Psedozyma Bandoni emend. Boekhout, a genus for yeast-like anamorphs of Ustilaginales. J. Gen. Appl. Microbiol. 41, 359-366.

Bordes, F., Fudalej, F., Dossat, V., Nicaud, J.-M. and Marty, A. 2007. A new recombinant protein expression system for high-throughput screening in the yeast Yarrowia lipolytica. J. Microb. Methods 70, 493-502.

Bulter, T., Alcalde, M., Sieber, V., Meinhold, P., Schlachtbauer, C. and Arnold, F.H. 2003. Functional Expression of a Fungal Laccase in Saccharomyces cerevisiae by Directed Evolution. Appl. Environ. Microbiol. 69, 987-995.

Cadwell, R.C. and Joyce, G.F. 1992. Randomization of genes by PCR mutagenesis. Genome Res. 2, 28-33.

Cereghino, G.P.L. and Cregg, J.M. 1999. Applications of yeast in biotech-nology: protein production and genetic analysis. Curr. Opin. Biotechnol. 10, 422-427.

Cereghino, J.L. and Cregg, J.M. 2000. Heterologous protein expression in the methylotrophic yeast Pichia pastoris. FEMS Microbiol. Rev. 24, 45-66.

Page 58: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

48

Christensen, T., Woeldike, H., Boel, E., Mortensen, S.B., Hjortshoej, K., Thim, L. and Hansen, M.T. 1988. High level expression of recom-binant genes in Aspergillus Oryzae. Bio/Technology 6, 1419 - 1422.

Cregg, J.M., Barringer, K.J., Hessler, A.Y. and Madden, K.R. 1985. Pichia pastoris as a host system for transformations. Mol. Cell. Biol. 5, 3376-3385.

Drummond, D.A., Iverson, B.L., Georgiou, G. and Arnold, F.H. 2005. Why high-error-rate random mutagenesis libraries are enriched in functional and improved proteins. J. Mol. Biol. 350, 806-816.

Eggert, T., Funke, S.A., Rao, N.M., Acharya, P., Krumm, H., Reetz, M.T. and Jaeger, K.-E. 2005. Multiplex-PCR-based recombination as a novel high-fidelity method for directed evolution. ChemBioChem 6, 1062-1067.

Gasser, B., Saloheimo, M., Rinas, U., Dragosits, M., Rodriguez-Carmona, E., Baumann, K., Giuliani, M., et al. 2008. Protein folding and con-formational stress in microbial cells producing recombinant proteins: a host comparative overview. Microb. Cell Fact. 7, 11.

Gellissen, G. and Hollenberg, C.P. 1997. Application of yeasts in gene expression studies: a comparison of Saccharomyces cerevisiae, Hansenula polymorpha and Kluyveromyces lactis- a review. Gene 190, 87-97.

Gietz RD and RA., W. 2001. Genetic transformation of yeast. Biotechniques 30, 816-20.

Hames, C., Halbedel, S., Schilling, O. and Stulke, J. 2005. Multiple-mutation reaction: a method for simultaneous introduction of multiple mutations into the glpK gene of Mycoplasma pneumoniae. Appl. Environ. Microbiol. 71, 4097-4100.

Ho, S.N., Hunt, H.D., Horton, R.M., Pullen, J.K. and Pease, L.R. 1989. Site-directed mutagenesis by overlap extension using the polymerase chain reaction. Gene 77, 51-59.

Inamine, J.M., Ho, K.C., Loechel, S. and Hu, P.C. 1990. Evidence that UGA is read as a tryptophan codon rather than as a stop codon by Mycoplasma pneumoniae, Mycoplasma genitalium, and Mycoplasma galli-septicum. J. Bacteriol. 172, 504-506.

Jahic, M., Gustavsson, M., Jansen, A.-K., Martinelle, M. and Enfors, S.-O. 2003. Analysis and control of proteolysis of a fusion protein in Pichia pastoris fed-batch processes. J. Biotechnol. 102, 45-53.

Page 59: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

References

49

Jahic, M., Rotticci-Mulder, J., Martinelle, M., Hult, K. and Enfors, S.-O. 2002. Modeling of growth and energy metabolism of Pichia pastoris producing a fusion protein. Bioprocess Biosyst. Eng. 24, 385-393.

Jensen, L.J., Andersen, K.V., Svendsen, A. and Kretzschmar, T. 1998. Scoring functions for computational algorithms applicable to the design of spiked oligonucleotides. Nucl. Acids Res. 26, 697-702.

Jonasson, P., Liljeqvist, S., Nygren, P.-Å. and Ståhl, S. 2002. Genetic design for facilitated production and recovery of recombinant proteins in Escherichia coli. Biotechnol. Appl. Biochem. 35, 91-105.

Jose, J. 2006. Autodisplay: efficient bacterial surface display of recombinant proteins. Appl. Microbiol. Biotechnol. 69, 607-614.

Kalinowski, J., Bathe, B., Bartels, D., Bischoff, N., Bott, M., Burkovski, A., Dusch, N., et al. 2003. The complete Corynebacterium glutamicum ATCC 13032 genome sequence and its impact on the production of -aspartate-derived amino acids and vitamins. J. Biotechnol. 104, 5-25.

Kazlauskas, R.J. and Bornscheuer, U.T. 2009. Finding better protein engineering strategies. Nat Chem Biol 5, 526-529.

Lee, C.C., Williams, T.G., Wong, D.W.S. and Robertson, G.H. 2005. An episomal expression vector for screening mutant gene libraries in Pichia pastoris. Plasmid 54, 80-85.

Li, W., Zhou, X. and Lu, P. 2004. Bottlenecks in the expression and secretion of heterologous proteins in Bacillus subtilis. Res. in Microbiol. 155, 605-610.

Ling, M.M. and Robinson, B.H. 1997. Approaches to DNA mutagenesis: An overview. Anal. Biochem. 254, 157-178.

Lubertozzi, D. and Keasling, J.D. 2009. Developing Aspergillus as a host for heterologous expression. Biotechnol. Adv. 27, 53-75.

Löfdahl, S., Guss, B., Uhlén, M., Philipson, L. and Lindberg, M. 1983. Gene for staphylococcal protein A. Proc. Natl. Acad. Sci. U. S. A. 80, 697-701.

Machida, M., Yamada, O. and Gomi, K. 2008. Genomics of Aspergillus oryzae: Learning from the History of Koji Mold and Exploration of Its Future. DNA Res. 15, 173-183.

Page 60: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

50

Madzak, C., Gaillardin, C. and Beckerich, J.-M. 2004. Heterologous protein expression and secretion in the non-conventional yeast Yarrowia lipolytica: a review. J. Biotechnol. 109, 63-81.

Makrides, S.C. 1996. Strategies for achieving high-level expression of genes in Escherichia coli. Microbiol. Rev. 60, 512 - 538.

Mergulhao, F.J.M., Summers, D.K. and Monteiro, G.A. 2005. Recombi-nant protein secretion in Escherichia coli. Biotechnol. Adv. 23, 177-202.

Morley, K.L. and Kazlauskas, R.J. 2005. Improving enzyme properties: when are closer mutations better? Trends Biotechnol. 23, 231-237.

Narita, J., Okano, K., Tateno, T., Tanino, T., Sewaki, T., Sung, M.-H., Fukuda, H., et al. 2006. Display of active enzymes on the cell surface of Escherichia coli using PgsA anchor protein and their application to bioconversion. Appl. Microbiol. Biotechnol. 70, 564-572.

Park, S., Morley, K.L., Horsman, G.P., Holmquist, M., Hult, K. and Kaz-lauskas, R.J. 2005. Focusing mutations into the P. fluorescens esterase binding site increases enantioselectivity more effectively than distant mutations. Chem. Biol. 12, 45-54.

Patkar, S.A., Björking, F., Zundel, M., Schulein, M., Svendsen, A., Heldt-Hansen, H.P. and Gormsen, E. 1993. Purification of two lipases from Candida antarctica and their inhibition by various inhibitors. Indian J. Chem., Sect. B 32, 76-80.

Qian, Z. and Lutz, S. 2005. Improving the Catalytic Activity of Candida antarctica Lipase B by Circular Permutation. J. Am. Chem. Soc. 127, 13466-13467.

Rasila, T.S., Pajunen, M.I. and Savilahti, H. 2009. Critical evaluation of random mutagenesis by error-prone polymerase chain reaction protocols, Escherichia coli mutator strain, and hydroxylamine treatment. Anal. Biochem. 388, 71-80.

Reetz, M.T., Bocola, M., Carballeira, J.D., Zha, D. and Vogel, A. 2005. Expanding the range of substrate acceptance of enzymes: Combina-torial Active-Site Saturation Test. Angew. Chem., Int. Ed. 44, 4192-4196.

Reetz, M.T., Carballeira, J.D., Peyralans, J., Höbenreich, H., Maichele, A. and Vogel, A. 2006a. Expanding the Substrate Scope of Enzymes: Combining Mutations Obtained by CASTing. Chem.--Eur. J. 12, 6031-6038.

Page 61: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

References

51

Reetz, M.T., Carballeira, J.D. and Vogel, A. 2006b. Iterative Saturation Mutagenesis on the Basis of B Factors as a Strategy for Increasing Protein Thermostability. Angewandte Chemie International Edition 45, 7745-7751.

Reetz, M.T. and Sanchis, J. 2008. Constructing and analyzing the fitness landscape of an experimental evolutionary process. ChemBioChem 9, 2260-2267.

Rotticci-Mulder, J.C., Gustavsson, M., Holmquist, M., Hult, K. and Martinelle, M. 2001. Expression in Pichia pastoris of Candida antarctica lipase B and lipase B fused to a cellulose-binding domain. Protein Expression Purif. 1, 386-392.

Rotticci, D., Rotticci-Mulder, J.C., Denman, S., Norin, T. and Hult, K. 2001. Improved enantioselectivity of a lipase by rational protein engineering. ChemBioChem 2, 766-770.

Santini, S., Crowet, J.M., Thomas, A., Paquot, M., Vandenbol, M., Thonart, P., Wathelet, J.P., et al. 2009. Study of Thermomyces lanuginosa lipase in the presence of tributyrylglycerol and water. Biophys. J. 96, 4814-4825.

Srivastava, P. and Deb, J.K. 2005. Gene expression systems in coryne-bacteria. Protein Expression Purif. 40, 221-229.

Stratagene. 2006. QuikChange multi site-directed mutagenesis kit. Rev. A.

Stratagene. 2009. QuikChange site-directed mutagenesis kit. Revision B.01.

Studier, F.W. 2005. Protein production by auto-induction in high-density shaking cultures. Protein Expression Purif. 41, 207-234.

Sudbery, P.E. 1996. The expression of recombinant proteins in yeasts. Curr. Opin. Biotechnol. 7, 517-524.

Sørensen, H.P. and Mortensen, K.K. 2005. Advanced genetic strategies for recombinant expression in Escherichia coli. J. Biotechnol. 115, 113 - 128.

Tateno, T., Hatada, K., Tanaka, T., Fukuda, H. and Kondo, A. 2009. Development of novel cell surface display in Corynebacterium glutamicum using porin. Appl. Microbiol. Biotechnol.

Terpe, K. 2006. Overview of bacterial expression systems for heterologous protein production: from molecular and biochemical fundamentals to commercial systems. Appl. Microbiol. Biotechnol. 72, 211-222.

Page 62: Expression of a lipase in prokaryote and eukaryote - DiVA Portal

52

Tracewell, C.A. and Arnold, F.H. 2009. Directed enzyme evolution: climbing fitness peaks one amino acid at a time. Curr. Opin. Chem. Biol. 13, 3-9.

Uppenberg, J., Hansen, M.T., Patkar, S. and Jones, T.A. 1994. The sequence, crystal structure determination and refinement of two crystal forms of lipase B from Candida antarctica. Structure 2, 293-308.

Uppenberg, J., Öhrner, N., Norin, M., Hult, K., Kleywegt, G.J., Patkar, S., Waagen, V., et al. 1995. Crystallographic and molecular-modeling studies of lipase B from Candida antarctica reveal a stereospecificity pocket for secondary alcohols. Biochemistry 34, 16838-16851.

Valenzuela, P., Medina, A., Rutter, W.J., Ammerer, G. and Hall, B.D. 1982. Synthesis and assembly of hepatitis B virus surface antigen particles in yeast. Nature 298, 347-350.

Waterham, H.R., Digan, M.E., Koutz, P.J., Lair, S.V. and Cregg, J.M. 1997. Isolation of the Pichia pastoris glyceraldehyde-3-phosphate dehydro-genase gene and regulation and use of its promoter. Gene 186, 37-44.

Waugh, D.S. 2005. Making the most of affinity tags. Trends Biotechnol. 23, 316-320.

Weeks, A., Lund, L. and Raushel, F.M. 2006. Tunneling of intermediates in enzyme-catalyzed reactions. Curr. Opin. Chem. Biol. 10, 465-472.

West, M.W. and Hecht, M.H. 1995. Binary patterning of polar and nonpolar amino acids in the sequences and structures of native proteins. Protein Sci. 4, 2032-2039.

Xu, Y., Yasin, A., Tang, R., Scharer, J., Moo-Young, M. and Chou, C. 2008. Heterologous expression of lipase in Escherichia coli is limited by folding and disulfide bond formation. Appl. Microbiol. Biotechnol. 81, 79-87.

Yin, J., Li, G., Ren, X. and Herrler, G. 2007. Select what you need: A comparative evaluation of the advantages and limitations of frequently used expression systems for foreign genes. J. Biotechnol. 127, 335-347.

Zhang, W., Bevins, M.A., Plantz, B.A., Smith, L.A. and Meagher, M.M. 2000. Modeling Pichia pastoris growth on methanol and optimizing the production of a recombinant protein, the heavy-chain fragment C of botulinum neurotoxin, serotype A. Biotechnol. Bioeng. 70, 1-8.