computational analysis and manipulation of the metabolic

Computational analysis and manipulation of the metabolic network of drug

producing microorganisms

INAUGURALDISSERTATION

zur Erlangung des Doktorgrades

der Fakultät für Chemie und Pharmazie

der Albert-Ludwigs-Universität Freiburg im Breisgau

Vorgelegt von

Dennis Klementz

aus Wiesbaden-Dotzheim

Freiburg 2017

Dekan: Prof. Dr. Manfred Jung

Vorsitzender des Promotionsausschusses: Prof. Dr. Stefan Weber

Referent: Prof. Dr. Stefan Günther

Korreferent: Prof. Dr. Andreas Bechthold

Drittprüfer: Prof. Dr. Oliver Einsle

Datum der Promotion: 26.02.2018

“What we observe is not nature itself, but nature exposed to our method of questioning.”

Werner Heisenberg

Index

Index

1 Abstract ......................................................................................................................... 1

2 Introduction ................................................................................................................... 3

2.1 Streptomycetes, an important source of natural drugs .................................................... 3

2.2 Nucleocidin production in Streptomyces calvus ............................................................... 6

2.3 Griseorhodin A, a telomerase inhibitor from a marine Streptomyces strain .................... 9

2.4 Genome-scale metabolic modeling and Flux Balance Analysis ....................................... 11

2.5 ‘Omics’ data and genome-scale metabolic models ......................................................... 14

2.6 Enhanced secondary metabolite production and genome-scale metabolic models ...... 16

3 Materials ....................................................................................................................... 19

3.1 Chemicals ......................................................................................................................... 19

3.2 Equipment ....................................................................................................................... 21

3.3 Consumables .................................................................................................................... 23

3.4 Strains .............................................................................................................................. 24

3.5 Programs & Services ........................................................................................................ 25

3.6 Cultivation media ............................................................................................................. 26

3.6.1 TSB (Tryptone Soy Broth) ......................................................................................... 26

3.6.2 MS (Mannitol Soy) .................................................................................................... 26

3.6.3 SG+ (Soytone Glucose) .............................................................................................. 26

3.6.4 Minor Element Solution ........................................................................................... 27

3.6.5 Buffer ........................................................................................................................ 27

3.6.6 Minimal medium ...................................................................................................... 27

Index

4 Methods ....................................................................................................................... 28

4.1 StreptomeDB ................................................................................................................... 28

4.1.1 Data collection .......................................................................................................... 28

4.1.2 Genomes, taxonomy and gene clusters ................................................................... 28

4.1.3 Generation of the phylogenetic tree ....................................................................... 28

4.2 AdpA in Streptomyces asterosporus ............................................................................... 30

4.2.1 Detection of transposons ......................................................................................... 30

4.2.2 Detection of adpA binding sites ............................................................................... 30

4.2.3 Prediction of adpA regulon ...................................................................................... 30

4.2.4 Potential disrupted genes ........................................................................................ 30

4.2.5 G+C content and skew.............................................................................................. 31

4.2.6 Dotplot ...................................................................................................................... 31

4.3 Genome-scale metabolic modeling ................................................................................. 32

4.3.1 Model reconstruction ............................................................................................... 32

4.3.2 Analysis of essential elements ................................................................................. 32

4.3.3 Correlation plot and area under the curve (AUC) .................................................... 32

4.3.4 Knockout candidate identification ........................................................................... 33

4.3.5 Integration of transcriptomic data ........................................................................... 34

4.3.6 Generation of functional subsets ............................................................................. 35

4.3.7 Minimal medium prediction and validation ............................................................. 35

4.4 Strain cultivation .............................................................................................................. 36

4.4.1 Preculture ................................................................................................................. 36

4.4.2 Permanent culture ................................................................................................... 36

4.4.3 Agar cultures ............................................................................................................ 36

Index

4.4.4 Liquid culture in Erlenmeyer flasks .......................................................................... 36

4.4.5 Dry weight measurement ......................................................................................... 36

4.4.6 Sample preparation for HPLC-MS ............................................................................ 37

4.4.7 HPLC measurement .................................................................................................. 37

5 Results .......................................................................................................................... 38

5.1 StreptomeDB update ....................................................................................................... 38

5.2 Data Overview ................................................................................................................. 38

5.3 Phylogenetic tree ............................................................................................................. 43

5.3.1 NMR and MS prediction ........................................................................................... 45

5.3.2 Scaffold browser ....................................................................................................... 47

5.4 bldA function in Streptomyces calvus ............................................................................. 49

5.4.1 Metabolic model ...................................................................................................... 49

5.4.2 Gene expression ....................................................................................................... 50

5.4.3 Protein production ................................................................................................... 61

5.5 Role of adpA function in Streptomyces asterosporus ..................................................... 67

5.5.1 Genome Overview .................................................................................................... 67

5.5.2 Putative disrupted genes ......................................................................................... 70

5.5.3 Proteomic analysis ................................................................................................... 73

5.6 Enhancement of Griseorhodin A production in Streptomyces avermitilis SUKA22 ........ 75

5.6.1 Metabolic model ...................................................................................................... 75

5.6.2 Minimal-medium validation experiment ................................................................. 76

5.6.3 Knockout candidate identification ........................................................................... 79

5.6.4 Overview of all knockout candidates ..................................................................... 112

5.7 Experimental results ...................................................................................................... 114

Index

6 Discussion & Conclusion .............................................................................................. 118

6.1 StreptomeDB ................................................................................................................. 118

6.2 The effect of bldA in Streptomyces calvus..................................................................... 121

6.3 adpA regulon in Streptomyces asterosporus ................................................................. 125

6.4 Enhancing Griseorhodin A expression in Streptomyces avermitilis SUKA22 ................ 126

6.4.1 Minimal medium validation of the model of S. avermitilis SUKA22 ...................... 126

6.4.2 Knockout candidate identification ......................................................................... 127

6.4.3 Conclusion & Future perspective ........................................................................... 133

7 References .................................................................................................................. 134

8 Licenses ....................................................................................................................... 147

9 Acknowledgments ....................................................................................................... 148

10 Publications ................................................................................................................ 149

11 Conference contributions ............................................................................................ 150

Index

Abbreviations

% percent

°C degree celcius

A adenine

abs. absolute

ACN acetonitrile

ADP adenosine diphosphate

AMP adenosine monophosphate

ATP adenosine triphosphate

AUC area under the curve

bp base pairs

C cysteine

CDS coding sequence

CoA coenzyme A

COG cluster of orthologus groups

∆ deleted gene

DNA desoxiribonucleic acid

EC enzyme class

EtOH ethanol

e.g. exempli gratia

ENA european nucleotide archive

et al. et alii

FAD flavin adenine dinucleotide

FADH flavin adenine dinucleotide (semiquinone)

FADH2 flavin adenine dinucleotide (hydroquinone)

FBA flux balance analysis

fc fold change

G guanine

g gram

H+ proton

h hour

HCl hydrochloric acid

HPLC high performance liquid chromatography

HSCoA coenzyme A

kDa kilo Dalton

KEGG Kyoto encyclopedia of genes and genomes

m mass

m/z mass per charge

MB megabyte

mbp mega base pairs

Index

MeOH methanol

min minute

ml milliliter

mm millimeter

mmol millimole

mol mole

MS mass spectrometry

MS Medium mannitol soy medium

NAD nicotinamide adenine dinucleotide (oxidized)

NADH nicotinamide adenine dinucleotide (reduced)

NADPH nicotinamide adenine dinucleotide phosphate (oxidized)

NADPH nicotinamide adenine dinucleotide phosphate (reduced)

NMR nuclear magnetic resonance spectroscopy

NRPS non-ribosomal peptide synthetase

nt nucleotide substitutions per side

PAPS 3'-phosphoadenosine-5'-phosphosulfate PAP adenosine 3',5'-bisphosphate pH potential of hydrogen

PKBC polyketide backbone chain

PKS polyketide synthase

Ppi pyrophosphate

RNA ribonucleic acid

rpm revolutions per minute

rRNA ribosomal ribonucleic acid

s seconds

SAH S-adenosly homocysteine

SAM S-adenosyl methionine

SBML systems biology markup language

T thymine

tRNA transfer ribonucleic acid

TSB tryptone soy broth

U uracil

V volt

Index

List of figures

Figure 1 Lifecycle of Streptomycetes on solid media ....................................................................... 3

Figure 2 Albert Schatz (left) and Selman Waksman (right) discoverers of Streptomycin................ 4

Figure 3 Bioactive compounds from Streptomyces species ............................................................ 4

Figure 4 Fluorinated natural compounds ......................................................................................... 6

Figure 5 Proposed effects of bldA (25) ............................................................................................. 6

Figure 6 Pointmutation of bldA A ..................................................................................................... 7

Figure 7 Annimycin ........................................................................................................................... 8

Figure 8 Biosynthesis of Rubromycines ........................................................................................... 9

Figure 9 Overview of the important principles and steps in FBA .................................................. 12

Figure 10 E-Flux Method ................................................................................................................ 15

Figure 11 Paclitaxel......................................................................................................................... 16

Figure 12 Genome of S. avermitilis wildtype compared to genome of S. avermitilis SUKA22 ...... 17

Figure 13 Flowchart phylogenetic tree generation ........................................................................ 29

Figure 14 Flowchart knockout candidate identification ................................................................ 34

Figure 15 Coverage of papers reviewed for StreptomeDB ............................................................ 39

Figure 16 Distribution of biological activities among annotated compounds in StreptomeDB .... 40

Figure 17 Distribution of synthesis pathways among annotated compounds in StreptomeDB ... 41

Figure 18 Molecular weight distribution of StreptomeDB compared to DrugBank ...................... 42

Figure 19 Vizualization of the phylogenetic tree ........................................................................... 44

Figure 20 Visualization of predicted mass fragmentation products .............................................. 45

Figure 21 Visualization of NMR data .............................................................................................. 46

Figure 22 Scaffold browser of StreptomeDB ................................................................................. 47

Figure 23 Overview of gene transcription levels............................................................................ 51

Figure 24 Gene regulation in COGs ................................................................................................ 52

Figure 25 Overview of transcriptomic and proteomic data ........................................................... 54

Figure 26 Fold change density in subsystems ................................................................................ 56

Figure 27 Overview of genes in predicted subsystems .................................................................. 57

Figure 28 Correlation curves of Biomass vs Secondary Metabolite production ............................ 58

Index

Figure 29 Glyoxylate and Dicarboxylate metabolism from KEGG .................................................. 60

Figure 30 Hydroxypyruvate isomerase reaction overview from KEGG.......................................... 60

Figure 31 Overview of proteins related to Primary Metabolism organized by predicted functional

subsets ............................................................................................................................................ 66

Figure 32 Overview of Streptomyces asterosporus genome properties ....................................... 67

Figure 33 Dotplot of Nucleocidin cluster in S. calvus and S. asterosporus .................................... 69

Figure 34 Phylogenetic tree based on 16S rRNA............................................................................ 70

Figure 35 Proteomic results and adpA regulon .............................................................................. 73

Figure 36 Griseorhodin A synthesis including co-factors ............................................................... 75

Figure 37 pyrB in pyrimidine metabolism ...................................................................................... 84

Figure 38 pyrB in propanoate metabolism .................................................................................... 86

Figure 39 kbl in glycine, serine and threonine metabolism ........................................................... 90

Figure 40 pta in retinol metabolism ............................................................................................... 92

Figure 41 pta in pyruvate metabolism ........................................................................................... 94

Figure 42 pta in propanoate metabolism....................................................................................... 95

Figure 43 fadE4 in fatty acid degradation ...................................................................................... 97

Figure 44 fadE4 in valine, leucine and isoleucine degradation ...................................................... 99

Figure 45 rocA in arginine and proline metabolism ..................................................................... 107

Figure 46 plcA in ether lipid metabolism ..................................................................................... 109

Figure 47 rpe in pentose phosphate pathway ............................................................................. 111

Figure 48 Overview of the simulated effect of all knockout candidates with relative biomass

production values ......................................................................................................................... 112

Figure 49 Overview of the simulated effect of all knockout candidates with absolute biomass

production values ......................................................................................................................... 112

Figure 50 Differing morphology of the different Griseorhodin A producing SUKA22 mutants ... 114

Figure 51 Dry weights of all mutants and wildtype after 10 days of cultivation in TSB liquid

medium ........................................................................................................................................ 115

Figure 52 Varying Griseorhodin production of the different SUKA22 mutants ........................... 116

Figure 53 Model based data integration and interpretation ....................................................... 122

Index

List of tables

Table 1 HPLC gradient pattern ....................................................................................................... 37

Table 2 Content overview of StreptomeDB in 2013 and 2015 ...................................................... 38

Table 3 Number of rRNA sequences for each filtering step........................................................... 43

Table 4 Number of scaffolds per scaffold-level ............................................................................. 48

Table 5 Predicted functional subsets ............................................................................................. 55

Table 6 Key values from correlation plots of bldA effect on energy distribution .......................... 59

Table 7 Overproduced proteins with active bldA in Streptomyces calvus ..................................... 61

Table 8 Predicted rRNA genes in S. asterosporus .......................................................................... 68

Table 9 Mobile genetic elements in S. asterosporus predicted by ISfinder ................................. 70

Table 10 Comparison of correlation simulation of Griseorhodin A and biomass production in

wildtype and SUKA22 ..................................................................................................................... 80

Table 11 Knockout candidates competing for NADPH ................................................................... 81

Table 12 Knockout candidates competing for Malonyl-CoA. ......................................................... 83

Table 13 Knockout candidates competing for Acetyl-CoA. ............................................................ 87

Table 14 Knockout candidates competing for FAD ........................................................................ 96

Table 15 Knockout candidates competing for NAD ..................................................................... 101

Table 16 Summary of all knockout candidates ............................................................................ 113

Abstract

1

1 Abstract

Streptomycetes are an important source of biogenic drugs. The production of these compounds

is often associated with complex changes within these organisms. In addition to morphological

differentiation, such as the formation of aerial hyphae and spore chains, the cells globally

reorganize their metabolic machinery. This phenomenon is known as metabolic switch. Despite

these extensive metabolic changes, however, the yield in the production of these natural

substances is typically very low.

Genome-scale metabolic models are comprehensive mathematical networks of reactions and

metabolites occurring within an organism. The generation of such a model requires extensive

amounts of background information. Comprehensive scientific databases, such as StreptomeDB

presented in this work, are a valuable tool for the creation of these models. Simulations based

on such models can grant profound insights of the interrelationships of complex metabolic

changes and the underlying regulatory context. This knowledge can in turn be used to efficiently

develop rational metabolic engineering strategies.

In the course of this work, we used a genome-scale metabolic model of Streptomyces calvus to

interpret the effect of the pleiotropic regulator bldA, a key player in the metabolic switch of this

strain (1). For that, we gathered proteomic and transcriptomic data of the naturally bldA

deficient wildtype strain and a mutant with restored bldA function. With the help of the

genome-scale metabolic model, we could organize the data in a functional context and predict

their regulatory effect on the metabolic machinery. The herein gained insights on the natural

regulation of metabolic flux between primary and secondary metabolism were used to develop

new methods for the generation of metabolic engineering strategies. Contrary to common

methods, the aim of this approach is not to directly increase the secondary metabolite

production rate, but to improve the ratio of produced secondary metabolite per produced

biomass by decreasing the efficiency of the competing primary metabolism. This strategy was

used with a genome-scale metabolic model of the already genome minimized host strain

Streptomyces avermitilis SUKA22 to identify knockout candidates that have the potential to

redirect the flux of metabolites and energy in favor of the production of Griseorhodin A, a

Abstract

2

promising telomerase and viral reverse transcriptase inhibitor (2, 3). First semi quantitative

experiments indicate, that growth and production rate are influenced by these knockouts.

Introduction

3

2 Introduction

2.1 Streptomycetes, an important source of natural drugs

Streptomycetes are a group of ubiquitous gram-positive soil bacteria that can be found from the

bottom of the sea to even burning coal seam (4, 5). Their unusual lifecycle on solid media led to

a false classification as fungi when they were discovered. After forming vegetative hyphae that

reach inside the medium, they start forming aerial hyphae when aging. Finally, these areal

hyphae differentiate into spore chains. (6, 7).

Figure 1: Lifecycle of Streptomycetes on solid media. Starting with a single spore, a vegetative mycelium is formed, followed by

aerial hyphae in later phases of culture growth. These aerial hyphae finally differentiate into spore chains. Modified from (7) and

(6).

Introduction

4

Researchers invested huge efforts in exploring this genus since Albert Schatz from the group of

Selman Waksman could isolate Streptomycin from a sample of Streptomyces griseus in 1943,

which was at that time the first effective

antibiotic against Tuberculosis (8). Due

to the exceptional diversity of secondary

metabolite production in

Streptomycetes, this interest has led to

the discovery of an immense number of

natural compounds, representing about

60% of all drugs in clinical use (9).

Blockbuster drugs as the antibiotics Tetracyclin, Daptomycine and Chloramphenicol, the anti-

parasitic Avermectin, the immunosuppressant Rapamycin or the lipase inhibitor Lipstatin are

just a few well known examples from the plethora of bioactive compounds produced exclusively

by Streptomycetes (10–15).

Figure 3: Bioactive compounds from Streptomyces species: Lipstatin (1) Tatracyclin (2) and Daptomycin (3)

Figure 2: Albert Schatz (left) and Selman Waksman (right) discoverers of Streptomycin. (http://sebsnjaes250.rutgers.edu/)

Introduction

5

The important role of these products for human society was most recently honored in 2015 by

presenting the Nobel-prize in medicine to William C. Campbell and Satoshi Ōmura for the

discovery of Avermectines in Streptomyces avermitilis (16). Even though being examined for

nearly 80 years, these bacteria are still a source of novel discoveries and potential new drug

substances (17). Throughout these decades, researchers had various interests and demands in

their studies. In the first half of the 20th century, infectious diseases were a common cause of

death, so the search for new antibiotics was a typical goal in working with Streptomycetes. This

interest changed over the time from infectious to cardiovascular diseases and lately anti-cancer

drugs. Nevertheless, the broad interest in compounds produced by these bacteria also lead to a

huge number of molecules that were isolated but did not show activity to the interest of their

age. These compounds are a rich source of potential new drug substances whose activities have

to be elucidated yet. A comprehensive collection of these compounds can be found in

StreptomeDB1. With more than 4.000 substances, it is currently the largest collection for this

genus and holds a broad spectrum of different background information (Chapter 6.1). Even

though there are already thousands of products known, their number is still increasing rapidly

with emerging methods (17, 18).

1 http://pharmazeutische-bioinformatik.de/streptomedb/

http://pharmazeutische-bioinformatik.de/streptomedb/

Introduction

6

2.2 Nucleocidin production in Streptomyces calvus

An interesting example is the re-discovery of Nucleocidin production in Streptomyces calvus.

This strain shows a for Streptomycetes unusual deficiency of sporulation and is therefore often

described as ‘bald’ (19). The American Cynamid company was the first to report Nucleocidin

production in S. calvus in 1956 (20). Nucleocidin shows antimicrobial and anti-trypanosomal

activity but is above all special amongst

natural compounds for being

fluorinated (20). Only five fluorinated

natural compounds are known so far

(21, 22). The biosynthesis of Nucleocidin

is not yet fully understood and still a

topic of interest for researchers around

the world (23). The latest results from

Bartholomé et al. showed by radioactive

labeling, that the fluorination does not

take place at an early stage of sugar

formation (24). However, after the first reports of Nucleocidin, it was not possible to establish

its production again for over 60 years (1).

Streptomycetes have a very

complex regulatory system that

is controlling morphological

differentiation and production

of secondary metabolites. This is

one of the reasons why usually

only a small number of the

secondary metabolite clusters of

a Streptomyces strain is

expressed under laboratory

Figure 4: Fluorinated natural compounds: Nucleocidin (1), Fluoroacetone (2), Fluoroacetate (3); Fluorothreonine (4), Fluorooleicacid (5) and Fluorocitrate (6)

Figure 5: Proposed effects of bldA (25)

Introduction

7

conditions. These regulation mechanisms can be divided in cluster specific and pleiotropic

regulation. The latter usually also interacts with the former and a whole network of other

regulators. One of the best studied pleiotropic regulators in Streptomycetes is a gene called

bldA. It is encoding for a t-RNA that is supplying TTA or UUA codons with Leucine (25). This

codon is very rare in Streptomycetes which are known for their high G+C-content. Genes

including such a codon can only be translated if bldA is expressed. This happens commonly in

the static phase of the culture, when aerial hyphae are formed and sporulation is induced (26).

Besides translational regulation, bldA is also reported to play a role in transcriptional regulation

and post translational modification as shown in Figure 5: Proposed effects of bldA. In 2013,

Kalan et al. could show that Streptomyces calvus was the victim of a momentous point mutation

in its bldA gene. This mutation leads to a misfolding of the corresponding t-RNA and thus to a

complete loss of function (1).

Figure 6: Point mutation of bldA (A) bldA sequence from different Streptomyces strains; (B) Secondary structure of mutated

tRNA; (C) Secondary structure of active tRNA; (D) top/bottom: S. calvus with restored bldA function, middle: S. calvus wildtype

with disrupted bldA (1)

Introduction

8

By a complementation experiment, it could be shown, that like reported in Streptomyces

coelicolor, bldA plays also an important role in differentiation and secondary metabolite

production of S. calvus. With a functional bldA

gene, S. calvus showed morphological

differentiation to aerial hyphae and sporulation.

Additionally to Nucleocidin, it started to produce

Annimycin, a polyketide that has no antimicrobial

effect but inhibits sporulation in other

Actinomycetales.

Another example for bod strain is S. asterosporus, which is a close relative of S. calvus. Its

genome is also containing the rare Nucleocidin gene cluster. It has several similar phenotypical

features, such as the inability to form spores or aerial hyphae. However, this incapacity is not

caused by a mutation of bldA but is probably related to adpA, a regulatory gene that belongs to

the same network as bldA. According to Higo et al., adpA is forming a feedback loop with bldA

(26). Mutations in either of these genes caused a bald phenotype in Streptomyces griseus and

suppressed the production of Streptomycin. In Streptomyces coelicolor, these genes also control

the morphological differentiation but do not influence the production of secondary metabolites

(27).

Figure 7: Annimycin

Introduction

9

2.3 Griseorhodin A, a telomerase inhibitor from a marine

Streptomyces strain

Griseorhodin A is a heavily oxidized compound that belongs to the group of structurally related

Rubromycines, which have been known for their vivid red colors since the 60‘s, when

Brockmann isolated a red pigment from Streptomyces collinus (28). Griseorhodin A, like other

Rubromycines, has an aplanar structure caused by their characteristic spiroketal moiety that is

formed by a FAD dependent enzyme. It was first isolated from the marine Streptomyces sp. JP65

(5).

Figure 8: Biosynthesis of Rubromycines (5)

This moiety plays a crucial role in their activity against human telomerase and retroviral reverse

transcriptase (3). Telomerase inhibition is a highly interesting biological property. This enzyme is

a unique ribonucleoprotein that counteracts the small DNA loss that appears during regular

DNA replication. This is done by maintaining the length of dispensable DNA caps at the end of

chromosomes. Telomerase is highly expressed in cancer cells but nearly absent in neighboring

Introduction

10

tissue (29). Although extensive efforts towards the total synthesis of Rubromycines have been

undertaken, the available synthetic routes are still limited and ineffective (30).

Introduction

11

2.4 Genome-scale metabolic modeling and Flux Balance Analysis

Metabolic models are sets of chemical reactions and according reactants that can occur in an

organism. A genome-scale metabolic model is based on the information that can be extracted

from an organism’s genome. Genes and their products are predicted and functionally annotated

to chemical reactions with the help of curated databases such as KEGG or BioCyc (31, 32).

Although this already results in a set of hundreds of reactions and reactants, these drafts still

contain gaps that impede simulations. Additional reactions, such as free diffusion or

spontaneous reactions have to be added and the results of the automated annotations have to

be carefully checked (33). In the past, the reconstruction of a genome-scale metabolic model

was a laborious task that included a lot of manual curation and correction (34). With strongly

increasing number of available sequenced genomes, the quality of automated annotation

databases is rapidly improving (35). Since genome-scale metabolic models are gaining

popularity, there are also automated gap-filling algorithms available (36–38). The latest release

of the RAST server and modelSEED is capable of generating directly usable models within few

hours. Once purged from essential gaps and errors, a model can be used for simulations and is

usually iteratively optimized with experimental data (33)

One of the most popular simulations that can be performed with such a model is constraint-

based Flux Balance Analysis (FBA). Metabolic models are usually stored in lists of

stoichiometrical reactions and reactants with varying amounts of additional information. For

FBA this data has to be translated to a mathematical matrix (S) of m rows (number of reactions)

and n columns (number of reactants). Each field of this matrix contains a value that displays if a

reaction consumes or produces a certain reactant. Consumed substrates are described by

negative numbers. The unit of these numbers is usually mmol per gram dry weight per hour

[mmol/g*h]. A vector (v) contains all reaction fluxes of the network. The upper and lower

boundary of the turnover rate of a reaction is defined in the model. A negative lower boundary

value herby defines a reversible reaction. To be able to simplify the differential reaction

equations to linear ones, the network has to be assumed to be in steady state S x v = 0. With

linear programming, it is possible to find a solution for this equation network where a particular

Introduction

12

objective function has an optimal value, typically biomass generation, energy equivalents or

secondary metabolites.

Figure 9: Overview of the important principles and steps in FBA (39)

This approach is a powerful tool to gain insights in complex metabolic contexts without the

need for detailed knowledge of enzyme kinetics for every reaction (39). Lewis et al. could prove

based on the model organism E. coli that results from FBA, which are typically the

stoichiometrically most efficient solutions with minimized total flux, are consistent with results

from omic analysis for growth optimization. Edwards and Palsson studied the influence of gene

deletions in E.coli on its metabolic capabilities with a genome scale metabolic model. Their

research grants a deeper understanding of essential metabolism in general. They could show,

that a cell can still maintain growth, even if stripped from central metabolic genes except from

glycolysis, tricarboxylic acid cycle and pentose phosphate pathway under defined conditions

(40). Furthermore, FBA simulations use little computational power and can be performed in a

few milliseconds, even on older machines.

Introduction

13

Despite these huge benefits, metabolic simulations have some limitations. Though simulations

assume a steady state, it is hard to predict the effect of changing concentrations. Moreover,

regulatory mechanisms are typically not taken into account, resulting in inaccurate quantitative

predictions.

Introduction

14

2.5 ‘Omics’ data and genome-scale metabolic models

The analysis of ‘omics’-data and the prediction of resulting effects on a cell is still a challenging

task. Extensive changes in a cells metabolic machinery, such as morphological differentiation are

controlled by complex regulatory systems which are often not yet fully understood or even not

known so far (41).

Interpreting metabolomic data with the help of genome-scale metabolic models seems to be

fairly straightforward and is performed successfully, but they also offer the possibility to

incorporate other omics data (42, 43). Transcriptomic and proteomic data grant insights on the

regulation of the metabolic machinery of an organism on different levels, but are commonly not

correlating very well due to posttranslational modifications (44). This makes them particularly

difficult to interpret in context with simulations of conversion rates of the reactions that are

catalyzed by them (45).

However, there are several methods available to approach this problem. Experimental results

can be used to constrain the model and create a specific model for a certain phenotype or

disease. Such a model can be used to organize the gathered data or to compare predicted with

measured results in order to find differences that help to improve the model and provide a

deeper insight in underlying biological processes (46, 47).

Colijn et al. have successfully shown, that Flux Balance Analysis is a powerful tool to predict the

effect of different gene expression levels on the metabolic capabilities of an organism.

Constraints, the possible ranges of turnover rates of a certain reaction in the simulations, are

adjusted according to fold changes of annotated genes. They used this ‘valve’-approach to

analyze transcriptome profiles of M. tuberculosis after being exposed to several inhibitors of its

metabolism.

Introduction

15

Figure 10: E-Flux Method: Gene expression data is used to adjust the constraints of corresponding reactions in the metabolic

network. (left) Gene expression data (Green: lower expression; Red: higher expression); (middle) Metabolic network: Thinner

pipes indicate smaller constraints of reactions; (right) Predicted flux rates (48)

They could successfully identify seven out of eight known inhibitors of mycolic acid pathway and

some putative new drug substances. The resulting predictions can provide a precise

understanding of the effect of differentially expressed genes on the metabolism as a whole (48).

Introduction

16

2.6 Enhanced secondary metabolite production and genome-

scale metabolic models

Although yields of over 90% per unit of energy (e.g. glucose) are possible for the

biotechnological production of simple small molecules such as technical alcohols that come

from primary metabolism, the efficient

production of more complex molecules like

Griseorhodin A is much more complicated (5,

49). Often, the desired products can only be

isolated in low amounts and under tremendous

expenditure. A famous example is Paclitaxel

(50). One gram of this important anti-cancer

drug has to be extracted of the bark of twelve

taxus brevifolia trees, followed by about nine weeks of chemical refinement. A total chemical

synthesis of this product has been established in 1994, but is compared to the conventional

extraction still not lucrative (51). Laborious production and purifications like this are the reason

why total chemical synthesis is still often the preferred way, even though pharmaceutical and

chemical industry are making huge efforts to become independent from finite fossil resources.

Starting with unguided mutagenesis and selection, the development of modern genetic

methods has led to a rapid progress in the last 30 years (52). Though secondary metabolites are

often only produced under special conditions, a common method is the isolation of the

responsive gene cluster followed by the heterologous expression in a better controllable host

strain (2). The most commonly used organisms are Escheria coli and Saccharomyces cerevisiae,

but also Streptomycetes are excellent hosts for heterologous expression of secondary

metabolites. They are able to produce an enormous variety of complex compounds from

renewable resources and stand out from other popular expression systems inter alia by their

ability to express very large gene cluster or by their efficient export systems (53). The group of

Prof. H. Ikeda has developed a genome-optimized mutant of the industrially used strain

Figure 11: Paclitaxel

Introduction

17

Streptomyces avermitilis. These so called SUKA strains have proven superior to many natural

hosts in terms of secondary metabolite production (2).

Figure 12: Genome of S. avermitilis wildtype compared to genome of S. avermitilis SUKA22. white arrows: OriC; black arrows:

rRNA genes; black bar: core region containing essential genes; letters: cutting pattern of endonuclease, modified regions are in

grey. Modified from (2) Copyright (2001) National Academy of Sciences.

By deleting about 1.7 mbp from its chromosomes arms, they could remove several secondary

gene clusters, competing for resources supplied by primary metabolism that are needed for the

generation of heterologous expressed compounds.

Other common methods that are applied in order to increase the outcome of fermentations are

overexpression of the desired products gene cluster or its transcriptional regulator, export

mechanisms, genes that are responsible for supply of precursors or knockouts of degrading

reactions (52).

Cofactors also play an important role in the production of secondary metabolites. They

transport precursors or provide for example the reductive or oxidative energy that is necessary

for enzymatic reactions. Experiments on increasing the supply of cofactors such as NADPH have

shown that influencing the redox balance of a cell can lead to a disproportional loss of

metabolic efficiency or even cell death. There are several emerging techniques available for the

optimization of cofactor supply but the understanding of the effects in the complex and strongly

interlinked metabolism of a cell regulations systems preventing the cell from redox imbalance

damage is still little (54).

The selection of a host for heterologous expression is a common step towards an enhanced

production of a certain secondary metabolite, but is often not guided by the evaluation of its

metabolic capabilities but by the availability and selection of strains that are available in the

researchers’ lab. Flux Balance Analysis can be used to identify the putatively most efficient host

for a certain secondary metabolite. Zakrewski et al. offered with their software MultiMetEval

Introduction

18

that already included a dataset of 38 genome scale metabolic models of Actinomycetales and 15

secondary metabolite synthesis pathways a powerful tool for this problem. Based on Pareto

font calculation, they could show that the correlation between primary and secondary

metabolism can differ significantly between different species (55).

In the last years, metabolic models have been utilized, often in combination with omics data, to

develop and enhance overproduction strategies. Genome-scale simulations offer the

opportunity to rationally guide metabolic engineering. Not only manipulation candidates can be

predicted but also the cellular metabolic response to the predicted manipulations (56). This

offers in-depth insights in the underlying effect of wanted and unwanted phenotypes and helps

to delimit the number of time consuming and expensive lab experiments. Dang et al. could

recently demonstrate the power of predictions based on a genome-scale metabolic model by

enhancing the rapamycin production in S. hygrosopicus. By in silico studies of the effect of

decreased and increased flux values on the product of the ratio of weighted and dimensionless

specific growth rate and specific rapamycin production rate they could identify pfk as knockout

and dahP as well as rapK as overexpression candidates. These manipulations lead to an

increased Rapamycin production rate of 142.3% compared to the wildtype strain (57).

Materials

19

3 Materials

Unless otherwise stated, the location of the supplier is in Germany.

3.1 Chemicals

Substance Item

number

Supplier

Acetonitrile HN44.2 Carl Roth GmbH , Karlsruhe

Agar-Agar 5210.1 Carl Roth GmbH , Karlsruhe

Ammonium sulfate 3746.2 Carl Roth GmbH, Karlsruhe

Antifoam Y30 Y30 A5758 Sigma-Aldrich Chemie GmbH,

Steinheim

Apramycin sulfate salt A2024 Sigma-Aldrich Chemie GmbH,

Steinheim

Bacto maltextract 291650 Becton, Dickinson & Co, Sparks USA

Calcium carbonate P013.1 Carl Roth GmbH , Karlsruhe

Calcium chloride CN93.1 Carl Roth GmbH, Karlsruhe

Calibration buffer pH 4.01 HI7004 Hanna Instruments, Woonsocket

USA

Calibration buffer pH 6.86 HI7006 Hanna Instruments, Woonsocket

USA

CASO bullion X938.2 Carl Roth GmbH , Karlsruhe

EDTA SLBC7327 Sigma-Aldrich Chemie GmbH,

Steinheim

Erythromycin 4166.1 Carl Roth GmbH, Karlsruhe

Ethanol 99,9% P076.1 Carl Roth GmbH, Karlsruhe

Ethyl acetate 6784.3 Carl Roth GmbH , Karlsruhe

Formic acid 4724.1 Carl Roth GmbH , Karlsruhe

D(+)-Glucose HN06.2 Carl Roth GmbH, Karlsruhe

Materials

20

Hydrochloric acid 37% 9277.1 Carl Roth GmbH , Karlsruhe

Iron(II) sulfate heptahydrate 3722.1 Carl Roth GmbH, Karlsruhe

Isopropanol AE73.1 Carl Roth GmbH , Karlsruhe

Dipotassium phosphate P749.1 Carl Roth GmbH, Karlsruhe

Kanamycin sulfate T832.1 Carl Roth GmbH, Karlsruhe

Lab Lemco meatextract LP0029 Oxoid Ltd, Basingstoke UK

L-Lysin hydrochloride 1700.2 Carl Roth GmbH, Karlsruhe

Magnesium chloride hexahydrate HN03.1 Carl Roth GmbH, Karlsruhe

Magnesium sulfate heptahydrate T888.1 Carl Roth GmbH, Karlsruhe

Manganese(II) chloride tetrahydrate 0276.1 Carl Roth GmbH, Karlsruhe

D(-)-Mannit 4175.1 Carl Roth GmbH, Karlsruhe

2-Mercaptoethanol 4227.3 Carl Roth GmbH , Karlsruhe

Methanol AE71.2 Carl Roth GmbH , Karlsruhe

Nalidixic acid CN32.1 Carl Roth GmbH, Karlsruhe

Natrium chloride HN00.1 Carl Roth GmbH, Karlsruhe

Mononatrium hydrogenphosphate

dihydrate

T879.1 Carl Roth GmbH, Karlsruhe

RNAseZAP SCBC9516V Sigma-Aldrich Chemie GmbH,

Steinheim

D(+)-Saccharose 4621.1 Carl Roth GmbH, Karlsruhe

Sodium hydroxide 40% 4347.1 Carl Roth GmbH , Karlsruhe

Sucrose 4621.1 Carl Roth GmbH , Karlsruhe

Soy flour 2004246 W.Schoenenberger, Magstadt

Soluble starch 4701.1 Carl Roth GmbH, Karlsruhe

L-Tryptophan 1739.1 Carl Roth GmbH, Karlsruhe

Zinc sulfate heptahydrate K301.1 Carl Roth GmbH, Karlsruhe

Materials

21

3.2 Equipment

Device Model Supplier

Bioanalyzer G2938L Agilent Technologies, Waldbronn

Bioreactor Bioflo/CelliGen 115 New Brunswick, Eppendorf AG,

Hamburg

Bunsen burner Fireboy plus Integra Biosciences, Fernwald

Centrifuge for centrifuge

tubes

5810R Eppendorf AG, Hamburg

Centrifuge for reaction

vessels

5415R Eppendorf AG, Hamburg

Column holder Guard Column Holder REF

718966

Marcherey-Nagel GmbH & Co. KG,

Düren

DO probe InPro 6800 Mettler Toledo GmbH, Gießen

Erlenmeyer flasks 250ml, with baffle Duran Group, Mainz

Erlenmeyer flasks 500 ml, with baffle Duran Group, Mainz

HPLC 1290 Serie Agilent Technologies, Waldbronn

HPLC column EC 150/2 Nucleodur

100-5 C18 ec REF

760008.20


Düren

Hybridization system Hybridisation System 4 Roche Diagnostics GmbH,

Mannheim

Incubation chamber Galaxy 48R New Brunswick, Eppendorf AG,

Hamburg

Mass spectrometer 6460Triple Quadrupol Agilent Technologies, Waldbronn

Mass spectrometer 6520 Accurate-Mass Q-TOF

LC/MS

Agilent Technologies, Waldbronn

Microarray Scanner G2505C Agilent Technologies, Waldbronn

Nanodrop ND-1000 PeqLab, Erlangen

Materials

22

pH probe 405.DPAS-SC-K8S/225 Mettler Toledo GmbH, Gießen

Pre-column EC 4/2 Nucleodur 100-5

C18 ec REF 761932.20


Düren

Shaking incubator Innova 44 New Brunswick, Eppendorf AG,

Hamburg

SpeedVac 5301 Eppendorf AG, Hamburg

Sterile bench MSC 1.8 Thermo electron LED GmbH,

Langenselbold

Thermomixer compact/comfort Eppendorf AG, Hamburg

Ultrasound clipper Covaris S-Series KBiosciences, Hoddeston UK

Vortex VV3 VWR, Darmstadt

Water filter unit Millipore Billerica, USA

Water cooling FL601 Julabo GmbH, Seelbach

Materials

23

3.3 Consumables

Type Description Supplier

Centrifuge tubes 15 ml N459.1 Carl Roth GmbH, Karlsruhe

Centrifuge tubes 50 ml X063.1 Carl Roth GmbH, Karlsruhe

Disposable syringe 20 ml EP98.1 Carl Roth GmbH, Karlsruhe

Disposable syringe 5 ml EP96.1 Carl Roth GmbH, Karlsruhe

Glass vials 2 ml, clr, WrtOn Agilent Technologies, Waldbronn

Petri dishes ALA5.1 Carl Roth GmbH, Karlsruhe

Pipette tips Standard Universal Carl Roth GmbH, Karlsruhe

Reaction vessel 1.5 ml CH76.1 Carl Roth GmbH, Karlsruhe

Reaction vessel 2 ml CK06.1 Carl Roth GmbH, Karlsruhe

Semi-micro cuvette XK26.1 Carl Roth GmbH, Karlsruhe

Sterile filter 0.22 µm P666.1 Carl Roth GmbH, Karlsruhe

Vial Cap 9 mm blue screw Agilent Technologies, Waldbronn

Vial Insert 250 μl Agilent Technologies, Waldbronn

Materials

24

3.4 Strains

Used strains Sequence available Model

Streptomyces sp. Tü6071

Streptomyces albus J1074

Streptomyces avermitilis K139

Streptomyces avermitilis SUKA22 ()

Strains conjugated with pMP31 Sequence available Model

Streptomyces albus J1074+ pMP31 X

Streptomyces sp. Tü6071 + pMP31 X X

Streptomyces avermitilis K139 + pMP31 X

Streptomyces avermitilis SUKA22 + pMP31 X

Knockout variants of S. avermitilis SUKA22 Sequence available Model

Streptomyces avermitilis SUKA22 Δpta X ()

Streptomyces avermitilis SUKA22 Δkbl X ()

Streptomyces avermitilis SUKA22 ΔfadE4 X ()

Streptomyces avermitilis SUKA22 ΔrocA X ()

Streptomyces avermitilis SUKA22 ΔplcA X ()

Streptomyces avermitilis SUKA22 ΔpyrB X ()

Streptomyces avermitilis SUKA22 Δpta + pMP31 X

Streptomyces avermitilis SUKA22 Δkbl + pMP31 X

Streptomyces avermitilis SUKA22 ΔfadE4 + pMP31 X

Streptomyces avermitilis SUKA22 ΔrocA + pMP31 X

Streptomyces avermitilis SUKA22 ΔplcA + pMP31 X

Streptomyces avermitilis SUKA22 ΔpyrB + pMP31 X

Materials

25

3.5 Programs & Services

Name Version

antiSMASH 3.0

BioCommand 3.30

Bioconductor 3.5

Biopython 1.70

BLAST 2.6.0

Chemdraw 16

ChemStation for LC Systems Rev. B. 03.02 [341]

COBRApy 0.8.2

COBRAtoolbox 2.0.5

DEVA 1.2

FAME 21.10.2013

Gurobi linear solver 7.5

ISfinder 2017.10.03

MassHunter Qualitative Analysis Rev. B.04.00

MatLab R2013a

Mauve 2.4.0

MEGA 6

MultiMetEval 14.12.2012

Origin Pro 9.0

Python 2.7.5

R 3.3.3

RNAmmer 1.2

Materials

26

3.6 Cultivation media

All media were used as liquid and solid media. The solid Media included 1% Agar-Agar.

3.6.1 TSB (Tryptone Soy Broth)

Ingredient Content

CASO-Bullion 30 g

Tab water ad 1000 ml

3.6.2 MS (Mannitol Soy)

Ingredient Content

D(-)-Mannit 20 g

Soy flour 20 g

Magnesium chloride hexahydrate 2 g


3.6.3 SG+ (Soytone Glucose)

Ingredient Content

Glucose 20 g

Soytone 10 g

Calcium carbonat 2 g

L-Valine 1 g

Cobalt chloride 2.34 mg


Materials

27

3.6.4 Minor Element Solution

Ingredient Content

Zinc sulfate heptahydrate 1 g

Iron(II) sulfate heptahydrate 1 g

Manganese(II) chloride tetrahydrate 1 g

Calcium chloride 1 g

Purified water ad 1000 ml

3.6.5 Buffer

Ingredient Content

Dipotassium phosphate 8.7 g

Mononatrium hydrogenphosphate dihydrate 7.8 g


3.6.6 Minimal medium

Ingredient Content

L-Lysine hydrochloride 0.286 g

L-Tryptophan 4.714 g

Ammonium sulfate 2 g

Magnesium sulfate heptahydrate 0.6 g

Carbon source 5 g

Minor Element Soulution 1 ml

Buffer 150 ml


Methods

28

4 Methods

4.1 StreptomeDB

4.1.1 Data collection

The Data was automatically extracted from literature and manually curated according to the

protocol presented in the first StreptomeDB publication (58). Additionally, we put a special

focus on compounds described in the most recent publicly available full texts. This includes

hundreds of compounds that cannot yet be found in popular databases such as PubChem or

Zinc (59).

4.1.2 Genomes, taxonomy and gene clusters

If available, the organisms have been assigned to their taxonomic ID and freely accessible

complete genome sequences from GenBank (60, 61). Compounds were linked to their gene

clusters in DoBISCUT and MiBIG (62, 63).

4.1.3 Generation of the phylogenetic tree

26,201 16S rRNA sequences were collected from ArB silva and ENA (61, 64). They were matched

to the strains represented in StreptomeDB. Sequences with low quality or an untypical length

were omitted. For strains where several sequences were available (in some cases over 150 from

different sources), a consensus sequence was calculated. For this purpose, all sequences

available for one strain were compared with each by BLAST analysis. The sequences with the

most hits were used to calculate a consensus sequence with the dump_consensus function of

Biopython (65). Subsequently, all sequences were aligned with ClustalW (66). The resulting

alignment was analyzed with the MEGA software package using maximum likehood and

maximum parsimony algorithms with 250 bootstrap replications (67). Sub-cluster within 0.07 nt

substitutions per side of the same ancestor node were summarized using DendroPy (68).

Visualization and final editing was performed with the ETE Toolkit (69).

Methods

29

Figure 13: Flowchart phylogenetic tree generation

Methods

30

4.2 AdpA in Streptomyces asterosporus

4.2.1 Detection of transposons

Transposons were detected with the services from ISfinder (70). The resulting Blast analysis was

filtered by length and quality. Only matches with more than 95% sequence identity and a p-

value below 0.005 were considered.

4.2.2 Detection of adpA binding sites

According to Yao et al., the sequence of the adpA binding site is 5’–TGGCSNGWWY-3’ (S is G or

C, W is A or T, Y is T or C, and N is any nucleotide) (71). The genome of S. asterosporus was

screened with the regular expression “TGGC[GC][GCAT]G[AT][AT][TC]” on sense and antisense

strand.

4.2.3 Prediction of adpA regulon

Genes potentially interacting with adpA were extracted from STRING database (72). In addition

to genes that have been found to interact directly with adpA in S. griseus and S. coelicolor, this

also includes the genes of the next level of interaction. This includes all genes that are in direct

relation with those that interact directly with adpA. If these genes are present in the genome of

S. calvus, they were included in the circos diagram (Figure 25).

4.2.4 Potential disrupted genes

In order to find genes that have been disrupted by transposons, the area around the transposon

sequences have been scanned. For this purpose, gene segments of 2500 bp were extracted

before and after the corresponding transposon and joined together without the transposon

sequence. The newly created sequences were blasted against UniProt/SwissProt for hits that

were not found in the original annotation (73).

Methods

31

4.2.5 G+C content and skew

The average G+C content and G+C skew were calculated in chunks of 5000 bp with the following

formulas:

G+C content: 100% ∙𝐺+𝐶

𝐺+𝐶+𝐴+𝑇

G+C skew: 𝐺−𝐶

𝐺+𝐶

4.2.6 Dotplot

The dotplot of the Nucleocidin clusters form S. asterosporus and S. calvus were computed with

the services of YASS with default options (74). The gene clusters were extracted according to

antiSMASH annotations.

Methods

32

4.3 Genome-scale metabolic modeling

For the simulation of the model, several functions from COBRApy were used together with the

Gurobi linear solver2 (75).

4.3.1 Model reconstruction

The models were reconstructed with the services provided by RAST and ModelSEED followed by

manual curation (76, 77). RAST automated annotation was performed with options adapted to

the results of the respective sequencing project with the latest FIGfams available. Errors and

gaps that could not be fixed automatically or were caused by the gap-filling algorithm used

during the automated reconstruction were identified manually or by validation experiments

that will be described later.

4.3.2 Analysis of essential elements

Essential reactions were detected by using the single_reaction_deletion method from COBRApy.

Reactions that lead to a flux value of 0.0 mmol/(g*h) for the objective function were considered

essential for this objective function.

Essential genes were detected by using the single_gene_deletion method from COBRApy. Genes

that lead to a flux value of 0.0 mmol/(g*h) for the objective function were considered essential

for this objective function.

Essential metabolites were detected by stepwise knocking the transport reactions from the

boundary condition out. Metabolites that are transported by reactions whose knockout resulted

in a flux value of 0.0 mmol/(g*h) for the objective function were considered to be essential. For

further analysis, these reactions were enabled again. This approach results in the minimal set of

boundary reactions that is necessary for the objective function.

4.3.3 Correlation plot and area under the curve (AUC)

The correlation curve was calculated for two objective functions in dependence of each other.

First, the optimal flux value of secondary metabolite production was determined. Next, the

2 www.gurobi.com

Methods

33

constraints of the final reaction of secondary metabolite production were both set to this value

to force a fixed turnover rate. Then the model was iteratively optimized on growth rate, while

the fixed turnover rate of secondary metabolite production was stepwise reduced to zero. The

area under the resulting curve was calculated according to Simpson’s rule (78). The relative AUC

was calculated in relation to the area stretched between the two maximal values.

4.3.4 Knockout candidate identification

Essential genes were identified by single gene deletion simulation. For this, the constraints of all

reactions annotated to a certain gene are set to zero. If the following optimization on biomass

production resulted in a value equal or close to zero, the respective gene was considered to be

essential. These genes were no longer taken into account.

Knockout simulations were evaluated by their influence on the generation of biomass and

correlation of biomass and secondary metabolite production. For this, the area under the curve

(AUC) of a correlation plot was used. Knockouts resulting in a significantly reduced production

rate of the secondary metabolite were also omitted.

Reactions from primary metabolism that are directly competing for resources with secondary

metabolite production were identified by comparing their flux values during the different steps

of the correlation plot generation. Reactions showing a decreased flux value while secondary

metabolite production is increased were considered to be competing for resources. The found

candidates were classified according to the precursors needed for secondary metabolite

synthesis. In addition, two candidates were determined on the basis of single knockout

correlation analysis only, independent of precursors. Finally, all knockout candidates were

screened for suitability. The more exclusively reactions are annotated to the knockout candidate

the better.

Methods

34

Figure 14: Flowchart knockout candidate identification

4.3.5 Integration of transcriptomic data

In order to integrate gene expression data into the model, a Flux Balance Analysis was carried

out first. The production of biomass was optimized without any experimental data. The

resulting flux rates of all reactions have been used as references. For every gene that was

significantly regulated and annotated in the model, the reference value of all corresponding

reactions was multiplied with the corresponding fold change. The boundary values of these

reactions were changed accordingly. For reactions with more than one annotated gene, the

average of fold change of all annotated genes was used. The resulting model was compared to

the reference model with the help of correlation plots as described above.

Methods

35

4.3.6 Generation of functional subsets

Gene expression data was analyzed with the help of the metabolic model. The model was

individually optimized for the production of Nucleocidin, Annimycin and biomass. For each

simulation, a subset of genes was compiled, based on whether their reactions showed a flux

rate other than zero. Additional subsets were created by subtracting reactions involved in the

generation of biomass from the reaction sets needed for secondary metabolite production. This

resulted in a set of genes from primary metabolism that are required exclusively for the

production of precursors of secondary metabolites.

4.3.7 Minimal medium prediction and validation

The composition of the minimum medium should not consist of more than two amino acids, a

carbon source and a defined salt composition. Tryptophan and lysine were chosen as amino

acids, starch, glucose, sucrose, and mannitol were tested as carbon sources. The salt solution

and the ratio between nitrogen to carbon sources were chosen according to the minimum

medium composition from Hopwood et al. (79). The simulation of the media was carried out by

manipulating the exchange reactions between defined model space and boundary condition.

The constraints of reactions transporting substances that were not necessary to grow the strains

in vitro were set to zero. In case no growth could be simulated, the model was revised

accordingly. The flux rates of lysine and tryptophan uptake in the simulation of cell growth were

used to determine the optimal ratio of the two amino acids for the minimal medium.

Methods

36

4.4 Strain cultivation

4.4.1 Preculture

Inoculation was done with a cropped 1 ml pipet tips that was pricked in frozen sucrose

permanent culture. The tip was then ejected into the prepared medium. All strains were

cultivated for 48 h in 50 ml Tryptone Soy Both (TSB) in 250 ml Erlenmeyer flasks with baffle at

28 °C and 180 rpm in a shaking incubator.

4.4.2 Permanent culture

The respective culture was cultivated for 48 h in 50 ml TSB at 28 °C and 180 rpm in 250 ml

Erlenmeyer flasks with baffle in a shaking incubator. The cells were transferred into 50 ml

centrifuge tubes and centrifuged for 10 min at 5000 rpm. The supernatant was discarded. The

pellet was redispersed in 10 ml of 20% sucrose solution, transferred to a 15 ml centrifuge tube

and stored at -20 °C.

4.4.3 Agar cultures

The sterile petri dishes were poured with freshly autoclaved medium under a sterile bench and

stored there until they solidified. Depending on the antibiotic used, it was added to the medium

before autoclaving, or the plate was subsequently treated with a sterile filtered solution. The

cultures were applied to the agar by dilution smear or with a Drigalski spatula. Plates were

cultivated at 28 °C and stored at 6 °C for up to three months.

4.4.4 Liquid culture in Erlenmeyer flasks

Strains were cultivated in 500 ml Erlenmeyer flasks with baffle and a stainless steel spiral.

200 ml cultivation medium was inoculated with 1 ml pre-culture. They were incubated at 28 °C

and 180 rpm in a shaking incubator.

4.4.5 Dry weight measurement

3 ml (2 ml +1 ml) of the bacterial culture were centrifuged for 10 minutes at maximal speed in a

tared 2 ml reaction flask. The supernatant was discarded. Pellets were tree times washed with

1 ml of purified water. Remaining pellets were dried in a SpeedVac overnight at 30 °C.

Methods

37

4.4.6 Sample preparation for HPLC-MS

1 ml of frozen bacterial culture was thawed, mixed with a standard solution and acidified with

150 µl of 0.5 M HCl. Cells were mechanically broken with 300 mg of glass beads in a ball mill at

room temperature. The resulting solution was three times extracted with 0.5 ml of ethyl

acetate. The organic phase was collected and dried overnight in a SpeedVac at room

temperature.

Dried samples were either stored at -20 °C or dissolved in MeOH/H2O (50/50), if necessary in an

ultrasound bath. The resulting solution was centrifuged for 10 minutes at maximal speed before

it was transferred to an injection vial with insert.

4.4.7 HPLC measurement

HPLC measurement was performed on an Agilent 1100series with a Nucleodur 100-5 C18 RP

(150 mm x 2 mm) column from Marcherey-Nagel, including a pre-column (4 mm x 2 mm) of the

same material. A gradient from 90:10 H2O/ACN (0.1% formic acid) to 02:98 H2O/ACN (0.1%

formic acid) was used with a flowrate of 0.2 ml/min.

Table 1: HPLC gradient pattern

Time [min] Gradient [H2O/ACN]

00 90:10

03 90:10

23 02:98

26 02:98

28 90:10

35 90:10

MS measurement was performed with an Agilent 6520 Accurate Mass Q-TOF LC/MS in positive

mode with a collision energy of 10, 20 and 40 V. Additionally, an UV Diode Array Detector was

uses at wave lengths of 235, 245, 310, 366 and 488 nm.

Results

38

5 Results

5.1 StreptomeDB update

StreptomeDB is currently the largest collection of metabolites reported to be produced in

Streptomycetes (58). With its advanced features, it allows for navigation through this vast

amount of structures in many scientific contexts. In 2015, we introduced this comprehensive

update(80).

5.2 Data Overview

Table 2: Content overview of StreptomeDB in 2013 and 2015

Release 2013 2015

No. of Compounds 2,444 4,040

No. of Organisms 1,985 2,584

No. of Synthesis Pathways 9 12

No. of Activities 579 905

No. of References 4,544 6,717

The update of StreptomeDB significantly increased its content. The number of included

compounds was nearly doubled to about 4,000 structures. The number auf annotated host

organisms, different synthesis pathways, and biological activities could be supplemented

accordingly. Additionally, further background information and several new features were

implemented:

- Interactive phylogenetic tree

- Scaffold-based navigation and search system

- Gene cluster information

Results

39

- Predicted MS data

- Predicted NMR data

- Physicochemical search options

This new data includes 4,485 different scaffolds, 6,717 compound-organism relationships, 1,945

MS-spectra in positive mode with 10, 20, and 40 eV, 3,989 NMR-spectra, 251 gene clusters, and

390 complete genomes. For sake of computational time, the MS-spectra prediction was limited

to compounds with 30 or less heavy atoms.

Figure 15: Coverage of papers reviewed for StreptomeDB

Even though Streptomycetes are examined since the beginning of the 20th century, Figure 15

shows that the number of publications concerning this genus is still increasing every year by

Results

40

more than 15 publications per year in average. For the update, we focused especially on the

most recent publications until October 2014.

Figure 16: Distribution of biological activities among annotated compounds in StreptomeDB

In total, 905 different activities could be annotated to 2,011 compounds. These activities were

extracted as they were reported in the corresponding publications. This includes very specific

activities such as the inhibition of a certain enzyme to very unspecific activities like toxicity.

Some of the activities, such as ‘toxic’ and ‘cytotoxic’ could be summarized but others such as the

inhibition of a certain enzyme could not be subsumed under a more general description. The

most common activity of the reported compounds in StreptomeDB is antibiosis with 666

substances. 160 of them were reported to have anticancer activity, 96 to be antifungal and 70

to be cytotoxic. All other activities have less than 10 representatives.

49,67%

33,21%

7,92%

4,83%

3,48%0,46% 0,43%

others - 49,67%

antibiotic - 33,21%

anticancer - 7,92%

antifungal - 4,83%

cytotoxic - 3,48%

antiviral - 0,46%

antiparasitic - 0,43%

Results

41

Figure 17: Distribution of synthesis pathways among annotated compounds in StreptomeDB

690 compounds could be annotated to 12 different synthesis pathways. These pathways were

only included if they were reported in the corresponding publications. They also include hybrid

pathways such as, NRPS-PKS hybrid cluster, etc. With more than two-thirds of all compounds

with reported synthesis pathway, polyketides are by far the biggest group. With only about 13%,

nonribosomal peptides are the second most common compound class. Terpenes, which are

actually a typical compound class produces by plants, are on third place. This supports the

reputation of Streptomycetes as remarkable efficient producers of this type of structures (81).

69,08%

13,41%

9,99%

4,65%

1,50%1,37%

PKS - 69,08%

NRPS - 13,41%

terpene - 9,99%

hybrid PKS/NRPS - 4,65%

shikimate - 1,50%

others - 1,37%

Results

42

Figure 18: Molecular weight distribution of StreptomeDB compared to DrugBank (82)

The distribution of the molecular weight of StreptomeDB's substances differs from that of the

DrugBank. While most of the compounds in DrugBank have a molecular weight between

100 and 500 g/mol, this distribution in StreptomeDB is shifted in the direction of the high-

molecular-weight substances, with a particularly high proportion of those over 1,000 g/mol.

0

5

10

15

20

25

30

<100 100-200 200-300 300-400 400-500 500-600 600-700 700-800 800-900 900-1000 >1000

Nu

mb

er o

f m

ole

cule

s [%

]

Molecular weight [g/mol]

Molecular weight distribution: StreptomeDB vs. DrugBank

DrugBank StreptomeDB

Results

43

5.3 Phylogenetic tree

A new feature of StreptomeDB is a comprehensive phylogenetic tree based on 16S rRNA

sequences. In total, 26,201 unique rRNA sequences were available on the two largest databases

for 16S rRNA: ArbSilva and ENA (64, 83). This also includes results from metagenomics analysis,

old ones with meanwhile changed designation and sequences from rather doubtful origin and

quality. After mapping the sequences to the strains reported in StreptomeDB, their number

could be reduced to 12,807. This still included sequences with low quality or wrong annotation.

After thorough filtering for coverage and length, the dataset was reduced to about 5% of its

initial size. Considering the up to six 16S rRNA (40) sequences present in a Streptomyces

genome, and in some cases several similar sequencing attempts for one sequence, which

cannot be distinguished in quality or length, a single consensus sequence was generated for

each strain. For a better overview and because of the relatively short evolutionary distance,

more than 1200 sub strains are represented by 319 parent stains, e.g. clicking on ‘Streptomyces

griseus’ leads to 128 compounds produced either by S. griseus itself or one of its 58 sub-strains,

such as S. griseus or S. griseus var. psychrophilus.

Table 3: Number of rRNA sequences for each filtering step

“High Quality” Streptomyces 16S rRNA

sequences from ArbSilva and ENA

26,201 (270 MB)

Sequences from Organisms in StreptomeDB 12,807 (133 MB)

Sequences with correct length

(1400-1600 bp)

1,240 (4.5 MB)

Consensus sequences 319

Results

44

Figure 19: (left) Visualization of the phylogenetic tree; (top right) compound coverage of the phylogenetic tree; (bottom right)

organism coverage of the phylogenetic tree

Any list of compounds can be visualized in the phylogenetic tree. Strains that are able to

produce one of the compounds in the list are highlighted in green. This provides the possibility

to visualize the distribution of a certain scaffold, chemotype, compound, bioactivity or synthetic

route in an evolutionary context.

Clicking on a single strain leads to a list of all known compounds produced by it and its sub-

strains. Closely related organisms are marked by different background colors. On the right of the

tree is a minimap for easier navigation that includes al features of the original tree. Though the

tree only comprises 319 parent organisms, it covers about two-thirds of all compounds in

Results

45

StreptomeDB. For a more detailed analysis, the whole tree is also available for download in

Newick-format.

5.3.1 NMR and MS prediction

Figure 20: Visualization of predicted mass fragmentation products

MS fragments were predicted with CFM-ID (84). The predicted MS-spectra are visualized in the

compounds cards of the corresponding compounds. The five most common masses are shown

with the predicted structure for each fragmentation energy level.

Results

46

NMR spectra were calculated with the NMR predictor

tool as implemented in cxcalc3. Predicted NMR spectra

are visualized as a plain list of signals including

multiplicity and chemical shift. 1H as well as 13C signals

were included.

3 www.chemaxon.com

Figure 21: Visualization of NMR data

Results

47

5.3.2 Scaffold browser

Figure 22: Screenshot of scaffold browser of StreptomeDB

The scaffolds were generated by means of their scaffold framework by Xavier Lucas (85). The

scaffold navigation system was implemented as a clickable map of scaffold pictures. The

scaffolds are organized in levels, where each level represents the number of rings or ring

systems that are attached by aliphatic bridges or heteroatoms. It is possible to choose a number

of scaffolds of interest and jump to different scaffold levels. The number of compounds

including the chosen scaffolds is calculated live and a list of these compounds can be opened

any time.

Results

48

Table 4: Number of scaffolds per scaffold-level

Lvl 0 1 2 3 4 5 6 7 8 9

No. of

scaffolds

1,032 732 672 559 469 314 213 137 96 91

Results

49

5.4 bldA function in Streptomyces calvus

As shown in chapter 2.2, bldA plays an important role in the morphological differentiation and

induction of secondary metabolism. To study the effect of bldA, Stefanie Hackl complemented a

functional bldA gene in Streptomyces calvus. As a dualomic approach, gene expression and

proteomic data were collected for both strains in late stationary phase after 48h of cultivation

at 28 °C in SG+ liquid medium. The main aim was to analyze the pleiotropic effect of bldA on the

transcriptional and posttranslational stage of expression. The manuscript of this project is

currently in preparation.

5.4.1 Metabolic model

The genome-scale metabolic model was generated from a whole genome sequence of

Streptomyces calvus with the services provided by ModelSeed followed by careful manual

revision (33). It consists of 1,659 reactions and 1,589 metabolites, which are annotated to

1,533 genes from primary metabolism and was generated in SBML format. Due to its automated

reconstruction, it includes mainly genes form primary metabolism. Genes belonging to rare

pathways, like Annimycin and Nucleocidin synthesis, were included manually.

A reaction encoding the synthesis of Nucleocidin was reconstructed based on the work of Zhu et

al.:

Adenosine + Fluoride + PAPS + NAD + amino acid → PAP + α-Ketoacid + NADH + Nucleocidin

(86).

Though the biosynthesis of this compound is not yet fully resolved, the fluorination mechanism

was assumed to be a SN-type reaction where a proton and two electrons are abstracted NAD

dependently. The amino acid used by the aminotransferase is also not yet identified. Therefore,

the reaction was designed for all amino acids that are available in the model and whose

respective products can be processed without adding additional reactions.

Results

50

A reaction encoding the biosynthesis of Annimycin was reconstructed based on the work of

Kalan et al.:

Succinyl-CoA + Glycin + 5 NADPH + 2 ATP + Propionyl-CoA + Metylmalonyl-CoA + 4 Malonyl-CoA

→ 4 H2O + 5 NADP + 6 CO2 + HSCoA + 2 AMP + 2 PPi + Annimycin

Both synthesis pathways were reconstructed as single reactions and added to the model,

including an exchange reacting to boundary condition.

5.4.2 Gene expression

The transcription of 6,862 genes in S. calvus could be measured by next generation sequencing.

The data of the strain with the complemented bldA gene had a fivefold less read coverage and

strong forward strand bias that made rather strong normalization necessary. This, together with

the absence of replicates made a significance test based filtering not suitable. Alternatively, the

data was filtered by fold change and with the help of predicted subsets from the genome-scale

metabolic model.

Results

51

Figure 23: Overview of gene transcription levels (87)

An active bldA gene does not only affect the 114 genes including a TTA codon. Under its

influence, the expression of more than 19% of all genes in the genome is more than fourfold

changed. 10% (669 genes) are up- and 9% (617) are downregulated. This underlines its role as

pleiotropic regulator.

For a more detailed analysis, the genes that were stronger than fourfold regulated were

clustered in orthologous groups (COG).

≥4 fold up10%

≥4 fold down9%

no effect81%

Results

52

Figure 24: Gene regulation in COGs (87)

The groups “cell cycle control”, “cell wall, membrane and envelope biogenesis”, “signal

transduction”, and “replication and repair” are predominantly upregulated. It is probable that

they are related to the incipient sporulation.

From most groups from primary metabolism, the majority of genes is downregulated. This

effect can be observed most strongly for carbohydrate metabolism, presumably as a reaction to

the decreasing sugar concentration in the nutrient medium after 48 h. In addition, amino acid

production, energy production and conversion, lipid metabolism, and nucleotide metabolism

are mainly downregulated. Surprisingly, in this overview a bigger portion of genes from

0

20

40

60

80

100

120

Gen

enu

mb

ero

fC

OG

Cat

ego

ries

Upregulated bldA Downregulated bldA

Results

53

secondary metabolism is downregulated. However, this only includes 33 genes and the majority

of secondary metabolite genes is not represented in this analysis. All 12 genes from Annimycin

and all 36 genes from Nucleocidin cluster are strongly up and all 20 genes from WS9326A

cluster are strongly down regulated. This is also consistent with the phenotypical observations.

Besides Nucleocidin, Annimycin, and WS9326A cluster, no other secondary metabolite gene

cluster seems to be explicitly regulated by the presence of bldA. It can also be seen, that the

presence of a TTA codon does not necessarily lead to an increased transcription when bldA is

expressed. Nucleocidin cluster contain several TTA codons, but Annimycin cluster does not

contain any. They show the highest density in the subtelomeric regions, but are also present in

the whole core region of the genome.

Results

54

Figure 25: Overview of transcriptomic and proteomic data: (A) secondary gene cluster predicted by antiSMASH; (B) location of

TTA codons; (C) gene expression levels, more than two fold upregulated genes with active bldA are highlighted in blue, more than

two fold downregulated ones in orange; (D) overproduced proteins, (red) primary metabolism, (blue) transcriptional and

translational regulators, (yellow) peptidases; the links in the meddle lead from Annimycin (blue), Nucleocidin (orange) and

WS9326A (grey) cluster to found overproduced proteins that are needed for their production

Results

55

The genome-scale metabolic model can be optimized on the production of a certain metabolite

or growth. Reactions that do not participate in the outcome of the objective function show no

flux rate in this simulations. This yielded subsets of reactions or more specifically genes that are

necessary for most efficient production of said metabolite. Though most precursors for growth

and secondary metabolite production are derived from primary metabolism, there are big

intersections. Still some reactions from primary metabolism are exclusively needed for the

production of precursors for secondary metabolism.

Table 5: Predicted functional subsets

Model Average fold change

All (primary metabolism) 1533 -0.49

Annimycin 458 -0.51

Annimycin(ex) 199 -0.37

Nucleocidin 523 -0.40

Nucleocidin(ex) 205 -0.33

Growth 762 -0.33

Results

56

Nucleocidin production, for which in total 523 reactions are needed for maximal production

possible in this network, can utilize

205 reactions that are not needed for

optimal growth. Annimycin

production can be supported by

458 reactions of which 199 are not

strictly needed for growth. The

762 reactions annotated to biomass

generation are in this case not the

minimal set of essential reactions, but

the reactions that allow for optimal

growth speed. The average fold

change of genes in these subgroups

does not significantly differ from the average of all genes that are annotated in the model. Also

the density of different fold changes within the subgroups is similar normal distributed. The fold

changes in all subgroups range from -10 to 10. Surprisingly, the genes that are exclusively

needed for the generation of Nucleocidin and Annimycin have the highest ratio of genes that

are not differentially regulated with functional bldA.

Figure 26: Fold change density in subsystems

Results

57

Figure 27: Overview of genes in predicted subsystems: (top left) all upregulated; (top right) all more than two fold upregulated;

(bottom left) all downregulated; (bottom right) all more than two fold down regulated

The ratio of genes in the different subsets did not significantly differ when filtered for up and

down regulated genes, no matter if they were additionally filtered by fold change or not. These

Results

58

results elucidate, that even though Nucleocidin and Annimycin production are induced under

the influence of bldA, there is no indication for a metabolic switch in primary metabolism on a

transcriptional level that would support a redirection of energy towards secondary metabolite

production.

The plots in Figure 28 were generated from the metabolic model considering the effect of bldA

on gene expression. The curves marked as ‘wildtype’ were generated without any constraints,

except removing reactions catalyzed by genes that did not show any expression.

Figure 28: Correlation curves of biomass vs. secondary metabolite production (top) Annimycin; (bottom) Nucleocidin;. (left)

"wildtype" unconstrained simulations; (right) Fold changes under bldA expression included

Results

59

For the simulations of the influence of bldA, the fold changes of strongly regulated genes

(|log2(fc)| > 2) were used to adjust the boundary values of reactions corresponding to the

calculated flux rates from the wildtype simulation. This was done for the correlation of biomass

and Nucleocidin as well as for biomass and Annimycin production. In both cases, the

correlations plots show a strongly decreased maximal biomass production rate, but also an

increased relative area under the curve (AUC) compared to the unconstrained wildtype

simulation. These plots have been calculated according to chapter 0.

Table 6: Key values from correlation plots of bldA effect on energy distribution. All units except AUC are in [mmol/(g*h)]

Max Biomass Min Biomass Max production AUC [%]

WT Annimycin 330.9 0.0 750.0 85.2

bldA Annimycin 15.3 0.0 750.0 99.6

WT Nucleocidin 330.9 0.0 1000.0 92.1

bldA Nucleocidin 15.3 0.0 1000.0 98.5

In both cases, the inclusion of gene expression data lead to a strong decrease of growth rate

without impairing the maximal production value of the respective secondary metabolite. The

relative AUC values are in both cases increased under the influence of bldA.

A thorough analysis showed that the reactions rxn00102 (KEGG reaction R10092), rxn00898

(KEGG reaction R01209), rxn03437 (KEGG reaction R05070) and rxn01015 (KEGG reaction

R01394) were mainly responsible for this result. Rxn01015 had by far the strongest effect and

can be annotated to a single essential gene: SCLAV_01143, a hydroxypyruvate isomerase (EC

5.3.1.22). It is catalyzing the reaction from hydroxypyruvate to 2-hydroxy-3-oxopropanoate.

Results

60

Figure 29: Glyoxylate and Dicarboxylate metabolism from KEGG. Reaction catalyzed by hydroxypyruvate isomerase is marked.

Figure 30: Hydroxypyruvate isomerase reaction overview from KEGG (31)

By splitting up the condensed biomass reaction in the model, it could be shown that this

reaction plays an important role in the precursor supply of folate metabolism, which is needed

for the synthesis of RNA and DNA (88).

Results

61

5.4.3 Protein production

Proteomic data was limited to the intracellular fraction within a pH range of 4 to 7 and protein

masses between 6.5 and 112.0 kDa. The proteins were collected under the same conditions as

the transcriptomic data. 34 protein spots on a 2D gel could be identified to be overproduced

under the influence of bldA.

Table 7: Overproduced proteins with active bldA in Streptomyces calvus (complemented from (87))

Spot

number

S. calvus

Amino

acids

Predicted

function

Homologue,

origin

Identity

[%]

Accession Fold[a]

change

mRNA

Subsystem

Central carbon metabolism

9 SCALV05263

507 MULTISPECIES: glucose-6-phosphate dehydrogenase

Streptomyces 98/99

WP_030820913.1

1.0 A, N, G

17 SCALV06768

331 glyceraldehyde-3-phosphate dehydrogenase

S. longwoodensis

92/96 KUN40924.1|

3.6 A, N

21 SCALV00992

281 inositol monophosphatase

S. ghanaensis 87/93 WP_004981222.1

0.9 -

22 SCALV05543

228 phosphoglycerate mutase

Streptomyces sp. MMG1533

95/97 WP_053752811.1

0.6 -

Aamino acid metabolism

5 SCALV02968

617 dihydroxy-acid dehydratase

S. aureofaciens

97/98 WP_052843504.1

0.5 N, G

http://www.ncbi.nlm.nih.gov/protein/664291360?report=genbank&log$=protalign&blast_rank=1&RID=SYCRVB3P016

http://www.ncbi.nlm.nih.gov/protein/664291360?report=genbank&log$=protalign&blast_rank=1&RID=SYCRVB3P016

http://www.ncbi.nlm.nih.gov/protein/973871314?report=genbank&log$=protalign&blast_rank=1&RID=SYEPTPAR016

http://www.ncbi.nlm.nih.gov/protein/973871314?report=genbank&log$=protalign&blast_rank=1&RID=SYEPTPAR016

http://www.ncbi.nlm.nih.gov/protein/491122782?report=genbank&log$=protalign&blast_rank=1&RID=SYFMVGP3013

http://www.ncbi.nlm.nih.gov/protein/491122782?report=genbank&log$=protalign&blast_rank=1&RID=SYFMVGP3013

http://www.ncbi.nlm.nih.gov/protein/926425752?report=genbank&log$=protalign&blast_rank=1&RID=SYH307F7016

http://www.ncbi.nlm.nih.gov/protein/926425752?report=genbank&log$=protalign&blast_rank=1&RID=SYH307F7016

http://www.ncbi.nlm.nih.gov/protein/919512219?report=genbank&log$=protalign&blast_rank=1&RID=SYBE76W2016

http://www.ncbi.nlm.nih.gov/protein/919512219?report=genbank&log$=protalign&blast_rank=1&RID=SYBE76W2016

Results

62

Spot

number

S. calvus

Amino

acids

Predicted

function

Homologue,

origin

Identity

[%]

Accession Fold[a]

change

mRNA

Subsystem

11 SCALV_04972

469 type I glutamate--ammonia ligase

Streptomyces sp. NRRL S-813

97/99 WP_030169313.1

1.7 A, N, G

12 SCALV02565

483 serine hydroxymethyltransferase

S. leeuwenhoekii

97/99 WP_029381239.1

1.6 N, G

11 SCALV01791

444 aspartate aminotransferase family protein


98/99 WP_030866013.1

0.3 N

Other primary metabolismus

3 SCALV03268

790 alkaline phosphatase

S. atroolivaceus

83/89 WP_033303075.1

0.6 N, G

6 SCALV04354

647 acetyl/propionyl-CoA carboxylase subuit alpha

Streptomyces sp. CC71

91/94 WP_062139544.1

0.5 N, G

10 SCALV01922

543 1-pyrroline-5-carboxylate dehydrogenase

Streptomyces sp. NRRL B-24085

93/96 WP_053849348.1

0.2 A, N, G

14 SCALV00196

340 alcohol dehydrogenase

S. rochei 94/95 ALV82389.1|

0.5 A, N, G

15 SCALV02589

294 succinate-CoA ligase subunit alpha

S. toyocaensis 99/99 WP_037932967.1

1.2 A, N

18 SCALV03337

334 2-oxoisovalerate


99/99 WP_030856943.1

0.9 N, G

http://www.ncbi.nlm.nih.gov/protein/663128439?report=genbank&log$=protalign&blast_rank=1&RID=SYD75N6J016

http://www.ncbi.nlm.nih.gov/protein/663128439?report=genbank&log$=protalign&blast_rank=1&RID=SYD75N6J016

http://www.ncbi.nlm.nih.gov/protein/657295069?report=genbank&log$=protalign&blast_rank=1&RID=SYDG0TME013

http://www.ncbi.nlm.nih.gov/protein/657295069?report=genbank&log$=protalign&blast_rank=1&RID=SYDG0TME013

http://www.ncbi.nlm.nih.gov/protein/664337823?report=genbank&log$=protalign&blast_rank=1&RID=SYX1V774014

http://www.ncbi.nlm.nih.gov/protein/664337823?report=genbank&log$=protalign&blast_rank=1&RID=SYX1V774014

http://www.ncbi.nlm.nih.gov/protein/702901602?report=genbank&log$=protalign&blast_rank=2&RID=SYAYCA2S013

http://www.ncbi.nlm.nih.gov/protein/702901602?report=genbank&log$=protalign&blast_rank=2&RID=SYAYCA2S013

http://www.ncbi.nlm.nih.gov/protein/1011219810?report=genbank&log$=protalign&blast_rank=1&RID=SYBNE10M013

http://www.ncbi.nlm.nih.gov/protein/1011219810?report=genbank&log$=protalign&blast_rank=1&RID=SYBNE10M013

http://www.ncbi.nlm.nih.gov/protein/927897087?report=genbank&log$=protalign&blast_rank=1&RID=SYCZCN88013

http://www.ncbi.nlm.nih.gov/protein/927897087?report=genbank&log$=protalign&blast_rank=1&RID=SYCZCN88013

http://www.ncbi.nlm.nih.gov/protein/971533854?report=genbank&log$=protalign&blast_rank=1&RID=SYE106KD013

http://www.ncbi.nlm.nih.gov/protein/971533854?report=genbank&log$=protalign&blast_rank=1&RID=SYE106KD013

http://www.ncbi.nlm.nih.gov/protein/740083561?report=genbank&log$=protalign&blast_rank=1&RID=SYE7XS2D016

http://www.ncbi.nlm.nih.gov/protein/740083561?report=genbank&log$=protalign&blast_rank=1&RID=SYE7XS2D016

http://www.ncbi.nlm.nih.gov/protein/664328704?report=genbank&log$=protalign&blast_rank=1&RID=SYF10H90013

http://www.ncbi.nlm.nih.gov/protein/664328704?report=genbank&log$=protalign&blast_rank=1&RID=SYF10H90013

Results

63

Spot

number

S. calvus

Amino

acids

Predicted

function

Homologue,

origin

Identity

[%]

Accession Fold[a]

change

mRNA

Subsystem

dehydrogenase

23 SCALV02628

374 guanosine monophosphate reductase


0.6 N

26 SCALV05618

286 3-hydroxybutyryl-CoA dehydrogenase

Streptomyces sp. NRRL WC-3626

98/98 WP_030225233.1

0.6 A, N

27 SCALV00211

879 bifunctional acetaldehyde-CoA/alcohol dehydrogenase

Streptomyces sp. CCM_MD2014

95/97 WP_061441054.1

0.6 A, N, G

30 SCALV03565

355 phosphoribosylformylglycinamidine cyclo-ligase


97/98 WP_030872125.1

2.9 -

32 SCALV02925

331 delta-aminolevulinic acid dehydratase

S. griseoflavus 97/98 WP_004927902.1

1.2 N, G

33 SCALV02590

392 succinate-CoA ligase subunit beta


1.3 A, N

Proteases

1 SCALV04494

1100 serine protease


0.03 -

http://www.ncbi.nlm.nih.gov/protein/740090226?report=genbank&log$=protalign&blast_rank=1&RID=SYRFUE46013

http://www.ncbi.nlm.nih.gov/protein/740090226?report=genbank&log$=protalign&blast_rank=1&RID=SYRFUE46013

http://www.ncbi.nlm.nih.gov/protein/663187089?report=genbank&log$=protalign&blast_rank=1&RID=SYSDVPBK013

http://www.ncbi.nlm.nih.gov/protein/663187089?report=genbank&log$=protalign&blast_rank=1&RID=SYSDVPBK013

http://www.ncbi.nlm.nih.gov/protein/1002996782?report=genbank&log$=protalign&blast_rank=1&RID=SYST25AP013

http://www.ncbi.nlm.nih.gov/protein/1002996782?report=genbank&log$=protalign&blast_rank=1&RID=SYST25AP013

http://www.ncbi.nlm.nih.gov/protein/664344026?report=genbank&log$=protalign&blast_rank=1&RID=SYUDVB8P013

http://www.ncbi.nlm.nih.gov/protein/664344026?report=genbank&log$=protalign&blast_rank=1&RID=SYUDVB8P013

http://www.ncbi.nlm.nih.gov/protein/491066271?report=genbank&log$=protalign&blast_rank=1&RID=SYUYMYEK01R

http://www.ncbi.nlm.nih.gov/protein/491066271?report=genbank&log$=protalign&blast_rank=1&RID=SYUYMYEK01R

http://www.ncbi.nlm.nih.gov/protein/740083564?report=genbank&log$=protalign&blast_rank=1&RID=SYV4Y0GR015

http://www.ncbi.nlm.nih.gov/protein/740083564?report=genbank&log$=protalign&blast_rank=1&RID=SYV4Y0GR015

http://www.ncbi.nlm.nih.gov/protein/491069689?report=genbank&log$=protalign&blast_rank=1&RID=SYAAUNNS016

http://www.ncbi.nlm.nih.gov/protein/491069689?report=genbank&log$=protalign&blast_rank=1&RID=SYAAUNNS016

Results

64

Spot

number

S. calvus

Amino

acids

Predicted

function

Homologue,

origin

Identity

[%]

Accession Fold[a]

change

mRNA

Subsystem

2 SCALV04598

1085 peptidase S41

S. ghanaensis 94/96 WP_004988295.1

2.0 -

16 SCALV00605

476 peptidase S8 S. viridosporus 88/92 WP_016823609.1

1.3 -

31 SCALV04617

360 peptidase M4


8.6 -

Secondary Metabolism

25 SCALV00122

273 5'-methylthioadenosine phosphorylase

Kitasatospora griseola

70/77 WP_043915700.1

151.4 N

Transcription

20 SCALV02194

468 Crp/Fnr family transcriptional regulator

Streptomyces sp. XY152

97/98 WP_053637103.1

3.6 -

28 SCALV01290

429 AraC family transcriptional regulator

S. albulus ZPM 58/69 AKA07479.1

6.9 -

Translation

4 SCALV01772

566 proline--tRNA ligase

Streptomyces sp. NRRL WC-3626

93/97 WP_030216781.1

1.2 -

http://www.ncbi.nlm.nih.gov/protein/491129867?report=genbank&log$=protalign&blast_rank=1&RID=SYANJW50016

http://www.ncbi.nlm.nih.gov/protein/491129867?report=genbank&log$=protalign&blast_rank=1&RID=SYANJW50016

http://www.ncbi.nlm.nih.gov/protein/515243942?report=genbank&log$=protalign&blast_rank=1&RID=SYEDBPFT013

http://www.ncbi.nlm.nih.gov/protein/515243942?report=genbank&log$=protalign&blast_rank=1&RID=SYEDBPFT013

http://www.ncbi.nlm.nih.gov/protein/491069907?report=genbank&log$=protalign&blast_rank=1&RID=SYURNCA8013

http://www.ncbi.nlm.nih.gov/protein/491069907?report=genbank&log$=protalign&blast_rank=1&RID=SYURNCA8013

http://www.ncbi.nlm.nih.gov/protein/763031550?report=genbank&log$=protalign&blast_rank=2&RID=SYS1P24Y013

http://www.ncbi.nlm.nih.gov/protein/763031550?report=genbank&log$=protalign&blast_rank=2&RID=SYS1P24Y013

http://www.ncbi.nlm.nih.gov/protein/926301409?report=genbank&log$=protalign&blast_rank=1&RID=SYFDZ4D6016

http://www.ncbi.nlm.nih.gov/protein/926301409?report=genbank&log$=protalign&blast_rank=1&RID=SYFDZ4D6016

http://www.ncbi.nlm.nih.gov/protein/785761691?report=genbank&log$=protalign&blast_rank=5&RID=SYT6WMCH013

http://www.ncbi.nlm.nih.gov/protein/785761691?report=genbank&log$=protalign&blast_rank=5&RID=SYT6WMCH013

http://www.ncbi.nlm.nih.gov/protein/663178189?report=genbank&log$=protalign&blast_rank=1&RID=SYB7107V016

http://www.ncbi.nlm.nih.gov/protein/663178189?report=genbank&log$=protalign&blast_rank=1&RID=SYB7107V016

Results

65

Spot

number

S. calvus

Amino

acids

Predicted

function

Homologue,

origin

Identity

[%]

Accession Fold[a]

change

mRNA

Subsystem

7 SCALV05690

658 MULTISPECIES: threonine--tRNA ligase

Streptomyces 98/99 WP_031022786.1

0.8 -

8 SCALV01726

739 polyribonucleotide nucleotidyltransferase

S. davawensis 99/99 WP_015657482.1

1.6 -

29 SCALV02704

139 MULTISPECIES: 50S ribosomal protein L23


1.1 -

Unknown function

13 SCALV04369

324 Lipoprotein S. afghaniensis 71/78 WP_037669879.1

0.5 -

19 SCALV02874

306 MULTISPECIES: hypothetical protein


4.2 -

24 SCALV02068

312 cellulose-binding protein

S. toyocaensis 99/100 WP_037935996.1

1.4 -

18 of the 33 overproduced proteins also show an increased transcription level while bldA is

expressed. One of the proteins belongs to Nucleocidin synthesis cluster. Three of the

overproduced proteins that are allocated to primary metabolism are not included in one of the

predicted subgroups from model simulations. The role of bldA regulon in posttranslational

modification is confirmed by the overproduced peptidases from S8, S41 and M4 family.

http://www.ncbi.nlm.nih.gov/protein/664504114?report=genbank&log$=protalign&blast_rank=1&RID=SYC7RB86013

http://www.ncbi.nlm.nih.gov/protein/664504114?report=genbank&log$=protalign&blast_rank=1&RID=SYC7RB86013

http://www.ncbi.nlm.nih.gov/protein/505470711?report=genbank&log$=protalign&blast_rank=1&RID=SYCF7K43013

http://www.ncbi.nlm.nih.gov/protein/505470711?report=genbank&log$=protalign&blast_rank=1&RID=SYCF7K43013

http://www.ncbi.nlm.nih.gov/protein/491126095?report=genbank&log$=protalign&blast_rank=1&RID=SYTFA518016

http://www.ncbi.nlm.nih.gov/protein/491126095?report=genbank&log$=protalign&blast_rank=1&RID=SYTFA518016

http://www.ncbi.nlm.nih.gov/protein/739818161?report=genbank&log$=protalign&blast_rank=2&RID=SYDRJZM8013

http://www.ncbi.nlm.nih.gov/protein/739818161?report=genbank&log$=protalign&blast_rank=2&RID=SYDRJZM8013

http://www.ncbi.nlm.nih.gov/protein/490072113?report=genbank&log$=protalign&blast_rank=1&RID=SYF755FR016

http://www.ncbi.nlm.nih.gov/protein/490072113?report=genbank&log$=protalign&blast_rank=1&RID=SYF755FR016

http://www.ncbi.nlm.nih.gov/protein/740086622?report=genbank&log$=protalign&blast_rank=1&RID=SYRUBWM5013

http://www.ncbi.nlm.nih.gov/protein/740086622?report=genbank&log$=protalign&blast_rank=1&RID=SYRUBWM5013

Results

66

Figure 31: Overview of proteins related to primary metabolism organized by predicted functional subsets

Analyzing the overproduced proteins from primary metabolism shows a different picture as

results from gene expression. All of them can be allocated to the production of Nucleocidin.

17% are exclusively need for Nucleocidin production. 61% are intersecting with Biomass

production, 50% are intersecting with Annimycin production, and about 50% of Annimycin and

Biomass production are intersecting themselves. This includes proteins that catalyze central

reactions from carbohydrate and amino acid metabolism, but also rather unspecific enzymes

such as alcohol dehydrogenases. This indicates in contrast to results from transcriptomic

analysis, that under the influence of bldA the protein machinery is optimized towards the

production of Nucleocidin. Manipulating the constraints of reactions catalyzed by the

overproduced proteins did not lead to significant outcome.

Results

67

5.5 Role of adpA function in Streptomyces asterosporus

5.5.1 Genome Overview

Streptomyces asterosporus has, like Streptomyces calvus, a bald phenotype. Songya Zhang

discovered that a mobile genetic element disrupted the promotor site of adpA. This lead to a

complete loss of function. By a complementation experiment, he could restore morphological

differentiation and sporulation in this strain. The genome of Streptomyces asterosporus was

sequenced by Songya Zhang and is publicly available at GenBank (accession number: CP022310).

The manuscript of this project is currently in preparation.

Figure 32: Overview of Streptomyces asterosporus genome properties: (A) Secondary Metabolite gene clusters predicted by

antiSMASH in green, rRNA genes (5S, 16S, 23S) in red; (B) G+C content in 5000 bp chunks; (C) G+C-skew in 5000 bp chunks; (D)

transposons

Streptomyces asterosporus

7,766,581 bp

Results

68

The genome of Streptomyces asterosporus has a size of 7,766,581 bp and a for Streptomycetes

typical high G+C content of 72.5%. It varies between about 60% and 80%. As expected, the G+C

skew inverts around the predicted origin of replication. AntiSMASH analysis revealed

28 potential secondary metabolite gene clusters that are mainly located in the subtelomeric

arm regions of the linear chromosome, including WS9326A, Annimycin, and the rare

Nucleocidin cluster. ISfinder detected 40 mobile genetic elements all over the chromosome.

Table 8: Predicted rRNA genes in S. asterosporus (89)

Type Start Stop Strand

16S 4059127 4060640 +

16S 6203585 6205098 +

16S 4702858 4704371 +

5S 4064178 4064293 +

5S 6208652 6208767 +

5S 4707896 4708011 +

23S 4060971 4064091 +

23S 6205425 6208545 +

23S 4704689 4707809 +

RNAmmer analysis detected three rRNA clusters at about 4.1, 4.7, and 6.2 mbp, including 5S,

16S and 23S rRNA. The number of rRNAs is varying in Streptomyces strains between one and

seven (89).

Results

69

Figure 33: Dotplot of Nucleocidin cluster in S. calvus and S. asterosporus (74)

Comparison of the Nucleocidin gene cluster in S. calvus and S. asterosporus showed a high

similarity of over 99%. However, Nucleocidin production could not be detected in S.

asterosporus, even with functional adpA gene.

Results

70

Figure 34: Phylogenetic tree based on 16S rRNA

Phylogenetic analysis of 16S rRNA showed a close relationship between S. asterosporus, S. albus

and S. coelicolor. The resolution of this approach was not big enough to show a difference

between S. asterosporus and S. calvus.

5.5.2 Putative disrupted genes

Table 9: Mobile genetic elements in S. asterosporus predicted by ISfinder (70)

ID Type Start Stop Frame Strand Lenght

fig|285570.5.peg.1032 CDS 1235325 1236185 3 + 861

fig|285570.5.peg.1126 CDS 1339922 1340884 2 + 963

fig|285570.5.peg.1195 CDS 1416513 1417373 3 + 861

fig|285570.5.peg.1601 CDS 1956404 1956829 2 + 426

fig|285570.5.peg.1602 CDS 1956913 1957314 1 + 402

fig|285570.5.peg.2155 CDS 2611844 2612863 2 + 1020

fig|285570.5.peg.2200 CDS 2655416 2655961 2 + 546

fig|285570.5.peg.2361 CDS 2833284 2832322 -3 - 963

fig|285570.5.peg.2395 CDS 2867522 2868484 2 + 963

fig|285570.5.peg.2586 CDS 3093068 3092208 -2 - 861

Results

71


fig|285570.5.peg.2718 CDS 3235851 3236678 3 + 828

fig|285570.5.peg.2947 CDS 3470482 3471443 1 + 963

fig|285570.5.peg.2964 CDS 3495093 3496055 3 + 963

fig|285570.5.peg.3306 CDS 3870372 3869471 -3 - 903

fig|285570.5.peg.3307 CDS 3870653 3871672 2 + 1020

fig|285570.5.peg.3595 CDS 4181286 4180933 -3 - 354

fig|285570.5.peg.3596 CDS 4181765 4181382 -2 - 384

fig|285570.5.peg.3871 CDS 4465045 4465905 1 + 861

fig|285570.5.peg.4010 CDS 4625855 4624995 -2 - 861

fig|285570.5.peg.4276 CDS 4942488 4943332 3 + 846

fig|285570.5.peg.4391 CDS 5076438 5075594 -3 - 846

fig|285570.5.peg.4864 CDS 5645697 5645903 3 + 207

fig|285570.5.peg.4866 CDS 5646854 5646420 -2 - 435

fig|285570.5.peg.4867 CDS 5647225 5646872 -1 - 354

fig|285570.5.peg.5060 CDS 5859265 5860227 1 + 963

fig|285570.5.peg.5078 CDS 5879758 5879357 -1 - 402

fig|285570.5.peg.5079 CDS 5880267 5879842 -3 - 426

fig|285570.5.peg.5482 CDS 6319305 6320165 3 + 861

fig|285570.5.peg.5496 CDS 6333650 6334612 2 + 963

fig|285570.5.peg.5566 CDS 6413825 6412998 -2 - 828

Results

72


fig|285570.5.peg.5568 CDS 6414172 6415134 1 + 963

fig|285570.5.peg.5736 CDS 6592778 6593740 2 + 963

fig|285570.5.peg.5967 CDS 6874932 6874384 -3 - 549

fig|285570.5.peg.5970 CDS 6876516 6877471 3 + 957

fig|285570.5.peg.6011 CDS 6923918 6923058 -2 - 861

fig|285570.5.peg.6133 CDS 7050344 7049853 -2 - 492

fig|285570.5.peg.6260 CDS 7195642 7195241 -1 - 402

fig|285570.5.peg.6261 CDS 7196151 7195726 -3 - 426

fig|285570.5.peg.6272 CDS 7212914 7212159 -2 - 756

fig|285570.5.peg.6684 CDS 7757712 7758642 3 + 927

The Analysis of the bordering regions of the transposons showed, that they did not split any

genes. Only three genes are located in such close neighborhood that their promoter region may

have been disturbed: The transcriptional regulator YdeE, a Tagatose-6-phosphat kinase and a

zinc carboxypeptidase.

Results

73

5.5.3 Proteomic analysis

Figure 35: Proteomic results and adpA regulon. Regulatory genes that are known to interact with adpA in S. coelicolor, S. griseus

and S. avermitilis are labeled on the outside (big: direct interactors; small: secondary interactors); (A) coverage of proteomic data

(B) predicted adpA binding site on sense (blue) and antisense strand (red)according to(CITE); (C) Proteomic results, statistically

significant results are colored in blue and red; (D) predicted secondary metabolite clusters from antiSMASH; (E) G+C-content in

5000 bp chunks; bands in the middle show interaction in the adpA regulon. Red lines start from direct interactors of adpA, grey

ones from secondary interactors

Genome analysis revealed more than 800 potential adpA binding sites all over the chromosome.

A detailed list can be found in the supplementary data. Proteomic analysis covered nearly 20%

Results

74

(1260) of all genes, with about 50 proteins, which were expressed differentially under the

influence of a complemented adpA gene. Many of the interactors of adpA reported from S.

griseus and S. coelicolor care also present in S. asterosporus. The manuscript of this project is

currently in preparation.

Results

75

5.6 Enhancement of Griseorhodin A production in Streptomyces

avermitilis SUKA22

5.6.1 Metabolic model

The genome-scale metabolic model was generated from the whole genome sequence of

Streptomyces avermitilis MA-4680 (GenBank ID: 1040) with the services provided by ModelSeed

followed by careful manual revision (33). It includes 1,721 metabolites and 1,635 reactions,

which are annotated to 1,221 genes from primary metabolism. According to the regions

removed during the creation of the SUKA22 strain, reactions catalyzed by the products of genes

from these regions were deleted.

Figure 36: Griseorhodin A synthesis

Results

76

Griseorhodin A synthesis was reconstructed according to the work of Yunt et al. (3). It was

subdivided in five steps in order to be able to analyze the ratio of all produced intermediates

that could be detected experimentally so far. These steps are summarized in the metabolic

model as follows:

1: Acetyl-CoA + 12 Malonyl-CoA Polyketide backbone chain (PKBC) +12 HSCoA + 11 CO2

2: PKBC + 2 NADPH Intermediate 1 + 6 H2O + 2 NADP

3: Intermediate 1 + SAM + 4 O2 + 4 NADPH +2 NAD Collinone + SAH + 4 H2O + 4 NADP + 2

NADH

4: Collinone + 3 FADH + 3 O2 -> Intermediate 2 + 3 FAD + CO2 + 3 H2O

5: Intermediate 2 + FADH + O2 + 2 NADPH -> Griseorhodin A + FAD + CO2 + 2 NADP + H2O

Additionally, an exchange reaction for Griseorhodin A to the boundary condition was added.

5.6.2 Minimal-medium validation experiment

For model validation, a minimal medium composition was generated in silico and tested in vitro.

The formula should consist of one carbon source, two amino acids and a defined inorganic salt

composition. For metal ions are mostly used as co-substrates in transport reactions or as

coordination center in proteins, they are not properly represented in the model. Therefore, an

established salt solution from Hopwood et al. was used (79). Since it has, according to Barton et

al., the highest synthesis cost, tryptophan was chosen as one of the amino acids (90). The

second one used was Lysine. Model simulations predicted a ratio of 16.5/1 (Tryptophan/Lysine)

of optimal growth. Sven Enderle as part of his diploma thesis tested glucose, mannitol, sucrose

and starch as possible carbon sources on solid medium. S. avermitilis, S. avermitilis SUKA22 and

S. avermitilis SUKA22 pMP31 grew well on all different media. All strains grew more densely on

mannitol and starch. Glucose medium showed a yellow discoloration, presumably cause by

products of Maillard reaction that are known to have a negative impact on bacterial growth.

Adding sterile filtered glucose after autoclaving lead to an increased growth rate.

Results

77

SUKA22 pMP31 did not show any red color indicating Griseorhodin A production. The strongest

coloring was visible on mannitol medium. In contrast to all other cultures, SUKA22 pMP31 did

not visibly sporulate on mannitol medium. Since no expression of Griseorhodin A was

detectable on sucrose medium, it was considered unsuitable and was not used for further

experiments. All of the following pictures of reaction equations were extracted from KEGG (31).

Minimal medium simulations revealed several error in the model indicated by false essential

nutrients.

Trehalose

Trehalose or mycose is a disaccharide formed of two glucose units linked by an α,α-1,1-

glucoside bond. Detailed analysis of the model revealed that trehalose was the only source of D-

glucose-1-phosphate. Though growing experiments with glucose as only carbon source were

successful, the according transmutation reaction from D-glucose-6-phosphate to D-glucose-1-

phosphate was added:

Spermidine

Spermidine is derived from putrescine and 4-aminobutanal. For both precursors can be

produced in the model, the subsequent reactions has been added:

Results

78

D-Arabinose

Arabinose is an aldopentose. L-arabinose is common in polysaccharides of vegetable origin such

as pectin. D-Arabinose can be found in polysaccharides produced by bacteria such as bacillus

tuberculosis’ surface polysaccharide. In the model, it is needed for the first step of the linear

synthesis pathway of core-oligosaccharide-lipid A. They are highly diverse, even within the

strains of the same species, and are typically found in gram-negative bacteria. Since the

experimental data show that D-arabinose is not an essential metabolite, its conversion from

ribulose was added to the model.

Thiamin

Thiamin or vitamin B1 is involved as a coenzyme in the catabolism of sugars in all organisms.

Experimental results show that it is not an essential metabolite for S. avermitilis. His presence is

only an on/off criterion for the simulation of cell growth, and his synthesis has virtually no

influence on the production rate. Therefore, the thiamine synthesis was not reconstructed and

the actually incorrect uptake of thiamine in the simulations was ignored.

Meso-2,6-Diaminopimelate

Diaminopimelate is a precursor of lysine and a component of peptidoglycan. In bacteria, it is a

product of aspartate metabolism. The model contained a gap between L-Aspartate-4-

semialdehyde and Tetrahydrodipicolinate. Additionally, the conversion from

Tetrahydrodipicolat to N-Succinyl-L-2-amino-6-oxopimelate was restricted in the wrong

direction.

Results

79

Ocdca

Octadecanoic or stearic acid is a common fatty acid in every lifeform. This false essential

nutrient revealed several gaps in the fatty acid metabolism, starting with the impotence to

transfer Acetyl and Malonyl from Coenzyme A to Acyl Carrier Protein. Since the intermediate

reactions do not seem to play a role, the synthesis of Octadecanoic acid from Acetyl-CoA and

Malonyl-CoA was modelled in a single reaction.


The following data is the result of the knockout candidate analysis. It is sorted by genes

annotated to reactions competing for certain Griseorhodin A precursors, e.g. all genes

annotated to reactions competing for Malonyl-CoA. Additionally, there is a set of genes were

only predicted by the AUC value of their knockout simulation and have a strong effect on

biomass, but not on secondary metabolite production. The tables show all detected possible

candidates. The ones that were tested in vitro were chosen based on the resulting AUC and

their experimental feasibility. Candidates that had a significantly negative influence on the

production of Griseorhodin A where omitted. The overview tables are followed by detailed

information about the corresponding reactions and additional information from the

Streptomyces avermitilis genome project (91). Roman Makitrynskyy from the group of Andreas

Results

80

Bechthold performed the proposed knockouts and cultivated the strains on solid medium. All of

the following reaction equations and pathway maps were extracted from KEGG (31). To the best

of our knowledge, the herein predicted knockouts have not been proposed for the

overproduction of secondary metabolites.

Production simulation in wildtype and engineered organism:

Table 10: Comparison of correlation simulation of Griseorhodin A and biomass production in wildtype and SUKA22. All units are

in [mmol/(g*h)] except area under the curve (AUC) which was calculated according to Chapter 4.3.3.

Model max prod. AUC (abs.) max biomass min biomass AUC [%]

S. avermitilis 76.98 17739.64 233.48 224.17 98.70

S. avermitilis SUKA22 76.97 14952.84 207.05 176.86 93.83

Even though S. avermitilis SUKA22 is known to be faster growing and a more efficient

heterologous producer of secondary metabolites than its wildtype, simulations imply that it is

exactly the other way around. The maximal flux rate of biomass production in the wildtype

model is about 13% higher, and the production of Griseorhodin A has a lower effect on it,

indicated by the higher AUC value. The experimental data on S. avermitilis SUKA22's speed of

growth in comparison to the wild type shows, that this prediction is wrong (2).

Results

81

NADPH:

Table 11: Knockout candidates competing for NADPH. All units are in [mmol/(g*h)] except area under the curve (AUC) which was

calculated according to Chapter 4.3.3.

Gene max prod. AUC (abs.) max biom. min biom. AUC [%]

peg.4449 76.97 14952.84 207.05 176.86 93.83

peg.6180 76.97 14952.84 207.05 176.86 93.83

peg.4943 76.97 14952.84 207.05 176.86 93.83

peg.6156 76.97 14952.84 207.05 176.86 93.83

peg.2513 76.97 14952.84 207.05 176.86 93.83

peg.4370 76.97 13671.15 198.23 157.04 94.66

peg.3495 76.97 13671.15 198.23 157.04 94.66

peg.338 76.97 14952.84 207.05 176.86 93.83

peg.5776 76.97 14952.84 207.05 176.86 93.83

peg.1152 76.97 14952.84 207.05 176.86 93.83

peg.3189 76.97 14952.84 207.05 176.86 93.83

peg.413 76.97 14952.84 207.05 176.86 93.83

peg.1763 76.97 14952.84 207.05 176.86 93.83

peg.4644 76.97 14952.84 207.05 176.86 93.83

peg.4645 76.97 14952.84 207.05 176.86 93.83

NADPH is the most common reactive power equivalent in biosynthesis. It is required for eight

reduction steps in the synthesis of Griseorhodin A. Removing the reaction related to the genes

peg.4370 and peg.3495 leads to an increased AUC value. That both modifications lead to the

Results

82

same value shows, that they are either annotated to the same reaction, or that they are part of

a linear series of reactions.

Unfortunately, none of the genes catalyzing a reaction competing for NADPH was suitable for a

knockout. They were either catalyzing unspecific reactions that were also annotated to several

other gene products or catalyzing essential reactions.

Results

83

Malonyl-CoA:

Table 12: Knockout candidates competing for Malonyl-CoA. All units are in [mmol/(g*h)] except area under the curve (AUC)

which was calculated according to Chapter 4.3.3. Candidates tested in vitro are labeled in bold.


peg.5340 76.97 14952.84 207.05 176.86 93.83

peg.5879 76.97 14690.05 202.42 176.86 94.29

Knockout candidate:

pyrB "Aspartate carbamoyltransferase (EC 2.1.3.2)":

start: 8190239

stop: 8191216

frame: 2

strand: +

length: 978 bp

function: Aspartate carbamoyltransferase (EC 2.1.3.2)

subsystem: De Novo Pyrimidine Synthesis

db_xref: GO:0004070

Malonyl-CoA is the main building block of the carbon backbone in polyketide as well as in fatty

acid synthesis.

In the metabolic model, this gene is annotated to three different functions:

Results

84

Carbamoyl-phosphate: L-aspartate carbamoyltransferase (EC 2.1:3.2)

Figure 37: pyrB in pyrimidine metabolism

The formation of N-carbamoyl-aspartate is an important reaction in one of two possible

pathways for the generation of pyrimidines. The gene catalyzing this reaction is reported to be

essential in other bacterial strains (92).

Results

85

(S)-2-Methyl-3-oxopropanoyl-CoA: pyruvate carboxyltransferase (EC 2.1:3.1)

Malonyl-CoA: pyruvate carboxytransferase (EC 2.1:3.1)

Simulations showed that the conversion of Malonyl-CoA to Acetyl-CoA catalyzed by Malonyl-

CoA: pyruvate carboxytransferase is a promising target for a knockout experiment. This is the

counter-reaction of Acetyl-CoA carboxylation, a vital precursor reaction of fatty acid and

polyketide metabolism. Acetyl-CoA is a typical starter unit of polyketide synthesis. Both,

Malonyl-CoA and Acetyl-CoA are needed for Griseorhodin A synthesis, but since twelve times

more Malonyl-CoA is required, it is likely that rather the supply of this precursor is a bottleneck.

Results

86

Figure 38: pyrB in propanoate metabolism

Results

87

Acetyl-CoA:

Table 13: Knockout candidates competing for Acetyl-CoA. All units except AUC are in [mmol/(g*h)]. Candidates tested in vitro are

labeled in bold.


peg.1274 76.97 14952.84 207.05 176.86 93.83

peg.439 76.97 14485.20 201.10 173.59 93.58

peg.4580 76.97 14952.84 207.05 176.86 93.83

peg.3638 76.97 14952.84 207.05 176.86 93.83

peg.1273 76.97 14952.84 207.05 176.86 93.83

peg.838 76.97 14952.84 207.05 176.86 93.83

peg.2202 76.97 14952.84 207.05 176.86 93.83

peg.4705 76.97 14952.84 207.05 176.86 93.83

peg.4300 76.97 14952.84 207.05 176.86 93.83

peg.882 76.97 14952.84 207.05 176.86 93.83

peg.1139 76.97 14952.84 207.05 176.86 93.83

peg.5777 76.97 14952.84 207.05 176.86 93.83

peg.1695 76.97 14946.68 205.99 176.86 94.27

peg.2754 76.97 14952.84 207.05 176.86 93.83

peg.1203 76.97 14952.84 207.05 176.86 93.83

peg.1567 76.97 14952.84 207.05 176.86 93.83

Results

88

Knockout candidates:

kbl - "Aspartate carbamoyltransferase (EC 2.1.3.2)":

start: 1996960

stop: 1995767

frame: -1

strand: -

length: 1194 bp

function: 2-amino-3-ketobutyrate coenzyme A ligase (EC 2.3.1.29)

subsystem: Glycine Biosynthesis; Glycine and Serine Utilization

db_xref: GO:0008890

translation:"MFDSVRDDLRTTLDEIRTAGLHKPERVIGTPQSATVSVTAGGRPGEVLNFCANNYLGLADHPE

VIAAAHEALDRWGYGMASVRFICGTQEVHKELERRLSAFLGQEDTILYSSCFDANGGVFETLLGAEDAVISDA

LNHASIIDGIRLSKARRFRYANRDMADLERQLKEASGARRRLIVTDGVFSMDGYVAPLREICDLADRYDAMV

MVDDSHAVGFVGPGGRGTPELHGVMDRVDIITGTLGKALGGASGGYVAARAEIVALLRQRSRPYLFSNTLAP

VIAAASLKVLDLLESADDLRVRLAENTALFRSRMTEEGFDILPGDHAIAPVMIGDAAVAGRLAELLLERGVYVIG

FSYPVVPQGQARIRVQLSAAHSTDDVNRAVDAFVSARAELEA"

Results

89

Acetyl-CoA: glycine C-acetyltransferase (EC 2.3.1.29)

Glycine C-acetyltransferase is converting glycine and Acetyl-CoA, which is the starting molecule

of Griseorhodin A synthesis, to L-2-Amino-3-oxobutanoic acid. This is an early step from glycine

towards the synthesis of several other biogenic amino acids, such as Threonine.

Results

90

Figure 39: kbl in glycine, serine and threonine metabolism

pta - "Phosphate acetyltransferase (EC 2.3.1.8)":

start: 3467325

stop: 3469415

frame: 3

strand: +

length: 2091 bp

Results

91

function: Phosphate acetyltransferase (EC 2.3.1.8)

subsystem: Fermentations: Lactate; Fermentations: Mixed acid; Pyruvate metabolism II: acetyl-

CoA, acetogenesis from pyruvate

db_xref: GO:0008959

Pta is the second knockout candidate related to Acetyl-CoA. The model shows a slight increase

in the AUC value after removing all reactions annotated to this gene. It is annotated to four

different reactions in the model that can be organized in two groups.

Palmitoyl-CoA retinol O-acyltransferase (EC 2.3.1.76)

Acyl-CoA retinol O-acyltransferase (EC 2.3.1.76)

Results

92

Figure 40: pta in retinol metabolism

Different families of Retinol O-acyltransferases are found very widespread in bacteria, but also

in eukaryotes and viruses. They play a role in Vitamin A storage or mobilization, and generation

of different chromophores in eukaryotes. It is suspected that they participate in the maturation

of lipoproteins in viral capsids. In bacteria, structurally related proteins are responsible for

hydrolyzations of peptidoglycan bounds that are necessary for reorganization of the cell wall

during vegetative growth (93).

Results

93

Acetyl-CoA phosphate acetyltransferase (EC 2.3.1.8)

Results

94

Figure 41: pta in pyruvate metabolism

Results

95

Propanoyl-CoA phosphate propanoyltransferase (EC 2.3.1.8)

Figure 42: pta in propanoate metabolism

The generation of ATP from acetyl or propionyl phosphate are the last steps in Lactic acid

fermentation. Even though these reactions are generating ATP, they are also consuming

precursors that are needed for Griseorhodin A production.

Results

96

FAD:

Table 14: Knockout candidates competing for FAD. All units except AUC are in [mmol/(g*h)]. Candidates tested in vitro are

labeled in bold.


peg.3800 76.97 14952.84 207.05 176.86 93.83

peg.4242 76.97 14952.17 206.87 176.86 93.90

peg.3804 76.97 14952.84 207.05 176.86 93.83

peg.406 76.97 14952.17 206.87 176.86 93.90

Knockout candidate:

fadE4 - "Isovaleryl-CoA dehydrogenase (EC 1.3.99.10)"

start: 6391645

stop: 6390485

frame: -1

strand: -

length: 1161 bp

function: Isovaleryl-CoA dehydrogenase (EC 1.3.99.10) new: (EC 1.3.8.4)

subsystem: HMG CoA Synthesis; Leucine Degradation and HMG-CoA Metabolism

db_xref: GO:0008470

FAD is a common electron donator in eukaryotic and prokaryotic metabolism. Unlike NAD it is

also able to transfer single electrons. It is the central cofactor in the generation of spiroketal

moiety of Griseorhodin A. This knockout candidate is annotated to two FAD dependent

reactions in the model.

Results

97

Butanoyl-CoA oxygen 2-oxidoreductase (EC 1.3.3.6)

Figure 43: fadE4 in fatty acid degradation

Results

98

3-methylbutanoyl-CoA (acceptor) 2,3-oxidoreductase(EC 1.3.8.4)

Results

99

Figure 44: fadE4 in valine, leucine and isoleucine degradation

Although these reactions only differ in one methyl group, they can be assigned to two different

metabolic pathways. Butanoyl oxidation is part beta-oxidation in fatty acid. In the simulations,

however, the effect of knocking out the other reaction is predominate. It is an early stage of

Results

100

valine, leucine and isoleucine biosynthesis, and besides FAD it is also utilizing precursors of

terpene biosynthesis.

Results

101

NAD:

Table 15: Knockout candidates competing for NAD. All units except AUC are in [mmol/(g*h)]. Candidates tested in vitro are

labeled in bold.


peg.3710 76.97 14952.84 207.05 176.86 93.83

peg.4030 76.97 14952.84 207.05 176.86 93.83

peg.6425 76.97 14857.18 205.71 175.77 93.83

peg.756 76.97 14952.84 207.05 176.86 93.83

peg.863 76.97 14952.84 207.05 176.86 93.83

peg.2309 76.9 14952.84 207.05 176.86 93.83

peg.5635 76.97 14947.39 207.05 176.80 93.79

peg.5636 76.97 14952.84 207.05 176.86 93.83

peg.349 76.97 14952.84 207.05 176.86 93.83

peg.608 76.97 14952.84 207.05 176.86 93.83

peg.2765 76.97 14952.84 207.05 176.86 93.83

peg.3382 76.97 14952.84 207.05 176.86 93.83

peg.3669 76.97 14952.84 207.05 176.86 93.83

peg.1641 76.97 14952.84 207.05 176.86 93.83

peg.5296 76.96 13381.25 186.34 157.04 93.31

peg.5527 76.97 14952.84 207.05 176.86 93.83

peg.4839 76.97 14952.84 207.05 176.86 93.83

peg.3495 76.96 13671.75 198.23 157.04 89.62

Results

102


peg.3979 76.97 14952.84 207.05 176.86 93.83

peg.5358 76.97 14952.84 207.05 176.86 93.83

peg.5605 76.97 14952.84 207.05 176.86 93.83

peg.2313 76.97 14952.43 206.99 176.86 93,85

peg.2311 76.97 14952.84 207.05 176.86 93.83

peg.440 76.96 14485.20 201.10 173.59 93.59

peg.449 76.97 14952.84 207.05 176.86 93.83

peg.4858 76.97 14952.84 207.05 176.86 93.83

peg.4199 76.97 14952.84 207.05 176.86 93.83

peg.6156 76.97 14952.84 207.05 176.86 93.83

peg.5514 76.97 14952.84 207.05 176.86 93.83

peg.1846 76.97 14952.84 207.05 176.86 93.83

peg.433 76.97 14952.84 207.05 176.86 93.83

peg.2185 76.97 14952.84 207.05 176.86 93.83

peg.4840 76.97 14952.84 207.05 176.86 93.83

peg.434 76.97 14952.84 207.05 176.86 93.83

peg.590 76.97 14546.08 196.70 176.86 96.08

peg.823 76.97 14952.84 207.05 176.86 93.83

peg.3041 76.97 14952.84 207.05 176.86 93.83

peg.931 76.97 14952.84 207.05 176.86 93.83

Results

103


peg.1780 76.97 14952.84 207.05 176.86 93.83

peg.6006 76.97 14952.84 207.05 176.86 93.83

peg.3593 76.97 14952.84 207.05 176.86 93.83

peg.4176 76.97 14952.84 207.05 176.86 93.83

peg.659 76.97 14952.84 207.05 176.86 93.83

peg.1601 76.97 14952.84 207.05 176.86 93.83

peg.222 76.97 14952.84 207.05 176.86 93.83

peg.1541 76.97 14952.84 207.05 176.86 93.83

peg.5901 76.97 14952.84 207.05 176.86 93.83

peg.4370 76.96 13671.15 198.23 157.04 94.66

peg.6176 76.97 14952.84 207.05 176.86 93.83

peg.1837 76.97 14952.84 207.05 176.86 93.83

peg.2063 76.97 14952.84 207.05 176.86 93.83

peg.2219 76.97 14952.84 207.05 176.86 93.83

peg.346 76.96 14857.18 205.71 175.77 93.85

peg.318 76.97 14952.84 207.05 176.86 93.83

peg.1201 76.97 14952.84 207.05 176.86 93.83

peg.5964 76.97 14952.84 207.05 176.86 93.83

peg.1446 76.97 14952.84 207.05 176.86 93.83

peg.5966 76.97 14952.84 207.05 176.86 93.83

Results

104


peg.6194 76.97 14952.84 207.05 176.86 93.83

peg.2688 76.97 14952.84 207.05 176.86 93.83

peg.4305 76.97 14952.84 207.05 176.86 93.83

peg.2289 76.97 14952.84 207.05 176.86 93.83

peg.2726 76.97 14952.84 207.05 176.86 93.83

peg.154 76.97 14952.84 207.05 176.86 93.83

peg.6282 76.97 14546.08 196.67 176.86 96.09

peg.1119 76.97 14952.84 207.05 176.86 93.83

peg.777 76.97 14952.84 207.05 176.86 93.83

peg.1595 76.97 14952.84 207.05 176.86 93.83

peg.6454 76.97 14952.84 207.05 176.86 93.83

peg.2237 76.97 14952.84 207.05 176.86 93.83

peg.5149 76.97 14952.84 207.05 176.86 93.83

peg.6202 76.97 14952.84 207.05 176.86 93.83

peg.4096 76.97 14857.18 205.71 175.77 93.83

peg.4095 76.97 14857.18 205.71 175.77 93.83

peg.4094 76.97 14857.18 205.71 175.77 93.83

peg.2322 76.96 13659.88 185.68 166.91 95.59

peg.5223 76.97 14952.84 207.05 176.86 93.83

peg.665 76.97 14952.84 207.05 176.86 93.83

Results

105


peg.662 76.97 14952.84 207.05 176.86 93.83

peg.1583 76.97 14952.84 207.05 176.86 93.83

peg.906 76.97 14857.18 205.71 175.77 93.83

peg.907 76.97 14857.18 205.71 175.77 93.83

peg.1588 76.96 13721.31 198.96 157.62 89.61

peg.905 76.97 14857.18 205.71 175.77 93.83

peg.1465 76.97 14952.84 207.05 176.86 93.83

peg.3231 76.97 14546.08 196.70 176.86 96.08

peg.2220 76.97 14952.84 207.05 176.86 93.83

peg.1904 76.97 14952.84 207.05 176.86 93.83

peg.3970 76.97 14952.84 207.05 176.86 93.83

peg.3955 76.97 14952.84 207.05 176.86 93.83

peg.3953 76.97 14952.84 207.05 176.86 93.83

peg.4312 76.97 14952.75 207.00 176.86 93.85

Knockout candidate:

rocA - "Delta-1-pyrroline-5-carboxylate dehydrogenase (EC 1.5.1.12)"

start :3341188

stop: 3339557

frame: -1

Results

106

strand: -

length: 1632 bp

function: Delta-1-pyrroline-5-carboxylate dehydrogenase (EC 1.5.1.12) new: (EC 1.2.1.88)

subsystem: Arginine and Ornithine Degradation; Proline, 4-hydroxyproline uptake and

utilization

db_xref: GO:0003842

In contrast to NADP, NAD usually serves as oxidation agent in physiological reactions. Although

a large number of possible knockout candidates have been predicted, only few show effect or

are feasible. This knockout candidate is annotated to two reactions that can both be linked to

the same enzyme type:

(S)1-pyrroline-5-carboxylate NAD oxidoreductase (EC 1.2.1.88)

L-Glutamate-5-semialdehyde NAD oxidoreductase (EC 1.2.1.88)

Results

107

Figure 45: rocA in arginine and proline metabolism

This enzyme is part of the proline degradation pathway. It catalyzes the irreversible generation

of Glutamine from 1-pyrolidine-5-carboxylate, but can also generate L-Glutamate-5-

semialdehyde. L-Glutamate-5-semialdehyde is in a spontaneous equilibrium with 1-pyrolidine-5-

carboxylate and can be utilized as a precursor for the generation of Ornithine.

Results

108

Single gene knockout analysis

With this approach, the effect of each single gene knockout that is possible to simulate with the

model has been tested. The following results are the ones with the strongest impact on biomass

production or AUC without negatively influencing the secondary metabolite production.

Knockouts of essential genes were omitted.

Knockout candidates:

gene: peg.532, max production: 76.96, AUC: 13381.25 (93.31%) max biomass: 186.34, min

production: 157.04

plcA - "Phospholipase C"

start: 2096878

stop: 2094827

frame: -1

strand: -

length: 2052 bp

function: Phospholipase C

subsystem: -

db_xref:""

The reaction catalyzed by this enzyme is not utilizing any precursors that are needed for the

synthesis of Griseorhodin A. However, it has in simulations a strong impact on the generation of

biomass. In the model, it is linked to a single reaction:

Results

109

Plasmenylethanolamine-ethanolamine-phosphohydrolase (EC 3.1.4.3)

Figure 46: plcA in ether lipid metabolism

Results

110

This is a late step in the generation of ether lipids. Besides being a part of the cell membrane,

ether lipids are also known to play a role as antioxidants.

gene: peg.5897, max production: 76.97, AUC: 14468.30(96.15%), max biomass: 195.51, min

production: 176.86

rpe - "Ribulose-phosphate 3-epimerase (EC 5.1.3.1)"

start: 8213111

stop: 8213797

frame: 2

strand: +

length: 687 bp

function: Ribulose-phosphate 3-epimerase (EC 5.1.3.1)

subsystem: Pentose phosphate pathway; Riboflavin synthesis cluster

db_xref: "GO:0004750"

This knockout had the strongest positive effect on the AUC of the correlation of primary and

secondary metabolism in both, The SUKA22 and the wildtype model. It is linked to only two

reactions in the model:

R-Acetoin racemase (EC 5.1.2.4)

Results

111

D-Ribulose-5-phosphate-3-epimerase (EC 5.1.3.1)

The observed effect is mainly caused by the removal of the reaction from D-xylose phosphate to

D-Ribulose phosphate. This is an early step towards the generation of DNA and RNA, but also

histidine metabolism. It is important to keep in mind that the model is considering the energy

needed for transcription and translation only generalized in the biomass equation but not

specifically in the synthesis of Griseorhodin A.

Figure 47: rpe in pentose phosphate pathway

Results

112

5.6.4 Overview of all knockout candidates

150

160

170

180

190

200

210

0 20 40 60 80 100

Bio

mas

s p

rod

uct

ion

[m

mo

l/(g

*h)]

Griseorhodin A production [%]

wildtype

pyrB (Malonyl-CoA)

kbl (Acetyl-CoA)

pta (Acetyl-CoA)

fadE4 (FADH)

rocA (NAD)

plcA (-)

rpe (-)

Figure 49: Overview of the simulated effect of all knockout candidates with absolute biomass production values

Figure 48: Overview of the simulated effect of all knockout candidates with relative biomass production values

75

80

85

90

95

100

0 20 40 60 80 100

Bio

mas

s p

rod

uct

ion

[%

]

Griseorhodin A production [%]

wildtype

pyrB (Malonyl-CoA)

kbl (Acetyl-CoA)

pta (Acetyl-CoA)

fadE4 (FADH)

rocA (NAD)

plcA (-)

rpe (-)

Results

113

In comparison, the effects of the seven knockouts differ greatly from each other. Apart from

their influence on the maximum production of Griseorhodin A, they all lead to a different

maximum biomass production rate. The fadE4 (peg.4242) knockout simulation shows the

slightest variation from the simulation of the unmodified SUKA22 strain. Despite a small

decrease of the initial maximal biomass production, the course of the curves are nearly

identical. This still leads to a little higher AUC value in the percentage comparison. The most

promising result achieves the rpe (peg.5897) mutant. With the second strongest influence on

maximum biomass production, it reaches by far the highest AUC value. The lowest AUC value is

determined by the rocA (peg.1588) mutant, although it can still be associated to a reaction that

is competing for precursors. Its curve form indicates that by this modification the two functions

become linearly dependent on each other. Especially interesting are the impacts of kbl

(peg.439) and pyrB (peg.5879) as they lead to correlation curves that cut the curve of the

wildtype. This implies that depending on the actual ratio of biomass and Griseorhodin A

production, their effect can be both,

Table 16: Summary of all knockout candidates. All units except AUC are in [mmol/(g*h)]


WT 76,97 14952.84 207,05 176,86 93.83

peg.439 (kbl) 76.97 14485.20 201,10 173,59 93.58

peg.5879 (pyrB) 76.97 14690.05 202,42 176,86 94.29

peg.1695 (pta) 76.97 14946.68 205,99 176,86 94.27

peg.4242 (fadE4) 76.97 14952.17 206,87 176,86 93.90

peg.1588 (rocA) 76.96 13721.31 198.96 157.62 89.61

peg.532 (plcA) 76.96 13381.25 186.34 157.04 93.31

peg.5897 (rpe) 76.97 14468.30 195.51 176.86 96.15

Results

114

5.7 Experimental results

Dr. Roman Makitrynskyy was able to successfully execute all knockouts and cultivate the

different mutants. As expected, they are all capable of growth and production of Griseorhodin

A.

Figure 50: Differing morphology of the different Griseorhodin A producing SUKA22 mutants

All mutants grow and produce Griseorhodin A on agar plates. The ∆pyrB mutant develops only

few spores chains and aerial hyphae and has a bald and wrinkled surface. The other strains

produce a dark aerial mycelium and spore chains.

Results

115

Figure 51: Dry weights of all mutants and wildtype after 10 days of cultivation in TSB liquid medium

Surprisingly, the ∆pta mutant is growing faster and the ∆kbl mutant as fast as the wild type.

∆pyrB shows an unexpectedly low growth rate and starts to sporulate on agar plates only after

several weeks. ∆fadE4, ∆rocA and ∆rpe behave as predicted. ∆plcA grows faster than expected

but still slower than the wild type.

pta kbl pyrB fadE4 rocA WT rpe plcA0,0000

0,2000

0,4000

0,6000

0,8000

1,0000

1,2000

1,4000

1,6000

1,8000D

ry w

eigh

t [g

/l]

Results

116

Figure 52: Varying Griseorhodin production of the different SUKA22 mutants (top left) Griseorhodin A producing mutant on agar

plates after two days; (top right)Griseorhodin A producing mutant on agar plates after three days ; (bottom) Overview of the

strains location on the plates

After two days, all strains show at least a Griseorhodin A coloring as strong as in the wildtype.

Due to the hydrophobic properties if Griseorhodin A, the cultures are clearly outlined.

Therefore, only their color contrast can determine how well they produce compared to the

wildtype strain. ∆rocA and ∆plcA show by far the darkest red, followed by ∆kbl. ∆pta, ∆rpe and

∆fadE4 appear to be slightly darker than the wildtype, but with tone very close to each other.

After three days, all strains show a sharply limited red color.∆rocA, ∆kbl and ∆plcA still appear

darker. The other strains are alike. Due to its lipophilic properties, Griseorhodin A does not

Results

117

seem to diffuse into the medium and thus clearly delimits the edge of the colonies. ∆pyrB is not

growing fast enough for a comparison after this time.

Discussion & Conclusion

118

6 Discussion & Conclusion

6.1 StreptomeDB

The presented update of StreptomeDB includes, besides the enlargement of the dataset, several

new features that provide easier access to huge amount of knowledge about Streptomycetes

hidden in literature. These new features allow for browsing this data in a chemical, biological,

and evolutionary context.

The new update includes mainly the publications since the first release of StreptomeDB in 2013.

As Figure 15 shows, we could cover nearly all publications available at PubMed and we ensured

high quality of the data by extensive manual curation in addition to the automated screening of

full texts.

The rich diversity of secondary metabolites produced by Streptomycetes is also represented in

the manifold biological activities we could extract from literature. About 40% of all substances

are associated with antibiotic activity. This result was foreseeable for various reasons. Since this

genus is well known for producing a broad variety of antibiotics, it seems straightforward to test

new substances for this activity, especially since these tests are fast and inexpensive. This

probably leads to the report of rather weak activities. Until the end of the sixties, the search for

new antibiotics was a very popular topic, but was then slowly superseded by the search for anti-

cancer drugs. This can also be seen in the high proportion of anti-cancer and cytotoxic activity of

compounds in this dataset. In contrast to antibiotic (33.2%), antiviral (0.4%), or antifungal (4.8%)

compounds, it is unlikely that soil living bacteria have an evolutionary advantage in producing

substances that have a specific effect against human cancer cells. Due to extensive screening for

this activity, suitable substances have been found, even if this may not be their main biological

function.

Almost half of the reported activities cannot easily be assigned to a general group. This includes

very exceptional activities such as effects on the rotting speed of sea wheat or very specific ones

such as the inhibition of a certain enzyme. Since there is no comprehensive ontology of


119

secondary metabolites activities yet, it could be an interesting enrichment for the community

and a project for future StreptomeDB updates.

With nearly 70%, the majority of substances for which a synthesis pathway is reported are

polyketides. This is also the generally best studied group of secondary metabolites (94). Since

only a small part of the substances in the StreptomeDB are assigned to a specific synthetic

pathway, it is difficult to say if this result is representative. It is possible that polyketide

synthases are actually the largest and most diverse group of secondary metabolite clusters in

Streptomycetes. However, it is also possible that a large part of the substances could not yet be

assigned to other synthetic pathways, since the methods for their identification are not yet so

well developed.

The phylogenetic tree makes it possible to bring the data of StreptomeDB into a biological or

evolutionary context. It allows easy access to data of closely related strains or strains with gene

clusters of structures with similar chemotypes. The analysis of the publicly available data

showed, that despite major efforts, the quality of many published sequences is still doubtful and

analysis requires strict filtering. Even though, 16S rRNA sequences are the by far most popular

tool to determine evolutionary distance, it still has some drawbacks. For example, the

discriminatory power within one genus can be rather poor (95). Nevertheless, this approach was

the best way to generate a dataset as comprehensive as possible for StreptomeDB.

Although the phylogenetic tree represents only a little more than 300 strains, it covers the

majority of the included substances. Due to decreasing sequencing costs and the resulting

rapidly increasing numbers of published genomes, we hope to be able to further expand this

feature (35).

Due to the newly implemented predicted NMR and MS data, it is now possible to efficiently

support screening for new compounds. They allow for identifying structurally similar substances

or substructures at an early stage, and protect against investing a lot of effort in the rediscovery

of already known structures (96).


120

In conclusion, StreptomeDB is a comprehensive source of information about Streptomycetes

and the compounds produced by them (97). It differs from other popular databases of natural

products such as KNAPsACK or NORINE by the extensive networking of the contained data (98,

99). Thanks to the versatile new access possibilities, it offers an ideal starting point and vast

background information for a wide variety of scientific projects, from virtual screening

campaigns to the development of bioengineering strategies.


121

6.2 The effect of bldA in Streptomyces calvus

As described in chapter 2.2 bldA is an important pleiotropic regulator that is inter alia

responsible for the induction of the sporulation and induction of secondary metabolism of

several Streptomyces strains (25, 100, 101). This differentiation is accompanied by global

changes of the organism’s metabolic machinery and is commonly referred to as “metabolic

switch” (102). The underlying regulation mechanisms are however far more intricate than just

allowing for the transcription of genes that include the TTA codon that is mediated by the tRNA

encoded by bldA. Several of those genes encode for transcriptional regulators, translational

regulators, or enzymes influencing the conservation or posttranslational modification of

proteins. Understanding these complex regulatory networks is a rich source for bioengineering

approaches and an important step in the development of new strategies in the fight against the

spreading of antibiotic-resistant pathogens (103).

The presented experiment compares Streptomyces calvus in batch liquid cultures in a late state.

One of the strains is expressing a miss folding Leu-tRNA mutant as shown in Figure 6. This ‘bald’

mutant is not capable of differentiating an aerial mycelium or spores. The only secondary

metabolite that could be detected in this culture was the nonribosomal peptide WS9326A. The

culture with the complemented functional bldA gene could reestablish an aerial mycelium and

spore production. WS9326A production was significantly reduced, but additionally the

secondary metabolites Annimycin and Nucleocidin could be detected.

Next generation sequencing and 2D-gelelectrophoresis have been used to gather detailed

information from theses cultures.

To support the analysis of comprehensive effects of bldA, we constructed a genome scale

metabolic model of S. calvus. Such models are powerful tools for the recognition of global

coherences that are otherwise difficult to interpret from mere omic data (104).

A major challenge in the generation of the metabolic model was the reconstruction of

Nucleocidin synthesis, since only little is known about it yet. Especially the fluorination is still a

black box, although research has been carried out on it for years (24, 86). The reactions used in


122

the model are the most common reaction types that might support the synthesis of Nucleocidin

from Adenosine and were already present in the model. Even if the actual synthesis of

Nucleocidin may differ in detail, it is likely that a large part of the required metabolites and co-

factors already agree with the simulation.

In order to study the effects of bldA on the distribution of energy between growth and

secondary metabolism, we have chosen two approaches: on the one hand, we have generated

expectation values with the model, which we have compared with the experimental data. On

the other hand, we have used the metabolic model to predict the effect of the differentially

expressed genes on the entire metabolic system.

Figure 53: Model based data integration and interpretation

Lewis et al. were able to show that the models can precisely predict which metabolic pathways

are actually preferred by microorganism (105). Hence, it is possible to determine a set of genes

that is specific for the production of a certain metabolite. If metabolism is actually regulated

globally in response to the production of this substance, these genes should show significant

regulation.

Genomics

Transcriptomics

Proteomics

Metabolomics

Phenotype

Metabolic model


123

The expression of genes in the predicted groups of primary metabolism presented in Figure 25

and Figure 26 does not support this hypothesis. There are multiple possible reasons for this. Due

to several strong normalization steps that were necessary during the analysis of the gene

expression data and to the lack of replicates, it was not possible to filter the data by statistical

significance. The normal distribution of fold changes within the groups also indicates that

further filtering steps are necessary. Alternatively, it cannot be ruled out that there is actually

no specific regulation towards the production of secondary metabolites. Lastly, it is possible that

the model is not adapted specifically enough to the environmental conditions during expression

of bldA and is including pathways that are unspecific for the simulated phenotype.

In contrast, the proteomics data supports the hypothesis. Apart from three, all detected

proteins of primary metabolism could be allocated to the production of Nucleocidin. This also

includes some proteins that can also be allocated to growth and Annimycin production. 10% of

the proteins are exclusively needed for Nucleocidin production.

The used method to interpret transcriptomic data with the help of a genome-scale metabolic

model is based on two assumptions: The amount of available mRNA controls the amount of

available proteins. The amount of proteins, in return, controls the turnover rates of the

associated reactions. This approach is of course a strong simplification and contains a few

systematical biases. As already shown in this study, protein levels do not necessarily correlate

with gene expression levels. Gene regulation can have different reasons. Despite guiding

metabolite fluxes to a certain pathway, it can also be a reaction to abundance or shortage of a

certain metabolite and does not necessarily have to lead to higher turnover rates.

Additionally, while simulations with experimentally determined endpoints such as biomass

production or sugar uptake are a precise and powerful tool to understand the difficult to

measure fluxes of metabolites within an organism, extrapolations are more difficult to evaluate.

Typically, the simulated results are more efficient than reality. Nevertheless, van Berlo et al.

could demonstrate that especially for drastically regulated genes, these assumptions and the

resulting simulated results are correlating very well with experimental data (106). The method

used in this project uses expression levels as linear adjustments to the constraints of annotated


124

reactions. Colijn et al. successfully applied this approach for analysis of the metabolic network

of Mycobacterium tuberculosis under the influence of potential new therapeutics (107).

Due to the lack of experimental data such as growth rate, nutrient uptake or gas exchange, the

predictions are based on reference values from an unrestricted simulation with no experimental

background and should only be considered quantitatively.

Interestingly, this analysis revealed that the regulation of a single essential gene with a global

effect. It is necessary for the generation of folic acid metabolism and therefor controls the

synthesis of DNA and RNA.

In summary, we can observe the influence of bldA at all levels from transcription to phenotype.

In transcription, we can observe the regulation of secondary metabolite gene clusters and

changes in orthologous clusters such as carbohydrate metabolism that indicate an adaption to

changing environmental conditions. Even though we could not find indications for a global

reorganization of primary metabolism in favor of secondary metabolite production, we could

find a pattern in the overexpressed proteins from primary metabolism. Together with the weak

correlation of transcriptomic and proteomic data and the detected specific proteases that play a

role in the modifications of proteins, these results emphasize the important role of post

translational modifications guided by bldA. Especially in context with the predicted general

downregulation of DNA and RNA synthesis and possible starvation conditions after 48 hours of

cultivation, the conservation and specific degradation of proteins gains importance in the

reorganization of the metabolic machinery during morphological differentiation and secondary

metabolite production.


125

6.3 adpA regulon in Streptomyces asterosporus

Streptomyces calvus and Streptomyces asterosporus are very close relatives. For example, both

strains contain the rare Nucleocidin gene cluster that has a sequence similarity of over 98%.

It is surprising, that the genomic analysis performed by Songya Zhang indicates, that their bald

phenotype is caused by completely different mutation mechanisms within the same regulative

network. While a point mutation in S. calvus has led to a wrong folding and thus to the loss of

function of a tRNA, a transposon in S. asterosporus has disrupted the promoter site of a

downstream regulation factor. Nevertheless, the phenotypic result is in both strains the same.

The presented results confirm the high quality of genome sequencing. The G+C skew shown in

Figure 32 has a typical pattern with an inversion at the origin of replications. The uniform course

indicates that there were no irregular shifts of bigger sequence blocks during the assembly.

Detailed analysis of the neighboring regions of all transposons in the genome yielded only two

genes in close proximity, but no complete gene disruption.

Genomic analysis revealed more than 500 potential adpA binding sites. This underlines the

global regulatory role of adpA and is consistent with the observed function of adpA in

Streptomyces griseus (108).

Figure 35 demonstrates the high quality of the proteomic profiling of the wildtype strain and a

mutant with restored adpA function. With very high coverage and significance, this dataset

together with the detailed analysis of the adpA binding sites will facilitate the development of

an in-depth understanding of this complex regulon.


126

6.4 Enhancing Griseorhodin A expression in Streptomyces

avermitilis SUKA22

As shown in chapter 2.2, 5.4 and 5.5, Streptomycetes have a very complex secondary

metabolism and regulatory network (109). This makes them an interesting resource for new

compounds but also hard to handle as producers of natural compounds. Usually the titers of

produced compounds are very low and sometimes the production is only inducible under very

specific conditions (110). Therefore, researcher are working on genomical minimized strains in

order to increase the production efficiency, reduce complexity of regulatory networks and

minimize the number of unwanted side products. The strain S. avermitilis SUKA22 is an excellent

example of a minimized host strain and that has already proven superior to the natural hosts of

several biogenetic products (2). A minimized host is also a huge benefit for genome-scale

metabolic simulations due to the reduced complexity of the organism. Especially synthesis

pathways for secondary metabolites are strongly affected by regulation mechanisms that can

influence the complete metabolic network. These effects are rarely well described yet and are

difficult to integrate into the simulations. Fortunately, the group of Prof. Dr. Haruo Ikeda has

stripped S. avermitilis SUKA22 of most of its secondary metabolite gene clusters. The popularity

of Streptomyces avermitilis as research topic and as industrial producer is an additional benefit,

since the generation of genome-scale metabolic model depends heavily on published

background information. The available cosmid database is also a great advantage for the

realization of the experimental parts of this project (91).

6.4.1 Minimal medium validation of the model of S. avermitilis SUKA22

Minimal media are ideal for collecting experimental data in order to validate and optimize

metabolic models (111). Their exact composition allows for eliminating a large portion of

uncertainty. The validation approach presented herein was able to efficiently identify several

gaps and errors in the model and thus strongly increase its significance.


127

Most of the detected errors were the results of the gap-filling algorithm, which could not close

gaps bigger than two reactions and therefore simply added the required products of the missing

reactions as a nutrient.

The in vitro growth rates on different carbon sources are usually lower and differ more from

each other than in the simulations (112). Since the model does not take enzyme kinetics and

regulatory effects into account, the use of the different carbon sources is probably displayed

unrealistically efficient. Therefore, they have to be compared experimentally.

Although minimal media usually do not achieve as good production results as full media, they

are a good starting point to develop rational strategies with the help of metabolic models to

further improve conventional production media, e.g. by adding certain amino acids. Song et al.

could successfully predict an optimal medium composition for the production of succinic acid in

Mannheimia succiniciproducens with FBA. It did not only increase the production rate by 15%,

but could also reduce the amount of unwanted by-products by 30% (113).

Especially in this project, which aims to reduce the efficiency of certain metabolic pathways,

minimal media are an excellent medium for iteratively validating the generated models.


Compared to chemical synthesis, biological production of secondary metabolites has an

important efficiency deficiency: the generation of biomass. Most of the energy provided by the

cultivation medium is not used for the production of the desired metabolite but for cell growth.

Secondary metabolites, as their name suggests, are only produced in secondary

importance/priority. As it can be seen in chapter 5.4 and in the results from Nieselt et al.,

bacteria strains have developed a way to reduce cell growth in order to support the production

of secondary metabolites by a so called metabolic switch, for example in a starvation situation

where resources are anyway rare and need to be used efficiently (102).

In this project, we have identified knockout candidates in robust pathways that lead to a

stoichiometrically less efficient synthesis route of certain metabolites. From the assumption that


128

the stoichiometrical or resource wise simplest pathway is also the most efficient way in

turnover rates, we can predict that a stoichiometrically less efficient pathway is also slower.

One of the biggest strengths of Flux Balance Analysis (FBA) is the possibility to simulate

metabolic processes within an organism without detailed knowledge of enzyme kinetics. These

predictions are surprisingly accurate as long as you interpolate between experimentally

measured endpoints like the amount of produced biomass and nutrient uptake. Lewis et al.

could demonstrate that E.coli in exponential growth phase actually behaves as predicted by a

genome-scale metabolic model. Thereby, it does not only upregulate genes from the

stoichiometrically most efficient metabolic pathways, but also downregulates ones from

alternative routes (105).

Flux Balance Analysis can only predict the optimal value of a single reaction within the reaction

network. To predict the correlation of biomass production and secondary metabolism, we have

applied an approximation of a multi-objective optimization. The resulting relative solution space

indicates how strongly the two reactions are competing for metabolites. A similar approach is

utilized by OptKnock and MultiMetEval, whereas the underlying idea is differing fundamentally

(55, 114). OptKnock enables the development of strategies how to couple the production of a

desired metabolite to the production of biomass in order to make it a necessary byproduct. The

correlation plot is thereby used to calculate the Pareto-Front of the two reactions, which is the

most efficient production ratio. This is a powerful method for increasing the yield of biogenic

products from primary metabolism. However, the application possibilities for products of

secondary metabolism are limited to optimization of the generation of certain precursors (114).

Zakrzewski et al. used their tool MultiMetEval to compare the production capability of

metabolic networks of different host strains for different secondary metabolites based on a

correlation plot. Besides the comparison of the resulting Pareto-Fronts, they also concluded that

a bigger plateau of these plots indicates a smaller burden on the production of biomass if the

secondary metabolite production is increased (55). Additionally they demonstrated that the

quality of automatically generated models that have barely been revised was already sufficient

to make usable predictions in 2012. Since then, the accuracy of automated reconstruction


129

pipelines has significantly increased up to the most recent version of modelSEED that has been

released in 2017 but is not published yet (115).

As mentioned before, we did not focus on increasing of the production of a certain precursor

but on decreasing the efficiency of biomass production in general. For the evaluation of the

proposed knockout mutants, we used an approach that is based on the MultiMetEval method.

However, we used a relative scale for the generation of biomass to be able to better compare

the different results. The size and steepness of the plateau of the plot was compared based on

the resulting AUC value, on the assumption that a lower burden on the production of biomass

per produced unit of secondary metabolite will naturally lead to a higher production of

secondary metabolite in relation to produced biomass.

From all possible knockout mutants, the ∆rpe mutant obtains the highest AUC value and is not

related to the consumption of a certain metabolite that is needed for the generation of

Griseorhodin A. As the reaction predicted to be the driving force behind the reallocation of

resources triggered by bldA, this reactions play a role in the generation of precursors needed for

the synthesis of DNA and RNA. The preliminary experimental results indicate that this mutation

actually obtains the predicted effect. Even though this mutant is slower growing than the

wildtype, it shows a significantly darker coloring on agar plate after two days of cultivation.

Additionally, knockout candidates have been predicted, that have a strong negative effect on

the generation of biomass but not on the Griseorhodin A production. The resulting AUC values

were hereby only of secondary importance. This analysis lead to the prediction of plcA as a

knockout target, which has the strongest impact on the generation of biomass without

decreasing Griseorhodin A production in simulations. This gene is related to a late step in the

generation of ether lipids, which are a part of bacterial membranes. The most interesting

question herein was if reducing the production of a single component of the biomass would

actually lead to the predicted general reduction of growing efficiency. The biomass equation in

the metabolic model is only considering one composition of typical cell components. Therefore,

the decrease of a single component always leads to a general decrease of biomass generation in

simulations, even though e.g. a decrease of peptidoglycan production would only lead to a


130

thinner cell wall. The experimental results indicate that the ratio of certain cell components is

actually regulated. As the ∆rpe mutant, the ∆plcA mutant is showing a decreased growing

speed, even though it is not the slowest growing mutant, and a darker coloring than the

unmodified SUKA22 strain.

Even though the preliminary experimental data is difficult to interpret and not quantitatively

exact, it is clearly indicating a successful change of the ratio for produced biomass and

secondary metabolite in favor of Griseorhodin A.

The methods shown in chapter 0 were used to identify reactions that are competing the most

for a certain precursors that is needed for the synthesis of Griseorhodin A. By disrupting

reactions in robust pathways, this method forces the utilization of less efficient alternative

synthesis routes. This does not necessarily change the production rate of a certain metabolite

but shall decrease the efficiency of primary metabolism to utilize it. As in the previous approach,

knockout candidates that had a negative effect on Griseorhodin A production were omitted.

The screening for knockout candidates utilizing NADPH did unfortunately not yield a suitable

candidate. The reactions were either essential, annotated to multiple genes or also negatively

influencing the Griseorhodin A production. In fatty acid synthesis, NADPH is known to be a

constraining precursor and is therefore a prominent target in the development of

overproduction strategies, especially in the biofuel community (116). Even though the

consumption of NADPH is usually lower in the mechanistically very similar polyketide synthesis,

it is probably still a limiting precursor and valuable target. As most of the NADPH dependent

reactions are annotated very unspecifically to several genes, it is probably possible to further

refine these assignments and to identify a candidate after all.

The knockout candidate pyrB is annotated to two different types of reactions in the model. One

that is competing for Malonyl-CoA and one that is playing an important role in the pyrimidine

synthesis pathway. These reactions are catalyzed by related but different carbamoylases. Even

though the knockout of the related reaction did indicate neither their essentiality, nor a strong

impact on the biomass production, the morphology and growing rate of the knockout mutant in

vitro is strongly differing from the other strains. For pyrB was reported to be essential in


131

Helicobacter pylori, it is interesting that Streptomyces avermitilis SUKA22 is able to maintain

growth without a functional pyrB gene (92). However, this knockout did probably not disrupt

the predicted reaction and this experiment should be considered for a repetition after a more

specialized screening for the actual gene catalyzing the reaction of interest.

Kbl is catalyzing a reaction that can utilize Acetyl-CoA and Glycine for the generation of

precursors of Threonine. This reaction is reversible and might also contribute to the production

of Acetyl-CoA under the right conditions, for example in a Glycine deficient medium that

contains a high concentration of Threonine. The preliminary experimental results match the

computational predictions and indicate that this reaction is rather consuming Acetyl-CoA in a

complex medium. The ∆kbl mutant showed a growing speed that is indistinguishable from the

unmodified SUKA22 strain, but showed a significantly darker coloring after two days of

cultivation on the agar plate.

Pta is annotated to two classes of reactions in the model. One class is associated to the storage

and mobilization of vitamin A and plays only a minor role in the model simulations. The other

one is a late step in lactic acid fermentation, which leads to the generation of ATP for Acetyl-

CoA. This process is most likely very dependent on environmental conditions. Lactic acid

fermentation is an alternative anaerobic pathway for energy generation, which is less efficient

than cellular respiration and is hence only activated under anaerobic conditions. This is a typical

example for regulation processes that are not sufficiently represented in genome-scale

metabolic models. Surprisingly, this modification leads to an increased biomass production,

whose underlying process could not be elucidated yet and is not correlating to the model

predictions. The experiments do neither indicate an increased Griseorhodin A production.

However, it is possible that this mutant can only show its potential under anaerobic conditions.

FadE4 is catalyzing a reaction that is consuming FAD for the generation of a precursor for the

generation of Valine, Leucine and Isoleucine. As described for the ∆kbl mutant, it is possible that

the effect of this knockout can either be amplified, or reversed by a corresponding media

composition. The measured growth rate is reduced and corresponds to the expectations. The

production rate of Griseorhodin seems to be slightly increased after two days in comparison to


132

the unmodified SUKA22 strain. Another remarkable feature of this mutant is, that the knocked

out reaction is positioned at a metabolic bifurcation, where 3-methylbutanoyl-CoA is either

used for amino acid, or terpene backbone synthesis. Although it only shows limited effect on

the production of polyketides, this might be an efficient mutant for the production of terpenes.

RocA is involved in the conversion of Proline, Glutamate, and Arginine. It is catalyzing a

reversible reaction that can either produce NADPH or consume NAD. Therefore, its role might

again be dependent on the media composition. The growing speed of the mutant corresponds

to the in silico predictions. Even though it has the lowest AUC value of all knockout candidates,

preliminary results indicate that this is a potent overproducer of Griseorhodin A.

Figure 48 shows the predicted relative, and Figure 49 shows the absolute correlation of growth

and Griseorhodin A production in SUKA22 and all knockout mutants in comparison. Although all

three methods used could successfully predict overproducers, it is difficult to assess their

potential with the help of the AUC value, apart from the candidate directly determined by the

AUC value. Since, as mentioned above, the effects of many of the candidates are likely to be

strongly influenced by environmental conditions, it can be assumed that the quality and

resolution of the predictions can still be further improved, taking into account the

corresponding experimental data.


133

6.4.3 Conclusion & Future perspective

The methods presented here represent a novel tool for the development of metabolic

engineering strategies that aim on the optimization of secondary metabolite production. While

common methods focus on increasing the secondary metabolite production or the production

of precursors needed for their synthesis, we focus on decreasing the efficiency of primary

metabolism and its ability to efficiently utilize those precursors. These different approaches are

likely to complement each other very well. We could predict several promising modifications,

which already proved their potential in preliminary in vitro experiments that will soon be

complemented by detailed quantitative measurements. One of the knockouts is using an

approach that we could also observe in the regulation of the metabolic switch in Streptomyces

calvus by bldA. It limits the production of DNA and RNA, leading to a global downregulation of

the metabolism, which seems to affect secondary metabolite clusters less severely.

Most of the knockout mutants show great potential and versatility. They are not specifically

improving the production of a certain secondary metabolite, but specifically decreasing the

efficiency of primary metabolism or the ability of primary metabolism to utilize a certain

precursor. Though current results indicate that the effect of the modifications is guided by

environmental conditions, we are planning to further explore these properties using the

gathered knowledge to improve our model and thereby gain a deep understanding of the

mechanism guiding metabolic flux.

References

134

7 References

1. Kalan,L., Gessner,A., Thaker,M.N., Waglechner,N., Zhu,X., Szawiola,A., Bechthold,A.,

Wright,G.D. and Zechel,D.L. (2013) A cryptic polyene biosynthetic gene cluster in

Streptomyces calvus is expressed upon complementation with a functional bldA gene.

Chem. Biol., 20, 1214–1224.

2. Komatsu,M., Komatsu,K., Koiwai,H., Yamada,Y., Kozone,I., Izumikawa,M., Hashimoto,J.,

Takagi,M., Omura,S., Shin-ya,K., et al. (2013) Engineered Streptomyces avermitilis host for

heterologous expression of biosynthetic gene cluster for secondary metabolites. ACS Synth.

Biol., 2, 384–96.

3. Yunt,Z., Reinhardt,K., Li,A., Engeser,M., Dahse,H.-M., Gutschow,M., Bruhn,T., Bringmann,G.

and Piel,J. (2009) Cleavage of four carbon−carbon bonds during biosynthesis of the

Griseorhodin A spiroketal pharmacophore. J. Am. Chem. Soc., 131, 2297–2305.

4. Wang,X., Elshahawi,S.I., Cai,W., Zhang,Y., Ponomareva,L. V., Chen,X., Copley,G.C., Hower,J.C.,

Zhan,C.-G., Parkin,S., et al. (2017) Bi- and tetracyclic spirotetronates from the coal mine fire

isolate Streptomyces sp. LC-6-2. J. Nat. Prod., 80, 1141–1149.

5. Li,A., Piel,J., Hopwood,D.., Rawlings,B.., Piel,J., Hertweck,C., Shipley,P.., Hunt,D..,

Newman,M.., Moore,B.., et al. (2002) A gene cluster from a marine Streptomyces encoding

the biosynthesis of the aromatic spiroketal polyketide Griseorhodin A. Chem. Biol., 9, 1017–

1026.

6. Schlimpert,S., Flärdh,K. and Buttner,M. (2016) Fluorescence time-lapse imaging of the

complete S. venezuelae life cycle using a microfluidic device. J. Vis. Exp., 10.3791/53863.

7. McGREGOR,J.F. (1954) Nuclear division and the life cycle in a Streptomyces sp. J. Gen.

Microbiol., 11, 52–6.

8. Zetterström,R. (2007) Selman A. Waksman (1888–1973) Nobel Prize in 1952 for the discovery

of streptomycin, the first antibiotic effective against tuberculosis*. Acta Paediatr., 96, 317–

319.

References

135

9. Procópio,R.E. de L., Silva,I.R. da, Martins,M.K., Azevedo,J.L. de and Araújo,J.M. de (2012)

Antibiotics produced by Streptomyces. Braz. J. Infect. Dis., 16, 466–71.

10. Sladden,M.J. and Hutchinson,P.E. (2017) The BLISTER study: possible overestimation of

tetracycline efficacy. Lancet, 390, 735.

11. Belley,A., Lalonde-Seguin,D., Arhin,F.F. and Moeck,G. (2017) Comparative

pharmacodynamics of single-dose Oritavancin and daily high-dose Daptomycin regimens

against Vancomycin-resistant Enterococcus faecium isolates in an in vitro PK/PD model of

infection. Antimicrob. Agents Chemother., 10.1128/AAC.01265-17.

12. Giricz,Z., Varga,Z. V., Koncsos,G., Nagy,C.T., Görbe,A., Mentzer,R.M., Gottlieb,R.A. and

Ferdinandy,P. (2017) Autophagosome formation is required for cardioprotection by

chloramphenicol. Life Sci., 186, 11–16.

13. Kose,L.P., Gulçin,İ., Özdemir,H., Atasever,A., Alwasel,S.H. and Supuran,C.T. (2015) The

effects of some avermectins on bovine carbonic anhydrase enzyme. J. Enzyme Inhib. Med.

Chem., 10.3109/14756366.2015.1064406.

14. Krenzien,F., ElKhal,A., Quante,M., Biefer,H.R.C., Hirofumi,U., Gabardi,S. and Tullius,S.G.

(2015) A rationale for age-adapted immunosuppression in organ transplantation.

Transplantation, 10.1097/TP.0000000000000842.

15. Bai,T., Zhang,D., Lin,S., Long,Q., Wang,Y., Ou,H., Kang,Q., Deng,Z., Liu,W. and Tao,M. (2014)

Operon for biosynthesis of lipstatin, the Beta-lactone inhibitor of human pancreatic lipase.

Appl. Environ. Microbiol., 80, 7473–83.

16. Chen,W.-J. (2016) Honoring antiparasitics: The 2015 Nobel Prize in Physiology or Medicine.

Biomed. J., 39, 93–97.

17. Landwehr,W., Wolf,C. and Wink,J. (2016) Actinobacteria and MyxobacteriatTwo of the most

important bacterial resources for novel antibiotics. In Current topics in microbiology and

immunology.Vol. 398, pp. 273–302.

References

136

18. Ikeda,H. (2017) Natural products discovery from micro-organisms in the post-genome era †.

Biosci. Biotechnol. Biochem., 81, 13–22.

19. Chater,K.F. (2013) Curing baldness activates antibiotic production. Chem. Biol., 20, 1199–

1200.

20. Thomas,S.O., Singleton,V.L., Lowery,J.A., Sharpe,R.W., Pruess,L.M., Porter,J.N., Mowat,J.H.

and Bohonos,N. Nucleocidin, a new antibiotic with activity against Trypanosomes. Antibiot.

Annu., 1956–1957, 716–21.

21. Chan,K.K.J. and O’Hagan,D. (2012) The rare fluorinated natural products and

biotechnological prospects for fluorine enzymology. In Methods in enzymology.Vol. 516,

pp. 219–235.

22. Murphy,C.D., Schaffrath,C. and O’Hagan,D. (2003) Fluorinated natural products: the

biosynthesis of fluoroacetate and 4-fluorothreonine in Streptomyces cattleya.

Chemosphere, 52, 455–461.

23. Zhu,X.M., Hackl,S., Thaker,M.N., Kalan,L., Weber,C., Urgast,D.S., Krupp,E.M., Brewer,A.,

Vanner,S., Szawiola,A., et al. (2015) Biosynthesis of the fluorinated natural product

Nucleocidin in Streptomyces calvus Is dependent on the bldA-specified Leu-tRNA(UUA)

molecule. Chembiochem, 10.1002/cbic.201500402.

24. Bartholomé,A., Janso,J.E., Reilly,U., O’Hagan,D., Inagaki,K., Urgast,D.S., Krupp,E.M.,

Brewer,A., Vanner,S., Szawiola,A., et al. (2017) Fluorometabolite biosynthesis: isotopically

labelled glycerol incorporations into the antibiotic nucleocidin in Streptomyces calvus. Org.

Biomol. Chem., 15, 61–64.

25. Hackl,S. and Bechthold,A. (2015) The Gene bldA , a regulator of morphological

differentiation and antibiotic production in Streptomyces. Arch. Pharm. (Weinheim)., 348,

455–462.

References

137

26. Higo,A., Horinouchi,S. and Ohnishi,Y. (2011) Strict regulation of morphological

differentiation and secondary metabolism by a positive feedback loop between two global

regulators AdpA and BldA in Streptomyces griseus. Mol. Microbiol., 81, 1607–22.

27. Takano,E., Tao,M., Long,F., Bibb,M.J., Wang,L., Li,W., Buttner,M.J., Bibb,M.J., Deng,Z.X. and

Chater,K.F. (2003) A rare leucine codon in adpA is implicated in the morphological defect of

bldA mutants of Streptomyces coelicolor. Mol. Microbiol., 50, 475–86.

28. Brockmann,H., Lenk,W., Schwantje,G. and Zeeck,A. (1966) The structure of rubromycin.

Tetrahedron Lett., 30, 3525–30.

29. Ivancich,M., Schrank,Z., Wojdyla,L., Leviskas,B., Kuckovic,A., Sanjali,A. and Puri,N. (2017)

Treating cancer by targeting telomeres and telomerase. Antioxidants, 6, 15.

30. Atkinson,D.J. and Brimble,M.A. (2015) Isolation, biological activity, biosynthesis and

synthetic studies towards the rubromycin family of natural products. Nat. Prod. Rep., 32,

811–40.

31. Kanehisa,M., Furumichi,M., Tanabe,M., Sato,Y. and Morishima,K. (2017) KEGG: new

perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res., 45, D353–

D361.

32. Latendresse,M., Paley,S. and Karp,P.D. (2012) Browsing metabolic and regulatory networks

with BioCyc. Methods Mol. Biol., 804, 197–216.

33. Henry,C.S., DeJongh,M., Best,A.A., Frybarger,P.M., Linsay,B. and Stevens,R.L. (2010) High-

throughput generation, optimization and analysis of genome-scale metabolic models. Nat.

Biotechnol., 28, 977–82.

34. Durot,M., Bourguignon,P.-Y. and Schachter,V. (2009) Genome-scale models of bacterial

metabolism: reconstruction and applications. FEMS Microbiol. Rev., 33, 164–90.

35. Land,M., Hauser,L., Jun,S.-R., Nookaew,I., Leuze,M.R., Ahn,T.-H., Karpinets,T., Lund,O.,

Kora,G., Wassenaar,T., et al. (2015) Insights from 20 years of bacterial genome sequencing.

Funct. Integr. Genomics, 15, 141–61.

References

138

36. Benedict,M.N., Mundy,M.B., Henry,C.S., Chia,N. and Price,N.D. (2014) Likelihood-based gene

annotations for gap filling and quality assessment in genome-scale metabolic models. PLoS

Comput. Biol., 10, e1003882.

37. Hosseini,Z. and Marashi,S.-A. (2017) Discovering missing reactions of metabolic networks by

using gene co-expression data. Sci. Rep., 7, 41774.

38. Cuevas,D.A., Edirisinghe,J., Henry,C.S., Overbeek,R., O’Connell,T.G. and Edwards,R.A. (2016)

From DNA to FBA: How to build your own genome-scale metabolic model. Front.

Microbiol., 7, 907.

39. Raman,K. and Chandra,N. (2009) Flux balance analysis of biological systems: applications

and challenges. Brief. Bioinform., 10, 435–449.

40. Edwards,J.S. and Palsson,B.O. (2000) The Escherichia coli MG1655 in silico metabolic

genotype: its definition, characteristics, and capabilities. Proc. Natl. Acad. Sci. U. S. A., 97,

5528–33.

41. Lovén,J., Orlando,D.A., Sigova,A.A., Lin,C.Y., Rahl,P.B., Burge,C.B., Levens,D.L., Lee,T.I. and

Young,R.A. (2012) Revisiting global gene expression analysis. Cell, 151, 476–482.

42. Montezano,D., Meek,L., Gupta,R., Bermudez,L.E. and Bermudez,J.C.M. (2015) Flux balance

analysis with objective function defined by proteomics data-metabolism of mycobacterium

tuberculosis exposed to mefloquine. PLoS One, 10, e0134014.

43. Toya,Y. and Shimizu,H. (2013) Flux analysis and metabolomics for systematic metabolic

engineering of microorganisms. Biotechnol. Adv., 31, 818–26.

44. Vogel,C. and Marcotte,E.M. (2012) Insights into the regulation of protein abundance from

proteomic and transcriptomic analyses. Nat. Rev. Genet., 13, 227–32.

45. Song,H.-S., Reifman,J. and Wallqvist,A. (2014) Prediction of metabolic flux distribution from

gene expression data based on the flux minimization principle. PLoS One, 9, e112524.

References

139

46. Hyduke,D.R., Lewis,N.E. and Palsson,B.Ø. (2013) Analysis of omics data with genome-scale

models of metabolism. Mol. Biosyst., 9, 167–74.

47. Lerman,J.A., Hyduke,D.R., Latif,H., Portnoy,V.A., Lewis,N.E., Orth,J.D., Schrimpe-

Rutledge,A.C., Smith,R.D., Adkins,J.N., Zengler,K., et al. (2012) In silico method for

modelling metabolism and gene product expression at genome scale. Nat. Commun., 3,

929.

48. Colijn,C., Brandes,A., Zucker,J., Lun,D.S., Weiner,B., Farhat,M.R., Cheng,T.-Y., Moody,D.B.,

Murray,M. and Galagan,J.E. (2009) Interpreting expression data with metabolic flux

models: predicting Mycobacterium tuberculosis mycolic acid production. PLoS Comput.

Biol., 5, e1000489.

49. Kremer,F., Blank,L.M., Jones,P.R. and Akhtar,M.K. (2015) A comparison of the microbial

production and combustion characteristics of three alcohol biofuels: Ethanol, 1-Butanol,

and 1-Octanol. Front. Bioeng. Biotechnol., 3, 112.

50. Khanna,C., Rosenberg,M. and Vail,D.M. (2015) A review of Paclitaxel and novel formulations

including those suitable for use in dogs. J. Vet. Intern. Med., 29, 1006–1012.

51. Meng,Z., Lv,Q., Lu,J., Yao,H., Lv,X., Jiang,F., Lu,A. and Zhang,G. (2016) Prodrug strategies for

paclitaxel. Int. J. Mol. Sci., 17, 796.

52. Tamano,K. (2014) Enhancing microbial metabolite and enzyme production: current

strategies and challenges. Front. Microbiol., 5, 718.

53. Nah,H.-J., Pyeon,H.-R., Kang,S.-H., Choi,S.-S. and Kim,E.-S. (2017) Cloning and heterologous

expression of a large-sized natural product biosynthetic gene cluster in Streptomyces

species. Front. Microbiol., 8, 394.

54. Chen,X., Li,S. and Liu,L. (2014) Engineering redox balance through cofactor systems. Trends

Biotechnol., 32, 337–343.

References

140

55. Zakrzewski,P., Medema,M.H., Gevorgyan,A., Kierzek,A.M., Breitling,R. and Takano,E. (2012)

MultiMetEval: comparative and multi-objective analysis of genome-scale metabolic

models. PLoS One, 7, e51511.

56. Yoshikawa,K., Toya,Y. and Shimizu,H. (2017) Metabolic engineering of Synechocystis sp. PCC

6803 for enhanced ethanol production based on flux balance analysis. Bioprocess Biosyst.

Eng., 40, 791–796.

57. Dang,L., Liu,J., Wang,C., Liu,H. and Wen,J. (2017) Enhancement of rapamycin production by

metabolic engineering in Streptomyces hygroscopicus based on genome-scale metabolic

model. J. Ind. Microbiol. Biotechnol., 44, 259–270.

58. Lucas,X., Senger,C., Erxleben,A., Grüning,B.A., Döring,K., Mosch,J., Flemming,S. and

Günther,S. (2013) StreptomeDB: a resource for natural compounds isolated from

Streptomyces species. Nucleic Acids Res., 41, D1130-6.

59. Lucas,X., Grüning,B.A., Bleher,S. and Günther,S. (2015) The purchasable chemical space: a

detailed picture. J. Chem. Inf. Model., 55, 915–24.

60. Sayers,E.W., Barrett,T., Benson,D.A., Bryant,S.H., Canese,K., Chetvernin,V., Church,D.M.,

DiCuccio,M., Edgar,R., Federhen,S., et al. (2009) Database resources of the National Center

for Biotechnology Information. Nucleic Acids Res., 37, D5-15.

61. Benson,D.A., Cavanaugh,M., Clark,K., Karsch-Mizrachi,I., Lipman,D.J., Ostell,J. and

Sayers,E.W. (2013) GenBank. Nucleic Acids Res., 41, D36-42.

62. Ichikawa,N., Sasagawa,M., Yamamoto,M., Komaki,H., Yoshida,Y., Yamazaki,S. and Fujita,N.

(2013) DoBISCUIT: a database of secondary metabolite biosynthetic gene clusters. Nucleic

Acids Res., 41, D408-14.

63. Medema,M.H., Kottmann,R., Yilmaz,P., Cummings,M., Biggins,J.B., Blin,K., de Bruijn,I.,

Chooi,Y.H., Claesen,J., Coates,R.C., et al. (2015) Minimum Information about a Biosynthetic

Gene cluster. Nat. Chem. Biol., 11, 625–631.

References

141

64. Quast,C., Pruesse,E., Yilmaz,P., Gerken,J., Schweer,T., Yarza,P., Peplies,J. and Glöckner,F.O.

(2013) The SILVA ribosomal RNA gene database project: improved data processing and

web-based tools. Nucleic Acids Res., 41, D590-6.

65. Cock,P.J.A., Antao,T., Chang,J.T., Chapman,B.A., Cox,C.J., Dalke,A., Friedberg,I., Hamelryck,T.,

Kauff,F., Wilczynski,B., et al. (2009) Biopython: freely available Python tools for

computational molecular biology and bioinformatics. Bioinformatics, 25, 1422–3.

66. McWilliam,H., Li,W., Uludag,M., Squizzato,S., Park,Y.M., Buso,N., Cowley,A.P. and Lopez,R.

(2013) Analysis tool web services from the EMBL-EBI. Nucleic Acids Res., 41, W597-600.

67. Tamura,K., Stecher,G., Peterson,D., Filipski,A. and Kumar,S. (2013) MEGA6: Molecular

Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol., 30, 2725–9.

68. Sukumaran,J. and Holder,M.T. (2010) DendroPy: a Python library for phylogenetic

computing. Bioinformatics, 26, 1569–71.

69. Huerta-Cepas,J., Dopazo,J. and Gabaldón,T. (2010) ETE: a python Environment for Tree

Exploration. BMC Bioinformatics, 11, 24.

70. Siguier,P., Perochon,J., Lestrade,L., Mahillon,J. and Chandler,M. (2006) ISfinder: the

reference centre for bacterial insertion sequences. Nucleic Acids Res., 34, D32-6.

71. Yao,M.D., Ohtsuka,J., Nagata,K., Miyazono,K.-I., Zhi,Y., Ohnishi,Y. and Tanokura,M. (2013)

Complex structure of the DNA-binding domain of AdpA, the global transcription factor in

Streptomyces griseus, and a target duplex DNA reveals the structural basis of its tolerant

DNA sequence specificity. J. Biol. Chem., 288, 31019–29.

72. Szklarczyk,D., Morris,J.H., Cook,H., Kuhn,M., Wyder,S., Simonovic,M., Santos,A.,

Doncheva,N.T., Roth,A., Bork,P., et al. (2017) The STRING database in 2017: quality-

controlled protein-protein association networks, made broadly accessible. Nucleic Acids

Res., 45, D362–D368.

73. The UniProt Consortium (2017) UniProt: the universal protein knowledgebase. Nucleic Acids

Res., 45, D158–D169.

References

142

74. Noé,L. and Kucherov,G. (2005) YASS: enhancing the sensitivity of DNA similarity search.

Nucleic Acids Res., 33, W540-3.

75. Ebrahim,A., Lerman,J.A., Palsson,B.O. and Hyduke,D.R. (2013) COBRApy: COnstraints-Based

Reconstruction and Analysis for Python. BMC Syst. Biol., 7, 74.

76. Aziz,R.K., Bartels,D., Best,A.A., DeJongh,M., Disz,T., Edwards,R.A., Formsma,K., Gerdes,S.,

Glass,E.M., Kubal,M., et al. (2008) The RAST Server: Rapid Annotations using Subsystems

Technology. BMC Genomics, 9, 75.

77. Fritzemeier,C.J., Hartleb,D., Szappanos,B., Papp,B. and Lercher,M.J. (2017) Erroneous

energy-generating cycles in published genome scale metabolic networks: Identification and

removal. PLOS Comput. Biol., 13, e1005494.

78. Zhu,S. and Chen,G. (2016) Explicit numerical solutions of a microbial survival model under

nonisothermal conditions. Food Sci. Nutr., 4, 284–289.

79. Hopwood,D.A., Bibb,M.J., Chater,K.F., Kieser,T., Bruton,C.J., Kieser,H.M., Lydiate,D.J.,

Smith,C.P., Ward,J.M. and Schrempf,H. (1986) Genetic manipulation of Streptomyces: A

laboratory manual Headington Hill Hall.

80. Klementz,D., Döring,K., Lucas,X., Telukunta,K.K., Erxleben,A., Deubel,D., Erber,A.,

Santillana,I., Thomas,O.S., Bechthold,A., et al. (2016) StreptomeDB 2.0—an extended

resource of natural products produced by streptomycetes. Nucleic Acids Res., 44, D509–

D514.

81. Yamada,Y., Kuzuyama,T., Komatsu,M., Shin-Ya,K., Omura,S., Cane,D.E. and Ikeda,H. (2015)

Terpene synthases are widely distributed in bacteria. Proc. Natl. Acad. Sci. U. S. A., 112,

857–62.

82. Law,V., Knox,C., Djoumbou,Y., Jewison,T., Guo,A.C., Liu,Y., Maciejewski,A., Arndt,D.,

Wilson,M., Neveu,V., et al. (2014) DrugBank 4.0: shedding new light on drug metabolism.

Nucleic Acids Res., 42, D1091–D1097.

References

143

83. Cochrane,G., Alako,B., Amid,C., Bower,L., Cerdeño-Tárraga,A., Cleland,I., Gibson,R.,

Goodgame,N., Jang,M., Kay,S., et al. (2013) Facing growth in the European Nucleotide

Archive. Nucleic Acids Res., 41, D30-5.

84. Allen,F., Greiner,R. and Wishart,D. (2014) Competitive fragmentation modeling of ESI-

MS/MS spectra for putative metabolite identification. Metabolomics, 11, 98–110.

85. Langdon,S.R., Brown,N. and Blagg,J. (2011) Scaffold diversity of exemplified medicinal

chemistry space. J. Chem. Inf. Model., 51, 2174–85.

86. Zhu,X.M., Hackl,S., Thaker,M.N., Kalan,L., Weber,C., Urgast,D.S., Krupp,E.M., Brewer,A.,

Vanner,S., Szawiola,A., et al. (2015) Biosynthesis of the fluorinated natural product

Nucleocidin in Streptomyces calvus is dependent on the bldA -specified Leu-tRNA UUA

molecule. ChemBioChem, 16, 2498–2506.

87. Hackl,S. (2017) Untersuchungen zur Biosynthese des nichtribosomalen Peptides WS9326A

und zum Einfluss von BldA auf das Transkriptom und Proteom von Streptomyces calvus.

88. de Windt,F.E. and van der Drift,C. (1980) Purification and some properties of

hydroxypyruvate isomerase of Bacillus fastidiosus. Biochim. Biophys. Acta, 613, 556–62.

89. Lagesen,K., Hallin,P., Rødland,E.A., Staerfeldt,H.-H., Rognes,T. and Ussery,D.W. (2007)

RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res., 35,

3100–8.

90. Barton,M.D., Delneri,D., Oliver,S.G., Rattray,M. and Bergman,C.M. (2010) Evolutionary

systems biology of amino acid biosynthetic cost in yeast. PLoS One, 5, e11935.

91. Ikeda,H., Ishikawa,J., Hanamoto,A., Shinose,M., Kikuchi,H., Shiba,T., Sakaki,Y. and Hattori,M.

(2003) Complete genome sequence and comparative analysis of the industrial

microorganism. Nat. Biotechnol., 21, 526–31.

92. Burns,B.P., Hazell,S.L., Mendz,G.L., Kolesnikow,T., Tillet,D. and Neilan,B.A. (2000) The

Helicobacter pylori pyrB gene encoding aspartate carbamoyltransferase is essential for

bacterial survival. Arch. Biochem. Biophys., 380, 78–84.

References

144

93. Anantharaman,V. and Aravind,L. (2003) Evolutionary history, structural features and

biochemical diversity of the NlpC/P60 superfamily of enzymes. Genome Biol., 4, R11.

94. Cimermancic,P., Medema,M.H., Claesen,J., Kurita,K., Wieland Brown,L.C., Mavrommatis,K.,

Pati,A., Godfrey,P.A., Koehrsen,M., Clardy,J., et al. (2014) Insights into secondary

metabolism from a global analysis of prokaryotic biosynthetic gene clusters. Cell, 158, 412–

421.

95. Janda,J.M. and Abbott,S.L. (2007) 16S rRNA gene sequencing for bacterial identification in

the diagnostic laboratory: pluses, perils, and pitfalls. J. Clin. Microbiol., 45, 2761–4.

96. Tulp,M. and Bohlin,L. (2005) Rediscovery of known natural compounds: Nuisance or

goldmine? Bioorg. Med. Chem., 13, 5274–5282.

97. Weber,T. and Kim,H.U. (2016) The secondary metabolite bioinformatics portal:

Computational tools to facilitate synthetic biology of secondary metabolite production.

10.1016/j.synbio.2015.12.002.

98. Nakamura,Y., Mochamad Afendi,F., Kawsar Parvin,A., Ono,N., Tanaka,K., Hirai Morita,A.,

Sato,T., Sugiura,T., Altaf-Ul-Amin,M. and Kanaya,S. (2013) KNApSAcK Metabolite activity

database for retrieving the relationships between metabolites and biological activities.

Plant Cell Physiol., 55, e7–e7.

99. Caboche,S., Pupin,M., Leclère,V., Fontaine,A., Jacques,P. and Kucherov,G. (2008) NORINE: a

database of nonribosomal peptides. Nucleic Acids Res., 36, D326-31.

100. Chandra,G. and Chater,K.F. (2008) Evolutionary flux of potentially bldA-dependent

Streptomyces genes containing the rare leucine codon TTA. Antonie Van Leeuwenhoek, 94,

111–26.

101. Higo,A., Horinouchi,S. and Ohnishi,Y. (2011) Strict regulation of morphological

differentiation and secondary metabolism by a positive feedback loop between two global

regulators AdpA and BldA in Streptomyces griseus. Mol. Microbiol., 81, 1607–1622.

References

145

102. Nieselt,K., Battke,F., Herbig,A., Bruheim,P., Wentzel,A., Jakobsen,Ø.M., Sletta,H.,

Alam,M.T., Merlo,M.E., Moore,J., et al. (2010) The dynamic architecture of the metabolic

switch in Streptomyces coelicolor. BMC Genomics, 11, 10.

103. Chater,K.F. (2016) Recent advances in understanding Streptomyces. F1000Research, 5,

2795.

104. Kim,M.K. and Lun,D.S. (2014) Methods for integration of transcriptomic data in genome-

scale metabolic models. Comput. Struct. Biotechnol. J., 11, 59–65.

105. Lewis,N.E., Hixson,K.K., Conrad,T.M., Lerman,J.A., Charusanti,P., Polpitiya,A.D., Adkins,J.N.,

Schramm,G., Purvine,S.O., Lopez-Ferrer,D., et al. (2010) Omic data from evolved E. coli are

consistent with computed optimal growth from genome-scale models. Mol. Syst. Biol., 6,

10.1038/msb.2010.47.

106. van Berlo,R.J.P., de Ridder,D., Daran,J.-M., Daran-Lapujade,P.A.S., Teusink,B. and

Reinders,M.J.T. (2011) Predicting metabolic fluxes using gene expression differences as

constraints. IEEE/ACM Trans. Comput. Biol. Bioinforma., 8, 206–216.

107. Colijn,C., Brandes,A., Zucker,J., Lun,D.S., Weiner,B., Farhat,M.R., Cheng,T.-Y., Moody,D.B.,

Murray,M. and Galagan,J.E. (2009) Interpreting expression data with metabolic flux

models: predicting Mycobacterium tuberculosis mycolic acid production. PLoS Comput.

Biol., 5, e1000489.

108. Higo,A., Hara,H., Horinouchi,S. and Ohnishi,Y. (2012) Genome-wide distribution of adpA, a

global regulator for secondary metabolism and morphological differentiation in

Streptomyces, revealed the extent and complexity of the adpA regulatory network. DNA

Res., 19, 259–274.

109. Castro-Melchor,M., Charaniya,S., Karypis,G., Takano,E. and Hu,W.-S. (2010) Genome-wide

inference of regulatory networks in Streptomyces coelicolor. BMC Genomics, 11, 578.

References

146

110. Azman,A.-S., Othman,I., Velu,S.S., Chan,K.-G. and Lee,L.-H. (2015) Mangrove rare

actinobacteria: taxonomy, natural compound, and discovery of bioactivity. Front.

Microbiol., 6, 856.

111. Weaver,D.S., Keseler,I.M., Mackie,A., Paulsen,I.T. and Karp,P.D. (2014) A genome-scale

metabolic flux model of Escherichia coli K-12 derived from the EcoCyc database. BMC Syst.

Biol., 8, 79.

112. Liao,Y.-C., Huang,T.-W., Chen,F.-C., Charusanti,P., Hong,J.S.J., Chang,H.-Y., Tsai,S.-F.,

Palsson,B.O. and Hsiung,C.A. (2011) An experimentally validated genome-scale metabolic

reconstruction of Klebsiella pneumoniae MGH 78578, iYL1228. J. Bacteriol., 193, 1710–7.

113. Song,H., Kim,T.Y., Choi,B.-K., Choi,S.J., Nielsen,L.K., Chang,H.N. and Lee,S.Y. (2008)

Development of chemically defined medium for Mannheimia succiniciproducens based on

its genome sequence. Appl. Microbiol. Biotechnol., 79, 263–272.

114. Burgard,A.P., Pharkya,P. and Maranas,C.D. (2003) Optknock: A bilevel programming

framework for identifying gene knockout strategies for microbial strain optimization.

Biotechnol. Bioeng., 84, 647–657.

115. Newman,J.D., Marshall,J., Chang,M., Nowroozi,F., Paradise,E., Pitera,D., Newman,K.L. and

Keasling,J.D. (2006) High-level production of amorpha-4,11-diene in a two-phase

partitioning bioreactor of metabolically engineered Escherichia coli. Biotechnol. Bioeng.,

95, 684–91.

116. Sheng,J. and Feng,X. (2015) Metabolic engineering of yeast to produce fatty acid-derived

biofuels: bottlenecks and solutions. Front. Microbiol., 6, 554.

Licenses

147

8 Licenses

The use of the following pictures required a license:

Figure 5: License Number: 4245921450913 (John Wiley and Sons)

Figure 6: License Number: 4245930393513 (Elsevier)

Figure 8: License Number: 4245940231732 (Elsevier)

Figure 9: License Number: 4245940535526 (Oxford University Press)

Acknowledgments

148

9 Acknowledgments

I would like to thank everyone without whom it would not have been possible to complete this

dissertation. I would particularly like to thank the following:

Prof. Dr. Stefan Günther for giving me the chance to do a PhD in his group, the opportunity to

work on these exciting projects, and the freedoms and development possibilities he gave to me.

Prof. Dr. Andreas Bechthold for being my co-supervisor, and him and his group for their support

and their expertise that they shared with me.

Prof. Dr. Oliver Einsle for agreeing to be my third examiner.

Dr. Robin Teufel and his group for their support and for giving me the opportunity to work with

the Griseorhodin A gene cluster.

Prof. Dr. Bernd Kammerer and his group for their support, the chance to work on their HPLC

machines and all the coffees.

Prof. Dr. Haruo Ikeda for giving me the chance to work with his outstanding genome minimized

host strain.

Dr. Kersten Döring, Dr. Stefanie Hackl, Dr. Roman Makitrynskyy, and Songya Zhang for our

rewarding collaborations.

My group for the awesome time we spent together.

Everyone who helped me with proofreading and especially Viola Steiert who endured me while I

was writing.

Finally, my parents who always supported me, even though I sometimes forgot to call them

back.

Publications

149

10 Publications

Kiske C†, Erxleben A†, Lucas X, Willmann L, Klementz D, Günther S, Römer W, Kammerer B; „Metabolic pathway monitoring of phenalinolactone biosynthesis from Streptomyces sp. Tü6071 by liquid chromatography/mass spectrometry coupling“– Rapid Communications in Mass Spectrometry, 2014, Vol. 28(13): 1459-67

Klementz D†, Döring K†, Lucas X, Telukunta KK, Erxleben A, Deubel D, Erber A, Santillana I, Thomas OS, Bechthold A, Günther S. „StreptomeDB 2.0—an extended resource of natural products produced by streptomycetes“- Nucleic Acids Research, 2016, Vol. 44: D509-14

Zierep PF, Padilla N, Yonchev DG, Telukunta KK, Klementz D, Günther S; „SeMPI: a genome-based secondary metabolite prediction and identification web server“- Nucleic Acids Research, 2017, Vol. 45(W1): W64-71

Hackl S†, Klementz D†, Simon D, BiniossekM, TokovenkoB, LuzhetskyyA, Koch HG, Oleschuk R, Zechel DL, Günther S, Bechthold A; “BldA, a trigger of antibiotic production in Streptomyces calvus, investigated through combined transcriptomic, proteomic and metabolic analysis”

(in preperation)

Zhang S, Klementz D, Günther S, Bechthold A; “The complete genome sequence of Streptomyces asterosporus DSM 41452, a high producer of the neurokinin A antagonist WS9326A”

(in preperation)

Zhang S, Klementz D, Wang, M, Zhu J,Dumit VI, Günther S, Bechthold A; “Comparative Proteomic Analysis of Streptomyces asterosporus DSM 41452 reveals the adpA regulon in a native non-sporulating Streptomyces species”

(in preperation)

Klementz D†, Makitrynskyy R†, Bechthold A, Gunther S; “Novel Methods for the enhancement of Moenomycin production in Streptomyces avermitilis SUKA22”

(in preperation)

†Contributed equally to this work

Conference contributions

150

11 Conference contributions

Talks

“Metabolic modeling of Griseorhodin A production in Streptomycetes”- International VAAM-Workshop “Biology of Bacteria producing natural Products” 28-30 September 2016, Freiburg

“Simulating and optimizing metabolic flux in Streptomycetes for production of natural products“- RTG-Symposium "Unique cofactor-dependent enzymes in microbes” 12-13 October 2017, Freiburg

Posters

Klementz D, Döring K, Lucas X, Gunther S; “StreptomeDB 2.0: an update“- Directing Biosynthesis IV, 25-27 March 2015, John Innes Centre, Norwich (UK)

Klementz D, Döring K, Lucas X, Telukunta KK, Thomas OS, Gunther S; “The StreptomeDB 2.0 – Knowledge database of secondary metabolites produced by streptomycetes”- Day of Science 2015, Albert-Ludwigs-University Freiburg (posterprize)

Klementz D, Günther S; “Genome-scale metabolic modeling of rubromycin production in a genome-reduced Streptomyces avermitilis strain” RTG Symposium ”Structural Biology ” 15-16 October 2015, Freiburg

Klementz D, Enderle S, Günther S; “Metabolic modeling of Griseorhodin A production in Streptomycetes”- International VAAM-Workshop 28-30 September 2016, Freiburg

Klementz D, Makitrynskyy R, Enderle S, Buhl N, Bechthold A, Gunther S; “Metabolic modeling of griseorhodin A production in Streptomyces avermitilis SUKA22“- Directing Biosynthesis V, 22-24 March 2017, Warwick (UK)

Zhang S, Klementz D, Wang, M, Zhu J,Dumit VI, Günther S, Bechthold A; “Complete genome sequencing and comparative Proteomic Analysis of Streptomyces asterosporus DSM 41452 reveals the adpA regulon in a native non-sporulating Streptomyces species” International VAAM-Workshop 27-29 September 2017, Tübingen

computational analysis and manipulation of the metabolic

Documents