complexbiologicalnetworks 2 online

Upload: ismael-sanchez

Post on 03-Jun-2018

221 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    1/40

    Elhanan Borenstein

    Spring 2010

    Complex (Biological) Networks

    Some slides are based on slides from courses given by Roded Sharan and Tomer Shlomi

    Analyzing Metabolic Networks

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    2/40

    Metabolism

    Metabolism is the process involved in the

    maintenance of life. It is comprised of a vast

    repertoire of enzymatic reactions and transport

    processes used to convert thousands of organic

    compounds into the various molecules

    necessary to support cellular life

    Schilling et al. 2000

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    3/40

    Why study metabolism? (II)

    Its the essence of life(and maybe its origins)

    Tremendous importance in Medicine

    Inborn errors of metabolism cause acute symptoms Metabolic diseases (obesity, diabetes) are on the rise

    (and are major sources of morbidity and mortality)

    Metabolic enzymes becoming viable drug targets

    Bioengineering applications

    Design strains for production of biological products

    Generation of bio-fuels

    The best understood of all cellular networks

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    4/40

    Metabolites & Biochemical Reactions

    Metabolite: an organic substance Sugars (e.g., glucose, galactose, lactose)

    Carbohydrates (e.g., glycogen, glucan)

    Amino-acids (e.g., histidine, proline, methionine)

    Nucleotides (e.g., cytosine, guanine) Lipids

    Chemical energy carriers (e.g., ATP, NADH)

    Atoms (e.g., oxygen, hydrogen)

    Biochemical reaction: the process in whichone or more substrate molecules are

    converted (usually with the help of an

    enzyme) to produce molecules

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    5/40

    Pathways

    Ouzonis, Karp, Genome Res. 10, 568 (2000)

    EcoCyc describes 131 pathways

    Pathways vary in length from asingle step to 16 steps (ave 5.4)

    But ... no precise biological definition and

    partitioning of the metabolic networkinto pathways is somehow arbitrary

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    6/40

    From Pathways to a Network

    http://www.genome.jp/kegg/pathway/map/map01100.html

    http://www.genome.jp/kegg/pathway/map/map01100.htmlhttp://www.genome.jp/kegg/pathway/map/map01100.html
  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    7/40

    Models

    ofMetabolism

    (and Metabolic Networks)

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    8/40

    Metabolic Network Models

    required data/accuracy /complexityabstraction/scale

    Topological analysis Degree distribution

    Motifs

    Modularity Reverse ecology

    Conventional models Boolean models

    Discrete models

    Bayesian models Kinetic models Dynamic system

    (differential eqs)

    Requires unknowndata constants and

    concentrations

    Constraint-based CB-Models

    Flux Balance Analysis

    Extreme Pathways

    Growth/KO effects

    ApproximateKinetic models

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    9/40

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    10/40

    Reconstructing Metabolic Networks

    Fructose + Glucose => Sucrose

    A

    B

    C D

    EIncomplete data

    Noise

    Large-scale

    Simple directed

    graphs

    Simple Representation

    Nodes=compounds

    Edges=reactions

    Topology based

    Static

    SucroseGlucose

    Fructose

    Describing the chemical reactions in the cell and thecompounds being consumed and produced

    atgaaaaccgtcgttt

    ttgcctaccacgatat

    gggatgcctcggtatg

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    11/40

    Metabolic Network (E.Coli)

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    12/40

    Environment

    & EcologySystem Topology

    & Structure

    Environments from Networks

    Can the structure/topologyof metabolic networks be used toobtain insights into the ecology in which species evolved/prevail?

    inference

    (Borenstein, et al.PNAS, 2008)

    Reverse Ecologyof

    Metabolic Environments

    d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    13/40

    Seed Sets

    &Metabolic Environments

    3

    4

    8

    7

    9

    6

    2

    1

    3

    0

    5

    Env

    ironment

    set of exogenously

    acquired compounds

    (seed set)

    proxy for the environment(operational definition)

    Seed set:a minimal subset of the compounds that cannot be synthesized from

    other compounds and whose existence permits the synthesis of all other

    compounds in the network.(Borenstein, et al.PNAS, 2008)

    d if i d d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    14/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    A Simple Synthetic Example

    Id if i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    15/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Strongly Connected Components (SCC)

    K j l i h

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    16/40

    2

    3

    4

    8

    79

    6

    12

    11

    14

    15

    1

    13105

    Kosarajusalgorithm

    for SCC Decomposition

    Given a graph G:

    1. Run a Depth-First Search (DFS) on G to compute finishing

    times f[v] for each node v

    2. Calculate the transposed network G (the network G with

    the direction of every edge reversed)

    3. Run DFS on G, traversing the nodes in decreasingorder

    of f[v]

    Each tree in the DFS forest created

    by the second DFS run forms a

    separate SCC

    Id if i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    17/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Strongly Connected Components (SCC)

    SCCs are equivalent sets(seed-wise)

    Id tif i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    18/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Strongly Connected Components (SCC)

    Directed Acyclic Graph (DAG)

    Id tif i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    19/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Source Components

    Candidate seeds are members ofsource components

    Id tif i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    20/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Candidate Seeds

    Id tif i S d C d

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    21/40

    2

    3

    4

    8

    7

    9

    6

    12

    11

    14

    15

    1

    1310

    5

    Identifying Seed Compounds:

    Seed Confidence Level

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    22/40

    Metabolic Network with Seeds

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    23/40

    Multi-Species Large-ScaleSeed Dataset

    Oxygen

    L-Glutamate

    Sulfate

    Leucine

    Sucrose

    Glycerol

    Methanol

    Thymidine

    B. aphidicola - - -

    S. pneumoniae - - -

    R. typhi - -

    S. aureus - -

    M. genitalium

    2264 compounds

    478s

    pecies

    accuracy 79%

    precision 95%

    recall 67%

    478species (networks); >2200compounds

    Seed compounds for each species

    Large-scale dataset

    of predicted metabolic

    environments

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    24/40

    Applications of Reverse Ecology

    Reconstructing ecology-based phylogeny

    Predicting ancestral environments

    Identifying evolutionary dynamics of networks

    Predicting species interaction

    Analyzing genetic vs. environmental robustness Quantifying ecological strategies

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    25/40

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    26/40

    Constraint-Based Modeling

    Living systems obey physical and chemical laws

    These can be used to constrain the space of

    possible behaviors of the network

    How often have I said to you that when

    you have eliminated the impossible,

    whatever remains, however improbable,

    must be the truth?Sherlock Holmes (A Study of Scarlet)

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    27/40

    Evolution Under Constraints

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    28/40

    1 Glucose + 1 ATP 1 Glucose-6-Phosphate + 1 ADP

    Reaction Stoichiometry

    Stoichiometry - the quantitative relationships

    of the reactants and products in reactions

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    29/40

    Stoichiometric Matrix

    S

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    30/40

    Stoichiometric Matrix and Fluxes

    vSdt

    md

    m: metabolite concentrations vector (mol/mg)

    S: stoichiometric matrix

    v: reaction rates vector

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    31/40

    Kinetic parameters

    ),( kmfSvSdt

    md

    Reaction rate equation

    Requires knowledge of m, f and k!

    A set of Ordinary Differential Equations (ODE)

    A Full Model? Not Really

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    32/40

    Constraint-Based Modeling

    Assumes a quasi steady-state!

    No changes in metabolite concentrations

    Metabolite production and consumption rates are equal

    No need for info on metabolite concentrations,

    reaction rate functions, or kinetic parameters

    0 vSdt

    md

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    33/40

    Constraint-Based Modeling

    In most cases, Sis underdetermined:a subspace of Rn(possible flux distributions)

    0

    0

    0

    Sv=0

    Thermodynamic constraints:

    a convex cone vi> 0

    Capacity constraints:

    a bounded convex cone vi< vmax

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    34/40

    Flux Balance Analysis

    But this still leaves a space of solutions

    How can we identify plausible solutions withinthis space?

    Optimize for maximum growth rate !!

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    35/40

    Flux Balance Analysis

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    36/40

    Flux Balance Analysis

    How do we

    solve this?

    Linear Programming

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    37/40

    Linear Programming (LP)

    Assume the following constraints: 0

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    38/40

    Application of CBM & FBA

    Predict metabolic fluxes on various media

    Predict growth rate

    Predict gene knockout lethality

    Characterize solution space

    Many more

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    39/40

    Available CBM Metabolic Models

    Bernhard Palsson

    UCSD

  • 8/12/2019 ComplexBiologicalNetworks 2 Online

    40/40