creativecommons/licenses/by-sa/2.0

77
http:// creativecommons.org/ licenses/by-sa/2.0/

Upload: keiran

Post on 08-Jan-2016

24 views

Category:

Documents


0 download

DESCRIPTION

http://creativecommons.org/licenses/by-sa/2.0/. The whys and hows of mathematical models for biological networks, with a view to pitfalls and limitations. Prof:Rui Alves [email protected] 973702406 Dept Ciencies Mediques Basiques, 1st Floor, Room 1.08. Organization of the talk. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: creativecommons/licenses/by-sa/2.0

http://creativecommons.org/licenses/by-sa/2.0/

Page 2: creativecommons/licenses/by-sa/2.0

Prof:Rui [email protected]

973702406Dept Ciencies Mediques Basiques,

1st Floor, Room 1.08

Page 3: creativecommons/licenses/by-sa/2.0

From networks to physiological behavior

Graphical network representations

Types of problems

Typical bottlenecks and assumptions in model building

Page 4: creativecommons/licenses/by-sa/2.0

What happens?

Probably a very sick mutant?

Page 5: creativecommons/licenses/by-sa/2.0

Maybe resolving ambiguity in representation is enough to predict

behavior?

X0 X1 X2 X3

X0

X1

X2

X3

t0 t1 t2 t3

Page 6: creativecommons/licenses/by-sa/2.0

X0 X1 X2 X3

Page 7: creativecommons/licenses/by-sa/2.0

Build mathematical models!!!!

Page 8: creativecommons/licenses/by-sa/2.0

From networks to physiological behavior

Graphical network representations

Types of problems

Typical bottlenecks and assumptions in model building

Page 9: creativecommons/licenses/by-sa/2.0

A B

What does this mean?

Possibilities:

AB

Function

BA

Function

AB

Function

A B

Function

BA

Function

Page 10: creativecommons/licenses/by-sa/2.0

A B

C

Full arrow represents a flux between A and B

Dashed arrow represents modulation of a flux

+

Dashed arrow with a plus sign represents positive modulation of a flux

-

Dashed arrow with a minus sign represents negative modulation of a flux

A and B – Dependent Variables

(Change over time)

C – Independent variable

(constant value)

Page 11: creativecommons/licenses/by-sa/2.0

A B

C

Stoichiometric information needs to be included

Dashed arrow represents modulation of a flux

+

Dashed arrow with a plus sign represents positive modulation of a flux

Dashed arrow with a minus sign represents negative modulation of a flux

23 D+

Reversible Reaction

Page 12: creativecommons/licenses/by-sa/2.0

B

C

Stoichiometric information needs to be included

Dashed arrow represents modulation of a flux

+

Dashed arrow with a plus sign represents positive modulation of a flux

Dashed arrow with a minus sign represents negative modulation of a flux

2 A3 D

Page 13: creativecommons/licenses/by-sa/2.0

C

Having too many names or names that are closely related my complicate interpretation and set up

of the model.

Therefore, using a structured nomenclature is important for book keeping

Let us call Xi to variable i

AB

DX3

X1X2

X4

Page 14: creativecommons/licenses/by-sa/2.0

X2

X3

+2 X13 X4

C

AB

DX3

X1X2

X4

Page 15: creativecommons/licenses/by-sa/2.0

X2X0

Production Reaction

Sink Reaction

Page 16: creativecommons/licenses/by-sa/2.0

X2X0

Organel

Cell

Compartmental models are important, both because compartments exist in the cell and because even in the absence of compartments reaction media are

not always homogeneous

Page 17: creativecommons/licenses/by-sa/2.0

Whatever representation is used be sure to be consistent and to know exactly what the different elements of a representation mean.

Page 18: creativecommons/licenses/by-sa/2.0

/dA dt

/ ,dA

dA dt A f A Cdt

/

dAdA dt

dt/dA

dA dt Adt

A B

C

+

Page 19: creativecommons/licenses/by-sa/2.0

A B

C

+

A or C

Flux1 2k A k CLinear 1 2

1 2 3 4

k A k C

K K A K C K AC

Saturating

4 41 2

44 41 2 3 4

k A k C

K K A K C K AC

Sigmoid

Page 20: creativecommons/licenses/by-sa/2.0

From networks to physiological behavior

Network representations

Types of problems

Typical bottlenecks and assumptions in model building

Page 21: creativecommons/licenses/by-sa/2.0

That depends on the question!!!!

It also depends upon the system for which you ask the question!!!!

Page 22: creativecommons/licenses/by-sa/2.0

The big one: How does a cell work???

What answers are being given?

Genome sequenced and annotated

Map onto cellular circuits chart

Create stoichiometric

model.

dXS v

dt0 .S v

stoichiometric matrix

rate vector

Usually solved for steady state

Page 23: creativecommons/licenses/by-sa/2.0
Page 24: creativecommons/licenses/by-sa/2.0

1. Assume that cells are growing at steady state with some optimal conversion of input material (flux b1) into biomass (A,B,C)

2. Assume linear kinetics for each rate equations

3. Use (linear) optimization methods to find a solution for the distribution of fluxes that allows the cell to fulfill 1.

Page 25: creativecommons/licenses/by-sa/2.0

Accurately predicting a decent fraction of knock out mutants that are lethal in S.cerevisiae and H. pylori.

Proc Natl Acad Sci U S A. 100: 13134-13139; J Bacteriol. 184: 4582-4593.

Fail to predict all mutants Does not account for transient behavior Does not account for dynamic regulationWhole cell modeling is far from being able

to answer the big question; not enough info is available to build the models.

(?)

Page 26: creativecommons/licenses/by-sa/2.0

Well, let us be modest: How does a simple cell work???

What is a simple cell? A cell that is much simpler than what we normally think

of as a cell Red Blood cell; lambda phage Mathematical models using dynamic equations have been created

to study these types of cells. (e.g. Ni & Savageau or Arkin ) A regular cell that we represent in a simplified maner

E-cell project represents the E. coli cell using linear kinetics.

. . ( , )dX

S v S f p Xdt

Page 27: creativecommons/licenses/by-sa/2.0

Savageau & Ni, 1992 JBC, JTB

Page 28: creativecommons/licenses/by-sa/2.0
Page 29: creativecommons/licenses/by-sa/2.0

Model was used to assess how complete our understading of red blood cell metabolism is.

How was this done? Using the notion that model robustness can be

used to identify ill defined parts of the model Using the notion that biological systems

should have stable steady states

Page 30: creativecommons/licenses/by-sa/2.0

Robustness is the notion that the dynamic behavior of a system is fairly insensitive to spurious fluctuations in parameter values

Parameter (T, kinetic parameters)

Ste

ad

y st

ate

va

lue

Page 31: creativecommons/licenses/by-sa/2.0

Because if biological systems were not robust, we would not be alive, given that fluctuations happen all the time.

Parameter (T, kinetic parameters)

Ste

ad

y st

ate

va

lue

Page 32: creativecommons/licenses/by-sa/2.0

Stability of a steady state is the notion that after spurious fluctuations in parameter values, the system will return to the original steady state it was in

t

X

Page 33: creativecommons/licenses/by-sa/2.0

Again, because if biological systems were not at stable steady states, we would not be alive, given that fluctuations happen all the time.

Page 34: creativecommons/licenses/by-sa/2.0

Found that the steady state was unstable Identified regulatory interactions that

stabilized the steady state Latter confirmed experimentally

Identified parts of the model that have high sensitivy Incomplete understanding of the system

Page 35: creativecommons/licenses/by-sa/2.0

Well, yes there are. There is a fair amount of modularity in

cells Organeles, Pathways, Circuits, etc.

Therefore, if one is interested in specific parts of cellular function and response, one can isolate the modules responsible for that function or response How does the specific part of a cell

responsible for a given function works???

Page 36: creativecommons/licenses/by-sa/2.0

• How does the specific part of a cell responsible for a given function works???

How does it work qualitatively Network reconstruction (RA)

P1 P2

P…

Pn

M1

M2M…

Mn

Page 37: creativecommons/licenses/by-sa/2.0

• How does the specific part of a cell responsible for a given function works???

How does it work qualitatively Network reconstructionP1

P2

P… Pn

M1M2

M…

Mn

Page 38: creativecommons/licenses/by-sa/2.0

FeSC biogenesis is a pathway that is conserved over evolution

Proteins involved in the pathway are identified

How these proteins act together to form a pathway is unknown; the reaction topology and the regulatory topology is unknown

How do these proteins work together?

Page 39: creativecommons/licenses/by-sa/2.0

Create all possible topologies

Scan all possible behaviors using simulation

Compare qualitative dynamic behavior of the different topologies to experimental results

Eliminate topological alternatives that do not reproduce experimental results

Page 40: creativecommons/licenses/by-sa/2.0

•Alves et. al. 2004 Proteins 57:481

•Vilella et. al. 2004 Comp. Func. Genomics 5:328

•Alves et. al. 2004 Proteins 56:354

•Alves & Sorribas 2007 BMC Systems Biology 1:10

Prediction Verified?

Grx5 modulates Nfs1 and Scaffold activity/Interactions

Detected interaction with scaffolds

Arh1-Yah1 act on S or ST Yes [PNAS 97:1050; JBC 276:1503]

Arh1-Yah1 interaction same as in mammals

No reported experiment

Yfh1 acts on S, T, or ST Yes [Science 305:242; EMBO Rep 4:906; JBC 281:12227; FEBS Lett

557:215]

Yfh1 storage of Fe not important for its role in biogenesis

Yes [EMBO Rep 5:1096]

Nfs1 acts in S, not necessarily in R No reported experiment

Chaperones act on Folding, Stability

Yes for Folding [JBC 281:7801]

•Alves et al. 2008 Curr. Bioinformatics in press

Page 41: creativecommons/licenses/by-sa/2.0

• How does the specific part of a cell responsible for a given function works???

How does it work quantitatively Parameter estimation when network is

known (PM)P1

P2

P… Pn

M1M2

M…

Mn

Page 42: creativecommons/licenses/by-sa/2.0

• If you know the topology and/or mechanism, then one can ask how does a system act under specific circumstances

• To answer such a question we often need numerical values for the parameters of the system so that simulations can be ran

• Numerical values for parameters can be estimated from experimental data

Page 43: creativecommons/licenses/by-sa/2.0

• Based on gene expression data, what are the parameter values that create a best fit of the model to the observed experimental results?

Page 44: creativecommons/licenses/by-sa/2.0

Collect experimental data

Create a mathematical model

Use optimization/fitting methods to estimate the parameters of the model in such a way that a minimum discrepancy exists betrween model predictions and observed data.

Page 45: creativecommons/licenses/by-sa/2.0
Page 46: creativecommons/licenses/by-sa/2.0

Hell, No!!!!

Modularity begs the question: Are there design principles that

explain why cell use specific modules for specific functions? [AS; RA;AS]

Page 47: creativecommons/licenses/by-sa/2.0

X0 X1

_

+

X2 X3

X4

X0 X1

_

+

X2 X3

X4

__

Overall feedback

Cascade feedback

Page 48: creativecommons/licenses/by-sa/2.0

Create mathematical models for the alternative networks

Compare the behavior of the models with respect to relevant functional criteria

Decide according to those criteria which model performs best

Page 49: creativecommons/licenses/by-sa/2.0

TimeSpurious stimulation

[C]Overall

Cascade

Proper stimulus

Overall

Cascade

[C]

StimulusOverall

Cascade

Alves & Savageau, 2000, Biophys. J.

Page 50: creativecommons/licenses/by-sa/2.0

From networks to physiological behavior

Graphical network representations

Types of problems

Typical bottlenecks and assumptions in model building

Page 51: creativecommons/licenses/by-sa/2.0

• Is your system the whole cell? If so, how detailed do you want to make your

model to be?

Page 52: creativecommons/licenses/by-sa/2.0

• Is your system the whole cell? If so, how detailed do you want to make your

model to be?• If your system is not the whole cell, is it a

pathway or circuit? How do you define pathway? What will you include in your model?

Include cofactors, elementary steps?

Include all reactions? Not all are present in a

given organism

Page 53: creativecommons/licenses/by-sa/2.0

• No magic bullet exists to define your system. Read the literature, learn about your system,

guesstimate the important inputs and bound your system as a module: Simplify as much as you can but not more than that

Page 54: creativecommons/licenses/by-sa/2.0

• Should we include all details known about the system?

• What can we simplify?

• Again, no general answer for this. Read the literature, learn about your system,

guesstimate the important inputs and bound your system as a module

Page 55: creativecommons/licenses/by-sa/2.0

What is the form of f(p,X)?

. . ( , )dX

S v S f p Xdt

Page 56: creativecommons/licenses/by-sa/2.0

Individual steps of all processes are mass action

The kinetics of a process may be complicated

( , ) 1 jn

m i i ji j

f p X k X

X1 X2

P1

X1

P1

X1P1 X2P1

X2

P1

Page 57: creativecommons/licenses/by-sa/2.0

We may end up with a model that is larger than it has to be:

2 Variables 4 Variables

Use rational enzyme kinetics to reduce model dimension: e.g. HMM kinetics

( , )mV X

f p XK X

X1 X2

P1

X1

P1

X1P1 X2P1

X2

P1

Page 58: creativecommons/licenses/by-sa/2.0

Allow for dimensional reduction of models while often still being accurate

Page 59: creativecommons/licenses/by-sa/2.0

Form is mechanism dependent Michaelis-Menten, Hill, Theorel-chance, etc.

Assumes that E<<<S and/or very different time scales for the individual processes E.g. in signal transduction, In some cases time scale simplification is incorrect

E S

Page 60: creativecommons/licenses/by-sa/2.0

Usually we do not know the individual mechanistic steps of processes

Therefore, using rational enzyme kinetics is not justified

However, one can use approximate formalisms Power Law, Saturating cooperative formalism,

etc…

Page 61: creativecommons/licenses/by-sa/2.0

Form is always the same (if Taylor based) Automated equation building from graphical

representation

Parameters are fairly easy to estimate

Page 62: creativecommons/licenses/by-sa/2.0

One needs to choose the appropriate formalism for the specific situation E.g. if a process saturates, one may use a piece-

wise power law or a SC formalism equation

X1 X2

P1

X1

v

Power Law

Piece Wise Power Law

Saturating Cooperative formalism

Lineal

Piece wise

Page 63: creativecommons/licenses/by-sa/2.0

If Taylor based, they are absolutely accurate only at the operating point of the approximation However, they may have a range of sufficient

accuracy of several orders of magnitude about the operating point

Page 64: creativecommons/licenses/by-sa/2.0

How do we analyze this? If closed form solutions are available, analysis may be

made independent of parameter values

Closed form solutions are almost never available!!!! Lineal approximations allways have close form solutions Power law, other transformations may also have closed

form steady state solutions

. . ( , )dX

S v S f p Xdt

Page 65: creativecommons/licenses/by-sa/2.0

If parameter values are available, then solutions can be numerically calculated (PM)

Numerical solutions allows us to predict the behavior of a specific system

. . ( , )dX

S v S f p Xdt

Page 66: creativecommons/licenses/by-sa/2.0

10 13 111 1 0 3 1 1/ g g hdX dt X X X

11 222 1 1 2 2/ h hdX dt X X

22 33 343 2 2 3 3 4/ h h hdX dt X X X

X0 X1

_

+

X2 X3

X4

Constant

Protein using X3

Page 67: creativecommons/licenses/by-sa/2.0

• Steady state response

• Long term or homeostatic systemic behavior of the network

Page 68: creativecommons/licenses/by-sa/2.0

10 13 111 1 0 3 1 1/ 0g g hdX dt X X X

11 222 1 1 2 2/ 0h hdX dt X X

22 33 343 2 2 3 3 4/ 0h h hdX dt X X X

Page 69: creativecommons/licenses/by-sa/2.0

• Sensitivity of the system to perturbations in parameters or conditions in the medium

• Stability of the homeostatic behavior of the system

• For both, you only need to know how to do derivatives!!!!

Page 70: creativecommons/licenses/by-sa/2.0

• Transient response

•Transient of adaptive systemic behavior of the network

Page 71: creativecommons/licenses/by-sa/2.0

10 13 111 1 0 3 1 1/ g g hdX dt X X X

11 222 1 1 2 2/ h hdX dt X X

22 33 343 2 2 3 3 4/ h h hdX dt X X X

Solve numerically

Page 72: creativecommons/licenses/by-sa/2.0

In and of itself a model is a model. It needs to be contrasted to reality

If when contrasted to reality, model predictions are verified, the model is validated; otherwise it is back to the drawing board

Models are never valid under all conditions

Page 73: creativecommons/licenses/by-sa/2.0

• All molecular species are present in discrete ammounts within a cell

• If one assumes that sufficiently large ammounts are present, it is OK to treat species as concentrations/densities, thus simplifying calculations => Deterministic ODE models

Page 74: creativecommons/licenses/by-sa/2.0

• No answer is always right for this question

• However, if small number of particles is involved in the process, assumption breaks down

• How to solve this problem? ‾ Either use statistical master equation or

stochastic differential equations

Page 75: creativecommons/licenses/by-sa/2.0

• If one assumes that all cellular compartments are well mixed in a time scale faster than the processes of interest, it is OK to use ODE models, either deterministic or stochastic

Page 76: creativecommons/licenses/by-sa/2.0

• There are all sorts of compartments and gradients within a cell

• Often, the gradients are important for the response one is studying

• How to solve this problem? ‾ Either use compartmental models (still ordinary

differential equations) or create models using partial differential equations.

‾ Effectivelly, PDEs are solved using compartments.

Page 77: creativecommons/licenses/by-sa/2.0

From networks to physiological behavior

Network representations

Types of problems

Typical bottlenecks and assumptions in model building