vl netzwerke, ws 2007/08 edda klipp 1 max planck institute molecular genetics humboldt university...

28
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism and Signaling Edda Klipp Humboldt University Berlin Lecture 3 / WS 2007/08 Random Networks: Scale-free Networks

Upload: arron-julian-jackson

Post on 16-Dec-2015

214 views

Category:

Documents


2 download

TRANSCRIPT

VL Netzwerke, WS 2007/08 Edda Klipp 1

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Networks in Metabolism and Signaling

Edda Klipp Humboldt University Berlin

Lecture 3 / WS 2007/08Random Networks: Scale-free

Networks

VL Netzwerke, WS 2007/08 Edda Klipp 2

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Random Networks: Scale-free Networks

Most “natural” networks do nothave a typical degree value,they are free of a characteristic“scale”: they are scale-free.

Their degree distribution followsa Power law: P(k) ~ k-

Heterogeneity: hubs with highconnectivity, many nodes with low connectivity.

VL Netzwerke, WS 2007/08 Edda Klipp 3

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Random Networks: Scale-free Networks

The probability that a node is highly connected is statistically more significant than in a random graph, the network’s properties often being determined by a relatively small number of highly connected nodes that are known as hubs.

Barabási–Albert model of a scale-free network: Growth: Start with small number of nodes (m0)At each time point add a new node with m ≤ m0 links to the network

Preferential attachment:Probability for a new edge is

where kI is the degree of node I and J is the index denoting the sum over network nodes.

The network that is generated by this growth process has a power-law degree distribution that is characterized by the degree exponent = 3.

jj

ii k

kk

VL Netzwerke, WS 2007/08 Edda Klipp 4

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Evolution of Scale-free Networks

N

jj

ii

i

k

kmkm

t

k

1

mtkN

jj 2

1

After t time steps network with N = t + m0 nodes and m t edges.

Continuum theory: ki as continuous real variable, change proportional to (ki)

21

2

ii

ii

t

tmtk

t

k

t

k

32

03

2

02

2

2

2

0

2

2

2

2

1

1

kmkPt

tmk

tm

k

ktkPkP

tmk

tm

k

tmtP

tmtP

k

tmtPktkP

i

i

i

ii

~

jj

ii k

kk

VL Netzwerke, WS 2007/08 Edda Klipp 5

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Evolution of Scale-free NetworksAfter t time steps network with N = t + m0 nodes and m t edges.

Continuum theory: ki as continuous real variable, change proportional to (ki)

322 kmkPt ~

jj

ii k

kk

Degree distribution independent of time tAnd independent of network size N

But proportional to m2

VL Netzwerke, WS 2007/08 Edda Klipp 6

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Evolution of Scale-free NetworksAfter t time steps network with N = t + m0 nodes and m t edges.

Master equation approach: probability p(k,ti,t) that a node i introduced at time ti has degreek at time t.

21

12

for2

2

1for12

1

1

211

2

11

kkk

mmkP

mkm

mkkPk

k

kP

ttkpt

kP

ttkpt

kttkp

t

kttkp

iti

t

iii

,,lim

,,,,,,Main idea:

Dorogovtsev&Mendes-2001

VL Netzwerke, WS 2007/08 Edda Klipp 7

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Degree Distribution

VL Netzwerke, WS 2007/08 Edda Klipp 8

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Clustering coefficient

No inherent clustering coefficent C(k)

VL Netzwerke, WS 2007/08 Edda Klipp 9

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Average Path Length

lk

1 l

lkN ~

Shortest path length l: distance between two vertices u and v with unit length edges

Fully connected network:

Rough estimation for random network:

Average number of nearest neighbors: <k>

vertices are at distance l or closer

total number of vertices

k

Nl

ln

ln~

VL Netzwerke, WS 2007/08 Edda Klipp 10

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Average Path Length

N

Nl

CBNAl

lnln

ln~

ln

Is obtained by fitting

1

12

1 zz

zNlln

ln

VL Netzwerke, WS 2007/08 Edda Klipp 11

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Error and Attack Tolerance

Question: consider arbitrary connected graph of N nodes and assume that a p fraction of edges have been removed. What is the probability of the resulting graph being still connected?Usually: existence of a threshold probability pc.

More severe: removal of nodes

VL Netzwerke, WS 2007/08 Edda Klipp 12

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Scale-free Networks: Error and Attack Tolerance

Node RemovalThe relative size S (a),(b) and average path length l (c),(d) of the largest cluster in an initially connected network when a fraction f of the nodes are removed. (a),(c) Erdös-Renyi random network with N=10 000 and <k>=4; (b),(d) scale-free network generated by the Barabasi-Albert model with N=10 000 and <k>=4. , random node removal; º, preferential removal of the most connected nodes.

VL Netzwerke, WS 2007/08 Edda Klipp 13

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Random Networks: Scale-free Networks

Difference to classical random networks:

Growth of the network – given number of nodes

Preferential attachment – equal probability for all edges

Examples:

References in www: connections frequently to existing hubs

Metabolism: many molecules are involved in only a few (1,2) reactions, others (like ATP or water) in many

Wagner/Fell: highly connected molecules are evolutionary “old”

VL Netzwerke, WS 2007/08 Edda Klipp 14

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Properties of Scale-Free Networks

Small-world – short paths between arbitrary points

Robustness – Topological robustness

VL Netzwerke, WS 2007/08 Edda Klipp 15

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Watts & Strogatz ModelProblem: real networks have short average path length and greater clustering coefficients than classical random graphs

Construction: Initially, a regular one dimensional lattice with periodical boundary conditions is present. Each of L vertices has z ≥ 4 nearest neighbors. Then one takes all the edges of the lattice in turn and with probability p rewires to randomly chosen vertices. In such a way, a number of far connections appears. Obviously, when p is small, the situation has to be close to the original regular lattice. For large enough p, the network is similar to the classical random graph.

VL Netzwerke, WS 2007/08 Edda Klipp 16

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Watts-Strogatz-Model

By definition of Watts and Strogatz, the smallworld networks are those with “small” average shortest path lengths and “large” clustering coecients.

<k> remains constant During rewiring

VL Netzwerke, WS 2007/08 Edda Klipp 17

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Random Networks: Hierarchical Networks

To account for the coexistence of modularity, local clustering and scale-free topology in many real systems it has to be assumed that clusters combine in an iterative manner, generating a hierarchical network. The starting point of this construction is a small cluster of four densely linked nodes.Next, three replicas of this module are generated and the three externalnodes of the replicated clustersconnected to the central node ofthe old cluster, which produces alarge 16-node module. Threereplicas of this 16-node moduleare then generated and the 16peripheral nodes connected tothe central node of the oldmodule, which produces a newmodule of 64 nodes….

VL Netzwerke, WS 2007/08 Edda Klipp 18

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Random Networks: Hierarchical Networks

Clustering coefficent scales with the degree of the nodes

VL Netzwerke, WS 2007/08 Edda Klipp 19

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

VL Netzwerke, WS 2007/08 Edda Klipp 20

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

VL Netzwerke, WS 2007/08 Edda Klipp 21

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Application to Metabolic Networks

Jeong H et al, 2000, Nature

VL Netzwerke, WS 2007/08 Edda Klipp 22

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Metabolic Networks: Degree Distributions

VL Netzwerke, WS 2007/08 Edda Klipp 23

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Metabolic Networks: Hubs

VL Netzwerke, WS 2007/08 Edda Klipp 24

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Metabolic Networks: Pathway Lengths

E.coli 43 different organisms

VL Netzwerke, WS 2007/08 Edda Klipp 25

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Metabolic Networks: Robustness

Hubs removed first

Random removal

M=60 – 8% of substrates

VL Netzwerke, WS 2007/08 Edda Klipp 26

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Protein-Protein-Interaction NetworksYeast proteomea) Map of protein-protein

interactions Largest cluster: 78% of all

proteinsRed – lethalGreen – non-lethalOrange – slow-growthYellow – unknown

Cut-off: kc=20

b) Connectivity distribution

c) Fraction of essential proteins with

k linksRandom removal – no effectHubs removal – lethal

VL Netzwerke, WS 2007/08 Edda Klipp 27

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Protein-Protein-Interaction Networks

Random removal – no effectHubs removal – lethal

93% of proteins have 5 or less links only 21% of them are essential

0.7% of proteins have more than 15 links 62% of them are lethal

VL Netzwerke, WS 2007/08 Edda Klipp 28

Max Planck Institute Molecular Genetics

Humboldt University BerlinTheoretical Biophysics

Protein-Protein-Interaction Networks