title title

33
Robustness, clustering & evolutionary conservation Stefan Wuchty Stefan Wuchty Center of Network Research Center of Network Research Department of Physics Department of Physics University of Notre Dame University of Notre Dame

Upload: amara

Post on 13-Jan-2016

24 views

Category:

Documents


0 download

DESCRIPTION

title title. Robustness, clustering & evolutionary conservation. Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame. New York Times. Complex systems. Made of many non-identical elements connected by diverse interactions. NETWORK. Bio-Map. GENOME. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: title title

Robustness, clustering &

evolutionary conservation

Robustness, clustering &

evolutionary conservation

Stefan WuchtyStefan WuchtyCenter of Network ResearchCenter of Network Research

Department of PhysicsDepartment of PhysicsUniversity of Notre DameUniversity of Notre Dame

Page 2: title title

Complex systems

Made of many non-identical elements connected by diverse interactions.

NETWORK

Page 3: title title

protein-gene

interactions

protein-protein

interactions

PROTEOME

GENOME

Citrate Cycle

METABOLISM

Bio-chemical reactions

Page 4: title title

protein-protein

interactions

PROTEOME

Page 5: title title

Yeast protein networkNodes: proteins

Links: physical interactions (binding)

P. Uetz, et al. Nature, 2000; Ito et al., PNAS, 2001; …

Page 6: title title

Topology of the protein network

)exp()(~)( 00

k

kkkkkP

H. Jeong, S.P. Mason, A.-L. Barabasi & Z.N. Oltvai, Nature, 2001

Page 7: title title

RobustnessComplex systems maintain their basic functions even under errors and failures

(cell mutations; Internet router breakdowns)

node failure

fc

0 1Fraction of removed nodes, f

1

S

Page 8: title title

Robustness of scale-free networks

1

S

0 1ffc

Attacks

3 : fc=1

(R. Cohen et. al., PRL, 2000)

Failures

Topological error tolerance

R. Albert et.al. Nature, 2000

Page 9: title title

Yeast protein network- lethality and topological position -

Highly connected proteins are more essential (lethal)...

H. Jeong et al., Nature, 2001

Page 10: title title

Modules in biological systems

Metabolic networks Protein networks

E. Ravasz et al., Science, 2002

Page 11: title title

Can we identify the modules?

),min(

),(),(

jiT kk

jiJjiO J(i,j): # of nodes both i and j link to; +1 if there is a direct (i,j) link

Page 12: title title

Metabolism: E. Ravasz et al., Science, 2002Protein interactions: Rives and Galitski, PNAS, 2003 Spirin and Mirny, PNAS, 2003

Page 13: title title

Open questionsDoes the application of standart clusteringalgorithms reflect real modules well?

Since e.g. one protein can be part of more than one protein complex overlapping clustering algorithms should give better results.

Page 14: title title

Motifs Small subnetworks that appear in real world networks

significantly more often than in random graphs.

(Milo et al., Science, 2002; Conant and Wagner, Nature Gen., 2003, Shen-Orr et al., Nature Gen., 2002, Milo et al, Science, 2004)

Page 15: title title

From the particular to the universal

A.-L- Barabasi & Z. Oltvai, Science, 2002

Page 16: title title

Topology and Evolution

Page 17: title title

Topology and Evolution

S. Wuchty, Z. Oltvai & A.-L. Barabasi, Nature Genetics, 2003

Page 18: title title

S. Wuchty, Genome Res., 2004

Topology and evolution

- General distribution of orthologs:E = N(o)/N(p)

- degree-dependent distribution of orthologsek = Nk(o)/Nk

Orthologous Excess Retention:ERk = ek/E

Page 19: title title
Page 20: title title

Clustering in protein interaction networks

Goldberg and Roth, PNAS, 2003

high clustering = high quality of interaction

|))(||,)(min(|

|)()(| |)(||)(|

|)(||)(|log

wNvN

wNvNi

vwwN

N

iwN

vNN

i

vNC

Page 21: title title

Does that also hold for evolutionary conservation?

Page 22: title title

Protein-protein interaction data are highly flawed:

90% false positives, 50% false negatives

Von Mering et al., Nature, 2002

How stable are these results?

Page 23: title title

Something else?

Eisen et al., PNAS, 1998

Page 24: title title

Open question

?

Wuchty et al., submitted, 2004

?

Page 25: title title

Plasmodium falciparum• Eukaryotic organism• Malaria parasite• Genome size 23 MB, 14 chromosomes• 5300 genes (estimated, Hall et al., Nature 2002,

Gardner et al., Nature, 2002) • No protein interaction data available• Co-expression data available (Bozdech et al.,

PloS, 2003, LeRoch et al., Science, 2003)• 868 orthologs with Yeast (InParanoid, Remm et

al. J. Mol. Biol., 2001)

Page 26: title title

Plasmodium falciparum

Page 27: title title

Plasmodium falciparum

Page 28: title title

Inferred protein interaction network

in P. falciparum• 667 nodes, 3,564 weighted interactions

• Clustering

i

iii aeQ )( 2 i

iii aeQ )( 2

- Iteratively pruning edges starting with the least weighted link- Quality of clusters is assessed by their modularity

i

iii aeQ )( 2

until a maximum is reached.

Page 29: title title

All edges shown with Cvw > 1.Colorcode red: Cvw > 4, yellow: Cvw > 3, green: Cvw > 2, blue: Cvw > 1

Page 30: title title

What does that mean?

Validation of results?

Co-expression patternsBozdech et al. PLoS, 2003

Page 31: title title

replication

exo/protesome

DN

A processing

translation

RN

A processing

ribososome

Wuchty, Barabasi, Ferdig and Adams, in preperation

Page 32: title title

What‘s next?

• Uncovering evolutionary cores of interactions in other organisms.

• Application of a Maximum Set Cover Algorithm to predict protein interactions (Huang, Kaanan, Wuchty, Izaguirre and Cheng, submitted) to unfold the interactome using the evolutionary cores and experimentally derived interactions.