into the unknown: using systems biology approaches in ... · pdf filerev. biochem. 81:379-405....

78
Into the unknown: Using systems biology approaches in exploring new functions of plant receptor kinases Waltraud Schulze systembiologie.uni-hohenheim.de

Upload: lytuyen

Post on 06-Feb-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Into the unknown: Using systems biology approaches

in exploring new functions of plant receptor kinases

Waltraud Schulze

systembiologie.uni-hohenheim.de

Page 2: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Plants are important

Plants can adapt to almost all climates (latitude / altitude)

Economic value: Food, Feed, Aesthetic and Recreational

Influence on word climate

We would like to understand more about how plants function and how they adapt to a changing environment on the molecular level.

Image source: www.wikipedia.com

Image source: www.kawakarpo.de

Light

Heat

Cold

Nutrients Water /

Drought

Pathogens

CO2

Page 3: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Plant model: Challenging anatomy and biochemistry

Cell wall: harsh extraction methods necessary with mechanical grinding.

Large central vacuole: low protein yield, acidic, secondary metabolites

Secondary metabolites: hydrophobic and interfere with protein purification

In green tissue: Rubisco as highest abundant protein can mask detection of low-abundant proteins

Image source: www.wikipedia.com

Page 4: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Signaling pathways and mutants

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

“One gene” resulting in “one

phenotype” relationship was

used to study signaling

pathways.

“Forward genetics”: Mutant

phenotype is used to find

underlying gene function.

Brassinosteroid

pathway

Ethylene

pathway

Plant hormone signaling pathways

Page 5: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Gene functions and phenotypes in Arabidopsis

Lloyd J & Meinke D (2012) A comprehensive dataset of genes with a loss-of-

function mutant phenotype in Arabidopsis. Plant Physiol. 158(3):1115-1129.

Arabidopsis

genome

published „kock-out“ line

collections

available

About 27416 gene models in

Arabidopsis.

Up to now, 2400 genes in

Arabidopsis are associated with

a loss-of-function phenotype.

Today, mainly “reverse-genetic”

approaches are used to study

gene functions.

Page 6: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Loss of protein function can

be buffered by cellular

network, leading to subtle

phenotypes

Understanding protein

function within the network

requires:

Parts list (measure

proteins in the cell)

Protein-Protein

interactions, Protein

signaling interactions

Page 7: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Network robustness and dynamics through „Proteoforms“

From: Smith LM et al. (2013) Proteoform: a single term describing protein

complexity. Nature Methods 10:186-187.

Functional complexity is

not achieved by many

genes, but rather by a

huge number of protein

variations (Proteoforms).

Proteoforms can result

from combinations of:

SNPs

Splice variants

Posttranslational

modifications

Image source: www.thermo.com

Page 8: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Importance of posttranslational modifications

From: Seet BT, Dikic I, Zhou MM, & Pawson T (2006) Reading protein

modifications with interaction domains. Nat. Rev. Mol. Cell Biol. 7(7):473-483.

Posttranslational modifications

serve multiple functions in

protein-protein interaction (PIN)

networks and protein signaling

networks (PSN).

Modifications and specific

protein domains often form

interaction sites of two proteins.

Page 9: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Kinases and phosphatases

Phosphorylation is one of the most widespread

posttranslational modifications (PTM).

Almost all cellular processes are regulated by one or more

phosphorylation or dephosphorylation events.

Phosphorylation can be activating or inactivating

We still do not fully understand the functional context for

most of the plant kinases.

Image source: www.thermo.com

Page 10: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Arabidopsis kinome

Monika Zulawski, PhD-student; Gunnar Schulze MSc-student

Page 11: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Carbon (sucrose)

Membrane proteins and nutrient signaling

Nitrogen (NO3, NH4)

Recognition,

Uptake across

membrane

Recognition,

Signal transduction

Regulation of transcription,

Regulation of protein activity

Modification-Dependent

Protein-Protein

Interactions

Dynamics of

Phosphorylation

Protein Complexes and

Membrane Microdomains

Page 12: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Quantitative proteomics strategies

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516 Kierszniowska S, et al (2009) Proteomics 9: 1916-1924

Engelsberger WR, et al (2006) Plant Methods 2(14): 1-11 Arsova B, et al (2012) Trends in Plant Sciences 17: 102-112

Page 13: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

LVSWYDNEWGYSSR

Retention time [minutes]

60.8 61.0 61.2 61.4 61.6 61.8

MS

pe

ak

are

a [

Th

*co

un

ts]

0

2e+4

4e+4

6e+4

8e+4

MS full scan

m/z [Th]

880 882 884 886 888 890 892 894

Ion

in

ten

sit

y

0

2e+5

4e+5

6e+5

8e+5

Retention time [minutes]

60.8 61.0 61.2 61.4 61.6 61.8

MS

pe

ak

are

a [

Th

*co

un

ts]

0

2e+4

4e+4

6e+4

8e+4

H

L

1.34

1.52

1.43

1.38

1.19

1.21

1.13

1.12

1.06

0.98

1.07

1.231.26

H L

average of ratios: 1.22 ratio of XICs: 1.20 A B

C

MH2+: 881.40

m/z [Th]

200 400 600 800 1000 1200 1400 1600 1800

inte

ns

ity

0

500

1000

1500

2000

MH2+: 891.37

inte

ns

ity

0

500

1000

1500

2000

y3

y4

y5 y6

y10

y8

y9

b6 b9 b11

y12

y11

y12++

y3y4

y5 y6

y8

y9y10

b11y11y12

D MS2 fragment spectrum

HL

H

L

Extracted ion chromatogram Extracted ion chromatogram

891.37

881.40

15N-Metabolic Labeling: Data Extraction Workflow

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516

Labeled and unlabeled peptide form co-elute in reversed-phase liquid chromatography.

Mass shift is dependent on the amino acid sequence.

Identification of the peptide is necessary before the peptide pairs of labeled and unlabeled form can be quantified.

Engelsberger WR, et al (2006) Plant Methods 2(14): 1-11

Arsova B, et al (2012) Trends in Plant Sciences 17: 102-112 Kierszniowska S, et al (2009) Proteomics 9: 1916-1924

Miancimu, 6054m, unclimbed

Page 14: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Peptide Correlation Profiling: Data extraction workflow

Sample 1

ion

in

ten

sit

y

0

1e+7

2e+7

3e+7

4e+7

Sample 2

Retention time [minutes]0 10 20 30 40 50 60 70 80 90

ion

in

ten

sit

y

0

1e+7

2e+7

3e+7

4e+7

Correlation of retention times

Retention time sample 1 [minutes]0 20 40 60 80

Re

ten

tio

n t

ime

sa

mp

le 2

[m

inu

tes

]

0

20

40

60

80

Sample 1

ion

in

ten

sit

y

0

1000

2000

3000

4000

5000

DLAVEGNEIIR M2H+: 614.83021

peak volume: 36627 Th*counts*s

Sample 1

m/z [Th]200 400 600 800 1000 1200

ion

in

ten

sit

y

0

500

1000

1500

2000

Extracted ion chromatogramMS2 fragment spectrum

y3a2

b2b3 b5

y6

y7

y8

y9 b10

Sample 2

Retention time [minutes]

24.0 24.5 25.0 25.5 26.0 26.5 27.0

ion

in

ten

sit

y

0

1000

2000

3000

4000

5000

peak volume: 13748 Th*counts*s

Sample 1

m/z [Th]615 616

ion

in

ten

sit

y

0

5e+4

1e+5

2e+5

2e+5

Sample 2

m/z [Th]615 616

M2H+: 614.83021 M2H+: 614.83021

MS full scan

RT: 25.39 min RT: 25.55 min

Base peak chromatogram

A B

C D

E

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516

Peptide Sample 1 Sample 2

A 12 24

B 22

C 15

... ... ...

9

30

Retention time

Retention time peak alignment increases qunatitative

coverage in label-free approaches.

Arsova B, et al (2012) Molecular and Cellular Proteomcs 11: 619-628

Kawagebo, 6740m, unclimbed

Page 15: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

P P

Trypsin

Digest

Plasma membranes

right side out Brij 58 treated vesicles

inside-out -

-

LC-MS/MS 0.01%

Brij58

Flip

Seedling liquid culture

Experimental setup: Nutrient starvation and resupply

Niittylä T, et al (2007) Molecular and Cellular Proteomics 6: 1711-1726

starved 5 minutes 3 minutes 10 minutes 30 minutes

Full medium starvation Re-supply of nutrient

sucrose

NO3/NH4

Engelsberger WR, Schulze WX (2011) The Plant Journal 69: 978-995

Biological Experiment

Sample Preparation

Identification

Quantitation

Page 16: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

K-means clustering of phosphorylation time profiles.

Nutrient-dependent phosphorylation response classes

Engelsberger WR, Schulze WX (2012) The Plant Journal 69: 978-995

Page 17: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Protein functions represented in phosphorylation responses

cell organizatioin

signaling

transcription factors

protein degradation/synthesis

GPI-anchored plasma membrane

glycolysis & metabolism

Early responses (up to 5 min) involves receptor kinases, cell organization, transcription factors, GPI-anchored membrane proteins.

Late responses (after 5 min) involve protein degradation and synthesis, second messenger signaling and central metabolism, such as enzymes.

over-represented:

• transport p- and v-ATPases

• transport aquaporins

• cell organization

• unknowns

• TFs: MADS

over-represented:

• signaling (RLK, kinases)

• phosphatases

• protein synthesis

•transport other

• lipid metabolsim (sterols)

over-represented:

• protein degradation

• protein synthesis

• signaling (calcium, G-protein)

• transport p- and v-ATPases

over-represented:

• TFs: SBP, PWWP

•transport aquaporins

• RNA processing

• lipid metabolism steroids

• protein degradation

transport

over-represented:

• protein synthesis

• central metabolism

• hormone metabolism

• vesicle transport

• transport p- and v-ATPases

hormone metabolism

starved 5 minutes 3 minutes 10 minutes 30 minutes

Legend:

Engelsberger WR, Schulze WX (2012) The Plant Journal 69: 978-995

Niittylä T, et al (2007) Molecular and Cellular Proteomics 6: 1711-1726

Page 18: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Experimental:

Functional analysis

of candidate

proteins

Carbon (sucrose)

Membrane proteins and nutrient signaling

Nitrogen (NO3, NH4) Recognition,

Uptake across

membrane

Recognition,

Signal transduction

Regulation of

transcription,

Regulation of

protein activity

Phosphorylation-Dependent

Protein-Protein

Interactions

Dynamics of

Phosphorylation

Protein Complexes and

Membrane Microdomains

Computational:

Analysis of

phosphorylation

networks

Page 19: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

PhosPhAt – Arabidopsis protein phosphorylation site

database and phosphorylation site predictor

Durek P, et al (2010) Nucleic Acids Research 38: D828-D834

Heazlewood JL, et al (2007) Nucleic Acids Research 36: D1015-D1021

http://phosphat.mpimp-golm.mpg.de

Joshi H, et al (2010) Plant Physiology 155: 259-270

Page 20: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Kinase target search in PhosPhAt

Monika Zulawski, PhD-student Zulawski M, et al (2013) Nucleic Acids Research 41: D1176-1184

Page 21: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Integrated phosphorylation site predictor

Heazlewood JL, et al (2008) Nucleic Acids Research 36: D1015-D1021

Plant specific p-site predictor:

Based on support-vector machine.

Feature Vector evaluates information

on local sequence of amino acids,

amino acid chemical and physical

properties as well as structural

disorder indices.

Page 22: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Arabidopsis natural variation

1001 genome project

http://www.1001genomes.org/

Detlef Weigel, Tübingen

Different natural variants of Arabidopsis have been characterized with different

phenotypes.

Sequencing efforts of these accessions provides a basis for new system-wide

analyses to address evolutionary context of adaptation to environments.

Page 23: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Gain and loss of phosphorylation sites by SNPs

Loss of a psite by a non-synonymous Single Nucleotide Polymorphism (SNP)

Genotype 1 of Arabidopsis: T C T Ser

Genotype 2 of Arabidopsis: G C T Ala loss of psite

A – Adenin

C - Cytosin

T – Thymin

G – Guanin

Adaptation to different

environments?

Which proteins are affected by gain/loss of psites?

Are particular protein functions over-represented in

the proteins affected by gain/loss of psites?

Riano-Pachon DM, et al (2010) BMC Genomics 11: 411

Page 24: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Mapping of nsSNPs to phosphorylation sites

156034 nsSNPs

7178 experimental

psites

86 loss sites

(86 proteins)

Riano-Pachon DM, et al (2010) BMC Genomics 11: 411

75,396 hc

predicted psites

1114 loss sites

1148 gain sites

Dataset

Number of

non -

redundant

SNPs in this

study

Number of

non -

redundant

SNPs

mapping onto

cDNAs

Number of

non -

redundant

SNPs

mapping onto

CDS

Number of non -

redundant

SNPs causing

at least one

non -

synonymous

substitution

Number of

non -

redundant

SNPs always

causing

synonymous

substitutions

Nordborg [4] 20667 9251 8023 4047 3975

Clark [5] 637522 263718 227497 109709 117788

Ossowski [6] 860154 220984 174559 84400 90159

TOTAL 1247284 382770 315039 156034 159004

Dataset

Number of

non -

redundant

SNPs in this

study

Number of

non -

redundant

SNPs

mapping onto

cDNAs

Number of

non -

redundant

SNPs

mapping onto

CDS

Number of non -

redundant

SNPs causing

at least one

non -

synonymous

substitution

Number of

non -

redundant

SNPs always

causing

synonymous

substitutions

Nordborg 20667 9251 8023 4047 3975

Clark 637522 263718 227497 109709 117788

Ossowski 860154 220984 174559 84400 90159

TOTAL 1247284 382770 315039 156034 159004

Page 25: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Arabidopsis proteins with a loss of psite by nsSNP AGI Protein position Status Accession(substitution) TAIR7 function

AT1G01550.1 337 (pS)C24(A); Got-7(A); Bur-

0(G)BPS1 (BYPASS 1)

AT1G02850.1 402 (pS)

Bay-0(A); Br-0(A); Fei-

0(A); Nfa-8(A); Tamm-

2(A); Ts-1(A); Tsu-1(A);

Van-0(A)

Glycosyl hydrolase family 1 protein

AT1G13410.1 216 (pT)

Br-0(N); C24(N); Cvi-

0(N); Est-1(N); Ler-1(N);

Nfa-8(N); Rrs-10(N);

Sha(N)

Binding

AT1G19410.1 6 (pT) Bur-0(S) Similar to F-box family protein

AT1G22060.1 261 (pS) Sha(N) Similar to F-box family protein

AT1G22170.1 56 (pY) Bur-0(F)Phosphoglycerate/bisphosphoglycerate

mutase family protein

AT1G24250.1 45 (pT) Bur-0(A); Sha(A)Paired amphipathic helix repeat-containing

protein

AT1G33530.1 2 (pT) Tamm-2(A); Tsu-1(A) F-box family protein

AT1G44800.1 341 (pS)

Bur-0(N); Est-1(Y); Fei-

0(Y); Got-7(Y); Ler-1(Y);

Nfa-8(Y); Rrs-10(Y); Rrs-

7(Y); Tamm-2(Y); Ts-

1(Y); Van-0(Y)

Nodulin MtN21 family protein

AT1G52920.1 37 (pS)

Bur-0(P); C24(P); Fei-

0(P); Lov-5(P); Nfa-8(P);

Rrs-7(P); Tamm-2(P); Ts-

1(P); Tsu-1(P); Van-0(P)

Catalytic

AT1G62330.1 500 (pS)Est-1(G); Rrs-10(G); Tsu-

1(N)Similar to unknown protein

AT1G63870.1 856 (pT) Sha(A)Disease resistance protein (TIR-NBS-LRR

class), putative

AT1G63900.1 278 (pT) Bur-0(A); Tsu-1(A)Zinc finger (C3HC4-type RING finger)

family protein

AT1G64580.1 539 (pS) Ler-1(P); Tsu-1(P)Pentatricopeptide (PPR) repeat-containing

protein

AT1G65060.1 155 (pS) C24(T)4CL3 (4-coumarate:CoA ligase 3); 4-

coumarate-CoA ligase

AT1G65260.1 173 (pT) Bay-0(S); Tsu-1(S)PTAC4 (PLASTID

TRANSCRIPTIONALLY ACTIVE4)

AT1G68760.1 100 (pS)Bur-0(T); Cvi-0(T); Fei-

0(T); Ts-1(T)

ATNUDT1 (Arabidopsis thaliana Nudix

hydrolase homolog 1); dihydroneopterin

triphosphate pyrophosphohydrolase/

hydrolase

AT1G71710.1 306 (pS) C24(C)Inositol polyphosphate 5-phosphatase,

putative

AT1G78360.1 197 (pS) Bay-0(C)

ATGSTU21 (Arabidopsis thaliana

Glutathione S-transferase (class tau) 21);

glutathione transferase

AT1G79850.1 4 (pS)Bay-0(T); Bor-4(T); Est-

1(T); Rrs-7(T)

RPS17 (ribosomal protein S17); structural

constituent of ribosome

AT2G02160.1 552 (pT) C24(M); Tamm-2(M) Zinc finger (CCCH-type) family protein

AT2G03110.1 97 (pT) Bur-0(K); Tsu-1(K) Nucleic acid binding

AT2G20710.1 301 (pY)

Bay-0(C); Bor-4(C); Br-

0(C); Bur-0(C); C24(C);

Fei-0(C); Ler-1(C); Lov-

5(C); Nfa-8(C); Rrs-

10(C); Rrs-7(C); Sha(C);

Tamm-2(C); Ts-1(C); Tsu-

1(C); Van-0(C)

Pentatricopeptide (PPR) repeat-containing

protein

AT2G23360.1 690 (pT) Fei-0(M) Transport protein-related

AT2G27120.1 1000 (pS)

Ag-0(R); CIBC-5(R); Gy-

0(R); HR-10(R); NFA-

10(R); NFA-8(R); Pro-

0(R); Sq-1(R); Tamm-

2(R); Tamm-27(R); Ull2-

5(R); Var2-1(R); Var2-

6(R)

POL2B/TIL2 (TILTED2); DNA-directed

DNA polymerase

AT2G27535.1 8 (pT) Tsu-1(A) Ribosomal protein L10A family protein

AT2G28650.1 284 (pT) Bur-0(K); Lz-0(K)ATEXO70H8 (exocyst subunit EXO70

family protein H8); protein binding

AT2G34780.1 491 (pS) Bur-0(L); Tsu-1(L)EMB1611/MEE22 (EMBRYO DEFECTIVE

1611, maternal effect embryo arrest 22)

AT2G34930.1 371 (pS)Bur-0(A); Lov-5(A);

Sha(A); Tamm-2(A)Disease resistance family protein

AT2G35880.1 118 (pS) Est-1(A) Unknown function

AT2G37340.1 266 (pS)Bur-0(P); Cvi-0(P); Got-

7(P); Rrs-7(P)

RSZ33 (Arginine/serine-rich Zinc knuckle-

containing protein 33); nucleic acid binding /

nucleotide binding / zinc ion binding

AT2G46810.1 7 (pS) Got-7(N)Basic helix-loop-helix (bHLH) family

proteinAT3G10820.1 131 (pS) Tamm-2(C) Transcription elongation factor-related

AT3G13250.1 76 (pS) Bor-4(Y)Similar to AT hook motif-containing protein-

related

AT3G14350.1 363 (pT) Tsu-1(M)Leucine-rich repeat transmembrane protein

kinase, putative

AT3G16430.1 194 (pS) Tsu-1(I) Jacalin lectin family protein

AT3G18180.1 67 (pS) Bor-4(R); Fei-0(R) Unknown function

AT3G24513.1 23 (pS) Fei-0(A)Encodes a defensin-like (DEFL) family

protein.

AT3G26260.1 780 (pS)

Bor-4(T); Br-0(T); Bur-

0(T); Lov-5(T); Nfa-8(T);

Rrs-7(T)

Ulp1 protease family protein

AT3G26730.1 430 (pS) Van-0(L)Zinc finger (C3HC4-type RING finger)

family protein

AT3G29075.1 179 (pS) Tsu-1(A) Glycine-rich protein

AT3G29310.1 327 (pS) C24(I); Ts-1(I) Calmodulin-binding protein-related

AT3G44770.1 103 (pY)

Br-0(F); Bur-0(F); Cvi-

0(F); Est-1(F); Rrs-7(F);

Tamm-2(F); Tsu-1(F);

Van-0(F)

Unknown function

AT3G44960.1 91 (pT) Est-1(N); Tsu-1(N) Unknown function

AT3G48740.1 276 (pT) Est-1(I) Nodulin MtN3 family protein

AT3G49610.1 239 (pT) Bur-0(S) DNA binding

AT3G54090.1 44 (pS) Sha(G) PfkB-type carbohydrate kinase family protein

AT3G58390.1 65 (pY) Tsu-1(D)Eukaryotic release factor 1 family protein /

eRF1 family protein

AT4G02300.1 282 (pT)Bur-0(A); C24(A); Cvi-

0(A)Pectinesterase family protein

AT4G02860.1 142 (pS)Bur-0(N); Ts-1(N); Tsu-

1(N)Catalytic

AT4G02910.1 66 (pT) Tsu-1(P) Unknown function

Receptor proteins are overrepresented

Riano-Pachon DM, et al (2010) BMC Genomics 11: 411

Page 26: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Functional over- and underrepresentation in proteins

with phosphorylation sites affected by nsSNPs

overrepresented underrepresented

Receptor proteins and other proteins involved in signaling and stress response are

over-represented in proteins affected by gain or loss of phosphorylation sites.

Regulation in metabolism is more conserved.

Riano-Pachon DM, et al (2010) BMC Genomics 11: 411

Page 27: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Nitrogen (NO3, NH4)

Goal: Time-resolved phosphorylation network

metabolic enzymes

kinases

transcription

factors

vesicle

transport

cytoplasmic

kinases

raft proteins

transporters

active inactive

protein degradation

protein synthesis other

signaling

proteins

?

?

?

?

? ?

?

? ?

?

Aim: Reconstruct phosphorylation/signaling network from time course data.

Evaluation of different methods for network reconstruction.

Study network properties (degree, path lengths etc.).

Validation of results

Analysis of network properties in selected k.o. mutants / site-directed mutants.

Mapping to protein-protein interactions to network.

Carbon (sucrose)

Page 28: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Mapping phosphoproteins to protein-protein

interaction network

Phosphoproteins particularly map to

hub-proteins with high degree.

Average degree of phosphoproteins is

higher than that of non-phosphoproteins.

http://bioinfo.esalq.usp.br/atpin/atpin.pl

Size represents the

degree of each node

Cyan nodes represent

phosphoproteins

Page 29: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

*

Degree distribution of modified proteins

Proteins with phosphorylation and acetylation sites have highest

degree in protein-protein interaction networks.

Proteins with N-glycosylation, are less central in PINs.

GuangYou Duan; PhD-student

ph

os

ph

ory

lati

on

a

ce

tyla

tio

n

gly

co

syla

tio

n

Arabidopsis yeast human

* * *

* *

* * *

*

* *

*

* *

* *

*

de

gre

e

de

gre

e

de

gre

e

Page 30: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Protein-protein interactions and subcellular location

Subcellular locations of proteins are

used to assess best method for

network reconstruction.

Subcellular location can be used for

weighting predicted interactions.

Partial correlation method gave the

best results in our dataset.

Page 31: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Reconstructed network topology

Initiation layer: Plasma membrane proteins

Processing layer: highest overlap among

treatments

Effector layer: cytosolic proteins, nuclear

proteins. MAPK-motifs

Page 32: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

SIRK1

SWEET11

aquaporin

Time-resolved phosphorylation network

GuangYou Duan; PhD-student

Reconstructed network using Graphical Gaussian

Models (GGM).

Network neighbors: of a receptor kinase: sucrose

transporters; aquaporins; phosphatase 2C.

Page 33: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Experimental:

Functional analysis

of candidate

proteins

Carbon (sucrose)

Membrane proteins and nutrient signaling

Nitrogen (NO3, NH4) Recognition,

Uptake across

membrane

Recognition,

Signal transduction

Regulation of

transcription,

Regulation of

protein activity

Phosphorylation-Dependent

Protein-Protein

Interactions

Dynamics of

Phosphorylation

Protein Complexes and

Membrane Microdomains

Computational:

Analysis of

phosphorylation

networks

Page 34: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

SIRK1: Sucrose-Induced Receptor Kinase

Xuna Wu, PhD-thesis

15

20

25

30

35

40

Re

lati

ve

Ex

ore

ss

ion

(4

0-ΔCt)

AtSIR…

**

**

** **

A

B C sirk1 Col-0

SIRK1 (3436bp) Intron Exon

5’ ATG TGA 3’

LP

RP

LBb1.3

T-DNA

F1

R1

F2

R2

Page 35: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

sirk1 T-DNA

insertional

mutant

Functional characterization using phosphorylation site

mutants

Xuna Wu, PhD-thesis

CaMV35s SIRK1 GFP

CaMV35s SIRK1S744A GFP

CaMV35s SIRK1S744D GFP

Transformation of sirk1 T-DNA insertional mutant with:

SIRK cDNA

SIRK1 with mutated Ser-744 to Asp-744 (D);

phosphorylation mimic mutant

SIRK1 with mutated Ser-744 to Ala-744 (A);

phosphorylation null mutant

Page 36: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

SIRK1 kinase activity

sucrose treatment

0 + 0 + 0 + 0 +

Lu

min

escen

ce (

co

un

ts)

0

500

1000

1500

2000 EVC

35S::SIRK1

35S::SIRK1S744A

35S::SIRK1S744D

aa

b

d

bbc

cd

d

SIRK1 sucrose-dependent kinase activity

Xuna Wu, PhD-thesis

Luciferase-based kinase-activity assay shows sucrose-dependent

quench of luminescence when expressing SIRK1 in sirk1

background.

SIRK1 is a protein kinase

SIRK1 activity is increased by sucrose

Kinase GLO® kit

Substrate: myelin

basic protein

Page 37: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Microsome preparation

SIRK1-GFP SIRK1S744D-GFP SIRK1S744A-GFP

GFP-IP

(Chromotek)

3min

suc

0 suc

LC-MS/MS

GFP (control)

3min

suc

0 suc 3min

suc

0 suc 3min

suc

0 suc

Data analysis:

MaxQuant

cRacker

SIRK1 interaction partners

Xuna Wu, PhD-thesis

Page 38: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

7

2

14

SIRK1

sucrose starved

SIRK1

3min sucrose

SIRK1S744D

3min sucrose

SIRK1S744A

3min sucrose

16

0

36

1

2

2

2

3 3

16

75

47

63

1

3

Over-representation of

aquaporins:

PIP2E , PIP2B, PIP2A,

PIP1A, PIP1D PIP3A

Remorins

SIRK1 interaction partners

Over-representation of:

Clathrins, coatomers

Common interaction partners with plasma membrane location include:

Over-representation of aquaporins (6 isoforms)

Stronger interactions with S744D:

Clathrins and coatomers involved in vesicle trafficking

Xuna Wu, PhD-thesis

Page 39: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Functional analysis using kinase mutants

Xuna Wu, PhD-thesis

Full nutrition Sucrose

Starvation

SIRK1

SIRK1S744D

SIRK1S744A

Col-0

sirk1 Mass spectrometry Sucrose

Re-supply

3 minutes

14 days 2 days Protein extraction

Membrane preparation

TiO2

Page 40: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Altered phosphorylation patterns in kinase mutant sirk1

Xuna Wu, PhD-thesis

common phosphopeptides

transport

cell wall / vesicle trafficking

signaling

protein synthesis / degradation

DNA / RNA binding

other

not assigned

reduced in sirk1

Col-0 sirk1

n=51

reduced in

sirk1

10%

8% 8%

17%

6%

35%

16% 16%

22%

4% 32%

6%

8% 12%

Several transport proteins show reduced sucrose-induced

phosphorylation in sirk1 mutant.

Commonly identified

Page 41: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Altered phosphorylation particularly of transport proteins

Xuna Wu, PhD-thesis

Function Locus Phosphopeptides Bin Log2 fold (sp1

vs WT)

Description/Na

me Classification

Niittylä et

al. 2007

Tra

nsp

ort

re

late

d

AT3G58730 GI(pS)INAAR 34.1.1.3 -1.11 VHA-D

ATPase

AT4G30190 GLDIETPSHYTV 34.1.2 -0.5 AHA2 *

AT3G01390 ME(pS)NRGQGSIQQLLAAEVE

AQHIVNAAR 34.1 -0.65 VMA10

AT5G57110 TSLLK(pS)SPGR 34.21 -0.69 ACA8 *

AT1G57990 QTTAEGSANPEPDQIL (pS)PR 34.10 -1.08 ATPUP18 purine

transporter

AT5G27150 GFVPFVPG(pS)PTER 34.14 -0.72 NHX1 cation

transporter

AT2G37170 SLG(pS)FR(pS)AANV 34.19.1 -0.83 PIP2B

aquaporin

*

AT4G35100 ALG(pS)FR(pS)NATN 34.19.1 -2.26 PIP3

AT5G60660 ALGSFG(pS)FG(pS)FR 34.19.1 -1.33 PIP2;4/PIP2F

AT1G75220 (pS)FRDDNEEAR 34.2 -1.78 unkown Sugar

transporter

AT3G48740 LGTVS(pS)PEPISVVR 33.99 -0.91 ATSWEET 11 Sucrose

transporter *

Sig

na

lin

g

AT3G02880 LIEEVSHSSG(pS)PNPVSD 30.2.3 -0.54 LRR-RLK receptor kinase *

AT2G43130 (pS)DDDERGEEYLFK 30.5 -0.5 ARA4 G-protein

Does SIRK1 directly modulate transport proteins?

Page 42: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Validation: Protoplast swelling assay

Xuna Wu, PhD-thesis

Measurement of Water flux density based on

the volume change of protoplast per unit

surface area Volume flow density jv=ΔV/ (Δt*S0)

Low osmolarity High osmolarity

500 mM Mannitol

(ES-M500)

350 mM Mannitol

(ES-M350)

470mM Mannitol/

30 mM Sucrose

(ES470M_30S)

320mM Mannitol/

30 mM Sucrose

(ES320M_30S)

=

= Osmolarity

High osmolarity Low osmolarity

Sucrose-starved

seedlings Protoplast Isolation

(500 mM mannitol)

Protoplast treatment

with 30 mM sucrose Protoplast swelling observation

Suc Man

Video camera

Protoplast swelling assay with wild

type, sirk1 mutant and transformants

(SIRK1, SIRK1A, SIRK1D)

Sucrose or mannitol as solute

Page 43: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Protoplast swelling assay

Osmolyte

man suc man suc man suc man suc man suc

Wa

ter

flu

x d

en

sit

y [

j V

att

oL

s-1

µm

-2]

0

20

40

60

80

SIRK1sirk1Col-0

p<0.001

p=0.002

p=0.021

SIRK1S744A SIRK1S744D

p<0.001 p<0.001

SIRK1 modulates water flux in sucrose-dependent manner

Xuna Wu, PhD-thesis

Sucrose-induced swelling is significantly reduced in sirk1

Sucrose-induced swelling is restored in SIRK1-GFP-line

No difference between lines is observed for mannitol

treatment

Page 44: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

SIRK1 S744 is involved in receptor internalization

Xuna Wu, PhD-thesis

Sucrose induces SIRK1 internalization

within 15 minutes

Mutation S744A displays reduced

SIRK1 internalization

Mutation S744D results in sucrose-

independent SIRK1 internalization.

Page 45: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Functional model for SIRK1

Xuna Wu, PhD-thesis

sucrose starvation

sucrose resupply 3min

Cytosolic targets

SWEET11 aquaporin

aquaporin

SIRK1 AT3G02880

clathrin remorin

sucrose resupply 10min

S744A S744D

SIRK1 phosphorylation declines

SIRK1 dimerization becomes less

Phosphorylation activity of targets goes down

Phosphorylated SIRK1 forms are internalized

SIRK1

Page 46: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Summary

Dynamic analysis of protein modification allows definition of edges in

signaling networks.

Network model suggests relevant sub-networks relevant under particular

biological context (cue-signal response compendiums).

Kinase mutants and phosphorylation site mutants are important tools for

characterization of regulatory interactions in protein signaling networks.

For rapid adaptations to changing external conditions, direct regulations of

membrane transporters by membrane bound kinases become evident.

Tagchagpuri 6430m

First ascent

Page 47: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Slava Braginets (Programmer)

Sylwia Kierszniowska (PhD, PostDoc)

Heidi Pertl-Obermeyer (PostDoc)

Witold Szymanski (PhD)

Karen Vester (Bsc)

XuNa Wu (PhD, PostDoc)

Kerstin Zander (TA)

Monika Zulawski (PhD)

Wolfgang Engelsberger (PhD)

John Rohloff (BSc)

Robert Schmidt (Programmer)

Margitta Schümann (TA)

Alois Schweighofer (PostDoc)

Dmitri Schmidt (Bsc, MSc)

Natalie Arndt (Dipl)

Borjana Arsova (PostDoc)

Vivian Schüler (Bsc)

Gunnar Schulze (Msc)

Henrik Zauber (Dipl, PhD)

Signaling Proteomics Group

"All true wisdom is only to be found far from the dwellings of men, in the great solitudes; and it

can only be attained through suffering."

(Igjagarjuk, Greenland)

Collaborators:

Dirk Walther, MPIMP

Staffan Persson, MPIMP

Stefanie Hartmann, UP

Page 48: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics
Page 49: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

LVSWYDNEWGYSSR

Retention time [minutes]

60.8 61.0 61.2 61.4 61.6 61.8

MS

pe

ak

are

a [

Th

*co

un

ts]

0

2e+4

4e+4

6e+4

8e+4

MS full scan

m/z [Th]

880 882 884 886 888 890 892 894

Ion

in

ten

sit

y

0

2e+5

4e+5

6e+5

8e+5

Retention time [minutes]

60.8 61.0 61.2 61.4 61.6 61.8

MS

pe

ak

are

a [

Th

*co

un

ts]

0

2e+4

4e+4

6e+4

8e+4

H

L

1.34

1.52

1.43

1.38

1.19

1.21

1.13

1.12

1.06

0.98

1.07

1.231.26

H L

average of ratios: 1.22 ratio of XICs: 1.20 A B

C

MH2+: 881.40

m/z [Th]

200 400 600 800 1000 1200 1400 1600 1800

inte

ns

ity

0

500

1000

1500

2000

MH2+: 891.37

inte

ns

ity

0

500

1000

1500

2000

y3

y4

y5 y6

y10

y8

y9

b6 b9 b11

y12

y11

y12++

y3y4

y5 y6

y8

y9y10

b11y11y12

D MS2 fragment spectrum

HL

H

L

Extracted ion chromatogram Extracted ion chromatogram

891.37

881.40

15N-Metabolic labeling: Data extraction workflow

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516

Labeled and unlabeled peptide form co-elute in reversed-phase liquid chromatography.

Mass shift is dependent on the amino acid sequence.

Identification of the peptide is necessary before the peptide pairs of labeled and unlabeled form can be quantified.

Engelsberger WR, et al (2006) Plant Methods 2(14): 1-11

Arsova B, et al (2012) Trends in Plant Sciences 17: 102-112 Kierszniowska S, et al (2009) Proteomics 9: 1916-1924

Page 50: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Peptide Correlation Profiling: Data extraction workflow

Sample 1

ion

in

ten

sit

y

0

1e+7

2e+7

3e+7

4e+7

Sample 2

Retention time [minutes]0 10 20 30 40 50 60 70 80 90

ion

in

ten

sit

y

0

1e+7

2e+7

3e+7

4e+7

Correlation of retention times

Retention time sample 1 [minutes]0 20 40 60 80

Re

ten

tio

n t

ime

sa

mp

le 2

[m

inu

tes

]

0

20

40

60

80

Sample 1

ion

in

ten

sit

y

0

1000

2000

3000

4000

5000

DLAVEGNEIIR M2H+: 614.83021

peak volume: 36627 Th*counts*s

Sample 1

m/z [Th]200 400 600 800 1000 1200

ion

in

ten

sit

y

0

500

1000

1500

2000

Extracted ion chromatogramMS2 fragment spectrum

y3a2

b2b3 b5

y6

y7

y8

y9 b10

Sample 2

Retention time [minutes]

24.0 24.5 25.0 25.5 26.0 26.5 27.0

ion

in

ten

sit

y

0

1000

2000

3000

4000

5000

peak volume: 13748 Th*counts*s

Sample 1

m/z [Th]615 616

ion

in

ten

sit

y

0

5e+4

1e+5

2e+5

2e+5

Sample 2

m/z [Th]615 616

M2H+: 614.83021 M2H+: 614.83021

MS full scan

RT: 25.39 min RT: 25.55 min

Base peak chromatogram

A B

C D

E

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516

Peptide Sample 1 Sample 2

A 12 24

B 22

C 15

... ... ...

9

30

Retention time

Retention time peak alignment increases qunatitative

coverage in label-free approaches.

Arsova B, et al (2012) Molecular and Cellular Proteomcs 11: 619-628

Page 51: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Summary

Dynamic analysis of protein modification allows definition of edges in

signaling networks.

Network model suggests relevant sub-networks relevant under particular

biological context (cue-signal response compendium).

Kinase mutants and phosphorylation site mutants are important tools for

characterization of regulatory interactions in protein signaling networks.

For rapid adaptations to changing external conditions, direct regulations of

membrane transporters by membrane bound kinases become evident.

To be done:

Integration of sirk1 data set into the dynamic network.

Targeted analysis of phosphorylation stoichiometry in CSR compendiums

Tagchagpuri 6430m

First ascent

Page 52: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Plans and Ideas

Research:

Protein-Protein Interaction Networks and Protein Signaling Networks: Use

quantitative data for true modeling

PINs and PSNs in natural variation (e.g. Arabidopsis accessions)

Quantitation of phosphorylation stoichiometry

Small molecule–Protein Interactions / protein-lipid interaction

Dynamics of membrane microdomains in context of external perturbations

Teaching:

Large-scale high-throughput methods

Experimental design and data analysis in quantitative proteomics

„Big data“ and networks (PINs, PSNs, CSRs)

Primary metabolism and nutrient signaling in plants

Plant signaling networks

Page 53: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Proteome space accessible for quantitation

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Instrumentation has and will be improved:

Scan speed.

Mass accuracy.

Sensitivity.

Selectivity.

Analysis of accurate and full proteome coverage become feasible

Page 54: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Networks consist of nodes and

connecting edges.

Protein-protein interaction

networks (PIN): undirected,

direct physical interactions

Protein-signaling networks

(PSN): directed, not equal to

PINs.

Measured by:

AP-MS

Y2H

cell biology

Measurement ?:

transient

often, edgeds need

to be inferred

Page 55: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Data sets for “network

biology” need to fulfill these

criteria:

Complete datasets (all

nodes and edges need to

be measurable)

Reproducible data sets

Quantitative data

Reasonable throughput

Page 56: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Page 57: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

Lloyd J & Meinke D (2012) A comprehensive dataset of genes with a loss-of-

function mutant phenotype in Arabidopsis. Plant Physiol. 158(3):1115-1129.

Essential genes are enriched for RNA and

protein synthesis and protein modification,

but deficient in transcription factors

Genes inducing morphological

phenotypes are enriched in transcriptional

regulation and signaling functions.

Essential genes are more likely to be

unique (no functional redundancy)

Page 58: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Quantitative proteomics workflow

raw data files from mass spectrometer

extracted MS/MS peak

list

validation of ID

Quantitation based on

MS or MS/MS scans

Sequence ID in

database search

validation of

quantification

Peptide average (good spectra for ID, good spectra

for quant)

Protein average (unique peptides per protein)

Statistical analysis,

report

Raw data are being processed to allow spectra to be assigned to peptide sequences.

quantitative information is extracted based on MS or MS/MS spectra.

Spectra with assigned peptide sequences are linked to quantitative information.

Identification and quantitation usually is validated by a manual verification step.

Averaged peptide and protein ratios are then subjected to statistic analysis and final evaluation.

Schulze WX Usadel B (2010) Annual Reviews of Plant Biology 61: 491-516

Page 59: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

MASCP Aggregator Portal

Joshi H, et al (2010) Plant Physiology 155: 259-270

Integration of different proteomic resources in a single portal:

AtPeptide

SUBA

TAIR

PhosPhAt

PPDB

http://gator.masc-proteomics.org/

Page 60: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Reproducibility of experimental phosphorylation sites

Most phosphorylation sites and most phosphoproteins

have been identified only once (in one experiment).

Only 2% of the phosphorylation sites have been

identified by more than two different studies.

Durek P, et al (2010) Nucleic Acids Research 38: D828-D834

Page 61: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Arabidopsis kinome and gene duplications

Monika Zulawski, PhD-student

Page 62: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Gene duplications in different kinase families

Monika Zulawski, PhD-student

Page 63: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Functional diversification upon gene duplication

Monika Zulawski, PhD-student

+ phylogeny - phylogeny

Page 64: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Mass spectrometry based proteomics

Separation by 2D-gel

select and isolate spots

Complex peptide mixture

separate by liquid

chromatography

Identify proteins by

peptide mass fingerprint

Identify proteins by tandem

mass spectrometry

Protein mixture

Page 65: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

cRacker as automated and standardized data analysis tool

Zauber H, Schulze WX (2012) Journal of Proteome Research, 11 (11): 5548-5555

cRacker produces graphic output and statistics:

ANOVA, ttest: presentation as volcano plots

K-means or hirarchical clustering

Over-representation test

Henrik Zauber; PhD-thesis

Page 66: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Progression of phosphorylation from membrane to cytosol

Proteins with maximal phosphorylation in early time points are enriched in

membrane proteins.

Proteins with maximal phosphorylation in later time points are enriched in soluble

proteins.

Distribution of membrane proteins to response groups

number of membrane spans in proteins

single TM more than 4 TM soluble

ab

un

dan

ce f

racti

on

0.0

0.2

0.6

0.8

1.0

untreated

03min

05min

10min

30min

* * *

Engelsberger WR, Schulze WX (2011) The Plant Journal 69: 978-995

P

Intracellular protein

Kinase domain

Plasma

membrane

Membrane

bound kinases

P

Transporter

P

Intermediate

protein P

S744

Page 67: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Multi-

Single

Single-Multi Multi-Multi Single-Single Single-

No

No-

Single

Multi-No No-Multi

Kinase 0.651 0.058 0.421 0.375 0.373 1 1 1

Phosphatase 1 1 1 0.016 1 0.521 1 1

Transport 0.433 1 1 0.442 0.698 0.297 0.360 0.634

Transcription 0.415 1 1 0.081 1 1 1 1

Metabolic 0.346 0.222 0.614 0.551 0.751 0.531 0.706 1

p-Values of Fisher Exact Test for representation of biological functions in nodes of phosphorylation site network. Directionality from

time information

Functional properites of phosphoprotein nodes

Kinases are over-represented in single-multi nodes.

Transcription factors and phosphatases are over-represented in single-

single nodes.

GuangYou Duan; PhD-student

Page 68: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Mass-spectrometry based quantitative proteomics T

iss

ue

P

rote

in

Pe

pti

de

s

Sp

ec

tru

m

m/z

Da

ta

m/z m/z

*

* *

* *

*

*

*

m/z

* * *

*

*

*

* *

*

m/z

m/z m/z m/z

Metabolic

Labeling

Chemical Labeling

Proteins Peptides

Synthetic

Standard

Peptides

Label-free Quantitation

MS

MS2

MS MS MS MS MS

Schulze WX, Usadel B (2010) Annual Reviews of Plant Biology 61: 491-516

Page 69: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Mass-spectrometry Based Quantitative Proteomics T

iss

ue

P

rote

in

Pe

pti

de

s

Sp

ec

tru

m

m/z

Da

ta

m/z m/z

*

* *

* *

*

*

*

m/z

* * *

*

*

*

* *

*

m/z

m/z m/z m/z

Metabolic

Labeling

Chemical Labeling

Proteins Peptides

Synthetic

Standard

Peptides

Label-free Quantitation

MS

MS2

MS MS MS MS MS

Schulze WX, Usadel B (2010) Annual Reviews of Plant Biology 61: 491-516

Page 70: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

inserted peptide based on m/z and time

real peptide

Reproducible LC required for label-free quantitation

Quantitative comparison can only be

done between the same peptide

sequence/charge state/modification.

Averaging per protein only after

standardization of peptide data.

Page 71: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Nano-LC-MS/MS Setup

C18-column (75 m),

8 m tip

flow rate: 250 nL/min

nano-HPLC LTQ-Orbitrap mass spectrometer

Protein mixture

Page 72: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Protein identification

by mass spectrometry

V I/L

D G D

AAVIGDTIGDPLK

Total ion chromatogram

MS/MS fragment spectrum

MS full scan

T I/L G

L P D D G I

Peak List

Database

Search Peptide sequence

for Quantification

for Identification

Genome sequencing efforts

Predicted protein sequences

Page 73: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Protein inference problem in peptide-based proteomics

From: Qeli E & Ahrens CH (2010) PeptideClassifier for protein inference

and targeted quantitative proteomics. Nature Biotech. 28(7):7647-7650.

Peptide sequences often cannot be assigned to a single protein only.

Peptides „specific“ to a protein: Proteotypic peptides.

Often not proteins, but „protein groups“ are identified by mass spectrometry.

Limitation to current „network“ biology approaches

Page 74: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Proteome coverage for quantitation

10-11 10-10 10-9 10-8

quantified

protein concentration

(mol/gFW)

Biggest challenge to current „network“ biology approaches

Schulze WX, Usadel B (2010) Annual Review of Plant Biology 61: 491-516

From all the proteins in the sample only a subset is being identified in LC-MS/MS experiments.

Out of the identified proteins another subset is suitable for quantitation.

Usually the more high abundant proteins are covered by identification and quantification.

Fractionation is necessary for in-depth analysis.

Page 75: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Missing value problem

Difficulty for statistical analysis and biological interpretation

Stochastic component in classic dda LC-MS/MS leads to missing values.

in different samples, different sets of proteins are identified / quantified.

Solutions:

„Better“ instrumentation.

Targeted analysis of protein subsets.

1000s proteins,

Incomplete data

„Discovery“

„Quantitation“

100s proteins

Complete data

Page 76: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

15N metabolic labeling

Chemical labeling

Label free approaches

Application of Different Quantitative Methods

Targeted approaches using standard peptides

Precision Coverage

+++

Dynamic

range

+

++

+++

++

+

++

+++

+++

++

++

++

Different experimental setups require different choice of quantitative method.

Depending on the biological question, the precision, coverage, and dynamic range must be considered.

Arsova B, et al (2012) Trends in Plant Sciences, 17: 102-112

Arsova B, et al (2012) Molecular and Cellular Proteomcs 11: 619-628

Page 77: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

cRacker as automated and standardized data analysis tool

Zauber H, Schulze WX (2012) Journal of Proteome Research, 11 (11): 5548-5555

cRacker can process quantitative data from various quantitation softwares (MaxQuant, MSQuant, Progenesis, custom definitions).

Processing of labeled and unlabeled data.

Different possibilites for normalization and standadization.

Henrik Zauber; PhD-thesis

http://cracker.mpimp-golm.mpg.de

Page 78: Into the unknown: Using systems biology approaches in ... · PDF fileRev. Biochem. 81:379-405. “One gene ... Heck AJ, & Aebersold R (2012) Mass spectrometry-based proteomics

Towards a „network biology“

From: Bensimon A, Heck AJ, & Aebersold R (2012) Mass spectrometry-based

proteomics and network biology. Annu. Rev. Biochem. 81:379-405.

Networks consist of nodes and

connecting edges.

Protein-protein interaction networks:

undirected, direct physical interactions

Protein-signaling networks: directed,

not equal to PINs.

Measured by:

AP-MS

Y2H

cell biology

Measurement ?:

transient

often, edgeds need

to be inferred