mass spectrometry applications for comparative proteomics

241
Mass Spectrometry Applications for Comparative Proteomics and Peptidomic Discovery by Robert Stewart Cunningham A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Chemistry) at the UNIVERSITY OF WISCONSIN-MADISON 2012 Date of final oral examination: 10/4/12 The dissertation is approved by the following members of the Final Oral Committee: Lingjun Li, Professor, Chemistry/Pharmacy Albee Messing, Professor, Comparative Biosciences Lloyd Smith, Professor, Chemistry Warren Heideman, Professor, Pharmacy Tim Bugni, Assistant Professor, Pharmacy

Upload: others

Post on 26-May-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Mass Spectrometry Applications for Comparative Proteomics

Mass Spectrometry Applications for Comparative Proteomics and Peptidomic

Discovery

by

Robert Stewart Cunningham

A dissertation submitted in partial fulfillment of

the requirements for the degree of

Doctor of Philosophy

(Chemistry)

at the

UNIVERSITY OF WISCONSIN-MADISON

2012

Date of final oral examination 10412

The dissertation is approved by the following members of the Final Oral Committee

Lingjun Li Professor ChemistryPharmacy

Albee Messing Professor Comparative Biosciences

Lloyd Smith Professor Chemistry

Warren Heideman Professor Pharmacy

Tim Bugni Assistant Professor Pharmacy

i

Acknowledgements

I would like to acknowledge the support and guidance from professors colleagues

and friends at the University of Wisconsin-Madison who are indispensable to this thesis

First I would like to express my deep gratitude to my advisor Prof Lingjun Li for

allowing me the freedom to chase scientific endeavors all while offering her constant

guidance assistance and support through my PhD study Her constant energy and

enthusiasm in research have led by example in performing research and inspired me to

make the most of the time given to me Dr Li encouraged me to take on challenging

projects apply for awards travel and present my research to the larger scientific

community None of my work would be achieved without her and I want to thank Dr Li

for her support during these years

I would also like to thank the members of my committee Dr Lingjun Li Dr

Albee Messing Dr Lloyd Smith Dr Warren Heideman and Dr Tim Bugni I truly

appreciate the willingness of these professors to take time out of their busy schedules to

serve as members of my committee

I have benefited greatly from previous members of the Li Lab In particular I

would like to thank Dr James Dowell Dr Xin Wei Dr Robert Sturm and Dr Limei

Hui for their patient and valuable suggestions in my research and also teaching me

valuable experimental skills how to perform general shotgun proteomics and how to use

several instruments Specifically I would like to thank Daniel Wellner who has worked

with me on numerous projects over the past 2 years and has been a constant in my

research life I also want to thank my wonderful current colleagues Jingxin Wang Tyler

ii Greer Chris Lietz Chenxi Jia Dustin Frost Di Ma Hui (Vivian) Ye Nicole Woodards

and Claire Schmerberg for their collaboration in many challenging research projects and

fruitful discussions on various research areas There are too many people to thank each

one individually but every member of the Li lab has in some way contributed to my

learning experience Beyond research work their friendship also made my life here in

Madison much more enjoyable

I would also like to thank our collaborators Dr Albee Messing Dr Warren

Heideman Dr Xin Sun and Dr James Dowell It is my great pleasure to have the

opportunities to work with these amazing people and gain precious experience I have

learned so much from them and their achievements in the field have inspired me to strive

to do the best I could

Furthermore I would like to thank Gary Girdaukas and Dr Cameron Scarlett at

School of Pharmacy for the access of the MALDI-FTMS and Bruker amaZon ion trap

instruments

In particular I wish to thank my family my Mom and Step-Dad for raising me

and my Dad for always being there for me They all supported me in my decision to

pursue science and specifically a career in chemistry I would like to thank my Sister

who grew up with me and always led by example in academics Most importantly I

would like to thank my wife Na Liu for her constant support She has inspired and

helped me finish my PhD and always encouraged me to be the best I could be To them

I dedicate this thesis

iii

Table of Contents

Page

________________________________________________________________________

Acknowledgements i

Table of Contents iii

Abstract iv

Chapter 1 Introduction brief background and research summary 1

Chapter 2 Mass spectrometry-based proteomics and peptidomics for

biomarker discovery and the current state of the field 15

Chapter 3 Protein changes in immunodepleted cerebrospinal fluid from

transgenic mouse models of Alexander disease detected

using mass spectrometry 73

Chapter 4 Genomic and proteomic profiling of rat adapted scrapie 110

Chapter 5 Investigation of the differences in the phosphoproteome

between starved vs glucose fed Saccharomyces cerevisiae 139

Chapter 6 Use of electron transfer dissociation for neuropeptide

sequencing and identification 166

Chapter 7 Investigation and reduction of sub-microgram peptide loss

using molecular weight cut-off fractionation prior to

mass spectrometric analysis 187

Chapter 8 Conclusions and future directions 206

Appendix 1 Protocols for sample preparation for mass spectrometry

based proteomics and peptidomics 217

Appendix 2 Publications and presentations 233

_______________________________________________________________________

iv

Mass Spectrometry Applications for Comparative Proteomics and

Peptidomic Discovery

Robert Stewart Cunningham

Under the supervision of Professor Lingjun Li

At the University of Wisconsin-Madison

Abstract

In this thesis multiple biological samples from various diseases models or

treatments are investigated using shotgun proteomics and improved methods are

developed to enable extended characterization and detection of neuropeptides In general

this thesis aims to expand upon the rapidly evolving field of mass spectrometry (MS)-

based proteomics and peptidomics by primarily enhancing small scale sample analysis

A review of the current status and progress in the field of biomarker discovery in

peptidomics and proteomics is presented To this rapidly expanding body of literature

our critical review offers new insights into MS-based biomarker studies investigating

numerous biological samples methods for post-translational modifications quantitative

proteomics and biomarker validation Methods are developed and presented including

immunodepletion for small volume cerebrospinal fluid (CSF) samples for comparison of

the CSF proteomes between an Alexander disease transgenic mouse model with

overexpression of the glial fibrillary acidic protein and a control animal This thesis also

covers the application of the small scale immunodepletion of CSF for comparative

proteomic analysis of a novel rat adapted scrapie (RAS) model for prion disease and

v

compares the RAS CSF proteome to control rat CSF using MS Large scale

phosphoproteomics of starved vs glucose fed yeast is presented to better understand the

phosphoproteome changes that occur during glucose feeding Method development for

neuropeptide analysis is expanded upon using electron transfer dissociation (ETD)

fragmentation to successfully sequence for the first time the crustacean hyperglycemic

hormone precursor-related peptide (CPRP) from the blue crab Callinectes sapidus In

addition a method for ETD sequencing of sulfonated neuropeptides using a magnesium

salt adduct in an ion trap mass spectrometer is reported This thesis also reports on a

method for sub-microg peptide isolation when using a molecular weight cut-off filtration

device to improve sample recovery by over 2 orders of magnitude All the protocols used

throughout the work are provided in an easy to use step-by-step format in the Appendix

Collectively this body of work extends the capabilities of mass spectrometry as a

bioanalytical tool for shotgun proteomics and expands upon methods for neuropeptide

discovery and analysis

1

Chapter 1

Introduction Brief Background and Research Summary

2

Abstract

Mass spectrometry based comparative proteomics and improved methods for

neuropeptide discovery have been reported in this thesis A very brief introduction of crustacean

neuropeptidomics is covered in this chapter followed by Chapter 2 which covers in great detail

comparative proteomics using mass spectrometry with an emphasis on biomarker discovery

Chapter 3 reports a novel immunodepletion technique used for comparative proteomics between

glial fibrillary acidic protein (GFAP) overexpressor and control mouse cerebrospinal fluid (CSF)

Chapter 4 involves rat CSF comparative proteomics between rat adapted scrapie and control

animals Chapter 5 details comparisons of phosphoproteomes in Saccharomyces cerevisiae

(Bakerrsquos yeast) between starved vs glucose fed conditions Chapter 6 investigates the use of

electron transfer dissociation (ETD) for kDa sized neuropeptides and neuropeptides with tyrosine

sulfonation Chapter 7 presents a molecular weight cut-off (MWCO) protocol to allow sub-microg

peptides to be recovered Chapter 8 provides a conclusion to this thesis and outlines future

directions for certain projects

3

Background

Mass spectrometry (MS) requires gas phase ions for experimental measurement and

intact large nonvolatile biomolecules proved difficult to be analyzed by electron ionization or

chemical ionization until the invention of two soft ionization techniques matrix-assisted laser

desorptionionization (MALDI)1 and electrospray ionization (ESI)

2 ESI and MALDI are the

two most common soft ionization techniques for mass spectrometry Once ionized molecules

such as peptides or proteins can be separated by their mass to charge ratios (mz) using various

mass analyzers manipulating electric andor magnetic fields ESI and MALDI mass

spectrometric techniques have become central analytical methods in biological sciences because

they are suitable for nonvolatile thermally labile analytes in femtomole quantities3 ESI allows

the coupling of high pressure liquid chromatography and the constant flow of solvent is

electrically charged at the outlet to produce a Taylor cone4 During desolvation the Raleigh

limit is reached and a coulombic explosion occurs commonly producing multiply charged ions

A modification of ESI is nanoLC-ESI which operates at nLmin flow rates to analyze low sample

amounts such as proteins or peptides in biological samples5 NanoLC-ESI uses less solvent as

the source because of the reduced flow rate of standard LCMS with an ESI source and nanoLC-

ESI is the predominate inlet for shotgun proteomic ESI MS experiments6 Conversely MALDI

can use sub-microL volume of sample and be co-crystallized with an ultraviolet absorbing organic

matrix that is obliterated using a laser at appropriate wavelength to generate gas phase ions

Alternatively MALDI has the unique capability to work with tissue samples and ionize in the

solid state instead of liquid like ESI

4

Mass analyzers require an operating pressure between 10-4

-10-10

Torr to allow proper ion

transfer and separationisolation of the ion based upon its mz Numerous mass analyzers are

currently available and each have their own strengths and weaknesses as shown in Figure 1 The

biomolecules are separated by the mass analyzers and detected without fragmentation which is

termed MS analysis Following MS detection tandem mass spectrometry (MSMS) of the

original precursor ion can be performed to provide additional structural information such as a

ladder sequence of amino acids for peptides Numerous fragmentation techniques are available

for proteomics such as collision induced dissociation (CID) electron transfer dissociation (ETD)

or high energy collision induced dissociation (HCD) Each of these fragmentation techniques

have their own benefits in proteomics experiments and are discussed in depth in Chapter 2 The

background and current status for comparative proteomics with specific emphasis on biomarker

analysis are covered in Chapter 2

Neuropeptidomic Method Development in the Crustacean Model System

Utilizing Mass Spectrometry

Chapter 6 and 7 focus on sample preparation and ETD fragmentation techniques to

characterize neuropeptides in the neuroendocrine organs in the crustacean nervous system

Neuropeptides are short chains of amino acids which are a diverse class of cell-cell signaling

molecules in the nervous system Neuropeptides have been investigated for being involved in

numerous physiological processes such as memory7 learning

8 depression

9 pain

10 reward

11

reproduction12

sleep-wake cycles13

homeostasis14

and feeding15-17

Figure 2 depicts how

neuropeptides are synthesized as pre-prohormones in the rough endoplasmic reticulum and

5

packaged in the Golgi apparatus After being packaged these pre-prohormones are processed

into bioactive peptides within the vesicle which is occurring during vesicular transport down an

axon and released into the synaptic cleft Secreted neuropeptides stimulate the post synaptic

neurons by interacting with G-protein coupled receptors at the chemical synapse

The crustacean model nervous system is well-defined neural network which has been

used in neurobiology and neuroendocrine (neurosecretory) research and thus lends itself for

studying neuromodulation18-22

Figure 3 shows the locations of several neuroendocrine organs in

the crustacean Callinectes sapidus including the sinus gland (SG) which is studied in Chapter 6

The Lingjun Li lab has focused on neuropeptide discovery and function in crustacean

neuroendocrine organs using mass spectrometry23-25

The work presented in Chapters 6 and 7

expand on sample preparation and analytical tools to further investigate the neuropeptidome

Research Overview

Comparative Proteomics of Biological Samples

Chapter 2 provides a thorough review of comparative proteomics and biomarker analysis

using mass spectrometry The scientific community has shown great interest in the field of mass

spectrometry-based proteomics and peptidomics for its applications in biology Proteomics

technologies have evolved to generate large datasets of proteins or peptides involved in various

biological and disease progression processes producing testable hypotheses for complex

biological questions This chapter provides an introduction and insight into relevant topics in

proteomics and peptidomics including biological material selection sample preparation

separation techniques peptide fragmentation post-translational modifications quantification

6

bioinformatics and biomarker discovery and validation In addition current literature and

remaining challenges and emerging technologies for proteomics and peptidomics are discussed

Chapter 3 investigates comparative proteomics between a GFAP overexpressor mouse

model and a control mouses cerebrospinal fluid (CSF) CSF is a low protein content biological

fluid with dynamic range spanning at least nine orders of magnitude in protein content and is in

direct contact with the brain but consist of very abundant proteins similar to serum which require

removal A modified IgY-14 immunodepletion treatment is presented to remove abundant

proteins such as albumin to enhance analysis of the low volumes of CSF that are obtainable

from mice These GFAP overexpressor mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we present the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates are performed to address animal variability as well as reproducibility in mass

spectrometric analysis Relative quantitation is performed using distributive normalized spectral

abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins with

significant changes in the CSF of GFAP transgenic mice are identified with validation from

ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

Chapter 4 explores the CSF proteomics of a novel prion model called rat adapted scrapie

(RAS) A similar IgY immunodepletion technique is presented as in Chapter 3 and is similarly

used to reduce the abundant proteins in CSF CSF from control and RAS (biological N=5

technical replicates N=3) were digested and separated using one dimensional reversed-phase

nanoLC separation In total 512 proteins 167 non-redundant protein groups and 1032 unique

peptides are identified with a 1 FDR Comparative analysis was done using dNSAF spectral

7

counting and 21 proteins were significantly up or down-regulated The proteins are compared to

the 1048 differentially regulated genes and additionally compared to previously published

proteins showing changes consistent with other prion animal models Of particular interest is

RAS specific proteins like RNAseT2 which is not up-regulated in mouse prion disease but is

designated as upregulated in both the genomic and proteomics data for RAS

Chapter 5 explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Previous work by the

Heideman lab investigated the transcriptional response to fresh glucose in yeast26

Kinases such

as protein kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose

response so we described a large scale phosphoproteomic MS based study in this chapter

Yeast cell extract was digested and phosphopeptides were enriched by immobilized metal

affinity chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase

(RP)-RP separation The low pH separation was infused directly into an ion trap mass

spectrometer with neutral loss triggered ETDfragmentation Specifically the CID fragmentation

can cause a neutral loss of 98z from the H3PO4 and can produce an incomplete fragmentation

pattern and lead to ambiguous identification so when the neutral loss is detected ETD MSMS

fragmentation is performed The neutral loss triggered ETD fragmentation is included in this

study to improve phosphopeptide identifications In total 477 phosphopeptides are identified

with 06 FDR and 669 phosphopeptides identified with 54 FDR Motif analysis of PKA and

phosphosite validation are performed as well

8

The future of comparative proteomics investigating small sample amounts or PTMs is

promising Further advances in enrichment separations science mass spectrometry

analyzersdetectors and bioinformatics will continue to create more powerful tools that enable

digging deeper in proteomics with both small (mouse CSF) and large (cell extracts) sample

amounts

Methods for Neuropeptide Analysis Using ETD fragmentation and Sample

Preparation

Chapter 6 investigates the utility of ETD fragmentation to sequence endogenous large

neuropeptides and labile tyrosine sulfation in the crustacean Callinectes sapidus The sinus

gland (SG) of the crustacean is a well-defined neuroendocrine site that produces numerous

hemolymph-borne agents including the most complex class of endocrine signaling moleculesmdash

neuropeptides One such peptide is the crustacean hyperglycemic hormone (CHH) precursor-

related peptide (CPRPs) which was not previously sequenced Electron transfer dissociation

(ETD) is used for sequencing of intact CPRPs due to their large size and high charge state In

addition ETD is used to sequence a sulfated peptide cholecystokinin (CCK)-like peptide in the

lobster Homarus americanus using a salt adduct Collectively this chapter presents two

examples of the utility of ETD in sequencing neuropeptides with large molecular weight or with

labile modifications

Chapter 7 reports a protocol on improving recovery of minute quantities of peptides by

adding methanol and a salt modifier (NaCl) to molecular weight cut-off membrane-based

centrifugal filters (MWCO) to enrich sub-microgram peptide quantities In some biological

9

fluids such as CSF the endogenous peptide content is very low and using pure water to perform

the MWCO separation produces too much sample loss Using a neuropeptide standard

bradykinin sample loss is reduced over two orders of magnitude with and without undigested

protein present The presence of bovine serum albumin (BSA) undigested protein and the

bradykinin standard lends evidence that sample loss occurs because of the MWCO and not the

presence of the protein Additionally a BSA tryptic digestion is presented where twenty-seven

tryptic peptides are identified from MALDI mass spectra after enriching with methanol while

only two tryptic peptides are identified after the standard MWCO protocol The strategy

presented in Chapter 7 greatly enhances recovery from MWCO separation for sub-microg peptide

samples

10

References

1 Tanaka K Waki H Ido Y Akita S Yoshida Y Yoshida T Matsuo T Protein and polymer analyses up to mz 100 000 by laser ionization time-of-flight mass spectrometry Rapid Communications in Mass Spectrometry 1988 2 (8) 151-153

2 Fenn J B Mann M Meng C K Wong S F Whitehouse C M Electrospray ionization for mass spectrometry of large biomolecules Science 1989 246 (4926) 64-71

3 Domon B Aebersold R Mass spectrometry and protein analysis Science 2006 312 (5771) 212-7

4 Whitehouse C M Dreyer R N Yamashita M Fenn J B Electrospray interface for liquid chromatographs and mass spectrometers Anal Chem 1985 57 (3) 675-9

5 Wilm M Mann M Analytical properties of the nanoelectrospray ion source Anal Chem 1996 68 (1) 1-8

6 Karas M Bahr U Dulcks T Nano-electrospray ionization mass spectrometry addressing analytical problems beyond routine Fresenius J Anal Chem 2000 366 (6-7) 669-76

7 Gulpinar M Yegen B The physiology of learning and memory Role of peptides and stress Curr Protein Pept Sci 2004 5 (6) 457-473

8 Ogren S O Kuteeva E Elvander-Tottie E Hokfelt T Neuropeptides in learning and memory processes with focus on galanin In Eur J Pharmacol 2010 Vol 626 pp 9-17

9 Strand F Neuropeptides general characteristics and neuropharmaceutical potential in treating CNS disorders Prog Drug Res 2003 61 1-37

10 Li W Chang M Peng Y-L Gao Y-H Zhang J-N Han R-W Wang R Neuropeptide S produces antinociceptive effects at the supraspinal level in mice Regul Pept 2009 156 (1-3) 90-95

11 Wu C-H Tao P-L Huang E Y-K Distribution of neuropeptide FF (NPFF) receptors in correlation with morphine-induced reward in the rat brain Peptides 2010 31 (7) 1374-1382

12 Tsutsui K Bentley G E Kriegsfeld L J Osugi T Seong J Y Vaudry H Discovery and evolutionary history of gonadotrophin-inhibitory hormone and kisspeptin new key neuropeptides controlling reproduction J Neuroendocrinol 2010 22 (7) 716-727

13 Piper D C Upton N Smith M I Hunter A J The novel brain neuropeptide orexin-A modulates the sleep-wake cycle of rats Eur J Neurosci 2000 12 (2) 726-730

14 Tsuneki H Wada T Sasaoka T Role of orexin in the regulation of glucose homeostasis - Tsuneki - 2009 - Acta Physiologica - Wiley Online Library Acta Physiol 2010

15 Kageyama H Takenoya F Shiba K Shioda S Neuronal circuits involving ghrelin in the hypothalamus-mediated regulation of feeding Neuropeptides 2010 44 (2) 133-138

16 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide family of aplysia J Neurosci 2002 22 (17) 7797-7808

11

17 Jensen J Regulatory peptides and control of food intake in non-mammalian vertebrates Comp Biochem Physiol Part A Mol Integr Physiol 2001 128 (3) 469-477

18 Billimoria C P Li L Marder E Profiling of neuropeptides released at the stomatogastric ganglion of the crab Cancer borealis with mass spectrometry J Neurochem 2005 95 (1) 191-199

19 Cape S S Rehm K J Ma M Marder E Li L Mass spectral comparison of the neuropeptide complement of the stomatogastric ganglion and brain in the adult and embryonic lobster Homarus americanus J Neurochem 2008 105 (3) 690-702

20 Cruz Bermuacutedez N D Fu Q Kutz Naber K K Christie A E Li L Marder E Mass

spectrometric characterization and physiological actions of GAHKNYLRFamide a novel FMRFamide‐like peptide from crabs of the genus Cancer J Neurochem 2006 97 (3) 784-799

21 Grashow R Brookings T Marder E Reliable neuromodulation from circuits with variable underlying structure Proc Natl Acad Sci USA 2009 106 (28) 11742-11746

22 Ma M Szabo T M Jia C Marder E Li L Mass spectrometric characterization and physiological actions of novel crustacean C-type allatostatins Peptides 2009 30 (9) 1660-1668

23 Chen R Hui L Cape S S Wang J Li L Comparative Neuropeptidomic Analysis of Food Intake via a Multi-faceted Mass Spectrometric Approach ACS Chem Neurosci 2010 1 (3) 204-214

24 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement approaches and challenges Annu Rev Anal Chem 2008 1 451-483

25 Ma M Wang J Chen R Li L Expanding the crustacean neuropeptidome using a multifaceted mass spectrometric approach J Proteome Res 2009 8 (5) 2426-2437

26 Slattery M G Heideman W Coordinated regulation of growth genes in Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

12

Figure 1 Capabilities and performances of certain mass analyzers Check marks indicate

availability check marks in parentheses indicate optional + ++ and +++ indicate possible or

moderate goodhigh and excellentvery high respectively Adapted with permission from

reference 3

13

Figure 2 Synthesis processing and release of neuropeptides (a) Cartoon showing two

interacting neurons (b) The synthesis of neuropeptides in the neuronal organelles and their

transport down the axon to the presynaptic terminal is depicted (c) Shows neuropeptide release

and interaction with the dendrite of the post-synaptic cell (Adapted from PhD thesis by Dr

Stephanie Cape)

14

Figure 3 Schematic showing location of the neuronal tissues being analyzed in the MS studies

of neuropeptides in Callinectes sapidus The SGs (sinus glands located in the eyestalks of the

crab) and the POs (pericardial organs located in the chamber surrounding the heart) release

neurohormones into the hemolymph (crab circulatory fluid) The brain is near the STNS

(stomatogastric nervous system neural network that controls the motion of the gut and foregut)

which has direct connections to the STG (stomatogastric ganglion) The STG is located in an

artery so it is continually in contact with hemolymph (Adapted from PhD thesis by Dr Robert

Sturm)

15

Chapter 2

Mass Spectrometry-based Proteomics and Peptidomics for Biomarker

Discovery and the Current State of the Field

Adapted from ldquoMass spectrometry-based proteomics and peptidomics for systems biology and

biomarker discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

16

Abstract

The scientific community has shown great interest in the field of mass spectrometry-based

proteomics and peptidomics for its applications in biology Proteomics technologies have

evolved to produce large datasets of proteins or peptides involved in various biological and

disease progression processes producing testable hypothesis for complex biological questions

This review provides an introduction and insight to relevant topics in proteomics and

peptidomics including biological material selection sample preparation separation techniques

peptide fragmentation post-translation modifications quantification bioinformatics and

biomarker discovery and validation In addition current literature and remaining challenges and

emerging technologies for proteomics and peptidomics are presented

17

Introduction

The field of proteomics has seen a huge expansion in the last two decades Multiple factors have

contributed to the rapid expansion of this field including the ever evolving mass spectrometry

instrumentation new sample preparation methods genomic sequencing of numerous model

organisms allowing database searching of proteomes improved quantitation capabilities and

availability of bioinformatic tools The ability to investigate the proteomes of numerous

biological samples and the ability to generate future hypothesis driven experiments makes

proteomics and biomarker studies exceedingly popular in biological studies today In addition

the advances in post-translational modification (PTM) analysis and quantification ability further

enhance the utility of mass spectrometry (MS)-based proteomics A subset of proteomics

research is devoted to profiling and quantifying neurologically related proteins and endogenous

peptides which has progressed rapidly in the past decade This review provides a general

overview as outlined in Figure 1 of proteomics technology including methodological and

conceptual improvements with a focus on recent studies and neurological biomarker studies

Biological Material Selection

The choice of biological matrix is an important first step in any proteomics analysis The

ease of sample collection (eg urine plasma or saliva) versus usefulness or localization of

sample (eg specific tissue or proximity fluid) needs to be evaluated early on in a study design

Plasma derived by centrifugation of blood to remove whole cells is a very popular

choice in proteomics due to the high protein content (~65 mg ml1) and the ubiquitous nature of

blood in the body and the ability to obtain large sample amounts or various time points without

the need to sacrifice the animal or to perform invasive techniques Plasma is centrifuged

18

immediately after sample collection unlike serum where coagulation needs to occur first To

obtain plasma blood is collected in a tube with an anticoagulant added (ETDA heparin or

citrate) and centrifuged but previous reports have shown variable results when heparin has been

used as an anticoagulant2 Human Proteome Organization (HUPO) specifically recommends the

anticoagulants EDTA or citrate to treat plasma3 4

One of the primary concerns with plasma is

degradation of the protein content via endogenous proteases found in the sample5 One way to

address this problem is the use of protease inhibitors In addition freezethaw cycles need to be

minimized to prevent protein degradation and variability6 7

Plasma proteomics has seen

extensive coordinated efforts to start assessing the diagnostic needs using plasma8 HUPO also

has established a public human database for plasma and serum proteomics from 35 collaborating

labratories9 Large dynamic range studies have been performed on plasma with a starting sample

amount of 2625 microl (1575 mg) resulting in 3654 proteins identified with a sub 5 false

discovery rate10

The large dynamic range spanning across eleven orders of magnitude as visualized in

Figure 2 is one of the biggest obstacles in plasma proteomics Figure 2 also shows that as lower

abundance proteins are investigated the origins of those identified proteins are more diverse than

the most abundant proteins Recent mining of the plasma proteome showed an ability to search

for disease biomarker applications across seven orders of magnitude In addition the tissue of

origin for the identified plasma proteins were identified and its origin was more diverse as the

protein concentration decreased11

Plasma has been used as a source for biomarker studies such

as colorectal cancer12 13

cardiovascular disease14

and abdominal aortic aneurysm15

Even

though the blood brain barrier prevents direct blood to brain interaction neurological disorders

such as Alzheimerrsquos disease (AD) have had their proteomes studied using plasma16

19

An alternative sample derived from blood is serum which is plasma allowed to coagulate

instead of adding anti-coagulates The time for coagulation is usually 30 minutes and during that

time significant and random degradation from endogenous proteases can occur The additional

variability caused from the coagulation process can change the concentration of multiple

potentially valuable biomarkers As biodiversity between samples or organisms is a challenging

endeavor additional sample variability due to serum generation may be undesirable but serum is

still currently being used for biomarker disease studies17

Serum has been used to compare the

proteome differences in neurological diseases such as AD Parkinsonrsquos disease and amyotrophic

lateral sclerosis and a review can be found elsewhere discussing the subject18

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy of brain or spinal cord

in evaluating diseases of the central nervous system and has been used for studies in neurological

disorders due to being a rich source of neuro-related proteins and peptides19

The protein

composition of the most abundant proteins in CSF is well defined and numerous studies exist to

broaden the proteins identified20-22

CSF has an exceedingly low protein content (~04 μgμL)

which is ~100 times lower than serum or plasma and over 60 of the total protein content in

CSF consists of a single protein albumin23-25

In addition the variable concentrations of proteins

span up to twelve orders of magnitude further complicating analysis and masking biologically

relevant proteins to any given study26

One of the highest number of identified proteins is from

Schutzer et al with 2630 non-redundant proteins from 14 mL of pooled human CSF This study

involved the removal of highly abundant proteins by performing IgY-14 immunodepletion

followed by two dimensional (2D) liquid chromatography (LC) separation27

Studies have also

been performed to characterize individual biomarkers or complex patterns of biomarkers in

various diseases in the CSF28 29

One potential pitfall of CSF proteomic analysis is

20

contamination from blood which can be identified by counting red blood cells present or

examining surrogate markers from blood contamination other than hemoglobin such as

peroxiredoxin catalase and carbonic anhydrase30

A proof of principle CSF peptidomics study

identified numerous endogenous peptides associated with the central nervous system which can

be used as a bank for neurological disorder studies31

Numerous recent reports highlighted the

utility of CSF analysis for biomarker studies in AD32 33

medulloblastoma34

both post-mortem

and ante-mortem35

Cellular lysates offer the distinct advantage to work with a cell line yeast or bacteria

with large amounts of proteins available for analysis36 37

with Saccharomyces cerevisiae being

the most common cell lysate38 39

Other cell lines are also used including HeLa40

and E coli41

The ability to obtain milligrams of proteins easily to scale up experiments without animal

sacrifice offers a clear advantage in biological sample selection Current literature supports

cellular lysate as a valued and sought after source of proteins for large scale proteomics

experiments because of the ability to assess treatments conditions and testable hypotheses42-44

Cellular lysate from rat B104 neuroblastoma cell line was used as an in vitro model for cerebral

ischemia and showed abundance changes in multiple proteins involved in various neurological

disorders45

Other Sources of Biological Samples

Urine

The urine proteome appears to be another attractive reservoir for biomarker discovery

due to the relatively low complexity compared with the plasma proteome and the noninvasive

collection of urine Urine is often considered as an ideal source to identify biomarkers for renal

21

diseases due to the fact that in healthy adults approximately 70 of the urine proteome originate

from the kidney and the urinary tract 46

thus the use of urine to identify neurological disorders is

neglected However strong evidence have shown that proteins that are associated with

neurodegenerative diseases can be excreted in the urine47-49

indicating the application of urine

proteomics could be a useful approach to the discovery of biomarkers and development of

diagnostic assays for neurodegenerative diseases However the current view of urine proteome

is still limited by factors such as sample preparation techniques and sensitivity of the mass

spectrometers There has been a tremendous drive to increase the coverage of urine proteome

In a recent study Court et al compared and evaluated several different sample preparation

methods with the objective of developing a standardized robust and scalable protocol that could

be used in biomarkers development by shotgun proteomics50

In another study Marimuthu et al

reported the largest catalog of proteins in urine identified in a single study to date The

proteomic analysis of urine samples pooled from healthy individuals was conducted by using

high-resolution Fourier transform mass spectrometry A total of 1823 proteins were identified

of which 671 proteins have not been previously reported in urine 51

Saliva

For diagnosis purposes saliva collection has the advantage of being an easy and non-

invasive technique The recent studies on saliva proteins that are critically involved in AD and

Parkinsonrsquos diseases suggested that saliva could be a potentially important sample source to

identify biomarkers for neurodegenerative diseases Bermejo-Pareja et al reported the level of

salivary Aβ42 in patients with mild AD was noticeably increased compared to a group of

controls 52

In another study Devic et al identified two of the most important Parkinsons

22

disease related proteinsmdash α-synuclein (α-Syn) and DJ-1 in human saliva 53

They observed that

salivary α-Syn levels tended to decrease while DJ-1 levels tended to increase in Parkinsons

disease The published results from this study also suggest that α-Syn might correlate with the

severity of motor symptoms in Parkinsons disease Due in part to recent advancements in MS-

based proteomics has provided promising results in utilizing saliva to explore biomarkers for

both local and systemic diseases 54 55

the further profiling of saliva proteome will provide

valuable biomarker discovery source for neurodegenerative diseases

Tissue

Compared to body fluids such as plasma serum and urine where the proteomic analysis

is complicated by the wide dynamic range of protein concentration the analysis of tissue

homogenates using the well-established and conventional proteomic analysis techniques has the

advantage of reduced dynamic range However the homogenization and extraction process may

suffer from the caveat that spatial information is lost which would be inadequate for the

detection of biomarkers whose localization and distribution play important roles in disease

development and progression Matrix-assisted laser desorptionionization (MALDI) imaging

mass spectrometry (IMS) is a method that allows the investigation of a wide range of molecules

including proteins peptides lipids drugs and metabolites directly in thin slices of tissue 56-59

Because this technology allows for identification and simultaneous localization of biomolecules

of interests in tissue sections linking the spatial expression of molecules to histopathology

MALDI-IMS has been utilized as a powerful tool for the discovery of new cancer biomarker

candidates as well as other clinical applications60 61

The utilization of MALDI-IMS for human

or animal brain tissue to identify or map the distribution of molecules related to

neurodegenerative diseases were also recently reported62 63

23

Secretome

There has been an increasing interest in the study of proteins secreted by various cells

(the secretomes) from tissue-proximal fluids or conditioned media as a potential source of

biomarkers Cell secretomes mainly comprise proteins that are secreted or are shed from the cell

surface and these proteins can play important role in both physiological processes (eg cell

signaling communication and migration) and pathological processes including tumor

angiogenesis differentiation invasion and metastasis In particular the study of cancer cell

secretomes by MS based proteomics has offered new opportunities for cancer biomarker

discovery as tumor proteins may be secreted or shed into the bloodstream and could be used as

noninvasive biomarkers The latest advances and challenges of sample preparation sample

concentration and separation techniques used specifically for secretome analysis and its clinical

applications in the discovery of disease specific biomarkers have been comprehensively

reviewed64 65

Here we only highlight the proteomic profiling of neural cells secretome that has

been applied to neurosciences for a better understanding of the roles secreted proteins play in

response to brain injury and neurological diseases The LC-MS shotgun identification of

proteins released by astrocytes has been recently reported66-68

In these studies the changes

observed in the astrocyte secretomes induced by inflammatory cytokines or cholinergic

stimulation were investigated6667

Alternatively our group performed 2D-LC separation and

included cytoplasmic protein extract from astrocytes as a control to identify cytoplasmic protein

contaminants which are not actively secreted from cells68

Sample Preparation

24

Proteomic analysis and biomarker discovery research in biological samples such as body

fluids tissues and cells are often hampered by the vast complexity and large dynamic range of

the proteins Because disease identifying biomarkers are more likely to be low-abundance

proteins it is imperative to remove the high-abundance proteins or apply enrichment techniques

to allow detection and better coverage of the low-abundance proteins for MS analysis Several

strategies including depletion and protein equalizer approach have been used during sample

preparation to reduce sample complexity69 70

and the latest advances of these methods have been

reviewed by Selvaraju et al 71

Alternatively the complexity of biological samples can be

reduced by capturing a specific subproteome that may have the biological information of interest

The latter strategy is especially useful in the biomarker discovery where the changes in the

proteome are not solely reflected through the concentration level of specific proteins but also

through changes in the post-translational modifications (PTMs) Here we will mainly discuss

the enrichment of phosphoproteinpeptides glycoproteinpeptides and sample preparation for

peptidomics and membrane proteins

Phosphoproteomics

Phosphorylation can act as a molecular switch on a protein by turning it on or off within

the cell It is thought that up to 30 of the proteins can be phosphorylated72

and it plays

significant roles in such biological processes as the cell cycle and signal transduction73

Currently tens of thousands of phosphorylation sites can be proposed using analytical methods

available today74 75

The amino acids that are targeted for phosphorylation studies are serine

threonine and tyrosine with the abundance of detection decreasing typically in that order Other

25

amino acids have been reported to be phosphorylated but traditional phosphoproteomics

experiments ignore these rare events76

In a typical large-scale phosphoproteomics experiment the sample size is usually in

milligram amounts to account for the low stoichiometry of phosphorylated proteins The large

amount of protein is then digested typically with trypsin but alternatively experiments have

been performed with Lys-C digestion to produce large enzymatic peptides The larger peptides

produced from Lys-C render higher charged peptides during electrospray ionization (ESI) and

allow improved electron-based fragmentation to determine specific sites of phosphorylation77

From the pool of peptides phosphopeptides must be enriched otherwise they will be masked by

the vast number and higher ionization efficiency of non-phosphorylated peptides The two most

common enrichment techniques are immobilized metal ion affinity chromatography (IMAC) and

metal oxide affinity chromatography (MOAC) TiO2 being the most common oxide used for this

purpose A recent study reported that phosphorylation of neuronal intermediate filament proteins

in neurofibrillary tangles are involved in Alzheimerrsquos disease78

Glycoproteomics

Protein glycosylation is one of the most common and complicated forms of PTM Types

of protein glycosylation in eukaryotes are categorized as either N-linked where glycans are

attached to asparagine residues in a consensus sequence N-X-ST (X can be any amino acid

except proline) via an N-acetylglucosamine (N-GlcNAc) residue or the O-glycosylation where

the glycans are attached to serine or threonine Glycosylation plays a fundamental role in

numerous biological processes and aberrant alterations in protein glycosylation are associated

with neurodegenerative disease states such as Creutzfeld-Jakob Disease (CJD) and AD79 80

26

Due to the low abundance of glycosylated forms of proteins compared to non-glycosylated

proteins it is essential to enrich glycoproteins or glycopeptides in complex biological samples

prior to MS analysis Two of the most common enrichment methods used in glycoproteomics are

lectin affinity chromatography (LAC) and hydrazide chemistry The detailed methodologies of

LAC hydrazide chemistry and other enrichment methods in glycoproteomics have been

extensively reviewed in the past81 82

In particular LAC is of great interest in studies of

glycosylation alterations as markers of AD and other neurodegenerative diseases due to its recent

applications in brain glycoproteomics83

Our group has utilized multi-lectin affinity

chromatography containing concanavalin A (ConA) and wheat germ agglutinin (WGA) to enrich

N-linked glycoproteins in control and prion-infected mouse plasma84

This method enabled us to

identify a low-abundance glycoprotein serum amyloid P-component (SAP) PNGase F digestion

and Western blotting validation confirmed that the glycosylated form of SAP was significantly

elevated in mice with early prion infection and it could be potentially used as a diagnostic

biomarker for prion diseases

Membrane proteins

Membrane proteins play an indispensable role in maintaining cellular integrity of their

structure and perform many important functions including signaling transduction intercellular

communication vesicle trafficking ion transport and protein translocationintegration85

However due to being relatively insoluble in water and low abundance it is challenging to

analyze membrane proteins by traditional MS-based proteomics approaches Numerous efforts

have been made to improve the solubility and enrichment of membrane proteins during sample

preparation Several comprehensive studies recently covered the commonly used technologies in

27

membrane proteomics and different strategies that circumvent technical issues specific to the

membrane 86-90

Recently Sun et al reported using 1-butyl-3-methyl imidazolium

tetrafluoroborate (BMIM BF4) an ionic liquid (IL) as a sample preparation buffer for the

analysis of integral membrane proteins (IMPs) by microcolumn reversed phase liquid

chromatography (μRPLC)-electrospray ionization tandem mass spectrometry (ESI-MSMS)

The authors compared BMIM BF4 to the other commonly used solvents such as sodium dodecyl

sulfate methanol Rapigest and urea but they found that the number of identified IMPs from rat

brain extracted by ILs was significantly increased The improved identifications could be due to

the fact that BMIM BF4 has higher thermal stability and thus offered higher solubilizing ability

for IMPs which provided better compatibility for tryptic digestion than traditionally used solvent

systems38

In addition to characterization of membrane proteome the investigation of PTMs on

membrane proteins is equally important for characterization of disease markers and drug

treatment targets Phosphorylations and glycosylations are the two most important PTMs for

membrane proteins In many membrane protein receptors the cytoplasmic domains can be

phosphorylated reversibly and function as signal transducers whereas the receptor activities of

the extracellular domains are mediated via N-linked glycosylation Wiśniewski et al provides an

informative summary on recent advances in proteomic technology for the identification and

characterization of these modifications91

Our group has pioneered the development of detergent

assisted lectin affinity chromatography (DALAC) for the enrichment of hydrophobic

glycoproteins using mouse brain extract92

We compared the binding efficiency of lectin affinity

chromatography in the presence of four commonly used detergents and determined that under

certain concentrations detergents can minimize the nonspecific bindings and facilitate the

elution of hydrophobic glycoproteins In summary NP-40 was suggested as the most suitable

28

detergent for DALAC due to the higher membrane protein recovery glycoprotein recovery and

membranous glycoprotein identifications compared to other detergents tested In a different

study on mouse brain membrane proteome Zhang et al reported an optimized protocol using

electrostatic repulsion hydrophilic interaction chromatography (ERLIC) for the simultaneous

enrichment of glyco- and phosphopeptides from mouse brain membrane protein preparation93

Using this protocol they successfully identified 544 unique glycoproteins and 922 glycosylation

sites which were significantly higher than those using the hydrazide chemistry method

Additionally a total of 383 phosphoproteins and 915 phosphorylation sites were identified

suggesting that the ERLIC separation has the potential for simultaneous analysis of both glyco-

and phosphoproteomes

Peptidomics

Peptidomics can be loosely defined as the study of the low molecular weight fraction of

proteins encompassing biologically active endogenous peptides protein fragments from

endogenous protein degradation products or other small proteins such as cytokines and signaling

peptides Studies can involve endogenous peptides94

peptidomic profiling33

and de novo

sequencing of peptides95 96

Neuropeptidomics focuses on biologically active short segments of

peptides and have been investigated in numerous species including Rattus97 98

Mus musculus99

100 Bovine taurus

101 Japanese quail diencephalon

102 and invertebrates

103-106 The isolation of

peptides is typically performed through molecular weight cut-offs from either biofluids such as

CSF plasma or tissue extracts If the protein and peptide content is high such as for tissue or cell

lysates protein precipitation can be done via high organic solvents and the resulting supernatant

can be analyzed for extracted peptides where extraction solvent and conditions could have a

29

significant effect on what endogenous peptides are extracted from tissue107

A comparative

peptidomic study of human cell lines highlights the utility of finding peptide signatures as

potential biomarkers108

A thorough review of endogenous peptides and neuropeptides is beyond

the scope of this review and an excellent review on this topic is available elsewhere109

Fractionation and Separation

The mass spectrometer has a limited duty cycle and data dependent analysis can only

scan a limited number of mz peaks at any given time In addition significant ion suppression

can occur if there is a difference in concentration between co-eluting peptides or if too many

peptides co-elute Therefore one of the biggest challenges in biomarker discovery is the

complexity of the sample and the presence of high-abundance proteins in body fluids such as

CSF serum and plasma In addition to the removal of the most abundant proteins by

immunodepletion the reduction of the complexity of the sample by further fractionation is

indispensable to facilitate the characterization of unidentified biomarkers from the low

abundance proteins Traditionally used techniques for complex protein analysis include gel

based fractionation methods such as two-dimensional gel electrophoresis (2D-GE) and its

variation two-dimensional differential gel electrophoresis (2D-DIGE) or non-gel based such as

one- or multidimensional liquid chromatography (LC) and microscale separation techniques

such as capillary electrophoresis (CE)

2D-GE MS has been widely used as a powerful tool to separate proteins and identify

differentially expressed proteins ever since 2D gels were coupled to mass spectrometry In 2D-

GE MS thousands of proteins can be separated on a single gel according to pI and molecular

weight Individual protein spots that show differences in abundance between different samples

30

can then be excised from the gel digested into peptides and analyzed by MALDI MS or by

liquid chromatography tandem mass spectrometry (LC-MSMS) for protein identification The

introduction of 2D-DIGE adds a quantitative strategy to gel electrophoresis by enabling multiple

protein extracts to be separated on the same 2D gel thus providing comparative analysis of

proteomes in complex samples In 2D-DIGE protein extracts from two different conditions and

an internal standard can be labeled with fluorescent dyes for example Cy3 Cy5 and Cy2

respectively prior to two-dimensional gel electrophoresis Compared to traditional 2D-GE 2D-

DIGE provides the clear advantage of overcoming the inter-gel variation problem 110

Proteomic

profiling of CSF by 2D-GE and 2D-DIGE has led to the identification of putative biomarkers in

multiple neurological disorders For example Brechlin et al reported an optimized 2-DIGE

protocol profiled CSF from 36 CJD patients The applicability of their approach was proven by

the detection of known CJD biomarkers such as 14-3-3 protein neuron-specific enolase lactate

dehydrogenase and other proteins that are potentially relevant to CJD 111

In another study to

identify novel CSF biomarkers for multiple sclerosis CSF from 112 multiple sclerosis patients

and control individuals were analyzed by 2D-GE MS for comparative proteomics Ten potential

multiple sclerosis biomarkers were selected for validation by immunoassay 112

These

methodologies sample preparation techniques and applications of 2D-DIGE in

neuroproteomics were reviewed by Diez et al113

Although 2D gel provides excellent resolving

power and capability to visualize abundance changes there are some limitations to the method

For example gel based separation is not suitable for low abundance proteins extremely basic or

acidic proteins very small or large proteins and hydrophobic proteins114 115

Complementary to gel-based approaches shotgun proteomics coupled to LC have

become increasingly popular in proteomic research because they are reproducible highly

31

automated and capable of detecting low abundance proteins Furthermore another advantage of

LC-MS shotgun proteomics is the suitability for isotope labeling for protein quantification which

is reviewed in a later section In shotgun proteomics a protein mixture is digested and resulting

peptides are separated by LC prior to tandem MS fragmentation to identify different proteins by

peptide sequencing The most common separation for shotgun proteomics peptidomics or top-

down proteomics experiments use low-pH reversed phase (RP) C18 C8 or C4 columns RPLC

is well established which provides high resolution desalts the sample which can interfere with

ionization and the mobile phase is compatible with ESI Nanoscale C18 columns allow for

separation and introduction of sub microgram samples If larger amounts of sample are

available two dimensional separations are usually preferred to greatly enhance the coverage of

the investigated proteome which will be discussed in depth later It is preferable to have an

orthogonal separation method and since RP separates via hydrophobicity strong cation exchange

(SCX) was the original choice due to its separation by charge MudPIT (multidimensional

protein identification technology) usually refers to the use of SCX as the first phase of separation

and is a well-established platform116

SCX has the advantage over RP separation technologies to

effectively remove interfering detergents from the sample SCX separation is not based solely

off charge and hydrophobicity contributes to elution therefore a small amount of organic

modifier usually 10-15 is added to lessen the hydrophobicity effects117

The addition of

organic modifiers needs to be minimized otherwise binding to the C18 trap cartridge or C18

column will be reduced if performed on-line SCX can be used for PTMs and offers specific

applications for proteomic studies and an excellent current review is offered on this subject

elsewhere118

An alternative MudPIT separation scheme employing high pH RPLC as the first

phase of separation and low pH RPLC in the second dimension (RP-RP) has been successfully

32

applied to the proteomic analysis of complex biological samples119 120

The advantage of using

RP as the first dimension is the higher resolution for separation and better compatibility with

down-stream MS detection by eliminating salt Song et al reported a phosphoproteome analysis

based on this 2D RP-RP coupling scheme121

Hydrophilic interaction chromatography (HILIC) employs distinct separation modality

where the retention of peptides is increased with increasing polarity122

The loading of sample is

done by high organic and eluted by increasing the percentage of the aqueous phase or polarity of

the mobile phase opposite from RPLC thus establishing orthogonality of the two separation

modes123

HILIC has quickly become a very useful method and is actively used for proteomic

experiments124

for increased sensitivity125

phosphoproteomics126

glycoproteins127

and

quantification studies128

An alternative and modification to HILIC is ERLIC which adds an

additional mode of separation by electrostatic attraction An earlier study using ERLIC

demonstrated the ability to separate phosphopeptides from non-phosphorylated peptides at

pH=2129

A recent study looking into changes in the phosphoproteome of Marekrsquos Disease

applied ERLIC to chicken embryonic fibroblast lysate identifying only 13 phosphopeptides

out of all the identified peptides Due to the lack of isolation of phosphopeptides from ERLIC

the investigators performed immobilized metal affinity chromatography (IMAC) enrichment on

the fractions increasing identification of phosphopeptides over 50 fold130

A comparative study

of ERLIC to HILIC and SCX following TiO2 phospho-enrichment reported that

SCXgtERLICgtHILIC for phosphopeptide identifications126

Recent developments in instrumentation to combine LC with ion mobility spectrometry

(IMS) and MS (LC-IMS-MS) offered more advantages than conventional LC due to the rapid

high-resolution separations of analytes based on their charge mass and shape as reflected by

33

mobility in a given buffer gas The mobility of an ion in a buffer gas is determined by the ionrsquos

charge and its collision cross-section with the buffer gas The methodologies of IMS separations

and the application of LC-IMS-MS for the proteomics analysis of complex systems including

human plasma have been reviewed by Clemmerrsquos group131-133

They proposed a method that

employs intrinsic amino acid size parameters to obtain ion mobility predictions which can be

used to rank candidate peptide ion assignments and significantly improve peptide identification

134

Although 2D gel and LC are routinely used as separation techniques in MS-based

proteomics capillary electrophoresis (CE) has received increasing attention as a promising

alternative due to the fast and high-resolution separation it offers CE has a wide variety of

operation modes among which capillary zone electrophoresis (CZE) and capillary isoelectric

focusing (CIEF) have the greatest potential applications in MS-based proteomics thus will be

highlighted here CZE separates analytes by their charge-to-size ratios in buffers under a high

electrical field and is often used as the final dimension prior to MS analysis while the separation

feature of CIEF is based on isoelectric point and this technique is more suitable to be used as the

first dimension separation Detailed description of different CEndashMS interfaces sample

preconcentration and capillary coating to minimize analyte adsorption could be found in several

reviews135-141

CE technique is complementary to conventional LC in that it is suitable for the

analysis of polar and chargeable compounds Dovichirsquos group conducted proteomic analysis of

the secreted protein fraction of Mycobacterium marinum which has intermediate protein

complexity142

The tryptic digests were either analyzed by UPLC-ESI-MSMS in triplicates or

prefactionated by RPLC followed by CZE-ESI-MSMS It was demonstrated that the two

methods identified similar numbers of peptides and proteins within similar analysis times

34

However CZE-ESI-MSMS analysis of the prefractionated sample tended to identify more

peptides that are basic and have lower mz values than those identified by UPLC-ESI-MSMS

This analysis also presented the largest number of protein identifications by using CE-MSMS

suggesting the effectiveness of prefractionation of complex samples by LC method prior to CZE-

ESI-MSMS The use of CIEF as the first dimension of separation provides both sample

concentration and excellent resolving power The combination of CIEF and RPLC separation

has been applied to the proteomic analyses where the amount of protein sample is limited and

cannot meet the requirement of minimal load amount for 2D LC-MSMS143 144

So far CE-MS

has been widely applied to the proteomic analysis of various biological samples such as urine145

146 CSF

147 blood

148 frozen tissues

149 and the formalin-fixed and paraffin-embedded (FFPE)

tissue samples150

The recent CEndashMS applications to clinical proteomics have been summarized

in several reviews135 151 152

Protein Quantification

In 2D gel electrophoresis the quantitative analysis of protein mixtures is performed on

the gel by comparing the intensity of the protein stain The development of 2D-DIGE eliminated

the gel-to-gel variation and greatly improved the quantitative capability and reliability of 2D gel

methodology110

However the accuracy of 2D gel based protein quantification suffers from the

limitations that a seemingly single gel spot often contains multiple proteins and the difficulty of

detecting proteins with extreme molecular weights and pI values as well as highly hydrophobic

proteins such as membrane proteins Therefore non-gel based shotgun proteomics technology is

more suitable for accurate and large-scale protein identification and quantification in complex

samples Briefly the quantification in non-gel based shotgun proteomics can be categorized into

35

two major approaches stable isotope labeling-based and label-free methods The common

strategies for quantitative proteomic analysis are reviewed and summarized in Table 1

Isotope labeling methods

Because stable isotope-labeled peptides have the same chemical properties as their

unlabeled counterparts the two peptides within a mixture should exhibit identical behaviors in

MS ionization The mass difference introduced by isotope labeling enables the detection of a

pair of two distinct peptide masses by MS within the mixture and allowing for the measurement

of the relative abundance differences between two peptides Depending on how isotopes are

incorporated into the protein or peptide these labeling methods can be divided into two groups

In vitro chemical derivatization techniques which incorporate a label or tag into the peptide or

protein during sample preparation metabolic labeling techniques which introduce the isotope

label directly into the organism via isotope-enriched nutrients from food or media

1 In vitro derivatization techniques

There are multiple methods to introduce heavy isotopes into proteins or peptides in vitro

The commonly used strategies include 18

O 16

O enzymatic labeling Isotope-Coded Affinity Tag

(ICAT) Tandem Mass Tags (TMTs) and Isobaric Tags for Relative and Absolute Quantification

(iTRAQ) The 18

O labeling method enzymatically cleaves the peptide bond with trypsin in the

presence of 18

O-enriched H2O and introduces 4-Da mass shift in the tryptic peptides153

The

advantages of this method include 18

O-enriched water is extremely stable tryptic peptides will

be labeled with the same mass shift secondary reactions inherent to other chemical labeling can

be avoided Conversely widespread use of 18

O-labeling has been hindered due to the difficulty

of attaining complete 18

O incorporation and the lack of robustness154 155

Currently ICAT

36

TMTs and iTRAQ methods are extensively used in quantitative proteomics In ICAT cysteine

residues are specifically derivatized with a reagent containing either zero or eight deuterium

atoms as well as a biotin group for affinity purification of cysteine-containing peptides156 157

The advantage of ICAT is that the affinity purification via biotin moiety can facilitate the

detection of low-abundance cysteine-containing peptides In addition the mass difference

introduced by labeling increases mass spectral complexity with quantification from the different

precursor masses done by MS and peptide identification being achieved through tandem MS

(MSMS) This added complexity from different peptide masses was addressed by using isobaric

labeling methods such as TMTs and iTRAQ 158 159

where the same peptides in different samples

are isobaric after tagging and appear as single mz in MS scans thus enhancing the peptide limit

of detection and reducing the MS scan complexity Isobaric labeling reagents are composed of a

primary amine reactive group and an isotopic reporter group linked by an isotopic balancer group

for the normalization of the total mass of the tags The reporter group serves for quantification

purpose since it is cleaved during collision-induced dissociation (CID) to yield a characteristic

isotope-encoded fragment Moreover isobaric labeling methods allow the comparison of

multiple samples within a single experiment Recently a 6-plex version of TMTs was

reported160

and iTRAQ enables up to eight samples to be labeled and relatively quantified in a

single experiment161

8-plex iTRAQ reagents have been used for the comparison of complicated

biological samples such as CSF in the studies of neurodegenerative diseases 162

Recently our

group developed a novel N N-dimethyl leucine (DiLeu) 4-plex isobaric tandem mass (MS2)

tagging reagents with high quantitation efficacy DiLeu has the advantage of synthetic simplicity

and greatly reduced synthesis cost compared to TMTs and iTRAQ163

Xiang et al demonstrated

that DiLeu produced comparable iTRAQ ability for protein sequence coverage (~43) and

37

quantitation accuracy (lt15) for tryptically digested proteins More importantly DiLeu

reagents could promote enhanced fragmentation of labeled peptides thus allowing more

confident peptide and protein identifications

2 In Vivo Metabolic Labeling

Metabolic processes can also be employed for the incorporation of stable-isotope labels

into the proteins or organisms by enriching culture media or food with light or heavy versions of

isotope labels (2H

13C

15N) The advantage of in vivo labeling is that metabolic labeling does

not suffer from incomplete labeling which is an inherent drawback for in vitro derivatization

techniques In addition metabolic labeling occurs from the start of the experiment and proteins

with light or heavy labels are simultaneously extracted thus reducing the error and variability of

quantification introduced during sample preparation The most widely used strategy for

metabolic labeling is known as stable-isotope labeling of amino acids in cell culture (SILAC)

which was introduced by Mann and co-workers164 165

In SILAC one cell population is grown

in normal or light media while the other is grown in heavy media enriched with a heavy

isotope-encoded (typically 13

C or 15

N) amino acid such as arginine or leucine Cells from the

two populations are then combined proteins are extracted digested and analyzed by MS The

relative protein expression differences are then determined from the extracted ion

chromatograms from both the light and heavy peptide forms SILAC has been shown to be a

powerful tool for the study of intracellular signal transduction In addition this technique has

recently been applied to the quantitative analysis of phosphotyrosine (pTyr) proteomes to

characterize pTyr-dependent signaling pathways166 167

38

Labe-free quantification

Although various isotope labeling methods have provided powerful tools for quantitative

proteomics several limitations of these approaches are noted Labeling increases the cost and

complexity of sample preparation introduces potential errors during the labeling reaction It also

requires a higher sample concentration and complicates data processing and interpretation In

addition so far only TMTs and iTRAQ allow the comparison of multiple (up to eight) samples

simultaneously The comparison of more than eight samples in a single experiment cannot be

achieved by isotope labeling In order to address these concerns there has been significant

interest in the development of label-free quantitative approaches Current label-free

quantification methods for MS-based proteomics were developed based on the observation that

the chromatographic peak area of a peptide168 169

or frequency of MSMS spectra170

correlating

to the protein or peptide concentration Therefore the two most common label-free

quantification approaches are conducted by comparing (i) area under the curve (AUC) of any

given peptides171 172

or (ii) by frequency measurements of MSMS spectra assigned to a protein

commonly referred to as spectral counting173

Several recent reviews provided detailed and

comprehensive knowledge comparing label-free methods with labeling methods data processing

and commercially available software for label-free quantitative proteomics174-177

Dissociation Techniques

The vast majority of proteomic experiments have proteins or peptides being identified by

two critical pieces of data obtained from the mass spectrometer The first is the precursor ion

identified by its mz which is informative to the mass of the peptide being analyzed The second

is the use of tandem mass spectrometry to fragment or dissociate the precursor ion and record the

39

generated fragment ion pattern to discern the amino acid sequence The three most popular

dissociation or fragmentation techniques for peptides are CID electron-transfer dissociation

(ETD) and high-energy collision dissociation (HCD) A recent study on the human plasma

proteome demonstrated that combined fragmentation techniques enhance coverage by providing

complementary information for identifications CID enabled the greatest number of protein

identifications while HCD identified an additional 25 proteins and ETD contributed an

additional 13 protein identifications178

ETDECD

Electron capture dissociation (ECD) 179

preceded ETD but ECD was developed for use

in a Penning trap for Fourier transform ion cyclotron resonance (FTICR) mass spectrometers

ECD requires the ion of interest to be in contact with near-thermal electrons and for the electron

capture event to occur on the millisecond time scale but the time scale is inadequate for electron

trapping in Paul traps or quadrupoles in the majority of mass spectrometers180

ETD involves a

radical anion like fluoranthene with low electron affinity to be transferred to peptide cation

which results in more uniform cleavage along the peptide backbone The cation accepts an

electron and the newly formed odd-electron protonated peptide undergoes fragmentation by

cleavage of the N-Cα bond which results in fragmentation ions consisting of c- and zbull-type

product ions The uniformed cleavage results in reduced sequence discrimination to labile bonds

such as PTMs and also provides improved sequencing for larger peptides compared to CID181

The realization that larger peptides produced better MSMS quality spectra compared to CID led

to a decision tree analysis strategy where peptide charge states and size determined whether the

precursor peptide would be fragmented with CID or ETD182

One of the main benefits of

ETDECD is the ability to sequence peptides with labile PTMs such as phosphorylation77 183

40

sulfation184

glycosylation185

ubiquitination186

and histone modifications187

ETD also has the

benefit of providing better sequence information on larger neuropeptides when compared to

CID188

However a thorough analysis suggested that CID still yielded more peptideprotein

identifications than ETD in large scale proteoimcs189

HCD

High energy collision dissociation (HCD)190

is an emerging fragmentation technique that

offers improved detection of small reporter ions from iTRAQ-based studies191 192

HCD is

performed at a higher energy in a collision cell instead of an ion trap like CID thus HCD does

not suffer from the low-mass cutoff limitation Furthermore HCD offers enhanced

fragmentation efficiency assisting in MSMS spectra interpretation and protein identification193

A major drawback for HCD is that the spectral acquisition times are up to two-fold longer due to

increased ion requirement for Fourier transform detection in the orbitrap194

HCD has been

reported to increase phosphopeptide identifications over CID74

but in a different study CID was

reported to offer more phosphopeptide identifications over HCD194

Work has also been done to

transfer the decision tree analysis for HCD which basically switches CID with HCD claiming

better quality data determined by higher Mascot scores with more peptide identifications195

MSE

Data dependent acquisition (DDA) is the most commonly used ion selection process in

mass spectrometers for proteomic experiments An alternative process which does not have ion

selection nor switch between MS and MSMS modes is termed MSE MS

E is a data independent

mode and does not require precursor ions of a significant intensity to be selected for MSMS

analysis196

A data independent mode decouples the mass spectrometer choosing which

precursor ions to fragment and when the ions are fragmented MSE works by a low or high

41

energy scan and no ion isolation is occurring The low energy scan is where the precursor ion is

not fragmented and the high energy scan allows fragmentation The resulting mix of precursor

and fragmentation ions is then detected simultaneously197

The data will then need to be

deconvoluted using a proprietary time-aligned algorithm that is discussed elsewhere198

The

continuous data independent acquisition allows multiple MSMS spectra to be collected during

the natural analyte peak broadening observed in chromatography which provides more data

points for AUC label-free quantification In addition lower abundance peptides can be

sequenced as more MSMS spectra are collected throughout the elution of an LC peak allowing

better signal averaging for smaller analyte peak of interest during coelution and reducing

sampling bias in typical DDA experiments where only more abundant peaks can be selected for

fragmentation

A comparison of spiked internal protein standards into a complex protein digest provided

evidence that MSE was comparable to DDA analysis in LC-MS

199 MS

E has been used for label

free proteomics of immunodepleted serum in large scale proteomics samples200

In addition

MSE was performed for the characterization of human cerebellum and primary visual cortex

proteomes Hundreds of proteins were identified including many previously reported in

neurological disorders201

MSE is quickly becoming a versatile data acquisition method recently

used in such studies as cancer cells202

schizophrenia203

and pituitary proteome discovery204

The usefulness of MSE as an unbiased data acquisition method is being assimilated into multiple

proteomics studies including studies involving neurological disorders

Data Analysis

42

One of the major bottlenecks in non-targeted proteomic experiments is how to handle the

enormous amount of data obtained Database searches biostatistical analysis de novo

sequencing PTM validation all have their place and multiple available platforms are available

If the organism being studied has had its genome sequenced databases can be created

with a list of proteins in the FASTA format to be used in database searching There are

numerous database searching algorithms for sequence identification of MSMS data including

Mascot205

Sequest206

Xtandem207

OMSSA208

and PEAKS209

These searching algorithms are

performed by matching MSMS spectra and precursor mass to sequences found within proteins

How well the actual spectra match the theoretical spectra determines a score which is unique to

the searching algorithm and usually can be extrapolated to the probability of a random hit

Recently a database has been developed for PTM analysis by the use of the program SIMS210

Specifically for phosphopeptides Ascorersquos algorithm scans the MSMS data to determine the

likelihood of correct phosphosite identification from the presence of site identifying product

ions211

If the organism that is being analyzed has not had its genome sequenced and no (or very

limited) FASTA database is available a homology search can be performed using SPIDER212

available with PEAKS software Alternatively individual MSMS spectrum can be de novo

sequenced but software is available to perform automated de novo sequencing of numerous

spectra (PEAKS208

DeNovoX and PepSeq)

For large-scale protein identifications the false discovery rate (FDR) must be established

by the searching algorithm and that is accomplished by re-searching the data with a false

database created by reversing or scrambling the amino acid sequence of the original database

used for the protein search Any hits from the false database will contribute to the FDR and this

value can be adjusted usually around 1 An additional layer of confidence in the obtained data

43

can be achieved in shotgun proteomics experiments by removing all the proteins that are

identified by only one peptide

Once a set of confident proteins or peptides have been generated from database

searching bioinformatic analysis or biostatistical analysis is needed Numerous software

packages are available for different purposes FLEXIQuant is an example for absolute

quantitation of isotopically labeled protein or peptides of interest213

FDR analysis of

phosphopeptides or other specific PTMs can be adjusted with such software as Scaffold

providing data consisting only of a specific modification214

Bioinformatic tools such as

Scaffold or ProteoIQ also include gene ontology (GO) analysis which can classify identified

proteins by three categories cellular component molecular function or biological process

Custom bioinformatics programs can also be developed and are often useful in various proteomic

studies including biomarker discovery in neurological diseases215

More detailed review of

bioinformatics in peptidomics216

and proteomics217

can be found elsewhere

Validation of Biomarkers by Targeted Proteomics

The validation of putative biomarkers identified by MS-based proteomic analysis is often

required to provide orthogonal analysis to rule out a false positive by MS and providing

additional evidence for the biomarker candidate(s) from the study for future potential clinical

assays At present antibody-based assays such as Western blotting ELISA and

immunochemistry are the most widely used methods for biomarker validation Although accurate

and well established these methods rely on protein specific antibodies for the measurement of

the putative biomarker and could be difficult for large-scale validation of all or even a subset of a

long list of putative protein biomarkers typically obtained by MS-based comparative proteomic

44

analysis Large scale validation is impractical due to the cost for each antibody the labor to

develop a publishable Western blot or ELISA and the antibody availability for certain proteins

As an alternative strategy quantitative assays based on multiple-reaction monitoring (MRM) MS

using a triple quadrupole mass spectrometer have been employed in biomarker verification

MRM is the most common use of MSMS for absolute quantitation It is a hypothesis

driven experiment where the peptide of interest and its subsequent fragmentation pattern must be

known prior to the quantitative MRM experiments MRM involves selecting a specific mz (first

quadrupole) to be isolated for fragmentation (second quadrupole) followed by one or more of

the most intense fragment ions (third quadrupole) being monitored The ability to quantitate and

thus validate the proteins or peptides as potential biomarkers is achieved by performing MRM on

isotopically labeled reference peptide for targeted peptideprotein of interest The main obstacle

for quantification of peptides is interference and ion suppression effects from co-eluting

substances Since the isotopically labeled and native peptide will co-elute the same interference

and ion suppression will occur for both peptides and thus correcting these interfering effects

Peptides need to be systematically chosen for a highly sensitive and reproducible MRM

experiment to ensure proper validation of putative biomarkers Peptides require certain intrinsic

properties which include an mz within the practical mass detection range for the instrument and

high ionization efficiency If the desired peptide to be quantified is derived from a digestion

then peptides that have detectable incomplete digestion or missed cleavage site can be a major

source of variability Peptides with a methionine and to a lesser extent tryptophan are

traditionally removed from consideration from MRM quantitative experiments due to the

variable nature of the oxidation that can occur In addition if chromatographic separation is

performed the retention behavior of the peptide must be well behaved with little tailing effects

45

eluting late causing broadening of the peak and even irreversible binding to the column As an

example hydrophilic peptides being eluted off a C18 column may exhibit the previously

described concerns and a different chromatographic separation will need to be explored for

improved limits of detection quantitation and validation To determine consistent peptide

detection or usefulness of certain peptides databases such as Proteomics Database218

PRIDE219

PeptideAtlas220

have been developed to compile proteomic data repositories from initial

discovery experiments

After the peptide is selected for analysis the proper MRM transitions need to be selected

to optimize the sensitivity and selectivity of the experiment It is common for investigators to

select two or three of the most intense transitions for the proposed experiment It is imperative

that the same instrument is used for the determination of transition ions as different mass

spectrometers may have a bias towards different fragment ions

MRM experiments are still highly popular experiments for hypothesis directed

experiments221

biomarker analysis222

and validation223

Validation of putative biomarkers is

increasingly becoming a necessary step when performing large scale non-hypothesis driven

proteomics experiments The traditional validation techniques of ELISA Western blotting and

immunohistochemistry are still used but MRM experiments are becoming an attractive

alternative for validation of putative biomarkers due to its enhanced throughput and specificity

Current work is still being performed to both expand the linear dynamic range224

and

sensitivity225

of MRM A recent endeavor to increase the sensitivity for MRM experiments was

accomplished by ldquoPulsed MRMrdquo via the use of an ion funnel trap to enhance confinement and

accumulation of ions The authors claimed an increase by 5-fold for peak amplitude and a 2-3

fold reduction in chemical background225

46

Remaining Challenges and Emerging Technologies

Large sample numbers for mass spectrometry analysis

Multiple conventional studies in proteomics have been performed on a single or a few

biological samples As bio-variability can be exceedingly high the need for larger sample sizes

is currently being investigated Prentice et al used a starting point of 3200 patient samples

from the Womenrsquos Health Institute (WHI) to probe the plasma proteome using MS for

biomarkers The study did not test the 3200 patient samples by MS because even a simple one

hour one dimensional RP analysis on a mass spectrometer would take months of instrument time

for uninterrupted analysis Instead the authors pooled 100 samples together to bring the total

number of pooled samples to 32 To provide relevant plasma biomarkers the samples were then

subjected to immunodepletion 2-D protein separation (96 fractions total) and then 1-D RPLC of

tryptic peptide separation on-line interface to a mass spectrometer The large sample cohorts

help address bio-variability that can be a concern from small sample size proteomic experiments

and provide ample sample amounts to investigate the low abundance proteins226

Hemoglobin-derived neuropeptides and non-classical neuropeptides

Neuropeptides such as neuropeptide Y and enkephalin are short chains of amino acids

that are secreted from a range of neuronal cells that signal nearby cells In contrast non-classical

neuropeptides are termed as neuropeptides or ldquomicroproteinsrdquo which are derived from

intracellular protein fragments and synthesized from the cytosol227

MS was recently used to

determine that hemopressins which are hemoglobin-derived peptides are upregulated in Cpefatfat

mice brains Gelman et al designed an MS experiment to compare hemoglobin-derived

47

peptides comparing the brain blood and heart peptidome in mice The authors provided data

that specific hemoglobin peptides were produced in the brain and were not produced in the

blood Certain alpha and beta hemoglobin peptides were also up regulated in the brain for

Cpefatfat

mice and bind to CB1 cannabinoid receptors228

As discussed earlier in the review

peptidomics and specifically neuropeptidomics are popular fields of study utilizing MS and non-

classical neuropeptides is an exciting emerging area of research that could further expand the

diversity of cell-cell signaling molecules

Ultrasensitive mass spectrometry for single cell analysis

In addition to large scale analysis MS-based proteomics and peptidomics are making

progress into ultrasensitive single cell analysis The most successful MS-based techniques for

single cell analysis was performed with MALDI and studies that have been performed on

relatively large neurons are reviewed elsewhere229

The ultrasensitive MS analysis is currently

directed towards single cell analysis of smaller cells including cancer cells The first challenge

in single cell analysis is the isolation and further sample preparation to yield relevant data

Collection and isolation of a cell type can be accomplished using antibodies for fluorescence

activated cell sorting (FACS) and immune magnetic separation FACS works by flow cytometry

sorting cells by a laser that excites a fluorescent tag that is attached to an antibody Immune

magnetic separation allows separation by antibodies with magnetic properties such as

Dynabeads230

One exciting study combining FACS and MS termed mass cytometry This

technology works by infusing a droplet into an inductively coupled plasma mass spectrometer

(ICP-MS) containing a single cell bound to antibodies chelated to transition elements allowing a

quantifying response between single cells231

Clearly the future of single cell analysis for

48

biomarker analysis and proteomics is encouraging and has the potential to be an emerging field

in MS-based proteomics and peptidomics

Laserspray ionization (LSI)

Laserspray ionization (LSI) is an exciting new method to produce multiply charged mass

spectra from MALDI that is nearly identical to ESI232-234

Recently it has been reported that LSI

can be performed in lieu of matrix to produce a total solvent-free analysis234

The benefits of

being able to generate multiply charged peptides without any solvent may offer advantages

including MS analysis of insoluble membrane proteins or hydrophobic peptides avoidance of

chemical reactions while in solvents a reduction of sample loss due to liquid sample preparation

and ability to avoid diffusion effects from tissue imaging studies234

The multiply charged peptide and protein ions produced by LSI expand the mass range

for tissue imaging analysis More importantly the multiply-charged peptide ions are amenable

for electron-based fragmentation methods such as ETD or ECD which can be employed in

conjunction with tissue imaging experiments to yield in situ sequencing and identification of

peptides of interest235

Paper spray ionization

Paper spray (PS) is an ambient ionization method which was first reported using

chromatography paper allowing detection of metabolites from dried blood spots The original

method used a cut out piece of paper with a voltage clipped on the back while applying 10 microL of

methanolH2O236

Improvements have been made to this technology to enhance analysis

efficiency with a new solvent 91 dichloromethaneisopropanol (vv) and the use of silica paper

49

over chromatography paper237

Interesting applications or modifications have been made to PS

including direct analysis of biological tissue238

and leaf spray for direct analysis of plant

materials239

but both detect metabolites instead of proteins or peptides Paper spray ionization

was previously shown to enable detection of cytochrome c and bradykinin [2-9] standards in a

proof of principle study240

Clearly the utility of PS analysis in proteomics and peptidomics is

yet to be explored

niECD

New fragmentation techniques have been investigated for their utility in proteomics and

peptidomics including a recently reported negative-ion electron capture dissociation (niECD)

Acidic peptides which usually contain PTMs such as phosphorylation or sulfonation are often

difficult to be detected as multiply charged peptides in the positive ion mode As discussed

earlier multiply charged peptides are required for ECDETD fragmentation The fragmentation

of niECD is accomplished by a multiply negatively charged peptide adding an electron The

resulting fragmentation of multiply sulfated and phosphorylated peptide and protein standards

showed no sulfate loss and preserved phosphorylation site The resulting fragmentation pattern

from niECD was also improved in the peptide anions and provides a new strategy for de novo

sequencing with PTM localization241

Conclusions and Perspectives

Proteomics methodologies have produced large datasets of proteins involved in various

biological and disease progression processes Numerous mass spectrometry-based proteomics

and peptidomics tools have been developed and are continuously being improved in both

50

chromatographic or electrophoretic separation and MS hardware and software However several

important issues that remain to be addressed rely on further technical advances in proteomics

analysis When large proteomes consisting of thousands of proteins are analyzed and quantified

dynamic range is still limited with more abundant proteins being preferentially detected

Development and optimization of chemical tagging reagents that target specific protein classes

maybe necessary to help enrich important signaling proteins and assess cellular and molecular

heterogeneity of the proteome and peptidome Furthermore a significant bottleneck in

usefulness of proteomics research is the ability to validate the results and provide clear

significant biological relevance to the results The idea of P4 medicine242 243

is an attractive

concept where the four Prsquos stand for predictive preventive personalized and participatory

Proteomics is one of the critical ldquoomicsrdquo fields and has led to the development of enabling

innovative strategies to P4 medicine244

A goal of P4 medicine is to assess both early disease

detection and disease progression in a person A simplified example of how proteomics fits into

P4 medicine is that certain brain-specific proteins could be used for diagnosis with

presymptomatic prion disease244

The concept of proteomic experiments providing an individual

biomarker is becoming more obsolete with the revised vision being a biomolecular barcode that

could potentially be ldquoscannedrdquo or be a fingerprint for a specific disease or early onset to that

disease being closer to reality An excellent review on what biomarker analysis can do for true

patients is available245

Proteomics can also generate new hypothesis that can be tested by classical biochemical

approaches If a disease has an unknown pathogenesis proteomics is a good starting point to try

to assemble putative markers that can lead to further hypothesis for evaluation If a particular

protein or PTM is associated with a disease state either qualitatively or quantitatively potential

51

treatments could target that protein of interest or investigators could monitor that protein or

PTM during potential treatments of the disease Proteomics has expanded greatly over the last

few decades with the goal of providing revealing insights to some of the most complex

biological problems currently facing the scientific community

Acknowledgements

Preparation of this manuscript was supported in part by the University of Wisconsin Graduate

School Wisconsin Alzheimerrsquos Disease Research Center Pilot Grant and a Department of

Defense Pilot Award LL acknowledges an H I Romnes Faculty Fellowship

52

Figure 1 A summary of general workflows of biomarker discovery pipeline by MS-based

proteomic approaches

Biological sample (CSF blood urine saliva cell

lysate tissue homogenates secreted proteins etc)

Protein extraction Sample pretreatment

2D-GE2D-DIGE MS 1D or 2D LC-MSMS

MALDI-IMS

Identification of

differentially

expressed proteins

Protein identification

Potential biomarkers

Biomarker validation

- Antibody based immunoassays

- MRM

Quantitative analysis

- Isotope labeling

- Label free

Identification and

localization of

differentially expressed

biomolecules

Intact tissue

Sample preparation Slice frozen tissues

thaw-mounted on plate

Apply Matrix

53

Figure 2 Tissue expression and dynamic range of the human plasma proteome A pie chart

representing the tissue of origin for the high abundance proteins shows that the majority of

proteins come from the liver (A) Conversely the lower abundance plasma proteins have a much

more diverse tissue of origin (B and C) The large dynamic range of plasma proteins is presented

and the proteins can be grouped into three categories (classical plasma proteins tissue leakage

products interleukinscytokines) (D) Adapted from Zhang et al11

and Schiess et al246

with

permission

54

55

Table 1 A summary of the common strategies applied to MS-based quantitative proteomic

analysis

Gel based Stable isotope labeling Label free

2D-GE

2D-DIGE 110

In vitro derivatization

18O

16O

153

ICAT 156

TMT 159

iTRAQ 158

Formaldehyde 247

ICPL 248

In vivo metabolic labeling

14N

15N

249

SILAC 164

AUC measurement 169 172

Spectral counting 173

AUC Area Under Curve ICAT Isotope-Coded Affinity Tag TMT Tandem Mass Tags iTRAQ Isobaric Tags for

Relative and Absolute Quantification ICPL Isotope Coded Protein Labeling SILAC Stable Isotope Labeling by

Amino Acids in Cell Culture Isobaric Tags for Relative and Absolute Quantification (iTRAQ)

56

References

1 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein

immunoaffinity subtraction for plasma proteomics and candidate biomarker discovery

using mass spectrometry Mol Cell Proteomics 2006 5 (11) 2167-74

2 Holten-Andersen M N Murphy G Nielsen H J Pedersen A N Christensen I J

Hoyer-Hansen G Brunner N Stephens R W Quantitation of TIMP-1 in plasma of healthy

blood donors and patients with advanced cancer Br J Cancer 1999 80 (3-4) 495-503

3 Tammen H Schulte I Hess R Menzel C Kellmann M Mohring T Schulz-

Knappe P Peptidomic analysis of human blood specimens comparison between plasma

specimens and serum by differential peptide display Proteomics 2005 5 (13) 3414-22

4 Rai A J Gelfand C A Haywood B C Warunek D J Yi J Schuchard M D

Mehigh R J Cockrill S L Scott G B Tammen H Schulz-Knappe P Speicher D W

Vitzthum F Haab B B Siest G Chan D W HUPO Plasma Proteome Project specimen

collection and handling towards the standardization of parameters for plasma proteome samples

Proteomics 2005 5 (13) 3262-77

5 Lippi G Guidi G C Mattiuzzi C Plebani M Preanalytical variability the dark side

of the moon in laboratory testing Clin Chem Lab Med 2006 44 (4) 358-65

6 Holten-Andersen M N Schrohl A S Brunner N Nielsen H J Hogdall C K

Hogdall E V Evaluation of sample handling in relation to levels of tissue inhibitor of

metalloproteinases-1 measured in blood by immunoassay Int J Biol Markers 2003 18 (3) 170-

6

7 Ytting H Christensen I J Thiel S Jensenius J C Svendsen M N Nielsen L

Lottenburger T Nielsen H J Biological variation in circulating levels of mannan-binding

lectin (MBL) and MBL-associated serine protease-2 and the influence of age gender and

physical exercise Scand J Immunol 2007 66 (4) 458-64

8 Hanash S Building a foundation for the human proteome the role of the Human

Proteome Organization J Proteome Res 2004 3 (2) 197-9

9 Omenn G S States D J Adamski M Blackwell T W Menon R Hermjakob H

Apweiler R Haab B B Simpson R J Eddes J S Kapp E A Moritz R L Chan D W

Rai A J Admon A Aebersold R Eng J Hancock W S Hefta S A Meyer H Paik Y

K Yoo J S Ping P Pounds J Adkins J Qian X Wang R Wasinger V Wu C Y

Zhao X Zeng R Archakov A Tsugita A Beer I Pandey A Pisano M Andrews P

Tammen H Speicher D W Hanash S M Overview of the HUPO Plasma Proteome Project

results from the pilot phase with 35 collaborating laboratories and multiple analytical groups

generating a core dataset of 3020 proteins and a publicly-available database Proteomics 2005 5

(13) 3226-45

10 Liu T Qian W J Gritsenko M A Xiao W Moldawer L L Kaushal A Monroe

M E Varnum S M Moore R J Purvine S O Maier R V Davis R W Tompkins R

G Camp D G 2nd Smith R D High dynamic range characterization of the trauma patient

plasma proteome Mol Cell Proteomics 2006 5 (10) 1899-913

11 Zhang Q Faca V Hanash S Mining the plasma proteome for disease applications

across seven logs of protein abundance J Proteome Res 2011 10 (1) 46-50

12 Matsubara J Honda K Ono M Sekine S Tanaka Y Kobayashi M Jung G

Sakuma T Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T Tsuchida A

57

Shimahara M Yasunami Y Chiba T Yamada T Identification of adipophilin as a potential

plasma biomarker for colorectal cancer using label-free quantitative mass spectrometry and

protein microarray Cancer Epidemiol Biomarkers Prev 2011 20 (10) 2195-203

13 Murakoshi Y Honda K Sasazuki S Ono M Negishi A Matsubara J Sakuma

T Kuwabara H Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T

Shimahara M Yasunami Y Ino Y Tsuchida A Aoki T Tsugane S Yamada T Plasma

biomarker discovery and validation for colorectal cancer by quantitative shotgun mass

spectrometry and protein microarray Cancer Sci 2011 102 (3) 630-8

14 Addona T A Shi X Keshishian H Mani D R Burgess M Gillette M A

Clauser K R Shen D Lewis G D Farrell L A Fifer M A Sabatine M S Gerszten R

E Carr S A A pipeline that integrates the discovery and verification of plasma protein

biomarkers reveals candidate markers for cardiovascular disease Nat Biotechnol 2011 29 (7)

635-43

15 Acosta-Martin A E Panchaud A Chwastyniak M Dupont A Juthier F Gautier

C Jude B Amouyel P Goodlett D R Pinet F Quantitative mass spectrometry analysis

using PAcIFIC for the identification of plasma diagnostic biomarkers for abdominal aortic

aneurysm PLoS One 2011 6 (12) e28698

16 Ray S Britschgi M Herbert C Takeda-Uchimura Y Boxer A Blennow K

Friedman L F Galasko D R Jutel M Karydas A Kaye J A Leszek J Miller B L

Minthon L Quinn J F Rabinovici G D Robinson W H Sabbagh M N So Y T

Sparks D L Tabaton M Tinklenberg J Yesavage J A Tibshirani R Wyss-Coray T

Classification and prediction of clinical Alzheimers diagnosis based on plasma signaling

proteins Nat Med 2007 13 (11) 1359-62

17 Lopez M F Kuppusamy R Sarracino D A Prakash A Athanas M Krastins B

Rezai T Sutton J N Peterman S Nicolaides K Mass spectrometric discovery and selective

reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21

maternal serum J Proteome Res 2011 10 (1) 133-42

18 Sheta E A Appel S H Goldknopf I L 2D gel blood serum biomarkers reveal

differential clinical proteomics of the neurodegenerative diseases Expert Rev Proteomics 2006

3 (1) 45-62

19 Zhang J Goodlett D R Montine T J Proteomic biomarker discovery in

cerebrospinal fluid for neurodegenerative diseases J Alzheimers Dis 2005 8 (4) 377-86

20 Yuan X Desiderio D M Proteomics analysis of prefractionated human lumbar

cerebrospinal fluid Proteomics 2005 5 (2) 541-50

21 Wenner B R Lovell M A Lynn B C Proteomic analysis of human ventricular

cerebrospinal fluid from neurologically normal elderly subjects using two-dimensional LC-

MSMS J Proteome Res 2004 3 (1) 97-103

22 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry Electrophoresis 2004 25 (14) 2402-12

23 Yuan X Desiderio D M Proteomics analysis of human cerebrospinal fluid J

Chromatogr B Analyt Technol Biomed Life Sci 2005 815 (1-2) 179-89

24 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

58

25 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers Proteomics Clin Appl 2008 2 (3) 428-36

26 Rozek W Ricardo-Dukelow M Holloway S Gendelman H E Wojna V

Melendez L M Ciborowski P Cerebrospinal fluid proteomic profiling of HIV-1-infected

patients with cognitive impairment J Proteome Res 2007 6 (11) 4189-99

27 Schutzer S E Liu T Natelson B H Angel T E Schepmoes A A Purvine S O

Hixson K K Lipton M S Camp D G Coyle P K Smith R D Bergquist J

Establishing the proteome of normal human cerebrospinal fluid PLoS One 2010 5 (6) e10980

28 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

29 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2010 878 (22) 2003-12

30 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

31 Yuan X Desiderio D M Human cerebrospinal fluid peptidomics J Mass Spectrom

2005 40 (2) 176-81

32 Ringman J M Schulman H Becker C Jones T Bai Y Immermann F Cole G

Sokolow S Gylys K Geschwind D H Cummings J L Wan H I Proteomic changes in

cerebrospinal fluid of presymptomatic and affected persons carrying familial Alzheimer disease

mutations Arch Neurol 2012 69 (1) 96-104

33 Jahn H Wittke S Zurbig P Raedler T J Arlt S Kellmann M Mullen W

Eichenlaub M Mischak H Wiedemann K Peptide fingerprinting of Alzheimers disease in

cerebrospinal fluid identification and prospective evaluation of new synaptic biomarkers PLoS

One 2011 6 (10) e26540

34 Rajagopal M U Hathout Y MacDonald T J Kieran M W Gururangan S

Blaney S M Phillips P Packer R Gordish-Dressman H Rood B R Proteomic profiling

of cerebrospinal fluid identifies prostaglandin D2 synthase as a putative biomarker for pediatric

medulloblastoma A pediatric brain tumor consortium study Proteomics 2011 11 (5) 935-43

35 Giron P Dayon L Turck N Hoogland C Sanchez J C Quantitative analysis of

human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive

isobaric labeling J Proteome Res 2011 10 (1) 249-58

36 Ting L Rad R Gygi S P Haas W MS3 eliminates ratio distortion in isobaric

multiplexed quantitative proteomics Nat Methods 2011 8 (11) 937-40

37 Michalski A Damoc E Hauschild J P Lange O Wieghaus A Makarov A

Nagaraj N Cox J Mann M Horning S Mass spectrometry-based proteomics using Q

Exactive a high-performance benchtop quadrupole Orbitrap mass spectrometer Mol Cell

Proteomics 2011 10 (9) M111 011015

38 Spirin V Shpunt A Seebacher J Gentzel M Shevchenko A Gygi S Sunyaev

S Assigning spectrum-specific P-values to protein identifications by mass spectrometry

Bioinformatics 2011 27 (8) 1128-34

39 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

59

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

2012 84 (1) 209-15

40 Wilhelm M Kirchner M Steen J A Steen H mz5 Space- and Time-efficient

Storage of Mass Spectrometry Data Sets Mol Cell Proteomics 2012 11 (1) O111 011379

41 Zhou F Sikorski T W Ficarro S B Webber J T Marto J A Online nanoflow

reversed phase-strong anion exchange-reversed phase liquid chromatography-tandem mass

spectrometry platform for efficient and in-depth proteome sequence analysis of complex

organisms Anal Chem 2011 83 (18) 6996-7005

42 Winter D Steen H Optimization of cell lysis and protein digestion protocols for the

analysis of HeLa S3 cells by LC-MSMS Proteomics 2011 11 (24) 4726-30

43 Shteynberg D Deutsch E W Lam H Eng J K Sun Z Tasman N Mendoza L

Moritz R L Aebersold R Nesvizhskii A I iProphet multi-level integrative analysis of

shotgun proteomic data improves peptide and protein identification rates and error estimates

Mol Cell Proteomics 2011 10 (12) M111 007690

44 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

84 (1) 209-15

45 Datta A Park J E Li X Zhang H Ho Z S Heese K Lim S K Tam J P Sze

S K Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun

quantitative proteomics J Proteome Res 9 (1) 472-84

46 Decramer S Gonzalez de Peredo A Breuil B Mischak H Monsarrat B Bascands

J L Schanstra J P Urine in clinical proteomics Mol Cell Proteomics 2008 7 (10) 1850-62

47 De La Monte S M Wands J R The AD7c-NTP neuronal thread protein biomarker for

detecting Alzheimers disease J Alzheimers Dis 2001 3 (3) 345-353

48 Van Dorsselaer A Carapito C Delalande F Schaeffer-Reiss C Thierse D Diemer

H McNair D S Krewski D Cashman N R Detection of prion protein in urine-derived

injectable fertility products by a targeted proteomic approach PLoS One 2011 6 (3) e17815

49 Kuwabara Y Mine K Katayama A Inagawa T Akira S Takeshita T Proteomic

analyses of recombinant human follicle-stimulating hormone and urinary-derived gonadotropin

preparations J Reprod Med 2009 54 (8) 459-66

50 Court M Selevsek N Matondo M Allory Y Garin J Masselon C D Domon B

Toward a standardized urine proteome analysis methodology Proteomics 2011 11 (6) 1160-71

51 Marimuthu A OMeally R N Chaerkady R Subbannayya Y Nanjappa V Kumar

P Kelkar D S Pinto S M Sharma R Renuse S Goel R Christopher R Delanghe B

Cole R N Harsha H C Pandey A A comprehensive map of the human urinary proteome J

Proteome Res 2011 10 (6) 2734-43

52 Bermejo-Pareja F Antequera D Vargas T Molina J A Carro E Saliva levels of

Abeta1-42 as potential biomarker of Alzheimers disease a pilot study BMC Neurol 2010 10

108

53 Devic I Hwang H Edgar J S Izutsu K Presland R Pan C Goodlett D R

Wang Y Armaly J Tumas V Zabetian C P Leverenz J B Shi M Zhang J Salivary

alpha-synuclein and DJ-1 potential biomarkers for Parkinsons disease Brain 2011 134 (Pt 7)

e178

60

54 Al-Tarawneh S K Border M B Dibble C F Bencharit S Defining salivary

biomarkers using mass spectrometry-based proteomics a systematic review OMICS 2011 15

(6) 353-61

55 Castagnola M Cabras T Vitali A Sanna M T Messana I Biotechnological

implications of the salivary proteome Trends Biotechnol 2011 29 (8) 409-18

56 Caprioli R M Farmer T B Gile J Molecular imaging of biological samples

localization of peptides and proteins using MALDI-TOF MS Anal Chem 1997 69 (23) 4751-

60

57 Stoeckli M Chaurand P Hallahan D E Caprioli R M Imaging mass spectrometry

a new technology for the analysis of protein expression in mammalian tissues Nat Med 2001 7

(4) 493-6

58 Chaurand P Schwartz S A Caprioli R M Imaging mass spectrometry a new tool to

investigate the spatial organization of peptides and proteins in mammalian tissue sections Curr

Opin Chem Biol 2002 6 (5) 676-81

59 Chaurand P Stoeckli M Caprioli R M Direct profiling of proteins in biological

tissue sections by MALDI mass spectrometry Anal Chem 1999 71 (23) 5263-70

60 Seeley E H Caprioli R M MALDI imaging mass spectrometry of human tissue

method challenges and clinical perspectives Trends Biotechnol 2011 29 (3) 136-43

61 Cazares L H Troyer D A Wang B Drake R R Semmes O J MALDI tissue

imaging from biomarker discovery to clinical applications Anal Bioanal Chem 2011 401 (1)

17-27

62 Stauber J Lemaire R Franck J Bonnel D Croix D Day R Wisztorski M

Fournier I Salzet M MALDI imaging of formalin-fixed paraffin-embedded tissues

application to model animals of Parkinson disease for biomarker hunting J Proteome Res 2008

7 (3) 969-78

63 Yuki D Sugiura Y Zaima N Akatsu H Hashizume Y Yamamoto T Fujiwara

M Sugiyama K Setou M Hydroxylated and non-hydroxylated sulfatide are distinctly

distributed in the human cerebral cortex Neuroscience 2011 193 44-53

64 Makridakis M Vlahou A Secretome proteomics for discovery of cancer biomarkers J

Proteomics 2010 73 (12) 2291-305

65 Dowling P Clynes M Conditioned media from cell lines a complementary model to

clinical specimens for the discovery of disease-specific biomarkers Proteomics 2011 11 (4)

794-804

66 Moore N H Costa L G Shaffer S A Goodlett D R Guizzetti M Shotgun

proteomics implicates extracellular matrix proteins and protease systems in neuronal

development induced by astrocyte cholinergic stimulation J Neurochem 2009 108 (4) 891-908

67 Keene S D Greco T M Parastatidis I Lee S H Hughes E G Balice-Gordon R

J Speicher D W Ischiropoulos H Mass spectrometric and computational analysis of

cytokine-induced alterations in the astrocyte secretome Proteomics 2009 9 (3) 768-82

68 Dowell J A Johnson J A Li L Identification of astrocyte secreted proteins with a

combination of shotgun proteomics and bioinformatics J Proteome Res 2009 8 (8) 4135-43

69 Ahmed F E Sample preparation and fractionation for proteome analysis and cancer

biomarker discovery by mass spectrometry J Sep Sci 2009 32 (5-6) 771-98

70 Righetti P G Fasoli E Boschetti E Combinatorial peptide ligand libraries the

conquest of the hidden proteome advances at great strides Electrophoresis 2011 32 (9) 960-6

61

71 Selvaraju S El Rassi Z Liquid-phase-based separation systems for depletion

prefractionation and enrichment of proteins in biological fluids and matrices for in-depth

proteomics analysis - An update covering the period 2008-2011 Electrophoresis 2011 33 (1)

74-88

72 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

73 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

74 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

75 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

76 Wagner P D Vu N D Histidine to aspartate phosphotransferase activity of nm23

proteins phosphorylation of aldolase C on Asp-319 Biochem J 2000 346 Pt 3 623-30

77 Chi A Huttenhower C Geer L Y Coon J J Syka J E Bai D L Shabanowitz

J Burke D J Troyanskaya O G Hunt D F Analysis of phosphorylation sites on proteins

from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry Proc

Natl Acad Sci U S A 2007 104 (7) 2193-8

78 Rudrabhatla P Jaffe H Pant H C Direct evidence of phosphorylated neuronal

intermediate filament proteins in neurofibrillary tangles (NFTs) phosphoproteomics of

Alzheimers NFTs FASEB J 25 (11) 3896-905

79 Saez-Valero J Fodero L R Sjogren M Andreasen N Amici S Gallai V

Vanderstichele H Vanmechelen E Parnetti L Blennow K Small D H Glycosylation of

acetylcholinesterase and butyrylcholinesterase changes as a function of the duration of

Alzheimers disease J Neurosci Res 2003 72 (4) 520-6

80 Silveyra M X Cuadrado-Corrales N Marcos A Barquero M S Rabano A

Calero M Saez-Valero J Altered glycosylation of acetylcholinesterase in Creutzfeldt-Jakob

disease J Neurochem 2006 96 (1) 97-104

81 Wei X Li L Comparative glycoproteomics approaches and applications Brief Funct

Genomic Proteomic 2009 8 (2) 104-13

82 Tian Y Zhang H Glycoproteomics and clinical applications Proteomics Clin Appl

2010 4 (2) 124-32

83 Butterfield D A Owen J B Lectin-affinity chromatography brain glycoproteomics

and Alzheimer disease insights into protein alterations consistent with the pathology and

progression of this dementing disorder Proteomics Clin Appl 2011 5 (1-2) 50-6

84 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion

disease biomarker research delving into the glycoproteome J Proteome Res 2010 10 (6) 2687-

702

85 Rucevic M Hixson D Josic D Mammalian plasma membrane proteins as potential

biomarkers and drug targets Electrophoresis 2011 32 (13) 1549-64

86 Gilmore J M Washburn M P Advances in shotgun proteomics and the analysis of

membrane proteomes J Proteomics 2010 73 (11) 2078-91

62

87 Weekes M P Antrobus R Lill J R Duncan L M Hor S Lehner P J

Comparative analysis of techniques to purify plasma membrane proteins J Biomol Tech 2010

21 (3) 108-15

88 Helbig A O Heck A J Slijper M Exploring the membrane proteome--challenges

and analytical strategies J Proteomics 2010 73 (5) 868-78

89 Griffin N M Schnitzer J E Overcoming key technological challenges in using mass

spectrometry for mapping cell surfaces in tissues Mol Cell Proteomics 2011 10 (2) R110

000935

90 Groen A J Lilley K S Proteomics of total membranes and subcellular membranes

Expert Rev Proteomics 2010 7 (6) 867-78

91 Wisniewski J R Tools for phospho- and glycoproteomics of plasma membranes Amino

Acids 2011 41 (2) 223-33

92 Wei X Dulberger C Li L Characterization of murine brain membrane glycoproteins

by detergent assisted lectin affinity chromatography Anal Chem 2010 82 (15) 6329-33

93 Zhang H Guo T Li X Datta A Park J E Yang J Lim S K Tam J P Sze S

K Simultaneous characterization of glyco- and phosphoproteomes of mouse brain membrane

proteome with electrostatic repulsion hydrophilic interaction chromatography Mol Cell

Proteomics 2011 9 (4) 635-47

94 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

95 Ma M Chen R Ge Y He H Marshall A G Li L Combining bottom-up and top-

down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic

hormone from Cancer borealis Anal Chem 2009 81 (1) 240-7

96 Chen R Ma M Hui L Zhang J Li L Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 (4) 708-18

97 Wei H Nolkrantz K Parkin M C Chisolm C N OCallaghan J P Kennedy R

T Identification and quantification of neuropeptides in brain tissue by capillary liquid

chromatography coupled off-line to MALDI-TOF and MALDI-TOFTOF-MS Anal Chem 2006

78 (13) 4342-51

98 Dowell J A Heyden W V Li L Rat neuropeptidomics by LC-MSMS and MALDI-

FTMS Enhanced dissection and extraction techniques coupled with 2D RP-RP HPLC J

Proteome Res 2006 5 (12) 3368-75

99 Che F Y Fricker L D Quantitative peptidomics of mouse pituitary comparison of

different stable isotopic tags J Mass Spectrom 2005 40 (2) 238-49

100 Fricker L D Analysis of mouse brain peptides using mass spectrometry-based

peptidomics implications for novel functions ranging from non-classical neuropeptides to

microproteins Mol Biosyst 2010 6 (8) 1355-65

101 Colgrave M L Xi L Lehnert S A Flatscher-Bader T Wadensten H Nilsson A

Andren P E Wijffels G Neuropeptide profiling of the bovine hypothalamus thermal

stabilization is an effective tool in inhibiting post-mortem degradation Proteomics 2011 11 (7)

1264-76

102 Scholz B Alm H Mattsson A Nilsson A Kultima K Savitski M M Falth M

Skold K Brunstrom B Andren P E Dencker L Neuropeptidomic analysis of the

embryonic Japanese quail diencephalon BMC Dev Biol 2010 10 30

63

103 Hummon A B Amare A Sweedler J V Discovering new invertebrate neuropeptides

using mass spectrometry Mass Spectrom Rev 2006 25 (1) 77-98

104 Vilim F S Sasaki K Rybak J Alexeeva V Cropper E C Jing J Orekhova I

V Brezina V Price D Romanova E V Rubakhin S S Hatcher N Sweedler J V

Weiss K R Distinct mechanisms produce functionally complementary actions of neuropeptides

that are structurally related but derived from different precursors J Neurosci 2010 30 (1) 131-

47

105 Fu Q Li L De novo sequencing of neuropeptides using reductive isotopic methylation

and investigation of ESI QTOF MSMS fragmentation pattern of neuropeptides with N-terminal

dimethylation Anal Chem 2005 77 (23) 7783-95

106 Chen R Jiang X Conaway M C Mohtashemi I Hui L Viner R Li L Mass

spectral analysis of neuropeptide expression and distribution in the nervous system of the lobster

Homarus americanus J Proteome Res 2010 9 (2) 818-32

107 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved

identification of endogenous peptides from murine nervous tissue by multiplexed peptide

extraction methods and multiplexed mass spectrometric analysis J Proteome Res 2009 8 (2)

870-6

108 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Peptidomic analysis

of human cell lines J Proteome Res 2011 10 (4) 1583-92

109 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement

approaches and challenges Annu Rev Anal Chem (Palo Alto Calif) 2008 1 451-83

110 Marouga R David S Hawkins E The development of the DIGE system 2D

fluorescence difference gel analysis technology Anal Bioanal Chem 2005 382 (3) 669-78

111 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

112 Ottervald J Franzen B Nilsson K Andersson L I Khademi M Eriksson B

Kjellstrom S Marko-Varga G Vegvari A Harris R A Laurell T Miliotis T

Matusevicius D Salter H Ferm M Olsson T Multiple sclerosis Identification and clinical

evaluation of novel CSF biomarkers J Proteomics 2010 73 (6) 1117-32

113 Diez R Herbstreith M Osorio C Alzate O 2-D Fluorescence Difference Gel

Electrophoresis (DIGE) in Neuroproteomics 2010

114 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent

advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1)

46-50

115 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J

Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66

116 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

117 Burke T W Mant C T Black J A Hodges R S Strong cation-exchange high-

performance liquid chromatography of peptides Effect of non-specific hydrophobic interactions

and linearization of peptide retention behaviour J Chromatogr 1989 476 377-89

118 Edelmann M J Strong cation exchange chromatography in analysis of posttranslational

modifications innovations and perspectives J Biomed Biotechnol 2011 2011 936508

64

119 Francois I Sandra K Sandra P Comprehensive liquid chromatography fundamental

aspects and practical considerations--a review Anal Chim Acta 2009 641 (1-2) 14-31

120 Guiochon G Marchetti N Mriziq K Shalliker R A Implementations of two-

dimensional liquid chromatography J Chromatogr A 2008 1189 (1-2) 109-68

121 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

122 Alpert A J Hydrophilic-interaction chromatography for the separation of peptides

nucleic acids and other polar compounds J Chromatogr 1990 499 177-96

123 Gilar M Olivova P Daly A E Gebler J C Orthogonality of separation in two-

dimensional liquid chromatography Anal Chem 2005 77 (19) 6426-34

124 Di Palma S Stange D van de Wetering M Clevers H Heck A J Mohammed S

Highly sensitive proteome analysis of FACS-sorted adult colon stem cells J Proteome Res 2011

10 (8) 3814-9

125 Di Palma S Boersema P J Heck A J Mohammed S Zwitterionic hydrophilic

interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution

separation and increase sensitivity in proteome analysis Anal Chem 2011 83 (9) 3440-7

126 Zarei M Sprenger A Metzger F Gretzmeier C Dengjel J Comparison of ERLIC-

TiO2 HILIC-TiO2 and SCX-TiO2 for global phosphoproteomics approaches J Proteome Res

2011 10 (8) 3474-83

127 Neue K Mormann M Peter-Katalinic J Pohlentz G Elucidation of glycoprotein

structures by unspecific proteolysis and direct nanoESI mass spectrometric analysis of ZIC-

HILIC-enriched glycopeptides J Proteome Res 2011 10 (5) 2248-60

128 Ow S Y Salim M Noirel J Evans C Wright P C Minimising iTRAQ ratio

compression through understanding LC-MS elution dependence and high-resolution HILIC

fractionation Proteomics 2011 11 (11) 2341-6

129 Alpert A J Electrostatic repulsion hydrophilic interaction chromatography for isocratic

separation of charged solutes and selective isolation of phosphopeptides Anal Chem 2008 80

(1) 62-76

130 Chien K Y Liu H C Goshe M B Development and application of a

phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography

(ERLIC) IMAC and LC-MSMS analysis to study Mareks Disease Virus infection J Proteome

Res 2011 10 (9) 4041-53

131 Liu X Plasencia M Ragg S Valentine S J Clemmer D E Development of high

throughput dispersive LC-ion mobility-TOFMS techniques for analysing the human plasma

proteome Brief Funct Genomic Proteomic 2004 3 (2) 177-86

132 Valentine S J Liu X Plasencia M D Hilderbrand A E Kurulugama R T

Koeniger S L Clemmer D E Developing liquid chromatography ion mobility mass

spectometry techniques Expert Rev Proteomics 2005 2 (4) 553-65

133 Valentine S J Plasencia M D Liu X Krishnan M Naylor S Udseth H R

Smith R D Clemmer D E Toward plasma proteome profiling with ion mobility-mass

spectrometry J Proteome Res 2006 5 (11) 2977-84

134 Valentine S J Ewing M A Dilger J M Glover M S Geromanos S Hughes C

Clemmer D E Using ion mobility data to improve peptide identification intrinsic amino acid

size parameters J Proteome Res 2011 10 (5) 2318-29

65

135 Ahmed F E The role of capillary electrophoresis-mass spectrometry to proteome

analysis and biomarker discovery J Chromatogr B Analyt Technol Biomed Life Sci 2009 877

(22) 1963-81

136 Fonslow B R Yates J R 3rd Capillary electrophoresis applied to proteomic analysis

J Sep Sci 2009 32 (8) 1175-88

137 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins J Chromatogr A 2007 1159 (1-2) 81-109

138 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins 2007-2010 Electrophoresis 2011 32 (1) 66-82

139 Huck C W Bakry R Huber L A Bonn G K Progress in capillary electrophoresis

coupled to matrix-assisted laser desorptionionization - time of flight mass spectrometry

Electrophoresis 2006 27 (11) 2063-74

140 Klampfl C W CE with MS detection a rapidly developing hyphenated technique

Electrophoresis 2009 30 Suppl 1 S83-91

141 Simpson D C Smith R D Combining capillary electrophoresis with mass

spectrometry for applications in proteomics Electrophoresis 2005 26 (7-8) 1291-305

142 Li Y Champion M M Sun L Champion P A Wojcik R Dovichi N J Capillary

zone electrophoresis-electrospray ionization-tandem mass spectrometry as an alternative

proteomics platform to ultraperformance liquid chromatography-electrospray ionization-tandem

mass spectrometry for samples of intermediate complexity Anal Chem 2012 84 (3) 1617-22

143 Dai L Li C Shedden K A Lee C J Quoc H Simeone D M Lubman D M

Quantitative proteomic profiling studies of pancreatic cancer stem cells J Proteome Res 2010 9

(7) 3394-402

144 Chen J Balgley B M DeVoe D L Lee C S Capillary isoelectric focusing-based

multidimensional concentrationseparation platform for proteome analysis Anal Chem 2003 75

(13) 3145-52

145 Mischak H Delles C Klein J Schanstra J P Urinary proteomics based on capillary

electrophoresis-coupled mass spectrometry in kidney disease discovery and validation of

biomarkers and clinical application Adv Chronic Kidney Dis 2010 17 (6) 493-506

146 Albalat A Mischak H Mullen W Urine proteomics in clinical applications

technologies principal considerations and clinical implementation Prilozi 2011 32 (1) 13-44

147 Zuberovic A Hanrieder J Hellman U Bergquist J Wetterhall M Proteome

profiling of human cerebrospinal fluid exploring the potential of capillary electrophoresis with

surface modified capillaries for analysis of complex biological samples Eur J Mass Spectrom

(Chichester Eng) 2008 14 (4) 249-60

148 Johannesson N Olsson L Backstrom D Wetterhall M Danielsson R Bergquist

J Screening for biomarkers in plasma from patients with gangrenous and phlegmonous

appendicitis using CE and CEC in combination with MS Electrophoresis 2007 28 (9) 1435-43

149 Guo T Lee C S Wang W DeVoe D L Balgley B M Capillary separations

enabling tissue proteomics-based biomarker discovery Electrophoresis 2006 27 (18) 3523-32

150 Hwang S I Thumar J Lundgren D H Rezaul K Mayya V Wu L Eng J

Wright M E Han D K Direct cancer tissue proteomics a method to identify candidate

cancer biomarkers from formalin-fixed paraffin-embedded archival tissues Oncogene 2007 26

(1) 65-76

66

151 Desiderio C Rossetti D V Iavarone F Messana I Castagnola M Capillary

electrophoresis--mass spectrometry recent trends in clinical proteomics J Pharm Biomed Anal

2010 53 (5) 1161-9

152 Albalat A Mischak H Mullen W Clinical application of urinary

proteomicspeptidomics Expert Rev Proteomics 2011 8 (5) 615-29

153 Ye X Luke B Andresson T Blonder J 18O stable isotope labeling in MS-based

proteomics Brief Funct Genomic Proteomic 2009 8 (2) 136-44

154 Staes A Demol H Van Damme J Martens L Vandekerckhove J Gevaert K

Global differential non-gel proteomics by quantitative and stable labeling of tryptic peptides with

oxygen-18 J Proteome Res 2004 3 (4) 786-91

155 Jorge I Navarro P Martinez-Acedo P Nunez E Serrano H Alfranca A

Redondo J M Vazquez J Statistical model to analyze quantitative proteomics data obtained

by 18O16O labeling and linear ion trap mass spectrometry application to the study of vascular

endothelial growth factor-induced angiogenesis in endothelial cells Mol Cell Proteomics 2009

8 (5) 1130-49

156 Gygi S P Rist B Gerber S A Turecek F Gelb M H Aebersold R Quantitative

analysis of complex protein mixtures using isotope-coded affinity tags Nat Biotechnol 1999 17

(10) 994-9

157 Haqqani A S Kelly J F Stanimirovic D B Quantitative protein profiling by mass

spectrometry using isotope-coded affinity tags Methods Mol Biol 2008 439 225-40

158 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

159 Thompson A Schafer J Kuhn K Kienle S Schwarz J Schmidt G Neumann T

Johnstone R Mohammed A K Hamon C Tandem mass tags a novel quantification strategy

for comparative analysis of complex protein mixtures by MSMS Anal Chem 2003 75 (8)

1895-904

160 Dayon L Hainard A Licker V Turck N Kuhn K Hochstrasser D F Burkhard

P R Sanchez J C Relative quantification of proteins in human cerebrospinal fluids by

MSMS using 6-plex isobaric tags Anal Chem 2008 80 (8) 2921-31

161 DAscenzo M Choe L Lee K H iTRAQPak an R based analysis and visualization

package for 8-plex isobaric protein expression data Brief Funct Genomic Proteomic 2008 7 (2)

127-35

162 Choe L DAscenzo M Relkin N R Pappin D Ross P Williamson B Guertin

S Pribil P Lee K H 8-plex quantitation of changes in cerebrospinal fluid protein expression

in subjects undergoing intravenous immunoglobulin treatment for Alzheimers disease

Proteomics 2007 7 (20) 3651-60

163 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 2010 82 (7) 2817-

25

164 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

67

165 Ong S E Kratchmarova I Mann M Properties of 13C-substituted arginine in stable

isotope labeling by amino acids in cell culture (SILAC) J Proteome Res 2003 2 (2) 173-81

166 Pimienta G Chaerkady R Pandey A SILAC for global phosphoproteomic analysis

Methods Mol Biol 2009 527 107-16 x

167 Zhang G Neubert T A Use of stable isotope labeling by amino acids in cell culture

(SILAC) for phosphotyrosine protein identification and quantitation Methods Mol Biol 2009

527 79-92 xi

168 Bondarenko P V Chelius D Shaler T A Identification and relative quantitation of

protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid

chromatography-tandem mass spectrometry Anal Chem 2002 74 (18) 4741-9

169 Chelius D Bondarenko P V Quantitative profiling of proteins in complex mixtures

using liquid chromatography and mass spectrometry J Proteome Res 2002 1 (4) 317-23

170 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

171 Wang W Zhou H Lin H Roy S Shaler T A Hill L R Norton S Kumar P

Anderle M Becker C H Quantification of proteins and metabolites by mass spectrometry

without isotopic labeling or spiked standards Anal Chem 2003 75 (18) 4818-26

172 Silva J C Denny R Dorschel C A Gorenstein M Kass I J Li G Z McKenna

T Nold M J Richardson K Young P Geromanos S Quantitative proteomic analysis by

accurate mass retention time pairs Anal Chem 2005 77 (7) 2187-200

173 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

174 Filiou M D Martins-de-Souza D Guest P C Bahn S Turck C W To label or not

to label Applications of quantitative proteomics in neuroscience research Proteomics 2012

175 Neilson K A Ali N A Muralidharan S Mirzaei M Mariani M Assadourian G

Lee A van Sluyter S C Haynes P A Less label more free approaches in label-free

quantitative mass spectrometry Proteomics 2011 11 (4) 535-53

176 Zhu W Smith J W Huang C M Mass spectrometry-based label-free quantitative

proteomics J Biomed Biotechnol 2010 2010 840518

177 Xie F Liu T Qian W J Petyuk V A Smith R D Liquid chromatography-mass

spectrometry-based quantitative proteomics J Biol Chem 2012 286 (29) 25443-9

178 Shen Y Tolic N Xie F Zhao R Purvine S O Schepmoes A A Moore R J

Anderson G A Smith R D Effectiveness of CID HCD and ETD with FT MSMS for

degradomic-peptidomic analysis comparison of peptide identification methods J Proteome Res

2011 10 (9) 3929-43

179 Zubarev R A Kelleher N L McLafferty F W Electron Capture Dissociation of

Multiply Charged Protein Cations A Nonergodic Process Journal of the American Chemical

Society 1998 120 (13) 3265-3266

180 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

181 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

68

182 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass

spectrometry for shotgun proteomics Nat Methods 2008 5 (11) 959-64

183 Zhou W Ross M M Tessitore A Ornstein D Vanmeter A Liotta L A

Petricoin E F 3rd An initial characterization of the serum phosphoproteome J Proteome Res

2009 8 (12) 5523-31

184 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides

complexed with divalent metal cations Anal Chem 2006 78 (21) 7570-6

185 Snovida S I Bodnar E D Viner R Saba J Perreault H A simple cellulose

column procedure for selective enrichment of glycopeptides and characterization by nano LC

coupled with electron-transfer and high-energy collisional-dissociation tandem mass

spectrometry Carbohydr Res 2010 345 (6) 792-801

186 Sobott F Watt S J Smith J Edelmann M J Kramer H B Kessler B M

Comparison of CID versus ETD based MSMS fragmentation for the analysis of protein

ubiquitination J Am Soc Mass Spectrom 2009 20 (9) 1652-9

187 Eliuk S M Maltby D Panning B Burlingame A L High resolution electron

transfer dissociation studies of unfractionated intact histones from murine embryonic stem cells

using on-line capillary LC separation determination of abundant histone isoforms and post-

translational modifications Mol Cell Proteomics 2010 9 (5) 824-37

188 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 10 (9) 4219-29

189 Molina H Matthiesen R Kandasamy K Pandey A Comprehensive comparison of

collision induced dissociation and electron transfer dissociation Anal Chem 2008 80 (13)

4825-35

190 Olsen J V Macek B Lange O Makarov A Horning S Mann M Higher-energy

C-trap dissociation for peptide modification analysis Nat Methods 2007 4 (9) 709-12

191 McAlister G C Phanstiel D Wenger C D Lee M V Coon J J Analysis of

tandem mass spectra by FTMS for improved large-scale proteomics with superior protein

quantification Anal Chem 2010 82 (1) 316-22

192 Dayon L Pasquarello C Hoogland C Sanchez J C Scherl A Combining low-

and high-energy tandem mass spectra for optimized peptide quantification with isobaric tags J

Proteomics 2010 73 (4) 769-77

193 Second T P Blethrow J D Schwartz J C Merrihew G E MacCoss M J

Swaney D L Russell J D Coon J J Zabrouskov V Dual-pressure linear ion trap mass

spectrometer improving the analysis of complex protein mixtures Anal Chem 2009 81 (18)

7757-65

194 Jedrychowski M P Huttlin E L Haas W Sowa M E Rad R Gygi S P

Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for

murine phosphoproteomics Mol Cell Proteomics 2011 10 (12) M111 009910

195 Frese C K Altelaar A F Hennrich M L Nolting D Zeller M Griep-Raming J

Heck A J Mohammed S Improved peptide identification by targeted fragmentation using

CID HCD and ETD on an LTQ-Orbitrap Velos J Proteome Res 2011 10 (5) 2377-88

196 Chakraborty A B Berger S J Gebler J C Use of an integrated MS--multiplexed

MSMS data acquisition strategy for high-coverage peptide mapping studies Rapid Commun

Mass Spectrom 2007 21 (5) 730-44

69

197 Ramos A A Yang H Rosen L E Yao X Tandem parallel fragmentation of

peptides for mass spectrometry Anal Chem 2006 78 (18) 6391-7

198 Barbara J E Castro-Perez J M High-resolution chromatographytime-of-flight MSE

with in silico data mining is an information-rich approach to reactive metabolite screening Rapid

Commun Mass Spectrom 2011 25 (20) 3029-40

199 Geromanos S J Vissers J P Silva J C Dorschel C A Li G Z Gorenstein M

V Bateman R H Langridge J I The detection correlation and comparison of peptide

precursor and product ions from data independent LC-MS with data dependant LC-MSMS

Proteomics 2009 9 (6) 1683-95

200 Koutroukides T A Guest P C Leweke F M Bailey D M Rahmoune H Bahn

S Martins-de-Souza D Characterization of the human serum depletome by label-free shotgun

proteomics J Sep Sci 34 (13) 1621-6

201 Martins-de-Souza D Guest P C Guest F L Bauder C Rahmoune H Pietsch S

Roeber S Kretzschmar H Mann D Baborie A Bahn S Characterization of the human

primary visual cortex and cerebellum proteomes using shotgun mass spectrometry-data-

independent analyses Proteomics 12 (3) 500-4

202 Scatena R Bottoni P Pontoglio A Giardina B Revisiting the Warburg effect in

cancer cells with proteomics The emergence of new approaches to diagnosis prognosis and

therapy Proteomics Clin Appl 4 (2) 143-58

203 Herberth M Koethe D Cheng T M Krzyszton N D Schoeffmann S Guest P

C Rahmoune H Harris L W Kranaster L Leweke F M Bahn S Impaired glycolytic

response in peripheral blood mononuclear cells of first-onset antipsychotic-naive schizophrenia

patients Mol Psychiatry 16 (8) 848-59

204 Krishnamurthy D Levin Y Harris L W Umrania Y Bahn S Guest P C

Analysis of the human pituitary proteome by data independent label-free liquid chromatography

tandem mass spectrometry Proteomics 11 (3) 495-500

205 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

206 Eng J K McCormack A L Yates Iii J R An approach to correlate tandem mass

spectral data of peptides with amino acid sequences in a protein database Journal of the

American Society for Mass Spectrometry 1994 5 (11) 976-989

207 Craig R Beavis R C TANDEM matching proteins with tandem mass spectra

Bioinformatics 2004 20 (9) 1466-7

208 Geer L Y Markey S P Kowalak J A Wagner L Xu M Maynard D M Yang

X Shi W Bryant S H Open mass spectrometry search algorithm J Proteome Res 2004 3

(5) 958-64

209 Zhang J Xin L Shan B Chen W Xie M Yuen D Zhang W Zhang Z Lajoie

G A Ma B PEAKS DB De Novo sequencing assisted database search for sensitive and

accurate peptide identification Mol Cell Proteomics 2011

210 Liu J Erassov A Halina P Canete M Nguyen D V Chung C Cagney G

Ignatchenko A Fong V Emili A Sequential interval motif search unrestricted database

surveys of global MSMS data sets for detection of putative post-translational modifications

Anal Chem 2008 80 (20) 7846-54

70

211 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

212 Han Y Ma B Zhang K SPIDER software for protein identification from sequence

tags with de novo sequencing error J Bioinform Comput Biol 2005 3 (3) 697-716

213 Singh S Springer M Steen J Kirschner M W Steen H FLEXIQuant a novel tool

for the absolute quantification of proteins and the simultaneous identification and quantification

of potentially modified peptides J Proteome Res 2009 8 (5) 2201-10

214 Searle B C Scaffold a bioinformatic tool for validating MSMS-based proteomic

studies Proteomics 10 (6) 1265-9

215 Herbst A McIlwain S Schmidt J J Aiken J M Page C D Li L Prion disease

diagnosis by proteomic profiling J Proteome Res 2009 8 (2) 1030-6

216 Menschaert G Vandekerckhove T T Baggerman G Schoofs L Luyten W Van

Criekinge W Peptidomics coming of age a review of contributions from a bioinformatics

angle J Proteome Res 2010 9 (5) 2051-61

217 Kumar C Mann M Bioinformatics analysis of mass spectrometry-based proteomics

data sets FEBS Lett 2009 583 (11) 1703-12

218 Craig R Cortens J P Beavis R C Open source system for analyzing validating and

storing protein identification data J Proteome Res 2004 3 (6) 1234-42

219 Jones P Cote R G Cho S Y Klie S Martens L Quinn A F Thorneycroft D

Hermjakob H PRIDE new developments and new datasets Nucleic Acids Res 2008 36

(Database issue) D878-83

220 Deutsch E W Lam H Aebersold R PeptideAtlas a resource for target selection for

emerging targeted proteomics workflows EMBO Rep 2008 9 (5) 429-34

221 Miliotis T Ali L Palm J E Lundqvist A J Ahnoff M Andersson T B

Hilgendorf C Development of a highly sensitive method using liquid chromatography-multiple

reaction monitoring to quantify membrane P-glycoprotein in biological matrices and relationship

to transport function Drug Metab Dispos 2011 39 (12) 2440-9

222 Xiang Y Koomen J M Evaluation of Direct Infusion-Multiple Reaction Monitoring

Mass Spectrometry for Quantification of Heat Shock Proteins Anal Chem 2012

223 Ossola R Schiess R Picotti P Rinner O Reiter L Aebersold R Biomarker

validation in blood specimens by selected reaction monitoring mass spectrometry of N-

glycosites Methods Mol Biol 2011 728 179-94

224 Liu H Lam L Dasgupta P K Expanding the linear dynamic range for multiple

reaction monitoring in quantitative liquid chromatography-tandem mass spectrometry utilizing

natural isotopologue transitions Talanta 2011 87 307-10

225 Belov M E Prasad S Prior D C Danielson W F 3rd Weitz K Ibrahim Y M

Smith R D Pulsed multiple reaction monitoring approach to enhancing sensitivity of a tandem

quadrupole mass spectrometer Anal Chem 2011 83 (6) 2162-71

226 Prentice R L Paczesny S Aragaki A Amon L M Chen L Pitteri S J

McIntosh M Wang P Buson Busald T Hsia J Jackson R D Rossouw J E Manson J

E Johnson K Eaton C Hanash S M Novel proteins associated with risk for coronary heart

disease or stroke among postmenopausal women identified by in-depth plasma proteome

profiling Genome Med 2 (7) 48

227 Gelman J S Fricker L D Hemopressin and other bioactive peptides from cytosolic

proteins are these non-classical neuropeptides AAPS J 2010 12 (3) 279-89

71

228 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Hemopressins and

other hemoglobin-derived peptides in mouse brain comparison between brain blood and heart

peptidome and regulation in Cpefatfat mice J Neurochem 2010 113 (4) 871-80

229 Li L Garden R W Sweedler J V Single-cell MALDI a new tool for direct peptide

profiling Trends Biotechnol 2000 18 (4) 151-60

230 Altelaar A M Heck A J Trends in ultrasensitive proteomics Curr Opin Chem Biol

231 Bandura D R Baranov V I Ornatsky O I Antonov A Kinach R Lou X

Pavlov S Vorobiev S Dick J E Tanner S D Mass cytometry technique for real time

single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass

spectrometry Anal Chem 2009 81 (16) 6813-22

232 Trimpin S Inutan E D Herath T N McEwen C N Laserspray ionization a new

atmospheric pressure MALDI method for producing highly charged gas-phase ions of peptides

and proteins directly from solid solutions Mol Cell Proteomics 2010 9 (2) 362-7

233 McEwen C N Larsen B S Trimpin S Laserspray ionization on a commercial

atmospheric pressure-MALDI mass spectrometer ion source selecting singly or multiply

charged ions Anal Chem 2010 82 (12) 4998-5001

234 Wang B Lietz C B Inutan E D Leach S M Trimpin S Producing highly

charged ions without solvent using laserspray ionization a total solvent-free analysis approach at

atmospheric pressure Anal Chem 2011 83 (11) 4076-84

235 Inutan E D Richards A L Wager-Miller J Mackie K McEwen C N Trimpin

S Laserspray ionization a new method for protein analysis directly from tissue at atmospheric

pressure with ultrahigh mass resolution and electron transfer dissociation Mol Cell Proteomics

2010 10 (2) M110 000760

236 Wang H Liu J Cooks R G Ouyang Z Paper spray for direct analysis of complex

mixtures using mass spectrometry Angew Chem Int Ed Engl 49 (5) 877-80

237 Zhang Z Xu W Manicke N E Cooks R G Ouyang Z Silica coated paper

substrate for paper-spray analysis of therapeutic drugs in dried blood spots Anal Chem 84 (2)

931-8

238 Wang H Manicke N E Yang Q Zheng L Shi R Cooks R G Ouyang Z

Direct analysis of biological tissue by paper spray mass spectrometry Anal Chem 83 (4) 1197-

201

239 Liu J Wang H Cooks R G Ouyang Z Leaf spray direct chemical analysis of plant

material and living plants by mass spectrometry Anal Chem 83 (20) 7608-13

240 Liu J Wang H Manicke N E Lin J M Cooks R G Ouyang Z Development

characterization and application of paper spray ionization Anal Chem 82 (6) 2463-71

241 Yoo H J Wang N Zhuang S Song H Hakansson K Negative-ion electron

capture dissociation radical-driven fragmentation of charge-increased gaseous peptide anions J

Am Chem Soc 2011 133 (42) 16790-3

242 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 271 (2) 111-

21

243 Hood L Friend S H Predictive personalized preventive participatory (P4) cancer

medicine Nat Rev Clin Oncol 2011 8 (3) 184-7

244 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 2012 271

(2) 111-21

72

245 Belda-Iniesta C de Castro J Perona R Translational proteomics what can you do for

true patients J Proteome Res 2011 10 (1) 101-4

246 Schiess R Wollscheid B Aebersold R Targeted proteomic strategy for clinical

biomarker discovery Mol Oncol 2009 3 (1) 33-44

247 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

248 Schmidt A Kellermann J Lottspeich F A novel strategy for quantitative proteomics

using isotope-coded protein labels Proteomics 2005 5 (1) 4-15

249 Wang Y K Ma Z Quinn D F Fu E W Inverse 15

N-metabolic labelingmass

spectrometry for comparative proteomics and rapid identification of protein markerstargets

Rapid Commun Mass Spectrom 2002 16 (14) 1389-97

73

Chapter 3

Protein changes in immunodepleted cerebrospinal fluid from transgenic

mouse models of Alexander disease detected using mass spectrometry

Adapted from ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse

models of Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P

Messing A Li L Submitted

74

ABSTRACT

Cerebrospinal fluid (CSF) is a low protein content biological fluid with dynamic range

spanning at least nine orders of magnitude in protein content and is in direct contact with the

brain A modified IgY-14 immunodepletion treatment was performed to enhance analysis of the

low volumes of CSF that are obtainable from mice As a model system in which to test this

approach we utilized transgenic mice that over-express the intermediate filament glial fibrillary

acidic protein (GFAP) These mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we report the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates were performed to address animal variability as well as reproducibility in

mass spectrometric analysis Relative quantitation was performed using distributive normalized

spectral abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins

with significant changes in the CSF of GFAP transgenic mice has been identified with validation

from ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

75

INTRODUCTION

Alexander disease (AxD) is a fatal neurogenetic disorder caused by heterozygous point

mutations in the coding region of GFAP (encoding glial fibrillary acidic protein)1 The hallmark

diagnostic feature of this disease is the accumulation of astrocytic cytoplasmic inclusions known

as Rosenthal fibers containing GFAP Hsp27 αB-crystallin and other components2-5

Although

several potential treatment strategies6-8

are under investigation clinical trial design is hampered

by the absence of a standardized clinical scoring system or means to quantify lesions in MRI

that could serve to monitor severity and progression of disease One solution to this problem

would be the identification of biomarkers in readily sampled body fluids as indirect indicators of

disease

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy site for brain or spinal

cord in evaluating diseases of the central nervous system The protein composition of CSF is

well defined at least for the most abundant species of proteins and numerous studies exist that

characterize individual biomarkers or complex patterns of biomarkers in various diseases9 10

GFAP itself is present in CSF (albeit at much lower levels than in brain parenchyma) and in one

study of three Alexander disease patients its levels were markedly increased11

Whether an

increase in CSF GFAP will be a consistent finding in Alexander disease or whether other useful

biomarkers for this disease could be identified through an unbiased analysis of the CSF

proteome is not yet known

The rarity of Alexander disease makes analysis of human samples difficult However

mouse models exist that replicate key features of the disease such as formation of Rosenthal

fibers Unfortunately mouse CSF poses particular problems for proteomic studies and there is

76

an urgent need for technical improvements for dealing with this fluid For instance collection

from an adult mouse typically yields ~10 μL of CSF often with some contamination by blood12

To further complicate analysis CSF has an exceedingly low protein content (~04 μgμL) with

over 60 of the total protein content consisting of a single protein albumin13 14

A number of

techniques have been developed to remove albumin from biological samples including Cibacron

Blue15

IgG immunodepletion16

and IgY immunodepletion17-19

IgY which is avian in origin

offers reduced non-specific binding and increased avidity when compared to IgG antibodies from

rabbits goats and mice20-23

One widely used IgY cocktail is IgY-14 which contains fourteen

specific antibodies specific for albumin IgG transferrin fibrinogen α1-antitrypsin IgA IgM

α2-macroglobulin haptoglobin apolipoproteins A-I A-II and B complement C3 and α1-acid

glycoprotein Since existing protocols for IgY-14 depletion are optimized for use with large

volumes of serum new protocols must be developed to permit its use with the low volumes of a

low protein fluid represented by mouse CSF

Various improvements have also taken place in the field of proteomic analysis that could

facilitate analysis of mouse CSF Data dependent tandem mass spectrometry followed by

quantification of proteins is used in standard shotgun proteomics24-29

Several methods now exist

for introducing quantitation into mass spectrometry including stable isotope labeling30-32

isobaric tandem mass tags33 34

and spectral counting35 36

Spectral counting which is a

frequency measurement that uses MSMS counts of identified peptides as the metric to enable

protein quantitation is attractive because it is label-free and requires no additional sample

preparation Finally recent advances in spectral counting has produced a data refinement

strategy termed normalized spectral abundance factor (NSAF)37 38

and further developed into

distributive NSAF (dNSAF) to address issues with peptides shared by multiple proteins39

77

To identify potential biomarkers in AxD we report a novel scaled-down version of IgY

antibody depletion strategy to reduce the complexity and remove high abundance proteins in

mouse CSF samples The generated spectral counts data were then subjected to dNSAF natural

log data transformation and t-test analysis to determine which proteins differ in abundance when

comparing GFAP transgenics and controls with multiple biological and technical replicates

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) urea (gt99) sodium chloride (997) and ammonium

bicarbonate (certified) were purchased from Fisher Scientific (Fair Lawn NJ) Deionized water

(182 MΩcm) was prepared with a Milli-Q Millipore system (Billerica MA) Optima LCMS

grade acetonitrile and water were purchased from Fisher Scientific (Fair Lawn NJ) DL-

Dithiothreitol (DTT) and sequencing grade modified trypsin were purchased from Promega

(Madison WI) Formic acid (ge98) was obtained from Fluka (Buchs Switzerland)

Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris ge999 ) ammonium formate

(ge99995) glycine (ge985) and IgY-14 antibodies were purchased from Sigma-Aldrich

(Saint Louis MO)

Mice

Transgenic mice over-expressing the human GFAP gene (Tg737 line) were maintained

as hemizygotes in the FVBN background and genotyped via PCR on DNA prepared from tail

samples as described previously40

The mice were housed on a 14-10 light-dark cycle with ad

libitum access to food and water All procedures were conducted using protocols approved by

the UW-Madison IACUC

78

CSF collection

CSF was collected from mice as described previously12

Briefly mice were anesthetized

with avertin (400-600 mgkg ip) A midline sagittal incision was made over the dorsal aspect

of the hindbrain and three muscle layers carefully peeled back to expose the cisterna magna The

membrane covering the cisterna magna was pierced with a 30 gauge needle and CSF was

collected immediately using a flexible plastic pipette Approximately ten microliters of CSF was

collected per animal All samples used for MS analysis showed no visible contamination of

blood

Enzyme-linked immunosorbent assay (ELISA)

A sandwich ELISA was used to quantify GFAP8 Briefly a microtiter plate was coated

with a cocktail of monoclonal antibodies (Covance SMI-26R) Plates were blocked with 5

milk before addition of sample or standards diluted in PBS with Triton-X and BSA A rabbit

polyclonal antibody (DAKO Z334) was used to detect the GFAP followed by a peroxidase

conjugated anti-rabbit IgG antibody (Sigma A6154) secondary antibody The peroxidase activity

was detected with SuperSignal ELISA Pico Chemiluminescent Substrate (PIERCE) and

quantified with a GloRunner Microplate Luminometer Values below the biological limit of

detection (16ngL) were given the value 16ngL before multiplying by the dilution factor

Immunodepletion of abundant proteins

Currently there are no commercial immunodepletion products available for use with CSF

and to address the low protein content of this fluid (~04 microgmicroL) Therefore 100 microL of

purchased IgY-14 resin was coupled with a Pierce Spin Cup using a paper filter from Thermo

Scientific (Waltham MA) CSF samples from either transgenics or controls were first pooled to

100 microL and then added to 100 microL of 20 mM Tris-HCl 300 mM pH 74 (2x dilution buffer) and

79

allowed to mix with the custom IgY-14 depletion spin column on an end-over-end rotor for 30

minutes at 4oC The sample was then centrifuged at 04 rcf for 45 seconds with an Eppendorf

Centrifuge 5415D (Hamburg Germany) The antibodies were then washed with 50 microL 1x

dilution buffer vortexed for 5 minutes centrifuged at 04 rcf for 45 seconds and the flow through

was collected for tryptic digestion The antibodies were then stripped of the bound proteins with

four 025 mL washes of 01 M glycine pH 25 neutralized with two 025 mL washes of 01 M

Tris-HCl pH 80 and washed once with 025mL of 1x dilution buffer The immunodepletion

protocol was repeated for each transgenic or control pooled 100 microL sample (N=3)

Preparation of tryptic digests

The immunodepleted pooled mouse CSF samples (200 microL total volume) were

concentrated to 10 microL using a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA)

To each sample 8 microL of 133 M urea and 1 microL of 050 M DTT were added and allowed to

incubate at 37oC for one hour After incubation 27 microL of 055 M IAA was added for

carboxymethylation and the sample was allowed to incubate for 15 minutes in the dark To

quench the IAA 1 microL of 050 M DTT was added and allowed to react for 10 minutes To

perform trypsin digestion 70 microL of 50 mM NH4HCO3 was then added along with 025 microg

trypsin which had previously been dissolved in 50 mM acetic acid at a concentration of 05

microgmicroL Digestion was performed overnight at 37oC and quenched by addition of 25 microL of 10

formic acid The tryptic peptides were then subjected to solid phase extraction using a Varian

Omix Tip C18 100 microL (Palo Alto CA) Peptides were eluted with 50 ACN in 01 formic

acid concentrated and reconstituted in 30 microL H2O in 01 formic acid

RP nanoLC separation

80

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto an Agilent

Technologies Zorbax 300 SB-C18 5 microm 5x03 mm trap cartridge (Santa Clara CA) at a flow

rate of 5 microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm

Atlantis dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B

at 250 nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

81

range of 300-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot41

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt mus musculus

(house mouse) database (version 575) False positive analyses42

were calculated using an

automatic decoy option of Mascot Results from the Mascot results were reported using

Proteinscape 21 and technical replicates were combined and reported as a protein compilation

using ProteinExtractor (Bruker Daltonics Bremen Germany)

Mascot search parameters were as follows Allowed missed cleavages 2 enzyme

trypsin variable modifications carboxymethylation (C) and oxidation (M) peptide tolerance

plusmn12 Da maximum number of 13

C 1 MSMS tolerance plusmn05 Da instrument type ESI-Trap

Reported proteins required a minimum of one peptide with a Mascot score ge300 with bold red

characterization Spectral counts were determined from the number of MSMS spectra identified

from accepted proteins A bold red peptide combines a bold peptide which represents the first

query result from a submitted MSMS spectrum with the red peptide which indicates the top

peptide for the identified protein Requiring one bold red peptide assists in removal of

homologous redundant proteins and further improves protein results In addition requiring one

82

peptide to be identified by a score gt300 removes the ability for proteins to be identified by

multiple low Mascot scoring peptides

Each immunodepleted biological replicate had technical triplicates performed and the

technical triplicates were summed together by ProteinExtractor Peptide spectral counts were

then summed for each protein and subjected to dNSAF analysis Details for this method can be

found elsewhere37 39

but briefly peptide spectral counts are summed per protein (SpC) based on

unique peptides and a weighted distribution of any shared peptides with homologous proteins

ProteinScape removed 83 homologous proteins found in the current study to bring the total

number of proteins identified to 266 but some non-unique homologous peptides which are

shared by multiple proteins are still present in the resulting 266 remaining proteins To address

these non-unique homologous peptides distributive spectral counting was performed as

described elsewhere39

The dSpC is divided by the proteinrsquos length (L) and then divided by the

summation of the dSpCL from all proteins in the experiment (N) to produce each proteinrsquos

specific dNSAF value

N

i

i

kk

LdSpC

LdSpCdNSAF

1

)(

)()(

The resulting data were then transformed by taking the natural log of the dNSAF value The

means for each protein were calculated using the ln(dNSAF) from N=3 biological replicates and

the whole data set was subjected to the Shapiro-Wilk test for normalityGaussian distribution

performed on the software PAST (Version 198 University of Oslo Norway Osla) The

Shapiro-Wilk test was repeated multiple times to determine a non-zero value for zero spectral

83

counts A non-zero value is required to alleviate the errors of dividing by zero which was

experimentally determined to be 043 The Gaussian data were then subjected to the t-test to

identify statistically significant changes in protein expression

RESULTS AND DISCUSSION

General workflow

Individual CSF samples were manually inspected and samples were only selected that

showed no visual blood contamination Preliminary experiments showed that the maximum

degree of blood contamination estimated from counts of red blood cells in the CSF that was not

visible to the eye was less than 005 by total mass Approximately 10 individual mouse CSF

samples were pooled to achieve the desired 100 μL volume for a single biological replicate The

CSF samples were IgY immunodepleted followed by digestion with trypsin The resulting

digested samples were then de-salted concentrated reconstituted in 30 μL of 01 formic acid

and 5 μL was loaded onto a reversed phase nanoflow HPLC column subjected to a 90 minute

gradient and infused into the amaZon ETD ion trap mass spectrometer The workflow for

mouse CSF analysis is shown in Figure 1 The remainder of the desalted sample was saved for

technical replicates

Immunodepletion for CSF

Currently there are no immunodepletion techniques specifically designed for CSF

Nonetheless the protein profiles between CSF and serum are similar enough to use currently

available immunodepletion techniques designed for serum as a starting point The smallest

commercially available IgY-14 immunodepletion kit is for 15 μL of serum which is equivalent in

protein content to ~15 mL of CSF Therefore for 100 μL CSF one fifteenth of the total IgY-14

84

beads would be used for the equivalent immunodepletion which is 66 μL of proprietary bead

slurry The potential for irreversible binding of abundant proteins to their respective IgY

antibody even after an extra stripping wash and low amounts of total beads made using 66 μL

of IgY-14 bead slurry unrealistic In practice almost doubling the amount of IgY-14 beads (100

μL) used was not sufficient and resulted in albumin and transferrin peptides to be observed in

high abundance (data not shown) The most important protein to immunodeplete is albumin and

it has been reported to be a greater percentage of total CSF protein content (~60) than serum

(~49) in humans14

The difference in albumin percentage supports the results that proprietary

blends of immunodepletion beads for high abundance proteins such as albumin cannot be

scaled down on a strict protein scale and further modifications to the serum immunodepletion

protocol need to be made

Since IgY-14 beads were developed for use with serum all of its protocols need to be

taken into account to modify the protocol for CSF Serum samples should be diluted fifty times

before mixing with the IgY-14 beads but CSF protein concentration is at least one hundred times

lower than serum Therefore CSF is below half the recommended diluted protein concentration

for IgY immunodepletion Consequently multiple steps have been devised to address this

limitation First the binding time between the proteins targeted for removal from the CSF and

IgY-14 beads was doubled during the end-over-end rotor depletion step from the recommended

15 minutes to 30 minutes Second to prevent non-specific protein binding to the antibodies the

CSF samples were diluted with an equal amount (100 μL) of 2X dilution buffer The dilution

buffer of NaCl and Tris-HCl was needed to assist in reducing non-specific binding of proteins to

the 14 antibodies and ensuring the sample is held at physiological pH In addition to these

modifications a doubling of the starting IgY-14 bead slurry to 200 μL provided the desired

85

results Overall this modified protocol results in effective depletion of CSF abundant proteins

using only one-fifth of the antibodies provided by the smallest commercially available platform

Data Analysis

Spectral counting technique for relative quantitation provides numerous benefits for the

study of mouse CSF due to its label-free advantage In contrast isotopic labeling method often

involves additional sample processing that could cause sample loss which is highly undesirable

for low protein content and low volume samples Labeling methods also require a mixing of two

sets of isotopically labeled samples which would effectively increase the sample complexity and

reduce the amount of sample that can be loaded onto the nanoLC column by half In addition

more than two sets of samples can be compared by label-free methods The use of label-free

spectral counting method does not lead to an increase in sample complexity or interference in

quantitation from peptides in the mz window selected for tandem MS Using spectral counting

for relative quantitation however is dependent on reproducible HPLC separation and careful

mass spectrometry operation to minimize technical variability during the study To address

concerns of analytical reliability and run to run deviations base peak chromatograms from two

transgenic IgY-14 immunodepleted biological replicates including two technical replicates of

each were shown to be highly reproducible (Figure 2)

Each biological sample was analyzed in triplicate with the same protocols on the amaZon

ETD with three control and three transgenic samples From the three technical replicates for

each biological replicate the spectral counts of the peptides for the proteins identified were

summed The results from these mouse CSF biological triplicates are shown in Figure 3A for

GFAP overexpressor and Figure 3B for control The summation of spectral counts for each

biological replicate was performed to remove the inherent bias related to data dependent analysis

86

for protein identification One concern in grouping technical replicates is a potential loss of

information regarding analytical variability Figure 4 provides a graphical representation of

variability of technical replicates illustrating the standard deviation of technical replicates with

error bars Figure 4 shows that a statistically changed protein creatine kinase M (A) and an

unchanged protein kininogen-1 (B) have similar variability within (technical replicates) and

between samples (biological replicates) for each protein In addition Figure 4B illustrates that

even with the variability of kininogen-1 the resulting mean shown by the dashed line of control

and transgenic samples were almost equal whereas Figure 4A shows significantly different

expression level of creatine kinase M Performing replicate analysis of each biological sample

(n=3) helps correct for systematic errors deriving from sample handling and pooling of samples

helps reduce random error during the CSF sample collection process

Protein Identification and Spectral Counting Analysis

The data for dNSAF analysis like any mass spectrometry proteomics experiment

requires multiple layers of verification to ensure reliable data Our initial protein identifications

were subjected to a database search using a decoy database from Mascot which resulted in an

average false positive rate below 1 for all the experimental data collected Representative

MSMS spectra between control and GFAP transgenic samples are illustrated in Figure 5

Overall 266 proteins were identified in a combination of control and transgenic samples

(Supplemental Table 1) A total of 349 proteins were initially identified but 83 proteins were

isoforms of previously identified proteins and automatically excluded by ProteinExtractor The

next level of quality control was to only include ln(dNSAF) values from proteins identified by 2

or more unique peptides having a Mascot score of ge300 and observed in two out of three

biological replicates These selection parameters resulted in 106 proteins remaining after

87

dNSAF analysis (Supplemental Table 2) The resulting spectral counts were then subjected to

dSpC in order to account and correct for the systematic error of peptides shared by multiple

proteins (Supplemental Table 3)

It is inevitable in large scale and complex proteomics experiments that some proteins will

be seen in some samples and not others In addition when controls were compared to transgenic

samples as shown in Figure 3C 127 proteins were unique to either control or GFAP transgenic

mice and 139 proteins were seen in both samples From dNSAF equation if the spectral count

is zero the numerator is zero and the value will not be normalized between runs In order to

circumvent the zero spectral counts in dNSAF analysis zero spectral counts must be replaced by

an experimentally determined non-zero value determined to be 043 The 043 spectral counts

for zero spectral counts was calculated by serial Shapiro-Wilk tests to determine the lowest value

(043) for zero spectral counts and maintain a Gaussian distribution for all data sets The 043

value for zero spectral counts in the current study was higher than the 016 reported value for

zero spectral counts in the original NSAF spectral counting study37

Our study may have a

higher zero spectral count value than the previous study because the spectral counting data were

an addition of three technical replicates and three times 016 is close to 043 The normalized

Gaussian data were then subjected to t-test analysis and P-values below 005 are considered as

statistically significant and are presented in Table 1 The proteins with significant up or down

regulation from Table 1 can be further evaluated as how close significant proteins were to a p-

value of 005 such as ganglioside GM2 activator serine protease inhibitor A3N and collagen

alpha-2(I) chain having t-test scores of 0045 0047 and 0043 respectively Proteins exhibiting

a P-value close to 005 were more likely to be highly variable proteins or have smaller fold

changes between control and transgenic samples and thus provide less biological relevancy to

88

future studies The modest change in antithrobin-III which is reduced by 14-fold in transgenic

is included due a low pooled standard deviation in spectral counts

Spectral counting has been analyzed with fold changes derived directly from the average

spectral counts from the technical replicates and then the average of the three biological

replicates We decided to perform additional analysis using fold changes to dig deeper into

proteins that statistically support the null hypothesis from the dNSAF analysis To only pull out

highly confident protein identifications we used the same strict cut-off of two unique peptides

identified per protein as in dNSAF analysis We only accepted proteins with greater than three-

fold change in total spectral counts listed in Table 2 Two proteins contactin-1 (cntn1) and

cathepsin B (CB) illustrate the potential biomarkers missed by dNSAF analysis Cntn1 had zero

spectral count in the transgenic sample and had an average spectral count of 41 in control

samples The lack of any spectral counts in one biological control for cntn1 resulted in a large

standard deviation in the ln(dNSAF) means for the control which resulted in the t-test supporting

the null hypothesis Another example is CB which was detected by numerous spectral counts in

every GFAP transgenic biological replicate with an average spectral count of 97 (Table 2) The

presence of CB in one biological control sample (23 average spectral counts) resulted in a high

standard deviation in the mean of the control samples These examples exhibit a limitation of

dNSAF analysis which could cause a loss of potentially useful information

Previously Identified Proteins with Expression Changes

Previously three proteins have been described as increased in CSF from individual(s)

suffering from AxD In one study a single patientrsquos CSF was found to have elevated levels of

αβ-crystallin and HSP2744

In a second study three patients were reported to have elevated

levels of GFAP with concentrations from 4760 ngL to 30000 ngL (compared to lt175 ngL for

89

controls)11

GFAP was detected in our current study however the other two proteins were not

detected One possibility is that αB-crystallin and Hsp-27 levels in mouse CSF are too low for

detection by MS analysis In addition while the transgenic mice display the hallmark

pathological feature of AxD in the form of Rosenthal fibers they do not have any evident

leukodystrophy and thus may not display the full range of changes in CSF as might be found in

human patients

Creatine Kinase M

Creatine kinase M (M-CK) is a 43 kDa protein whose role is to reversibly catalyze

phosphate transfer between ATP and energy storage compounds M-CK has been primarily

found in muscle tissue and for humans creatine kinase M is diagnostically analyzed in the blood

for myocardial infarction (heart attack) In addition M-CK is also found in Purkinje neurons of

the cerebellum45 46

A related protein creatine kinase B (B-CK) also exhibited an apparent 21

fold increase in transgenic CSF over control but this difference was not statistically different

B-CK concentration is known to be elevated in CSF following head trauma47

or cerebral

infarction48

but decreased in astrocytes in individuals affected by multiple sclerosis49

Cathepsin

The data showed multiple cathepsins were up regulated in the CSF of transgenic mice

when compared to control mice The up regulated cathepsins were S L1 and B isoforms which

are all cysteine proteases Cathepsin S (CS) was never observed in control samples but

observed with an average of 73 spectral counts per analysis Cathepsin L1 (CL1) was up

regulated by 94 times in transgenic mice These two cathepsins exhibited significant changes

using dNSAF and t-test statistics as shown in Table 1 and Cathepsin B (CB) showed a 42 fold

increase in transgenic CSF as shown in Table 2

90

Cathepsins regulate apoptosis in cells50

which is the major mechanism for elimination of

cells deemed by the organism to be dangerous damaged or expendable CL and CB are

redundant in the system as thiol proteases and inhibition of CB by CA-047 causes a diminished

apoptosis response in multiple cell lines51

Intriguingly increased levels of CB or CL are

correlated with poor prognosis for cancer patients and shorter disease-free intervals It is

believed that these proteases degrade the extracellular membrane which allows tumor cells to

invade adjacent tissue and metastasize52

With regards to AxD the up regulation of these

cathepsins may be indicative of the bodyrsquos natural defense and response to Rosenthal fibers

Thus stimulation of these cathepsins may provide a further protective stress response but the

positive correlation between these proteases and cancer highlights the multiple roles of these

proteins in pathological response Alternatively it has been shown that increased CB is involved

with the tumor necrosis factor α (TNFα) induced apoptosis cascade53

The activation of the

TNFα could produce oligodendrocyte toxicity54

with the expression of TNFα being elevated in

tissue samples from mouse models and AxD patients55

The potential for a positive or a negative

effect in increasing or decreasing cathepsins warrant future research with cathepsins and AxD

Contactin-1

Contactin-1 (Cntn1) is a 133 kDa cell surface protein that is highly glycosylated and

belongs to a family of immunoglobulin domain-containing cell adhesion molecues56

Table 2

shows that Cntn1 is down regulated in transgenic mice since spectral counts were only observed

in control mice Cntn1 is expressed in neurons and oligodendrocytes and higher levels were

observed during brain development57

In addition Cntn1 leads to activation of Notch1 which

mediates differentiation and maturation of oligodendrocyte precursor cells (OPC) Although the

mechanism leading to a reduction of Cntn1 in CSF is not known it may reflect a change in

91

astrocyte interactions with either neurons or oligodendrocytes to alter their expression of this

protein

Validation of putative biomarkers and MS proteomics data using ELISA and RNA

microarray data

To further validate the relative protein expression data obtained via MS-based spectral

counting techniques orthogonal immunological and molecular biological approaches have been

examined As GFAP was shown to be elevated in the CSF of transgenic animals we used a

well-defined GFAP ELISA protocol to test its CSF concentration CSF from 8 week old male

mice was collected from both transgenic and control animals Five samples of transgenic CSF

was prepared by pooling four to eight animals for the GFAP ELISA In the case of controls

each sample represents a single animal GFAP concentrations observed by both the MS and

ELISA showed significant increases of GFAP in the CSF when comparing transgenic to control

animals

Another validation of MS spectral counts is observed in a microarray analysis performed

on transgenic mouse olfactory bulb tissue 55

In this paper nine of the proteins found by MS

showed similar gene expression in the microarray (Table 3) It is not surprising that all the genes

observed in the microarray are not the same as the proteins observed by MS analysis Gene

expression and protein synthesis and expression are not always correlated but the similarities

and overlapping trends observed with these two assays are encouraging As shown in Table 3

gene expression analysis also revealed the up regulation of several cathepsin proteins GFAP

and down regulation of contactin-1 in CSF collected from transgenic mice corroborating the

MS-based proteomics results

92

CONCLUSIONS

In this study we have produced a panel of proteins with significant up or down regulation

in the CSF of transgenic GFAP overexpressor mice This mouse model for AxD was consistent

with the previous studies showing elevation of GFAP in CSF The development of a modified

IgY-14 immunodepletion technique for low amounts of CSF was presented This improved

protocol is useful for future investigations to deal with the unique challenges of mouse CSF

analysis Modified proteomics protocols were employed to profile mouse CSF with biological

and technical triplicates addressing the variability and providing quantitation with dNSAF

spectral counting Validation of the MS-based proteomics data were performed using both

ELISA and RNA microarray data to provide further confidence in the changes in the putative

protein biomarkers This study presents three classes of interesting targets for future study in

AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

93

REFERENCES

1 Brenner M Johnson A B Boespflug-Tanguy O Rodriguez D Goldman J E

Messing A Mutations in GFAP encoding glial fibrillary acidic protein are associated with

Alexander disease Nat Genet 2001 27 (1) 117-20

2 Herndon R M Rubinstein L J Freeman J M Mathieson G Light and electron

microscopic observations on Rosenthal fibers in Alexanders disease and in multiple sclerosis J

Neuropathol Exp Neurol 1970 29 (4) 524-51

3 Alexander W S Progressive fibrinoid degeneration of fibrillary astrocytes associated

with mental retardation in a hydrocephalic infant Brain 1949 72 (3) 373-81 3 pl

4 Iwaki T Kume-Iwaki A Liem R K Goldman J E Alpha B-crystallin is expressed

in non-lenticular tissues and accumulates in Alexanders disease brain Cell 1989 57 (1) 71-8

5 Head M W Goldman J E Small heat shock proteins the cytoskeleton and inclusion

body formation Neuropathol Appl Neurobiol 2000 26 (4) 304-12

6 Messing A Daniels C M Hagemann T L Strategies for treatment in alexander

disease Neurotherapeutics 7 (4) 507-15

7 Tang G Yue Z Talloczy Z Hagemann T Cho W Messing A Sulzer D L

Goldman J E Autophagy induced by Alexander disease-mutant GFAP accumulation is

regulated by p38MAPK and mTOR signaling pathways Hum Mol Genet 2008 17 (11) 1540-

55

8 Hagemann T L Boelens W C Wawrousek E F Messing A Suppression of GFAP

toxicity by alphaB-crystallin in mouse models of Alexander disease Hum Mol Genet 2009 18

(7) 1190-9

9 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

10 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 878 (22) 2003-12

11 Kyllerman M Rosengren L Wiklund L M Holmberg E Increased levels of GFAP

in the cerebrospinal fluid in three subtypes of genetically confirmed Alexander disease

Neuropediatrics 2005 36 (5) 319-23

12 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M

Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta)

equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

13 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

14 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers PROTEOMICS ndash Clinical Applications 2008 2 (3)

428-436

15 Li C Lee K H Affinity depletion of albumin from human cerebrospinal fluid using

Cibacron-blue-3G-A-derivatized photopatterned copolymer in a microfluidic device Anal

Biochem 2004 333 (2) 381-8

94

16 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry ELECTROPHORESIS 2004 25 (14) 2402-

2412

17 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L

Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity

separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample

preparation and analysis Proteomics 2005 5 (13) 3314-28

18 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag

L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep

Biochem Biotechnol 2009 39 (3) 221-47

19 Huang L Fang X Immunoaffinity fractionation of plasma proteins by chicken IgY

antibodies Methods Mol Biol 2008 425 41-51

20 Greunke K Braren I Alpers I Blank S Sodenkamp J Bredehorst R Spillner E

Recombinant IgY for improvement of immunoglobulin-based analytical applications Clin

Biochem 2008 41 (14-15) 1237-44

21 Xiao Y Gao X Taratula O Treado S Urbas A Holbrook R D Cavicchi R E

Avedisian C T Mitra S Savla R Wagner P D Srivastava S He H Anti-HER2 IgY

antibody-functionalized single-walled carbon nanotubes for detection and selective destruction

of breast cancer cells BMC Cancer 2009 9 351

22 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein immunoaffinity

subtraction for plasma proteomics and candidate biomarker discovery using mass spectrometry

Mol Cell Proteomics 2006 5 (11) 2167-74

23 Hinerfeld D Innamorati D Pirro J Tam S W SerumPlasma depletion with

chicken immunoglobulin Y antibodies for proteomic analysis from multiple Mammalian species

J Biomol Tech 2004 15 (3) 184-90

24 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D

Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in

the discovery of candidate protein biomarkers in a diabetes autoantibody standardization

program sample subset J Proteome Res 2008 7 (2) 698-707

25 Ru Q C Zhu L A Silberman J Shriver C D Label-free semiquantitative peptide

feature profiling of human breast cancer and breast disease sera via two-dimensional liquid

chromatography-mass spectrometry Mol Cell Proteomics 2006 5 (6) 1095-104

26 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

27 Rao P V Reddy A P Lu X Dasari S Krishnaprasad A Biggs E Roberts C T

Nagalla S R Proteomic identification of salivary biomarkers of type-2 diabetes J Proteome

Res 2009 8 (1) 239-45

28 Yu K H Barry C G Austin D Busch C M Sangar V Rustgi A K Blair I A

Stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for

pancreatic cancer serum biomarker discovery J Proteome Res 2009 8 (3) 1565-76

29 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422

(6928) 198-207

95

30 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

31 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

32 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

33 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 82 (7) 2817-25

34 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

35 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

36 Old W M Meyer-Arendt K Aveline-Wolf L Pierce K G Mendoza A Sevinsky

J R Resing K A Ahn N G Comparison of label-free methods for quantifying human

proteins by shotgun proteomics Mol Cell Proteomics 2005 4 (10) 1487-502

37 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M

P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J

Proteome Res 2006 5 (9) 2339-47

38 Mosley A L Florens L Wen Z Washburn M P A label free quantitative

proteomic analysis of the Saccharomyces cerevisiae nucleus J Proteomics 2009 72 (1) 110-20

39 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome

quantitation how to deal with peptides shared by multiple proteins Anal Chem 82 (6) 2272-81

40 Messing A Head M W Galles K Galbreath E J Goldman J E Brenner M

Fatal encephalopathy with astrocyte inclusions in GFAP transgenic mice Am J Pathol 1998

152 (2) 391-8

41 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

42 Elias J E Gygi S P Target-decoy search strategy for increased confidence in large-

scale protein identifications by mass spectrometry Nat Methods 2007 4 (3) 207-14

43 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

44 Takanashi J Sugita K Tanabe Y Niimi H Adolescent case of Alexander disease

MR imaging and MR spectroscopy Pediatr Neurol 1998 18 (1) 67-70

45 Hemmer W Wallimann T Functional Aspects of Creatine Kinase in Brain

Developmental Neuroscience 1993 15 (3-5) 249-260

46 Hemmer W Zanolla E Furter-Graves E M Eppenberger H M Wallimann T

Creatine kinase isoenzymes in chicken cerebellum specific localization of brain-type creatine

96

kinase in Bergmann glial cells and muscle-type creatine kinase in Purkinje neurons Eur J

Neurosci 1994 6 (4) 538-49

47 Nordby H K Tveit B Ruud I Creatine kinase and lactate dehydrogenase in the

cerebrospinal fluid in patients with head injuries Acta Neurochirurgica 1975 32 (3) 209-217

48 Bell R D Alexander G M Nguyen T Albin M S Quantification of cerebral

infarct size by creatine kinase BB isoenzyme Stroke 1986 17 (2) 254-60

49 Steen C Wilczak N Hoogduin J M Koch M De Keyser J Reduced Creatine

Kinase B Activity in Multiple Sclerosis Normal Appearing White Matter PLoS ONE 5 (5)

e10811

50 Chwieralski C Welte T Buumlhling F Cathepsin-regulated apoptosis Apoptosis 2006

11 (2) 143-149

51 Droga-Mazovec G Bojic L Petelin A Ivanova S Romih R Repnik U Salvesen

G S Stoka V Turk V Turk B Cysteine cathepsins trigger caspase-dependent cell death

through cleavage of bid and antiapoptotic Bcl-2 homologues J Biol Chem 2008 283 (27)

19140-50

52 Duffy M J Proteases as prognostic markers in cancer Clin Cancer Res 1996 2 (4)

613-8

53 Guicciardi M E Deussing J Miyoshi H Bronk S F Svingen P A Peters C

Kaufmann S H Gores G J Cathepsin B contributes to TNF-alpha-mediated hepatocyte

apoptosis by promoting mitochondrial release of cytochrome c J Clin Invest 2000 106 (9)

1127-37

54 Butt A M Jenkins H G Morphological changes in oligodendrocytes in the intact

mouse optic nerve following intravitreal injection of tumour necrosis factor J Neuroimmunol

1994 51 (1) 27-33

55 Hagemann T L Gaeta S A Smith M A Johnson D A Johnson J A Messing

A Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal

fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum Mol

Genet 2005 14 (16) 2443-58

56 Falk J Bonnon C Girault J A Faivre-Sarrailh C F3contactin a neuronal cell

adhesion molecule implicated in axogenesis and myelination Biol Cell 2002 94 (6) 327-34

57 Eckerich C Zapf S Ulbricht U Muumlller S Fillbrandt R Westphal M Lamszus

K Contactin is expressed in human astrocytic gliomas and mediates repulsive effects Glia

2006 53 (1) 1-12

97

Table 1 Statistically changed proteins between transgenic and control mouse CSF using

dNSAF analysis

Accession Protein Pa SC

b Fold

Changec

Control

dSpCd

Transgenic

dSpCd

KCRM_MOUSE Creatine kinase M-type 0034 425 30 182 541

HEXB_MOUSE Βeta-hexosaminidase subunit β 0012 183 uarr 0 59

CMGA_MOUSE Chromogranin-A 000078 153 darr 44 0

ANT3_MOUSE Antithrombin-III 0017 176 -14 67 47

SAP3_MOUSE Ganglioside GM2 activator 0045 415 darr 28 0

SPA3N_MOUSE Serine protease inhibitor A3N 0047 170 42 10 42

CO1A2_MOUSE Collagen alpha-2(I) chain 0043 56 darr 19 0

BLVRB_MOUSE Flavin reductase 00054 204 uarr 0 12

CATS_MOUSE Cathepsin S 00032 232 uarr 0 73

GFAP_MOUSE Glial fibrillary acidic protein 0021 107 uarr 0 21

RET4_MOUSE Retinol-binding protein 4 0040 189 darr 13 0

CBPE_MOUSE Carboxypeptidase E 0043 139 darr 11 0

CATL1_MOUSE Cathepsin L1 0015 87 94 02 19

The statistics are performed using the t-test from the ln(dNSAF) Gaussian data

a P p-value of the t-test where the null hypothesis states that there was no change in expression between

control and transgenic GFAP overexpressor b SC percentage of sequence coverage of the listed protein from

sequenced tryptic peptides c Fold Change positive values are indicative of a fold change for transgenic CSF

negative values are fold changes for control CSF (ie down-regulated in transgenic CSF) uarr indicates the protein

was only detected in transgenic CSF and darr indicates the protein was only detected in control CSF d dSpC

distributive spectral counts which represent the average spectral counts observed per run analysis on the mass

spectrometer and corrected using distributive analysis for peptides shared by more than one protein

98

Table 2 Proteins showing greater than three-fold changes with at least two unique

peptides identified for each protein

Accession Protein SC ()a Fold

Change b

Control

dSpC c

Transgenic

dSpC c

MUG1_MOUSE Murinoglobulin-1 precursor 147 42 155 37

CO4B_MOUSE Complement C4-B 113 54 22 118

PRDX6_MOUSE Peroxiredoxin-6 576 46 14 64

CNTN1_MOUSE Contactin-1 65 darr 41 0

CATB_MOUSE Cathepsin B 263 42 23 97

CAH3_MOUSE Carbonic anhydrase 3 396 76 11 84

APOH_MOUSE Beta-2-glycoprotein 1 128 44 44 1

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

PGK1_MOUSE Phosphoglycerate kinase 1 355 36 17 61

NDKB_MOUSE Nucleoside diphosphate kinase B 408 34 13 44

FHL1_MOUSE

Four and a half LIM domains

protein 1 243 39 13 51

NELL2_MOUSE

Protein kinase C-binding protein

NELL2 45 -43 13 03

MDHM_MOUSE

Malate dehydrogenase

mitochondrial 385 41 12 49

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

a SC percentage of sequence coverage of the listed protein from sequenced tryptic peptides b Fold

Change positive values are indicative of a fold change for transgenic CSF negative values are fold changes for

control CSF and darr indicates the protein was only detected in control CSF c dSpC distributive spectral counts

which represent the average spectral counts observed per run analysis on the mass spectrometer and corrected using

distributive analysis for peptides shared by more than one protein

99

Table 3 Validation of changes in proteins revealed by MS-based spectral counting

consistent with previously published microarray data

Consistent changes in RNA and proteomic data

uarr regulated in transgenic darr regulated in transgenic

Cathepsin S Contactin-1

Cathepsin B Carboxypeptidase E

Cathepsin L1

Peroxiredoxin-6

Complement C4-B

Glial fibrillary acidic protein

Serine protease inhibitor A3N

Note Validation of putative biomarkers from the current proteomics dataset by previously

published RNA microarray data55

Both up and down regulated proteins were consistent with the

RNA microarray data

_

100

___________________________________________

SUPPLEMENTAL INFORMATION (Available upon request)

Table S1 Compilation list of proteins identified from all the control and transgenic biological

replicates

Table S2 Distributive spectral counting calculations performed for proteins observed to share

identified peptides

Table S3 Proteins with dNSAF spectral counting performed including t-test analysis for a

comparison between transgenic and control CSF

101

FIGURE LEGENDS

Figure 1 The general workflow indicating the major steps involved in sample collection sample

processing mass spectrometric data acquisition and analysis of mouse CSF samples

Figure 2 Assessment of run to run variability of the base peak chromatograms within and

between two biological and technical replicates The peak profile and intensity scale is

consistent between the four chromatograms The four panels show two biological replicates (Tg

4 and Tg5) with two technical replicates for each biological sample

Figure 3 A Venn-diagram of the biological triplicates (1 2 and 3) of transgenic mouse

CSF showing 78 proteins identified in all three experiments B Venn-diagram of the biological

triplicates (1 2 and 3) of control mouse CSF having 61 proteins identified by all three

replicates C The overlap between control and transgenic CSF proteomic analysis showing 139

proteins identified by both groups and 73 and 54 uniquely identified by respective groups

Figure 4 Assessment of technical replicate variability between biological replicates The error

bars in both A and B are the standard deviation derived from the technical triplicates for each

biological replicate Panel A shows creatine kinase M having more or equal variability in the

biological triplicates than each technical triplicate The means of the biological triplicates are

illustrated by the dashed line Panel B represents kininogen-1 which is unchanged between

control (Ctl) and transgenic (Tg) samples The biological variability coupled with the technical

replicates provides a barely noticeable difference in the pooled mean between control and

102

transgenic spectral counts The difference in means is contrasted with the three fold change

observed from creatine kinase M (A)

Figure 5 MSMS profile of the tryptic peptide LSVEALNSLTGEFK from creatine kinase M

(A) The top MSMS is from a control sample with an elution at 463 minutes (B) The bottom

MSMS is from the GFAP transgenic sample with an elution at 46 minutes These tandem MS

spectra show instrument reliability and consistent fragmentation patterns which are necessary for

spectral counting analysis

Figure 6 Validation of MS spectral counts using a GFAP ELISA GFAP content (ngL)

measured within mouse CSF from both transgenic and control animals The data represents the

average with standard deviation (n=5 each sample represents pooling of 1 to 8 animals) The

statistics are performed using a student t-test plt00001

103

Figure 1

104

Figure 2

105

Figure3

106

Figure 4

107

Figure 5

108

Figure 6

Ctl Tg

100

1000

10000

100000

Mouse CSF Sample

GF

AP

(n

gL

)

109

Table of Contents Summary

Cerebrospinal fluid (CSF) was obtained from transgenic mouse models of Alexander disease as

well as healthy controls We immunodepleted pooled CSF from mice by a modified IgY-14

protocol Shotgun proteomics utilizing trypsin digestion 1-D nanoRP separation CID tandem

mass spectrometry analysis Mascot database searching and relative quantitation via distributive

normalized spectral abundance factor resulted in the identification of 266 proteins and 27

putative biomarkers

110

Chapter 4

Genomic and proteomic profiling of rat adapted scrapie

Adapted from ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A

Cunningham R Wang J Wellner D Ma D Li L Aiken J In preparation

111

Abstract

A novel model of prion disease using rats called Rat Adapted Scrapie (RAS) was

developed with the genomics from brain and proteomics from cerebrospinal fluid (CSF) profiled

The CSF proteome from control and RAS (biological N=5 technical replicates N=3) were

digested and separated using one dimensional reversed-phase nanoLC coupled to data-

dependent tandem MS acquisition on an ion trap mass spectrometer In total 512 proteins 167

non-redundant protein groups and 1032 unique peptides were identified with a 1 false

discovery rate (FDR) Of the proteins identified 21 were statistically up-regulated (plt005) and

7 statistically down-regulated in the RAS CSF We also identified 1048 genes which were

differentially regulated in rat prion disease and upon mapping these changes to mouse gene

expression however only 22 of these genes were in common with mRNAs responding to

prion infection in mice suggesting that the molecular pathology observed in mice may not be

applicable to other species The proteins are compared to the differentially regulated genes as

well as to previously published proteins showing changes consistent with other prion animal

models

112

Introduction

Prion diseases are an unusual class of fatal transmissible neurodegenerative disorders

that affect the mammalian central nervous system They are caused by the accumulation of an

abnormal conformation of the normal host encoded cellular prion protein PrPC This

conformational rearrangement of PrPC is brought about by template directed misfolding wherein

seed molecules of the abnormal isoform PrPScrapie

PrPSc

convert PrPC into new PrP

Sc molecules

Scrapie in sheep and goats is the prototypic prion disease and is so named because clinically

affected animals scrape their wool off on pen and stall walls Laboratory investigation of prion

diseases typically relies upon rodents which can be infected with natural isolates of scrapie1

albeit with some difficulty as ovid scrapie isolates need time to adapt to rodents This adaptation

is characteristic of prion disease interspecies transmissions and properly reflects the molecular

adaptation that must occur to allow interaction between exogenous foreign PrPSc

and host PrPC

molecules selecting for conformers which exhibit template directed misfolding In some cases

no conformational solution is found reflecting a species barrier to disease transmission

In recent years advances in genomics and proteomics technologies have allowed

unprecedented examination of the biomolecules that are altered upon exposure to prion agents

These studies2 3

have relied upon analysis of gene and protein expression changes in response to

prion infection with the aim of trying to identify pathways that might underlie the mechanism of

prion-induced neurotoxicity A second important aim has been to identify signature molecules

that might act as surrogate biomarkers for these diseases as there are significant analytical

challenges associated with sensitively detecting and specifically distinguishing disease-induced

conformational changes (PrPSc

) of the prion protein from normal host conformations (PrPC)

113

Mass spectrometry (MS)-based proteomics has become a promising tool for biomarker

discovery from biological fluids such as CSF blood and urine4-6

Two-dimensional gel

electrophoresis MS (2D-GE MS) and two-dimensional differential gel electrophoresis (2D-DIGE

MS) are the most widely used methods for proteomic profiling of prion infected CSF so far due

to the advantage of ready separation and quantification of proteins in complex biological samples

Studies using 2D-GE MS or 2D-DIGE MS to profile proteins in CSF have led to the

identifications of currently accepted biomarker for sCJD 14-3-3 protein and other potential

biomarkers for prion diseases7-9

However the application of this method in biomarker

discovery is limited by insufficient sensitivity and potential bias against certain classes of

proteins as gel-based separation does not work well for the low abundance proteins very basic

or acidic proteins very small or large proteins and hydrophobic proteins 10 11

In contrast to 2D-

GE or 2D-DIGE shotgun proteomics employs enzymatic digestion of the protein samples

followed by chromatographic separation prior to introduction into a mass spectrometer for

tandem MS analyses Shotgun proteomics has become increasingly popular in proteomic

research because these methods are reproducible highly automated and have a greater

likelihood of detecting low abundance proteins12 13

Due to the sample complexity in CSF and

because albumin comprises over half of the protein content in CSF removal of high-abundance

proteins including albumin is necessary to improve proteomic coverage and identify low-

abundance proteins One method is IgY immunodepletion14 15

which is performed prior to LC-

MSMS and uses species specific antibodies from chicken yolk to remove abundant proteins in

biological samples such as CSF In the present work CSF from control and rat adapted scrapie

animal models (biological N=5 technical N=3 replicates) were analyzed by LC-MSMS and we

114

indentified 512 proteins and 167 non-redundant protein isoforms (referred to as protein groups)

with 21 proteins being statistically up-regulated (p lt 005) and 7 statistically down-regulated

By and large this work has been performed using laboratory mice for the gene

expression studies and human cerebrospinal fluid for proteomics mice do not provide sufficient

volumes of CSF for proteomic studies Both approaches are exceedingly valuable as the mouse

model allows cross-sectional time course experiments to be performed including the important

pre-clinical phase of disease Critically however the relevance and generalizability of mouse

prion responses to other prion diseases especially human disease is unknown Human proteomic

studies on CSF guarantee relevant protein identifications but are limited to the clinical phase of

the disease when apparent markers may reflect gross neurodegeneration covering up subtle but

more specific responses To address these issues we have adapted mouse RML prions into rats

with the aim of expanding the knowledge of prion disease responses addressing the limitations

of mice and opening up the possibility of preclinical proteomic profiling in a laboratory rodent

In the present work CSF samples from control and rat adapted scrapie were analyzed by system

biology approaches Here we show that Rat Adapted Scrapie (RAS) is a powerful tool for an -

omics based approach to decipher the molecular impact of prion disease in vivo with

applicability to the molecular mechanisms of disease and biomarker discovery We identified

1048 genes which were differentially regulated in rat prion disease Astonishingly on the whole

mouse orthologs of these genes did not change in mouse adapted scrapie and vice versa

questioning the universality of previous mouse gene expression profiles These RAS gene

expression changes were identified in the CSF proteome where we detected 512 proteins and 167

protein groups with 21 proteins being statistically up-regulated (plt005) and 7 statistically down-

115

regulated in the CSF of prion diseased rats Many of the proteins detected have previously been

observed in human CSF from CJD patients

Materials and Methods

Ethics Statement

This study was carried out in accordance with the recommendations in the NIH Guide for Care

and Use of Laboratory Animals and the guidelines of the Canadian Council on Animal Care The

protocols used were approved by the Institutional Animal Care and Use Committees at the

University of Wisconsin and University of Alberta

Chemicals

Acetonitrile (HPLC grade) and ammonium bicarbonate (certified) were purchased from

Fisher Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) Optima LCMS grade acetonitrile and water were purchased

from Fisher Scientific (Fair Lawn NJ) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris

ge999 ) ammonium formate (ge99995) glycine (ge985) and IgY-7 antibodies were

purchased from Sigma-Aldrich (Saint Louis MO)

Rat Transmission and Adaptation

Prion agents used were the rocky mountain lab RML strain of mouse adapted scrapie

Stetsonville transmissible mink encephalopathy16

(TME) Hyper (Hy) strain of Hamster TME 17

1st passage Skunk adapted TME prepared as described and C from genetically defined

transmissions18

116

Brains from animals clinically affected with prion disease were aseptically removed and

prepared as 10 wv homogenates in sterile water 50 microL of 10 brain homogenate was

inoculated into weanling wistar rats intracranially After one year of incubation preclinical rats

from RML infections were euthanized by CO2 inhalation and the brain excised homogenized

and re-inoculated into naive animals Subsequent serial passages were from rats clinically

affected with rat adapted scrapie

Brains from rat passages were aseptically removed and bisected sagittally Brain halves

were reserved for immunohistochemistry (IHC) in formalin or frozen for immunoblotting RNA

isolation or subsequent passage For IHC tissues were first fixed in 10 buffered formalin

followed by antigen retrieval at 121 oC 21 bar in 10mM citrate buffer for 2 min After cooling

to room temperature sections were treated with 88 formic acid for 30 min and 4M guanidine

thiocyanate for 2 h Endogenous peroxidases were inactivated by 003 hydrogen peroxide and

tissue was blocked with 1 normal mouse serum Mouse anti-PrP mAB SAF83 (Cayman

Chemical) was biotinylated and applied at a 1250 dilution in blocking buffer overnight at 4 oC

Washes were performed in 10mM PBS with 005 tween-20 Streptavidin-peroxidase

(Invitrogen) was added for Diaminobenzidine (DAB) color reaction Anti-GFAP

immunohistochemistry was performed as above except that formic acid and guanidine treatment

steps were excluded Mouse anti-GFAP mab G-A-5 (Sigma) was used at a 1400 dilution

Brain samples for immunoblot were treated with or without Proteinase K (Roche) at a

ratio of 35 microg PK 100 ug protein at 50 microg mL for 30 minutes at 37 oC Phosphotungstenic acid

enrichments were performed as described14 19

Bis-Tris SDS-PAGE was performed on 12

polyacrylamide gels (Invitrogen) and transferred to PVDF Immunoblotting was performed using

117

mABs SAF83 (Cayman Chemical) 6H4 (Prionics) or 3F10 (a kind gift from Yong-Sun Kim) all

at a 120000 dilution

Gene Expression Profiling

RNA was extracted from frozen brain halves from clinically affected and control animals with

the QIAshredder and RNeasy mini kit (Qiagen Valencia CA) in accordance with the

manufacturerrsquos recommendations Due to the relatively large mass of rat brain an initial

homogenization was performed with a needle and syringe in 5mL of buffer RLT before further

diluting an aliquot in accordance with the manufacturers protocol Total RNA was amplified and

labeled in preparation for chemical fragmentation and hybridization with the MessageAmp

Premier RNA amplification and labeling kit (Life Technologies Grand Island NY) Amplified

and labeled cRNAs were hybridized on Affymetrix (Santa Clara CA) rat genome 230 20 high

density oligonucleotide arrays in accordance with the manufacturers recommendations

Differentially Expressed Genes were identified using Arraystar 50 (DNAStar Madison WI)

Robust multi-array normalization using the quantile approach was used to normalize all

microarray data A moderated T-test with a multiple comparison adjustment20

was used to reduce

the false discovery rate yet preserve a meaningful number of genes for pathway analysis

Pathway analysis was performed using the DAVID Bioinformatics database21

Comparative

analysis of genes induced by prions in mouse22

and rat disease was performed on genes

exhibiting at least a 2-fold change in expression Orthologs of rat and mouse genes were

identified using ENSEMBLE biomart release 6823

CSF Proteomic Profiling

118

CSF was collected from rats anaesthetized with isoflorane via puncture of the cisterna

magna A volume of 100-200 microL was routinely obtained CSF was touch spun for 30s at 2000xg

on a benchtop nano centrifuge to identify any blood contamination by the presence of a red

pellet CSF with blood contamination was not used for proteomic analysis CSF was prepared

for profiling by first depleting abundant proteins with an antibody based immunopartitioning

column IgY-R7 (Sigma-Aldrich Saint Louis MO) The manufacturers recommendations were

followed with a few deviations 40 microg of protein (~100 microL CSF) was bound to 100 microL of IgY

bead slurry in total volume of 600 microL formulated in 150mM ammonium bicarbonate The flow

through was collected and denatured by the addition of 10 microL of 8 M guanidine thiocyanate and

lyophilized in a speed-vac apparatus The sample was then reconstituted in 10 microL of water and 1

microL of 050 M DTT were added and allowed to incubate at 37oC for 30 minutes After incubation

27 microL of 055 M IAA was added for carboxymethylation and the sample was allowed to

incubate in the dark for 15 minutes After that 1 microL of 050 M DTT was added and allowed to

sit at room temperature for 10 minutes To perform trypsin digestion 70 microL of 50 mM

NH4HCO3 was then added along with 025 microg trypsin Digestion was performed overnight at

37oC and quenched by addition of 5 microL of 10 formic acid The tryptic peptides were then

subjected to solid phase extraction using an Agilent OMIX Tip C18 100 microL (Santa Clara CA)

Peptides were eluted with 50 ACN in 01 formic acid concentrated and reconstituted in 30

microL H2O with 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of a nanoAcquity (Milford MA) with a 10 microL injection

loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B consisted of

ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto a Waters 5 microm

119

Symmetry C18 180 microm x 20 mm trap cartridge (Milford MA) at a flow rate of 75 microLmin for 5

minutes at 95 A 5 B Separation was performed on a Waters 17 microm BEH130 C18 75 microm x

100 mm (Milford MA) with a gradient of 5 to 15 B over 10 minutes followed by 15 to

40 B over 80 minutes at room temperature

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with a Bruker CaptiveSpray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Waters Acquity console software to perform MS acquisitions for all experiments Smart

parameter setting (SPS) was set to 700 mz compound stability and trap drive level was set at

100 Optimization of the CaptiveSpray source resulted in dry gas temperature of 160oC dry

gas flow of 30 Lmin capillary voltage of -1325 V end plate offset of -500 V MSMS

fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

range of 350-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

120

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot24

(Version 24 Matrix

Science London UK) Database searching was performed against a forward and reversed

concatenated SwissProt Rattus rattus Mascot search parameters were as follows Allowed

missed cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) peptide tolerance plusmn12 Da maximum number of 13

C 1 MSMS

tolerance plusmn05 Da instrument type ESI-Trap Data was then transformed into Dat file formats

and transferred to ProteoIQ (NuSep Bogart GA) False positive analyses were calculated using

ProteoIQ and set at 1

Results

Development of Rat Adapted Scrapie

To develop a rat adapted strain of prion disease we introduced 6 different prion agents RML

TME Hy Skunk Adapted TME and Chronic Wasting Disease (CWD) from wild type 96G and

96S deer16-18

into 6 rats (Fig 1) Of these primary transmissions only RML induced the

accumulation of Proteinase K resistant PrP after one year of incubation as determined by western

blotting on 10 brain homogenates and PrPSc

enriched phoshotungstenic acid precipitated brain

homogenates Second passage of rat adapted RML scrapie resulted in clinically affected rats at

565 days post inoculation Serial passages of TME or CWD agents were not performed Clinical

symptoms consisted of ataxia lethargy wasting kyphosis and myoclonus Prion affected rats

121

also showed low level porphyrin staining around their head Subsequent serial passage decreased

incubation time to 215 days

Proteinase K resistant prion protein was observed from all clinically affected animals both by

immunoblot (Fig 2) and by immunohistochemistry (Fig 3) Di- and mono-glycosylated bands

were the most abundant isoforms of PrPSc

PrPSc

was extensively deposited in the cerebral cortex

hippocampus thalamus inferior colliculus and granular layer of the cerebellum GFAP

expressing activated astrocytes were found throughout the brain particularly in the white matter

of the hippocampus thalamus and cerebellum Spongiform change was a dominant feature of

clinical rat

Gene expression Profiling

In total 1048 genes were differentially regulated within a 95 confidence interval

(Supplementary Table 1) and 305 genes were differentially expressed by greater than 2-fold (Fig

4) The 1048 genes that were statistically significant were used for pathway analysis using

DAVID Pathway analysis suggested that the gene expression profile was consistent with

immune activation and maturation as well as inflammation (Supplementary Table 2) a likely

interpretation given the observable astrocytosis (Fig 3) and gliosis associated with prion disease

Other pathways highlighted by the analysis included increases in transcription of genes involved

in lysosomes and endosomes

To further probe the gene expression data we compared genes which were differentially

expressed in rat adapted scrapie against their orthologs expression in mouse scrapie and vice

versa (Figs 5 and 6) We did not observe a high degree of correlation between expression fold

changes For example GFAP a gene whose up-regulation in prion disease is well known was

122

increased 25 fold in rat adapted scrapie but up-regulated 93 fold in mouse scrapie A

qualitative analysis of expression of orthologs in prion disease suggests that many genes

deregulated at least 2-fold in mouse or rat do not have orthologs that are differentially expressed

For example the gene RNAseT2 was up-regulated greater than threefold in rat adapted scrapie

but was not significantly up-regulated in mouse Similarly three genes important in metals

homeostatsis metallothionein 1a and 2a and ceruloplasmin were up-regulated in rat 46 43 and

3 fold respectively but were not differentially expressed in mouse prion disease

CSF Proteomics

Each immunodepleted biological replicate (N=5 for each control and RAS) had technical

triplicates performed The triplicates were summed together using ProteoIQ Peptide spectral

counts were then summed for each protein and subjected to dNSAF analysis using ProteoIQ

internal algorithms Details for this method can be found elsewhere25 26

but briefly peptide

spectral counts are summed per protein (SpC) based on unique peptides and a weighted

distribution of any shared peptides with homologous proteins T-tests were used to identify

significant changes in protein expression 1032 unique peptides which identify 512 proteins and

167 protein groups were found Of these 512 proteins 437 were identified in both RAS and

control CSF samples (Fig 7) The complete list of all 512 proteins can be viewed in

Supplemental Table 3 Using the ProteoIQ the most likely isoform was selected such as 14-3-3

protein gamma

From Table 1 we observe five proteins that agree with the genomic data for up

regulations in RAS granulin macrophage colony-stimulating factor 1 receptor cathepsin D

complement factor H and ribonuclease T2 In addition serine protease inhibitor A3N was not

123

detected as up regulated in the RAS genomic data but was found to be up-regulated in previous

genomic profiling of the mouse prion model22

One interesting trend from the data in Table 1 is

that the majority of proteins found to be up-regulated in the RAS model were not detected in the

control samples The absence of the detection of those proteins such as ribonuclease T2 in the

control CSF does not necessarily suggest the absence of the protein nonetheless it is below the

detection limits for this current proteomics protocol and instrumentation

Discussion

Mice have been the preferred laboratory rodent for prion diseases research because they

can be inexpensively housed and are amenable to transgenesis which allows for short incubation

periods as well as hypothesis testing upon targeted pathways Pursuant to the determination of

the mouse genome and the development of high density transcriptional arrays for measurements

of gene expression profiling mice have been used extensively to examine the molecular

pathology of prion disease probing the impact of disease and animal strain In order to expand

upon this foundation we adapted mouse prions to infect rats We reasoned that by taking a

comparative approach to the molecular pathology of prion disease inferences could be obtained

into the variability of the molecular response to prion diseases and that understanding this

variability might suggest whether human prion disease responses are more or less similar to

mouse responses A second rationale is the desire to identify surrogate markers of prion disease

While this approach has been taken before using gene expression profiling a more direct

approach has been to use proteomic approaches on cerebral spinal fluid (CSF) identifying

proteins that are increase in abundance with disease A rat prion disease is valuable for this

because the rat proteome is established and rats allow for the collection of relatively large

volumes of CSF (~100 microL per animal) which allows for robust analytical separations enhancing

124

detection of biomarkers Finally rats unlike humans can be used in a time course study of prion

disease This allows for the identification of early transcriptional and proteomic responses to

prion infection responses which are particularly valuable for the identification of surrogate

disease biomarkers

To initiate the development of a rat prion disease we attempted to adapt six different

prion disease agents PrPres

molecules to rat via intracranial inoculation of weanling animals

(Figure 1) Of these six agents only mouse RML prions were able to surmount the species

barrier to prion disease transmission Mouse PrP differs from rat by eight amino acid changes

six in the mature protein and four in the N-terminal octa-peptide repeat region (Supplementary

Fig 1) As rat shares the most prion protein sequence homology with mouse it is perhaps not

surprising that it transmitted whereas the other did not confirming that the primary prion protein

sequence is the most important determinant for interspecies transmission We conclude that there

is a large molecular species barrier preventing conversion of rat PrPc into PrP

res

The transmission of mouse RML into rats was characterized by a shortening of the

incubation period following each passage This is indicative of agent adaption to the new host

and increases in the titer present in end-stage brain Overall our adaptation of mouse prion

disease into rats resulted in a similar agent to that observed by Kimberlin27

The differences in

incubation period at second passage are largely due to our collecting the animals at 365 days post

inoculation upon first passage whereas Kimberlin and Walker allowed the first passage animals

to reach end-stage clinical rats

Rat adapted scrapie is in many ways similar to other prion diseases The clinical period of

disease is marked by a progressive neurological dysfunction with prominent ataxia kyphosis and

125

wasting The protease resistant PrP observed is typical as is the widespread deposition of PrPSc

in

the brain Spongiosis and reactive astrogliosis are as expected of a prion disease

Gene expression profiles from rats clinically affected with prion disease revealed a strong

neuronal inflammation associated with proliferated and activated astrocytes This is perhaps best

observed through the up-regulation of GFAP GFAP positive astrocytes were omnipresent

throughout the brain and GFAP mRNA was up-regulated 25 fold The up-regulation of GFAP is

a hallmark of the molecular response to prion infection and has been routinely observed Our

comparative analysis of gene expression changes in mouse RML versus rat adapted scrapie

suggest substantial differences in gene expression in response to prion disease despite the fact

that the overall response is neuro-inflammatory This suggests that the potential overlap between

mouse expression profiles and a putative human CJD expression profile could be quite different

at the level of individual transcripts that might be expected to be changed

CSF Proteomics

CSF proteomics can be exceedingly challenging due to the small sample available large

dynamic range in proteins and abundant proteins limiting sample loading onto nanoscale

columns Dynamic range reduction in the CSF sample was achieved using a custom amount of

IgY-7 antibodies to abundant proteins such as albumin After immunodepletion the total

protein content was reduced by ~90 limiting the proteomics analysis to one dimensional

separation Label free quantitation spectral counting was performed because it requires less

protein and does not increase sample complexity The proteins identified from the affected and

control N=5 are shown in Supplemental Table 2 Consistent detection of proteins in CSF from

both control and infected rats was observed (Fig 7C) Only two proteins were identified in

126

controls that were not observed in RAS and only 10 proteins were only observed in RAS Some

of these proteins that were only identified in RAS are significantly changed (Supplemental Table

3) One concern in proteomics data is the variability from run to run and the possibility that

certain proteins are identified from different unique peptides Figure 7A shows that the vast

majority of the unique peptides detected (783) were identified with a 1 FDR in both RAS and

control CSF samples highlighting the analytical reproducibility of our methodology

Proteomic analysis of the infected rat CSF provides a reasonable approach to cross

validate biomarkers detected by gene expression profiling For example RNAseT2 a secreted

ribonuclease was up-regulated 3-fold at the mRNA level and was also enriched in CSF from

infected rats (Table 1) Similarly coordinated increases in detection of colony stimulating factor

1 receptor complement factor H granulin and cathepsin D were also observed Conversely

proteomic analysis of CSF also allows for the observation of post-transcriptional responses to

prion disease that might be absent in gene expression profiling The 14-3-3 proteins and neuron

specific enolase both known markers for CJD are only detected by proteomic analysis Thus

gene expression profiling and proteomic detection serve to increase confidence in the

observation of up-regulation enhancing the likelihood that proteins detected by both

methodologies are specific and perhaps may be more sensitive at earlier time points

Comparison to human CSF prion disease proteome

In our comparative analysis 21 proteins were found to be up-regulated while 7 proteins

down-regulated in prion infected rats Included in the up-regulated proteins were the 14-3-3

proteins epsilon and gamma 14-3-3 zetadelta was also up-regulated but not statistically

significant Enolase 2 or neuron specific enolase (NSE) was also up-regulated in CSF of infected

127

rats These proteins are all in agreement with results from previous proteomic profiling of human

CSF from patients with CJD8 9

The detection of 14-3-3 protein is included in the diagnostic

criteria approved by World Health Organization for the pre-mortem diagnosis of clinically

suspected cases of sCJD28

although its application in large-scale screening of CJD is still

debated due to high false positive rate where elevated 14-3-3 protein levels were also reported in

other conditions associated with acute neuronal damage29 30

It was suggested that other brain-

derived proteins that are also specific to CJD can be used in conjunction with 14-3-3 protein to

increase diagnosis accuracy and specificity31

NSE is present in high concentration in neurons

and in central and peripheral neuroendocrine cells NSE serves as another valuable biomarker in

diagnosis of CJD as the elevated level of NSE in CSF was found in early and middle stages of

CJD 32

Other proteins detected in CSF included cystatin C and serpina3N although both of

these were not statistically changed These proteins were both previously identified as being

putative biomarkers for CJD33 34

Utility for biomarker discovery with emphasis on onset of biomarker appearance in CSF

The investigation of the protein changes in CSF from RAS compared to control rats

provides a solid foundation when investigating potential biomarkers with prion disease onset

The cross-validation of the genomic and proteomics data further emphasizes the targets for

consideration during disease onset Biomarker discovery provides the potential to determine if

animals such as cows in bovine spongiform encephalopathy (BSE) are affected instead of

having to be euthanized when BSE is expected In addition CSF collection in mice or hamsters

Prion models is extremely difficult and limited alternatively with the advent of the RAS model

CSF can be collected in sufficient amounts for CSF proteomics CSF collection for mice or

hamsters can be ~10 microL and could be contaminated with blood further complicating proteomic

128

analysis unlike rats which over 10 times more CSF can be collected per animal35

Due to the

amount of CSF available time course studies analyzing CSF proteomics is possible in RAS due

to animal numbers that are manageable and reasonable The RAS model further allows

investigators to bypass working with highly infections CJD CSF samples to investigate the CSF

proteome changes

Conclusion

In this study we have described the gene and protein expression changes in brain and

spinal fluid from a transmission of mouse prions into rats We find that while the overall gene

expression profile in rats is similar to that in mice the specific genes that make up that profile

are different suggesting that genes that change in response to prion disease in different species

may not be as conserved Analysis of CSF from rat adapted scrapie identified similar protein

changes as known in human CJD The rat will be a useful model to identify surrogate markers

that appear prior to the onset of clinical disease and thus may be of higher specificity and

sensitivity

Supplemental Information Available Upon Request

1 Chandler R L Encephalopathy in mice produced by inoculation with scrapie brain material Lancet 1961 1 (7191) 1378-9 2 Silva C J Onisko B C Dynin I Erickson M L Requena J R Carter J M Utility of Mass Spectrometry in the Diagnosis of Prion Diseases Anal Chem 2011 3 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion disease biomarker research delving into the glycoproteome J Proteome Res 2011 10 (6) 2687-702 4 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7 (1) 386-99 5 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2003 878 (22) 2003-12 6 Cunningham R Ma D Li L Mass spectrometry-based proteomics and peptidomics for systems biology and biomarker discovery Frontiers in Biology 2012 7 (4) 313-335

129

7 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66 8 Harrington M G Merril C R Asher D M Gajdusek D C Abnormal proteins in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease N Engl J Med 1986 315 (5) 279-83 9 Piubelli C Fiorini M Zanusso G Milli A Fasoli E Monaco S Righetti P G Searching for markers of Creutzfeldt-Jakob disease in cerebrospinal fluid by two-dimensional mapping Proteomics 2006 6 Suppl 1 S256-61 10 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1) 46-50 11 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66 12 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset J Proteome Res 2008 7 (2) 698-707 13 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422 (6928) 198-207 14 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample preparation and analysis Proteomics 2005 5 (13) 3314-28 15 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep Biochem Biotechnol 2009 39 (3) 221-47 16 Marsh R F Bessen R A Lehmann S Hartsough G R Epidemiological and experimental studies on a new incident of transmissible mink encephalopathy J Gen Virol 1991 72 ( Pt 3) 589-94 17 Bessen R A Marsh R F Biochemical and physical properties of the prion protein from two strains of the transmissible mink encephalopathy agent J Virol 1992 66 (4) 2096-101 18 Johnson C J Herbst A Duque-Velasquez C Vanderloo J P Bochsler P Chappell R McKenzie D Prion protein polymorphisms affect chronic wasting disease progression PLoS One 2011 6 (3) e17450 19 Safar J Wille H Itri V Groth D Serban H Torchia M Cohen F E Prusiner S B Eight prion strains have PrP(Sc) molecules with different conformations Nat Med 1998 4 (10) 1157-65 20 Benjamini Y Hochberg Y Controlling the False Discovery Rate A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 1995 57 (1) 289-300 21 Huang da W Sherman B T Lempicki R A Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources Nat Protoc 2009 4 (1) 44-57 22 Moody L R Herbst A J Aiken J M Upregulation of interferon-gamma-induced genes during prion infection J Toxicol Environ Health A 2009 74 (2-4) 146-53 23 Flicek P Amode M R Barrell D Beal K Brent S Carvalho-Silva D Clapham P Coates G Fairley S Fitzgerald S Gil L Gordon L Hendrix M Hourlier T Johnson N Kahari A K Keefe D Keenan S Kinsella R Komorowska M Koscielny G Kulesha E Larsson P Longden I McLaren W Muffato M Overduin B Pignatelli M Pritchard B Riat H S Ritchie G R Ruffier M Schuster M Sobral D Tang Y A Taylor K Trevanion S Vandrovcova J White S Wilson M Wilder S P Aken B L Birney E Cunningham F Dunham I Durbin R Fernandez-Suarez X M Harrow J Herrero J

130

Hubbard T J Parker A Proctor G Spudich G Vogel J Yates A Zadissa A Searle S M Ensembl 2012 Nucleic Acids Res 2012 40 (Database issue) D84-90 24 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein identification by searching sequence databases using mass spectrometry data Electrophoresis 1999 20 (18) 3551-67 25 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J Proteome Res 2006 5 (9) 2339-47 26 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome quantitation how to deal with peptides shared by multiple proteins Anal Chem 2010 82 (6) 2272-81 27 Kimberlin R H Cole S Walker C A Temporary and permanent modifications to a single strain of mouse scrapie on transmission to rats and hamsters J Gen Virol 1987 68 ( Pt 7) 1875-81 28 World Health Organization Global surveillance diagnosis and therapy of human transmissible spongiform encephalopathies report from a WHO consultation 1998 World Health Organization Geneva Switzerland 9-11 February 1998 1-26 29 Bartosik-Psujek H Archelos J J Tau protein and 14-3-3 are elevated in the cerebrospinal fluid of patients with multiple sclerosis and correlate with intrathecal synthesis of IgG J Neurol 2004 251 (4) 414-20 30 Saiz A Graus F Dalmau J Pifarre A Marin C Tolosa E Detection of 14-3-3 brain protein in the cerebrospinal fluid of patients with paraneoplastic neurological disorders Ann Neurol 1999 46 (5) 774-7 31 Sanchez-Juan P Green A Ladogana A Cuadrado-Corrales N Saanchez-Valle R Mitrovaa E Stoeck K Sklaviadis T Kulczycki J Hess K Bodemer M Slivarichova D Saiz A Calero M Ingrosso L Knight R Janssens A C van Duijn C M Zerr I CSF tests in the differential diagnosis of Creutzfeldt-Jakob disease Neurology 2006 67 (4) 637-43 32 Zerr I Bodemer M Racker S Grosche S Poser S Kretzschmar H A Weber T Cerebrospinal fluid concentration of neuron-specific enolase in diagnosis of Creutzfeldt-Jakob disease Lancet 1995 345 (8965) 1609-10 33 Sanchez J C Guillaume E Lescuyer P Allard L Carrette O Scherl A Burgess J Corthals G L Burkhard P R Hochstrasser D F Cystatin C as a potential cerebrospinal fluid marker for the diagnosis of Creutzfeldt-Jakob disease Proteomics 2004 4 (8) 2229-33 34 Miele G Seeger H Marino D Eberhard R Heikenwalder M Stoeck K Basagni M Knight R Green A Chianini F Wuthrich R P Hock C Zerr I Aguzzi A Urinary alpha1-antichymotrypsin a biomarker of prion infection PLoS One 2008 3 (12) e3870 35 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta) equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

131

Figure 1 Interspecies transmission of prion disease to rats End-stage clinical brain homogenates

were used to passage prion disease After one year of incubation animals were euthanized to

determine the extent of PrPres

accumulation Protease resistance PrP was only observed in those

animals infected with RML scrapie prions This material was serially passaged for two more

incubations before becoming rat-adapted as indicated by the shortening of the incubation period

132

Table 1 A list of selected proteins and mRNAs up-regulated in RAS CSF and brain tissue If

the protein is detected in the RAS CSF but not the control CSF the CSF expression is reported

with a infin If there is no change or data on certain genes related to an up regulated protein nd is

noted The mouse genomic data presented here was previously published22

Gene Protein Symbol Accession CSF

Expression

Rat

GEX

Mouse

GEX

14-3-3 protein zetadelta Ywhaz NP_037143 infin nd nd

14-3-3 protein epsilon Ywhae NP_113791 infin nd nd

14-3-3 protein gamma Ywhag NP_062249 infin nd nd

serine protease inhibitor A3N Serpin A3N NP_113719 54 nd 975

enolase 2 (gamma neuronal) Eno2 NP_647541 infin nd nd

granulin GRN NP_058809 62 364 184

macrophage colony-stimulating

factor 1 receptor

Csf1r NP_001025072 infin 293 205

cathepsin D CTSD NP_599161 infin 255 299

complement factor H Cfh NP_569093 376 234 nd

ribonuclease T2 RNAset2 NP_001099680 infin 302 nd

133

Figure 2 Accumulation of PrPSc

in rat adapted scrapie First second and third passage brain

homogenates were assayed for the presence of protease resistant PrP by immunoblot PrPSc

was

observed following phosphotungstenic acid enrichment at first passage By clinical disease in 2nd

and 3rd

passage rats PrPSc

had substantially accumulated

134

Figure 3 Histological appearance of rat adapted scrapie in the hippocampus at clinical disease

Infected animals showed intense immuno-staining for deposits of PrPSc

and GFAP expressing

astrocytes Spongiform change is an abundant feature in rat adapted scrapie

135

Figure 4 Scatter plot of gene expression profiling of rat adapted scrapie The expression of

individual genes from uninfected and infected animals were plotted to display up and down

regulation The dashed green line is no change Solid green lines are 2-fold changes in gene

expression

136

Figure 5 Quantitative relationship between orthologous pairs of two-fold up-regulated genes in

mouse and rat scrapie Genes up-regulated more than two fold were mapped to their orthologs

and the fold change was plotted Expression is log2 transformed

137

Figure 6 Comparison of genes upregulated in rat and mouse prion diseases Genes up-regulated

two fold in rodent scrapie were identified and the expression of their orthologs was determined

138

Figure 7 Venn diagram comparison of the CSF proteomics data between rat adapted scrapie

(RAS) model and control rats (A) Unique peptides from tryptic peptides produced for the

proteins identified (B) The total proteins identified including all isoforms within the protein

groups (C) The protein groups comparing only the top protein hit of the protein isoforms

showing very consistent protein identifications between RAS and control

139

Chapter 5

Investigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiae

Adapted from ldquoInvestigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M

Heideman W Li L In preparation

140

Abstract

This work explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Kinases such as protein

kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose response

Thus performing a large scale phosphoproteomic MS study can be highly beneficial to map the

signaling networks and cascades involved in glucose-dependent stimulated yeast growth Yeast

cell extract was digested and phosphopeptides were enriched by immobilized metal affinity

chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase (RP)-RP

separation The low pH separation was infused directly into an ion trap mass spectrometer with

neutral loss electron-transfer dissociation (ETD)-triggered fragmentation to improve

phosphopeptide identifications In total 477 unique phosphopeptides are identified with 06

false discovery rate (FDR) and 669 unique phosphopeptides identified with 54 FDR This

study includes comparative phosphoproteins for one specific motif of PKA xxxKRKRxSxxxx

which is presented and differences between starved vs glucose fed are highlighted Phosphosite

validation is performed using a localization algorithm Ascore to provide more confident and

site-specific characterization of phosphopeptides

141

Introduction

Microorganisms such as Saccharomyces cerevisiae (yeast) cycle between growth when

nutrients are available and quiescence when nutrients are scarce When glucose is limited yeast

go into growth arrest state but when glucose is added growth quickly resumes Kinases such as

protein kinase A (PKA) and target of rapamycin (TOR) regulate growth in response to nutrient

conditions and have been well studied through transcriptional control1-4

Yeast execute large

transcriptome alterations in response to changing environmental growth conditions5 6

Gene

regulation by glucose introduction in yeast has been studied including genes used for growth on

alternative carbon sources and activation of genes coding for glucose transport and protein

synthesis7-10

Response to nutrients for survival is not limited to yeast biology and indeed all

living creatures both prokaryotes and multicellular eukaryotes like mammals require nutrient

responsiveness and coordinating cellular functions to survive

With regulation of certain genes well studied by glucose introduction the mechanism and

global protein modulation caused by glucose introduction remain unknown6 Large-scale

phosphorylation analysis has been previously performed in yeast using mass spectrometry11-14

Mapping the phosphoproteome in yeast transitioning from quiescence to growth conditions to

better understand the roles of phosphorylation in orchestrating growth is needed The

phosphorylation state during growth or starvation could be used to engineer cells to drive ectopic

activity (or inhibition) to promote growth and ethanol production on non-native sugars like

xylose

It has been reported that the phosphorylation state can be affected by the introduction of

glucose to carbon-starved yeast15

and phosphorylation plays a significant role in the cell cycle

and signal transduction16

Specifically O-Phosphorylation can function as a molecular switch by

142

changing the structure of a protein via alteration of the chemical nature of an amino acid for

serine threonine or tyrosine It has been reported that up to 30 of the proteome can undergo

phophorylation17

Mass spectrometry has evolved as a powerful tool to accomplish phosphosite

mapping using shotgun proteomics With available technology tens of thousands of

phosphorylation sites can be mapped to amino acids on thousands of proteins using shotgun

proteomics18-20

Mass spectrometry can offer sensitive automated non-targeted global analysis of

phosphorylation events in proteomic samples but in any large scale phosphoproteomic

investigation enrichment of phosphoproteinspeptides is required First phosphorylation is

usually a sub-stoichiometric process where only a percentage of all protein copies are

phosphorylated21

Various enrichment methods have been used for phosphopeptide enrichment

including metal oxide affinity chromatography (MOAC)22

such as TiO223

immobilized metal

affinity chromatography (IMAC)12 24 25

electrostatic repulsion-hydrophilic interaction

chromatography (ERLIC)26

and immunoaffinity of tyrosine phosphorylation27 28

After

enrichment MS analyses of phosphorylated peptides are still challenging due to ion suppression

from non-phosphorylated peptides

Even after phosphopeptide enrichment further sample preparation is needed for large

scale proteomic experiments Additional fractionation can increase protein coverage of a

sample by over ten fold such as MudPIT29

(multidimensional protein identification technology)

In online MudPIT a sample is injected onto a strong cation exchange (SCX) column coupled to

a RP column Successive salt bumps followed by low pH gradients provide the separation of

peptides in two dimensions Since the phosphate groups on phosphopeptides have a low pKa

value due to being more acidic then their unmodified counterparts they tend to elute earlier and

143

disproportionally in ion exchange chromatography like SCX Alternatively reversed-phase

reversed-phase (RP-RP) separation has been proposed as an alternative to MudPIT for offline

two dimensional (2D) separation30

One of the caveats of 2D separation is the potential for

wasted mass spectrometry time from early and late fractions having very few peptides present

all while having too much sample for middle fractions One simple method to reduce these

ldquowasted fractionsrdquo is to combine early or late fractions with the middle fractions to reduce MS

runs with little peptide content to analyze thus shortening the overall analysis time31

In addition the labile phosphorylation group has a large propensity to undergo cleavage

during collision induced dissociation (CID) producing a neutral loss The neutral loss can

produce insufficient backbone fragment ions for MSMS identification32

A solution to neutral

loss fragmentation in phosphopeptides is MS3 using CID to produce improved backbone

fragmentation13 14 33

An alternative fragmentation method to CID for fast sampling ion traps is

electron transfer dissociation (ETD)34-36

ETD produces a more uniform back-bone cleavage

where the cation peptide receives an electron from a low affinity radical anion37

The transferred

electron induces a dissociation at the N-Cα bond to produce c- and zbull-type product ions while

retaining the labile post-translational modifications (PTM) such as phosphorylation bound to the

product ions38

The Bruker amaZon ETD ion trap mass spectrometer has the potential to trigger

ETD fragmentation after a labile PTM produces a neutral loss from CID fragmentation This

method is termed neutral loss-triggered ETD fragmentation and provides a complementary

fragmentation pathway to labile poor fragmenting phosphorylated peptides

In this work we provide a qualitative comparative list of yeast phosphopeptides observed

in glucose fed vs glucose starved conditions

144

Experimental

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) ammonium hydroxide (ACS plus certified) urea (gt99)

sodium chloride (997) ammonium bicarbonate (certified) Optima LCMS grade acetonitrile

Optima LCMS grade water and EDTA disodium salt (99) were purchased from Fisher

Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) ammonium formate (ge99995) Trizma

hydrochloride (Tris HCl ge990) Triton X-100 (laboratory grade) and iron (III) chloride

hexahydrate (97) were purchased from Sigma-Aldrich (Saint Louis MO) Sodium dodecyl

sulfate (SDS) (998) was purchased from US Biological (Marblehead MA) Nickel

nitrilotriacetic acid (Ni-NTA) magnetic agarose beads were purchased from Qiagen (Valencia

CA) Autoclaved yeast peptone dextrose (YPD) media was prepared in 1 L of deionized water

using 10 g yeast extract 20 g Peptone from Becton Dickinson and Company (Sparks MD) and

20 g D-(+)-Glucose (ge995) purchased from Sigma-Aldrich (Saint Louis MO)

Modified Mary Miller Yeast Protein Isolation

The yeast culture was prepared and protein extraction was performed using a modified

Mary Miller protocol39

Briefly yeast strain s288c was inoculated with YPD media and shook

for 72 hours to allow the yeast to be glucose starved After the 72 hours the yeast culture was

partitioned into two flasks To one flask glucose was added at 2 of the final concentration and

allowed to incubate and shake for 10 minutes Then 100 optical density units (ODU) of the yeast

145

culture were collected from each flask Each yeast sample was spun down in a Beckman-Coulter

J6B centrifuge (Indianapolis IN) at 2500xg for three minutes The media was aspirated and the

tubes were flash frozen in liquid nitrogen and stored at -80oC The yeast pellets were thawed on

ice and resuspended in 1 mL of lysis buffer (50 mM Tris HCl 01 triton x-100 05 SDS

pH=80) and 1X protease and phosphatase inhibitor cocktail from Thermo Scientific (Rockford

IL) and transferred to an Eppendorf tube To the yeast 200 microL of glass beads were added and

amalgamated for 25 minutes at 4oC The sample was inverted and the Eppendorf tube was

pierced with a hot 23 gauge needle through the bottom and the tube was placed atop a 5 mL

culture tube and centrifuge in the Beckman-Coulter J6B at 2500xg for three minutes at 4oC to

collect the liquid containing the yeast cells while the glass beads remain trapped in the

Eppendorf tubes The protein lysate was then centrifuged at 3000xg for five minutes at 4oC and

the supernatant was collected and stored at -80oC

Preparation of tryptic digests

The Saccharomyces cerevisiae protein cell extract for fed or starved were analyzed with a

BCA protein assay kit (Thermo Scientific Rockford IL) and 2 mg of protein was added to four

parts -20oC acetone and allowed to precipitate the protein for 1 hour at -20

oC The samples were

then centrifuged at 14000xg for 5 minutes and the supernatant was removed and the protein

pellet was reconstituted in 360 microl of 8 M urea To each sample 20 microL of 050 M DTT was

added and allowed to incubate at 37oC for 45 minutes After incubation 54 microL of 055 M IAA

was added for carbamidomethyl of cysteine and the sample was allowed to incubate for 15

minutes in the dark To quench the IAA 20 microL of 050 M DTT was added and allowed to react

for 10 minutes To perform trypsin digestion 1400 microL of 50 mM NH4HCO3 was then added

along with 20 microg trypsin to each sample Digestion was performed overnight at 37oC and

146

quenched by addition of 25 microL of 10 formic acid For each sample the tryptic peptides were

then subjected to solid phase extraction using a Hypersep C18 100 mg solid phase extraction

(SPE) column (Thermo Scientific Bellefonte PA) Peptides were eluted with 50 ACN in

01 formic acid concentrated and reconstituted in 500 microL of 80 ACN and 01 formic acid

Phosphopeptide enrichment using immobilized metal affinity chromatography (IMAC)

One ml of Ni-NTA was washed three times with 800 microL of H2O and then the Ni was

removed with 800 microL of 100 mM EDTA in 50 mM ammonium bicarbonate and vortexed for 30

minutes The supernatant was removed and the NTA was washed with 800 microL of H2O three

times followed by addition of 800 microL 10 mM FeCl3 hexahydrate and vortexed for 30 minutes

The Fe-NTA was then washed three times with 800 microL of H2O and once with 80 ACN in 01

formic acid before being combined with the cell extract for phosphopeptide enrichment and

vortexed for 40 minutes The sample was washed three times with 200 microL of 80 ACN in 01

formic acid before elution with two aliquots of 200 microL of 5 ammonium hydroxide in 5050

ACNH2O with 15 minutes of vortexing The enriched phosphopeptides were then dried down

with a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA) and reconstituted in 55 microL

25mM ammonium formate pH=75

First dimension neutral pH separation

Individually 50 microL of each yeast phosphopeptide enriched sample was injected onto a

Phenomenex Gemini-NX 3microm C18 110 Aring 150 x 2 mm column (Torrance CA) The Gemini

column was coupled to a Phenomenex SecurityGuard Gemini C18 2 mm guard cartridge

(Torrance CA) Mobile phase A consisted of 25 mM ammonium formate at pH=75 and mobile

phase B consisted of 25 mM ammonium formateACN with a 19 ratio respectively at pH=75

The column was operated at 50oC at a flow rate of 05 mLmin The flow profile was 2-30 B

147

over 65 minutes and then 5 min at 100 B A total of 22 fractions were collected every 3

minutes and the fractions were combined as follows 1 with 12 2 with 13 etc to 11 with 22

The combined fractions were dried down and reconstituted in 25 microL of 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL from each fraction onto an Agilent Technologies

Zorbax 300 SB-C18 5 microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5

microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm Atlantis

dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B at 250

nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) for MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

148

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions An additional mode of MSMS fragmentation electron transfer dissociation

(ETD) was triggered on the precursor ion when a neutral loss was observed in CID

fragmentation from phosphoric acid H3PO4 (Δmz of 49 and 327 for 2+ and 3+ charge states

respectively) or for metaphosphoric acid (HPO3) (Δmz of 40 and 267 for 2+ and 3+ charge

states respectively) For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms rolling average 2 acquisition range of 300-1500 mz

and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target

was set to 300000 maximum accumulation time 5000 ms two spectral averages acquisition

range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required to prevent artificial data

reduction Identification of peptides were performed using Mascot40

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt Saccharomyces

cerevisiae database (version 575) Mascot search parameters were as follows Allowed missed

cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) and phosphorylation (STY) peptide tolerance plusmn12 Da maximum

number of 13

C 1 MSMS tolerance peptide decoy (Mascot) checked plusmn05 Da instrument type

149

ESI-Trap Results from the Mascot search was exported as DAT files for analysis by Scaffold 3

and Scaffold PTM

Scaffold and Ascore data processing

Scaffold 3 (Proteome Software Portland OR)was used for MSMS proteomics data

comparison and false discovery rate analysis The DAT files were loaded into Scaffold 3 and

the fractions for the two dimensional fractionation were combined The resulting biological

triplicates for starved and glucose fed yeast were filtered with a 1 false discovery rate (FDR)

on the protein level (Supplemental Table 1) Due to the inherent poor fragmentation of

phosphopeptides a lenient FDR level may still yield biologically relevant data and thus a 54

FDR was generated for phosphopeptides (Supplemental Table 2) For a more conservative list of

phosphopeptides with a 06 FDR was generated from Scaffold 3 (Supplemental Table 3) FDR

analysis is sufficient at preventing poor data from being reported but does not prevent false

phosphosite identification in phosphopeptides To provide confidence in site identification

Scaffold PTM was used to perform Ascore41

analysis Ascore uses an algorithm to score the

probability of the phosphosite from a phosphopeptide identified by a database searching

algorithm such as Mascot Ascore can be freely accessed at httpascoremedharvardedu

Cell collection RNA isolation and microarray data analysis

All experiments were performed in biological duplicates Cell samples (10 ODU) were

taken at starved and fed states spun at 5000xg for 2 minutes at 30 oC the supernatant was

removed and the pellets snap frozen using liquid nitrogen RNA was isolated using the Epicentre

MasterPure Yeast Purification Kit (Epicentre Technologies) and quality was checked using gel

electrophoresis cRNA synthesis was carried out using the GeneChipreg Expression 3

Amplification One-Cycle Target Labeling and Control Reagents kit (Affymetrix) All

150

experiments followed the manufactures instructions cRNA samples were hybridized to

GeneChip Yeast Genome 20 Arrays for 16 hours Arrays were washed stained and scanned

according the manufactures recommendations Affymetrix CEL files were RMA normalized

with R and the Bioconductor Suite Data analysis was performed within TIGR Multiexperiment

Viewer v451 in-house Perl scripting R and Bioconductor

Results

Sample preparation for shotgun proteomics

As discussed in the introduction the purpose of this study is to provide an exploratory list

of qualitative differences in the phosphoproteome between starved vs glucose fed yeast After

yeast cell lysate production a substantial amount of sample preparation is performed to enhance

the depth of the phosphoproteome coverage A workflow of the steps following cell lysis is

outlined in Figure 1A The yeast proteins are isolated via acetone precipitation followed by

digestion with trypsin to produce peptides Phosphopeptides are intermixed with the entire

tryptic peptide sample and therefore Fe-NTA IMAC is used to enrich the phosphopeptides To

improve upon the number of phosphopeptides we then performed two dimensional separation

with high pH RP followed by low pH RP C18 and infused the eluent directly onto an ion trap

mass spectrometer Figure 1B show an improved technique for the first dimension of separation

to combine the early eluting and late eluting fractions from the first phase of separation to reduce

overall MS analysis time The combination of fractions 1 and 12 2 and 13 etc essentially

improves the duty cycle of the mass spectrometer by ensuring sufficient peptide content is

injected onto a low pH nanoLC RP C18 column

ETD-triggered mass spectrometry

151

In the present study labile phosphorylation can lead to non-informative neutral loss

During MS scanning mode the instrument will choose the 6 most abundant peaks with correct

isotopic distribution and select them for MSMS CID fragmentation During CID fragmentation

it is common for labile PTMs such as phosphorylation to undergo neutral loss and thus limited

informative b and y-type ions are formed Alternatively ETD fragmentation can be used on

specific MSMS scans which produce a neutral loss indicative of phosphorylation (98Daz or

80Daz) The subsequent ETD fragmentation on these putative phosphopeptides will lead to

uniform backbone cleavage resulting in confident identification of phosphopeptides with site-

specific localization during MSMS It is important to note that CID fragmentation still produces

very informative fragmentation for phosphorylation but ETD provides an orthogonal

fragmentation pathway to further increase the phosphoproteome coverage Additionally the

duty cycle for ETD fragmentation is roughly half that of CID therefore about half as many

potential peptides would be fragmented and sequenced if the instrument was using ETD

fragmentation exclusively

Protein Data

Even with phosphopeptide enrichment it is inevitable that non-phosphopeptides will also

be identified All data were searched with Mascot and in total over 1000 proteins were identified

with a 1 FDR rate and are listed in Supplemental Table 1 The proteins listed in Supplemental

Table 1 include both phosphoproteins and non-phosphorylated proteins In addition to the

proteins identified in the fed and starved states the unique peptides and spectral counts are also

listed for reference The phosphopeptides are specifically listed with a 54 FDR rate in

Supplemental Table 2 and comparisons between starved and fed with Ascore analysis are listed

for every phosphopeptide identified A higher confidence of phosphopeptide identification is

152

sometimes required before investing in time consuming biological experiments so a list of

phosphopeptides identified in starved and fed was taken from a setting directing Scaffold to

produce a 05 FDR which resulted in the generated FDR of 06 FDR and listed in

Supplemental Table 3

A summary of specific phosphorylation sites are listed in Table 1 with a 06 FDR and

Table 2 with a 54 FDR Each table gives totals for unique phosphopeptides identified having

an Ascore localization score ge80 without Ascore and phosphorylation events on each unique

peptides As expected the majority of phosphorylation events (over 50) occurred on serine

whereas fewer phosphorylation events (~10) occurred on tyrosine In addition the vast

majority of phosphorylation events were single phosphorylation (ge65) with very few

identifications having more than two phosphosites per peptide For specific phosphopeptide

identification and subsequent phosphoprotein identification refer to Supplemental Table 2 and 3

Discussion

Transcriptional response to glucose feeding

Yeast responds to the repletion of glucose after glucose-depletion by broad

transcriptional changes to induce growth Roughly one-third of the yeast genome is altered by at

least two fold when glucose is introduced as shown in Figure 2 Figure 2 is a heat map from a

microarray analysis showing 1601 genes up regulated and 1457 genes down regulated after

addition of glucose compared to the starved state The arbitrary cut-offs for these values were as

follows two fold induction after nutrient repletion and a false discovery rate (q-value) of 001

Red color in the heat map indicates a particular gene is up-regulated in the fed state compared to

the starved state Alternatively genes coded in green are less expressed in the fed state

compared to the starved condition The intensity of the green or red colors is indicative of the

153

intensity of the fold change in gene expression These large transcriptional changes after glucose

repletion drive and complement the current phosphoproteomic study

PKA motif analysis

One benefit of a large scale phosphoproteomics experiment is to examine the different

phosphopeptides from known motifs for specific kinases PKA and TOR are responsible for the

majority of the transcriptional response and thus PKA is a good target for motif analysis Figure

3 shows the PKA motif sequence xxxRKRKxSxxxxxx with the phosphorylation occurring on

the serine Other than F16P (fructose-16-bisphosphatase) all the proteins listed are unique to the

starved or fed samples A motif sequence will inevitably show up by random chance in any

analysis For this study the control for motif analysis uses the swissprot protein list for the

entire yeast proteome for the background Compared to background this specific PKA kinase

from Figure 3 is up-regulated by 264 fold when compared to the background One interesting

protein emerged from this motif analysis in the fed sample but not the starved sample is

Ssd1which is implicated in the control of the cell cycle in G1 phase42

Ssd1 also is

phosphorylated by Cbk1 and Cbk1 has been proposed to inhibit Ssd143

and provides an

intriguing target for future studies on starved vs glucose fed yeast growth

Localization of the phosphorylation sites

When a phosphopeptide contains any number of serine threonine or tyrosine amino

acids the localization of the phosphosite can sometimes be ambiguous Database searches used

to identify peptides like Mascot do not provide any probability for localization of correct

phosphorylation sites Ascore is an algorithm that uses the same MSMS spectra as Mascot but

instead of comparing theoretical peptide MSMS to experimental spectra Ascore searches for

informative a b or y-type peptide fragments giving evidence for phosphosite The Scaffold

154

program adds a localization probability to the Ascore values and the values are listed in

Supplemental Table 2 and 3 Figure 4 has a representative Ascore mass spectra assigning the

peaks identified and providing evidence that the phosphorylation site occurs at the threonine

instead of the serine Incorporating Ascore into this study provides a layer of validation for

putative phosphosite identification

Plasma Membrane 2-ATPase

A previous study identified and localized phosphorylation sites on plasma membrane 1-

ATPase after glucose was introduced to starved yeast15

In the current study PMA2 (plasma

membrane ATPase 2) was identified in glucose fed and not starved samples The doubly

threonine phosphorylated peptide identified in fed yeast is PMA2 with amino acid sequence

IKMLTGDAVGIAKETCR (583-599) whereas the homologous peptide in PMA1 has the exact

same amino acid sequence except for the first isoleucine substituted for valine

VKMLTGDAVGIAKETCR (554-570 not observed in data) PMA2 was observed in the 06

FDR list of phosphopeptides and has an Ascore localization probably of 100 A plant study

showed that PMA2 phosphorylation level was higher in early growth phase than when in

stationary phase44

In addition PMA2 expression in yeast permits the growth of yeast and

threonine phosphorylation has been reported on Thr-95545

The identification of PMA2 in the

fed glucose cell extract provides an interesting target for future study on the molecular

mechanisms involved in regulation growth in starved vs glucose fed yeast

Conclusion

In conclusion this work provides a qualitative comparison in the phosphoproteome

between starved and glucose fed yeast A large scale IMAC enrichment of yeast cell lysate

followed by 2-D separation provided a rich source of phosphopeptides for neutral loss triggered

155

ETD analysis using mass spectrometry A specific motif for PKA is highlighted to show the

differences in proteins identified between starved vs fed conditions In total 477 unique

phosphopeptides are identified with 06 FDR and 669 unique phosphopeptides identified with

54 FDR Phosphosite validation is performed using a localization algorithm Ascore to

provide further confidence on the site-specific characterization of these phosphopeptides The

proteins Ssd1 and PMA2 emerged as potential intriguing targets for more in-depth studies on

protein phosphorylation involved in glucose response

Supplemental Tables 1 2 and 3 are available upon request

References

1 Martinez M J Roy S Archuletta A B Wentzell P D Anna-Arriola S S

Rodriguez A L Aragon A D Quinones G A Allen C Werner-Washburne M Genomic

analysis of stationary-phase and exit in Saccharomyces cerevisiae gene expression and

identification of novel essential genes Mol Biol Cell 2004 15 (12) 5295-305

2 Radonjic M Andrau J C Lijnzaad P Kemmeren P Kockelkorn T T van Leenen

D van Berkum N L Holstege F C Genome-wide analyses reveal RNA polymerase II

located upstream of genes poised for rapid response upon S cerevisiae stationary phase exit Mol

Cell 2005 18 (2) 171-83

3 Slattery M G Heideman W Coordinated regulation of growth genes in

Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

4 Wang Y Pierce M Schneper L GAtildefrac14ldal C G k e Zhang X Tavazoie S

Broach J R Ras and Gpa2 Mediate One Branch of a Redundant Glucose Signaling Pathway in

Yeast PLoS Biol 2004 2 (5) e128

5 Newcomb L L Hall D D Heideman W AZF1 is a glucose-dependent positive

regulator of CLN3 transcription in Saccharomyces cerevisiae Mol Cell Biol 2002 22 (5) 1607-

14

6 Newcomb L L Diderich J A Slattery M G Heideman W Glucose regulation of

Saccharomyces cerevisiae cell cycle genes Eukaryot Cell 2003 2 (1) 143-9

7 Carlson M Glucose repression in yeast Curr Opin Microbiol 1999 2 (2) 202-7

8 Gancedo J M Yeast carbon catabolite repression Microbiol Mol Biol Rev 1998 62

(2) 334-61

9 Johnston M Feasting fasting and fermenting Glucose sensing in yeast and other cells

Trends Genet 1999 15 (1) 29-33

156

10 Warner J R The economics of ribosome biosynthesis in yeast Trends Biochem Sci

1999 24 (11) 437-40

11 Li X Gerber S A Rudner A D Beausoleil S A Haas W Villen J Elias J E

Gygi S P Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces

cerevisiae J Proteome Res 2007 6 (3) 1190-7

12 Ficarro S B McCleland M L Stukenberg P T Burke D J Ross M M

Shabanowitz J Hunt D F White F M Phosphoproteome analysis by mass spectrometry and

its application to Saccharomyces cerevisiae Nat Biotechnol 2002 20 (3) 301-5

13 Gruhler A Olsen J V Mohammed S Mortensen P Faergeman N J Mann M

Jensen O N Quantitative phosphoproteomics applied to the yeast pheromone signaling

pathway Mol Cell Proteomics 2005 4 (3) 310-27

14 Peng J Schwartz D Elias J E Thoreen C C Cheng D Marsischky G Roelofs

J Finley D Gygi S P A proteomics approach to understanding protein ubiquitination Nat

Biotechnol 2003 21 (8) 921-6

15 Lecchi S Nelson C J Allen K E Swaney D L Thompson K L Coon J J

Sussman M R Slayman C W Tandem phosphorylation of Ser-911 and Thr-912 at the C

terminus of yeast plasma membrane H+-ATPase leads to glucose-dependent activation J Biol

Chem 2007 282 (49) 35471-81

16 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

17 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

18 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

19 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

20 Breitkopf S B Asara J M Determining In Vivo Phosphorylation Sites Using Mass

Spectrometry In Current Protocols in Molecular Biology John Wiley amp Sons Inc 2012

21 Steen H Jebanathirajah J A Rush J Morrice N Kirschner M W Phosphorylation

analysis by mass spectrometry myths facts and the consequences for qualitative and

quantitative measurements Mol Cell Proteomics 2006 5 (1) 172-81

22 Kweon H K Hakansson K Metal oxide-based enrichment combined with gas-phase

ion-electron reactions for improved mass spectrometric characterization of protein

phosphorylation J Proteome Res 2008 7 (2) 749-55

23 Larsen M R Thingholm T E Jensen O N Roepstorff P Jorgensen T J Highly

selective enrichment of phosphorylated peptides from peptide mixtures using titanium dioxide

microcolumns Mol Cell Proteomics 2005 4 (7) 873-86

24 Kokubu M Ishihama Y Sato T Nagasu T Oda Y Specificity of immobilized

metal affinity-based IMACC18 tip enrichment of phosphopeptides for protein phosphorylation

analysis Anal Chem 2005 77 (16) 5144-54

25 Swaney D L Wenger C D Thomson J A Coon J J Human embryonic stem cell

phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry Proc

Natl Acad Sci U S A 2009 106 (4) 995-1000

157

26 Hao P Guo T Sze S K Simultaneous analysis of proteome phospho- and

glycoproteome of rat kidney tissue with electrostatic repulsion hydrophilic interaction

chromatography PLoS One 2011 6 (2) e16884

27 Rush J Moritz A Lee K A Guo A Goss V L Spek E J Zhang H Zha X

M Polakiewicz R D Comb M J Immunoaffinity profiling of tyrosine phosphorylation in

cancer cells Nat Biotechnol 2005 23 (1) 94-101

28 Ficarro S Chertihin O Westbrook V A White F Jayes F Kalab P Marto J A

Shabanowitz J Herr J C Hunt D F Visconti P E Phosphoproteome analysis of

capacitated human sperm Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3

and valosin-containing proteinp97 during capacitation J Biol Chem 2003 278 (13) 11579-89

29 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

30 Dowell J A Frost D C Zhang J Li L Comparison of two-dimensional

fractionation techniques for shotgun proteomics Anal Chem 2008 80 (17) 6715-23

31 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

32 Palumbo A M Smith S A Kalcic C L Dantus M Stemmer P M Reid G E

Tandem mass spectrometry strategies for phosphoproteome analysis Mass Spectrom Rev 2011

30 (4) 600-25

33 Beausoleil S A Jedrychowski M Schwartz D Elias J E Villen J Li J Cohn M

A Cantley L C Gygi S P Large-scale characterization of HeLa cell nuclear

phosphoproteins Proc Natl Acad Sci U S A 2004 101 (33) 12130-5

34 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

35 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion

dependence in the partitioning between proton and electron transfer in ionion reactions

International Journal of Mass Spectrometry 2004 236 (1acirceuroldquo3) 33-42

36 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

37 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

38 Coon J J Collisions or electrons Protein sequence analysis in the 21st century Anal

Chem 2009 81 (9) 3208-15

39 Miller M E Cross F R Distinct subcellular localization patterns contribute to

functional specificity of the Cln2 and Cln3 cyclins of Saccharomyces cerevisiae Mol Cell Biol

2000 20 (2) 542-55

40 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

158

41 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

42 Sutton A Immanuel D Arndt K T The SIT4 protein phosphatase functions in late

G1 for progression into S phase Mol Cell Biol 1991 11 (4) 2133-48

43 Jansen J M Wanless A G Seidel C W Weiss E L Cbk1 regulation of the RNA-

binding protein Ssd1 integrates cell fate with translational control Curr Biol 2009 19 (24)

2114-20

44 Kanczewska J Marco S Vandermeeren C Maudoux O Rigaud J L Boutry M

Activation of the plant plasma membrane H+-ATPase by phosphorylation and binding of 14-3-3

proteins converts a dimer into a hexamer Proc Natl Acad Sci U S A 2005 102 (33) 11675-80

45 Maudoux O Batoko H Oecking C Gevaert K Vandekerckhove J Boutry M

Morsomme P A plant plasma membrane H+-ATPase expressed in yeast is activated by

phosphorylation at its penultimate residue and binding of 14-3-3 regulatory proteins in the

absence of fusicoccin J Biol Chem 2000 275 (23) 17762-70

159

Figure 1 The general workflow indicating the major steps involved in sample collection

sample processing mass spectrometric data acquisition and analysis of comparative

phosphoproteomics for starved vs glucose fed yeast (A) The high pH RP separation

procedure for combining fractions to reduce low peptide containing fractions from the

UV-VIS trace is also shown (B)

160

Figure 2 Heat map for the global transcriptional response to glucose fed vs starved samples

S288C cells starved for glucose until growth was arrested and subsequently glucose was added

161

Samples for microarray analysis were taken at the starved state and 60 minutes after glucose was

added The heat map shows the fed log2 fold change for each gene relative to the starved state

across the genome performed in biological replicate (A) Black indicates no change in

expression level while red indicates higher expression for the fed relative to the starved state

Green indicates higher expression for the starved state compared to the fed state (Adapted from

Dr Michael Conways Thesis)

162

Figure 3 The motif for protein kinase A (PKA) that was observed for fed vs starved which is

xxxRKRKxSxxxxxx with the phosphorylation occurring on the serine The motif occurred at a

rate 264 fold higher than the yeast proteome used for background In addition one protein was

observed in both starved and fed with accession identification of F16P (Fructose-16-

bisphosphatase)

163

06 FDR phosphopeptide analysis

Table 1 The number of phosphopeptides identified with a 06 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

Starved Fed All

Ascore ge80 score

unique

STY 164 153 317

S 87 (530) 82 (536) 169 (533)

T 60 (366) 55 (359) 115 (363)

Y 17 (104) 16 (105) 33 (104)

Unique no Ascore

STY 242 235 477

S 131 (541) 133 (566) 264 (553)

T 86 (355) 78 (332) 164 (344)

Y 25 (103) 24 (102) 49 (103)

Phosphorylation events

on each unique peptide

1 102 113 187

2 36 40 68

3 12 11 22

4 or more 8 3 11

164

54 FDR phosphopeptide analysis

Starved Fed All

Ascore ge80 score

unique

STY 217 217 434

S 115 (530) 113 (521) 228 (525)

T 78 (359) 78 (359) 156 (359)

Y 24 (111) 26 (120) 50 (115)

Unique no Ascore

STY 337 332 669

S 193 (573) 180 (542) 373 (558)

T 111 (329) 116 (349) 227 (339)

Y

Phosphorylation events

on each unique peptide

1

2

3

4 or more

33 (98)

135

56

16

11

36 (108)

169

55

14

3

69 (103)

280

100

27

13

Table 2 The number of phosphopeptides identified with a 54 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

165

Figure 4 Ascore validation of the phosphopeptide LSLAtKK from the protein SGM1 (slow

growth on galactose and mannose protein 1) with 100 localization probability observed

in only fed but not starved yeast samples Ascorersquos algorithm identifies a b and y-type

ions and looks to identify peaks that provide evidence for a specific phosphorylation site

For this example Ascore assigns this phosphosite to threonine (T5) instead of the serine

(S2) The intense peaks at 4024 mz and 5250 mz are not identified by any a b or y-

type ions From the ladder sequence of the peptide shown numerous ions indicate the

threonine is phosphorylated while the serine is not Among these ions used for

localization are b2 y2 y5+H2O y3 y4 and y5

166

Chapter 6

Use of electron transfer dissociation for neuropeptide sequencing and

identification

Adapted from ldquoDiscovery and Characterization of the Crustacean Hyperglycemic Hormone

Precursor Related Peptides (CPRP) and Orcokinin Neuropeptides in the Sinus Glands of the Blue

Crab Callinectes sapidus Using Multiple Tandem Mass Spectrometry Techniquesrdquo Hui L

Cunningham R Zhang Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

167

Abstract

The crustacean Callinectes sapidus sinus gland (SG) is a well-defined neuroendocrine site that

produces numerous hemolymph-borne agents including the most complex class of endocrine

signaling moleculesmdashneuropeptides One such peptide is the crustacean hyperglycemic hormone

(CHH) precursor-related peptide (CPRPs) which was not previously sequenced Electron

transfer dissociation (ETD) was used for sequencing of intact CPRPs due to their large size and

high charge state In addition ETD was used to sequence the sulfated peptide Cholecystokinin

CCK-like Homarus americanus using a salt adduct Collectively these two examples

demonstrated the utility of ETD in sequencing neuropeptides with large molecular weight or

with labile modifications

168

Introduction

Neuropeptides are the largest and most diverse group of endocrine signaling molecules in

the nervous system They are necessary and critical for initiation and regulation of numerous

physiological processes such as feeding reproduction and development1 2

Mass spectrometry

(MS) with unique advantages such as high sensitivity high throughput chemical specificity and

the capability of de novo sequencing with limited genomic information is well suited for the

detection and sequencing of neuropeptides in endocrine glands and simultaneously provides the

potential for information on post-translational modifications such as sulfonation3-6

The sinus glands (SG) are located between the medulla interna and medulla externa of the

eyestalk in Callinectes sapidus It is a well-known neuroendocrine site that synthesizes and

secretes peptide hormones regulating various physiological activities such as molting

hemolymph glucose levels integument color changes eye pigment movements and

hydro-mineral balance7 Our labrsquos previous neuropeptidomic studies of SG in several

crustacean species including Cancer borealis8-11

Carcinus maenas12

and Homarus americanus13

14 used multiple MS strategies combining MALDI-based high resolution accurate mass profiling

biochemical derivatization and nanoscale separation coupled to tandem MS for de novo

sequencing In the current study we explore the neuropeptidome of the SG in the blue crab

Callinectes sapidus a vital species of economic importance on the seafood market worldwide In

total 70 neuropeptides are identified including 27 novel neuropeptides and among them the

crustacean hyperglycemic hormone precursor-related peptides (CPRPs) and orcokinins represent

major neuropeptide families known in the SG

The crustacean hyperglycemic hormone (CHH) and its precursor-related peptide (CPRP) are

produced concurrently during the cleavage of CHH from the CHH preprohormone protein15

The

169

CPRP peptide is located between the signal peptide and the CHH sequence and is separated from

the CHH by a dibasic cleavage site The functions of CPRPs are currently unknown16

However

the complete characterization of CPRPs provides a foundation for future experiments elucidating

their functional roles in crustaceans The cDNA of CHH preprohormone in SG of Callinectes

sapidus has been characterized17

but the direct detection of CPRP has not been reported due to

its relatively large size and possible post-translational modifications While small fragments of

CPRPs can be identified using nanoLC-ESI-Q-TOF tandem mass spectrometry the intact

peptide is difficult to detect due to the large molecular weight of CPRPs

Peptidomics traditionally uses collision induced dissociation (CID) for tandem MS

experiments for de novo sequencing Recently an alternative peptide fragmentation method has

been developed using the transfer of an electron termed electron transfer dissociation (ETD)18 19

ETD involves a radical anion with low electron affinity to be transferred to peptide cation which

results in reduced sequence discrimination and thus provides improved sequencing for larger

peptides compared to CID20

Specifically for an ion trap ETD the fluoranthene radical anion is

allowed to react with peptide cations in the three dimensional trap The resulting dissociation

cleaves at the N-Cα bond to produce c and zbull type product ions The peptidome of model

organisms like Callinectes sapidus can be further explored and expanded utilizing ETD as a

complementary fragmentation technique to CID Previous peptidomic analysis has been

completed using ETD as an additional fragmentation method21

It was observed that

enzymatically produced peptides with a higher mz produced improved sequence coverage using

ETD This observation termed decision tree analysis determined that a charge state of ge6 all

peptides endogenous or enzymatic should be fragmented via ETD22

In the present study the

highly charged peptide CPRP with an intact molecular weight of 3837 readily produces plus six

170

charge states The higher charge state of CPRP is highly amenable to fragmentation by ETD

which produces remarkably improved fragmentation and thus increased sequence coverage when

compared to CID

Sulfonation of tyrosine is a post-translational modification (PTM) that is thought to occur on

trans-membrane spanning and secreted proteins23

Cholecystokinin-8 (CCK-8) is a sulfated

peptide which has been linked to satiety24-26

and a CCK-like peptide has been observed in

lobster27

Sulfonation is an extremely labile modification and is often lost during soft

ionization such as ESI or MALDI but can be avoided by optimizing the ionization process28

One potential strategy to identify sulfopeptides is to scan for a neutral loss of 80 Da after CID

but this method does not allow for identification of site of sulfonation and has the risk to be

mistaken for phosphorylation The sulfonation PTM is a negatively charged modification on

the peptide which allows for negative ion scanning in the mass spectrometer but provides

minimal MSMS sequence coverage during CID due to preferential loss of the sulfate group

Alternatively electron-based dissociation technique enables more rapid radical driven

fragmentation where the cleavage pattern is more uniform along the peptide backbone without

initially cleaving the labile sulfated group thus preserving the site of modification These types

of dissociation techniques only work for multiply-charged ions thus a method to install a

multiply-charged cation on the target peptide is desirable It has been shown that the electron

capture dissociation (ECD) can be used to sequence sulfated peptides when a multi-charged

cation is added to the solution29

We use a similar multi-charge cation solution technique to

sequence a sulfated peptide from Homarus americanus via ETD on an ion trap mass

spectrometer Here we presented the use of the ETD fragmentation technique for the analysis

of large peptides and peptides containing labile post-translational modification

171

Experimental Section

Chemical and materials

Acetonitrile methanol magnesium chloride (ACS grade) potassium chloride (ACS grade) and

calcium chloride (ACS grade) were purchased from Fisher Scientific (Pittsburgh PA) Formic

acid was from Sigma-Aldrich (St Louis MO) Human sulfated CCK-8 and CCK-like peptide

(E(pyro)FDEY(Sulfo)GHMRFamide) were purchased from American Peptide (Sunnyvale CA)

Fused-silica capillary with 75 μm id and 360 μm od was purchased from Polymicro

Technologies (Phoenix AZ) Millipore C18 Ziptip column was used for sample cleaning and all

water used in this study was deionized water (182 MΩcm) prepared from a Milli-Q Millipore

system (Billerica MA) The physiological saline consisted of 440 mM NaCl 11mM KCl 26

mM MgCl2 13 mM CaCl2 11 mM Trizma base and 5 mM maleic acid in pH 745

Animals and dissection

Callinectes sapidus (blue crab) were obtained from commercial food market and maintained

without food in artificial sea water at 10-12 ordmC Animals were cold-anesthetized by packing on

ice for 15-30min before dissection The optic ganglia with the SG attached were dissected in

chilled (~4ordmC) physiological saline and individual SGs were isolated from the optic ganglia by

micro-dissection and immediately placed in acidified methanol (90 methanol 9 glacial acetic

acid and 1 water) and stored at -80ordmC until tissue extraction

Tissue homogenization

Acidified methanol was used during the homogenization of SGs The homogenized SGs were

immediately centrifuged at 16100 g using an Eppendorf 5415D tabletop centrifuge (Eppendorf

172

AG) The pellet was washed using acidified methanol and combined with the supernatant and

further dried in a Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation) The

resulting dried sample was re-suspended with a minimum amount (20 microL) of 01 formic acid

Fractionation of homogenates using reversed phase (RP)-HPLC

The re-suspended extracts were then vortexed and briefly centrifuged The resulting supernatants

were subsequently fractionated via high performance liquid chromatography (HPLC) HPLC

separations were performed using a Rainin Dynamax HPLC system equipped with a Dynamax

UV-D II absorbance detector (Rainin Instrument Inc Woburn MA) The mobile phases included

Solution A (deionized water containing 01 formic acid) and Solution B (acetonitrile containing

01 formic acid) About 20 μl of extract was injected onto a Macrosphere C18 column (21 mm

id x 250 mm length 5 microm particle size Alltech Assoc Inc Deerfield IL) The separation

consisted of a 120 minute gradient of 5-95 Solution B Fractions were automatically collected

every two minutes using a Rainin Dynamax FC-4 fraction collector (Rainin Instrument Inc

Woburn MA) The fractions were then concentrated using in Savant SC 110 SpeedVac

concentrator (Thermo Electron Corporation) and re-suspended with minimum amount of 01

formic acid

Nano-LC-ESI-Q-TOF MSMS

Nanoscale LC-ESI-Q-TOF MSMS was performed using a Waters nanoAcquity UPLC system

coupled to a Q-TOF Micro mass spectrometer (Waters Corp Milford MA)

Chromatographic separations were performed on a homemade C18 reversed phase capillary

column (75 microm internal diameter x100 mm length 3 microm particle size 100 Aring) The mobile phases

173

used were 01 formic acid in deionized water (A) 01 formic acid in acetonitrile (B) An

aliquot of 50 microl of a tissue extract or HPLC fraction was injected and loaded onto the trap

column (Zorbax 300SB-C18 Nano trapping column Agilent Technologies Santa Clara CA)

using 95 mobile phase A and 5 mobile phase Bat a flow rate of 10 microlmin for 10 minutes

Following this the stream select module was switched to a position at which the trap column

came in line with the analytical capillary column and a linear gradient of mobile phases A and B

was initiated For neuropeptides the linear gradient was from 5 buffer B to 45 buffer B over

90 min The nanoflow ESI source conditions were set as follows capillary voltage 3200 V

sample cone voltage 35 V extraction cone voltage 1 V source temperature 100ordmC A data

dependent acquisition was employed for the MS survey scan and the selection of three precursor

ions and subsequent MSMS of the selected parent ions The MS scan range was from mz

400-1800 and the MSMS scan was from mz 50-1800

Peptide Prediction De Novo Sequencing and Database Searching

De novo sequencing was performed using a combination of MassLynxTM

41 PepSeq software

(Waters) and manual sequencing Tandem mass spectra acquired from the Q-TOF were first

deconvoluted using MaxEnt 3 software (Waters) to convert multiply charged ions into their

singly charged forms The resulting spectra were pasted into the PepSeq window for sequencing

analysis The candidate sequences generated by the PepSeq software were compared and

evaluated for homology with previous known peptides The online program blastp (National

Center for Biotechnology Information Bethesda MD httpwwwncbinlmnihgovBLAST)

was used to search the existing NCBI crustacean protein database using the candidate peptide

sequences as queries For all searches the blastp database was set to non-redundant protein

174

sequences (ie nr) and restricted to crustacean sequences (ie taxid 6657) For each of the

proteins putatively identified via blastp the BLAST score and BLAST-generated E-value for

significant alignment are provided in the appropriate subsection of the results Peptides with

partial sequence homology were selected for further examination by comparing theoretical

MSMS fragmentation spectra generated by PepSeq with the raw MSMS spectra If the

fragmentation patterns did not match well manual sequencing was performed

NanoLC Coupled to MSMS by CID and ETD

Setup for RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN (Fisher Scientific Optima LCMS grade Fair Lawn NJ) Injections

consisted of 5 microL of prepared tissue extract onto an Agilent Technologies Zorbax 300 SB-C18 5

microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5 microLmin for 5 minutes at 95

A5 B Separation was performed on a Waters HPLC column with 3 microm Atlantis dC18 75 microm

x 150 mm (Milford MA) on a gradient from 5 to 45 mobile phase B at 250 nLmin over 90

minutes at room temperature Emitter tips were pulled from 75 microm inner diameter 360 microm

outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using a commercial

laser puller model P-2000 (Sutter Instrument Co Novato CA)

Mass Spectrometry Data Acquisitions for Collision Induced Dissociation (CID)

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany) equipped

with an on-line nanospray source was used for mass spectrometry data acquisition Hystar

(Version 32 Bruker Daltonics Bremen Germany) was used to couple and control Eksigent

175

nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for all

experiments Smart parameter setting (SPS) was set to mz 700 compound stability and trap

drive level were set at 100 Optimization of the nanospray source resulted in dry gas

temperature 125oC dry gas 40 Lmin capillary voltage -1300 V end plate offset -500 V

MSMS fragmentation amplitude 10V and SmartFragmentation set at 30-300

Data was generated in data dependent mode with strict active exclusion set after two spectra and

released after one minute MSMS was obtained via CID fragmentation for the six most

abundant MS peaks with a preference for doubly charged ions and excluded singly charged ions

For MS generation the ion charge control (ICC) target was set to 200000 maximum

accumulation time 5000 ms four spectral average acquisition range of mz 300-1500 and scan

speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target was set to

200000 maximum accumulation time 5000 ms three spectral averages acquisition range of

mz 100-2000 and scan speed (Ultrascan) of 32000 mz s-1

Mass Spectrometry Data Acquisitions for Electron Transfer Dissociation (ETD)

The amaZon ETD ion-trap mass spectrometer was operated in electron transfer dissociation for

MSMS fragmentation with the same optimized settings as reported for CID unless otherwise

stated Smart parameter setting (SPS) was set to 500 mz compound stability and trap drive

level were set at 100 MSMS was obtained via ETD fragmentation for the four most

abundant MS peaks with no preference for specifically charged ions except to exclude singly

charged ions The ETD reagent parameters were set to 400000 target ICC for the fluoranthene

radical anion The fluoranthene radical anion has an mz of 210 and therefore the 210 mz value

was removed from the resulting MSMS spectra and 160 mz was set for the MSMS low mz

cut-off

176

Direct Infusion of Sulfated Peptide Standards into the Mass Spectrometer using ETD and

CID Fragmentation

The sulfated peptide standards were infused into the mass spectrometer at a flow rate of 300

nlmin using a fused-silica capillary with 75 μm id and 360 μm od coupled to a custom pulled

tip The CCK-8 human peptide (20 microM) was infused in 5050 ACNH2O and detected in

negative ionization mode with an ICC of 70000 and fragmented with CID using the same

settings as the previous CID experiments Alternatively the CCK-like lobster sulfated peptide

(2 microM) was mixed with 30 microM of different salts (MgCl2 KCl or CaCl2) and 01 formic acid in

5050 ACNH2O The plus three charge of the CCK-like lobster peptide was selected for ETD

fragmentation in positive mode with the same setting as the previous ETD experiments The

data were then de novo sequenced for coverage and localization of the sulfation site

Data Analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen Germany)

Deviations in parameters from the default Protein Analysis in DataAnalysis were as follows

intensity threshold 1000 maximum number of compounds 1E7 and retention time window 05

minutes These parameter changes assisted in providing better quality spectra for sequencing

Identification of peptides was performed using Mascot (Version 23 Matrix Science London

UK) Searches were performed against a custom crustacean database none enzyme allow up to

1 missed cleavage amidated C-terminal as variable modifications peptide tolerance mass error

12 Da MSMS mass error tolerance is 06 Da

Results and Discussion

177

Identification and Characterization of Intact CPRPs Using ETD

Using ESI-Q-TOF 9 truncated CPRPs fragments were sequenced to cover the amino acid

sequence of the intact CPRP peptide RSAEGLGRMGRLLASLKSDTVTPLRGFEGETGHPLE

A representative CID MSMS spectrum of ASLKSDTVTPLR is shown in Figure 1 (a) CID

using ESI-Q-TOF mainly produces fragmentation for doubly and triply charge molecules which

is adequate to sequence peptides with relatively small molecular weight (less than 2000 Da)

However CID fragmentation is insufficient to sequence the larger intact CPRP in a complex

sample whose molecular mass is 3837 Da and therefore it is extremely difficult to directly

sequence by traditional CID ESI-Q-TOF ETD is an attractive fragmentation alternative to

sequence the highly charged CPRP peptide In ETD a protonated peptide reacts with an anion

(fluoranthene radical) where the anion transfers an electron to the peptide cation and the resulting

fragmentation occurs along the N-Cα bond by free-radical-driven-cleavage The large size of

CPRP offers multiple basic amino acid sites to sequester charge and reduce the sequence

coverage from collision induced dissociate by preventing random backbone cleavage whereas

ETD does not cleave the weakest bond and provides the alternative fragmentation pathway to

obtain superior sequence coverage to larger peptides As shown in Figure 1 (cd) the

fragmentation of ETD is highly amenable to ETD compared to the CID fragmentation in Figure

1 (b) As evident from comparison ETD offers more extensive fragmentation than CID thus

providing enhanced coverage for large intact peptides such as CPRPs Specifically we observe

125 of the b- and y- cleavages for CID as compared to an average of 535 of c- and zbull-

fragments More than a four-fold increase in fragments using ETD suggests the utility of this

relatively new tandem MS fragmentation method as complementary techniques for de novo

sequencing of large intact endogenous peptides and novel neuropeptide discovery endeavors

178

Negative Mode Sulfated Peptide Identification

An accepted method for identification and quantification for sulfated peptides is negative

ionization30

CCK-8 sulfated peptide standards show intense signal in negative ionization mode

without needing to have additives added such as salts Figure 2 shows the CID MSMS

spectrum from the negative ionization and produces an intense b6 ion at 430 mz The transition

from the doubly negatively charged MS ion of 570 mz to the b6 ion provides a selected reaction

monitoring transition for quantification studies but the sequence information is limited and the

presence of the methionine produces variable oxidation In addition Figure 2 shows the

MSMS product ions with the loss of the sulfate group thus making site-specific location of

sulfation more difficult

Investigation of Cation Salt Adducts to Produce Multiply Charged Sulfated Peptides

Sulfated peptides can be isolated in positive scanning mode but usually only in a plus one

state with low signal intensity If ETD is performed on the singly charged peptide cation a

neutral is formed and is lost in the mass spectrometer and thus no sequence information can be

obtained In order to remedy this situation a technique that adding metal salts to peptides to

increase charge state for ECD used in Fourier transform ion cyclotron resonance mass

spectrometry (FTICR-MS)29

inspired the investigation of increasing charge state of targeted

peptide via metal salt adduction followed by ETD analysis in a Bruker amaZon ion trap

Various salts were investigated including MgCl2 CaCl2 and KCl each used at a concentration of

30 microM and mixed with a sulfated peptide standard at 2 microM The addition of KCl produced

mostly doubly charged ion of the lobster CCK-like peptide standard K adduct which produced

insufficient sequence information from ETD fragmentation (data not shown) In comparison

the MgCl2 and CaCl2 produced intense triply charged peptide ions but CaCl2 produced lower

179

signal intensity compared to MgCl2 (data not shown)

Cation Assisted ETD Fragmentation of CCK-like Lobster Sulfated Peptide and Future

Directions

The addition of MgCl2 to the lobster CCK-like sulfated peptide is shown in Figure 3

Except for z1 the complete z-series is obtained including the z7 ion with and without the

sulfation PTM The spectrum is complicated by the additional Mg adducts but several peaks

are shown without any Mg adduct such as z2 z3 z4 etc Clearly the method of cation

assisted ETD fragmentation can be useful using ion trap instruments and can properly sequence

sulfated peptides that are prone to neutral loss from the labile PTM One concern about future

direction in sulfopeptide isolation in non-genome sequenced species is isolation of sulfopeptides

Currently antibodies to specific sulfopeptides is the only commonly used enrichment technique

for sulfopeptides Also since metal cations are needed for this method direct infusion into an

ESI mass spectrometer is preferred to prevent running large amounts of adduct forming salts

through the LC system With direct infusion the lack of separation confounds the problem in

sulfopeptide detection because any co-eluting peptides could have ionization efficiency and thus

reduce the signal of the sulfopeptide Once discovered and sequenced a selected reaction

monitoring (SRM) method could be developed using LC retention coupled with negative

ionization mode followed by CID MSMS or possibly MSMSMS to perform quantitative

studies for sulfopeptides

Conclusions

In this study ETD was performed to improve the sequence coverage of large endogenous

neuropeptides with higher charge and bigger size The intact CPRP from Callinectes sapidus was

identified and characterized with 68 sequence coverage by MS for the first time Cation

180

assisted ETD fragmentation was also shown to be a powerful tool in de novo sequencing of

sulfated neuropeptides These endeavors into using ETD for certain neuropeptides will assist in

future analysis of large neuropeptides and PTM containing neuropeptides

181

Reference

1 Schwartz M W Woods S C Porte D Jr Seeley R J Baskin D G Central nervous system control of

food intake Nature 2000 404 (6778) 661-71

2 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R

Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide

family of aplysia J Neurosci 2002 22 (17) 7797-808

3 Baggerman G Cerstiaens A De Loof A Schoofs L Peptidomics of the larval Drosophila melanogaster

central nervous system Journal of Biological Chemistry 2002 277 (43) 40368-40374

4 Desiderio D M Mass spectrometric analysis of neuropeptidergic systems in the human pituitary and

cerebrospinal fluid Journal of Chromatography B 1999 731 (1) 3-22

5 Li L J Kelley W P Billimoria C P Christie A E Pulver S R Sweedler J V Marder E Mass

spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab Cancer

borealis Journal of Neurochemistry 2003 87 (3) 642-656

6 Li L J Moroz T P Garden R W Floyd P D Weiss K R Sweedler J V Mass spectrometric survey of

interganglionically transported peptides in Aplysia Peptides 1998 19 (8) 1425-1433

7 Strand F L Neuropeptides regulators of physiological processes 1st ed ed MIT Press Cambridge Mass

1999 p 658 p

8 Fu Q Goy M F Li L J Identification of neuropeptides from the decapod crustacean sinus glands using

nanoscale liquid chromatography tandem mass spectrometry Biochemical and Biophysical Research

Communications 2005 337 (3) 765-778

9 Fu Q Christie A E Li L J Mass spectrometric characterization of crustacean hyperglycemic hormone

precursor-related peptides (CPRPs) from the sinus gland of the crab Cancer productus Peptides 2005 26 (11)

2137-2150

10 Ma M M Chen R B Ge Y He H Marshall A G Li L J Combining Bottom-Up and Top-Down Mass

Spectrometric Strategies for De Novo Sequencing of the Crustacean Hyperglycemic Hormone from Cancer borealis

Analytical Chemistry 2009 81 (1) 240-247

11 Ma M M Sturm R M Kutz-Naber K K Fu Q Li L J Immunoaffinity-based mass spectrometric

characterization of the FMRFamide-related peptide family in the pericardial organ of Cancer borealis Biochemical

and Biophysical Research Communications 2009 390 (2) 325-330

12 Ma M M Bors E K Dickinson E S Kwiatkowski M A Sousa G L Henry R P Smith C M Towle

D W Christie A E Li L J Characterization of the Carcinus maenas neuropeptidome by mass spectrometry and

functional genomics General and Comparative Endocrinology 2009 161 (3) 320-334

13 Chen R B Jiang X Y Conaway M C P Mohtashemi I Hui L M Viner R Li L J Mass Spectral

Analysis of Neuropeptide Expression and Distribution in the Nervous System of the Lobster Homarus americanus

Journal of Proteome Research 2010 9 (2) 818-832

14 Ma M M Chen R B Sousa G L Bors E K Kwiatkowski M A Goiney C C Goy M F Christie A

E Li L J Mass spectral characterization of peptide transmittershormones in the nervous system and

neuroendocrine organs of the American lobster Homarus americanus General and Comparative Endocrinology

2008 156 (2) 395-409

15 Ollivaux C Gallois D Amiche M Boscameric M Soyez D Molecular and cellular specificity of

post-translational aminoacyl isomerization in the crustacean hyperglycaemic hormone family FEBS J 2009 276

(17) 4790-802

16 Wilcockson D C Chung J S Webster S G Is crustacean hyperglycaemic hormone precursor-related

peptide a circulating neurohormone in crabs Cell and Tissue Research 2002 307 (1) 129-138

17 Choi C Y Zheng J Watson R D Molecular cloning of a cDNA encoding a crustacean hyperglycemic

hormone from eyestalk ganglia of the blue crab Callinectes sapidus General and Comparative Endocrinology 2006

148 (3) 383-387

18 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and protein sequence analysis

by electron transfer dissociation mass spectrometry Proc Natl Acad Sci U S A 2004 101 (26) 9528-33

19 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion dependence in the partitioning

between proton and electron transfer in ionion reactions International Journal of Mass Spectrometry 2004 236

(1-3) 33-42

20 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-site identity and

position on electron-transfer dissociation of polypeptide cations J Am Chem Soc 2007 129 (40) 12232-43

182

21 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved identification of endogenous

peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric

analysis J Proteome Res 2009 8 (2) 870-6

22 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass spectrometry for shotgun

proteomics Nat Methods 2008 5 (11) 959-64

23 Monigatti F Hekking B Steen H Protein sulfation analysis--A primer Biochim Biophys Acta 2006 1764

(12) 1904-13

24 Chandler P C Wauford P K Oswald K D Maldonado C R Hagan M M Change in CCK-8 response

after diet-induced obesity and MC34-receptor blockade Peptides 2004 25 (2) 299-306

25 Little T J Feltrin K L Horowitz M Meyer J H Wishart J Chapman I M Feinle-Bisset C A

high-fat diet raises fasting plasma CCK but does not affect upper gut motility PYY and ghrelin or energy intake

during CCK-8 infusion in lean men Am J Physiol Regul Integr Comp Physiol 2008 294 (1) R45-51

26 Blevins J E Morton G J Williams D L Caldwell D W Bastian L S Wisse B E Schwartz M W

Baskin D G Forebrain melanocortin signaling enhances the hindbrain satiety response to CCK-8 Am J Physiol

Regul Integr Comp Physiol 2009 296 (3) R476-84

27 Turrigiano G G Selverston A I A cholecystokinin-like hormone activates a feeding-related neural circuit in

lobster Nature 1990 344 (6269) 866-8

28 Wolfender J L Chu F Ball H Wolfender F Fainzilber M Baldwin M A Burlingame A L

Identification of tyrosine sulfation in Conus pennaceus conotoxins alpha-PnIA and alpha-PnIB further investigation

of labile sulfo- and phosphopeptides by electrospray matrix-assisted laser desorptionionization (MALDI) and

atmospheric pressure MALDI mass spectrometry J Mass Spectrom 1999 34 (4) 447-54

29 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides complexed with divalent

metal cations Anal Chem 2006 78 (21) 7570-6

30 Young S A Julka S Bartley G Gilbert J R Wendelburg B M Hung S C Anderson W H

Yokoyama W H Quantification of the sulfated cholecystokinin CCK-8 in hamster plasma using

immunoprecipitation liquid chromatography-mass spectrometrymass spectrometry Anal Chem 2009 81 (21)

9120-8

183

Figure 1 Comparison of tandem MS spectra of truncated CPRP (ASLKSDTVTPL mz 128766)

by ESI-Q-TOF (a) and intact CPRP (MW 3837) by the amaZon ETD for both CID and ETD

fragmentation (a) MSn of precursor ion with mz 640 with charge state +6 by CID represent

loss of NH3 ordm represent loss of H2O (b) MS+6

of precursor ion with mz 640 with charge state +6

by ETD at z represent z+1 z represent z+2 (c) MS+5

of precursor ion with mz 768 with charge

state +5 by ETD z represent z+1 z represent z+2 For all labels in Figure 1 charge state +1 is

not specified

184

185

Figure 2 Negative CID spectrum of CCK-8 peptide standard (20 microM) via direct infusion show

the doubly charged b6 ion provides the most intense MSMS transition

186

Figure 3 ETD spectrum of E(pyro)FDEY(Sulfo)GHMRF-amide (2 microM) obtained on the

amaZon ETD via direct infusion with 30 microM MgCl2 The sulfated peptide can be identified

with a visible z-series of z2 to z9 and identified sulfate loss

187

Chapter 7

Investigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysis

Adapted from ldquoInvestigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J

Wellner D Li L Journal of Mass Spectrometry In Press

188

ABSTRACT

This work investigates the introduction of methanol and a salt modifier to molecular

weight cut-off membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide

quantities Using a neuropeptide standard bradykinin sample loss was reduced over two orders

of magnitude with and without undigested protein present Additionally a bovine serum

albumin (BSA) tryptic digestion was investigated Twenty-seven tryptic peptides were identified

from MALDI mass spectra after enriching with methanol while only two tryptic peptides were

identified after the standard MWCO protocol The strategy presented here enhances recovery

from MWCO separation for sub-microg peptide samples

INTRODUCTION

Molecular weight cut-off membrane-based centrifugal filter devices (MWCO) are

commonly used to desalt and concentrate large molecular weight proteins [1] Greeing and

Simpson recently investigated various MWCO membranes for large amounts of starting material

(~6 mg) focusing on optimal conditions for the sub 25 kDa protein fraction [2] The authors

recovered 200 μg to 29 mg of protein from multiple MWCO experiments and demonstrated that

a 1090 acetonitrile (ACN)H2O elution solvent produced optimal results [3] In addition Manza

et al provided an alternative approach to isolate proteins with a 5 kDa MWCO by using

NH4HCO3 and recovering the retained proteins [4] Alternatively the elution from MWCOs can

be collected to recover only low molecular weight peptides Multiple peptidomic studies have

utilized MWCOs for peptide isolation during the first few steps of sample preparation [5 6]

When sample amount is limited or peptide content is below 1 microg sample loss is a significant

concern when using MWCOs to isolate endogenous peptides Optimized protocols have been

189

investigated using ACN [3 7] salt [4 8] SDS [5] or native sample [6 9 10] but these

experiments primarily focused on large sample amounts rather than sub-microgram peptide

quantities

MWCOs separate large molecules from small molecules The small molecule fraction

may be rich with signaling peptides (SP) cytokines and other small molecules involved in cell-

cell signaling Signaling peptides perform various functions in the body including cell growth

cell survival and hormonal signaling between organs [11] Individual SP contribute to different

aspects of behavior such as pain (enkephalins) [12] feeding (neuropeptide Y) [13] and blood

pressure (bradykinin) [14] MWCO separations can be used to enrich biologically important SP

and explore the peptide content from biological fluids with relatively low peptide content like

blood or cerebrospinal fluid (CSF) In a recent investigation the detection of neuropeptides and

standards in crustacean hemolymph was improved when methanol and protease inhibitors were

present before performing MWCO neuropeptide isolation The impact of methanol on MWCO

sample loss was not investigated in the study [15] In another study a large-scale mass

fingerprinting protocol of endogenous peptides from CSF used a combination of salts before

MWCO fractionation but the impact of adding salts was not discussed [16] The most

commonly used brand of MWCO in the publications and in peptidomic studies is Millipore

Therefore Millipore MWCOs (using regenerated cellulose as the membrane) were used in the

present study The purpose of this work is to provide an optimized sample preparation technique

for MWCO filtering to reduce sample loss and allow sub-microg detection of peptides using MALDI

mass spectrometry

MATERIALS AND METHODS

190

Materials and Chemicals

Water acetonitrile methanol (optima LCMS grade) and sodium chloride (995) were

purchased from Fisher Scientific (Fair Lawn NJ) The α-cyano-4-hydroxy-cinnamic acid (99)

formic acid (FA) (ge98) and bovine serum albumin (ge96) were purchased from Sigma-

Aldrich (St Louis MO) Amicon Ultra 05 mL 10000 MWCO centrifugal filters and ZipTips

packed with C18 reversed-phase resin were purchased from Millipore (Billerica MA) Trypsin-

digested bovine serum albumin (BSA) was purchased from Waters (Milford MA) Bradykinin

was purchased from American Peptide Company (Sunnyvale CA)

MALDI MS Instrumentation

An AutoFLEX III MALDI TOFTOF mass spectrometer (Bruker Daltonics Billerica

MA) was operated in positive ion reflectron mode The MALDI MS instrument is equipped with

a proprietary smart beam (Bruker Daltonics Billerica MA) laser operating at 200 Hz The

instrument was internally calibrated over the mass range of mz 500minus2500 using a standard

peptide mix Two thousand laser shots were collected per sample spot at an accelerating voltage

of 19 kV and a constant laser power using random shot selection The acquired data were

analyzed using FlexAnalysis software (Bruker Daltonics Billerica MA) Mass spectrometry

data acquisition was obtained by averaging 2000 laser shots

Molecular weight cut off separation procedure

The MWCO separations were performed using Amicon Ultra 05 mL 10000 MWCO

centrifugal filters (Billerica MA) Before MWCO separation three washing steps were

performed to remove contaminants from the filter The three washes were 500 μL of 5050

H2OMeOH 500 μL of H2O and 400 μL of the solution used for MWCO separation For the

191

100 H2O solution 1 microg of BSA or bradykinin was used for separation All the other MWCO

separation experiments used 500 ng of BSA or 100 ng or less of bradykinin The MWCO filter

was then centrifuged at 14000 rcf for 5 min at room temperature in an Eppendorf 5415 D

microcentrifuge (Brinkmann Instruments Inc Westbury NY) The filtrate was concentrated in a

Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation West Palm Beach FL)

and acidified The resulting sample was desalted according to the manufacturer using C18

ZipTips from Millipore (Billerica MA) by washing the ZipTip with three times 100 ACN

three aqueous washes of 01 FA binding the peptides from the solution and one aqueous wash

of 01 FA Peptides were then eluted from the ZipTips using 15 microL of 50 ACN in 01 FA

Matrix deposition

Equal volumes of 05 μL from the 15 μL sample solution (including standards not subject

to MWCO filtering) and α-cyano-4-hydroxy-cinnamic acid (CHCA) matrix solution in 50

ACN50 H2O were mixed using a dried-droplet method and spotted on a MALDI target The

resulting droplets were allowed to air dry prior to mass spectrometry acquisition

RESULTS AND DISCUSSION

Analysis of two orders of magnitude increase for bradykinin standard

Bradykinin was selected to assess the potential peptide loss in the flow-through after

performing MWCO separation As shown in Figure 1 1 microg of bradykinin standard does not

produce a detectable signal by MALDI mass spectrometry analysis after a 10 kDa MWCO

separation in water (performed in triplicate) For comparison 1 ng of bradykinin standard

diluted to 15 microL produced an intense signal on the MALDI mass spectrometer suggesting

192

significant sample loss occurs when the target analyte is low in quantity (data not shown

performed in triplicate) Figure 1 shows that the addition of a salt in this case NaCl improves

the limits of detection and decreases sample loss when 7030 watermethanol was compared to

7030 aqueous 1 M NaClmethanol The reproducible results gave a relative standard deviation

(RSD) of 6 for peak intensity Figure 1 shows that even when starting with 1 μg of bradykinin

too much sample is lost during the MWCO separation in water to detect the remainder

However an intense signal is observed for 10 ng of bradykinin after the MWCO separation when

7030 aqueous 1 M NaClmethanol is used as an elution solvent Sample loss from zip-tipping

was estimated in triplicate by calculating the decreases in peak intensity When 10 ng of

bradykinin was used sample loss was ~41 In comparison the calculated yield for 10 ng of

bradykinin after the 7030 aqueous 1 M NaClmethanol MWCO separation and zip-tip desalting

showed an estimated sample loss of 63 meaning more loss can be attributed to sample clean-

up than MWCO filtration

A series of experiments were performed to determine if 7030 aqueous 1 M

NaClmethanol is an optimal solution to recover peptides during a MWCO separation (data not

shown) A 5050 aqueous 1 M NaClmethanol and a 5050 watermethanol elution were

performed in duplicate but signal intensity of the resulting bradykinin was poor and numerous

polymer peaks were detected in the flow through A lower salt concentration 01 M NaCl was

used but also produced greatly reduced signal when compared to aqueous 1 M NaCl To assess

the optimized methodrsquos compatibility with lower sample amounts the 7030 aqueous 1 M

NaClmethanol solution was added to 1 ng of bradykinin for MWCO separation but no signal

was obtained (data not shown) Using a neuropeptide standard the addition of methanol and

NaCl salt significantly improved the sample recovery in sub-microg amounts

193

BSA tryptic peptide mixture analysis

After demonstrating the importance of using an optimized solution for MWCO

separations with an individual peptide the new method was applied to 500 ng of BSA tryptic

digest to investigate its utility with more complex peptide mixtures Table 1 lists the BSA

tryptic peptides identified in the MALDI MS analysis from different solution conditions

processed by MWCO separation As shown in Table 1 a directly spotted BSA tryptic peptide

standard in the absence of any MWCO filtration enabled identification of 39 tryptic peptides by

accurate peptide mass measurements Once again when using 100 H2O for MWCO

separations the starting amount was doubled to 1 microg (also done with 500 ng data not shown)

However many tryptic peptides were not detected due to low signal intensities and non-optimal

elution conditions Instead of H2O a 1 M NaCl solution was used for the MWCO elution but

only two tryptic peptides were identified (Table 1) The addition of 30 methanol into the

sample before MWCO filtration produced the first increase in identified BSA tryptic peptides

The remaining data from Table 1 shows improved BSA tryptic peptide identifications as the

sample (elution) conditions were further optimized Figure 2 shows the actual mass spectra

associated with the three most promising elution solutions along with 100 H2O

The BSA tryptic peptide intensities are shown in Figure 2A and the most intense tryptic

peptide YLYEIAR mz 92749 was observed in the four different solutions shown in Figure 2B

but not in the 100 H2O or 100 1 M NaCl solutions (data not shown) In Figure 2A all mass

spectra are normalized to an intensity of 7 x 104 to illustrate two points First the MWCO

filtering step still produced sample loss regardless of the solvent conditions chosen Second

there is a noticeable increase in peptide peak intensity using the optimized solvent 6040

194

aqueous 1 M NaClmethanol (Figure 2A) Figure 2C displays a zoomed-in view of a BSA

tryptic peptide signal LKECC

DKPLLEK mz 153266 (

carbamidomethyl) observed only in

the optimized solvent The detection of the mz 153266 peptide in Figure 2C highlights the

potential gain in sample and detectable peptides by using an optimized saltMeOH combination

A non-optimized saltMeOH combination will still reduce sample loss but further minimizing

sample loss during sample preparation will always be desirable in any analytical protocol

MWCO composition

The purpose of this application note is to provide evidence of sub-microg sample loss during

MWCO separations of peptide samples and a solution to overcome this limitation The

explanation of why adding MeOH and NaCl to the sample solution provides a significant

reduction in sample loss is beyond the scope of this application note Regardless Supplemental

Table S1 is an expanded version of Table 1 showing the amino acid sequence hydrophobicity

calculated using GRAVY scores and pI of the identified peptides in this study No discernible

trend was obtained from the data The membrane of commonly used MWCO in peptidomics and

for this study is comprised of chemically treated (regenerated) cellulose which is a

polysaccharide containing β (1rarr4) linked D-glucose Glucose has numerous free hydroxyl

groups which could non-specifically adsorb peptides flowing through the MWCO The addition

of MeOH has the most significant effect on signal which could be due to disrupting the

interaction between peptides and hydroxyl groups from glucose NaCl has a less significant

effect on sample recovery compared to MeOH but a detectable reduction in sample loss is noted

This improvement in sample recovery could be analogous to the use of NaCl in

195

immunodepletion protocols to reduce non-specific binding which is accomplished by adding

150 mM NaCl [17]

Analysis of bradykinin in the presence of undigested BSA

When using MWCO for peptide isolation proteins are typically present in the samples

usually in larger amounts Figure 3 shows the effect that adding BSA protein to a 10 ng

bradykinin solution before MWCO fractionation has on the resulting recovery of bradykinin

Adding 10 μg of BSA to the optimized 7030 aqueous 1 M NaClmethanol solution slightly

decreased bradykininrsquos signal with a RSD of 10 More severe signal reduction occurred after

adding 100 microg BSA with a RSD of 2 (N=2) It is not unexpected that more signal reduction

due to sample loss would occur especially in the 100 microg BSA sample because the BSA protein

has a molar ratio of 160 BSA protein molecules to one bradykinin peptide Figure 3 shows the

usefulness of the MWCO method with samples containing large amounts of proteins

RecommendationConclusion

The present work provides solutions to reduce sample loss from the use of MWCO for

sub-microg peptide isolation with or without non-digested proteins in the sample Despite its

widespread utility significant sample loss often occurs during the MWCO fractionation step

which is particularly problematic when analyzing low-abundance peptides from limited starting

material This application note aims to reduce sample loss during MWCO separations

specifically for sub-microg peptide isolation If complex samples are processed with MWCO

separation the authors recommend eluting the sample with 6040 aqueous 1 M NaClmethanol

solution as a starting point to minimize sample loss This application note provides a viable

196

alternative for sub-microg peptide MWCO separation circumventing the need to increase the starting

material by minimizing sample loss from using a MWCO membrane-based centrifugal filter

device

References

[1] HM Georgiou GE Rice MS Baker Proteomic analysis of human plasma failure of

centrifugal ultrafiltration to remove albumin and other high molecular weight proteins

Proteomics 2001 1 1503

[2] DW Greening RJ Simpson Low-molecular weight plasma proteome analysis using

centrifugal ultrafiltration Methods Mol Biol 2011 278 109

[3] DW Greening RJ Simpson A centrifugal ultrafiltration strategy for isolating the low-

molecular weight (ltor=25K) component of human plasma proteome J Proteomics 2010 73

637

[4] LL Manza SL Stamer AJ Ham SG Codreanu DC Liebler Sample preparation and

digestion for proteomic analyses using spin filters Proteomics 2005 5 1742

[5] D Theodorescu D Fliser S Wittke H Mischak R Krebs M Walden M Ross E Eltze O

Bettendorf C Wulfing A Semjonow Pilot study of capillary electrophoresis coupled to mass

spectrometry as a tool to define potential prostate cancer biomarkers in urine Electrophoresis

2005 26 2797

[6] K Antwi G Hostetter MJ Demeure BA Katchman GA Decker Y Ruiz TD Sielaff LJ

Koep DF Lake Analysis of the plasma peptidome from pancreas cancer patients connects a

peptide in plasma to overexpression of the parent protein in tumors J Proteome Res 2009 8

4722

[7] LP Aristoteli MP Molloy MS Baker Evaluation of endogenous plasma peptide extraction

methods for mass spectrometric biomarker discovery J Proteome Res 2007 6 571

[8] A Zougman B Pilch A Podtelejnikov M Kiehntopf C Schnabel C Kumar M Mann

Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7

386

[9] X Yuan DM Desiderio Human cerebrospinal fluid peptidomics J Mass Spectrom 2005 40

176

[10] X Zheng H Baker WS Hancock Analysis of the low molecular weight serum peptidome

using ultrafiltration and a hybrid ion trap-Fourier transform mass spectrometer J Chromatogr A

2006 1120 173

[11] L Li JV Sweedler Peptides in the brain mass spectrometry-based measurement approaches

and challenges Annu Rev Anal Chem 2008 1 451

[12] GB Stefano G Fricchione Y Goumon T Esch Pain immunity opiate and opioid

compounds and health Med Sci Monit 2005 11 MS47

[13] J Jensen Regulatory peptides and control of food intake in non-mammalian vertebrates Comp

Biochem Physiol A Mol Integr Physiol 2001 128 471

197

[14] A Kuoppala KA Lindstedt J Saarinen PT Kovanen JO Kokkonen Inactivation of

bradykinin by angiotensin-converting enzyme and by carboxypeptidase N in human plasma Am

J Physiol Heart Circ Physiol 2000 278 H1069

[15] R Chen M Ma L Hui J Zhang L Li Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 708

[16] H Jahn S Wittke P Zurbig TJ Raedler S Arlt M Kellmann W Mullen M Eichenlaub H

Mischak K Wiedemann Peptide fingerprinting of Alzheimers disease in cerebrospinal fluid

identification and prospective evaluation of new synaptic biomarkers PLoS One 2011 6

e26540

[17] NA Cellar AS Karnoup DR Albers ML Langhorst SA Young Immunodepletion of high

abundance proteins coupled on-line with reversed-phase liquid chromatography a two-

dimensional LC sample enrichment and fractionation technique for mammalian proteomics J

Chromatogr B Analyt Technol Biomed Life Sci 2009 877 79

198

Table 1 Identified BSA tryptic peptides from various MWCO separation conditions

BSA tryptic

peptide (MH+)

100

H2O 1microg

100

1 M NaCl

70

H2O

80

1 M NaCl

70

1 M NaCl

60

H2O

60

1 M NaCl

5083

5453

6894

7124

8985

9275

10345

10725

11385

11636

12496

12837

13057

13997

14157

14197

14398

14636

14798

15026

15118

15328

15547

15677

15768

16399

16678

16738

17248

17408

17477

17497

18809

18890

19019

19079

20450

21139

22479

Total 39 2 2 6 8 15 15 27

199

Figure 1 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard showing improvement over two orders of magnitude in detection limits Each MWCO

separation was performed at minimum in triplicate with representative spectrum selected for

each with a calculated RSD from the peak heights Three different amounts of bradykinin were

tested to assess the magnitude of sample loss under different MWCO solvent conditions The

top panel shows 1 microg of bradykinin standard after MWCO separation with 100 H2O elution

produced no signal The addition of 40 or 30 MeOH produced very low bradykinin signals

for both 100 ng (RSD of 28) and 10 ng (SNlt3 no RSD calculated) respectively In the

bottom two spectra each showed very large intensity but the 7030 aqueous 1 M NaClmethanol

10 ng bradykinin was processed with a 10kDa MWCO and zip-tipped and was reproducible with

200

a RSD of 6 The last spectrum was from 10 ng bradykinin (RSD of 3) which was diluted to

an equivalent volume as all the other experiments and directly spotted onto the MALDI plate

201

Figure 2 Representative MALDI mass spectra from MWCO separation of a BSA tryptic

peptide standard showing sample loss Stacked mass spectra from mz range 875-2150

normalized to 7 x 104 intensity representing the detection difference from a BSA tryptic peptide

standard from different MWCO separation conditions (A) It should be noted that when the

solvent for MWCO elution was 100 H2O 1 microg of BSA tryptic peptides was processed instead

of 500 ng A zoomed in view of the most abundant BSA tryptic peptide detected YLYEIAR

mz 92749 (B) Various percentages of MeOH produced significant signal but addition of a salt

(1 M NaCl) increases the signal which is closest to a directly spotted BSA tryptic peptide

standard A zoomed in view of a representative low intensity BSA tryptic peptide detected

LKECC

DKPLLEK mz 153266 (C) The optimized solution to be used for MWCO filtration

202

6040 aqueous 1 M NaClmethanol was the only procedure that enabled the detection of the

tryptic peptide in Figure 2C which was also detected in the directly spotted BSA tryptic peptide

standard All experiments were performed a minimum of two times with nearly identical results

) Carbamidomethyl amino acid modification

ordm) Tryptic peptide identified in three of the spectra in Figure 2A

dagger) Tryptic peptide identified in two of the spectra in Figure 2A

) Tryptic peptide identified in a single spectrum in Figure 2A

203

Figure 3 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard with a BSA protein present showing optimized solvent conditions minimized samples

losses Each experiment was performed in duplicate Two different amounts of BSA protein

were tested to assess the magnitude of sample loss caused by the presence of a protein The top

panel shows 10 microg of BSA protein and the second panel shows 100 microg of BSA protein added

while only 10 ng of bradykinin was added Detectable sample loss was observed when the BSA

protein was added but panel two shows that the amount of BSA protein was 1 x 104 greater

(equivalent to 160 fold molar excess) than bradykinin The last two spectra were controls using

a MWCO with the optimized solution in panel 3 and panel 4 using 10 ng bradykinin which was

diluted to an equivalent volume as all the other experiments and directly spotted onto the

MALDI plate

204

Supplemental Table 1 Expanded Table 1 including grand average of hydropathicity (GRAVY)

score theoretical pI and the sequence from the underlying amino acid sequence for the peptides

identified in the BSA digest The GRAVY and pI scores were obtained from ExPASy

Bioinformatics and modifications were not taken into consideration

(httpwebexpasyorgprotparam) Peptide assignments to the recorded peaks were done by

BSA

tryptic

peptide

(MH+)

GRAVY

score

Theoretical

pI

Sequence 100

H2O

1microg

100

1 M

NaCl

70

H2O

80

1 M

NaCl

70

1 M

NaCl

60

H2O

60

1 M

NaCl

5083 NA NA FGER

5453 0900 972 VASLR

6894 0267 979 AWSVAR

7124 -0950 647 SEIAHR

8985 0529 674 LcVLHEK

9275 -0071 600 YLYEIAR

10345 -0725 674 NEcFLSHK

10725 -0211 538 SHcIAEVEK

11385 0 599 ccTESLVNR

11636 0130 453 LVNELTEFAK

12496 -1250 545 FKDLGEEHFK

12837 0264 675 HPEYAVSVLLR

13057 -0582 532 HLVDEPQNLIK

13997 0567 437 TVMENFVAFVDK

14157 0567 437 TVmENFVAFVDK

14197 0058 530 SLHTLFGDELcK

14398 -0133 875 RHPEYAVSVLLR

14636 -0515 465 TcVADESHAGcEK

14798 0292 600 LGEYGFQNALIVR

15026 -0625 409 EYEATLEEccAK

15118 0207 597 VPQVSTPTLVEVSR

15328 -0617 617 LKEccDKPLLEK

15547 -0823 441 DDPHAcYSTVFDK

15677 -0085 437 DAFLGSFLYEYSR

15768 -0985 456 LKPDPNTLcDEFK

16399 -0067 875 KVPQVSTPTLVEVSR

16678 0064 437 MPCTEDYLSLILNR

16738 -1723 550 QEPERNEcFLSHK

17248 0064 437 MPcTEDYLSLILNR

17408 0064 437 mPcTEDYLSLILNR

17477 -0914 414 YNGVFQEccQAEDK

17497 -0621 410 EccHGDLLEcADDR

18809 -0537 606 RPcFSALTPDETYVPK

18890 -0567 674 HPYFYAPELLYYANK

19019 -1275 466 NEcFLSHKDDSPDLPK

19079 0044 454 LFTFHADIcTLPDTEK

20450 -0812 839 RHPYFYAPELLYYANK

21139 -0682 480 VHKEccHGDLLEcADDR

22479 -0458 423 EccHGDLLEcADDRADLAK

Total 39 2 2 6 8 15 15 27

205

mass matching with no tandem mass spectrometry performed Lower case amino acids indicate

a modification present in the peptide of carbamidomethyl (c) or oxidation (m)

206

Chapter 8

Conclusions and Future Directions

207

Summary

Comparative shotgun proteomics investigating numerous biological changes in various

species and sample media and peptidomic method development have been reported The

developed comparative shotgun proteomics based on label-free spectral counting with nanoLC

MSMS platform have been employed in mouse CSF rat CSF yeast culture and other biological

specimens Mass spectrometry (MS)-based peptidomic enhancements using ETD and improved

sample preparation methods for molecular weight cut-offs have been reported Together these

studies add to the Li Labs capabilities in comparative shotgun proteomics and expand available

methods for peptidomic research

Immunodepletion of CSF for comparative proteomics

Chapters 3 and 4 used similar methods to generate a list of differentially expressed

proteins providing insight into how Alexander disease (AxD) perturbs the proteome and how the

new prion model rat adapted scrapie (RAS) proteome varies from control rats In the GFAP

overexpressor mouse CSF study in Chapter 3 we have produced a panel of proteins with

significant up or down regulation in the CSF of transgenic GFAP overexpressor mice MS-based

proteomic study of this mouse model for AxD was consistent with the previous studies showing

elevation of GFAP in CSF The development of a modified IgY-14 immunodepletion technique

for low amounts of CSF with recommendations for future antibody depletion techniques to deal

with the unique challenges of mouse CSF was presented Modified proteomics protocols were

employed to profile mouse CSF with biological and technical triplicates addressing the

variability and providing quantitation with dNSAF spectral counting Validation of the data was

performed using both ELISA and RNA microarray data to provide corroboration with the

208

changes in the putative biomarkers This work presents numerous interesting targets for future

study in AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

Using the protocol developed in Chapter 3 we performed a similar study in RAS CSF

compared to control rat CSF Two differences in sample preparation for the rat CSF compared

to the mouse CSF are noted (1) rat IgY-7 immunodepletion was performed and (2) each rat

CSF sample was collected from one animal due to sufficient volume instead of pooling from

multiple animals for the mouse CSF sample After immunodepletion the CSF samples from

control and RAS (biological N=5 technical replicates N=3) were digested and separated using

one dimensional RP nanoLC separation We were able to identify 167 proteins with redundant

isoforms removed which was derived from total 512 proteins with a 1 FDR from all the CSF

samples Comparative analysis was performed using dNSAF spectral counting and 21 proteins

were significantly changed Our data were consistent with previous prion CSF studies showing

14-3-3 protein increased in CSF of prion infected animals Genomic analysis was also

performed and was used to cross-validate our proteomic data and numerous proteins were found

to be consistent including a novel marker related to prion disease RNAset2 (ribonuclease T2)

In summary this work provides a foundation for investigation of the perturbed proteome of a

new prion model RAS

Yeast Phosphoproteomic Comparison between Starved and Glucose Fed Conditions

This work presented a qualitative comparison of the phosphoproteome between starved

and glucose fed Saccharomyces cerevisiae (bakers yeast) A large scale IMAC enrichment of

yeast cell lysate followed by 2-D separations provided a rich source of phosphopeptides for CID

MSMS and neutral loss triggered ETD analysis using mass spectrometry A specific motif for

PKA was highlighted to show the differences in proteins identified between starved and glucose

209

fed conditions In total 477 unique phosphopeptides were identified with 06 FDR and 669

unique phosphopeptides identified with 54 FDR Phosphosite validation was performed using

a localization algorithm Ascore to provide further confidence on the site-specific

characterization of these phosphopeptides Two proteins Ssd1 and PMA2 emerged as potential

intriguing targets for more in-depth studies on protein phosphorylation involved in glucose

response

Methods for Peptide Sample Preparation and Sequencing

In this study ETD was performed to improve the sequence coverage of endogenous large

neuropeptides and labile tyrosine sulfation CCK-like neuropeptides found in the blue crab

Callinectes sapidus The intact CPRP from Callinectes sapidus was identified and characterized

with 68 sequence coverage by MS for the first time Cation assisted ETD fragmentation using

MgCl2 was also shown to be a powerful tool in de novo sequencing of sulfated neuropeptides

These endeavors into using ETD for certain neuropeptides will assist in future analysis of large

neuropeptides and PTM containing neuropeptides

In addition to ETD sequencing I presented a protocol on improving recovery of minute

quantities of peptides by adding methanol and a salt modifier (NaCl) to molecular weight cut-off

membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide quantities

Despite its widespread utility significant sample loss often occurs during the MWCO

fractionation step which is particularly problematic when analyzing low-abundance peptides

from limited starting material This work presented a method to reduce sample loss during

MWCO separations specifically for sub-microg peptide isolation Using a neuropeptide standard

bradykinin sample loss was reduced by over two orders of magnitude with and without

210

undigested protein present The presence of bovine serum albumin (BSA) undigested protein

and the bradykinin standard lends evidence that sample loss occurs because of the MWCO and

not the presence of the protein Additionally a BSA tryptic digestion is presented where twenty-

seven tryptic peptides are identified from MALDI mass spectra after enriching with methanol

while only two tryptic peptides are identified after the standard MWCO protocol

Ongoing Projects and Future Directions

CSF Projects

Rat Adapted Scrapie and Time Course Study of Rat CSF

In ongoing experiments from the work described in Chapter 4 related to rat adapted

scrapie (RAS) we are working towards more complete proteome coverage of CSF and a time

course study of RAS After the promising results of the 1-D proteomic comparison between

RAS and control CSF we want to perform IgY immunodepletion on 1 mL of rat CSF followed

by 2-D high pH-low pH RP-RP LC-MSMS IgY immunodepletion is still required and

afterwards approximately 40 microg of low abundance protein would remain Following traditional

urea tryptic digestion (see Appendix 1) and solid phase extraction (SPE) clean-up the sample

would be dried down and reconstituted in mobile phase A (25 mM ammonium formate basic

pH) for high pH RP separation Fractions would be dried and subjected to MS analysis similar to

the work described in Chapter 4 The purpose of this work would be to increase the proteome

coverage from a few hundred to a thousand CSF proteins identified A time course study of RAS

is also desirable to gain insight into disease progression Rats at different stages will be

sacrificed and CSF will be collected The goal is to perform isobaric labeling between the time

courses with spectral counting being an alternative for relative protein expression We will use

the targets identified in Chapter 4 to track certain proteins for time course analysis Overall

211

these future projects will dig deeper into the proteome and how this novel prion model RAS

perturbs the proteins expressed in rats over time

Glycoproteomics and Peptidomic Analysis of CSF Samples from Individuals with

Alzheimerrsquos Disease

Alzheimerrsquos disease is an incurable progressive neurodegenerative disorder which results

in cognitive impairment Our aim is to provide additional biomarkers andor targets for drug

treatment Currently we used 500 microL of human CSF for non-membrane glycoprotein

enrichment (see Appendix 1) followed by 2-D high-pH-low-pH RP-RP amaZon ion trap LC-

MSMS analysis The initial work was a failure due to low amount of signal and significant

sample loss (data not shown) Through a comparison with Dr Xin Weirsquos PhD thesis we

estimated roughly 20 microg of proteins was obtained after glycoprotein enrichment and 1-D analysis

produced over 100 protein identifications (data not shown) but the additional off-line 2-D

separation and sample clean up resulted in low number of protein identifications for each fraction

analyzed by MS (data not shown) We have recently obtained 1 mL of human CSF samples

from healthy controls and individuals suffering from Alzheimerrsquos disease and plan to perform

the same experiments with double the starting amount and reduce the fractions collected from 2-

D separation from 11 to 6 In addition to the glycoproteomics anything not enriched will be

subjected to MWCO separation using the method presented in Chapter 7 The obtained peptide

sample will be subjected to Q-TOF nanoLC MSE data acquisition followed by de novo

sequencing using various programs including PEAKS and Mascot Collectively we feel this

project has great potential to lead to interesting targets and further expand the proteomic

knowledge of Alzheimerrsquos disease

GFAP Knock-in Mouse CSF

212

In a direct follow-up to Chapter 3 we aim to perform comparative proteomics on control

vs glial fibrillary acidic protein (GFAP) knock-in mouse CSF samples The sample preparation

protocol as presented in Chapter 3 will be used For quantitative proteomics we plan on

performing isobaric labeling followed by performing high energy collision induced dissociation

(HCD) on the Orbitrap Elite The MS acquisition will also perform a top ten scan where the top

ten MS peaks will be selected for tandem MS analysis In addition targeted quantitation of

specific proteins using multiple reaction monitoring (MRM) can be performed on potential

biomarkers and identified in this follow-up study and the work presented in Chapter 3 Also any

CSF samples with noticeable blood content cannot be used for the exploratory proteomics

experiments but can potentially be used for the MRM analysis and should be kept for such

experiments in the future

Large Scale Proteomics

Proteomics of Mouse Amniotic Fluid for Lung Maturation

The overall goal of this project is to determine what proteins are present in amniotic fluid

when the embryo is 175 weeks compared to amniotic fluid at 155 weeks The rationale behind

why these two time points matter was investigated through a lung explant culture where amniotic

fluid was added at 155 weeks and 175 weeks Visual maturation of lungs occurred with the

175 week amniotic fluid and lung maturation genes were up regulated in the mRNA in the lung

explant culture when compared to the 155 week amniotic fluid The compound which is

causing the maturation of the lungs is unknown and search for a secreted protein might provide a

clue to this process In order to test this hypothesis we carried out discovery proteomics

experiments The workflow involves tryptic digestion SPE and 1-D low pH nanoLC separation

coupled to an amaZon ETD ion trap for tandem MS analysis The resulting mass spectrometric

213

acquisition identified less than 100 proteins with a 1 FDR (data not published) To address the

lack of depth in the proteome coverage we purchased an IgY immunodepletion column to

remove the most abundant proteins in amniotic fluid similarly to Chapter 3 and 4 but on a larger

scale One abundant protein alpha-fetoprotein is present in amniotic fluid but absent or present

in very low concentration in CSF serum or plasma Alpha-fetoprotein is similar to albumin and

thus we ran amniotic fluid on an IgY immunodepletion column and observed significant

reduction in alpha-fetoprotein amount (data not published) After immunodepletion 2-D high

pH-low pH RP-RP will be performed followed by nanoLCMSMS on the amaZon ETD ion trap

We will perform mass spectrometry experiments using the 155 week amniotic fluid and 175

week amniotic fluid which have average protein contents ~2 mgmL We anticipate a minimum

of tenfold increase in the proteome coverage of amniotic fluid and provide meaningful

hypothesis driven biological experiments from this work

Phosphoproteomics of JNK Activation

c-Jun N-terminal kinase (JNK) is an important cellular mediator of stress activated

signaling Under conditions of oxidative stress JNK is activated resulting in the downstream

phosphorylation of a large number of proteins including c-Jun However the cellular response

to JNK activation is extremely complex and JNK activation can result in extremely different

physiological outcomes For example JNK is at the crossroads of cellular death and survival

and exhibits both pro-death and pro-survival activities These highly disparate outcomes of JNK

activation are highly contextual and depend on the type of stressor and duration of stress In the

brain JNK has been shown to be activated in models of Alzheimerrsquos disease Parkinsonrsquos

disease amyotrophic lateral sclerosis and stroke It is thought that JNK is activated in these

diseases as a result of oxidative stress and it is unknown whether this activation is pro-survival or

214

pro-death To elucidate JNK signaling in the brain we chose to analyze primary cortical

astrocytes under conditions of oxidative stress Since hydrogen peroxide is a physiologically

relevant oxidant in the brain and known to activate JNK we chose to use this to treat astrocytes

and then analyze changes to the phosphoproteome by mass spectrometry By doing this we

hope to elucidate important JNK-dependent targets involved in stress signaling in the brain and

that identifying these targets could lead to the identification of novel disease mechanisms and

potentially new therapeutic targets for neurodegeneration Specifically we plan on performing

stable isotope labeling by amino acids (SILAC) of astrocytes for control hydrogen peroxide

treated and hydrogen peroxide treated with JNK inhibitor The SILAC labeled astrocyte cell

lysate will be digested enriched using Fe-NTA IMAC and separated using 2-D high pH-low pH

RP-RP and infused into the amaZon ETD ion trap The protocol is similar to the yeast

comparative phosphoproteomics from Chapter 5 After analysis we plan on processing the data

using ProteoIQ to identify phosphoproteins with significant changes

Immunoprecipitation Followed by Mass Spectrometry

Stb3 Mass Spectrometry Analysis

Stb3 (Sin3-binding protein) has previously been shown to change location depending on

the presence or absence of glucose (Heideman Lab Dr Michael Conways Thesis) An

immunoprecipitation of Stb3 was performed in parallel to the work described in Chapter 5 Two

separate experiments were performed one with wild type Stb3 and another tagged with myc for

improved protein immunoprecipitation Myc tagging is a polypeptide tag which can be

recognized by an antibody and provide a better enrichment compared to using a Stb3 antibody

alone The myc tagging was done because of the low abundance of Stb3 and the limited amount

of protein that was pulled down by the antibody for Stb3 Immunopreciptation experiments were

215

performed for both starved and glucose fed samples All samples were tryptically digested

followed by low pH RP nanoLCMSMS Both CID and ETD were used for fragmentation

analysis of each sample (n=2) The Stb3 wild type had two peptide hits in one MS run which is

not surprising due to low Stb3 antibody affinity In contrast a significant amount of Stb3 was

pulled down from Myc tagged starved and glucose fed samples For the glucose starved

samples 22 unique peptides from Stb3 were identified whereas glucose fed samples yielded 21

unique Stb3 peptides identified (unpublished data) The identified Stb3 in the myc samples

allowed us to investigate what other proteins were pulled down that are not present in the wild

type samples

From previous work by our collaborator Dr Heideman it had been suggested that Stb3

forms a complex with Sin3 and Rpd3 Our proteomic data shows multiple significant peptide

hits for Sin3 and Rpd3 in myc tagged but not in wild type where Stb3 is only observed once

with a low Mascot score When looking at the unique proteins identified in myc tagged glucose

fed sample but not included in the wild type samples the myc fed sample contained eight unique

ribosomal proteins with numerous unique peptides for each The ribosomal proteins identified in

myc fed sample supports the novel hypothesis that in the glucose fed state the complex of Stb3

Rpd3 and Sin3 interact with ribosomal proteins Other proteins unique to myc tagged glucose

starved samples are glyceraldehyde-3-phosphate dehydrogenase 2 and transcriptional regulatory

protein UME6 Also three proteins were observed in both myc fed and starved but not in the

wild type samples including transposon Ty1-DR1 Gag-Pol polyprotein(YD11B) SWIRM

domain-containing protein (FUN19) and RNA polymerase II degradation factor 1 (DEF1) Our

proteomics data for the Stb3 immunoprecipitation experiments with starved vs glucose fed

216

samples provide exciting evidence to support previous observation made by the Heideman group

and highlight the utility of MS-based approach to deciphering protein-protein interactions

Conclusions

The majority of the work described in this dissertation revolves around sample

preparation for proteomics and peptidomics with a focus on generating biologically testable

hypotheses In the area of neuropeptides CPRP was fully sequenced analytical methods were

transformed for use in an ETD ion trap and sub-microg peptides can now be analyzed by mass

spectrometry after MWCO separation In the field of comparative proteomics comparisons

between mouse CSF in GFAP overexpressors and control or rat adapted scrapie model and

control CSF yeast fed vs glucose starved cell extract have all been presented Collectively this

thesis has developed new techniques for neuropeptide sample preparation and presented

numerous comparative proteomic analyses of various biological samples and how the proteomes

are dynamically perturbed by various treatments and disease conditions

217

Appendix 1

Protocols for sample preparation for mass spectrometry based

proteomics and peptidomics

218

Small Scale Urea Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 8 μL of urea solution

(400mg05mL) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Allow to digest overnight in 37degC water bath

10 Acidify with 10μL 10 formic acid

11 Perform solid phase extraction using tips dependent of sample amount

a Sub-5μg amounts ndash Millipore Ziptips

b 5-75μg amounts ndash Bond-Elut tips from Agilent (OMIX tips)

12 Dry down in Speedvac as needed for experiment

219

Small Scale ProteaseMAX Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 2 μL of 1 of

ProtesaeMAX (Promega) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Add 1 μL ProteaseMAX and let sit for 3-4 hours

10 Acidify with 2μL 10 formic acid

11 Dry down in Speedvac as needed for experiment

220

Large Scale Urea Tryptic Digestion (mg of proteins)

1 All solutions are in a 50 mM NH4HCO3 buffer (395 mgmL)

2 Add 20μL of 05M Dithiothreitol (DTT) (77mg100μL) and 160μL of urea solution

(400mg05mL) to sample

3 Allow sample to denature 45-60 minutes in a 37degC water bath

4 Add 54μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

5 Quench reaction with 20μL of DTT solution

6 Dilute with 14mL of NH4HCO3 solution

7 Add 100μg of trypsin

8 Allow to digest overnight in 37degC water bath

9 Acidify sample with 100μL of 10 formic acid

10 Perform solid phase extraction with Hypersep C18 Cartridges with 100 mg of C18

bead volume (Thermo)

11 Dry down with the Speedvac as needed for experiment

221

Fe-NTA Preparation from Ni-NTA Beads

1 Centrifuge (5 min at 140 rcf) and use a magnet to keep beads in place as supernatant

is removed

2 Wash 3 times with 800μL of H2O (using same technique of centrifuging and using

magnet to keep beads in places as supernatant is removed)

3 Add 800μL of 100mM EDTA (372mgmL) in a 50mM NH4HCO3 (395mgmL)

buffered solution (pH around 8) and vortex for 30 minutes to chelate the Ni

centrifuge and remove supernatant

4 Wash 3 times with 800μL of H2O

5 Add 800μL of 10mM FeCl3 hexahydrate (27mgmL) and vortex for 30 minutes to

bind Fe to beads centrifuge and remove supernatant

6 Wash 3 times with 800μL H2O

7 Resuspend in a storage solution of 111 ACNMeOH001 Acetic Acid (1mL total)

222

Fe-NTA IMAC Phospho-enrichment

1 Use wash solution (80ACN 01Formic Acid) to rinse beads vortex 1 minute

centrifuge and remove supernatant

2 Add sample (around 500μL in 80ACN 01Formic Acid) and vortex 40 minutes to

allow sample to bind dispose of supernatant after centrifuging

3 Wash 3 times with 200μL of wash solution discard supernatant

4 Add 200μL of elution solution (5050 ACN5Ammonium Hydroxide) vortex 15

minutes and save supernatant

5 Add 200μL of elution solution vortex 10 minutes and save supernatant

6 Wash 2 time with wash solution (collect supernatant of first wash)

7 If reusing store in 1mL solution of 111 ACNMeOH001 Acetic Acid

223

High pH Off-line Separation

1) In general a minimum of 20 microg of peptides are needed to gain any benefit

from off-line 2D fractionation It is better to inject 100 microg of peptides on

column

2) Use a Gemini column or a similar column that can handle pH=10 and for this

protocol a 21 mm x 150 mm column was used

3) Prepare ldquobuffer Ardquo 25 mM NH4 formate pH=10 Also prepare ldquobuffer Brdquo

4) Dry down desired sample and reconstitute in buffer A

5) Inject based off sample loop (current Alliance HPLC has a 50 microL sample

loop)

6) Run gradient at bottom of the page collecting fractions every 3 minutes except

for the 1st minute which is the void volume

7) Optional If you want to reduce instrument time you can combine fractions 1

with 12 2 with 13 etc until 11 with 22

Time Mobile phase A Mobile phase B Flow Rate

05mlmin

0 98 2 05 mLmin

65rsquo 70 30 05 mLmin

65rsquo1rdquo 5 95 05 mLmin

70 5 95 05 mLmin

224

Non Membrane Glycoprotein Enrichment

1 For protocols involving membrane glycoprotein enrichment see Dr Xin Weirsquos

thesis

2 Add 75 microL of Wheat germ agglutinin (WGA) bound to agarose beads and 150 microL

of Concanavalin A (ConA) to a Handee spin cup (Thermo) Wash 2 times with

lectin affinity chromatography (LAC) binding buffer (015 M NaCl 002 M Tris-

HCl 1 mM CaCl2 pH=74) centrifuging at 400 g for 30 seconds

3 Place stopper on bottom of spin cup and add 200 microg of sample (500 microL for CSF)

Bring up to 300 microL using lectin LAC binding buffer

4 Incubate for 1 hour with continuous mixing at room temperature

5 Centrifuge at 400 g for 30 seconds

6 Perform two more 300 microL LAC binding washes followed by centrifugation

7 Add 300 microL LAC eluting buffer (0075 M NaCl 001 M Tris-HCl 02 M alpha-

methyl mannoside 02 M alpha-methyl glucoside and 05 M acetyl-D-

glucosamine) vortex for 10 minutes (have stopper in place while vortexing)

centrifuge and collect

7 Add another 300 microL LAC eluting buffer centrifuge and collect

225

MWCO separation for Sub-microg peptide concentrations

1 Wash molecular weight cut-off (MWCO) with 5050 waterMeOH centrifuge at

14000 g for 5 minutes for 10 kDa filter (time will vary Millipore Amicon Ultra

filters)

2 Wash with 100 water centrifuge at 14000 g for 5 minutes

3 Add methanol to the sample to get the concentration to 30 methanol and add

salt to achieve a roughly 05 M NaCl before adding the sample to the MWCO

4 Centrifuge at 14000 for 10 minutes collect flow through

226

Immunoprecipitation

Modified from Thermo Fisher Scientific Classic IP Kit

1 10 microL Polyclonal antibody added to 1 microg protein standard in a Handee spin cup

(Thermo) diluted with 1x TBS buffer to 400 microL and incubated overnight at on

shakerend-over-end rotator

2 Wash Protein GA beads with 2x 200microL TBS buffer and added to the

antibodysample for 15 hours at 4oC

3 Centrifuge at 400 g for 30 seconds and discard flow through

4 Wash 3x with 100microL TBS buffer centrifuge at 400 g for 30 seconds and discard

flow through

5 Wash with 1x conditioning solution (neutral pH buffer) centrifuge at 400 g for 30

seconds and discard flow through

6 Eluted with elution buffer (2x100microL) centrifuge at 400 g for 30 seconds and

collect flow through

227

C18 Solid Phase Extraction (SPE)

1 Determine amount of peptides for SPE ge5 microg use Ziptips from Millipore If

between 5-75 microg use OMIX from Agilent 100 microL version and if ge75 microg use SPE

cartridges such as 100 mg Hypersep from Thermo

2 Ensure no detergents are in the sample and it is acidified

3 The three SPE procedures all use the same sets of solutions only volumes vary

4 3x Wetting solution (100 ACN) for 60 microL OMIX (20 microL ziptip and 15 mL for

100 mg cartridge)

5 3x Wash solution (01 formic acid in 100 H2O) same volumes as 4

6 Draw up the sample about 10 times minimum (generally 30-40 is preferred)

without letting the bead volume dry out

7 1X Wash solution same volumes as 4

8 Back load the eluting solution 01 formic acid in 5050 H2OACN into the

Ziptips (15 microL) or OMIX (50 microL) tips For the SPE cartridge add 700 microL of

eluting solution

9 Dry down and prepare for next step in sample preparation

228

Laser Puller Programs and Protocols

1) Obtain about two feet of Fused-silica capillary with 75 μm id and 360 μm od

2) Wash with methanol and then air dry using the bomb

3) Cut into one foot or desired length

4) Use a lighter to burn the middle portion (about an inch in length) of the tubing

5) Remove the ash by rinsing with methanol and rubbing with a Kimwipe

6) Make sure the laser puller has been on for at least 30 minutes before use to allow

for the instrument to warm up

7) Place capillary in instrument with the burnedexposed portion in the center

making sure that the length of the capillary is pulled taut

8) Enter desired program (next page) and press pull

229

Laser Puller Programs

Program 99 (default lab program)

Heat Filament Velocity Delay Pull

250 0 25 150 15

240 0 25 150 15

235 0 25 150 15

245 0 25 150 15

Program 97 (developed for larger inner diameter tips)

Heat Filament Velocity Delay Pull

230 - 25 150 -

220 - 25 150 -

215 - 25 150 8

230

On column Immunodepletion (serum plasma CSF amniotic fluid)

1) Prepare buffer A ldquoDilution Bufferrdquo 10 mM Tris-HCl pH=74 150 mM NaCl

2) Prepare buffer B ldquoStripping Bufferrdquo 01 M Glycine-HCl pH=25

3) Prepare buffer C ldquoNeutralization Bufferrdquo 100 mM Tris-HCl pH=80

4) Determine loading amount (LC1=~1000 ug of serum or plasma less for CSF due

to the increased amount of albumin percentage in CSF)

5) Add Dilution buffer to sample before injection and ensure the sample is proper

pH (~7)

6) Use gradient below

Time A B C Flow Rate

(mLmin)

0rsquo 100 0 0 02

4rsquo59rdquo 100 0 0 02

5rsquo 100 0 0 05

8rsquo59rdquo 100 0 0 05

9rsquo 0 100 0 05

22rsquo 0 100 0 05

22rsquo1rdquo 0 0 100 05

39rsquo 0 0 100 05

7) Store the column in 1x dilution buffer until next use

231

Small Scale Immunodepletion (100 microL of CSF)

1) Use 75 microg (per 100 microL CSF) of Sigma Seppro IgY-14 or IgY-7 bead slurry

2) Add an equal volume of 2x dilution buffer (20 mM Tris-HCl pH=74 300 mM

NaCl) to the starting amount of CSF

3) Add to a spin cup with a filter and allow to mix for 30 minutes

4) Centrifuge at 400 g for 30 seconds and collect the flow through

5) Add 50 microL of 1x dilution buffer mix for 5 minutes centrifuge at 400 g for 30

seconds and collect the flow through

6) Use 1x stripping washes (01 M Glycine-HCl pH=25) centrifuge as before and

discard Repeat four times

7) Use 1x neutralization buffer (100 mM Tris-HCl pH=80) centrifuge as before

and discard Repeat two times

8) Store the beads in the spin column in 1x dilution buffer until next use

232

Alliance Maintenance Protocol

Seal Wash

10 methanol no acetonitrile This wash cleans behind the pump-head seals to

ensure proper lubrication Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime SealWsh gt Start

2 Press Stop after 5 minutes

Prime Injector

10 methanol for maintenance high organic solvent for dirty runs (eg 95

acetonitrile) Done before injecting any real samples to ensure no bubbles are

present in the injector Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime NdlWsh gt Start

2 After completion press Close

Purge Injector

Solvent is dependent on run Run this protocol at beginning of experiments each day

Minimum once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Navigate Direct Function gt 4 Purge Injector gt OK

3 Set Sample loop volumes 60 leave Compression Check unchecked gt OK

Prime Solvent Pumps

Solvent is dependent on run If solvents are changed run this protocol Minimum

once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys choose composition A type 100 Enter x4

3 Navigate Direct Function gt 3 Wet Prime gt OK

4 Set Flow Rate 7000 mLmin Time 100 min gt OK

5 Repeat for all changedactive solvent pumps

Condition Column

Dependent on user Use starting conditions for run

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys type starting solvent compositions for run

3 Navigate Direct Function gt 6 Condition Column gt OK

4 Set Time as desired

233

Appendix 2

List of Publications and Presentations

234

PUBLICATIONS

ldquoDiscovery and characterization of the crustacean hyperglycemic hormone precursor related

peptides (CPRP) and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes

sapidus using multiple tandem mass spectrometry techniquesrdquo Hui L Cunningham R Zhang

Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

ldquoInvestigation and reduction of sub-microgram peptide loss using molecular weight cut-off

fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J Wellner D Li L

Journal of Mass Spectrometry In Press

ldquoMass spectrometry-based proteomics and peptidomics for systems biology and biomarker

discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse models of

Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P Messing A Li

L Journal of Proteome Research Submitted

ldquoInvestigation of the differences in the phosphoproteome between starved vs glucose fed

Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M Heideman

W Li L In preparation

ldquoIdentification of astrocytic JNK targets by phosphoproteomics using mass spectrometryrdquo

Cunningham R Dowell J Wang J Wellner D Li L Milhelm M In preparation

ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A Cunningham R Wellner

D Wang J Ma D Li L Aiken J In preparation

235

INVITED SEMINARS AND CONFERENCE PRESENTATIONS

Robert Cunningham Davenne Mayour and Shoshanna Coon ldquoMorphology and Thermal

Stability of Monolayers on Porous Siliconrdquo The 231th

ACS National Meeting 2006 Atlanta

GA

Robert Cunningham Xin Wei Paige Jany Albee Messing and Lingjun Li ldquoMass

Spectrometry-Based Analysis of Cerebrospinal Fluid Peptidome and Proteome for Biomarker

Discovery in Alexander Diseaserdquo The 57th

ASMS Conference 2009 Philadelphia PA

Robert Cunningham ldquoWhat to Expect in a PhD Graduate Schoolrdquo Invited seminar University

of Northern Iowa 2010 Cedar Falls IA

Robert Cunningham Dustin Frost Albee Messing and Lingjun Li ldquoInvestigation of an

Optimized ProteaseMAX Assisted Trypsin Digestion of Human CSF for Pseudo-SRM

Quantification of GFAP and Identification of Biomarkersrdquo The 58th

ASMS Conference 2010

Salt Lake City UT

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo Oral presentation at Pittcon Conference and Expo 2011 2011 Atlanta

GA

Robert Cunningham Michael Conway Daniel Wellner Douglas Grunwald Warren

Heideman and Lingjun Li ldquoDevelopment of optimized phosphopeptide enrichment methods for

comparison of starved and glucose fed yeast Saccharomyces cerevisiaerdquo The 59th

ASMS

Conference 2011 Denver CO

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo 2011 Wisconsin Human Proteomics Symposium 2011 Madison WI

Page 2: Mass Spectrometry Applications for Comparative Proteomics

i

Acknowledgements

I would like to acknowledge the support and guidance from professors colleagues

and friends at the University of Wisconsin-Madison who are indispensable to this thesis

First I would like to express my deep gratitude to my advisor Prof Lingjun Li for

allowing me the freedom to chase scientific endeavors all while offering her constant

guidance assistance and support through my PhD study Her constant energy and

enthusiasm in research have led by example in performing research and inspired me to

make the most of the time given to me Dr Li encouraged me to take on challenging

projects apply for awards travel and present my research to the larger scientific

community None of my work would be achieved without her and I want to thank Dr Li

for her support during these years

I would also like to thank the members of my committee Dr Lingjun Li Dr

Albee Messing Dr Lloyd Smith Dr Warren Heideman and Dr Tim Bugni I truly

appreciate the willingness of these professors to take time out of their busy schedules to

serve as members of my committee

I have benefited greatly from previous members of the Li Lab In particular I

would like to thank Dr James Dowell Dr Xin Wei Dr Robert Sturm and Dr Limei

Hui for their patient and valuable suggestions in my research and also teaching me

valuable experimental skills how to perform general shotgun proteomics and how to use

several instruments Specifically I would like to thank Daniel Wellner who has worked

with me on numerous projects over the past 2 years and has been a constant in my

research life I also want to thank my wonderful current colleagues Jingxin Wang Tyler

ii Greer Chris Lietz Chenxi Jia Dustin Frost Di Ma Hui (Vivian) Ye Nicole Woodards

and Claire Schmerberg for their collaboration in many challenging research projects and

fruitful discussions on various research areas There are too many people to thank each

one individually but every member of the Li lab has in some way contributed to my

learning experience Beyond research work their friendship also made my life here in

Madison much more enjoyable

I would also like to thank our collaborators Dr Albee Messing Dr Warren

Heideman Dr Xin Sun and Dr James Dowell It is my great pleasure to have the

opportunities to work with these amazing people and gain precious experience I have

learned so much from them and their achievements in the field have inspired me to strive

to do the best I could

Furthermore I would like to thank Gary Girdaukas and Dr Cameron Scarlett at

School of Pharmacy for the access of the MALDI-FTMS and Bruker amaZon ion trap

instruments

In particular I wish to thank my family my Mom and Step-Dad for raising me

and my Dad for always being there for me They all supported me in my decision to

pursue science and specifically a career in chemistry I would like to thank my Sister

who grew up with me and always led by example in academics Most importantly I

would like to thank my wife Na Liu for her constant support She has inspired and

helped me finish my PhD and always encouraged me to be the best I could be To them

I dedicate this thesis

iii

Table of Contents

Page

________________________________________________________________________

Acknowledgements i

Table of Contents iii

Abstract iv

Chapter 1 Introduction brief background and research summary 1

Chapter 2 Mass spectrometry-based proteomics and peptidomics for

biomarker discovery and the current state of the field 15

Chapter 3 Protein changes in immunodepleted cerebrospinal fluid from

transgenic mouse models of Alexander disease detected

using mass spectrometry 73

Chapter 4 Genomic and proteomic profiling of rat adapted scrapie 110

Chapter 5 Investigation of the differences in the phosphoproteome

between starved vs glucose fed Saccharomyces cerevisiae 139

Chapter 6 Use of electron transfer dissociation for neuropeptide

sequencing and identification 166

Chapter 7 Investigation and reduction of sub-microgram peptide loss

using molecular weight cut-off fractionation prior to

mass spectrometric analysis 187

Chapter 8 Conclusions and future directions 206

Appendix 1 Protocols for sample preparation for mass spectrometry

based proteomics and peptidomics 217

Appendix 2 Publications and presentations 233

_______________________________________________________________________

iv

Mass Spectrometry Applications for Comparative Proteomics and

Peptidomic Discovery

Robert Stewart Cunningham

Under the supervision of Professor Lingjun Li

At the University of Wisconsin-Madison

Abstract

In this thesis multiple biological samples from various diseases models or

treatments are investigated using shotgun proteomics and improved methods are

developed to enable extended characterization and detection of neuropeptides In general

this thesis aims to expand upon the rapidly evolving field of mass spectrometry (MS)-

based proteomics and peptidomics by primarily enhancing small scale sample analysis

A review of the current status and progress in the field of biomarker discovery in

peptidomics and proteomics is presented To this rapidly expanding body of literature

our critical review offers new insights into MS-based biomarker studies investigating

numerous biological samples methods for post-translational modifications quantitative

proteomics and biomarker validation Methods are developed and presented including

immunodepletion for small volume cerebrospinal fluid (CSF) samples for comparison of

the CSF proteomes between an Alexander disease transgenic mouse model with

overexpression of the glial fibrillary acidic protein and a control animal This thesis also

covers the application of the small scale immunodepletion of CSF for comparative

proteomic analysis of a novel rat adapted scrapie (RAS) model for prion disease and

v

compares the RAS CSF proteome to control rat CSF using MS Large scale

phosphoproteomics of starved vs glucose fed yeast is presented to better understand the

phosphoproteome changes that occur during glucose feeding Method development for

neuropeptide analysis is expanded upon using electron transfer dissociation (ETD)

fragmentation to successfully sequence for the first time the crustacean hyperglycemic

hormone precursor-related peptide (CPRP) from the blue crab Callinectes sapidus In

addition a method for ETD sequencing of sulfonated neuropeptides using a magnesium

salt adduct in an ion trap mass spectrometer is reported This thesis also reports on a

method for sub-microg peptide isolation when using a molecular weight cut-off filtration

device to improve sample recovery by over 2 orders of magnitude All the protocols used

throughout the work are provided in an easy to use step-by-step format in the Appendix

Collectively this body of work extends the capabilities of mass spectrometry as a

bioanalytical tool for shotgun proteomics and expands upon methods for neuropeptide

discovery and analysis

1

Chapter 1

Introduction Brief Background and Research Summary

2

Abstract

Mass spectrometry based comparative proteomics and improved methods for

neuropeptide discovery have been reported in this thesis A very brief introduction of crustacean

neuropeptidomics is covered in this chapter followed by Chapter 2 which covers in great detail

comparative proteomics using mass spectrometry with an emphasis on biomarker discovery

Chapter 3 reports a novel immunodepletion technique used for comparative proteomics between

glial fibrillary acidic protein (GFAP) overexpressor and control mouse cerebrospinal fluid (CSF)

Chapter 4 involves rat CSF comparative proteomics between rat adapted scrapie and control

animals Chapter 5 details comparisons of phosphoproteomes in Saccharomyces cerevisiae

(Bakerrsquos yeast) between starved vs glucose fed conditions Chapter 6 investigates the use of

electron transfer dissociation (ETD) for kDa sized neuropeptides and neuropeptides with tyrosine

sulfonation Chapter 7 presents a molecular weight cut-off (MWCO) protocol to allow sub-microg

peptides to be recovered Chapter 8 provides a conclusion to this thesis and outlines future

directions for certain projects

3

Background

Mass spectrometry (MS) requires gas phase ions for experimental measurement and

intact large nonvolatile biomolecules proved difficult to be analyzed by electron ionization or

chemical ionization until the invention of two soft ionization techniques matrix-assisted laser

desorptionionization (MALDI)1 and electrospray ionization (ESI)

2 ESI and MALDI are the

two most common soft ionization techniques for mass spectrometry Once ionized molecules

such as peptides or proteins can be separated by their mass to charge ratios (mz) using various

mass analyzers manipulating electric andor magnetic fields ESI and MALDI mass

spectrometric techniques have become central analytical methods in biological sciences because

they are suitable for nonvolatile thermally labile analytes in femtomole quantities3 ESI allows

the coupling of high pressure liquid chromatography and the constant flow of solvent is

electrically charged at the outlet to produce a Taylor cone4 During desolvation the Raleigh

limit is reached and a coulombic explosion occurs commonly producing multiply charged ions

A modification of ESI is nanoLC-ESI which operates at nLmin flow rates to analyze low sample

amounts such as proteins or peptides in biological samples5 NanoLC-ESI uses less solvent as

the source because of the reduced flow rate of standard LCMS with an ESI source and nanoLC-

ESI is the predominate inlet for shotgun proteomic ESI MS experiments6 Conversely MALDI

can use sub-microL volume of sample and be co-crystallized with an ultraviolet absorbing organic

matrix that is obliterated using a laser at appropriate wavelength to generate gas phase ions

Alternatively MALDI has the unique capability to work with tissue samples and ionize in the

solid state instead of liquid like ESI

4

Mass analyzers require an operating pressure between 10-4

-10-10

Torr to allow proper ion

transfer and separationisolation of the ion based upon its mz Numerous mass analyzers are

currently available and each have their own strengths and weaknesses as shown in Figure 1 The

biomolecules are separated by the mass analyzers and detected without fragmentation which is

termed MS analysis Following MS detection tandem mass spectrometry (MSMS) of the

original precursor ion can be performed to provide additional structural information such as a

ladder sequence of amino acids for peptides Numerous fragmentation techniques are available

for proteomics such as collision induced dissociation (CID) electron transfer dissociation (ETD)

or high energy collision induced dissociation (HCD) Each of these fragmentation techniques

have their own benefits in proteomics experiments and are discussed in depth in Chapter 2 The

background and current status for comparative proteomics with specific emphasis on biomarker

analysis are covered in Chapter 2

Neuropeptidomic Method Development in the Crustacean Model System

Utilizing Mass Spectrometry

Chapter 6 and 7 focus on sample preparation and ETD fragmentation techniques to

characterize neuropeptides in the neuroendocrine organs in the crustacean nervous system

Neuropeptides are short chains of amino acids which are a diverse class of cell-cell signaling

molecules in the nervous system Neuropeptides have been investigated for being involved in

numerous physiological processes such as memory7 learning

8 depression

9 pain

10 reward

11

reproduction12

sleep-wake cycles13

homeostasis14

and feeding15-17

Figure 2 depicts how

neuropeptides are synthesized as pre-prohormones in the rough endoplasmic reticulum and

5

packaged in the Golgi apparatus After being packaged these pre-prohormones are processed

into bioactive peptides within the vesicle which is occurring during vesicular transport down an

axon and released into the synaptic cleft Secreted neuropeptides stimulate the post synaptic

neurons by interacting with G-protein coupled receptors at the chemical synapse

The crustacean model nervous system is well-defined neural network which has been

used in neurobiology and neuroendocrine (neurosecretory) research and thus lends itself for

studying neuromodulation18-22

Figure 3 shows the locations of several neuroendocrine organs in

the crustacean Callinectes sapidus including the sinus gland (SG) which is studied in Chapter 6

The Lingjun Li lab has focused on neuropeptide discovery and function in crustacean

neuroendocrine organs using mass spectrometry23-25

The work presented in Chapters 6 and 7

expand on sample preparation and analytical tools to further investigate the neuropeptidome

Research Overview

Comparative Proteomics of Biological Samples

Chapter 2 provides a thorough review of comparative proteomics and biomarker analysis

using mass spectrometry The scientific community has shown great interest in the field of mass

spectrometry-based proteomics and peptidomics for its applications in biology Proteomics

technologies have evolved to generate large datasets of proteins or peptides involved in various

biological and disease progression processes producing testable hypotheses for complex

biological questions This chapter provides an introduction and insight into relevant topics in

proteomics and peptidomics including biological material selection sample preparation

separation techniques peptide fragmentation post-translational modifications quantification

6

bioinformatics and biomarker discovery and validation In addition current literature and

remaining challenges and emerging technologies for proteomics and peptidomics are discussed

Chapter 3 investigates comparative proteomics between a GFAP overexpressor mouse

model and a control mouses cerebrospinal fluid (CSF) CSF is a low protein content biological

fluid with dynamic range spanning at least nine orders of magnitude in protein content and is in

direct contact with the brain but consist of very abundant proteins similar to serum which require

removal A modified IgY-14 immunodepletion treatment is presented to remove abundant

proteins such as albumin to enhance analysis of the low volumes of CSF that are obtainable

from mice These GFAP overexpressor mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we present the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates are performed to address animal variability as well as reproducibility in mass

spectrometric analysis Relative quantitation is performed using distributive normalized spectral

abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins with

significant changes in the CSF of GFAP transgenic mice are identified with validation from

ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

Chapter 4 explores the CSF proteomics of a novel prion model called rat adapted scrapie

(RAS) A similar IgY immunodepletion technique is presented as in Chapter 3 and is similarly

used to reduce the abundant proteins in CSF CSF from control and RAS (biological N=5

technical replicates N=3) were digested and separated using one dimensional reversed-phase

nanoLC separation In total 512 proteins 167 non-redundant protein groups and 1032 unique

peptides are identified with a 1 FDR Comparative analysis was done using dNSAF spectral

7

counting and 21 proteins were significantly up or down-regulated The proteins are compared to

the 1048 differentially regulated genes and additionally compared to previously published

proteins showing changes consistent with other prion animal models Of particular interest is

RAS specific proteins like RNAseT2 which is not up-regulated in mouse prion disease but is

designated as upregulated in both the genomic and proteomics data for RAS

Chapter 5 explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Previous work by the

Heideman lab investigated the transcriptional response to fresh glucose in yeast26

Kinases such

as protein kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose

response so we described a large scale phosphoproteomic MS based study in this chapter

Yeast cell extract was digested and phosphopeptides were enriched by immobilized metal

affinity chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase

(RP)-RP separation The low pH separation was infused directly into an ion trap mass

spectrometer with neutral loss triggered ETDfragmentation Specifically the CID fragmentation

can cause a neutral loss of 98z from the H3PO4 and can produce an incomplete fragmentation

pattern and lead to ambiguous identification so when the neutral loss is detected ETD MSMS

fragmentation is performed The neutral loss triggered ETD fragmentation is included in this

study to improve phosphopeptide identifications In total 477 phosphopeptides are identified

with 06 FDR and 669 phosphopeptides identified with 54 FDR Motif analysis of PKA and

phosphosite validation are performed as well

8

The future of comparative proteomics investigating small sample amounts or PTMs is

promising Further advances in enrichment separations science mass spectrometry

analyzersdetectors and bioinformatics will continue to create more powerful tools that enable

digging deeper in proteomics with both small (mouse CSF) and large (cell extracts) sample

amounts

Methods for Neuropeptide Analysis Using ETD fragmentation and Sample

Preparation

Chapter 6 investigates the utility of ETD fragmentation to sequence endogenous large

neuropeptides and labile tyrosine sulfation in the crustacean Callinectes sapidus The sinus

gland (SG) of the crustacean is a well-defined neuroendocrine site that produces numerous

hemolymph-borne agents including the most complex class of endocrine signaling moleculesmdash

neuropeptides One such peptide is the crustacean hyperglycemic hormone (CHH) precursor-

related peptide (CPRPs) which was not previously sequenced Electron transfer dissociation

(ETD) is used for sequencing of intact CPRPs due to their large size and high charge state In

addition ETD is used to sequence a sulfated peptide cholecystokinin (CCK)-like peptide in the

lobster Homarus americanus using a salt adduct Collectively this chapter presents two

examples of the utility of ETD in sequencing neuropeptides with large molecular weight or with

labile modifications

Chapter 7 reports a protocol on improving recovery of minute quantities of peptides by

adding methanol and a salt modifier (NaCl) to molecular weight cut-off membrane-based

centrifugal filters (MWCO) to enrich sub-microgram peptide quantities In some biological

9

fluids such as CSF the endogenous peptide content is very low and using pure water to perform

the MWCO separation produces too much sample loss Using a neuropeptide standard

bradykinin sample loss is reduced over two orders of magnitude with and without undigested

protein present The presence of bovine serum albumin (BSA) undigested protein and the

bradykinin standard lends evidence that sample loss occurs because of the MWCO and not the

presence of the protein Additionally a BSA tryptic digestion is presented where twenty-seven

tryptic peptides are identified from MALDI mass spectra after enriching with methanol while

only two tryptic peptides are identified after the standard MWCO protocol The strategy

presented in Chapter 7 greatly enhances recovery from MWCO separation for sub-microg peptide

samples

10

References

1 Tanaka K Waki H Ido Y Akita S Yoshida Y Yoshida T Matsuo T Protein and polymer analyses up to mz 100 000 by laser ionization time-of-flight mass spectrometry Rapid Communications in Mass Spectrometry 1988 2 (8) 151-153

2 Fenn J B Mann M Meng C K Wong S F Whitehouse C M Electrospray ionization for mass spectrometry of large biomolecules Science 1989 246 (4926) 64-71

3 Domon B Aebersold R Mass spectrometry and protein analysis Science 2006 312 (5771) 212-7

4 Whitehouse C M Dreyer R N Yamashita M Fenn J B Electrospray interface for liquid chromatographs and mass spectrometers Anal Chem 1985 57 (3) 675-9

5 Wilm M Mann M Analytical properties of the nanoelectrospray ion source Anal Chem 1996 68 (1) 1-8

6 Karas M Bahr U Dulcks T Nano-electrospray ionization mass spectrometry addressing analytical problems beyond routine Fresenius J Anal Chem 2000 366 (6-7) 669-76

7 Gulpinar M Yegen B The physiology of learning and memory Role of peptides and stress Curr Protein Pept Sci 2004 5 (6) 457-473

8 Ogren S O Kuteeva E Elvander-Tottie E Hokfelt T Neuropeptides in learning and memory processes with focus on galanin In Eur J Pharmacol 2010 Vol 626 pp 9-17

9 Strand F Neuropeptides general characteristics and neuropharmaceutical potential in treating CNS disorders Prog Drug Res 2003 61 1-37

10 Li W Chang M Peng Y-L Gao Y-H Zhang J-N Han R-W Wang R Neuropeptide S produces antinociceptive effects at the supraspinal level in mice Regul Pept 2009 156 (1-3) 90-95

11 Wu C-H Tao P-L Huang E Y-K Distribution of neuropeptide FF (NPFF) receptors in correlation with morphine-induced reward in the rat brain Peptides 2010 31 (7) 1374-1382

12 Tsutsui K Bentley G E Kriegsfeld L J Osugi T Seong J Y Vaudry H Discovery and evolutionary history of gonadotrophin-inhibitory hormone and kisspeptin new key neuropeptides controlling reproduction J Neuroendocrinol 2010 22 (7) 716-727

13 Piper D C Upton N Smith M I Hunter A J The novel brain neuropeptide orexin-A modulates the sleep-wake cycle of rats Eur J Neurosci 2000 12 (2) 726-730

14 Tsuneki H Wada T Sasaoka T Role of orexin in the regulation of glucose homeostasis - Tsuneki - 2009 - Acta Physiologica - Wiley Online Library Acta Physiol 2010

15 Kageyama H Takenoya F Shiba K Shioda S Neuronal circuits involving ghrelin in the hypothalamus-mediated regulation of feeding Neuropeptides 2010 44 (2) 133-138

16 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide family of aplysia J Neurosci 2002 22 (17) 7797-7808

11

17 Jensen J Regulatory peptides and control of food intake in non-mammalian vertebrates Comp Biochem Physiol Part A Mol Integr Physiol 2001 128 (3) 469-477

18 Billimoria C P Li L Marder E Profiling of neuropeptides released at the stomatogastric ganglion of the crab Cancer borealis with mass spectrometry J Neurochem 2005 95 (1) 191-199

19 Cape S S Rehm K J Ma M Marder E Li L Mass spectral comparison of the neuropeptide complement of the stomatogastric ganglion and brain in the adult and embryonic lobster Homarus americanus J Neurochem 2008 105 (3) 690-702

20 Cruz Bermuacutedez N D Fu Q Kutz Naber K K Christie A E Li L Marder E Mass

spectrometric characterization and physiological actions of GAHKNYLRFamide a novel FMRFamide‐like peptide from crabs of the genus Cancer J Neurochem 2006 97 (3) 784-799

21 Grashow R Brookings T Marder E Reliable neuromodulation from circuits with variable underlying structure Proc Natl Acad Sci USA 2009 106 (28) 11742-11746

22 Ma M Szabo T M Jia C Marder E Li L Mass spectrometric characterization and physiological actions of novel crustacean C-type allatostatins Peptides 2009 30 (9) 1660-1668

23 Chen R Hui L Cape S S Wang J Li L Comparative Neuropeptidomic Analysis of Food Intake via a Multi-faceted Mass Spectrometric Approach ACS Chem Neurosci 2010 1 (3) 204-214

24 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement approaches and challenges Annu Rev Anal Chem 2008 1 451-483

25 Ma M Wang J Chen R Li L Expanding the crustacean neuropeptidome using a multifaceted mass spectrometric approach J Proteome Res 2009 8 (5) 2426-2437

26 Slattery M G Heideman W Coordinated regulation of growth genes in Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

12

Figure 1 Capabilities and performances of certain mass analyzers Check marks indicate

availability check marks in parentheses indicate optional + ++ and +++ indicate possible or

moderate goodhigh and excellentvery high respectively Adapted with permission from

reference 3

13

Figure 2 Synthesis processing and release of neuropeptides (a) Cartoon showing two

interacting neurons (b) The synthesis of neuropeptides in the neuronal organelles and their

transport down the axon to the presynaptic terminal is depicted (c) Shows neuropeptide release

and interaction with the dendrite of the post-synaptic cell (Adapted from PhD thesis by Dr

Stephanie Cape)

14

Figure 3 Schematic showing location of the neuronal tissues being analyzed in the MS studies

of neuropeptides in Callinectes sapidus The SGs (sinus glands located in the eyestalks of the

crab) and the POs (pericardial organs located in the chamber surrounding the heart) release

neurohormones into the hemolymph (crab circulatory fluid) The brain is near the STNS

(stomatogastric nervous system neural network that controls the motion of the gut and foregut)

which has direct connections to the STG (stomatogastric ganglion) The STG is located in an

artery so it is continually in contact with hemolymph (Adapted from PhD thesis by Dr Robert

Sturm)

15

Chapter 2

Mass Spectrometry-based Proteomics and Peptidomics for Biomarker

Discovery and the Current State of the Field

Adapted from ldquoMass spectrometry-based proteomics and peptidomics for systems biology and

biomarker discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

16

Abstract

The scientific community has shown great interest in the field of mass spectrometry-based

proteomics and peptidomics for its applications in biology Proteomics technologies have

evolved to produce large datasets of proteins or peptides involved in various biological and

disease progression processes producing testable hypothesis for complex biological questions

This review provides an introduction and insight to relevant topics in proteomics and

peptidomics including biological material selection sample preparation separation techniques

peptide fragmentation post-translation modifications quantification bioinformatics and

biomarker discovery and validation In addition current literature and remaining challenges and

emerging technologies for proteomics and peptidomics are presented

17

Introduction

The field of proteomics has seen a huge expansion in the last two decades Multiple factors have

contributed to the rapid expansion of this field including the ever evolving mass spectrometry

instrumentation new sample preparation methods genomic sequencing of numerous model

organisms allowing database searching of proteomes improved quantitation capabilities and

availability of bioinformatic tools The ability to investigate the proteomes of numerous

biological samples and the ability to generate future hypothesis driven experiments makes

proteomics and biomarker studies exceedingly popular in biological studies today In addition

the advances in post-translational modification (PTM) analysis and quantification ability further

enhance the utility of mass spectrometry (MS)-based proteomics A subset of proteomics

research is devoted to profiling and quantifying neurologically related proteins and endogenous

peptides which has progressed rapidly in the past decade This review provides a general

overview as outlined in Figure 1 of proteomics technology including methodological and

conceptual improvements with a focus on recent studies and neurological biomarker studies

Biological Material Selection

The choice of biological matrix is an important first step in any proteomics analysis The

ease of sample collection (eg urine plasma or saliva) versus usefulness or localization of

sample (eg specific tissue or proximity fluid) needs to be evaluated early on in a study design

Plasma derived by centrifugation of blood to remove whole cells is a very popular

choice in proteomics due to the high protein content (~65 mg ml1) and the ubiquitous nature of

blood in the body and the ability to obtain large sample amounts or various time points without

the need to sacrifice the animal or to perform invasive techniques Plasma is centrifuged

18

immediately after sample collection unlike serum where coagulation needs to occur first To

obtain plasma blood is collected in a tube with an anticoagulant added (ETDA heparin or

citrate) and centrifuged but previous reports have shown variable results when heparin has been

used as an anticoagulant2 Human Proteome Organization (HUPO) specifically recommends the

anticoagulants EDTA or citrate to treat plasma3 4

One of the primary concerns with plasma is

degradation of the protein content via endogenous proteases found in the sample5 One way to

address this problem is the use of protease inhibitors In addition freezethaw cycles need to be

minimized to prevent protein degradation and variability6 7

Plasma proteomics has seen

extensive coordinated efforts to start assessing the diagnostic needs using plasma8 HUPO also

has established a public human database for plasma and serum proteomics from 35 collaborating

labratories9 Large dynamic range studies have been performed on plasma with a starting sample

amount of 2625 microl (1575 mg) resulting in 3654 proteins identified with a sub 5 false

discovery rate10

The large dynamic range spanning across eleven orders of magnitude as visualized in

Figure 2 is one of the biggest obstacles in plasma proteomics Figure 2 also shows that as lower

abundance proteins are investigated the origins of those identified proteins are more diverse than

the most abundant proteins Recent mining of the plasma proteome showed an ability to search

for disease biomarker applications across seven orders of magnitude In addition the tissue of

origin for the identified plasma proteins were identified and its origin was more diverse as the

protein concentration decreased11

Plasma has been used as a source for biomarker studies such

as colorectal cancer12 13

cardiovascular disease14

and abdominal aortic aneurysm15

Even

though the blood brain barrier prevents direct blood to brain interaction neurological disorders

such as Alzheimerrsquos disease (AD) have had their proteomes studied using plasma16

19

An alternative sample derived from blood is serum which is plasma allowed to coagulate

instead of adding anti-coagulates The time for coagulation is usually 30 minutes and during that

time significant and random degradation from endogenous proteases can occur The additional

variability caused from the coagulation process can change the concentration of multiple

potentially valuable biomarkers As biodiversity between samples or organisms is a challenging

endeavor additional sample variability due to serum generation may be undesirable but serum is

still currently being used for biomarker disease studies17

Serum has been used to compare the

proteome differences in neurological diseases such as AD Parkinsonrsquos disease and amyotrophic

lateral sclerosis and a review can be found elsewhere discussing the subject18

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy of brain or spinal cord

in evaluating diseases of the central nervous system and has been used for studies in neurological

disorders due to being a rich source of neuro-related proteins and peptides19

The protein

composition of the most abundant proteins in CSF is well defined and numerous studies exist to

broaden the proteins identified20-22

CSF has an exceedingly low protein content (~04 μgμL)

which is ~100 times lower than serum or plasma and over 60 of the total protein content in

CSF consists of a single protein albumin23-25

In addition the variable concentrations of proteins

span up to twelve orders of magnitude further complicating analysis and masking biologically

relevant proteins to any given study26

One of the highest number of identified proteins is from

Schutzer et al with 2630 non-redundant proteins from 14 mL of pooled human CSF This study

involved the removal of highly abundant proteins by performing IgY-14 immunodepletion

followed by two dimensional (2D) liquid chromatography (LC) separation27

Studies have also

been performed to characterize individual biomarkers or complex patterns of biomarkers in

various diseases in the CSF28 29

One potential pitfall of CSF proteomic analysis is

20

contamination from blood which can be identified by counting red blood cells present or

examining surrogate markers from blood contamination other than hemoglobin such as

peroxiredoxin catalase and carbonic anhydrase30

A proof of principle CSF peptidomics study

identified numerous endogenous peptides associated with the central nervous system which can

be used as a bank for neurological disorder studies31

Numerous recent reports highlighted the

utility of CSF analysis for biomarker studies in AD32 33

medulloblastoma34

both post-mortem

and ante-mortem35

Cellular lysates offer the distinct advantage to work with a cell line yeast or bacteria

with large amounts of proteins available for analysis36 37

with Saccharomyces cerevisiae being

the most common cell lysate38 39

Other cell lines are also used including HeLa40

and E coli41

The ability to obtain milligrams of proteins easily to scale up experiments without animal

sacrifice offers a clear advantage in biological sample selection Current literature supports

cellular lysate as a valued and sought after source of proteins for large scale proteomics

experiments because of the ability to assess treatments conditions and testable hypotheses42-44

Cellular lysate from rat B104 neuroblastoma cell line was used as an in vitro model for cerebral

ischemia and showed abundance changes in multiple proteins involved in various neurological

disorders45

Other Sources of Biological Samples

Urine

The urine proteome appears to be another attractive reservoir for biomarker discovery

due to the relatively low complexity compared with the plasma proteome and the noninvasive

collection of urine Urine is often considered as an ideal source to identify biomarkers for renal

21

diseases due to the fact that in healthy adults approximately 70 of the urine proteome originate

from the kidney and the urinary tract 46

thus the use of urine to identify neurological disorders is

neglected However strong evidence have shown that proteins that are associated with

neurodegenerative diseases can be excreted in the urine47-49

indicating the application of urine

proteomics could be a useful approach to the discovery of biomarkers and development of

diagnostic assays for neurodegenerative diseases However the current view of urine proteome

is still limited by factors such as sample preparation techniques and sensitivity of the mass

spectrometers There has been a tremendous drive to increase the coverage of urine proteome

In a recent study Court et al compared and evaluated several different sample preparation

methods with the objective of developing a standardized robust and scalable protocol that could

be used in biomarkers development by shotgun proteomics50

In another study Marimuthu et al

reported the largest catalog of proteins in urine identified in a single study to date The

proteomic analysis of urine samples pooled from healthy individuals was conducted by using

high-resolution Fourier transform mass spectrometry A total of 1823 proteins were identified

of which 671 proteins have not been previously reported in urine 51

Saliva

For diagnosis purposes saliva collection has the advantage of being an easy and non-

invasive technique The recent studies on saliva proteins that are critically involved in AD and

Parkinsonrsquos diseases suggested that saliva could be a potentially important sample source to

identify biomarkers for neurodegenerative diseases Bermejo-Pareja et al reported the level of

salivary Aβ42 in patients with mild AD was noticeably increased compared to a group of

controls 52

In another study Devic et al identified two of the most important Parkinsons

22

disease related proteinsmdash α-synuclein (α-Syn) and DJ-1 in human saliva 53

They observed that

salivary α-Syn levels tended to decrease while DJ-1 levels tended to increase in Parkinsons

disease The published results from this study also suggest that α-Syn might correlate with the

severity of motor symptoms in Parkinsons disease Due in part to recent advancements in MS-

based proteomics has provided promising results in utilizing saliva to explore biomarkers for

both local and systemic diseases 54 55

the further profiling of saliva proteome will provide

valuable biomarker discovery source for neurodegenerative diseases

Tissue

Compared to body fluids such as plasma serum and urine where the proteomic analysis

is complicated by the wide dynamic range of protein concentration the analysis of tissue

homogenates using the well-established and conventional proteomic analysis techniques has the

advantage of reduced dynamic range However the homogenization and extraction process may

suffer from the caveat that spatial information is lost which would be inadequate for the

detection of biomarkers whose localization and distribution play important roles in disease

development and progression Matrix-assisted laser desorptionionization (MALDI) imaging

mass spectrometry (IMS) is a method that allows the investigation of a wide range of molecules

including proteins peptides lipids drugs and metabolites directly in thin slices of tissue 56-59

Because this technology allows for identification and simultaneous localization of biomolecules

of interests in tissue sections linking the spatial expression of molecules to histopathology

MALDI-IMS has been utilized as a powerful tool for the discovery of new cancer biomarker

candidates as well as other clinical applications60 61

The utilization of MALDI-IMS for human

or animal brain tissue to identify or map the distribution of molecules related to

neurodegenerative diseases were also recently reported62 63

23

Secretome

There has been an increasing interest in the study of proteins secreted by various cells

(the secretomes) from tissue-proximal fluids or conditioned media as a potential source of

biomarkers Cell secretomes mainly comprise proteins that are secreted or are shed from the cell

surface and these proteins can play important role in both physiological processes (eg cell

signaling communication and migration) and pathological processes including tumor

angiogenesis differentiation invasion and metastasis In particular the study of cancer cell

secretomes by MS based proteomics has offered new opportunities for cancer biomarker

discovery as tumor proteins may be secreted or shed into the bloodstream and could be used as

noninvasive biomarkers The latest advances and challenges of sample preparation sample

concentration and separation techniques used specifically for secretome analysis and its clinical

applications in the discovery of disease specific biomarkers have been comprehensively

reviewed64 65

Here we only highlight the proteomic profiling of neural cells secretome that has

been applied to neurosciences for a better understanding of the roles secreted proteins play in

response to brain injury and neurological diseases The LC-MS shotgun identification of

proteins released by astrocytes has been recently reported66-68

In these studies the changes

observed in the astrocyte secretomes induced by inflammatory cytokines or cholinergic

stimulation were investigated6667

Alternatively our group performed 2D-LC separation and

included cytoplasmic protein extract from astrocytes as a control to identify cytoplasmic protein

contaminants which are not actively secreted from cells68

Sample Preparation

24

Proteomic analysis and biomarker discovery research in biological samples such as body

fluids tissues and cells are often hampered by the vast complexity and large dynamic range of

the proteins Because disease identifying biomarkers are more likely to be low-abundance

proteins it is imperative to remove the high-abundance proteins or apply enrichment techniques

to allow detection and better coverage of the low-abundance proteins for MS analysis Several

strategies including depletion and protein equalizer approach have been used during sample

preparation to reduce sample complexity69 70

and the latest advances of these methods have been

reviewed by Selvaraju et al 71

Alternatively the complexity of biological samples can be

reduced by capturing a specific subproteome that may have the biological information of interest

The latter strategy is especially useful in the biomarker discovery where the changes in the

proteome are not solely reflected through the concentration level of specific proteins but also

through changes in the post-translational modifications (PTMs) Here we will mainly discuss

the enrichment of phosphoproteinpeptides glycoproteinpeptides and sample preparation for

peptidomics and membrane proteins

Phosphoproteomics

Phosphorylation can act as a molecular switch on a protein by turning it on or off within

the cell It is thought that up to 30 of the proteins can be phosphorylated72

and it plays

significant roles in such biological processes as the cell cycle and signal transduction73

Currently tens of thousands of phosphorylation sites can be proposed using analytical methods

available today74 75

The amino acids that are targeted for phosphorylation studies are serine

threonine and tyrosine with the abundance of detection decreasing typically in that order Other

25

amino acids have been reported to be phosphorylated but traditional phosphoproteomics

experiments ignore these rare events76

In a typical large-scale phosphoproteomics experiment the sample size is usually in

milligram amounts to account for the low stoichiometry of phosphorylated proteins The large

amount of protein is then digested typically with trypsin but alternatively experiments have

been performed with Lys-C digestion to produce large enzymatic peptides The larger peptides

produced from Lys-C render higher charged peptides during electrospray ionization (ESI) and

allow improved electron-based fragmentation to determine specific sites of phosphorylation77

From the pool of peptides phosphopeptides must be enriched otherwise they will be masked by

the vast number and higher ionization efficiency of non-phosphorylated peptides The two most

common enrichment techniques are immobilized metal ion affinity chromatography (IMAC) and

metal oxide affinity chromatography (MOAC) TiO2 being the most common oxide used for this

purpose A recent study reported that phosphorylation of neuronal intermediate filament proteins

in neurofibrillary tangles are involved in Alzheimerrsquos disease78

Glycoproteomics

Protein glycosylation is one of the most common and complicated forms of PTM Types

of protein glycosylation in eukaryotes are categorized as either N-linked where glycans are

attached to asparagine residues in a consensus sequence N-X-ST (X can be any amino acid

except proline) via an N-acetylglucosamine (N-GlcNAc) residue or the O-glycosylation where

the glycans are attached to serine or threonine Glycosylation plays a fundamental role in

numerous biological processes and aberrant alterations in protein glycosylation are associated

with neurodegenerative disease states such as Creutzfeld-Jakob Disease (CJD) and AD79 80

26

Due to the low abundance of glycosylated forms of proteins compared to non-glycosylated

proteins it is essential to enrich glycoproteins or glycopeptides in complex biological samples

prior to MS analysis Two of the most common enrichment methods used in glycoproteomics are

lectin affinity chromatography (LAC) and hydrazide chemistry The detailed methodologies of

LAC hydrazide chemistry and other enrichment methods in glycoproteomics have been

extensively reviewed in the past81 82

In particular LAC is of great interest in studies of

glycosylation alterations as markers of AD and other neurodegenerative diseases due to its recent

applications in brain glycoproteomics83

Our group has utilized multi-lectin affinity

chromatography containing concanavalin A (ConA) and wheat germ agglutinin (WGA) to enrich

N-linked glycoproteins in control and prion-infected mouse plasma84

This method enabled us to

identify a low-abundance glycoprotein serum amyloid P-component (SAP) PNGase F digestion

and Western blotting validation confirmed that the glycosylated form of SAP was significantly

elevated in mice with early prion infection and it could be potentially used as a diagnostic

biomarker for prion diseases

Membrane proteins

Membrane proteins play an indispensable role in maintaining cellular integrity of their

structure and perform many important functions including signaling transduction intercellular

communication vesicle trafficking ion transport and protein translocationintegration85

However due to being relatively insoluble in water and low abundance it is challenging to

analyze membrane proteins by traditional MS-based proteomics approaches Numerous efforts

have been made to improve the solubility and enrichment of membrane proteins during sample

preparation Several comprehensive studies recently covered the commonly used technologies in

27

membrane proteomics and different strategies that circumvent technical issues specific to the

membrane 86-90

Recently Sun et al reported using 1-butyl-3-methyl imidazolium

tetrafluoroborate (BMIM BF4) an ionic liquid (IL) as a sample preparation buffer for the

analysis of integral membrane proteins (IMPs) by microcolumn reversed phase liquid

chromatography (μRPLC)-electrospray ionization tandem mass spectrometry (ESI-MSMS)

The authors compared BMIM BF4 to the other commonly used solvents such as sodium dodecyl

sulfate methanol Rapigest and urea but they found that the number of identified IMPs from rat

brain extracted by ILs was significantly increased The improved identifications could be due to

the fact that BMIM BF4 has higher thermal stability and thus offered higher solubilizing ability

for IMPs which provided better compatibility for tryptic digestion than traditionally used solvent

systems38

In addition to characterization of membrane proteome the investigation of PTMs on

membrane proteins is equally important for characterization of disease markers and drug

treatment targets Phosphorylations and glycosylations are the two most important PTMs for

membrane proteins In many membrane protein receptors the cytoplasmic domains can be

phosphorylated reversibly and function as signal transducers whereas the receptor activities of

the extracellular domains are mediated via N-linked glycosylation Wiśniewski et al provides an

informative summary on recent advances in proteomic technology for the identification and

characterization of these modifications91

Our group has pioneered the development of detergent

assisted lectin affinity chromatography (DALAC) for the enrichment of hydrophobic

glycoproteins using mouse brain extract92

We compared the binding efficiency of lectin affinity

chromatography in the presence of four commonly used detergents and determined that under

certain concentrations detergents can minimize the nonspecific bindings and facilitate the

elution of hydrophobic glycoproteins In summary NP-40 was suggested as the most suitable

28

detergent for DALAC due to the higher membrane protein recovery glycoprotein recovery and

membranous glycoprotein identifications compared to other detergents tested In a different

study on mouse brain membrane proteome Zhang et al reported an optimized protocol using

electrostatic repulsion hydrophilic interaction chromatography (ERLIC) for the simultaneous

enrichment of glyco- and phosphopeptides from mouse brain membrane protein preparation93

Using this protocol they successfully identified 544 unique glycoproteins and 922 glycosylation

sites which were significantly higher than those using the hydrazide chemistry method

Additionally a total of 383 phosphoproteins and 915 phosphorylation sites were identified

suggesting that the ERLIC separation has the potential for simultaneous analysis of both glyco-

and phosphoproteomes

Peptidomics

Peptidomics can be loosely defined as the study of the low molecular weight fraction of

proteins encompassing biologically active endogenous peptides protein fragments from

endogenous protein degradation products or other small proteins such as cytokines and signaling

peptides Studies can involve endogenous peptides94

peptidomic profiling33

and de novo

sequencing of peptides95 96

Neuropeptidomics focuses on biologically active short segments of

peptides and have been investigated in numerous species including Rattus97 98

Mus musculus99

100 Bovine taurus

101 Japanese quail diencephalon

102 and invertebrates

103-106 The isolation of

peptides is typically performed through molecular weight cut-offs from either biofluids such as

CSF plasma or tissue extracts If the protein and peptide content is high such as for tissue or cell

lysates protein precipitation can be done via high organic solvents and the resulting supernatant

can be analyzed for extracted peptides where extraction solvent and conditions could have a

29

significant effect on what endogenous peptides are extracted from tissue107

A comparative

peptidomic study of human cell lines highlights the utility of finding peptide signatures as

potential biomarkers108

A thorough review of endogenous peptides and neuropeptides is beyond

the scope of this review and an excellent review on this topic is available elsewhere109

Fractionation and Separation

The mass spectrometer has a limited duty cycle and data dependent analysis can only

scan a limited number of mz peaks at any given time In addition significant ion suppression

can occur if there is a difference in concentration between co-eluting peptides or if too many

peptides co-elute Therefore one of the biggest challenges in biomarker discovery is the

complexity of the sample and the presence of high-abundance proteins in body fluids such as

CSF serum and plasma In addition to the removal of the most abundant proteins by

immunodepletion the reduction of the complexity of the sample by further fractionation is

indispensable to facilitate the characterization of unidentified biomarkers from the low

abundance proteins Traditionally used techniques for complex protein analysis include gel

based fractionation methods such as two-dimensional gel electrophoresis (2D-GE) and its

variation two-dimensional differential gel electrophoresis (2D-DIGE) or non-gel based such as

one- or multidimensional liquid chromatography (LC) and microscale separation techniques

such as capillary electrophoresis (CE)

2D-GE MS has been widely used as a powerful tool to separate proteins and identify

differentially expressed proteins ever since 2D gels were coupled to mass spectrometry In 2D-

GE MS thousands of proteins can be separated on a single gel according to pI and molecular

weight Individual protein spots that show differences in abundance between different samples

30

can then be excised from the gel digested into peptides and analyzed by MALDI MS or by

liquid chromatography tandem mass spectrometry (LC-MSMS) for protein identification The

introduction of 2D-DIGE adds a quantitative strategy to gel electrophoresis by enabling multiple

protein extracts to be separated on the same 2D gel thus providing comparative analysis of

proteomes in complex samples In 2D-DIGE protein extracts from two different conditions and

an internal standard can be labeled with fluorescent dyes for example Cy3 Cy5 and Cy2

respectively prior to two-dimensional gel electrophoresis Compared to traditional 2D-GE 2D-

DIGE provides the clear advantage of overcoming the inter-gel variation problem 110

Proteomic

profiling of CSF by 2D-GE and 2D-DIGE has led to the identification of putative biomarkers in

multiple neurological disorders For example Brechlin et al reported an optimized 2-DIGE

protocol profiled CSF from 36 CJD patients The applicability of their approach was proven by

the detection of known CJD biomarkers such as 14-3-3 protein neuron-specific enolase lactate

dehydrogenase and other proteins that are potentially relevant to CJD 111

In another study to

identify novel CSF biomarkers for multiple sclerosis CSF from 112 multiple sclerosis patients

and control individuals were analyzed by 2D-GE MS for comparative proteomics Ten potential

multiple sclerosis biomarkers were selected for validation by immunoassay 112

These

methodologies sample preparation techniques and applications of 2D-DIGE in

neuroproteomics were reviewed by Diez et al113

Although 2D gel provides excellent resolving

power and capability to visualize abundance changes there are some limitations to the method

For example gel based separation is not suitable for low abundance proteins extremely basic or

acidic proteins very small or large proteins and hydrophobic proteins114 115

Complementary to gel-based approaches shotgun proteomics coupled to LC have

become increasingly popular in proteomic research because they are reproducible highly

31

automated and capable of detecting low abundance proteins Furthermore another advantage of

LC-MS shotgun proteomics is the suitability for isotope labeling for protein quantification which

is reviewed in a later section In shotgun proteomics a protein mixture is digested and resulting

peptides are separated by LC prior to tandem MS fragmentation to identify different proteins by

peptide sequencing The most common separation for shotgun proteomics peptidomics or top-

down proteomics experiments use low-pH reversed phase (RP) C18 C8 or C4 columns RPLC

is well established which provides high resolution desalts the sample which can interfere with

ionization and the mobile phase is compatible with ESI Nanoscale C18 columns allow for

separation and introduction of sub microgram samples If larger amounts of sample are

available two dimensional separations are usually preferred to greatly enhance the coverage of

the investigated proteome which will be discussed in depth later It is preferable to have an

orthogonal separation method and since RP separates via hydrophobicity strong cation exchange

(SCX) was the original choice due to its separation by charge MudPIT (multidimensional

protein identification technology) usually refers to the use of SCX as the first phase of separation

and is a well-established platform116

SCX has the advantage over RP separation technologies to

effectively remove interfering detergents from the sample SCX separation is not based solely

off charge and hydrophobicity contributes to elution therefore a small amount of organic

modifier usually 10-15 is added to lessen the hydrophobicity effects117

The addition of

organic modifiers needs to be minimized otherwise binding to the C18 trap cartridge or C18

column will be reduced if performed on-line SCX can be used for PTMs and offers specific

applications for proteomic studies and an excellent current review is offered on this subject

elsewhere118

An alternative MudPIT separation scheme employing high pH RPLC as the first

phase of separation and low pH RPLC in the second dimension (RP-RP) has been successfully

32

applied to the proteomic analysis of complex biological samples119 120

The advantage of using

RP as the first dimension is the higher resolution for separation and better compatibility with

down-stream MS detection by eliminating salt Song et al reported a phosphoproteome analysis

based on this 2D RP-RP coupling scheme121

Hydrophilic interaction chromatography (HILIC) employs distinct separation modality

where the retention of peptides is increased with increasing polarity122

The loading of sample is

done by high organic and eluted by increasing the percentage of the aqueous phase or polarity of

the mobile phase opposite from RPLC thus establishing orthogonality of the two separation

modes123

HILIC has quickly become a very useful method and is actively used for proteomic

experiments124

for increased sensitivity125

phosphoproteomics126

glycoproteins127

and

quantification studies128

An alternative and modification to HILIC is ERLIC which adds an

additional mode of separation by electrostatic attraction An earlier study using ERLIC

demonstrated the ability to separate phosphopeptides from non-phosphorylated peptides at

pH=2129

A recent study looking into changes in the phosphoproteome of Marekrsquos Disease

applied ERLIC to chicken embryonic fibroblast lysate identifying only 13 phosphopeptides

out of all the identified peptides Due to the lack of isolation of phosphopeptides from ERLIC

the investigators performed immobilized metal affinity chromatography (IMAC) enrichment on

the fractions increasing identification of phosphopeptides over 50 fold130

A comparative study

of ERLIC to HILIC and SCX following TiO2 phospho-enrichment reported that

SCXgtERLICgtHILIC for phosphopeptide identifications126

Recent developments in instrumentation to combine LC with ion mobility spectrometry

(IMS) and MS (LC-IMS-MS) offered more advantages than conventional LC due to the rapid

high-resolution separations of analytes based on their charge mass and shape as reflected by

33

mobility in a given buffer gas The mobility of an ion in a buffer gas is determined by the ionrsquos

charge and its collision cross-section with the buffer gas The methodologies of IMS separations

and the application of LC-IMS-MS for the proteomics analysis of complex systems including

human plasma have been reviewed by Clemmerrsquos group131-133

They proposed a method that

employs intrinsic amino acid size parameters to obtain ion mobility predictions which can be

used to rank candidate peptide ion assignments and significantly improve peptide identification

134

Although 2D gel and LC are routinely used as separation techniques in MS-based

proteomics capillary electrophoresis (CE) has received increasing attention as a promising

alternative due to the fast and high-resolution separation it offers CE has a wide variety of

operation modes among which capillary zone electrophoresis (CZE) and capillary isoelectric

focusing (CIEF) have the greatest potential applications in MS-based proteomics thus will be

highlighted here CZE separates analytes by their charge-to-size ratios in buffers under a high

electrical field and is often used as the final dimension prior to MS analysis while the separation

feature of CIEF is based on isoelectric point and this technique is more suitable to be used as the

first dimension separation Detailed description of different CEndashMS interfaces sample

preconcentration and capillary coating to minimize analyte adsorption could be found in several

reviews135-141

CE technique is complementary to conventional LC in that it is suitable for the

analysis of polar and chargeable compounds Dovichirsquos group conducted proteomic analysis of

the secreted protein fraction of Mycobacterium marinum which has intermediate protein

complexity142

The tryptic digests were either analyzed by UPLC-ESI-MSMS in triplicates or

prefactionated by RPLC followed by CZE-ESI-MSMS It was demonstrated that the two

methods identified similar numbers of peptides and proteins within similar analysis times

34

However CZE-ESI-MSMS analysis of the prefractionated sample tended to identify more

peptides that are basic and have lower mz values than those identified by UPLC-ESI-MSMS

This analysis also presented the largest number of protein identifications by using CE-MSMS

suggesting the effectiveness of prefractionation of complex samples by LC method prior to CZE-

ESI-MSMS The use of CIEF as the first dimension of separation provides both sample

concentration and excellent resolving power The combination of CIEF and RPLC separation

has been applied to the proteomic analyses where the amount of protein sample is limited and

cannot meet the requirement of minimal load amount for 2D LC-MSMS143 144

So far CE-MS

has been widely applied to the proteomic analysis of various biological samples such as urine145

146 CSF

147 blood

148 frozen tissues

149 and the formalin-fixed and paraffin-embedded (FFPE)

tissue samples150

The recent CEndashMS applications to clinical proteomics have been summarized

in several reviews135 151 152

Protein Quantification

In 2D gel electrophoresis the quantitative analysis of protein mixtures is performed on

the gel by comparing the intensity of the protein stain The development of 2D-DIGE eliminated

the gel-to-gel variation and greatly improved the quantitative capability and reliability of 2D gel

methodology110

However the accuracy of 2D gel based protein quantification suffers from the

limitations that a seemingly single gel spot often contains multiple proteins and the difficulty of

detecting proteins with extreme molecular weights and pI values as well as highly hydrophobic

proteins such as membrane proteins Therefore non-gel based shotgun proteomics technology is

more suitable for accurate and large-scale protein identification and quantification in complex

samples Briefly the quantification in non-gel based shotgun proteomics can be categorized into

35

two major approaches stable isotope labeling-based and label-free methods The common

strategies for quantitative proteomic analysis are reviewed and summarized in Table 1

Isotope labeling methods

Because stable isotope-labeled peptides have the same chemical properties as their

unlabeled counterparts the two peptides within a mixture should exhibit identical behaviors in

MS ionization The mass difference introduced by isotope labeling enables the detection of a

pair of two distinct peptide masses by MS within the mixture and allowing for the measurement

of the relative abundance differences between two peptides Depending on how isotopes are

incorporated into the protein or peptide these labeling methods can be divided into two groups

In vitro chemical derivatization techniques which incorporate a label or tag into the peptide or

protein during sample preparation metabolic labeling techniques which introduce the isotope

label directly into the organism via isotope-enriched nutrients from food or media

1 In vitro derivatization techniques

There are multiple methods to introduce heavy isotopes into proteins or peptides in vitro

The commonly used strategies include 18

O 16

O enzymatic labeling Isotope-Coded Affinity Tag

(ICAT) Tandem Mass Tags (TMTs) and Isobaric Tags for Relative and Absolute Quantification

(iTRAQ) The 18

O labeling method enzymatically cleaves the peptide bond with trypsin in the

presence of 18

O-enriched H2O and introduces 4-Da mass shift in the tryptic peptides153

The

advantages of this method include 18

O-enriched water is extremely stable tryptic peptides will

be labeled with the same mass shift secondary reactions inherent to other chemical labeling can

be avoided Conversely widespread use of 18

O-labeling has been hindered due to the difficulty

of attaining complete 18

O incorporation and the lack of robustness154 155

Currently ICAT

36

TMTs and iTRAQ methods are extensively used in quantitative proteomics In ICAT cysteine

residues are specifically derivatized with a reagent containing either zero or eight deuterium

atoms as well as a biotin group for affinity purification of cysteine-containing peptides156 157

The advantage of ICAT is that the affinity purification via biotin moiety can facilitate the

detection of low-abundance cysteine-containing peptides In addition the mass difference

introduced by labeling increases mass spectral complexity with quantification from the different

precursor masses done by MS and peptide identification being achieved through tandem MS

(MSMS) This added complexity from different peptide masses was addressed by using isobaric

labeling methods such as TMTs and iTRAQ 158 159

where the same peptides in different samples

are isobaric after tagging and appear as single mz in MS scans thus enhancing the peptide limit

of detection and reducing the MS scan complexity Isobaric labeling reagents are composed of a

primary amine reactive group and an isotopic reporter group linked by an isotopic balancer group

for the normalization of the total mass of the tags The reporter group serves for quantification

purpose since it is cleaved during collision-induced dissociation (CID) to yield a characteristic

isotope-encoded fragment Moreover isobaric labeling methods allow the comparison of

multiple samples within a single experiment Recently a 6-plex version of TMTs was

reported160

and iTRAQ enables up to eight samples to be labeled and relatively quantified in a

single experiment161

8-plex iTRAQ reagents have been used for the comparison of complicated

biological samples such as CSF in the studies of neurodegenerative diseases 162

Recently our

group developed a novel N N-dimethyl leucine (DiLeu) 4-plex isobaric tandem mass (MS2)

tagging reagents with high quantitation efficacy DiLeu has the advantage of synthetic simplicity

and greatly reduced synthesis cost compared to TMTs and iTRAQ163

Xiang et al demonstrated

that DiLeu produced comparable iTRAQ ability for protein sequence coverage (~43) and

37

quantitation accuracy (lt15) for tryptically digested proteins More importantly DiLeu

reagents could promote enhanced fragmentation of labeled peptides thus allowing more

confident peptide and protein identifications

2 In Vivo Metabolic Labeling

Metabolic processes can also be employed for the incorporation of stable-isotope labels

into the proteins or organisms by enriching culture media or food with light or heavy versions of

isotope labels (2H

13C

15N) The advantage of in vivo labeling is that metabolic labeling does

not suffer from incomplete labeling which is an inherent drawback for in vitro derivatization

techniques In addition metabolic labeling occurs from the start of the experiment and proteins

with light or heavy labels are simultaneously extracted thus reducing the error and variability of

quantification introduced during sample preparation The most widely used strategy for

metabolic labeling is known as stable-isotope labeling of amino acids in cell culture (SILAC)

which was introduced by Mann and co-workers164 165

In SILAC one cell population is grown

in normal or light media while the other is grown in heavy media enriched with a heavy

isotope-encoded (typically 13

C or 15

N) amino acid such as arginine or leucine Cells from the

two populations are then combined proteins are extracted digested and analyzed by MS The

relative protein expression differences are then determined from the extracted ion

chromatograms from both the light and heavy peptide forms SILAC has been shown to be a

powerful tool for the study of intracellular signal transduction In addition this technique has

recently been applied to the quantitative analysis of phosphotyrosine (pTyr) proteomes to

characterize pTyr-dependent signaling pathways166 167

38

Labe-free quantification

Although various isotope labeling methods have provided powerful tools for quantitative

proteomics several limitations of these approaches are noted Labeling increases the cost and

complexity of sample preparation introduces potential errors during the labeling reaction It also

requires a higher sample concentration and complicates data processing and interpretation In

addition so far only TMTs and iTRAQ allow the comparison of multiple (up to eight) samples

simultaneously The comparison of more than eight samples in a single experiment cannot be

achieved by isotope labeling In order to address these concerns there has been significant

interest in the development of label-free quantitative approaches Current label-free

quantification methods for MS-based proteomics were developed based on the observation that

the chromatographic peak area of a peptide168 169

or frequency of MSMS spectra170

correlating

to the protein or peptide concentration Therefore the two most common label-free

quantification approaches are conducted by comparing (i) area under the curve (AUC) of any

given peptides171 172

or (ii) by frequency measurements of MSMS spectra assigned to a protein

commonly referred to as spectral counting173

Several recent reviews provided detailed and

comprehensive knowledge comparing label-free methods with labeling methods data processing

and commercially available software for label-free quantitative proteomics174-177

Dissociation Techniques

The vast majority of proteomic experiments have proteins or peptides being identified by

two critical pieces of data obtained from the mass spectrometer The first is the precursor ion

identified by its mz which is informative to the mass of the peptide being analyzed The second

is the use of tandem mass spectrometry to fragment or dissociate the precursor ion and record the

39

generated fragment ion pattern to discern the amino acid sequence The three most popular

dissociation or fragmentation techniques for peptides are CID electron-transfer dissociation

(ETD) and high-energy collision dissociation (HCD) A recent study on the human plasma

proteome demonstrated that combined fragmentation techniques enhance coverage by providing

complementary information for identifications CID enabled the greatest number of protein

identifications while HCD identified an additional 25 proteins and ETD contributed an

additional 13 protein identifications178

ETDECD

Electron capture dissociation (ECD) 179

preceded ETD but ECD was developed for use

in a Penning trap for Fourier transform ion cyclotron resonance (FTICR) mass spectrometers

ECD requires the ion of interest to be in contact with near-thermal electrons and for the electron

capture event to occur on the millisecond time scale but the time scale is inadequate for electron

trapping in Paul traps or quadrupoles in the majority of mass spectrometers180

ETD involves a

radical anion like fluoranthene with low electron affinity to be transferred to peptide cation

which results in more uniform cleavage along the peptide backbone The cation accepts an

electron and the newly formed odd-electron protonated peptide undergoes fragmentation by

cleavage of the N-Cα bond which results in fragmentation ions consisting of c- and zbull-type

product ions The uniformed cleavage results in reduced sequence discrimination to labile bonds

such as PTMs and also provides improved sequencing for larger peptides compared to CID181

The realization that larger peptides produced better MSMS quality spectra compared to CID led

to a decision tree analysis strategy where peptide charge states and size determined whether the

precursor peptide would be fragmented with CID or ETD182

One of the main benefits of

ETDECD is the ability to sequence peptides with labile PTMs such as phosphorylation77 183

40

sulfation184

glycosylation185

ubiquitination186

and histone modifications187

ETD also has the

benefit of providing better sequence information on larger neuropeptides when compared to

CID188

However a thorough analysis suggested that CID still yielded more peptideprotein

identifications than ETD in large scale proteoimcs189

HCD

High energy collision dissociation (HCD)190

is an emerging fragmentation technique that

offers improved detection of small reporter ions from iTRAQ-based studies191 192

HCD is

performed at a higher energy in a collision cell instead of an ion trap like CID thus HCD does

not suffer from the low-mass cutoff limitation Furthermore HCD offers enhanced

fragmentation efficiency assisting in MSMS spectra interpretation and protein identification193

A major drawback for HCD is that the spectral acquisition times are up to two-fold longer due to

increased ion requirement for Fourier transform detection in the orbitrap194

HCD has been

reported to increase phosphopeptide identifications over CID74

but in a different study CID was

reported to offer more phosphopeptide identifications over HCD194

Work has also been done to

transfer the decision tree analysis for HCD which basically switches CID with HCD claiming

better quality data determined by higher Mascot scores with more peptide identifications195

MSE

Data dependent acquisition (DDA) is the most commonly used ion selection process in

mass spectrometers for proteomic experiments An alternative process which does not have ion

selection nor switch between MS and MSMS modes is termed MSE MS

E is a data independent

mode and does not require precursor ions of a significant intensity to be selected for MSMS

analysis196

A data independent mode decouples the mass spectrometer choosing which

precursor ions to fragment and when the ions are fragmented MSE works by a low or high

41

energy scan and no ion isolation is occurring The low energy scan is where the precursor ion is

not fragmented and the high energy scan allows fragmentation The resulting mix of precursor

and fragmentation ions is then detected simultaneously197

The data will then need to be

deconvoluted using a proprietary time-aligned algorithm that is discussed elsewhere198

The

continuous data independent acquisition allows multiple MSMS spectra to be collected during

the natural analyte peak broadening observed in chromatography which provides more data

points for AUC label-free quantification In addition lower abundance peptides can be

sequenced as more MSMS spectra are collected throughout the elution of an LC peak allowing

better signal averaging for smaller analyte peak of interest during coelution and reducing

sampling bias in typical DDA experiments where only more abundant peaks can be selected for

fragmentation

A comparison of spiked internal protein standards into a complex protein digest provided

evidence that MSE was comparable to DDA analysis in LC-MS

199 MS

E has been used for label

free proteomics of immunodepleted serum in large scale proteomics samples200

In addition

MSE was performed for the characterization of human cerebellum and primary visual cortex

proteomes Hundreds of proteins were identified including many previously reported in

neurological disorders201

MSE is quickly becoming a versatile data acquisition method recently

used in such studies as cancer cells202

schizophrenia203

and pituitary proteome discovery204

The usefulness of MSE as an unbiased data acquisition method is being assimilated into multiple

proteomics studies including studies involving neurological disorders

Data Analysis

42

One of the major bottlenecks in non-targeted proteomic experiments is how to handle the

enormous amount of data obtained Database searches biostatistical analysis de novo

sequencing PTM validation all have their place and multiple available platforms are available

If the organism being studied has had its genome sequenced databases can be created

with a list of proteins in the FASTA format to be used in database searching There are

numerous database searching algorithms for sequence identification of MSMS data including

Mascot205

Sequest206

Xtandem207

OMSSA208

and PEAKS209

These searching algorithms are

performed by matching MSMS spectra and precursor mass to sequences found within proteins

How well the actual spectra match the theoretical spectra determines a score which is unique to

the searching algorithm and usually can be extrapolated to the probability of a random hit

Recently a database has been developed for PTM analysis by the use of the program SIMS210

Specifically for phosphopeptides Ascorersquos algorithm scans the MSMS data to determine the

likelihood of correct phosphosite identification from the presence of site identifying product

ions211

If the organism that is being analyzed has not had its genome sequenced and no (or very

limited) FASTA database is available a homology search can be performed using SPIDER212

available with PEAKS software Alternatively individual MSMS spectrum can be de novo

sequenced but software is available to perform automated de novo sequencing of numerous

spectra (PEAKS208

DeNovoX and PepSeq)

For large-scale protein identifications the false discovery rate (FDR) must be established

by the searching algorithm and that is accomplished by re-searching the data with a false

database created by reversing or scrambling the amino acid sequence of the original database

used for the protein search Any hits from the false database will contribute to the FDR and this

value can be adjusted usually around 1 An additional layer of confidence in the obtained data

43

can be achieved in shotgun proteomics experiments by removing all the proteins that are

identified by only one peptide

Once a set of confident proteins or peptides have been generated from database

searching bioinformatic analysis or biostatistical analysis is needed Numerous software

packages are available for different purposes FLEXIQuant is an example for absolute

quantitation of isotopically labeled protein or peptides of interest213

FDR analysis of

phosphopeptides or other specific PTMs can be adjusted with such software as Scaffold

providing data consisting only of a specific modification214

Bioinformatic tools such as

Scaffold or ProteoIQ also include gene ontology (GO) analysis which can classify identified

proteins by three categories cellular component molecular function or biological process

Custom bioinformatics programs can also be developed and are often useful in various proteomic

studies including biomarker discovery in neurological diseases215

More detailed review of

bioinformatics in peptidomics216

and proteomics217

can be found elsewhere

Validation of Biomarkers by Targeted Proteomics

The validation of putative biomarkers identified by MS-based proteomic analysis is often

required to provide orthogonal analysis to rule out a false positive by MS and providing

additional evidence for the biomarker candidate(s) from the study for future potential clinical

assays At present antibody-based assays such as Western blotting ELISA and

immunochemistry are the most widely used methods for biomarker validation Although accurate

and well established these methods rely on protein specific antibodies for the measurement of

the putative biomarker and could be difficult for large-scale validation of all or even a subset of a

long list of putative protein biomarkers typically obtained by MS-based comparative proteomic

44

analysis Large scale validation is impractical due to the cost for each antibody the labor to

develop a publishable Western blot or ELISA and the antibody availability for certain proteins

As an alternative strategy quantitative assays based on multiple-reaction monitoring (MRM) MS

using a triple quadrupole mass spectrometer have been employed in biomarker verification

MRM is the most common use of MSMS for absolute quantitation It is a hypothesis

driven experiment where the peptide of interest and its subsequent fragmentation pattern must be

known prior to the quantitative MRM experiments MRM involves selecting a specific mz (first

quadrupole) to be isolated for fragmentation (second quadrupole) followed by one or more of

the most intense fragment ions (third quadrupole) being monitored The ability to quantitate and

thus validate the proteins or peptides as potential biomarkers is achieved by performing MRM on

isotopically labeled reference peptide for targeted peptideprotein of interest The main obstacle

for quantification of peptides is interference and ion suppression effects from co-eluting

substances Since the isotopically labeled and native peptide will co-elute the same interference

and ion suppression will occur for both peptides and thus correcting these interfering effects

Peptides need to be systematically chosen for a highly sensitive and reproducible MRM

experiment to ensure proper validation of putative biomarkers Peptides require certain intrinsic

properties which include an mz within the practical mass detection range for the instrument and

high ionization efficiency If the desired peptide to be quantified is derived from a digestion

then peptides that have detectable incomplete digestion or missed cleavage site can be a major

source of variability Peptides with a methionine and to a lesser extent tryptophan are

traditionally removed from consideration from MRM quantitative experiments due to the

variable nature of the oxidation that can occur In addition if chromatographic separation is

performed the retention behavior of the peptide must be well behaved with little tailing effects

45

eluting late causing broadening of the peak and even irreversible binding to the column As an

example hydrophilic peptides being eluted off a C18 column may exhibit the previously

described concerns and a different chromatographic separation will need to be explored for

improved limits of detection quantitation and validation To determine consistent peptide

detection or usefulness of certain peptides databases such as Proteomics Database218

PRIDE219

PeptideAtlas220

have been developed to compile proteomic data repositories from initial

discovery experiments

After the peptide is selected for analysis the proper MRM transitions need to be selected

to optimize the sensitivity and selectivity of the experiment It is common for investigators to

select two or three of the most intense transitions for the proposed experiment It is imperative

that the same instrument is used for the determination of transition ions as different mass

spectrometers may have a bias towards different fragment ions

MRM experiments are still highly popular experiments for hypothesis directed

experiments221

biomarker analysis222

and validation223

Validation of putative biomarkers is

increasingly becoming a necessary step when performing large scale non-hypothesis driven

proteomics experiments The traditional validation techniques of ELISA Western blotting and

immunohistochemistry are still used but MRM experiments are becoming an attractive

alternative for validation of putative biomarkers due to its enhanced throughput and specificity

Current work is still being performed to both expand the linear dynamic range224

and

sensitivity225

of MRM A recent endeavor to increase the sensitivity for MRM experiments was

accomplished by ldquoPulsed MRMrdquo via the use of an ion funnel trap to enhance confinement and

accumulation of ions The authors claimed an increase by 5-fold for peak amplitude and a 2-3

fold reduction in chemical background225

46

Remaining Challenges and Emerging Technologies

Large sample numbers for mass spectrometry analysis

Multiple conventional studies in proteomics have been performed on a single or a few

biological samples As bio-variability can be exceedingly high the need for larger sample sizes

is currently being investigated Prentice et al used a starting point of 3200 patient samples

from the Womenrsquos Health Institute (WHI) to probe the plasma proteome using MS for

biomarkers The study did not test the 3200 patient samples by MS because even a simple one

hour one dimensional RP analysis on a mass spectrometer would take months of instrument time

for uninterrupted analysis Instead the authors pooled 100 samples together to bring the total

number of pooled samples to 32 To provide relevant plasma biomarkers the samples were then

subjected to immunodepletion 2-D protein separation (96 fractions total) and then 1-D RPLC of

tryptic peptide separation on-line interface to a mass spectrometer The large sample cohorts

help address bio-variability that can be a concern from small sample size proteomic experiments

and provide ample sample amounts to investigate the low abundance proteins226

Hemoglobin-derived neuropeptides and non-classical neuropeptides

Neuropeptides such as neuropeptide Y and enkephalin are short chains of amino acids

that are secreted from a range of neuronal cells that signal nearby cells In contrast non-classical

neuropeptides are termed as neuropeptides or ldquomicroproteinsrdquo which are derived from

intracellular protein fragments and synthesized from the cytosol227

MS was recently used to

determine that hemopressins which are hemoglobin-derived peptides are upregulated in Cpefatfat

mice brains Gelman et al designed an MS experiment to compare hemoglobin-derived

47

peptides comparing the brain blood and heart peptidome in mice The authors provided data

that specific hemoglobin peptides were produced in the brain and were not produced in the

blood Certain alpha and beta hemoglobin peptides were also up regulated in the brain for

Cpefatfat

mice and bind to CB1 cannabinoid receptors228

As discussed earlier in the review

peptidomics and specifically neuropeptidomics are popular fields of study utilizing MS and non-

classical neuropeptides is an exciting emerging area of research that could further expand the

diversity of cell-cell signaling molecules

Ultrasensitive mass spectrometry for single cell analysis

In addition to large scale analysis MS-based proteomics and peptidomics are making

progress into ultrasensitive single cell analysis The most successful MS-based techniques for

single cell analysis was performed with MALDI and studies that have been performed on

relatively large neurons are reviewed elsewhere229

The ultrasensitive MS analysis is currently

directed towards single cell analysis of smaller cells including cancer cells The first challenge

in single cell analysis is the isolation and further sample preparation to yield relevant data

Collection and isolation of a cell type can be accomplished using antibodies for fluorescence

activated cell sorting (FACS) and immune magnetic separation FACS works by flow cytometry

sorting cells by a laser that excites a fluorescent tag that is attached to an antibody Immune

magnetic separation allows separation by antibodies with magnetic properties such as

Dynabeads230

One exciting study combining FACS and MS termed mass cytometry This

technology works by infusing a droplet into an inductively coupled plasma mass spectrometer

(ICP-MS) containing a single cell bound to antibodies chelated to transition elements allowing a

quantifying response between single cells231

Clearly the future of single cell analysis for

48

biomarker analysis and proteomics is encouraging and has the potential to be an emerging field

in MS-based proteomics and peptidomics

Laserspray ionization (LSI)

Laserspray ionization (LSI) is an exciting new method to produce multiply charged mass

spectra from MALDI that is nearly identical to ESI232-234

Recently it has been reported that LSI

can be performed in lieu of matrix to produce a total solvent-free analysis234

The benefits of

being able to generate multiply charged peptides without any solvent may offer advantages

including MS analysis of insoluble membrane proteins or hydrophobic peptides avoidance of

chemical reactions while in solvents a reduction of sample loss due to liquid sample preparation

and ability to avoid diffusion effects from tissue imaging studies234

The multiply charged peptide and protein ions produced by LSI expand the mass range

for tissue imaging analysis More importantly the multiply-charged peptide ions are amenable

for electron-based fragmentation methods such as ETD or ECD which can be employed in

conjunction with tissue imaging experiments to yield in situ sequencing and identification of

peptides of interest235

Paper spray ionization

Paper spray (PS) is an ambient ionization method which was first reported using

chromatography paper allowing detection of metabolites from dried blood spots The original

method used a cut out piece of paper with a voltage clipped on the back while applying 10 microL of

methanolH2O236

Improvements have been made to this technology to enhance analysis

efficiency with a new solvent 91 dichloromethaneisopropanol (vv) and the use of silica paper

49

over chromatography paper237

Interesting applications or modifications have been made to PS

including direct analysis of biological tissue238

and leaf spray for direct analysis of plant

materials239

but both detect metabolites instead of proteins or peptides Paper spray ionization

was previously shown to enable detection of cytochrome c and bradykinin [2-9] standards in a

proof of principle study240

Clearly the utility of PS analysis in proteomics and peptidomics is

yet to be explored

niECD

New fragmentation techniques have been investigated for their utility in proteomics and

peptidomics including a recently reported negative-ion electron capture dissociation (niECD)

Acidic peptides which usually contain PTMs such as phosphorylation or sulfonation are often

difficult to be detected as multiply charged peptides in the positive ion mode As discussed

earlier multiply charged peptides are required for ECDETD fragmentation The fragmentation

of niECD is accomplished by a multiply negatively charged peptide adding an electron The

resulting fragmentation of multiply sulfated and phosphorylated peptide and protein standards

showed no sulfate loss and preserved phosphorylation site The resulting fragmentation pattern

from niECD was also improved in the peptide anions and provides a new strategy for de novo

sequencing with PTM localization241

Conclusions and Perspectives

Proteomics methodologies have produced large datasets of proteins involved in various

biological and disease progression processes Numerous mass spectrometry-based proteomics

and peptidomics tools have been developed and are continuously being improved in both

50

chromatographic or electrophoretic separation and MS hardware and software However several

important issues that remain to be addressed rely on further technical advances in proteomics

analysis When large proteomes consisting of thousands of proteins are analyzed and quantified

dynamic range is still limited with more abundant proteins being preferentially detected

Development and optimization of chemical tagging reagents that target specific protein classes

maybe necessary to help enrich important signaling proteins and assess cellular and molecular

heterogeneity of the proteome and peptidome Furthermore a significant bottleneck in

usefulness of proteomics research is the ability to validate the results and provide clear

significant biological relevance to the results The idea of P4 medicine242 243

is an attractive

concept where the four Prsquos stand for predictive preventive personalized and participatory

Proteomics is one of the critical ldquoomicsrdquo fields and has led to the development of enabling

innovative strategies to P4 medicine244

A goal of P4 medicine is to assess both early disease

detection and disease progression in a person A simplified example of how proteomics fits into

P4 medicine is that certain brain-specific proteins could be used for diagnosis with

presymptomatic prion disease244

The concept of proteomic experiments providing an individual

biomarker is becoming more obsolete with the revised vision being a biomolecular barcode that

could potentially be ldquoscannedrdquo or be a fingerprint for a specific disease or early onset to that

disease being closer to reality An excellent review on what biomarker analysis can do for true

patients is available245

Proteomics can also generate new hypothesis that can be tested by classical biochemical

approaches If a disease has an unknown pathogenesis proteomics is a good starting point to try

to assemble putative markers that can lead to further hypothesis for evaluation If a particular

protein or PTM is associated with a disease state either qualitatively or quantitatively potential

51

treatments could target that protein of interest or investigators could monitor that protein or

PTM during potential treatments of the disease Proteomics has expanded greatly over the last

few decades with the goal of providing revealing insights to some of the most complex

biological problems currently facing the scientific community

Acknowledgements

Preparation of this manuscript was supported in part by the University of Wisconsin Graduate

School Wisconsin Alzheimerrsquos Disease Research Center Pilot Grant and a Department of

Defense Pilot Award LL acknowledges an H I Romnes Faculty Fellowship

52

Figure 1 A summary of general workflows of biomarker discovery pipeline by MS-based

proteomic approaches

Biological sample (CSF blood urine saliva cell

lysate tissue homogenates secreted proteins etc)

Protein extraction Sample pretreatment

2D-GE2D-DIGE MS 1D or 2D LC-MSMS

MALDI-IMS

Identification of

differentially

expressed proteins

Protein identification

Potential biomarkers

Biomarker validation

- Antibody based immunoassays

- MRM

Quantitative analysis

- Isotope labeling

- Label free

Identification and

localization of

differentially expressed

biomolecules

Intact tissue

Sample preparation Slice frozen tissues

thaw-mounted on plate

Apply Matrix

53

Figure 2 Tissue expression and dynamic range of the human plasma proteome A pie chart

representing the tissue of origin for the high abundance proteins shows that the majority of

proteins come from the liver (A) Conversely the lower abundance plasma proteins have a much

more diverse tissue of origin (B and C) The large dynamic range of plasma proteins is presented

and the proteins can be grouped into three categories (classical plasma proteins tissue leakage

products interleukinscytokines) (D) Adapted from Zhang et al11

and Schiess et al246

with

permission

54

55

Table 1 A summary of the common strategies applied to MS-based quantitative proteomic

analysis

Gel based Stable isotope labeling Label free

2D-GE

2D-DIGE 110

In vitro derivatization

18O

16O

153

ICAT 156

TMT 159

iTRAQ 158

Formaldehyde 247

ICPL 248

In vivo metabolic labeling

14N

15N

249

SILAC 164

AUC measurement 169 172

Spectral counting 173

AUC Area Under Curve ICAT Isotope-Coded Affinity Tag TMT Tandem Mass Tags iTRAQ Isobaric Tags for

Relative and Absolute Quantification ICPL Isotope Coded Protein Labeling SILAC Stable Isotope Labeling by

Amino Acids in Cell Culture Isobaric Tags for Relative and Absolute Quantification (iTRAQ)

56

References

1 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein

immunoaffinity subtraction for plasma proteomics and candidate biomarker discovery

using mass spectrometry Mol Cell Proteomics 2006 5 (11) 2167-74

2 Holten-Andersen M N Murphy G Nielsen H J Pedersen A N Christensen I J

Hoyer-Hansen G Brunner N Stephens R W Quantitation of TIMP-1 in plasma of healthy

blood donors and patients with advanced cancer Br J Cancer 1999 80 (3-4) 495-503

3 Tammen H Schulte I Hess R Menzel C Kellmann M Mohring T Schulz-

Knappe P Peptidomic analysis of human blood specimens comparison between plasma

specimens and serum by differential peptide display Proteomics 2005 5 (13) 3414-22

4 Rai A J Gelfand C A Haywood B C Warunek D J Yi J Schuchard M D

Mehigh R J Cockrill S L Scott G B Tammen H Schulz-Knappe P Speicher D W

Vitzthum F Haab B B Siest G Chan D W HUPO Plasma Proteome Project specimen

collection and handling towards the standardization of parameters for plasma proteome samples

Proteomics 2005 5 (13) 3262-77

5 Lippi G Guidi G C Mattiuzzi C Plebani M Preanalytical variability the dark side

of the moon in laboratory testing Clin Chem Lab Med 2006 44 (4) 358-65

6 Holten-Andersen M N Schrohl A S Brunner N Nielsen H J Hogdall C K

Hogdall E V Evaluation of sample handling in relation to levels of tissue inhibitor of

metalloproteinases-1 measured in blood by immunoassay Int J Biol Markers 2003 18 (3) 170-

6

7 Ytting H Christensen I J Thiel S Jensenius J C Svendsen M N Nielsen L

Lottenburger T Nielsen H J Biological variation in circulating levels of mannan-binding

lectin (MBL) and MBL-associated serine protease-2 and the influence of age gender and

physical exercise Scand J Immunol 2007 66 (4) 458-64

8 Hanash S Building a foundation for the human proteome the role of the Human

Proteome Organization J Proteome Res 2004 3 (2) 197-9

9 Omenn G S States D J Adamski M Blackwell T W Menon R Hermjakob H

Apweiler R Haab B B Simpson R J Eddes J S Kapp E A Moritz R L Chan D W

Rai A J Admon A Aebersold R Eng J Hancock W S Hefta S A Meyer H Paik Y

K Yoo J S Ping P Pounds J Adkins J Qian X Wang R Wasinger V Wu C Y

Zhao X Zeng R Archakov A Tsugita A Beer I Pandey A Pisano M Andrews P

Tammen H Speicher D W Hanash S M Overview of the HUPO Plasma Proteome Project

results from the pilot phase with 35 collaborating laboratories and multiple analytical groups

generating a core dataset of 3020 proteins and a publicly-available database Proteomics 2005 5

(13) 3226-45

10 Liu T Qian W J Gritsenko M A Xiao W Moldawer L L Kaushal A Monroe

M E Varnum S M Moore R J Purvine S O Maier R V Davis R W Tompkins R

G Camp D G 2nd Smith R D High dynamic range characterization of the trauma patient

plasma proteome Mol Cell Proteomics 2006 5 (10) 1899-913

11 Zhang Q Faca V Hanash S Mining the plasma proteome for disease applications

across seven logs of protein abundance J Proteome Res 2011 10 (1) 46-50

12 Matsubara J Honda K Ono M Sekine S Tanaka Y Kobayashi M Jung G

Sakuma T Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T Tsuchida A

57

Shimahara M Yasunami Y Chiba T Yamada T Identification of adipophilin as a potential

plasma biomarker for colorectal cancer using label-free quantitative mass spectrometry and

protein microarray Cancer Epidemiol Biomarkers Prev 2011 20 (10) 2195-203

13 Murakoshi Y Honda K Sasazuki S Ono M Negishi A Matsubara J Sakuma

T Kuwabara H Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T

Shimahara M Yasunami Y Ino Y Tsuchida A Aoki T Tsugane S Yamada T Plasma

biomarker discovery and validation for colorectal cancer by quantitative shotgun mass

spectrometry and protein microarray Cancer Sci 2011 102 (3) 630-8

14 Addona T A Shi X Keshishian H Mani D R Burgess M Gillette M A

Clauser K R Shen D Lewis G D Farrell L A Fifer M A Sabatine M S Gerszten R

E Carr S A A pipeline that integrates the discovery and verification of plasma protein

biomarkers reveals candidate markers for cardiovascular disease Nat Biotechnol 2011 29 (7)

635-43

15 Acosta-Martin A E Panchaud A Chwastyniak M Dupont A Juthier F Gautier

C Jude B Amouyel P Goodlett D R Pinet F Quantitative mass spectrometry analysis

using PAcIFIC for the identification of plasma diagnostic biomarkers for abdominal aortic

aneurysm PLoS One 2011 6 (12) e28698

16 Ray S Britschgi M Herbert C Takeda-Uchimura Y Boxer A Blennow K

Friedman L F Galasko D R Jutel M Karydas A Kaye J A Leszek J Miller B L

Minthon L Quinn J F Rabinovici G D Robinson W H Sabbagh M N So Y T

Sparks D L Tabaton M Tinklenberg J Yesavage J A Tibshirani R Wyss-Coray T

Classification and prediction of clinical Alzheimers diagnosis based on plasma signaling

proteins Nat Med 2007 13 (11) 1359-62

17 Lopez M F Kuppusamy R Sarracino D A Prakash A Athanas M Krastins B

Rezai T Sutton J N Peterman S Nicolaides K Mass spectrometric discovery and selective

reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21

maternal serum J Proteome Res 2011 10 (1) 133-42

18 Sheta E A Appel S H Goldknopf I L 2D gel blood serum biomarkers reveal

differential clinical proteomics of the neurodegenerative diseases Expert Rev Proteomics 2006

3 (1) 45-62

19 Zhang J Goodlett D R Montine T J Proteomic biomarker discovery in

cerebrospinal fluid for neurodegenerative diseases J Alzheimers Dis 2005 8 (4) 377-86

20 Yuan X Desiderio D M Proteomics analysis of prefractionated human lumbar

cerebrospinal fluid Proteomics 2005 5 (2) 541-50

21 Wenner B R Lovell M A Lynn B C Proteomic analysis of human ventricular

cerebrospinal fluid from neurologically normal elderly subjects using two-dimensional LC-

MSMS J Proteome Res 2004 3 (1) 97-103

22 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry Electrophoresis 2004 25 (14) 2402-12

23 Yuan X Desiderio D M Proteomics analysis of human cerebrospinal fluid J

Chromatogr B Analyt Technol Biomed Life Sci 2005 815 (1-2) 179-89

24 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

58

25 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers Proteomics Clin Appl 2008 2 (3) 428-36

26 Rozek W Ricardo-Dukelow M Holloway S Gendelman H E Wojna V

Melendez L M Ciborowski P Cerebrospinal fluid proteomic profiling of HIV-1-infected

patients with cognitive impairment J Proteome Res 2007 6 (11) 4189-99

27 Schutzer S E Liu T Natelson B H Angel T E Schepmoes A A Purvine S O

Hixson K K Lipton M S Camp D G Coyle P K Smith R D Bergquist J

Establishing the proteome of normal human cerebrospinal fluid PLoS One 2010 5 (6) e10980

28 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

29 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2010 878 (22) 2003-12

30 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

31 Yuan X Desiderio D M Human cerebrospinal fluid peptidomics J Mass Spectrom

2005 40 (2) 176-81

32 Ringman J M Schulman H Becker C Jones T Bai Y Immermann F Cole G

Sokolow S Gylys K Geschwind D H Cummings J L Wan H I Proteomic changes in

cerebrospinal fluid of presymptomatic and affected persons carrying familial Alzheimer disease

mutations Arch Neurol 2012 69 (1) 96-104

33 Jahn H Wittke S Zurbig P Raedler T J Arlt S Kellmann M Mullen W

Eichenlaub M Mischak H Wiedemann K Peptide fingerprinting of Alzheimers disease in

cerebrospinal fluid identification and prospective evaluation of new synaptic biomarkers PLoS

One 2011 6 (10) e26540

34 Rajagopal M U Hathout Y MacDonald T J Kieran M W Gururangan S

Blaney S M Phillips P Packer R Gordish-Dressman H Rood B R Proteomic profiling

of cerebrospinal fluid identifies prostaglandin D2 synthase as a putative biomarker for pediatric

medulloblastoma A pediatric brain tumor consortium study Proteomics 2011 11 (5) 935-43

35 Giron P Dayon L Turck N Hoogland C Sanchez J C Quantitative analysis of

human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive

isobaric labeling J Proteome Res 2011 10 (1) 249-58

36 Ting L Rad R Gygi S P Haas W MS3 eliminates ratio distortion in isobaric

multiplexed quantitative proteomics Nat Methods 2011 8 (11) 937-40

37 Michalski A Damoc E Hauschild J P Lange O Wieghaus A Makarov A

Nagaraj N Cox J Mann M Horning S Mass spectrometry-based proteomics using Q

Exactive a high-performance benchtop quadrupole Orbitrap mass spectrometer Mol Cell

Proteomics 2011 10 (9) M111 011015

38 Spirin V Shpunt A Seebacher J Gentzel M Shevchenko A Gygi S Sunyaev

S Assigning spectrum-specific P-values to protein identifications by mass spectrometry

Bioinformatics 2011 27 (8) 1128-34

39 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

59

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

2012 84 (1) 209-15

40 Wilhelm M Kirchner M Steen J A Steen H mz5 Space- and Time-efficient

Storage of Mass Spectrometry Data Sets Mol Cell Proteomics 2012 11 (1) O111 011379

41 Zhou F Sikorski T W Ficarro S B Webber J T Marto J A Online nanoflow

reversed phase-strong anion exchange-reversed phase liquid chromatography-tandem mass

spectrometry platform for efficient and in-depth proteome sequence analysis of complex

organisms Anal Chem 2011 83 (18) 6996-7005

42 Winter D Steen H Optimization of cell lysis and protein digestion protocols for the

analysis of HeLa S3 cells by LC-MSMS Proteomics 2011 11 (24) 4726-30

43 Shteynberg D Deutsch E W Lam H Eng J K Sun Z Tasman N Mendoza L

Moritz R L Aebersold R Nesvizhskii A I iProphet multi-level integrative analysis of

shotgun proteomic data improves peptide and protein identification rates and error estimates

Mol Cell Proteomics 2011 10 (12) M111 007690

44 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

84 (1) 209-15

45 Datta A Park J E Li X Zhang H Ho Z S Heese K Lim S K Tam J P Sze

S K Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun

quantitative proteomics J Proteome Res 9 (1) 472-84

46 Decramer S Gonzalez de Peredo A Breuil B Mischak H Monsarrat B Bascands

J L Schanstra J P Urine in clinical proteomics Mol Cell Proteomics 2008 7 (10) 1850-62

47 De La Monte S M Wands J R The AD7c-NTP neuronal thread protein biomarker for

detecting Alzheimers disease J Alzheimers Dis 2001 3 (3) 345-353

48 Van Dorsselaer A Carapito C Delalande F Schaeffer-Reiss C Thierse D Diemer

H McNair D S Krewski D Cashman N R Detection of prion protein in urine-derived

injectable fertility products by a targeted proteomic approach PLoS One 2011 6 (3) e17815

49 Kuwabara Y Mine K Katayama A Inagawa T Akira S Takeshita T Proteomic

analyses of recombinant human follicle-stimulating hormone and urinary-derived gonadotropin

preparations J Reprod Med 2009 54 (8) 459-66

50 Court M Selevsek N Matondo M Allory Y Garin J Masselon C D Domon B

Toward a standardized urine proteome analysis methodology Proteomics 2011 11 (6) 1160-71

51 Marimuthu A OMeally R N Chaerkady R Subbannayya Y Nanjappa V Kumar

P Kelkar D S Pinto S M Sharma R Renuse S Goel R Christopher R Delanghe B

Cole R N Harsha H C Pandey A A comprehensive map of the human urinary proteome J

Proteome Res 2011 10 (6) 2734-43

52 Bermejo-Pareja F Antequera D Vargas T Molina J A Carro E Saliva levels of

Abeta1-42 as potential biomarker of Alzheimers disease a pilot study BMC Neurol 2010 10

108

53 Devic I Hwang H Edgar J S Izutsu K Presland R Pan C Goodlett D R

Wang Y Armaly J Tumas V Zabetian C P Leverenz J B Shi M Zhang J Salivary

alpha-synuclein and DJ-1 potential biomarkers for Parkinsons disease Brain 2011 134 (Pt 7)

e178

60

54 Al-Tarawneh S K Border M B Dibble C F Bencharit S Defining salivary

biomarkers using mass spectrometry-based proteomics a systematic review OMICS 2011 15

(6) 353-61

55 Castagnola M Cabras T Vitali A Sanna M T Messana I Biotechnological

implications of the salivary proteome Trends Biotechnol 2011 29 (8) 409-18

56 Caprioli R M Farmer T B Gile J Molecular imaging of biological samples

localization of peptides and proteins using MALDI-TOF MS Anal Chem 1997 69 (23) 4751-

60

57 Stoeckli M Chaurand P Hallahan D E Caprioli R M Imaging mass spectrometry

a new technology for the analysis of protein expression in mammalian tissues Nat Med 2001 7

(4) 493-6

58 Chaurand P Schwartz S A Caprioli R M Imaging mass spectrometry a new tool to

investigate the spatial organization of peptides and proteins in mammalian tissue sections Curr

Opin Chem Biol 2002 6 (5) 676-81

59 Chaurand P Stoeckli M Caprioli R M Direct profiling of proteins in biological

tissue sections by MALDI mass spectrometry Anal Chem 1999 71 (23) 5263-70

60 Seeley E H Caprioli R M MALDI imaging mass spectrometry of human tissue

method challenges and clinical perspectives Trends Biotechnol 2011 29 (3) 136-43

61 Cazares L H Troyer D A Wang B Drake R R Semmes O J MALDI tissue

imaging from biomarker discovery to clinical applications Anal Bioanal Chem 2011 401 (1)

17-27

62 Stauber J Lemaire R Franck J Bonnel D Croix D Day R Wisztorski M

Fournier I Salzet M MALDI imaging of formalin-fixed paraffin-embedded tissues

application to model animals of Parkinson disease for biomarker hunting J Proteome Res 2008

7 (3) 969-78

63 Yuki D Sugiura Y Zaima N Akatsu H Hashizume Y Yamamoto T Fujiwara

M Sugiyama K Setou M Hydroxylated and non-hydroxylated sulfatide are distinctly

distributed in the human cerebral cortex Neuroscience 2011 193 44-53

64 Makridakis M Vlahou A Secretome proteomics for discovery of cancer biomarkers J

Proteomics 2010 73 (12) 2291-305

65 Dowling P Clynes M Conditioned media from cell lines a complementary model to

clinical specimens for the discovery of disease-specific biomarkers Proteomics 2011 11 (4)

794-804

66 Moore N H Costa L G Shaffer S A Goodlett D R Guizzetti M Shotgun

proteomics implicates extracellular matrix proteins and protease systems in neuronal

development induced by astrocyte cholinergic stimulation J Neurochem 2009 108 (4) 891-908

67 Keene S D Greco T M Parastatidis I Lee S H Hughes E G Balice-Gordon R

J Speicher D W Ischiropoulos H Mass spectrometric and computational analysis of

cytokine-induced alterations in the astrocyte secretome Proteomics 2009 9 (3) 768-82

68 Dowell J A Johnson J A Li L Identification of astrocyte secreted proteins with a

combination of shotgun proteomics and bioinformatics J Proteome Res 2009 8 (8) 4135-43

69 Ahmed F E Sample preparation and fractionation for proteome analysis and cancer

biomarker discovery by mass spectrometry J Sep Sci 2009 32 (5-6) 771-98

70 Righetti P G Fasoli E Boschetti E Combinatorial peptide ligand libraries the

conquest of the hidden proteome advances at great strides Electrophoresis 2011 32 (9) 960-6

61

71 Selvaraju S El Rassi Z Liquid-phase-based separation systems for depletion

prefractionation and enrichment of proteins in biological fluids and matrices for in-depth

proteomics analysis - An update covering the period 2008-2011 Electrophoresis 2011 33 (1)

74-88

72 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

73 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

74 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

75 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

76 Wagner P D Vu N D Histidine to aspartate phosphotransferase activity of nm23

proteins phosphorylation of aldolase C on Asp-319 Biochem J 2000 346 Pt 3 623-30

77 Chi A Huttenhower C Geer L Y Coon J J Syka J E Bai D L Shabanowitz

J Burke D J Troyanskaya O G Hunt D F Analysis of phosphorylation sites on proteins

from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry Proc

Natl Acad Sci U S A 2007 104 (7) 2193-8

78 Rudrabhatla P Jaffe H Pant H C Direct evidence of phosphorylated neuronal

intermediate filament proteins in neurofibrillary tangles (NFTs) phosphoproteomics of

Alzheimers NFTs FASEB J 25 (11) 3896-905

79 Saez-Valero J Fodero L R Sjogren M Andreasen N Amici S Gallai V

Vanderstichele H Vanmechelen E Parnetti L Blennow K Small D H Glycosylation of

acetylcholinesterase and butyrylcholinesterase changes as a function of the duration of

Alzheimers disease J Neurosci Res 2003 72 (4) 520-6

80 Silveyra M X Cuadrado-Corrales N Marcos A Barquero M S Rabano A

Calero M Saez-Valero J Altered glycosylation of acetylcholinesterase in Creutzfeldt-Jakob

disease J Neurochem 2006 96 (1) 97-104

81 Wei X Li L Comparative glycoproteomics approaches and applications Brief Funct

Genomic Proteomic 2009 8 (2) 104-13

82 Tian Y Zhang H Glycoproteomics and clinical applications Proteomics Clin Appl

2010 4 (2) 124-32

83 Butterfield D A Owen J B Lectin-affinity chromatography brain glycoproteomics

and Alzheimer disease insights into protein alterations consistent with the pathology and

progression of this dementing disorder Proteomics Clin Appl 2011 5 (1-2) 50-6

84 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion

disease biomarker research delving into the glycoproteome J Proteome Res 2010 10 (6) 2687-

702

85 Rucevic M Hixson D Josic D Mammalian plasma membrane proteins as potential

biomarkers and drug targets Electrophoresis 2011 32 (13) 1549-64

86 Gilmore J M Washburn M P Advances in shotgun proteomics and the analysis of

membrane proteomes J Proteomics 2010 73 (11) 2078-91

62

87 Weekes M P Antrobus R Lill J R Duncan L M Hor S Lehner P J

Comparative analysis of techniques to purify plasma membrane proteins J Biomol Tech 2010

21 (3) 108-15

88 Helbig A O Heck A J Slijper M Exploring the membrane proteome--challenges

and analytical strategies J Proteomics 2010 73 (5) 868-78

89 Griffin N M Schnitzer J E Overcoming key technological challenges in using mass

spectrometry for mapping cell surfaces in tissues Mol Cell Proteomics 2011 10 (2) R110

000935

90 Groen A J Lilley K S Proteomics of total membranes and subcellular membranes

Expert Rev Proteomics 2010 7 (6) 867-78

91 Wisniewski J R Tools for phospho- and glycoproteomics of plasma membranes Amino

Acids 2011 41 (2) 223-33

92 Wei X Dulberger C Li L Characterization of murine brain membrane glycoproteins

by detergent assisted lectin affinity chromatography Anal Chem 2010 82 (15) 6329-33

93 Zhang H Guo T Li X Datta A Park J E Yang J Lim S K Tam J P Sze S

K Simultaneous characterization of glyco- and phosphoproteomes of mouse brain membrane

proteome with electrostatic repulsion hydrophilic interaction chromatography Mol Cell

Proteomics 2011 9 (4) 635-47

94 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

95 Ma M Chen R Ge Y He H Marshall A G Li L Combining bottom-up and top-

down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic

hormone from Cancer borealis Anal Chem 2009 81 (1) 240-7

96 Chen R Ma M Hui L Zhang J Li L Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 (4) 708-18

97 Wei H Nolkrantz K Parkin M C Chisolm C N OCallaghan J P Kennedy R

T Identification and quantification of neuropeptides in brain tissue by capillary liquid

chromatography coupled off-line to MALDI-TOF and MALDI-TOFTOF-MS Anal Chem 2006

78 (13) 4342-51

98 Dowell J A Heyden W V Li L Rat neuropeptidomics by LC-MSMS and MALDI-

FTMS Enhanced dissection and extraction techniques coupled with 2D RP-RP HPLC J

Proteome Res 2006 5 (12) 3368-75

99 Che F Y Fricker L D Quantitative peptidomics of mouse pituitary comparison of

different stable isotopic tags J Mass Spectrom 2005 40 (2) 238-49

100 Fricker L D Analysis of mouse brain peptides using mass spectrometry-based

peptidomics implications for novel functions ranging from non-classical neuropeptides to

microproteins Mol Biosyst 2010 6 (8) 1355-65

101 Colgrave M L Xi L Lehnert S A Flatscher-Bader T Wadensten H Nilsson A

Andren P E Wijffels G Neuropeptide profiling of the bovine hypothalamus thermal

stabilization is an effective tool in inhibiting post-mortem degradation Proteomics 2011 11 (7)

1264-76

102 Scholz B Alm H Mattsson A Nilsson A Kultima K Savitski M M Falth M

Skold K Brunstrom B Andren P E Dencker L Neuropeptidomic analysis of the

embryonic Japanese quail diencephalon BMC Dev Biol 2010 10 30

63

103 Hummon A B Amare A Sweedler J V Discovering new invertebrate neuropeptides

using mass spectrometry Mass Spectrom Rev 2006 25 (1) 77-98

104 Vilim F S Sasaki K Rybak J Alexeeva V Cropper E C Jing J Orekhova I

V Brezina V Price D Romanova E V Rubakhin S S Hatcher N Sweedler J V

Weiss K R Distinct mechanisms produce functionally complementary actions of neuropeptides

that are structurally related but derived from different precursors J Neurosci 2010 30 (1) 131-

47

105 Fu Q Li L De novo sequencing of neuropeptides using reductive isotopic methylation

and investigation of ESI QTOF MSMS fragmentation pattern of neuropeptides with N-terminal

dimethylation Anal Chem 2005 77 (23) 7783-95

106 Chen R Jiang X Conaway M C Mohtashemi I Hui L Viner R Li L Mass

spectral analysis of neuropeptide expression and distribution in the nervous system of the lobster

Homarus americanus J Proteome Res 2010 9 (2) 818-32

107 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved

identification of endogenous peptides from murine nervous tissue by multiplexed peptide

extraction methods and multiplexed mass spectrometric analysis J Proteome Res 2009 8 (2)

870-6

108 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Peptidomic analysis

of human cell lines J Proteome Res 2011 10 (4) 1583-92

109 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement

approaches and challenges Annu Rev Anal Chem (Palo Alto Calif) 2008 1 451-83

110 Marouga R David S Hawkins E The development of the DIGE system 2D

fluorescence difference gel analysis technology Anal Bioanal Chem 2005 382 (3) 669-78

111 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

112 Ottervald J Franzen B Nilsson K Andersson L I Khademi M Eriksson B

Kjellstrom S Marko-Varga G Vegvari A Harris R A Laurell T Miliotis T

Matusevicius D Salter H Ferm M Olsson T Multiple sclerosis Identification and clinical

evaluation of novel CSF biomarkers J Proteomics 2010 73 (6) 1117-32

113 Diez R Herbstreith M Osorio C Alzate O 2-D Fluorescence Difference Gel

Electrophoresis (DIGE) in Neuroproteomics 2010

114 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent

advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1)

46-50

115 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J

Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66

116 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

117 Burke T W Mant C T Black J A Hodges R S Strong cation-exchange high-

performance liquid chromatography of peptides Effect of non-specific hydrophobic interactions

and linearization of peptide retention behaviour J Chromatogr 1989 476 377-89

118 Edelmann M J Strong cation exchange chromatography in analysis of posttranslational

modifications innovations and perspectives J Biomed Biotechnol 2011 2011 936508

64

119 Francois I Sandra K Sandra P Comprehensive liquid chromatography fundamental

aspects and practical considerations--a review Anal Chim Acta 2009 641 (1-2) 14-31

120 Guiochon G Marchetti N Mriziq K Shalliker R A Implementations of two-

dimensional liquid chromatography J Chromatogr A 2008 1189 (1-2) 109-68

121 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

122 Alpert A J Hydrophilic-interaction chromatography for the separation of peptides

nucleic acids and other polar compounds J Chromatogr 1990 499 177-96

123 Gilar M Olivova P Daly A E Gebler J C Orthogonality of separation in two-

dimensional liquid chromatography Anal Chem 2005 77 (19) 6426-34

124 Di Palma S Stange D van de Wetering M Clevers H Heck A J Mohammed S

Highly sensitive proteome analysis of FACS-sorted adult colon stem cells J Proteome Res 2011

10 (8) 3814-9

125 Di Palma S Boersema P J Heck A J Mohammed S Zwitterionic hydrophilic

interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution

separation and increase sensitivity in proteome analysis Anal Chem 2011 83 (9) 3440-7

126 Zarei M Sprenger A Metzger F Gretzmeier C Dengjel J Comparison of ERLIC-

TiO2 HILIC-TiO2 and SCX-TiO2 for global phosphoproteomics approaches J Proteome Res

2011 10 (8) 3474-83

127 Neue K Mormann M Peter-Katalinic J Pohlentz G Elucidation of glycoprotein

structures by unspecific proteolysis and direct nanoESI mass spectrometric analysis of ZIC-

HILIC-enriched glycopeptides J Proteome Res 2011 10 (5) 2248-60

128 Ow S Y Salim M Noirel J Evans C Wright P C Minimising iTRAQ ratio

compression through understanding LC-MS elution dependence and high-resolution HILIC

fractionation Proteomics 2011 11 (11) 2341-6

129 Alpert A J Electrostatic repulsion hydrophilic interaction chromatography for isocratic

separation of charged solutes and selective isolation of phosphopeptides Anal Chem 2008 80

(1) 62-76

130 Chien K Y Liu H C Goshe M B Development and application of a

phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography

(ERLIC) IMAC and LC-MSMS analysis to study Mareks Disease Virus infection J Proteome

Res 2011 10 (9) 4041-53

131 Liu X Plasencia M Ragg S Valentine S J Clemmer D E Development of high

throughput dispersive LC-ion mobility-TOFMS techniques for analysing the human plasma

proteome Brief Funct Genomic Proteomic 2004 3 (2) 177-86

132 Valentine S J Liu X Plasencia M D Hilderbrand A E Kurulugama R T

Koeniger S L Clemmer D E Developing liquid chromatography ion mobility mass

spectometry techniques Expert Rev Proteomics 2005 2 (4) 553-65

133 Valentine S J Plasencia M D Liu X Krishnan M Naylor S Udseth H R

Smith R D Clemmer D E Toward plasma proteome profiling with ion mobility-mass

spectrometry J Proteome Res 2006 5 (11) 2977-84

134 Valentine S J Ewing M A Dilger J M Glover M S Geromanos S Hughes C

Clemmer D E Using ion mobility data to improve peptide identification intrinsic amino acid

size parameters J Proteome Res 2011 10 (5) 2318-29

65

135 Ahmed F E The role of capillary electrophoresis-mass spectrometry to proteome

analysis and biomarker discovery J Chromatogr B Analyt Technol Biomed Life Sci 2009 877

(22) 1963-81

136 Fonslow B R Yates J R 3rd Capillary electrophoresis applied to proteomic analysis

J Sep Sci 2009 32 (8) 1175-88

137 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins J Chromatogr A 2007 1159 (1-2) 81-109

138 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins 2007-2010 Electrophoresis 2011 32 (1) 66-82

139 Huck C W Bakry R Huber L A Bonn G K Progress in capillary electrophoresis

coupled to matrix-assisted laser desorptionionization - time of flight mass spectrometry

Electrophoresis 2006 27 (11) 2063-74

140 Klampfl C W CE with MS detection a rapidly developing hyphenated technique

Electrophoresis 2009 30 Suppl 1 S83-91

141 Simpson D C Smith R D Combining capillary electrophoresis with mass

spectrometry for applications in proteomics Electrophoresis 2005 26 (7-8) 1291-305

142 Li Y Champion M M Sun L Champion P A Wojcik R Dovichi N J Capillary

zone electrophoresis-electrospray ionization-tandem mass spectrometry as an alternative

proteomics platform to ultraperformance liquid chromatography-electrospray ionization-tandem

mass spectrometry for samples of intermediate complexity Anal Chem 2012 84 (3) 1617-22

143 Dai L Li C Shedden K A Lee C J Quoc H Simeone D M Lubman D M

Quantitative proteomic profiling studies of pancreatic cancer stem cells J Proteome Res 2010 9

(7) 3394-402

144 Chen J Balgley B M DeVoe D L Lee C S Capillary isoelectric focusing-based

multidimensional concentrationseparation platform for proteome analysis Anal Chem 2003 75

(13) 3145-52

145 Mischak H Delles C Klein J Schanstra J P Urinary proteomics based on capillary

electrophoresis-coupled mass spectrometry in kidney disease discovery and validation of

biomarkers and clinical application Adv Chronic Kidney Dis 2010 17 (6) 493-506

146 Albalat A Mischak H Mullen W Urine proteomics in clinical applications

technologies principal considerations and clinical implementation Prilozi 2011 32 (1) 13-44

147 Zuberovic A Hanrieder J Hellman U Bergquist J Wetterhall M Proteome

profiling of human cerebrospinal fluid exploring the potential of capillary electrophoresis with

surface modified capillaries for analysis of complex biological samples Eur J Mass Spectrom

(Chichester Eng) 2008 14 (4) 249-60

148 Johannesson N Olsson L Backstrom D Wetterhall M Danielsson R Bergquist

J Screening for biomarkers in plasma from patients with gangrenous and phlegmonous

appendicitis using CE and CEC in combination with MS Electrophoresis 2007 28 (9) 1435-43

149 Guo T Lee C S Wang W DeVoe D L Balgley B M Capillary separations

enabling tissue proteomics-based biomarker discovery Electrophoresis 2006 27 (18) 3523-32

150 Hwang S I Thumar J Lundgren D H Rezaul K Mayya V Wu L Eng J

Wright M E Han D K Direct cancer tissue proteomics a method to identify candidate

cancer biomarkers from formalin-fixed paraffin-embedded archival tissues Oncogene 2007 26

(1) 65-76

66

151 Desiderio C Rossetti D V Iavarone F Messana I Castagnola M Capillary

electrophoresis--mass spectrometry recent trends in clinical proteomics J Pharm Biomed Anal

2010 53 (5) 1161-9

152 Albalat A Mischak H Mullen W Clinical application of urinary

proteomicspeptidomics Expert Rev Proteomics 2011 8 (5) 615-29

153 Ye X Luke B Andresson T Blonder J 18O stable isotope labeling in MS-based

proteomics Brief Funct Genomic Proteomic 2009 8 (2) 136-44

154 Staes A Demol H Van Damme J Martens L Vandekerckhove J Gevaert K

Global differential non-gel proteomics by quantitative and stable labeling of tryptic peptides with

oxygen-18 J Proteome Res 2004 3 (4) 786-91

155 Jorge I Navarro P Martinez-Acedo P Nunez E Serrano H Alfranca A

Redondo J M Vazquez J Statistical model to analyze quantitative proteomics data obtained

by 18O16O labeling and linear ion trap mass spectrometry application to the study of vascular

endothelial growth factor-induced angiogenesis in endothelial cells Mol Cell Proteomics 2009

8 (5) 1130-49

156 Gygi S P Rist B Gerber S A Turecek F Gelb M H Aebersold R Quantitative

analysis of complex protein mixtures using isotope-coded affinity tags Nat Biotechnol 1999 17

(10) 994-9

157 Haqqani A S Kelly J F Stanimirovic D B Quantitative protein profiling by mass

spectrometry using isotope-coded affinity tags Methods Mol Biol 2008 439 225-40

158 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

159 Thompson A Schafer J Kuhn K Kienle S Schwarz J Schmidt G Neumann T

Johnstone R Mohammed A K Hamon C Tandem mass tags a novel quantification strategy

for comparative analysis of complex protein mixtures by MSMS Anal Chem 2003 75 (8)

1895-904

160 Dayon L Hainard A Licker V Turck N Kuhn K Hochstrasser D F Burkhard

P R Sanchez J C Relative quantification of proteins in human cerebrospinal fluids by

MSMS using 6-plex isobaric tags Anal Chem 2008 80 (8) 2921-31

161 DAscenzo M Choe L Lee K H iTRAQPak an R based analysis and visualization

package for 8-plex isobaric protein expression data Brief Funct Genomic Proteomic 2008 7 (2)

127-35

162 Choe L DAscenzo M Relkin N R Pappin D Ross P Williamson B Guertin

S Pribil P Lee K H 8-plex quantitation of changes in cerebrospinal fluid protein expression

in subjects undergoing intravenous immunoglobulin treatment for Alzheimers disease

Proteomics 2007 7 (20) 3651-60

163 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 2010 82 (7) 2817-

25

164 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

67

165 Ong S E Kratchmarova I Mann M Properties of 13C-substituted arginine in stable

isotope labeling by amino acids in cell culture (SILAC) J Proteome Res 2003 2 (2) 173-81

166 Pimienta G Chaerkady R Pandey A SILAC for global phosphoproteomic analysis

Methods Mol Biol 2009 527 107-16 x

167 Zhang G Neubert T A Use of stable isotope labeling by amino acids in cell culture

(SILAC) for phosphotyrosine protein identification and quantitation Methods Mol Biol 2009

527 79-92 xi

168 Bondarenko P V Chelius D Shaler T A Identification and relative quantitation of

protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid

chromatography-tandem mass spectrometry Anal Chem 2002 74 (18) 4741-9

169 Chelius D Bondarenko P V Quantitative profiling of proteins in complex mixtures

using liquid chromatography and mass spectrometry J Proteome Res 2002 1 (4) 317-23

170 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

171 Wang W Zhou H Lin H Roy S Shaler T A Hill L R Norton S Kumar P

Anderle M Becker C H Quantification of proteins and metabolites by mass spectrometry

without isotopic labeling or spiked standards Anal Chem 2003 75 (18) 4818-26

172 Silva J C Denny R Dorschel C A Gorenstein M Kass I J Li G Z McKenna

T Nold M J Richardson K Young P Geromanos S Quantitative proteomic analysis by

accurate mass retention time pairs Anal Chem 2005 77 (7) 2187-200

173 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

174 Filiou M D Martins-de-Souza D Guest P C Bahn S Turck C W To label or not

to label Applications of quantitative proteomics in neuroscience research Proteomics 2012

175 Neilson K A Ali N A Muralidharan S Mirzaei M Mariani M Assadourian G

Lee A van Sluyter S C Haynes P A Less label more free approaches in label-free

quantitative mass spectrometry Proteomics 2011 11 (4) 535-53

176 Zhu W Smith J W Huang C M Mass spectrometry-based label-free quantitative

proteomics J Biomed Biotechnol 2010 2010 840518

177 Xie F Liu T Qian W J Petyuk V A Smith R D Liquid chromatography-mass

spectrometry-based quantitative proteomics J Biol Chem 2012 286 (29) 25443-9

178 Shen Y Tolic N Xie F Zhao R Purvine S O Schepmoes A A Moore R J

Anderson G A Smith R D Effectiveness of CID HCD and ETD with FT MSMS for

degradomic-peptidomic analysis comparison of peptide identification methods J Proteome Res

2011 10 (9) 3929-43

179 Zubarev R A Kelleher N L McLafferty F W Electron Capture Dissociation of

Multiply Charged Protein Cations A Nonergodic Process Journal of the American Chemical

Society 1998 120 (13) 3265-3266

180 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

181 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

68

182 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass

spectrometry for shotgun proteomics Nat Methods 2008 5 (11) 959-64

183 Zhou W Ross M M Tessitore A Ornstein D Vanmeter A Liotta L A

Petricoin E F 3rd An initial characterization of the serum phosphoproteome J Proteome Res

2009 8 (12) 5523-31

184 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides

complexed with divalent metal cations Anal Chem 2006 78 (21) 7570-6

185 Snovida S I Bodnar E D Viner R Saba J Perreault H A simple cellulose

column procedure for selective enrichment of glycopeptides and characterization by nano LC

coupled with electron-transfer and high-energy collisional-dissociation tandem mass

spectrometry Carbohydr Res 2010 345 (6) 792-801

186 Sobott F Watt S J Smith J Edelmann M J Kramer H B Kessler B M

Comparison of CID versus ETD based MSMS fragmentation for the analysis of protein

ubiquitination J Am Soc Mass Spectrom 2009 20 (9) 1652-9

187 Eliuk S M Maltby D Panning B Burlingame A L High resolution electron

transfer dissociation studies of unfractionated intact histones from murine embryonic stem cells

using on-line capillary LC separation determination of abundant histone isoforms and post-

translational modifications Mol Cell Proteomics 2010 9 (5) 824-37

188 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 10 (9) 4219-29

189 Molina H Matthiesen R Kandasamy K Pandey A Comprehensive comparison of

collision induced dissociation and electron transfer dissociation Anal Chem 2008 80 (13)

4825-35

190 Olsen J V Macek B Lange O Makarov A Horning S Mann M Higher-energy

C-trap dissociation for peptide modification analysis Nat Methods 2007 4 (9) 709-12

191 McAlister G C Phanstiel D Wenger C D Lee M V Coon J J Analysis of

tandem mass spectra by FTMS for improved large-scale proteomics with superior protein

quantification Anal Chem 2010 82 (1) 316-22

192 Dayon L Pasquarello C Hoogland C Sanchez J C Scherl A Combining low-

and high-energy tandem mass spectra for optimized peptide quantification with isobaric tags J

Proteomics 2010 73 (4) 769-77

193 Second T P Blethrow J D Schwartz J C Merrihew G E MacCoss M J

Swaney D L Russell J D Coon J J Zabrouskov V Dual-pressure linear ion trap mass

spectrometer improving the analysis of complex protein mixtures Anal Chem 2009 81 (18)

7757-65

194 Jedrychowski M P Huttlin E L Haas W Sowa M E Rad R Gygi S P

Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for

murine phosphoproteomics Mol Cell Proteomics 2011 10 (12) M111 009910

195 Frese C K Altelaar A F Hennrich M L Nolting D Zeller M Griep-Raming J

Heck A J Mohammed S Improved peptide identification by targeted fragmentation using

CID HCD and ETD on an LTQ-Orbitrap Velos J Proteome Res 2011 10 (5) 2377-88

196 Chakraborty A B Berger S J Gebler J C Use of an integrated MS--multiplexed

MSMS data acquisition strategy for high-coverage peptide mapping studies Rapid Commun

Mass Spectrom 2007 21 (5) 730-44

69

197 Ramos A A Yang H Rosen L E Yao X Tandem parallel fragmentation of

peptides for mass spectrometry Anal Chem 2006 78 (18) 6391-7

198 Barbara J E Castro-Perez J M High-resolution chromatographytime-of-flight MSE

with in silico data mining is an information-rich approach to reactive metabolite screening Rapid

Commun Mass Spectrom 2011 25 (20) 3029-40

199 Geromanos S J Vissers J P Silva J C Dorschel C A Li G Z Gorenstein M

V Bateman R H Langridge J I The detection correlation and comparison of peptide

precursor and product ions from data independent LC-MS with data dependant LC-MSMS

Proteomics 2009 9 (6) 1683-95

200 Koutroukides T A Guest P C Leweke F M Bailey D M Rahmoune H Bahn

S Martins-de-Souza D Characterization of the human serum depletome by label-free shotgun

proteomics J Sep Sci 34 (13) 1621-6

201 Martins-de-Souza D Guest P C Guest F L Bauder C Rahmoune H Pietsch S

Roeber S Kretzschmar H Mann D Baborie A Bahn S Characterization of the human

primary visual cortex and cerebellum proteomes using shotgun mass spectrometry-data-

independent analyses Proteomics 12 (3) 500-4

202 Scatena R Bottoni P Pontoglio A Giardina B Revisiting the Warburg effect in

cancer cells with proteomics The emergence of new approaches to diagnosis prognosis and

therapy Proteomics Clin Appl 4 (2) 143-58

203 Herberth M Koethe D Cheng T M Krzyszton N D Schoeffmann S Guest P

C Rahmoune H Harris L W Kranaster L Leweke F M Bahn S Impaired glycolytic

response in peripheral blood mononuclear cells of first-onset antipsychotic-naive schizophrenia

patients Mol Psychiatry 16 (8) 848-59

204 Krishnamurthy D Levin Y Harris L W Umrania Y Bahn S Guest P C

Analysis of the human pituitary proteome by data independent label-free liquid chromatography

tandem mass spectrometry Proteomics 11 (3) 495-500

205 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

206 Eng J K McCormack A L Yates Iii J R An approach to correlate tandem mass

spectral data of peptides with amino acid sequences in a protein database Journal of the

American Society for Mass Spectrometry 1994 5 (11) 976-989

207 Craig R Beavis R C TANDEM matching proteins with tandem mass spectra

Bioinformatics 2004 20 (9) 1466-7

208 Geer L Y Markey S P Kowalak J A Wagner L Xu M Maynard D M Yang

X Shi W Bryant S H Open mass spectrometry search algorithm J Proteome Res 2004 3

(5) 958-64

209 Zhang J Xin L Shan B Chen W Xie M Yuen D Zhang W Zhang Z Lajoie

G A Ma B PEAKS DB De Novo sequencing assisted database search for sensitive and

accurate peptide identification Mol Cell Proteomics 2011

210 Liu J Erassov A Halina P Canete M Nguyen D V Chung C Cagney G

Ignatchenko A Fong V Emili A Sequential interval motif search unrestricted database

surveys of global MSMS data sets for detection of putative post-translational modifications

Anal Chem 2008 80 (20) 7846-54

70

211 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

212 Han Y Ma B Zhang K SPIDER software for protein identification from sequence

tags with de novo sequencing error J Bioinform Comput Biol 2005 3 (3) 697-716

213 Singh S Springer M Steen J Kirschner M W Steen H FLEXIQuant a novel tool

for the absolute quantification of proteins and the simultaneous identification and quantification

of potentially modified peptides J Proteome Res 2009 8 (5) 2201-10

214 Searle B C Scaffold a bioinformatic tool for validating MSMS-based proteomic

studies Proteomics 10 (6) 1265-9

215 Herbst A McIlwain S Schmidt J J Aiken J M Page C D Li L Prion disease

diagnosis by proteomic profiling J Proteome Res 2009 8 (2) 1030-6

216 Menschaert G Vandekerckhove T T Baggerman G Schoofs L Luyten W Van

Criekinge W Peptidomics coming of age a review of contributions from a bioinformatics

angle J Proteome Res 2010 9 (5) 2051-61

217 Kumar C Mann M Bioinformatics analysis of mass spectrometry-based proteomics

data sets FEBS Lett 2009 583 (11) 1703-12

218 Craig R Cortens J P Beavis R C Open source system for analyzing validating and

storing protein identification data J Proteome Res 2004 3 (6) 1234-42

219 Jones P Cote R G Cho S Y Klie S Martens L Quinn A F Thorneycroft D

Hermjakob H PRIDE new developments and new datasets Nucleic Acids Res 2008 36

(Database issue) D878-83

220 Deutsch E W Lam H Aebersold R PeptideAtlas a resource for target selection for

emerging targeted proteomics workflows EMBO Rep 2008 9 (5) 429-34

221 Miliotis T Ali L Palm J E Lundqvist A J Ahnoff M Andersson T B

Hilgendorf C Development of a highly sensitive method using liquid chromatography-multiple

reaction monitoring to quantify membrane P-glycoprotein in biological matrices and relationship

to transport function Drug Metab Dispos 2011 39 (12) 2440-9

222 Xiang Y Koomen J M Evaluation of Direct Infusion-Multiple Reaction Monitoring

Mass Spectrometry for Quantification of Heat Shock Proteins Anal Chem 2012

223 Ossola R Schiess R Picotti P Rinner O Reiter L Aebersold R Biomarker

validation in blood specimens by selected reaction monitoring mass spectrometry of N-

glycosites Methods Mol Biol 2011 728 179-94

224 Liu H Lam L Dasgupta P K Expanding the linear dynamic range for multiple

reaction monitoring in quantitative liquid chromatography-tandem mass spectrometry utilizing

natural isotopologue transitions Talanta 2011 87 307-10

225 Belov M E Prasad S Prior D C Danielson W F 3rd Weitz K Ibrahim Y M

Smith R D Pulsed multiple reaction monitoring approach to enhancing sensitivity of a tandem

quadrupole mass spectrometer Anal Chem 2011 83 (6) 2162-71

226 Prentice R L Paczesny S Aragaki A Amon L M Chen L Pitteri S J

McIntosh M Wang P Buson Busald T Hsia J Jackson R D Rossouw J E Manson J

E Johnson K Eaton C Hanash S M Novel proteins associated with risk for coronary heart

disease or stroke among postmenopausal women identified by in-depth plasma proteome

profiling Genome Med 2 (7) 48

227 Gelman J S Fricker L D Hemopressin and other bioactive peptides from cytosolic

proteins are these non-classical neuropeptides AAPS J 2010 12 (3) 279-89

71

228 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Hemopressins and

other hemoglobin-derived peptides in mouse brain comparison between brain blood and heart

peptidome and regulation in Cpefatfat mice J Neurochem 2010 113 (4) 871-80

229 Li L Garden R W Sweedler J V Single-cell MALDI a new tool for direct peptide

profiling Trends Biotechnol 2000 18 (4) 151-60

230 Altelaar A M Heck A J Trends in ultrasensitive proteomics Curr Opin Chem Biol

231 Bandura D R Baranov V I Ornatsky O I Antonov A Kinach R Lou X

Pavlov S Vorobiev S Dick J E Tanner S D Mass cytometry technique for real time

single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass

spectrometry Anal Chem 2009 81 (16) 6813-22

232 Trimpin S Inutan E D Herath T N McEwen C N Laserspray ionization a new

atmospheric pressure MALDI method for producing highly charged gas-phase ions of peptides

and proteins directly from solid solutions Mol Cell Proteomics 2010 9 (2) 362-7

233 McEwen C N Larsen B S Trimpin S Laserspray ionization on a commercial

atmospheric pressure-MALDI mass spectrometer ion source selecting singly or multiply

charged ions Anal Chem 2010 82 (12) 4998-5001

234 Wang B Lietz C B Inutan E D Leach S M Trimpin S Producing highly

charged ions without solvent using laserspray ionization a total solvent-free analysis approach at

atmospheric pressure Anal Chem 2011 83 (11) 4076-84

235 Inutan E D Richards A L Wager-Miller J Mackie K McEwen C N Trimpin

S Laserspray ionization a new method for protein analysis directly from tissue at atmospheric

pressure with ultrahigh mass resolution and electron transfer dissociation Mol Cell Proteomics

2010 10 (2) M110 000760

236 Wang H Liu J Cooks R G Ouyang Z Paper spray for direct analysis of complex

mixtures using mass spectrometry Angew Chem Int Ed Engl 49 (5) 877-80

237 Zhang Z Xu W Manicke N E Cooks R G Ouyang Z Silica coated paper

substrate for paper-spray analysis of therapeutic drugs in dried blood spots Anal Chem 84 (2)

931-8

238 Wang H Manicke N E Yang Q Zheng L Shi R Cooks R G Ouyang Z

Direct analysis of biological tissue by paper spray mass spectrometry Anal Chem 83 (4) 1197-

201

239 Liu J Wang H Cooks R G Ouyang Z Leaf spray direct chemical analysis of plant

material and living plants by mass spectrometry Anal Chem 83 (20) 7608-13

240 Liu J Wang H Manicke N E Lin J M Cooks R G Ouyang Z Development

characterization and application of paper spray ionization Anal Chem 82 (6) 2463-71

241 Yoo H J Wang N Zhuang S Song H Hakansson K Negative-ion electron

capture dissociation radical-driven fragmentation of charge-increased gaseous peptide anions J

Am Chem Soc 2011 133 (42) 16790-3

242 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 271 (2) 111-

21

243 Hood L Friend S H Predictive personalized preventive participatory (P4) cancer

medicine Nat Rev Clin Oncol 2011 8 (3) 184-7

244 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 2012 271

(2) 111-21

72

245 Belda-Iniesta C de Castro J Perona R Translational proteomics what can you do for

true patients J Proteome Res 2011 10 (1) 101-4

246 Schiess R Wollscheid B Aebersold R Targeted proteomic strategy for clinical

biomarker discovery Mol Oncol 2009 3 (1) 33-44

247 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

248 Schmidt A Kellermann J Lottspeich F A novel strategy for quantitative proteomics

using isotope-coded protein labels Proteomics 2005 5 (1) 4-15

249 Wang Y K Ma Z Quinn D F Fu E W Inverse 15

N-metabolic labelingmass

spectrometry for comparative proteomics and rapid identification of protein markerstargets

Rapid Commun Mass Spectrom 2002 16 (14) 1389-97

73

Chapter 3

Protein changes in immunodepleted cerebrospinal fluid from transgenic

mouse models of Alexander disease detected using mass spectrometry

Adapted from ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse

models of Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P

Messing A Li L Submitted

74

ABSTRACT

Cerebrospinal fluid (CSF) is a low protein content biological fluid with dynamic range

spanning at least nine orders of magnitude in protein content and is in direct contact with the

brain A modified IgY-14 immunodepletion treatment was performed to enhance analysis of the

low volumes of CSF that are obtainable from mice As a model system in which to test this

approach we utilized transgenic mice that over-express the intermediate filament glial fibrillary

acidic protein (GFAP) These mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we report the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates were performed to address animal variability as well as reproducibility in

mass spectrometric analysis Relative quantitation was performed using distributive normalized

spectral abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins

with significant changes in the CSF of GFAP transgenic mice has been identified with validation

from ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

75

INTRODUCTION

Alexander disease (AxD) is a fatal neurogenetic disorder caused by heterozygous point

mutations in the coding region of GFAP (encoding glial fibrillary acidic protein)1 The hallmark

diagnostic feature of this disease is the accumulation of astrocytic cytoplasmic inclusions known

as Rosenthal fibers containing GFAP Hsp27 αB-crystallin and other components2-5

Although

several potential treatment strategies6-8

are under investigation clinical trial design is hampered

by the absence of a standardized clinical scoring system or means to quantify lesions in MRI

that could serve to monitor severity and progression of disease One solution to this problem

would be the identification of biomarkers in readily sampled body fluids as indirect indicators of

disease

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy site for brain or spinal

cord in evaluating diseases of the central nervous system The protein composition of CSF is

well defined at least for the most abundant species of proteins and numerous studies exist that

characterize individual biomarkers or complex patterns of biomarkers in various diseases9 10

GFAP itself is present in CSF (albeit at much lower levels than in brain parenchyma) and in one

study of three Alexander disease patients its levels were markedly increased11

Whether an

increase in CSF GFAP will be a consistent finding in Alexander disease or whether other useful

biomarkers for this disease could be identified through an unbiased analysis of the CSF

proteome is not yet known

The rarity of Alexander disease makes analysis of human samples difficult However

mouse models exist that replicate key features of the disease such as formation of Rosenthal

fibers Unfortunately mouse CSF poses particular problems for proteomic studies and there is

76

an urgent need for technical improvements for dealing with this fluid For instance collection

from an adult mouse typically yields ~10 μL of CSF often with some contamination by blood12

To further complicate analysis CSF has an exceedingly low protein content (~04 μgμL) with

over 60 of the total protein content consisting of a single protein albumin13 14

A number of

techniques have been developed to remove albumin from biological samples including Cibacron

Blue15

IgG immunodepletion16

and IgY immunodepletion17-19

IgY which is avian in origin

offers reduced non-specific binding and increased avidity when compared to IgG antibodies from

rabbits goats and mice20-23

One widely used IgY cocktail is IgY-14 which contains fourteen

specific antibodies specific for albumin IgG transferrin fibrinogen α1-antitrypsin IgA IgM

α2-macroglobulin haptoglobin apolipoproteins A-I A-II and B complement C3 and α1-acid

glycoprotein Since existing protocols for IgY-14 depletion are optimized for use with large

volumes of serum new protocols must be developed to permit its use with the low volumes of a

low protein fluid represented by mouse CSF

Various improvements have also taken place in the field of proteomic analysis that could

facilitate analysis of mouse CSF Data dependent tandem mass spectrometry followed by

quantification of proteins is used in standard shotgun proteomics24-29

Several methods now exist

for introducing quantitation into mass spectrometry including stable isotope labeling30-32

isobaric tandem mass tags33 34

and spectral counting35 36

Spectral counting which is a

frequency measurement that uses MSMS counts of identified peptides as the metric to enable

protein quantitation is attractive because it is label-free and requires no additional sample

preparation Finally recent advances in spectral counting has produced a data refinement

strategy termed normalized spectral abundance factor (NSAF)37 38

and further developed into

distributive NSAF (dNSAF) to address issues with peptides shared by multiple proteins39

77

To identify potential biomarkers in AxD we report a novel scaled-down version of IgY

antibody depletion strategy to reduce the complexity and remove high abundance proteins in

mouse CSF samples The generated spectral counts data were then subjected to dNSAF natural

log data transformation and t-test analysis to determine which proteins differ in abundance when

comparing GFAP transgenics and controls with multiple biological and technical replicates

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) urea (gt99) sodium chloride (997) and ammonium

bicarbonate (certified) were purchased from Fisher Scientific (Fair Lawn NJ) Deionized water

(182 MΩcm) was prepared with a Milli-Q Millipore system (Billerica MA) Optima LCMS

grade acetonitrile and water were purchased from Fisher Scientific (Fair Lawn NJ) DL-

Dithiothreitol (DTT) and sequencing grade modified trypsin were purchased from Promega

(Madison WI) Formic acid (ge98) was obtained from Fluka (Buchs Switzerland)

Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris ge999 ) ammonium formate

(ge99995) glycine (ge985) and IgY-14 antibodies were purchased from Sigma-Aldrich

(Saint Louis MO)

Mice

Transgenic mice over-expressing the human GFAP gene (Tg737 line) were maintained

as hemizygotes in the FVBN background and genotyped via PCR on DNA prepared from tail

samples as described previously40

The mice were housed on a 14-10 light-dark cycle with ad

libitum access to food and water All procedures were conducted using protocols approved by

the UW-Madison IACUC

78

CSF collection

CSF was collected from mice as described previously12

Briefly mice were anesthetized

with avertin (400-600 mgkg ip) A midline sagittal incision was made over the dorsal aspect

of the hindbrain and three muscle layers carefully peeled back to expose the cisterna magna The

membrane covering the cisterna magna was pierced with a 30 gauge needle and CSF was

collected immediately using a flexible plastic pipette Approximately ten microliters of CSF was

collected per animal All samples used for MS analysis showed no visible contamination of

blood

Enzyme-linked immunosorbent assay (ELISA)

A sandwich ELISA was used to quantify GFAP8 Briefly a microtiter plate was coated

with a cocktail of monoclonal antibodies (Covance SMI-26R) Plates were blocked with 5

milk before addition of sample or standards diluted in PBS with Triton-X and BSA A rabbit

polyclonal antibody (DAKO Z334) was used to detect the GFAP followed by a peroxidase

conjugated anti-rabbit IgG antibody (Sigma A6154) secondary antibody The peroxidase activity

was detected with SuperSignal ELISA Pico Chemiluminescent Substrate (PIERCE) and

quantified with a GloRunner Microplate Luminometer Values below the biological limit of

detection (16ngL) were given the value 16ngL before multiplying by the dilution factor

Immunodepletion of abundant proteins

Currently there are no commercial immunodepletion products available for use with CSF

and to address the low protein content of this fluid (~04 microgmicroL) Therefore 100 microL of

purchased IgY-14 resin was coupled with a Pierce Spin Cup using a paper filter from Thermo

Scientific (Waltham MA) CSF samples from either transgenics or controls were first pooled to

100 microL and then added to 100 microL of 20 mM Tris-HCl 300 mM pH 74 (2x dilution buffer) and

79

allowed to mix with the custom IgY-14 depletion spin column on an end-over-end rotor for 30

minutes at 4oC The sample was then centrifuged at 04 rcf for 45 seconds with an Eppendorf

Centrifuge 5415D (Hamburg Germany) The antibodies were then washed with 50 microL 1x

dilution buffer vortexed for 5 minutes centrifuged at 04 rcf for 45 seconds and the flow through

was collected for tryptic digestion The antibodies were then stripped of the bound proteins with

four 025 mL washes of 01 M glycine pH 25 neutralized with two 025 mL washes of 01 M

Tris-HCl pH 80 and washed once with 025mL of 1x dilution buffer The immunodepletion

protocol was repeated for each transgenic or control pooled 100 microL sample (N=3)

Preparation of tryptic digests

The immunodepleted pooled mouse CSF samples (200 microL total volume) were

concentrated to 10 microL using a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA)

To each sample 8 microL of 133 M urea and 1 microL of 050 M DTT were added and allowed to

incubate at 37oC for one hour After incubation 27 microL of 055 M IAA was added for

carboxymethylation and the sample was allowed to incubate for 15 minutes in the dark To

quench the IAA 1 microL of 050 M DTT was added and allowed to react for 10 minutes To

perform trypsin digestion 70 microL of 50 mM NH4HCO3 was then added along with 025 microg

trypsin which had previously been dissolved in 50 mM acetic acid at a concentration of 05

microgmicroL Digestion was performed overnight at 37oC and quenched by addition of 25 microL of 10

formic acid The tryptic peptides were then subjected to solid phase extraction using a Varian

Omix Tip C18 100 microL (Palo Alto CA) Peptides were eluted with 50 ACN in 01 formic

acid concentrated and reconstituted in 30 microL H2O in 01 formic acid

RP nanoLC separation

80

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto an Agilent

Technologies Zorbax 300 SB-C18 5 microm 5x03 mm trap cartridge (Santa Clara CA) at a flow

rate of 5 microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm

Atlantis dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B

at 250 nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

81

range of 300-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot41

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt mus musculus

(house mouse) database (version 575) False positive analyses42

were calculated using an

automatic decoy option of Mascot Results from the Mascot results were reported using

Proteinscape 21 and technical replicates were combined and reported as a protein compilation

using ProteinExtractor (Bruker Daltonics Bremen Germany)

Mascot search parameters were as follows Allowed missed cleavages 2 enzyme

trypsin variable modifications carboxymethylation (C) and oxidation (M) peptide tolerance

plusmn12 Da maximum number of 13

C 1 MSMS tolerance plusmn05 Da instrument type ESI-Trap

Reported proteins required a minimum of one peptide with a Mascot score ge300 with bold red

characterization Spectral counts were determined from the number of MSMS spectra identified

from accepted proteins A bold red peptide combines a bold peptide which represents the first

query result from a submitted MSMS spectrum with the red peptide which indicates the top

peptide for the identified protein Requiring one bold red peptide assists in removal of

homologous redundant proteins and further improves protein results In addition requiring one

82

peptide to be identified by a score gt300 removes the ability for proteins to be identified by

multiple low Mascot scoring peptides

Each immunodepleted biological replicate had technical triplicates performed and the

technical triplicates were summed together by ProteinExtractor Peptide spectral counts were

then summed for each protein and subjected to dNSAF analysis Details for this method can be

found elsewhere37 39

but briefly peptide spectral counts are summed per protein (SpC) based on

unique peptides and a weighted distribution of any shared peptides with homologous proteins

ProteinScape removed 83 homologous proteins found in the current study to bring the total

number of proteins identified to 266 but some non-unique homologous peptides which are

shared by multiple proteins are still present in the resulting 266 remaining proteins To address

these non-unique homologous peptides distributive spectral counting was performed as

described elsewhere39

The dSpC is divided by the proteinrsquos length (L) and then divided by the

summation of the dSpCL from all proteins in the experiment (N) to produce each proteinrsquos

specific dNSAF value

N

i

i

kk

LdSpC

LdSpCdNSAF

1

)(

)()(

The resulting data were then transformed by taking the natural log of the dNSAF value The

means for each protein were calculated using the ln(dNSAF) from N=3 biological replicates and

the whole data set was subjected to the Shapiro-Wilk test for normalityGaussian distribution

performed on the software PAST (Version 198 University of Oslo Norway Osla) The

Shapiro-Wilk test was repeated multiple times to determine a non-zero value for zero spectral

83

counts A non-zero value is required to alleviate the errors of dividing by zero which was

experimentally determined to be 043 The Gaussian data were then subjected to the t-test to

identify statistically significant changes in protein expression

RESULTS AND DISCUSSION

General workflow

Individual CSF samples were manually inspected and samples were only selected that

showed no visual blood contamination Preliminary experiments showed that the maximum

degree of blood contamination estimated from counts of red blood cells in the CSF that was not

visible to the eye was less than 005 by total mass Approximately 10 individual mouse CSF

samples were pooled to achieve the desired 100 μL volume for a single biological replicate The

CSF samples were IgY immunodepleted followed by digestion with trypsin The resulting

digested samples were then de-salted concentrated reconstituted in 30 μL of 01 formic acid

and 5 μL was loaded onto a reversed phase nanoflow HPLC column subjected to a 90 minute

gradient and infused into the amaZon ETD ion trap mass spectrometer The workflow for

mouse CSF analysis is shown in Figure 1 The remainder of the desalted sample was saved for

technical replicates

Immunodepletion for CSF

Currently there are no immunodepletion techniques specifically designed for CSF

Nonetheless the protein profiles between CSF and serum are similar enough to use currently

available immunodepletion techniques designed for serum as a starting point The smallest

commercially available IgY-14 immunodepletion kit is for 15 μL of serum which is equivalent in

protein content to ~15 mL of CSF Therefore for 100 μL CSF one fifteenth of the total IgY-14

84

beads would be used for the equivalent immunodepletion which is 66 μL of proprietary bead

slurry The potential for irreversible binding of abundant proteins to their respective IgY

antibody even after an extra stripping wash and low amounts of total beads made using 66 μL

of IgY-14 bead slurry unrealistic In practice almost doubling the amount of IgY-14 beads (100

μL) used was not sufficient and resulted in albumin and transferrin peptides to be observed in

high abundance (data not shown) The most important protein to immunodeplete is albumin and

it has been reported to be a greater percentage of total CSF protein content (~60) than serum

(~49) in humans14

The difference in albumin percentage supports the results that proprietary

blends of immunodepletion beads for high abundance proteins such as albumin cannot be

scaled down on a strict protein scale and further modifications to the serum immunodepletion

protocol need to be made

Since IgY-14 beads were developed for use with serum all of its protocols need to be

taken into account to modify the protocol for CSF Serum samples should be diluted fifty times

before mixing with the IgY-14 beads but CSF protein concentration is at least one hundred times

lower than serum Therefore CSF is below half the recommended diluted protein concentration

for IgY immunodepletion Consequently multiple steps have been devised to address this

limitation First the binding time between the proteins targeted for removal from the CSF and

IgY-14 beads was doubled during the end-over-end rotor depletion step from the recommended

15 minutes to 30 minutes Second to prevent non-specific protein binding to the antibodies the

CSF samples were diluted with an equal amount (100 μL) of 2X dilution buffer The dilution

buffer of NaCl and Tris-HCl was needed to assist in reducing non-specific binding of proteins to

the 14 antibodies and ensuring the sample is held at physiological pH In addition to these

modifications a doubling of the starting IgY-14 bead slurry to 200 μL provided the desired

85

results Overall this modified protocol results in effective depletion of CSF abundant proteins

using only one-fifth of the antibodies provided by the smallest commercially available platform

Data Analysis

Spectral counting technique for relative quantitation provides numerous benefits for the

study of mouse CSF due to its label-free advantage In contrast isotopic labeling method often

involves additional sample processing that could cause sample loss which is highly undesirable

for low protein content and low volume samples Labeling methods also require a mixing of two

sets of isotopically labeled samples which would effectively increase the sample complexity and

reduce the amount of sample that can be loaded onto the nanoLC column by half In addition

more than two sets of samples can be compared by label-free methods The use of label-free

spectral counting method does not lead to an increase in sample complexity or interference in

quantitation from peptides in the mz window selected for tandem MS Using spectral counting

for relative quantitation however is dependent on reproducible HPLC separation and careful

mass spectrometry operation to minimize technical variability during the study To address

concerns of analytical reliability and run to run deviations base peak chromatograms from two

transgenic IgY-14 immunodepleted biological replicates including two technical replicates of

each were shown to be highly reproducible (Figure 2)

Each biological sample was analyzed in triplicate with the same protocols on the amaZon

ETD with three control and three transgenic samples From the three technical replicates for

each biological replicate the spectral counts of the peptides for the proteins identified were

summed The results from these mouse CSF biological triplicates are shown in Figure 3A for

GFAP overexpressor and Figure 3B for control The summation of spectral counts for each

biological replicate was performed to remove the inherent bias related to data dependent analysis

86

for protein identification One concern in grouping technical replicates is a potential loss of

information regarding analytical variability Figure 4 provides a graphical representation of

variability of technical replicates illustrating the standard deviation of technical replicates with

error bars Figure 4 shows that a statistically changed protein creatine kinase M (A) and an

unchanged protein kininogen-1 (B) have similar variability within (technical replicates) and

between samples (biological replicates) for each protein In addition Figure 4B illustrates that

even with the variability of kininogen-1 the resulting mean shown by the dashed line of control

and transgenic samples were almost equal whereas Figure 4A shows significantly different

expression level of creatine kinase M Performing replicate analysis of each biological sample

(n=3) helps correct for systematic errors deriving from sample handling and pooling of samples

helps reduce random error during the CSF sample collection process

Protein Identification and Spectral Counting Analysis

The data for dNSAF analysis like any mass spectrometry proteomics experiment

requires multiple layers of verification to ensure reliable data Our initial protein identifications

were subjected to a database search using a decoy database from Mascot which resulted in an

average false positive rate below 1 for all the experimental data collected Representative

MSMS spectra between control and GFAP transgenic samples are illustrated in Figure 5

Overall 266 proteins were identified in a combination of control and transgenic samples

(Supplemental Table 1) A total of 349 proteins were initially identified but 83 proteins were

isoforms of previously identified proteins and automatically excluded by ProteinExtractor The

next level of quality control was to only include ln(dNSAF) values from proteins identified by 2

or more unique peptides having a Mascot score of ge300 and observed in two out of three

biological replicates These selection parameters resulted in 106 proteins remaining after

87

dNSAF analysis (Supplemental Table 2) The resulting spectral counts were then subjected to

dSpC in order to account and correct for the systematic error of peptides shared by multiple

proteins (Supplemental Table 3)

It is inevitable in large scale and complex proteomics experiments that some proteins will

be seen in some samples and not others In addition when controls were compared to transgenic

samples as shown in Figure 3C 127 proteins were unique to either control or GFAP transgenic

mice and 139 proteins were seen in both samples From dNSAF equation if the spectral count

is zero the numerator is zero and the value will not be normalized between runs In order to

circumvent the zero spectral counts in dNSAF analysis zero spectral counts must be replaced by

an experimentally determined non-zero value determined to be 043 The 043 spectral counts

for zero spectral counts was calculated by serial Shapiro-Wilk tests to determine the lowest value

(043) for zero spectral counts and maintain a Gaussian distribution for all data sets The 043

value for zero spectral counts in the current study was higher than the 016 reported value for

zero spectral counts in the original NSAF spectral counting study37

Our study may have a

higher zero spectral count value than the previous study because the spectral counting data were

an addition of three technical replicates and three times 016 is close to 043 The normalized

Gaussian data were then subjected to t-test analysis and P-values below 005 are considered as

statistically significant and are presented in Table 1 The proteins with significant up or down

regulation from Table 1 can be further evaluated as how close significant proteins were to a p-

value of 005 such as ganglioside GM2 activator serine protease inhibitor A3N and collagen

alpha-2(I) chain having t-test scores of 0045 0047 and 0043 respectively Proteins exhibiting

a P-value close to 005 were more likely to be highly variable proteins or have smaller fold

changes between control and transgenic samples and thus provide less biological relevancy to

88

future studies The modest change in antithrobin-III which is reduced by 14-fold in transgenic

is included due a low pooled standard deviation in spectral counts

Spectral counting has been analyzed with fold changes derived directly from the average

spectral counts from the technical replicates and then the average of the three biological

replicates We decided to perform additional analysis using fold changes to dig deeper into

proteins that statistically support the null hypothesis from the dNSAF analysis To only pull out

highly confident protein identifications we used the same strict cut-off of two unique peptides

identified per protein as in dNSAF analysis We only accepted proteins with greater than three-

fold change in total spectral counts listed in Table 2 Two proteins contactin-1 (cntn1) and

cathepsin B (CB) illustrate the potential biomarkers missed by dNSAF analysis Cntn1 had zero

spectral count in the transgenic sample and had an average spectral count of 41 in control

samples The lack of any spectral counts in one biological control for cntn1 resulted in a large

standard deviation in the ln(dNSAF) means for the control which resulted in the t-test supporting

the null hypothesis Another example is CB which was detected by numerous spectral counts in

every GFAP transgenic biological replicate with an average spectral count of 97 (Table 2) The

presence of CB in one biological control sample (23 average spectral counts) resulted in a high

standard deviation in the mean of the control samples These examples exhibit a limitation of

dNSAF analysis which could cause a loss of potentially useful information

Previously Identified Proteins with Expression Changes

Previously three proteins have been described as increased in CSF from individual(s)

suffering from AxD In one study a single patientrsquos CSF was found to have elevated levels of

αβ-crystallin and HSP2744

In a second study three patients were reported to have elevated

levels of GFAP with concentrations from 4760 ngL to 30000 ngL (compared to lt175 ngL for

89

controls)11

GFAP was detected in our current study however the other two proteins were not

detected One possibility is that αB-crystallin and Hsp-27 levels in mouse CSF are too low for

detection by MS analysis In addition while the transgenic mice display the hallmark

pathological feature of AxD in the form of Rosenthal fibers they do not have any evident

leukodystrophy and thus may not display the full range of changes in CSF as might be found in

human patients

Creatine Kinase M

Creatine kinase M (M-CK) is a 43 kDa protein whose role is to reversibly catalyze

phosphate transfer between ATP and energy storage compounds M-CK has been primarily

found in muscle tissue and for humans creatine kinase M is diagnostically analyzed in the blood

for myocardial infarction (heart attack) In addition M-CK is also found in Purkinje neurons of

the cerebellum45 46

A related protein creatine kinase B (B-CK) also exhibited an apparent 21

fold increase in transgenic CSF over control but this difference was not statistically different

B-CK concentration is known to be elevated in CSF following head trauma47

or cerebral

infarction48

but decreased in astrocytes in individuals affected by multiple sclerosis49

Cathepsin

The data showed multiple cathepsins were up regulated in the CSF of transgenic mice

when compared to control mice The up regulated cathepsins were S L1 and B isoforms which

are all cysteine proteases Cathepsin S (CS) was never observed in control samples but

observed with an average of 73 spectral counts per analysis Cathepsin L1 (CL1) was up

regulated by 94 times in transgenic mice These two cathepsins exhibited significant changes

using dNSAF and t-test statistics as shown in Table 1 and Cathepsin B (CB) showed a 42 fold

increase in transgenic CSF as shown in Table 2

90

Cathepsins regulate apoptosis in cells50

which is the major mechanism for elimination of

cells deemed by the organism to be dangerous damaged or expendable CL and CB are

redundant in the system as thiol proteases and inhibition of CB by CA-047 causes a diminished

apoptosis response in multiple cell lines51

Intriguingly increased levels of CB or CL are

correlated with poor prognosis for cancer patients and shorter disease-free intervals It is

believed that these proteases degrade the extracellular membrane which allows tumor cells to

invade adjacent tissue and metastasize52

With regards to AxD the up regulation of these

cathepsins may be indicative of the bodyrsquos natural defense and response to Rosenthal fibers

Thus stimulation of these cathepsins may provide a further protective stress response but the

positive correlation between these proteases and cancer highlights the multiple roles of these

proteins in pathological response Alternatively it has been shown that increased CB is involved

with the tumor necrosis factor α (TNFα) induced apoptosis cascade53

The activation of the

TNFα could produce oligodendrocyte toxicity54

with the expression of TNFα being elevated in

tissue samples from mouse models and AxD patients55

The potential for a positive or a negative

effect in increasing or decreasing cathepsins warrant future research with cathepsins and AxD

Contactin-1

Contactin-1 (Cntn1) is a 133 kDa cell surface protein that is highly glycosylated and

belongs to a family of immunoglobulin domain-containing cell adhesion molecues56

Table 2

shows that Cntn1 is down regulated in transgenic mice since spectral counts were only observed

in control mice Cntn1 is expressed in neurons and oligodendrocytes and higher levels were

observed during brain development57

In addition Cntn1 leads to activation of Notch1 which

mediates differentiation and maturation of oligodendrocyte precursor cells (OPC) Although the

mechanism leading to a reduction of Cntn1 in CSF is not known it may reflect a change in

91

astrocyte interactions with either neurons or oligodendrocytes to alter their expression of this

protein

Validation of putative biomarkers and MS proteomics data using ELISA and RNA

microarray data

To further validate the relative protein expression data obtained via MS-based spectral

counting techniques orthogonal immunological and molecular biological approaches have been

examined As GFAP was shown to be elevated in the CSF of transgenic animals we used a

well-defined GFAP ELISA protocol to test its CSF concentration CSF from 8 week old male

mice was collected from both transgenic and control animals Five samples of transgenic CSF

was prepared by pooling four to eight animals for the GFAP ELISA In the case of controls

each sample represents a single animal GFAP concentrations observed by both the MS and

ELISA showed significant increases of GFAP in the CSF when comparing transgenic to control

animals

Another validation of MS spectral counts is observed in a microarray analysis performed

on transgenic mouse olfactory bulb tissue 55

In this paper nine of the proteins found by MS

showed similar gene expression in the microarray (Table 3) It is not surprising that all the genes

observed in the microarray are not the same as the proteins observed by MS analysis Gene

expression and protein synthesis and expression are not always correlated but the similarities

and overlapping trends observed with these two assays are encouraging As shown in Table 3

gene expression analysis also revealed the up regulation of several cathepsin proteins GFAP

and down regulation of contactin-1 in CSF collected from transgenic mice corroborating the

MS-based proteomics results

92

CONCLUSIONS

In this study we have produced a panel of proteins with significant up or down regulation

in the CSF of transgenic GFAP overexpressor mice This mouse model for AxD was consistent

with the previous studies showing elevation of GFAP in CSF The development of a modified

IgY-14 immunodepletion technique for low amounts of CSF was presented This improved

protocol is useful for future investigations to deal with the unique challenges of mouse CSF

analysis Modified proteomics protocols were employed to profile mouse CSF with biological

and technical triplicates addressing the variability and providing quantitation with dNSAF

spectral counting Validation of the MS-based proteomics data were performed using both

ELISA and RNA microarray data to provide further confidence in the changes in the putative

protein biomarkers This study presents three classes of interesting targets for future study in

AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

93

REFERENCES

1 Brenner M Johnson A B Boespflug-Tanguy O Rodriguez D Goldman J E

Messing A Mutations in GFAP encoding glial fibrillary acidic protein are associated with

Alexander disease Nat Genet 2001 27 (1) 117-20

2 Herndon R M Rubinstein L J Freeman J M Mathieson G Light and electron

microscopic observations on Rosenthal fibers in Alexanders disease and in multiple sclerosis J

Neuropathol Exp Neurol 1970 29 (4) 524-51

3 Alexander W S Progressive fibrinoid degeneration of fibrillary astrocytes associated

with mental retardation in a hydrocephalic infant Brain 1949 72 (3) 373-81 3 pl

4 Iwaki T Kume-Iwaki A Liem R K Goldman J E Alpha B-crystallin is expressed

in non-lenticular tissues and accumulates in Alexanders disease brain Cell 1989 57 (1) 71-8

5 Head M W Goldman J E Small heat shock proteins the cytoskeleton and inclusion

body formation Neuropathol Appl Neurobiol 2000 26 (4) 304-12

6 Messing A Daniels C M Hagemann T L Strategies for treatment in alexander

disease Neurotherapeutics 7 (4) 507-15

7 Tang G Yue Z Talloczy Z Hagemann T Cho W Messing A Sulzer D L

Goldman J E Autophagy induced by Alexander disease-mutant GFAP accumulation is

regulated by p38MAPK and mTOR signaling pathways Hum Mol Genet 2008 17 (11) 1540-

55

8 Hagemann T L Boelens W C Wawrousek E F Messing A Suppression of GFAP

toxicity by alphaB-crystallin in mouse models of Alexander disease Hum Mol Genet 2009 18

(7) 1190-9

9 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

10 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 878 (22) 2003-12

11 Kyllerman M Rosengren L Wiklund L M Holmberg E Increased levels of GFAP

in the cerebrospinal fluid in three subtypes of genetically confirmed Alexander disease

Neuropediatrics 2005 36 (5) 319-23

12 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M

Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta)

equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

13 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

14 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers PROTEOMICS ndash Clinical Applications 2008 2 (3)

428-436

15 Li C Lee K H Affinity depletion of albumin from human cerebrospinal fluid using

Cibacron-blue-3G-A-derivatized photopatterned copolymer in a microfluidic device Anal

Biochem 2004 333 (2) 381-8

94

16 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry ELECTROPHORESIS 2004 25 (14) 2402-

2412

17 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L

Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity

separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample

preparation and analysis Proteomics 2005 5 (13) 3314-28

18 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag

L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep

Biochem Biotechnol 2009 39 (3) 221-47

19 Huang L Fang X Immunoaffinity fractionation of plasma proteins by chicken IgY

antibodies Methods Mol Biol 2008 425 41-51

20 Greunke K Braren I Alpers I Blank S Sodenkamp J Bredehorst R Spillner E

Recombinant IgY for improvement of immunoglobulin-based analytical applications Clin

Biochem 2008 41 (14-15) 1237-44

21 Xiao Y Gao X Taratula O Treado S Urbas A Holbrook R D Cavicchi R E

Avedisian C T Mitra S Savla R Wagner P D Srivastava S He H Anti-HER2 IgY

antibody-functionalized single-walled carbon nanotubes for detection and selective destruction

of breast cancer cells BMC Cancer 2009 9 351

22 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein immunoaffinity

subtraction for plasma proteomics and candidate biomarker discovery using mass spectrometry

Mol Cell Proteomics 2006 5 (11) 2167-74

23 Hinerfeld D Innamorati D Pirro J Tam S W SerumPlasma depletion with

chicken immunoglobulin Y antibodies for proteomic analysis from multiple Mammalian species

J Biomol Tech 2004 15 (3) 184-90

24 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D

Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in

the discovery of candidate protein biomarkers in a diabetes autoantibody standardization

program sample subset J Proteome Res 2008 7 (2) 698-707

25 Ru Q C Zhu L A Silberman J Shriver C D Label-free semiquantitative peptide

feature profiling of human breast cancer and breast disease sera via two-dimensional liquid

chromatography-mass spectrometry Mol Cell Proteomics 2006 5 (6) 1095-104

26 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

27 Rao P V Reddy A P Lu X Dasari S Krishnaprasad A Biggs E Roberts C T

Nagalla S R Proteomic identification of salivary biomarkers of type-2 diabetes J Proteome

Res 2009 8 (1) 239-45

28 Yu K H Barry C G Austin D Busch C M Sangar V Rustgi A K Blair I A

Stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for

pancreatic cancer serum biomarker discovery J Proteome Res 2009 8 (3) 1565-76

29 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422

(6928) 198-207

95

30 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

31 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

32 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

33 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 82 (7) 2817-25

34 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

35 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

36 Old W M Meyer-Arendt K Aveline-Wolf L Pierce K G Mendoza A Sevinsky

J R Resing K A Ahn N G Comparison of label-free methods for quantifying human

proteins by shotgun proteomics Mol Cell Proteomics 2005 4 (10) 1487-502

37 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M

P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J

Proteome Res 2006 5 (9) 2339-47

38 Mosley A L Florens L Wen Z Washburn M P A label free quantitative

proteomic analysis of the Saccharomyces cerevisiae nucleus J Proteomics 2009 72 (1) 110-20

39 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome

quantitation how to deal with peptides shared by multiple proteins Anal Chem 82 (6) 2272-81

40 Messing A Head M W Galles K Galbreath E J Goldman J E Brenner M

Fatal encephalopathy with astrocyte inclusions in GFAP transgenic mice Am J Pathol 1998

152 (2) 391-8

41 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

42 Elias J E Gygi S P Target-decoy search strategy for increased confidence in large-

scale protein identifications by mass spectrometry Nat Methods 2007 4 (3) 207-14

43 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

44 Takanashi J Sugita K Tanabe Y Niimi H Adolescent case of Alexander disease

MR imaging and MR spectroscopy Pediatr Neurol 1998 18 (1) 67-70

45 Hemmer W Wallimann T Functional Aspects of Creatine Kinase in Brain

Developmental Neuroscience 1993 15 (3-5) 249-260

46 Hemmer W Zanolla E Furter-Graves E M Eppenberger H M Wallimann T

Creatine kinase isoenzymes in chicken cerebellum specific localization of brain-type creatine

96

kinase in Bergmann glial cells and muscle-type creatine kinase in Purkinje neurons Eur J

Neurosci 1994 6 (4) 538-49

47 Nordby H K Tveit B Ruud I Creatine kinase and lactate dehydrogenase in the

cerebrospinal fluid in patients with head injuries Acta Neurochirurgica 1975 32 (3) 209-217

48 Bell R D Alexander G M Nguyen T Albin M S Quantification of cerebral

infarct size by creatine kinase BB isoenzyme Stroke 1986 17 (2) 254-60

49 Steen C Wilczak N Hoogduin J M Koch M De Keyser J Reduced Creatine

Kinase B Activity in Multiple Sclerosis Normal Appearing White Matter PLoS ONE 5 (5)

e10811

50 Chwieralski C Welte T Buumlhling F Cathepsin-regulated apoptosis Apoptosis 2006

11 (2) 143-149

51 Droga-Mazovec G Bojic L Petelin A Ivanova S Romih R Repnik U Salvesen

G S Stoka V Turk V Turk B Cysteine cathepsins trigger caspase-dependent cell death

through cleavage of bid and antiapoptotic Bcl-2 homologues J Biol Chem 2008 283 (27)

19140-50

52 Duffy M J Proteases as prognostic markers in cancer Clin Cancer Res 1996 2 (4)

613-8

53 Guicciardi M E Deussing J Miyoshi H Bronk S F Svingen P A Peters C

Kaufmann S H Gores G J Cathepsin B contributes to TNF-alpha-mediated hepatocyte

apoptosis by promoting mitochondrial release of cytochrome c J Clin Invest 2000 106 (9)

1127-37

54 Butt A M Jenkins H G Morphological changes in oligodendrocytes in the intact

mouse optic nerve following intravitreal injection of tumour necrosis factor J Neuroimmunol

1994 51 (1) 27-33

55 Hagemann T L Gaeta S A Smith M A Johnson D A Johnson J A Messing

A Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal

fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum Mol

Genet 2005 14 (16) 2443-58

56 Falk J Bonnon C Girault J A Faivre-Sarrailh C F3contactin a neuronal cell

adhesion molecule implicated in axogenesis and myelination Biol Cell 2002 94 (6) 327-34

57 Eckerich C Zapf S Ulbricht U Muumlller S Fillbrandt R Westphal M Lamszus

K Contactin is expressed in human astrocytic gliomas and mediates repulsive effects Glia

2006 53 (1) 1-12

97

Table 1 Statistically changed proteins between transgenic and control mouse CSF using

dNSAF analysis

Accession Protein Pa SC

b Fold

Changec

Control

dSpCd

Transgenic

dSpCd

KCRM_MOUSE Creatine kinase M-type 0034 425 30 182 541

HEXB_MOUSE Βeta-hexosaminidase subunit β 0012 183 uarr 0 59

CMGA_MOUSE Chromogranin-A 000078 153 darr 44 0

ANT3_MOUSE Antithrombin-III 0017 176 -14 67 47

SAP3_MOUSE Ganglioside GM2 activator 0045 415 darr 28 0

SPA3N_MOUSE Serine protease inhibitor A3N 0047 170 42 10 42

CO1A2_MOUSE Collagen alpha-2(I) chain 0043 56 darr 19 0

BLVRB_MOUSE Flavin reductase 00054 204 uarr 0 12

CATS_MOUSE Cathepsin S 00032 232 uarr 0 73

GFAP_MOUSE Glial fibrillary acidic protein 0021 107 uarr 0 21

RET4_MOUSE Retinol-binding protein 4 0040 189 darr 13 0

CBPE_MOUSE Carboxypeptidase E 0043 139 darr 11 0

CATL1_MOUSE Cathepsin L1 0015 87 94 02 19

The statistics are performed using the t-test from the ln(dNSAF) Gaussian data

a P p-value of the t-test where the null hypothesis states that there was no change in expression between

control and transgenic GFAP overexpressor b SC percentage of sequence coverage of the listed protein from

sequenced tryptic peptides c Fold Change positive values are indicative of a fold change for transgenic CSF

negative values are fold changes for control CSF (ie down-regulated in transgenic CSF) uarr indicates the protein

was only detected in transgenic CSF and darr indicates the protein was only detected in control CSF d dSpC

distributive spectral counts which represent the average spectral counts observed per run analysis on the mass

spectrometer and corrected using distributive analysis for peptides shared by more than one protein

98

Table 2 Proteins showing greater than three-fold changes with at least two unique

peptides identified for each protein

Accession Protein SC ()a Fold

Change b

Control

dSpC c

Transgenic

dSpC c

MUG1_MOUSE Murinoglobulin-1 precursor 147 42 155 37

CO4B_MOUSE Complement C4-B 113 54 22 118

PRDX6_MOUSE Peroxiredoxin-6 576 46 14 64

CNTN1_MOUSE Contactin-1 65 darr 41 0

CATB_MOUSE Cathepsin B 263 42 23 97

CAH3_MOUSE Carbonic anhydrase 3 396 76 11 84

APOH_MOUSE Beta-2-glycoprotein 1 128 44 44 1

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

PGK1_MOUSE Phosphoglycerate kinase 1 355 36 17 61

NDKB_MOUSE Nucleoside diphosphate kinase B 408 34 13 44

FHL1_MOUSE

Four and a half LIM domains

protein 1 243 39 13 51

NELL2_MOUSE

Protein kinase C-binding protein

NELL2 45 -43 13 03

MDHM_MOUSE

Malate dehydrogenase

mitochondrial 385 41 12 49

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

a SC percentage of sequence coverage of the listed protein from sequenced tryptic peptides b Fold

Change positive values are indicative of a fold change for transgenic CSF negative values are fold changes for

control CSF and darr indicates the protein was only detected in control CSF c dSpC distributive spectral counts

which represent the average spectral counts observed per run analysis on the mass spectrometer and corrected using

distributive analysis for peptides shared by more than one protein

99

Table 3 Validation of changes in proteins revealed by MS-based spectral counting

consistent with previously published microarray data

Consistent changes in RNA and proteomic data

uarr regulated in transgenic darr regulated in transgenic

Cathepsin S Contactin-1

Cathepsin B Carboxypeptidase E

Cathepsin L1

Peroxiredoxin-6

Complement C4-B

Glial fibrillary acidic protein

Serine protease inhibitor A3N

Note Validation of putative biomarkers from the current proteomics dataset by previously

published RNA microarray data55

Both up and down regulated proteins were consistent with the

RNA microarray data

_

100

___________________________________________

SUPPLEMENTAL INFORMATION (Available upon request)

Table S1 Compilation list of proteins identified from all the control and transgenic biological

replicates

Table S2 Distributive spectral counting calculations performed for proteins observed to share

identified peptides

Table S3 Proteins with dNSAF spectral counting performed including t-test analysis for a

comparison between transgenic and control CSF

101

FIGURE LEGENDS

Figure 1 The general workflow indicating the major steps involved in sample collection sample

processing mass spectrometric data acquisition and analysis of mouse CSF samples

Figure 2 Assessment of run to run variability of the base peak chromatograms within and

between two biological and technical replicates The peak profile and intensity scale is

consistent between the four chromatograms The four panels show two biological replicates (Tg

4 and Tg5) with two technical replicates for each biological sample

Figure 3 A Venn-diagram of the biological triplicates (1 2 and 3) of transgenic mouse

CSF showing 78 proteins identified in all three experiments B Venn-diagram of the biological

triplicates (1 2 and 3) of control mouse CSF having 61 proteins identified by all three

replicates C The overlap between control and transgenic CSF proteomic analysis showing 139

proteins identified by both groups and 73 and 54 uniquely identified by respective groups

Figure 4 Assessment of technical replicate variability between biological replicates The error

bars in both A and B are the standard deviation derived from the technical triplicates for each

biological replicate Panel A shows creatine kinase M having more or equal variability in the

biological triplicates than each technical triplicate The means of the biological triplicates are

illustrated by the dashed line Panel B represents kininogen-1 which is unchanged between

control (Ctl) and transgenic (Tg) samples The biological variability coupled with the technical

replicates provides a barely noticeable difference in the pooled mean between control and

102

transgenic spectral counts The difference in means is contrasted with the three fold change

observed from creatine kinase M (A)

Figure 5 MSMS profile of the tryptic peptide LSVEALNSLTGEFK from creatine kinase M

(A) The top MSMS is from a control sample with an elution at 463 minutes (B) The bottom

MSMS is from the GFAP transgenic sample with an elution at 46 minutes These tandem MS

spectra show instrument reliability and consistent fragmentation patterns which are necessary for

spectral counting analysis

Figure 6 Validation of MS spectral counts using a GFAP ELISA GFAP content (ngL)

measured within mouse CSF from both transgenic and control animals The data represents the

average with standard deviation (n=5 each sample represents pooling of 1 to 8 animals) The

statistics are performed using a student t-test plt00001

103

Figure 1

104

Figure 2

105

Figure3

106

Figure 4

107

Figure 5

108

Figure 6

Ctl Tg

100

1000

10000

100000

Mouse CSF Sample

GF

AP

(n

gL

)

109

Table of Contents Summary

Cerebrospinal fluid (CSF) was obtained from transgenic mouse models of Alexander disease as

well as healthy controls We immunodepleted pooled CSF from mice by a modified IgY-14

protocol Shotgun proteomics utilizing trypsin digestion 1-D nanoRP separation CID tandem

mass spectrometry analysis Mascot database searching and relative quantitation via distributive

normalized spectral abundance factor resulted in the identification of 266 proteins and 27

putative biomarkers

110

Chapter 4

Genomic and proteomic profiling of rat adapted scrapie

Adapted from ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A

Cunningham R Wang J Wellner D Ma D Li L Aiken J In preparation

111

Abstract

A novel model of prion disease using rats called Rat Adapted Scrapie (RAS) was

developed with the genomics from brain and proteomics from cerebrospinal fluid (CSF) profiled

The CSF proteome from control and RAS (biological N=5 technical replicates N=3) were

digested and separated using one dimensional reversed-phase nanoLC coupled to data-

dependent tandem MS acquisition on an ion trap mass spectrometer In total 512 proteins 167

non-redundant protein groups and 1032 unique peptides were identified with a 1 false

discovery rate (FDR) Of the proteins identified 21 were statistically up-regulated (plt005) and

7 statistically down-regulated in the RAS CSF We also identified 1048 genes which were

differentially regulated in rat prion disease and upon mapping these changes to mouse gene

expression however only 22 of these genes were in common with mRNAs responding to

prion infection in mice suggesting that the molecular pathology observed in mice may not be

applicable to other species The proteins are compared to the differentially regulated genes as

well as to previously published proteins showing changes consistent with other prion animal

models

112

Introduction

Prion diseases are an unusual class of fatal transmissible neurodegenerative disorders

that affect the mammalian central nervous system They are caused by the accumulation of an

abnormal conformation of the normal host encoded cellular prion protein PrPC This

conformational rearrangement of PrPC is brought about by template directed misfolding wherein

seed molecules of the abnormal isoform PrPScrapie

PrPSc

convert PrPC into new PrP

Sc molecules

Scrapie in sheep and goats is the prototypic prion disease and is so named because clinically

affected animals scrape their wool off on pen and stall walls Laboratory investigation of prion

diseases typically relies upon rodents which can be infected with natural isolates of scrapie1

albeit with some difficulty as ovid scrapie isolates need time to adapt to rodents This adaptation

is characteristic of prion disease interspecies transmissions and properly reflects the molecular

adaptation that must occur to allow interaction between exogenous foreign PrPSc

and host PrPC

molecules selecting for conformers which exhibit template directed misfolding In some cases

no conformational solution is found reflecting a species barrier to disease transmission

In recent years advances in genomics and proteomics technologies have allowed

unprecedented examination of the biomolecules that are altered upon exposure to prion agents

These studies2 3

have relied upon analysis of gene and protein expression changes in response to

prion infection with the aim of trying to identify pathways that might underlie the mechanism of

prion-induced neurotoxicity A second important aim has been to identify signature molecules

that might act as surrogate biomarkers for these diseases as there are significant analytical

challenges associated with sensitively detecting and specifically distinguishing disease-induced

conformational changes (PrPSc

) of the prion protein from normal host conformations (PrPC)

113

Mass spectrometry (MS)-based proteomics has become a promising tool for biomarker

discovery from biological fluids such as CSF blood and urine4-6

Two-dimensional gel

electrophoresis MS (2D-GE MS) and two-dimensional differential gel electrophoresis (2D-DIGE

MS) are the most widely used methods for proteomic profiling of prion infected CSF so far due

to the advantage of ready separation and quantification of proteins in complex biological samples

Studies using 2D-GE MS or 2D-DIGE MS to profile proteins in CSF have led to the

identifications of currently accepted biomarker for sCJD 14-3-3 protein and other potential

biomarkers for prion diseases7-9

However the application of this method in biomarker

discovery is limited by insufficient sensitivity and potential bias against certain classes of

proteins as gel-based separation does not work well for the low abundance proteins very basic

or acidic proteins very small or large proteins and hydrophobic proteins 10 11

In contrast to 2D-

GE or 2D-DIGE shotgun proteomics employs enzymatic digestion of the protein samples

followed by chromatographic separation prior to introduction into a mass spectrometer for

tandem MS analyses Shotgun proteomics has become increasingly popular in proteomic

research because these methods are reproducible highly automated and have a greater

likelihood of detecting low abundance proteins12 13

Due to the sample complexity in CSF and

because albumin comprises over half of the protein content in CSF removal of high-abundance

proteins including albumin is necessary to improve proteomic coverage and identify low-

abundance proteins One method is IgY immunodepletion14 15

which is performed prior to LC-

MSMS and uses species specific antibodies from chicken yolk to remove abundant proteins in

biological samples such as CSF In the present work CSF from control and rat adapted scrapie

animal models (biological N=5 technical N=3 replicates) were analyzed by LC-MSMS and we

114

indentified 512 proteins and 167 non-redundant protein isoforms (referred to as protein groups)

with 21 proteins being statistically up-regulated (p lt 005) and 7 statistically down-regulated

By and large this work has been performed using laboratory mice for the gene

expression studies and human cerebrospinal fluid for proteomics mice do not provide sufficient

volumes of CSF for proteomic studies Both approaches are exceedingly valuable as the mouse

model allows cross-sectional time course experiments to be performed including the important

pre-clinical phase of disease Critically however the relevance and generalizability of mouse

prion responses to other prion diseases especially human disease is unknown Human proteomic

studies on CSF guarantee relevant protein identifications but are limited to the clinical phase of

the disease when apparent markers may reflect gross neurodegeneration covering up subtle but

more specific responses To address these issues we have adapted mouse RML prions into rats

with the aim of expanding the knowledge of prion disease responses addressing the limitations

of mice and opening up the possibility of preclinical proteomic profiling in a laboratory rodent

In the present work CSF samples from control and rat adapted scrapie were analyzed by system

biology approaches Here we show that Rat Adapted Scrapie (RAS) is a powerful tool for an -

omics based approach to decipher the molecular impact of prion disease in vivo with

applicability to the molecular mechanisms of disease and biomarker discovery We identified

1048 genes which were differentially regulated in rat prion disease Astonishingly on the whole

mouse orthologs of these genes did not change in mouse adapted scrapie and vice versa

questioning the universality of previous mouse gene expression profiles These RAS gene

expression changes were identified in the CSF proteome where we detected 512 proteins and 167

protein groups with 21 proteins being statistically up-regulated (plt005) and 7 statistically down-

115

regulated in the CSF of prion diseased rats Many of the proteins detected have previously been

observed in human CSF from CJD patients

Materials and Methods

Ethics Statement

This study was carried out in accordance with the recommendations in the NIH Guide for Care

and Use of Laboratory Animals and the guidelines of the Canadian Council on Animal Care The

protocols used were approved by the Institutional Animal Care and Use Committees at the

University of Wisconsin and University of Alberta

Chemicals

Acetonitrile (HPLC grade) and ammonium bicarbonate (certified) were purchased from

Fisher Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) Optima LCMS grade acetonitrile and water were purchased

from Fisher Scientific (Fair Lawn NJ) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris

ge999 ) ammonium formate (ge99995) glycine (ge985) and IgY-7 antibodies were

purchased from Sigma-Aldrich (Saint Louis MO)

Rat Transmission and Adaptation

Prion agents used were the rocky mountain lab RML strain of mouse adapted scrapie

Stetsonville transmissible mink encephalopathy16

(TME) Hyper (Hy) strain of Hamster TME 17

1st passage Skunk adapted TME prepared as described and C from genetically defined

transmissions18

116

Brains from animals clinically affected with prion disease were aseptically removed and

prepared as 10 wv homogenates in sterile water 50 microL of 10 brain homogenate was

inoculated into weanling wistar rats intracranially After one year of incubation preclinical rats

from RML infections were euthanized by CO2 inhalation and the brain excised homogenized

and re-inoculated into naive animals Subsequent serial passages were from rats clinically

affected with rat adapted scrapie

Brains from rat passages were aseptically removed and bisected sagittally Brain halves

were reserved for immunohistochemistry (IHC) in formalin or frozen for immunoblotting RNA

isolation or subsequent passage For IHC tissues were first fixed in 10 buffered formalin

followed by antigen retrieval at 121 oC 21 bar in 10mM citrate buffer for 2 min After cooling

to room temperature sections were treated with 88 formic acid for 30 min and 4M guanidine

thiocyanate for 2 h Endogenous peroxidases were inactivated by 003 hydrogen peroxide and

tissue was blocked with 1 normal mouse serum Mouse anti-PrP mAB SAF83 (Cayman

Chemical) was biotinylated and applied at a 1250 dilution in blocking buffer overnight at 4 oC

Washes were performed in 10mM PBS with 005 tween-20 Streptavidin-peroxidase

(Invitrogen) was added for Diaminobenzidine (DAB) color reaction Anti-GFAP

immunohistochemistry was performed as above except that formic acid and guanidine treatment

steps were excluded Mouse anti-GFAP mab G-A-5 (Sigma) was used at a 1400 dilution

Brain samples for immunoblot were treated with or without Proteinase K (Roche) at a

ratio of 35 microg PK 100 ug protein at 50 microg mL for 30 minutes at 37 oC Phosphotungstenic acid

enrichments were performed as described14 19

Bis-Tris SDS-PAGE was performed on 12

polyacrylamide gels (Invitrogen) and transferred to PVDF Immunoblotting was performed using

117

mABs SAF83 (Cayman Chemical) 6H4 (Prionics) or 3F10 (a kind gift from Yong-Sun Kim) all

at a 120000 dilution

Gene Expression Profiling

RNA was extracted from frozen brain halves from clinically affected and control animals with

the QIAshredder and RNeasy mini kit (Qiagen Valencia CA) in accordance with the

manufacturerrsquos recommendations Due to the relatively large mass of rat brain an initial

homogenization was performed with a needle and syringe in 5mL of buffer RLT before further

diluting an aliquot in accordance with the manufacturers protocol Total RNA was amplified and

labeled in preparation for chemical fragmentation and hybridization with the MessageAmp

Premier RNA amplification and labeling kit (Life Technologies Grand Island NY) Amplified

and labeled cRNAs were hybridized on Affymetrix (Santa Clara CA) rat genome 230 20 high

density oligonucleotide arrays in accordance with the manufacturers recommendations

Differentially Expressed Genes were identified using Arraystar 50 (DNAStar Madison WI)

Robust multi-array normalization using the quantile approach was used to normalize all

microarray data A moderated T-test with a multiple comparison adjustment20

was used to reduce

the false discovery rate yet preserve a meaningful number of genes for pathway analysis

Pathway analysis was performed using the DAVID Bioinformatics database21

Comparative

analysis of genes induced by prions in mouse22

and rat disease was performed on genes

exhibiting at least a 2-fold change in expression Orthologs of rat and mouse genes were

identified using ENSEMBLE biomart release 6823

CSF Proteomic Profiling

118

CSF was collected from rats anaesthetized with isoflorane via puncture of the cisterna

magna A volume of 100-200 microL was routinely obtained CSF was touch spun for 30s at 2000xg

on a benchtop nano centrifuge to identify any blood contamination by the presence of a red

pellet CSF with blood contamination was not used for proteomic analysis CSF was prepared

for profiling by first depleting abundant proteins with an antibody based immunopartitioning

column IgY-R7 (Sigma-Aldrich Saint Louis MO) The manufacturers recommendations were

followed with a few deviations 40 microg of protein (~100 microL CSF) was bound to 100 microL of IgY

bead slurry in total volume of 600 microL formulated in 150mM ammonium bicarbonate The flow

through was collected and denatured by the addition of 10 microL of 8 M guanidine thiocyanate and

lyophilized in a speed-vac apparatus The sample was then reconstituted in 10 microL of water and 1

microL of 050 M DTT were added and allowed to incubate at 37oC for 30 minutes After incubation

27 microL of 055 M IAA was added for carboxymethylation and the sample was allowed to

incubate in the dark for 15 minutes After that 1 microL of 050 M DTT was added and allowed to

sit at room temperature for 10 minutes To perform trypsin digestion 70 microL of 50 mM

NH4HCO3 was then added along with 025 microg trypsin Digestion was performed overnight at

37oC and quenched by addition of 5 microL of 10 formic acid The tryptic peptides were then

subjected to solid phase extraction using an Agilent OMIX Tip C18 100 microL (Santa Clara CA)

Peptides were eluted with 50 ACN in 01 formic acid concentrated and reconstituted in 30

microL H2O with 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of a nanoAcquity (Milford MA) with a 10 microL injection

loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B consisted of

ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto a Waters 5 microm

119

Symmetry C18 180 microm x 20 mm trap cartridge (Milford MA) at a flow rate of 75 microLmin for 5

minutes at 95 A 5 B Separation was performed on a Waters 17 microm BEH130 C18 75 microm x

100 mm (Milford MA) with a gradient of 5 to 15 B over 10 minutes followed by 15 to

40 B over 80 minutes at room temperature

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with a Bruker CaptiveSpray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Waters Acquity console software to perform MS acquisitions for all experiments Smart

parameter setting (SPS) was set to 700 mz compound stability and trap drive level was set at

100 Optimization of the CaptiveSpray source resulted in dry gas temperature of 160oC dry

gas flow of 30 Lmin capillary voltage of -1325 V end plate offset of -500 V MSMS

fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

range of 350-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

120

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot24

(Version 24 Matrix

Science London UK) Database searching was performed against a forward and reversed

concatenated SwissProt Rattus rattus Mascot search parameters were as follows Allowed

missed cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) peptide tolerance plusmn12 Da maximum number of 13

C 1 MSMS

tolerance plusmn05 Da instrument type ESI-Trap Data was then transformed into Dat file formats

and transferred to ProteoIQ (NuSep Bogart GA) False positive analyses were calculated using

ProteoIQ and set at 1

Results

Development of Rat Adapted Scrapie

To develop a rat adapted strain of prion disease we introduced 6 different prion agents RML

TME Hy Skunk Adapted TME and Chronic Wasting Disease (CWD) from wild type 96G and

96S deer16-18

into 6 rats (Fig 1) Of these primary transmissions only RML induced the

accumulation of Proteinase K resistant PrP after one year of incubation as determined by western

blotting on 10 brain homogenates and PrPSc

enriched phoshotungstenic acid precipitated brain

homogenates Second passage of rat adapted RML scrapie resulted in clinically affected rats at

565 days post inoculation Serial passages of TME or CWD agents were not performed Clinical

symptoms consisted of ataxia lethargy wasting kyphosis and myoclonus Prion affected rats

121

also showed low level porphyrin staining around their head Subsequent serial passage decreased

incubation time to 215 days

Proteinase K resistant prion protein was observed from all clinically affected animals both by

immunoblot (Fig 2) and by immunohistochemistry (Fig 3) Di- and mono-glycosylated bands

were the most abundant isoforms of PrPSc

PrPSc

was extensively deposited in the cerebral cortex

hippocampus thalamus inferior colliculus and granular layer of the cerebellum GFAP

expressing activated astrocytes were found throughout the brain particularly in the white matter

of the hippocampus thalamus and cerebellum Spongiform change was a dominant feature of

clinical rat

Gene expression Profiling

In total 1048 genes were differentially regulated within a 95 confidence interval

(Supplementary Table 1) and 305 genes were differentially expressed by greater than 2-fold (Fig

4) The 1048 genes that were statistically significant were used for pathway analysis using

DAVID Pathway analysis suggested that the gene expression profile was consistent with

immune activation and maturation as well as inflammation (Supplementary Table 2) a likely

interpretation given the observable astrocytosis (Fig 3) and gliosis associated with prion disease

Other pathways highlighted by the analysis included increases in transcription of genes involved

in lysosomes and endosomes

To further probe the gene expression data we compared genes which were differentially

expressed in rat adapted scrapie against their orthologs expression in mouse scrapie and vice

versa (Figs 5 and 6) We did not observe a high degree of correlation between expression fold

changes For example GFAP a gene whose up-regulation in prion disease is well known was

122

increased 25 fold in rat adapted scrapie but up-regulated 93 fold in mouse scrapie A

qualitative analysis of expression of orthologs in prion disease suggests that many genes

deregulated at least 2-fold in mouse or rat do not have orthologs that are differentially expressed

For example the gene RNAseT2 was up-regulated greater than threefold in rat adapted scrapie

but was not significantly up-regulated in mouse Similarly three genes important in metals

homeostatsis metallothionein 1a and 2a and ceruloplasmin were up-regulated in rat 46 43 and

3 fold respectively but were not differentially expressed in mouse prion disease

CSF Proteomics

Each immunodepleted biological replicate (N=5 for each control and RAS) had technical

triplicates performed The triplicates were summed together using ProteoIQ Peptide spectral

counts were then summed for each protein and subjected to dNSAF analysis using ProteoIQ

internal algorithms Details for this method can be found elsewhere25 26

but briefly peptide

spectral counts are summed per protein (SpC) based on unique peptides and a weighted

distribution of any shared peptides with homologous proteins T-tests were used to identify

significant changes in protein expression 1032 unique peptides which identify 512 proteins and

167 protein groups were found Of these 512 proteins 437 were identified in both RAS and

control CSF samples (Fig 7) The complete list of all 512 proteins can be viewed in

Supplemental Table 3 Using the ProteoIQ the most likely isoform was selected such as 14-3-3

protein gamma

From Table 1 we observe five proteins that agree with the genomic data for up

regulations in RAS granulin macrophage colony-stimulating factor 1 receptor cathepsin D

complement factor H and ribonuclease T2 In addition serine protease inhibitor A3N was not

123

detected as up regulated in the RAS genomic data but was found to be up-regulated in previous

genomic profiling of the mouse prion model22

One interesting trend from the data in Table 1 is

that the majority of proteins found to be up-regulated in the RAS model were not detected in the

control samples The absence of the detection of those proteins such as ribonuclease T2 in the

control CSF does not necessarily suggest the absence of the protein nonetheless it is below the

detection limits for this current proteomics protocol and instrumentation

Discussion

Mice have been the preferred laboratory rodent for prion diseases research because they

can be inexpensively housed and are amenable to transgenesis which allows for short incubation

periods as well as hypothesis testing upon targeted pathways Pursuant to the determination of

the mouse genome and the development of high density transcriptional arrays for measurements

of gene expression profiling mice have been used extensively to examine the molecular

pathology of prion disease probing the impact of disease and animal strain In order to expand

upon this foundation we adapted mouse prions to infect rats We reasoned that by taking a

comparative approach to the molecular pathology of prion disease inferences could be obtained

into the variability of the molecular response to prion diseases and that understanding this

variability might suggest whether human prion disease responses are more or less similar to

mouse responses A second rationale is the desire to identify surrogate markers of prion disease

While this approach has been taken before using gene expression profiling a more direct

approach has been to use proteomic approaches on cerebral spinal fluid (CSF) identifying

proteins that are increase in abundance with disease A rat prion disease is valuable for this

because the rat proteome is established and rats allow for the collection of relatively large

volumes of CSF (~100 microL per animal) which allows for robust analytical separations enhancing

124

detection of biomarkers Finally rats unlike humans can be used in a time course study of prion

disease This allows for the identification of early transcriptional and proteomic responses to

prion infection responses which are particularly valuable for the identification of surrogate

disease biomarkers

To initiate the development of a rat prion disease we attempted to adapt six different

prion disease agents PrPres

molecules to rat via intracranial inoculation of weanling animals

(Figure 1) Of these six agents only mouse RML prions were able to surmount the species

barrier to prion disease transmission Mouse PrP differs from rat by eight amino acid changes

six in the mature protein and four in the N-terminal octa-peptide repeat region (Supplementary

Fig 1) As rat shares the most prion protein sequence homology with mouse it is perhaps not

surprising that it transmitted whereas the other did not confirming that the primary prion protein

sequence is the most important determinant for interspecies transmission We conclude that there

is a large molecular species barrier preventing conversion of rat PrPc into PrP

res

The transmission of mouse RML into rats was characterized by a shortening of the

incubation period following each passage This is indicative of agent adaption to the new host

and increases in the titer present in end-stage brain Overall our adaptation of mouse prion

disease into rats resulted in a similar agent to that observed by Kimberlin27

The differences in

incubation period at second passage are largely due to our collecting the animals at 365 days post

inoculation upon first passage whereas Kimberlin and Walker allowed the first passage animals

to reach end-stage clinical rats

Rat adapted scrapie is in many ways similar to other prion diseases The clinical period of

disease is marked by a progressive neurological dysfunction with prominent ataxia kyphosis and

125

wasting The protease resistant PrP observed is typical as is the widespread deposition of PrPSc

in

the brain Spongiosis and reactive astrogliosis are as expected of a prion disease

Gene expression profiles from rats clinically affected with prion disease revealed a strong

neuronal inflammation associated with proliferated and activated astrocytes This is perhaps best

observed through the up-regulation of GFAP GFAP positive astrocytes were omnipresent

throughout the brain and GFAP mRNA was up-regulated 25 fold The up-regulation of GFAP is

a hallmark of the molecular response to prion infection and has been routinely observed Our

comparative analysis of gene expression changes in mouse RML versus rat adapted scrapie

suggest substantial differences in gene expression in response to prion disease despite the fact

that the overall response is neuro-inflammatory This suggests that the potential overlap between

mouse expression profiles and a putative human CJD expression profile could be quite different

at the level of individual transcripts that might be expected to be changed

CSF Proteomics

CSF proteomics can be exceedingly challenging due to the small sample available large

dynamic range in proteins and abundant proteins limiting sample loading onto nanoscale

columns Dynamic range reduction in the CSF sample was achieved using a custom amount of

IgY-7 antibodies to abundant proteins such as albumin After immunodepletion the total

protein content was reduced by ~90 limiting the proteomics analysis to one dimensional

separation Label free quantitation spectral counting was performed because it requires less

protein and does not increase sample complexity The proteins identified from the affected and

control N=5 are shown in Supplemental Table 2 Consistent detection of proteins in CSF from

both control and infected rats was observed (Fig 7C) Only two proteins were identified in

126

controls that were not observed in RAS and only 10 proteins were only observed in RAS Some

of these proteins that were only identified in RAS are significantly changed (Supplemental Table

3) One concern in proteomics data is the variability from run to run and the possibility that

certain proteins are identified from different unique peptides Figure 7A shows that the vast

majority of the unique peptides detected (783) were identified with a 1 FDR in both RAS and

control CSF samples highlighting the analytical reproducibility of our methodology

Proteomic analysis of the infected rat CSF provides a reasonable approach to cross

validate biomarkers detected by gene expression profiling For example RNAseT2 a secreted

ribonuclease was up-regulated 3-fold at the mRNA level and was also enriched in CSF from

infected rats (Table 1) Similarly coordinated increases in detection of colony stimulating factor

1 receptor complement factor H granulin and cathepsin D were also observed Conversely

proteomic analysis of CSF also allows for the observation of post-transcriptional responses to

prion disease that might be absent in gene expression profiling The 14-3-3 proteins and neuron

specific enolase both known markers for CJD are only detected by proteomic analysis Thus

gene expression profiling and proteomic detection serve to increase confidence in the

observation of up-regulation enhancing the likelihood that proteins detected by both

methodologies are specific and perhaps may be more sensitive at earlier time points

Comparison to human CSF prion disease proteome

In our comparative analysis 21 proteins were found to be up-regulated while 7 proteins

down-regulated in prion infected rats Included in the up-regulated proteins were the 14-3-3

proteins epsilon and gamma 14-3-3 zetadelta was also up-regulated but not statistically

significant Enolase 2 or neuron specific enolase (NSE) was also up-regulated in CSF of infected

127

rats These proteins are all in agreement with results from previous proteomic profiling of human

CSF from patients with CJD8 9

The detection of 14-3-3 protein is included in the diagnostic

criteria approved by World Health Organization for the pre-mortem diagnosis of clinically

suspected cases of sCJD28

although its application in large-scale screening of CJD is still

debated due to high false positive rate where elevated 14-3-3 protein levels were also reported in

other conditions associated with acute neuronal damage29 30

It was suggested that other brain-

derived proteins that are also specific to CJD can be used in conjunction with 14-3-3 protein to

increase diagnosis accuracy and specificity31

NSE is present in high concentration in neurons

and in central and peripheral neuroendocrine cells NSE serves as another valuable biomarker in

diagnosis of CJD as the elevated level of NSE in CSF was found in early and middle stages of

CJD 32

Other proteins detected in CSF included cystatin C and serpina3N although both of

these were not statistically changed These proteins were both previously identified as being

putative biomarkers for CJD33 34

Utility for biomarker discovery with emphasis on onset of biomarker appearance in CSF

The investigation of the protein changes in CSF from RAS compared to control rats

provides a solid foundation when investigating potential biomarkers with prion disease onset

The cross-validation of the genomic and proteomics data further emphasizes the targets for

consideration during disease onset Biomarker discovery provides the potential to determine if

animals such as cows in bovine spongiform encephalopathy (BSE) are affected instead of

having to be euthanized when BSE is expected In addition CSF collection in mice or hamsters

Prion models is extremely difficult and limited alternatively with the advent of the RAS model

CSF can be collected in sufficient amounts for CSF proteomics CSF collection for mice or

hamsters can be ~10 microL and could be contaminated with blood further complicating proteomic

128

analysis unlike rats which over 10 times more CSF can be collected per animal35

Due to the

amount of CSF available time course studies analyzing CSF proteomics is possible in RAS due

to animal numbers that are manageable and reasonable The RAS model further allows

investigators to bypass working with highly infections CJD CSF samples to investigate the CSF

proteome changes

Conclusion

In this study we have described the gene and protein expression changes in brain and

spinal fluid from a transmission of mouse prions into rats We find that while the overall gene

expression profile in rats is similar to that in mice the specific genes that make up that profile

are different suggesting that genes that change in response to prion disease in different species

may not be as conserved Analysis of CSF from rat adapted scrapie identified similar protein

changes as known in human CJD The rat will be a useful model to identify surrogate markers

that appear prior to the onset of clinical disease and thus may be of higher specificity and

sensitivity

Supplemental Information Available Upon Request

1 Chandler R L Encephalopathy in mice produced by inoculation with scrapie brain material Lancet 1961 1 (7191) 1378-9 2 Silva C J Onisko B C Dynin I Erickson M L Requena J R Carter J M Utility of Mass Spectrometry in the Diagnosis of Prion Diseases Anal Chem 2011 3 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion disease biomarker research delving into the glycoproteome J Proteome Res 2011 10 (6) 2687-702 4 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7 (1) 386-99 5 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2003 878 (22) 2003-12 6 Cunningham R Ma D Li L Mass spectrometry-based proteomics and peptidomics for systems biology and biomarker discovery Frontiers in Biology 2012 7 (4) 313-335

129

7 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66 8 Harrington M G Merril C R Asher D M Gajdusek D C Abnormal proteins in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease N Engl J Med 1986 315 (5) 279-83 9 Piubelli C Fiorini M Zanusso G Milli A Fasoli E Monaco S Righetti P G Searching for markers of Creutzfeldt-Jakob disease in cerebrospinal fluid by two-dimensional mapping Proteomics 2006 6 Suppl 1 S256-61 10 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1) 46-50 11 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66 12 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset J Proteome Res 2008 7 (2) 698-707 13 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422 (6928) 198-207 14 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample preparation and analysis Proteomics 2005 5 (13) 3314-28 15 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep Biochem Biotechnol 2009 39 (3) 221-47 16 Marsh R F Bessen R A Lehmann S Hartsough G R Epidemiological and experimental studies on a new incident of transmissible mink encephalopathy J Gen Virol 1991 72 ( Pt 3) 589-94 17 Bessen R A Marsh R F Biochemical and physical properties of the prion protein from two strains of the transmissible mink encephalopathy agent J Virol 1992 66 (4) 2096-101 18 Johnson C J Herbst A Duque-Velasquez C Vanderloo J P Bochsler P Chappell R McKenzie D Prion protein polymorphisms affect chronic wasting disease progression PLoS One 2011 6 (3) e17450 19 Safar J Wille H Itri V Groth D Serban H Torchia M Cohen F E Prusiner S B Eight prion strains have PrP(Sc) molecules with different conformations Nat Med 1998 4 (10) 1157-65 20 Benjamini Y Hochberg Y Controlling the False Discovery Rate A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 1995 57 (1) 289-300 21 Huang da W Sherman B T Lempicki R A Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources Nat Protoc 2009 4 (1) 44-57 22 Moody L R Herbst A J Aiken J M Upregulation of interferon-gamma-induced genes during prion infection J Toxicol Environ Health A 2009 74 (2-4) 146-53 23 Flicek P Amode M R Barrell D Beal K Brent S Carvalho-Silva D Clapham P Coates G Fairley S Fitzgerald S Gil L Gordon L Hendrix M Hourlier T Johnson N Kahari A K Keefe D Keenan S Kinsella R Komorowska M Koscielny G Kulesha E Larsson P Longden I McLaren W Muffato M Overduin B Pignatelli M Pritchard B Riat H S Ritchie G R Ruffier M Schuster M Sobral D Tang Y A Taylor K Trevanion S Vandrovcova J White S Wilson M Wilder S P Aken B L Birney E Cunningham F Dunham I Durbin R Fernandez-Suarez X M Harrow J Herrero J

130

Hubbard T J Parker A Proctor G Spudich G Vogel J Yates A Zadissa A Searle S M Ensembl 2012 Nucleic Acids Res 2012 40 (Database issue) D84-90 24 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein identification by searching sequence databases using mass spectrometry data Electrophoresis 1999 20 (18) 3551-67 25 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J Proteome Res 2006 5 (9) 2339-47 26 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome quantitation how to deal with peptides shared by multiple proteins Anal Chem 2010 82 (6) 2272-81 27 Kimberlin R H Cole S Walker C A Temporary and permanent modifications to a single strain of mouse scrapie on transmission to rats and hamsters J Gen Virol 1987 68 ( Pt 7) 1875-81 28 World Health Organization Global surveillance diagnosis and therapy of human transmissible spongiform encephalopathies report from a WHO consultation 1998 World Health Organization Geneva Switzerland 9-11 February 1998 1-26 29 Bartosik-Psujek H Archelos J J Tau protein and 14-3-3 are elevated in the cerebrospinal fluid of patients with multiple sclerosis and correlate with intrathecal synthesis of IgG J Neurol 2004 251 (4) 414-20 30 Saiz A Graus F Dalmau J Pifarre A Marin C Tolosa E Detection of 14-3-3 brain protein in the cerebrospinal fluid of patients with paraneoplastic neurological disorders Ann Neurol 1999 46 (5) 774-7 31 Sanchez-Juan P Green A Ladogana A Cuadrado-Corrales N Saanchez-Valle R Mitrovaa E Stoeck K Sklaviadis T Kulczycki J Hess K Bodemer M Slivarichova D Saiz A Calero M Ingrosso L Knight R Janssens A C van Duijn C M Zerr I CSF tests in the differential diagnosis of Creutzfeldt-Jakob disease Neurology 2006 67 (4) 637-43 32 Zerr I Bodemer M Racker S Grosche S Poser S Kretzschmar H A Weber T Cerebrospinal fluid concentration of neuron-specific enolase in diagnosis of Creutzfeldt-Jakob disease Lancet 1995 345 (8965) 1609-10 33 Sanchez J C Guillaume E Lescuyer P Allard L Carrette O Scherl A Burgess J Corthals G L Burkhard P R Hochstrasser D F Cystatin C as a potential cerebrospinal fluid marker for the diagnosis of Creutzfeldt-Jakob disease Proteomics 2004 4 (8) 2229-33 34 Miele G Seeger H Marino D Eberhard R Heikenwalder M Stoeck K Basagni M Knight R Green A Chianini F Wuthrich R P Hock C Zerr I Aguzzi A Urinary alpha1-antichymotrypsin a biomarker of prion infection PLoS One 2008 3 (12) e3870 35 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta) equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

131

Figure 1 Interspecies transmission of prion disease to rats End-stage clinical brain homogenates

were used to passage prion disease After one year of incubation animals were euthanized to

determine the extent of PrPres

accumulation Protease resistance PrP was only observed in those

animals infected with RML scrapie prions This material was serially passaged for two more

incubations before becoming rat-adapted as indicated by the shortening of the incubation period

132

Table 1 A list of selected proteins and mRNAs up-regulated in RAS CSF and brain tissue If

the protein is detected in the RAS CSF but not the control CSF the CSF expression is reported

with a infin If there is no change or data on certain genes related to an up regulated protein nd is

noted The mouse genomic data presented here was previously published22

Gene Protein Symbol Accession CSF

Expression

Rat

GEX

Mouse

GEX

14-3-3 protein zetadelta Ywhaz NP_037143 infin nd nd

14-3-3 protein epsilon Ywhae NP_113791 infin nd nd

14-3-3 protein gamma Ywhag NP_062249 infin nd nd

serine protease inhibitor A3N Serpin A3N NP_113719 54 nd 975

enolase 2 (gamma neuronal) Eno2 NP_647541 infin nd nd

granulin GRN NP_058809 62 364 184

macrophage colony-stimulating

factor 1 receptor

Csf1r NP_001025072 infin 293 205

cathepsin D CTSD NP_599161 infin 255 299

complement factor H Cfh NP_569093 376 234 nd

ribonuclease T2 RNAset2 NP_001099680 infin 302 nd

133

Figure 2 Accumulation of PrPSc

in rat adapted scrapie First second and third passage brain

homogenates were assayed for the presence of protease resistant PrP by immunoblot PrPSc

was

observed following phosphotungstenic acid enrichment at first passage By clinical disease in 2nd

and 3rd

passage rats PrPSc

had substantially accumulated

134

Figure 3 Histological appearance of rat adapted scrapie in the hippocampus at clinical disease

Infected animals showed intense immuno-staining for deposits of PrPSc

and GFAP expressing

astrocytes Spongiform change is an abundant feature in rat adapted scrapie

135

Figure 4 Scatter plot of gene expression profiling of rat adapted scrapie The expression of

individual genes from uninfected and infected animals were plotted to display up and down

regulation The dashed green line is no change Solid green lines are 2-fold changes in gene

expression

136

Figure 5 Quantitative relationship between orthologous pairs of two-fold up-regulated genes in

mouse and rat scrapie Genes up-regulated more than two fold were mapped to their orthologs

and the fold change was plotted Expression is log2 transformed

137

Figure 6 Comparison of genes upregulated in rat and mouse prion diseases Genes up-regulated

two fold in rodent scrapie were identified and the expression of their orthologs was determined

138

Figure 7 Venn diagram comparison of the CSF proteomics data between rat adapted scrapie

(RAS) model and control rats (A) Unique peptides from tryptic peptides produced for the

proteins identified (B) The total proteins identified including all isoforms within the protein

groups (C) The protein groups comparing only the top protein hit of the protein isoforms

showing very consistent protein identifications between RAS and control

139

Chapter 5

Investigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiae

Adapted from ldquoInvestigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M

Heideman W Li L In preparation

140

Abstract

This work explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Kinases such as protein

kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose response

Thus performing a large scale phosphoproteomic MS study can be highly beneficial to map the

signaling networks and cascades involved in glucose-dependent stimulated yeast growth Yeast

cell extract was digested and phosphopeptides were enriched by immobilized metal affinity

chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase (RP)-RP

separation The low pH separation was infused directly into an ion trap mass spectrometer with

neutral loss electron-transfer dissociation (ETD)-triggered fragmentation to improve

phosphopeptide identifications In total 477 unique phosphopeptides are identified with 06

false discovery rate (FDR) and 669 unique phosphopeptides identified with 54 FDR This

study includes comparative phosphoproteins for one specific motif of PKA xxxKRKRxSxxxx

which is presented and differences between starved vs glucose fed are highlighted Phosphosite

validation is performed using a localization algorithm Ascore to provide more confident and

site-specific characterization of phosphopeptides

141

Introduction

Microorganisms such as Saccharomyces cerevisiae (yeast) cycle between growth when

nutrients are available and quiescence when nutrients are scarce When glucose is limited yeast

go into growth arrest state but when glucose is added growth quickly resumes Kinases such as

protein kinase A (PKA) and target of rapamycin (TOR) regulate growth in response to nutrient

conditions and have been well studied through transcriptional control1-4

Yeast execute large

transcriptome alterations in response to changing environmental growth conditions5 6

Gene

regulation by glucose introduction in yeast has been studied including genes used for growth on

alternative carbon sources and activation of genes coding for glucose transport and protein

synthesis7-10

Response to nutrients for survival is not limited to yeast biology and indeed all

living creatures both prokaryotes and multicellular eukaryotes like mammals require nutrient

responsiveness and coordinating cellular functions to survive

With regulation of certain genes well studied by glucose introduction the mechanism and

global protein modulation caused by glucose introduction remain unknown6 Large-scale

phosphorylation analysis has been previously performed in yeast using mass spectrometry11-14

Mapping the phosphoproteome in yeast transitioning from quiescence to growth conditions to

better understand the roles of phosphorylation in orchestrating growth is needed The

phosphorylation state during growth or starvation could be used to engineer cells to drive ectopic

activity (or inhibition) to promote growth and ethanol production on non-native sugars like

xylose

It has been reported that the phosphorylation state can be affected by the introduction of

glucose to carbon-starved yeast15

and phosphorylation plays a significant role in the cell cycle

and signal transduction16

Specifically O-Phosphorylation can function as a molecular switch by

142

changing the structure of a protein via alteration of the chemical nature of an amino acid for

serine threonine or tyrosine It has been reported that up to 30 of the proteome can undergo

phophorylation17

Mass spectrometry has evolved as a powerful tool to accomplish phosphosite

mapping using shotgun proteomics With available technology tens of thousands of

phosphorylation sites can be mapped to amino acids on thousands of proteins using shotgun

proteomics18-20

Mass spectrometry can offer sensitive automated non-targeted global analysis of

phosphorylation events in proteomic samples but in any large scale phosphoproteomic

investigation enrichment of phosphoproteinspeptides is required First phosphorylation is

usually a sub-stoichiometric process where only a percentage of all protein copies are

phosphorylated21

Various enrichment methods have been used for phosphopeptide enrichment

including metal oxide affinity chromatography (MOAC)22

such as TiO223

immobilized metal

affinity chromatography (IMAC)12 24 25

electrostatic repulsion-hydrophilic interaction

chromatography (ERLIC)26

and immunoaffinity of tyrosine phosphorylation27 28

After

enrichment MS analyses of phosphorylated peptides are still challenging due to ion suppression

from non-phosphorylated peptides

Even after phosphopeptide enrichment further sample preparation is needed for large

scale proteomic experiments Additional fractionation can increase protein coverage of a

sample by over ten fold such as MudPIT29

(multidimensional protein identification technology)

In online MudPIT a sample is injected onto a strong cation exchange (SCX) column coupled to

a RP column Successive salt bumps followed by low pH gradients provide the separation of

peptides in two dimensions Since the phosphate groups on phosphopeptides have a low pKa

value due to being more acidic then their unmodified counterparts they tend to elute earlier and

143

disproportionally in ion exchange chromatography like SCX Alternatively reversed-phase

reversed-phase (RP-RP) separation has been proposed as an alternative to MudPIT for offline

two dimensional (2D) separation30

One of the caveats of 2D separation is the potential for

wasted mass spectrometry time from early and late fractions having very few peptides present

all while having too much sample for middle fractions One simple method to reduce these

ldquowasted fractionsrdquo is to combine early or late fractions with the middle fractions to reduce MS

runs with little peptide content to analyze thus shortening the overall analysis time31

In addition the labile phosphorylation group has a large propensity to undergo cleavage

during collision induced dissociation (CID) producing a neutral loss The neutral loss can

produce insufficient backbone fragment ions for MSMS identification32

A solution to neutral

loss fragmentation in phosphopeptides is MS3 using CID to produce improved backbone

fragmentation13 14 33

An alternative fragmentation method to CID for fast sampling ion traps is

electron transfer dissociation (ETD)34-36

ETD produces a more uniform back-bone cleavage

where the cation peptide receives an electron from a low affinity radical anion37

The transferred

electron induces a dissociation at the N-Cα bond to produce c- and zbull-type product ions while

retaining the labile post-translational modifications (PTM) such as phosphorylation bound to the

product ions38

The Bruker amaZon ETD ion trap mass spectrometer has the potential to trigger

ETD fragmentation after a labile PTM produces a neutral loss from CID fragmentation This

method is termed neutral loss-triggered ETD fragmentation and provides a complementary

fragmentation pathway to labile poor fragmenting phosphorylated peptides

In this work we provide a qualitative comparative list of yeast phosphopeptides observed

in glucose fed vs glucose starved conditions

144

Experimental

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) ammonium hydroxide (ACS plus certified) urea (gt99)

sodium chloride (997) ammonium bicarbonate (certified) Optima LCMS grade acetonitrile

Optima LCMS grade water and EDTA disodium salt (99) were purchased from Fisher

Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) ammonium formate (ge99995) Trizma

hydrochloride (Tris HCl ge990) Triton X-100 (laboratory grade) and iron (III) chloride

hexahydrate (97) were purchased from Sigma-Aldrich (Saint Louis MO) Sodium dodecyl

sulfate (SDS) (998) was purchased from US Biological (Marblehead MA) Nickel

nitrilotriacetic acid (Ni-NTA) magnetic agarose beads were purchased from Qiagen (Valencia

CA) Autoclaved yeast peptone dextrose (YPD) media was prepared in 1 L of deionized water

using 10 g yeast extract 20 g Peptone from Becton Dickinson and Company (Sparks MD) and

20 g D-(+)-Glucose (ge995) purchased from Sigma-Aldrich (Saint Louis MO)

Modified Mary Miller Yeast Protein Isolation

The yeast culture was prepared and protein extraction was performed using a modified

Mary Miller protocol39

Briefly yeast strain s288c was inoculated with YPD media and shook

for 72 hours to allow the yeast to be glucose starved After the 72 hours the yeast culture was

partitioned into two flasks To one flask glucose was added at 2 of the final concentration and

allowed to incubate and shake for 10 minutes Then 100 optical density units (ODU) of the yeast

145

culture were collected from each flask Each yeast sample was spun down in a Beckman-Coulter

J6B centrifuge (Indianapolis IN) at 2500xg for three minutes The media was aspirated and the

tubes were flash frozen in liquid nitrogen and stored at -80oC The yeast pellets were thawed on

ice and resuspended in 1 mL of lysis buffer (50 mM Tris HCl 01 triton x-100 05 SDS

pH=80) and 1X protease and phosphatase inhibitor cocktail from Thermo Scientific (Rockford

IL) and transferred to an Eppendorf tube To the yeast 200 microL of glass beads were added and

amalgamated for 25 minutes at 4oC The sample was inverted and the Eppendorf tube was

pierced with a hot 23 gauge needle through the bottom and the tube was placed atop a 5 mL

culture tube and centrifuge in the Beckman-Coulter J6B at 2500xg for three minutes at 4oC to

collect the liquid containing the yeast cells while the glass beads remain trapped in the

Eppendorf tubes The protein lysate was then centrifuged at 3000xg for five minutes at 4oC and

the supernatant was collected and stored at -80oC

Preparation of tryptic digests

The Saccharomyces cerevisiae protein cell extract for fed or starved were analyzed with a

BCA protein assay kit (Thermo Scientific Rockford IL) and 2 mg of protein was added to four

parts -20oC acetone and allowed to precipitate the protein for 1 hour at -20

oC The samples were

then centrifuged at 14000xg for 5 minutes and the supernatant was removed and the protein

pellet was reconstituted in 360 microl of 8 M urea To each sample 20 microL of 050 M DTT was

added and allowed to incubate at 37oC for 45 minutes After incubation 54 microL of 055 M IAA

was added for carbamidomethyl of cysteine and the sample was allowed to incubate for 15

minutes in the dark To quench the IAA 20 microL of 050 M DTT was added and allowed to react

for 10 minutes To perform trypsin digestion 1400 microL of 50 mM NH4HCO3 was then added

along with 20 microg trypsin to each sample Digestion was performed overnight at 37oC and

146

quenched by addition of 25 microL of 10 formic acid For each sample the tryptic peptides were

then subjected to solid phase extraction using a Hypersep C18 100 mg solid phase extraction

(SPE) column (Thermo Scientific Bellefonte PA) Peptides were eluted with 50 ACN in

01 formic acid concentrated and reconstituted in 500 microL of 80 ACN and 01 formic acid

Phosphopeptide enrichment using immobilized metal affinity chromatography (IMAC)

One ml of Ni-NTA was washed three times with 800 microL of H2O and then the Ni was

removed with 800 microL of 100 mM EDTA in 50 mM ammonium bicarbonate and vortexed for 30

minutes The supernatant was removed and the NTA was washed with 800 microL of H2O three

times followed by addition of 800 microL 10 mM FeCl3 hexahydrate and vortexed for 30 minutes

The Fe-NTA was then washed three times with 800 microL of H2O and once with 80 ACN in 01

formic acid before being combined with the cell extract for phosphopeptide enrichment and

vortexed for 40 minutes The sample was washed three times with 200 microL of 80 ACN in 01

formic acid before elution with two aliquots of 200 microL of 5 ammonium hydroxide in 5050

ACNH2O with 15 minutes of vortexing The enriched phosphopeptides were then dried down

with a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA) and reconstituted in 55 microL

25mM ammonium formate pH=75

First dimension neutral pH separation

Individually 50 microL of each yeast phosphopeptide enriched sample was injected onto a

Phenomenex Gemini-NX 3microm C18 110 Aring 150 x 2 mm column (Torrance CA) The Gemini

column was coupled to a Phenomenex SecurityGuard Gemini C18 2 mm guard cartridge

(Torrance CA) Mobile phase A consisted of 25 mM ammonium formate at pH=75 and mobile

phase B consisted of 25 mM ammonium formateACN with a 19 ratio respectively at pH=75

The column was operated at 50oC at a flow rate of 05 mLmin The flow profile was 2-30 B

147

over 65 minutes and then 5 min at 100 B A total of 22 fractions were collected every 3

minutes and the fractions were combined as follows 1 with 12 2 with 13 etc to 11 with 22

The combined fractions were dried down and reconstituted in 25 microL of 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL from each fraction onto an Agilent Technologies

Zorbax 300 SB-C18 5 microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5

microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm Atlantis

dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B at 250

nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) for MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

148

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions An additional mode of MSMS fragmentation electron transfer dissociation

(ETD) was triggered on the precursor ion when a neutral loss was observed in CID

fragmentation from phosphoric acid H3PO4 (Δmz of 49 and 327 for 2+ and 3+ charge states

respectively) or for metaphosphoric acid (HPO3) (Δmz of 40 and 267 for 2+ and 3+ charge

states respectively) For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms rolling average 2 acquisition range of 300-1500 mz

and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target

was set to 300000 maximum accumulation time 5000 ms two spectral averages acquisition

range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required to prevent artificial data

reduction Identification of peptides were performed using Mascot40

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt Saccharomyces

cerevisiae database (version 575) Mascot search parameters were as follows Allowed missed

cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) and phosphorylation (STY) peptide tolerance plusmn12 Da maximum

number of 13

C 1 MSMS tolerance peptide decoy (Mascot) checked plusmn05 Da instrument type

149

ESI-Trap Results from the Mascot search was exported as DAT files for analysis by Scaffold 3

and Scaffold PTM

Scaffold and Ascore data processing

Scaffold 3 (Proteome Software Portland OR)was used for MSMS proteomics data

comparison and false discovery rate analysis The DAT files were loaded into Scaffold 3 and

the fractions for the two dimensional fractionation were combined The resulting biological

triplicates for starved and glucose fed yeast were filtered with a 1 false discovery rate (FDR)

on the protein level (Supplemental Table 1) Due to the inherent poor fragmentation of

phosphopeptides a lenient FDR level may still yield biologically relevant data and thus a 54

FDR was generated for phosphopeptides (Supplemental Table 2) For a more conservative list of

phosphopeptides with a 06 FDR was generated from Scaffold 3 (Supplemental Table 3) FDR

analysis is sufficient at preventing poor data from being reported but does not prevent false

phosphosite identification in phosphopeptides To provide confidence in site identification

Scaffold PTM was used to perform Ascore41

analysis Ascore uses an algorithm to score the

probability of the phosphosite from a phosphopeptide identified by a database searching

algorithm such as Mascot Ascore can be freely accessed at httpascoremedharvardedu

Cell collection RNA isolation and microarray data analysis

All experiments were performed in biological duplicates Cell samples (10 ODU) were

taken at starved and fed states spun at 5000xg for 2 minutes at 30 oC the supernatant was

removed and the pellets snap frozen using liquid nitrogen RNA was isolated using the Epicentre

MasterPure Yeast Purification Kit (Epicentre Technologies) and quality was checked using gel

electrophoresis cRNA synthesis was carried out using the GeneChipreg Expression 3

Amplification One-Cycle Target Labeling and Control Reagents kit (Affymetrix) All

150

experiments followed the manufactures instructions cRNA samples were hybridized to

GeneChip Yeast Genome 20 Arrays for 16 hours Arrays were washed stained and scanned

according the manufactures recommendations Affymetrix CEL files were RMA normalized

with R and the Bioconductor Suite Data analysis was performed within TIGR Multiexperiment

Viewer v451 in-house Perl scripting R and Bioconductor

Results

Sample preparation for shotgun proteomics

As discussed in the introduction the purpose of this study is to provide an exploratory list

of qualitative differences in the phosphoproteome between starved vs glucose fed yeast After

yeast cell lysate production a substantial amount of sample preparation is performed to enhance

the depth of the phosphoproteome coverage A workflow of the steps following cell lysis is

outlined in Figure 1A The yeast proteins are isolated via acetone precipitation followed by

digestion with trypsin to produce peptides Phosphopeptides are intermixed with the entire

tryptic peptide sample and therefore Fe-NTA IMAC is used to enrich the phosphopeptides To

improve upon the number of phosphopeptides we then performed two dimensional separation

with high pH RP followed by low pH RP C18 and infused the eluent directly onto an ion trap

mass spectrometer Figure 1B show an improved technique for the first dimension of separation

to combine the early eluting and late eluting fractions from the first phase of separation to reduce

overall MS analysis time The combination of fractions 1 and 12 2 and 13 etc essentially

improves the duty cycle of the mass spectrometer by ensuring sufficient peptide content is

injected onto a low pH nanoLC RP C18 column

ETD-triggered mass spectrometry

151

In the present study labile phosphorylation can lead to non-informative neutral loss

During MS scanning mode the instrument will choose the 6 most abundant peaks with correct

isotopic distribution and select them for MSMS CID fragmentation During CID fragmentation

it is common for labile PTMs such as phosphorylation to undergo neutral loss and thus limited

informative b and y-type ions are formed Alternatively ETD fragmentation can be used on

specific MSMS scans which produce a neutral loss indicative of phosphorylation (98Daz or

80Daz) The subsequent ETD fragmentation on these putative phosphopeptides will lead to

uniform backbone cleavage resulting in confident identification of phosphopeptides with site-

specific localization during MSMS It is important to note that CID fragmentation still produces

very informative fragmentation for phosphorylation but ETD provides an orthogonal

fragmentation pathway to further increase the phosphoproteome coverage Additionally the

duty cycle for ETD fragmentation is roughly half that of CID therefore about half as many

potential peptides would be fragmented and sequenced if the instrument was using ETD

fragmentation exclusively

Protein Data

Even with phosphopeptide enrichment it is inevitable that non-phosphopeptides will also

be identified All data were searched with Mascot and in total over 1000 proteins were identified

with a 1 FDR rate and are listed in Supplemental Table 1 The proteins listed in Supplemental

Table 1 include both phosphoproteins and non-phosphorylated proteins In addition to the

proteins identified in the fed and starved states the unique peptides and spectral counts are also

listed for reference The phosphopeptides are specifically listed with a 54 FDR rate in

Supplemental Table 2 and comparisons between starved and fed with Ascore analysis are listed

for every phosphopeptide identified A higher confidence of phosphopeptide identification is

152

sometimes required before investing in time consuming biological experiments so a list of

phosphopeptides identified in starved and fed was taken from a setting directing Scaffold to

produce a 05 FDR which resulted in the generated FDR of 06 FDR and listed in

Supplemental Table 3

A summary of specific phosphorylation sites are listed in Table 1 with a 06 FDR and

Table 2 with a 54 FDR Each table gives totals for unique phosphopeptides identified having

an Ascore localization score ge80 without Ascore and phosphorylation events on each unique

peptides As expected the majority of phosphorylation events (over 50) occurred on serine

whereas fewer phosphorylation events (~10) occurred on tyrosine In addition the vast

majority of phosphorylation events were single phosphorylation (ge65) with very few

identifications having more than two phosphosites per peptide For specific phosphopeptide

identification and subsequent phosphoprotein identification refer to Supplemental Table 2 and 3

Discussion

Transcriptional response to glucose feeding

Yeast responds to the repletion of glucose after glucose-depletion by broad

transcriptional changes to induce growth Roughly one-third of the yeast genome is altered by at

least two fold when glucose is introduced as shown in Figure 2 Figure 2 is a heat map from a

microarray analysis showing 1601 genes up regulated and 1457 genes down regulated after

addition of glucose compared to the starved state The arbitrary cut-offs for these values were as

follows two fold induction after nutrient repletion and a false discovery rate (q-value) of 001

Red color in the heat map indicates a particular gene is up-regulated in the fed state compared to

the starved state Alternatively genes coded in green are less expressed in the fed state

compared to the starved condition The intensity of the green or red colors is indicative of the

153

intensity of the fold change in gene expression These large transcriptional changes after glucose

repletion drive and complement the current phosphoproteomic study

PKA motif analysis

One benefit of a large scale phosphoproteomics experiment is to examine the different

phosphopeptides from known motifs for specific kinases PKA and TOR are responsible for the

majority of the transcriptional response and thus PKA is a good target for motif analysis Figure

3 shows the PKA motif sequence xxxRKRKxSxxxxxx with the phosphorylation occurring on

the serine Other than F16P (fructose-16-bisphosphatase) all the proteins listed are unique to the

starved or fed samples A motif sequence will inevitably show up by random chance in any

analysis For this study the control for motif analysis uses the swissprot protein list for the

entire yeast proteome for the background Compared to background this specific PKA kinase

from Figure 3 is up-regulated by 264 fold when compared to the background One interesting

protein emerged from this motif analysis in the fed sample but not the starved sample is

Ssd1which is implicated in the control of the cell cycle in G1 phase42

Ssd1 also is

phosphorylated by Cbk1 and Cbk1 has been proposed to inhibit Ssd143

and provides an

intriguing target for future studies on starved vs glucose fed yeast growth

Localization of the phosphorylation sites

When a phosphopeptide contains any number of serine threonine or tyrosine amino

acids the localization of the phosphosite can sometimes be ambiguous Database searches used

to identify peptides like Mascot do not provide any probability for localization of correct

phosphorylation sites Ascore is an algorithm that uses the same MSMS spectra as Mascot but

instead of comparing theoretical peptide MSMS to experimental spectra Ascore searches for

informative a b or y-type peptide fragments giving evidence for phosphosite The Scaffold

154

program adds a localization probability to the Ascore values and the values are listed in

Supplemental Table 2 and 3 Figure 4 has a representative Ascore mass spectra assigning the

peaks identified and providing evidence that the phosphorylation site occurs at the threonine

instead of the serine Incorporating Ascore into this study provides a layer of validation for

putative phosphosite identification

Plasma Membrane 2-ATPase

A previous study identified and localized phosphorylation sites on plasma membrane 1-

ATPase after glucose was introduced to starved yeast15

In the current study PMA2 (plasma

membrane ATPase 2) was identified in glucose fed and not starved samples The doubly

threonine phosphorylated peptide identified in fed yeast is PMA2 with amino acid sequence

IKMLTGDAVGIAKETCR (583-599) whereas the homologous peptide in PMA1 has the exact

same amino acid sequence except for the first isoleucine substituted for valine

VKMLTGDAVGIAKETCR (554-570 not observed in data) PMA2 was observed in the 06

FDR list of phosphopeptides and has an Ascore localization probably of 100 A plant study

showed that PMA2 phosphorylation level was higher in early growth phase than when in

stationary phase44

In addition PMA2 expression in yeast permits the growth of yeast and

threonine phosphorylation has been reported on Thr-95545

The identification of PMA2 in the

fed glucose cell extract provides an interesting target for future study on the molecular

mechanisms involved in regulation growth in starved vs glucose fed yeast

Conclusion

In conclusion this work provides a qualitative comparison in the phosphoproteome

between starved and glucose fed yeast A large scale IMAC enrichment of yeast cell lysate

followed by 2-D separation provided a rich source of phosphopeptides for neutral loss triggered

155

ETD analysis using mass spectrometry A specific motif for PKA is highlighted to show the

differences in proteins identified between starved vs fed conditions In total 477 unique

phosphopeptides are identified with 06 FDR and 669 unique phosphopeptides identified with

54 FDR Phosphosite validation is performed using a localization algorithm Ascore to

provide further confidence on the site-specific characterization of these phosphopeptides The

proteins Ssd1 and PMA2 emerged as potential intriguing targets for more in-depth studies on

protein phosphorylation involved in glucose response

Supplemental Tables 1 2 and 3 are available upon request

References

1 Martinez M J Roy S Archuletta A B Wentzell P D Anna-Arriola S S

Rodriguez A L Aragon A D Quinones G A Allen C Werner-Washburne M Genomic

analysis of stationary-phase and exit in Saccharomyces cerevisiae gene expression and

identification of novel essential genes Mol Biol Cell 2004 15 (12) 5295-305

2 Radonjic M Andrau J C Lijnzaad P Kemmeren P Kockelkorn T T van Leenen

D van Berkum N L Holstege F C Genome-wide analyses reveal RNA polymerase II

located upstream of genes poised for rapid response upon S cerevisiae stationary phase exit Mol

Cell 2005 18 (2) 171-83

3 Slattery M G Heideman W Coordinated regulation of growth genes in

Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

4 Wang Y Pierce M Schneper L GAtildefrac14ldal C G k e Zhang X Tavazoie S

Broach J R Ras and Gpa2 Mediate One Branch of a Redundant Glucose Signaling Pathway in

Yeast PLoS Biol 2004 2 (5) e128

5 Newcomb L L Hall D D Heideman W AZF1 is a glucose-dependent positive

regulator of CLN3 transcription in Saccharomyces cerevisiae Mol Cell Biol 2002 22 (5) 1607-

14

6 Newcomb L L Diderich J A Slattery M G Heideman W Glucose regulation of

Saccharomyces cerevisiae cell cycle genes Eukaryot Cell 2003 2 (1) 143-9

7 Carlson M Glucose repression in yeast Curr Opin Microbiol 1999 2 (2) 202-7

8 Gancedo J M Yeast carbon catabolite repression Microbiol Mol Biol Rev 1998 62

(2) 334-61

9 Johnston M Feasting fasting and fermenting Glucose sensing in yeast and other cells

Trends Genet 1999 15 (1) 29-33

156

10 Warner J R The economics of ribosome biosynthesis in yeast Trends Biochem Sci

1999 24 (11) 437-40

11 Li X Gerber S A Rudner A D Beausoleil S A Haas W Villen J Elias J E

Gygi S P Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces

cerevisiae J Proteome Res 2007 6 (3) 1190-7

12 Ficarro S B McCleland M L Stukenberg P T Burke D J Ross M M

Shabanowitz J Hunt D F White F M Phosphoproteome analysis by mass spectrometry and

its application to Saccharomyces cerevisiae Nat Biotechnol 2002 20 (3) 301-5

13 Gruhler A Olsen J V Mohammed S Mortensen P Faergeman N J Mann M

Jensen O N Quantitative phosphoproteomics applied to the yeast pheromone signaling

pathway Mol Cell Proteomics 2005 4 (3) 310-27

14 Peng J Schwartz D Elias J E Thoreen C C Cheng D Marsischky G Roelofs

J Finley D Gygi S P A proteomics approach to understanding protein ubiquitination Nat

Biotechnol 2003 21 (8) 921-6

15 Lecchi S Nelson C J Allen K E Swaney D L Thompson K L Coon J J

Sussman M R Slayman C W Tandem phosphorylation of Ser-911 and Thr-912 at the C

terminus of yeast plasma membrane H+-ATPase leads to glucose-dependent activation J Biol

Chem 2007 282 (49) 35471-81

16 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

17 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

18 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

19 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

20 Breitkopf S B Asara J M Determining In Vivo Phosphorylation Sites Using Mass

Spectrometry In Current Protocols in Molecular Biology John Wiley amp Sons Inc 2012

21 Steen H Jebanathirajah J A Rush J Morrice N Kirschner M W Phosphorylation

analysis by mass spectrometry myths facts and the consequences for qualitative and

quantitative measurements Mol Cell Proteomics 2006 5 (1) 172-81

22 Kweon H K Hakansson K Metal oxide-based enrichment combined with gas-phase

ion-electron reactions for improved mass spectrometric characterization of protein

phosphorylation J Proteome Res 2008 7 (2) 749-55

23 Larsen M R Thingholm T E Jensen O N Roepstorff P Jorgensen T J Highly

selective enrichment of phosphorylated peptides from peptide mixtures using titanium dioxide

microcolumns Mol Cell Proteomics 2005 4 (7) 873-86

24 Kokubu M Ishihama Y Sato T Nagasu T Oda Y Specificity of immobilized

metal affinity-based IMACC18 tip enrichment of phosphopeptides for protein phosphorylation

analysis Anal Chem 2005 77 (16) 5144-54

25 Swaney D L Wenger C D Thomson J A Coon J J Human embryonic stem cell

phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry Proc

Natl Acad Sci U S A 2009 106 (4) 995-1000

157

26 Hao P Guo T Sze S K Simultaneous analysis of proteome phospho- and

glycoproteome of rat kidney tissue with electrostatic repulsion hydrophilic interaction

chromatography PLoS One 2011 6 (2) e16884

27 Rush J Moritz A Lee K A Guo A Goss V L Spek E J Zhang H Zha X

M Polakiewicz R D Comb M J Immunoaffinity profiling of tyrosine phosphorylation in

cancer cells Nat Biotechnol 2005 23 (1) 94-101

28 Ficarro S Chertihin O Westbrook V A White F Jayes F Kalab P Marto J A

Shabanowitz J Herr J C Hunt D F Visconti P E Phosphoproteome analysis of

capacitated human sperm Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3

and valosin-containing proteinp97 during capacitation J Biol Chem 2003 278 (13) 11579-89

29 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

30 Dowell J A Frost D C Zhang J Li L Comparison of two-dimensional

fractionation techniques for shotgun proteomics Anal Chem 2008 80 (17) 6715-23

31 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

32 Palumbo A M Smith S A Kalcic C L Dantus M Stemmer P M Reid G E

Tandem mass spectrometry strategies for phosphoproteome analysis Mass Spectrom Rev 2011

30 (4) 600-25

33 Beausoleil S A Jedrychowski M Schwartz D Elias J E Villen J Li J Cohn M

A Cantley L C Gygi S P Large-scale characterization of HeLa cell nuclear

phosphoproteins Proc Natl Acad Sci U S A 2004 101 (33) 12130-5

34 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

35 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion

dependence in the partitioning between proton and electron transfer in ionion reactions

International Journal of Mass Spectrometry 2004 236 (1acirceuroldquo3) 33-42

36 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

37 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

38 Coon J J Collisions or electrons Protein sequence analysis in the 21st century Anal

Chem 2009 81 (9) 3208-15

39 Miller M E Cross F R Distinct subcellular localization patterns contribute to

functional specificity of the Cln2 and Cln3 cyclins of Saccharomyces cerevisiae Mol Cell Biol

2000 20 (2) 542-55

40 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

158

41 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

42 Sutton A Immanuel D Arndt K T The SIT4 protein phosphatase functions in late

G1 for progression into S phase Mol Cell Biol 1991 11 (4) 2133-48

43 Jansen J M Wanless A G Seidel C W Weiss E L Cbk1 regulation of the RNA-

binding protein Ssd1 integrates cell fate with translational control Curr Biol 2009 19 (24)

2114-20

44 Kanczewska J Marco S Vandermeeren C Maudoux O Rigaud J L Boutry M

Activation of the plant plasma membrane H+-ATPase by phosphorylation and binding of 14-3-3

proteins converts a dimer into a hexamer Proc Natl Acad Sci U S A 2005 102 (33) 11675-80

45 Maudoux O Batoko H Oecking C Gevaert K Vandekerckhove J Boutry M

Morsomme P A plant plasma membrane H+-ATPase expressed in yeast is activated by

phosphorylation at its penultimate residue and binding of 14-3-3 regulatory proteins in the

absence of fusicoccin J Biol Chem 2000 275 (23) 17762-70

159

Figure 1 The general workflow indicating the major steps involved in sample collection

sample processing mass spectrometric data acquisition and analysis of comparative

phosphoproteomics for starved vs glucose fed yeast (A) The high pH RP separation

procedure for combining fractions to reduce low peptide containing fractions from the

UV-VIS trace is also shown (B)

160

Figure 2 Heat map for the global transcriptional response to glucose fed vs starved samples

S288C cells starved for glucose until growth was arrested and subsequently glucose was added

161

Samples for microarray analysis were taken at the starved state and 60 minutes after glucose was

added The heat map shows the fed log2 fold change for each gene relative to the starved state

across the genome performed in biological replicate (A) Black indicates no change in

expression level while red indicates higher expression for the fed relative to the starved state

Green indicates higher expression for the starved state compared to the fed state (Adapted from

Dr Michael Conways Thesis)

162

Figure 3 The motif for protein kinase A (PKA) that was observed for fed vs starved which is

xxxRKRKxSxxxxxx with the phosphorylation occurring on the serine The motif occurred at a

rate 264 fold higher than the yeast proteome used for background In addition one protein was

observed in both starved and fed with accession identification of F16P (Fructose-16-

bisphosphatase)

163

06 FDR phosphopeptide analysis

Table 1 The number of phosphopeptides identified with a 06 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

Starved Fed All

Ascore ge80 score

unique

STY 164 153 317

S 87 (530) 82 (536) 169 (533)

T 60 (366) 55 (359) 115 (363)

Y 17 (104) 16 (105) 33 (104)

Unique no Ascore

STY 242 235 477

S 131 (541) 133 (566) 264 (553)

T 86 (355) 78 (332) 164 (344)

Y 25 (103) 24 (102) 49 (103)

Phosphorylation events

on each unique peptide

1 102 113 187

2 36 40 68

3 12 11 22

4 or more 8 3 11

164

54 FDR phosphopeptide analysis

Starved Fed All

Ascore ge80 score

unique

STY 217 217 434

S 115 (530) 113 (521) 228 (525)

T 78 (359) 78 (359) 156 (359)

Y 24 (111) 26 (120) 50 (115)

Unique no Ascore

STY 337 332 669

S 193 (573) 180 (542) 373 (558)

T 111 (329) 116 (349) 227 (339)

Y

Phosphorylation events

on each unique peptide

1

2

3

4 or more

33 (98)

135

56

16

11

36 (108)

169

55

14

3

69 (103)

280

100

27

13

Table 2 The number of phosphopeptides identified with a 54 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

165

Figure 4 Ascore validation of the phosphopeptide LSLAtKK from the protein SGM1 (slow

growth on galactose and mannose protein 1) with 100 localization probability observed

in only fed but not starved yeast samples Ascorersquos algorithm identifies a b and y-type

ions and looks to identify peaks that provide evidence for a specific phosphorylation site

For this example Ascore assigns this phosphosite to threonine (T5) instead of the serine

(S2) The intense peaks at 4024 mz and 5250 mz are not identified by any a b or y-

type ions From the ladder sequence of the peptide shown numerous ions indicate the

threonine is phosphorylated while the serine is not Among these ions used for

localization are b2 y2 y5+H2O y3 y4 and y5

166

Chapter 6

Use of electron transfer dissociation for neuropeptide sequencing and

identification

Adapted from ldquoDiscovery and Characterization of the Crustacean Hyperglycemic Hormone

Precursor Related Peptides (CPRP) and Orcokinin Neuropeptides in the Sinus Glands of the Blue

Crab Callinectes sapidus Using Multiple Tandem Mass Spectrometry Techniquesrdquo Hui L

Cunningham R Zhang Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

167

Abstract

The crustacean Callinectes sapidus sinus gland (SG) is a well-defined neuroendocrine site that

produces numerous hemolymph-borne agents including the most complex class of endocrine

signaling moleculesmdashneuropeptides One such peptide is the crustacean hyperglycemic hormone

(CHH) precursor-related peptide (CPRPs) which was not previously sequenced Electron

transfer dissociation (ETD) was used for sequencing of intact CPRPs due to their large size and

high charge state In addition ETD was used to sequence the sulfated peptide Cholecystokinin

CCK-like Homarus americanus using a salt adduct Collectively these two examples

demonstrated the utility of ETD in sequencing neuropeptides with large molecular weight or

with labile modifications

168

Introduction

Neuropeptides are the largest and most diverse group of endocrine signaling molecules in

the nervous system They are necessary and critical for initiation and regulation of numerous

physiological processes such as feeding reproduction and development1 2

Mass spectrometry

(MS) with unique advantages such as high sensitivity high throughput chemical specificity and

the capability of de novo sequencing with limited genomic information is well suited for the

detection and sequencing of neuropeptides in endocrine glands and simultaneously provides the

potential for information on post-translational modifications such as sulfonation3-6

The sinus glands (SG) are located between the medulla interna and medulla externa of the

eyestalk in Callinectes sapidus It is a well-known neuroendocrine site that synthesizes and

secretes peptide hormones regulating various physiological activities such as molting

hemolymph glucose levels integument color changes eye pigment movements and

hydro-mineral balance7 Our labrsquos previous neuropeptidomic studies of SG in several

crustacean species including Cancer borealis8-11

Carcinus maenas12

and Homarus americanus13

14 used multiple MS strategies combining MALDI-based high resolution accurate mass profiling

biochemical derivatization and nanoscale separation coupled to tandem MS for de novo

sequencing In the current study we explore the neuropeptidome of the SG in the blue crab

Callinectes sapidus a vital species of economic importance on the seafood market worldwide In

total 70 neuropeptides are identified including 27 novel neuropeptides and among them the

crustacean hyperglycemic hormone precursor-related peptides (CPRPs) and orcokinins represent

major neuropeptide families known in the SG

The crustacean hyperglycemic hormone (CHH) and its precursor-related peptide (CPRP) are

produced concurrently during the cleavage of CHH from the CHH preprohormone protein15

The

169

CPRP peptide is located between the signal peptide and the CHH sequence and is separated from

the CHH by a dibasic cleavage site The functions of CPRPs are currently unknown16

However

the complete characterization of CPRPs provides a foundation for future experiments elucidating

their functional roles in crustaceans The cDNA of CHH preprohormone in SG of Callinectes

sapidus has been characterized17

but the direct detection of CPRP has not been reported due to

its relatively large size and possible post-translational modifications While small fragments of

CPRPs can be identified using nanoLC-ESI-Q-TOF tandem mass spectrometry the intact

peptide is difficult to detect due to the large molecular weight of CPRPs

Peptidomics traditionally uses collision induced dissociation (CID) for tandem MS

experiments for de novo sequencing Recently an alternative peptide fragmentation method has

been developed using the transfer of an electron termed electron transfer dissociation (ETD)18 19

ETD involves a radical anion with low electron affinity to be transferred to peptide cation which

results in reduced sequence discrimination and thus provides improved sequencing for larger

peptides compared to CID20

Specifically for an ion trap ETD the fluoranthene radical anion is

allowed to react with peptide cations in the three dimensional trap The resulting dissociation

cleaves at the N-Cα bond to produce c and zbull type product ions The peptidome of model

organisms like Callinectes sapidus can be further explored and expanded utilizing ETD as a

complementary fragmentation technique to CID Previous peptidomic analysis has been

completed using ETD as an additional fragmentation method21

It was observed that

enzymatically produced peptides with a higher mz produced improved sequence coverage using

ETD This observation termed decision tree analysis determined that a charge state of ge6 all

peptides endogenous or enzymatic should be fragmented via ETD22

In the present study the

highly charged peptide CPRP with an intact molecular weight of 3837 readily produces plus six

170

charge states The higher charge state of CPRP is highly amenable to fragmentation by ETD

which produces remarkably improved fragmentation and thus increased sequence coverage when

compared to CID

Sulfonation of tyrosine is a post-translational modification (PTM) that is thought to occur on

trans-membrane spanning and secreted proteins23

Cholecystokinin-8 (CCK-8) is a sulfated

peptide which has been linked to satiety24-26

and a CCK-like peptide has been observed in

lobster27

Sulfonation is an extremely labile modification and is often lost during soft

ionization such as ESI or MALDI but can be avoided by optimizing the ionization process28

One potential strategy to identify sulfopeptides is to scan for a neutral loss of 80 Da after CID

but this method does not allow for identification of site of sulfonation and has the risk to be

mistaken for phosphorylation The sulfonation PTM is a negatively charged modification on

the peptide which allows for negative ion scanning in the mass spectrometer but provides

minimal MSMS sequence coverage during CID due to preferential loss of the sulfate group

Alternatively electron-based dissociation technique enables more rapid radical driven

fragmentation where the cleavage pattern is more uniform along the peptide backbone without

initially cleaving the labile sulfated group thus preserving the site of modification These types

of dissociation techniques only work for multiply-charged ions thus a method to install a

multiply-charged cation on the target peptide is desirable It has been shown that the electron

capture dissociation (ECD) can be used to sequence sulfated peptides when a multi-charged

cation is added to the solution29

We use a similar multi-charge cation solution technique to

sequence a sulfated peptide from Homarus americanus via ETD on an ion trap mass

spectrometer Here we presented the use of the ETD fragmentation technique for the analysis

of large peptides and peptides containing labile post-translational modification

171

Experimental Section

Chemical and materials

Acetonitrile methanol magnesium chloride (ACS grade) potassium chloride (ACS grade) and

calcium chloride (ACS grade) were purchased from Fisher Scientific (Pittsburgh PA) Formic

acid was from Sigma-Aldrich (St Louis MO) Human sulfated CCK-8 and CCK-like peptide

(E(pyro)FDEY(Sulfo)GHMRFamide) were purchased from American Peptide (Sunnyvale CA)

Fused-silica capillary with 75 μm id and 360 μm od was purchased from Polymicro

Technologies (Phoenix AZ) Millipore C18 Ziptip column was used for sample cleaning and all

water used in this study was deionized water (182 MΩcm) prepared from a Milli-Q Millipore

system (Billerica MA) The physiological saline consisted of 440 mM NaCl 11mM KCl 26

mM MgCl2 13 mM CaCl2 11 mM Trizma base and 5 mM maleic acid in pH 745

Animals and dissection

Callinectes sapidus (blue crab) were obtained from commercial food market and maintained

without food in artificial sea water at 10-12 ordmC Animals were cold-anesthetized by packing on

ice for 15-30min before dissection The optic ganglia with the SG attached were dissected in

chilled (~4ordmC) physiological saline and individual SGs were isolated from the optic ganglia by

micro-dissection and immediately placed in acidified methanol (90 methanol 9 glacial acetic

acid and 1 water) and stored at -80ordmC until tissue extraction

Tissue homogenization

Acidified methanol was used during the homogenization of SGs The homogenized SGs were

immediately centrifuged at 16100 g using an Eppendorf 5415D tabletop centrifuge (Eppendorf

172

AG) The pellet was washed using acidified methanol and combined with the supernatant and

further dried in a Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation) The

resulting dried sample was re-suspended with a minimum amount (20 microL) of 01 formic acid

Fractionation of homogenates using reversed phase (RP)-HPLC

The re-suspended extracts were then vortexed and briefly centrifuged The resulting supernatants

were subsequently fractionated via high performance liquid chromatography (HPLC) HPLC

separations were performed using a Rainin Dynamax HPLC system equipped with a Dynamax

UV-D II absorbance detector (Rainin Instrument Inc Woburn MA) The mobile phases included

Solution A (deionized water containing 01 formic acid) and Solution B (acetonitrile containing

01 formic acid) About 20 μl of extract was injected onto a Macrosphere C18 column (21 mm

id x 250 mm length 5 microm particle size Alltech Assoc Inc Deerfield IL) The separation

consisted of a 120 minute gradient of 5-95 Solution B Fractions were automatically collected

every two minutes using a Rainin Dynamax FC-4 fraction collector (Rainin Instrument Inc

Woburn MA) The fractions were then concentrated using in Savant SC 110 SpeedVac

concentrator (Thermo Electron Corporation) and re-suspended with minimum amount of 01

formic acid

Nano-LC-ESI-Q-TOF MSMS

Nanoscale LC-ESI-Q-TOF MSMS was performed using a Waters nanoAcquity UPLC system

coupled to a Q-TOF Micro mass spectrometer (Waters Corp Milford MA)

Chromatographic separations were performed on a homemade C18 reversed phase capillary

column (75 microm internal diameter x100 mm length 3 microm particle size 100 Aring) The mobile phases

173

used were 01 formic acid in deionized water (A) 01 formic acid in acetonitrile (B) An

aliquot of 50 microl of a tissue extract or HPLC fraction was injected and loaded onto the trap

column (Zorbax 300SB-C18 Nano trapping column Agilent Technologies Santa Clara CA)

using 95 mobile phase A and 5 mobile phase Bat a flow rate of 10 microlmin for 10 minutes

Following this the stream select module was switched to a position at which the trap column

came in line with the analytical capillary column and a linear gradient of mobile phases A and B

was initiated For neuropeptides the linear gradient was from 5 buffer B to 45 buffer B over

90 min The nanoflow ESI source conditions were set as follows capillary voltage 3200 V

sample cone voltage 35 V extraction cone voltage 1 V source temperature 100ordmC A data

dependent acquisition was employed for the MS survey scan and the selection of three precursor

ions and subsequent MSMS of the selected parent ions The MS scan range was from mz

400-1800 and the MSMS scan was from mz 50-1800

Peptide Prediction De Novo Sequencing and Database Searching

De novo sequencing was performed using a combination of MassLynxTM

41 PepSeq software

(Waters) and manual sequencing Tandem mass spectra acquired from the Q-TOF were first

deconvoluted using MaxEnt 3 software (Waters) to convert multiply charged ions into their

singly charged forms The resulting spectra were pasted into the PepSeq window for sequencing

analysis The candidate sequences generated by the PepSeq software were compared and

evaluated for homology with previous known peptides The online program blastp (National

Center for Biotechnology Information Bethesda MD httpwwwncbinlmnihgovBLAST)

was used to search the existing NCBI crustacean protein database using the candidate peptide

sequences as queries For all searches the blastp database was set to non-redundant protein

174

sequences (ie nr) and restricted to crustacean sequences (ie taxid 6657) For each of the

proteins putatively identified via blastp the BLAST score and BLAST-generated E-value for

significant alignment are provided in the appropriate subsection of the results Peptides with

partial sequence homology were selected for further examination by comparing theoretical

MSMS fragmentation spectra generated by PepSeq with the raw MSMS spectra If the

fragmentation patterns did not match well manual sequencing was performed

NanoLC Coupled to MSMS by CID and ETD

Setup for RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN (Fisher Scientific Optima LCMS grade Fair Lawn NJ) Injections

consisted of 5 microL of prepared tissue extract onto an Agilent Technologies Zorbax 300 SB-C18 5

microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5 microLmin for 5 minutes at 95

A5 B Separation was performed on a Waters HPLC column with 3 microm Atlantis dC18 75 microm

x 150 mm (Milford MA) on a gradient from 5 to 45 mobile phase B at 250 nLmin over 90

minutes at room temperature Emitter tips were pulled from 75 microm inner diameter 360 microm

outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using a commercial

laser puller model P-2000 (Sutter Instrument Co Novato CA)

Mass Spectrometry Data Acquisitions for Collision Induced Dissociation (CID)

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany) equipped

with an on-line nanospray source was used for mass spectrometry data acquisition Hystar

(Version 32 Bruker Daltonics Bremen Germany) was used to couple and control Eksigent

175

nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for all

experiments Smart parameter setting (SPS) was set to mz 700 compound stability and trap

drive level were set at 100 Optimization of the nanospray source resulted in dry gas

temperature 125oC dry gas 40 Lmin capillary voltage -1300 V end plate offset -500 V

MSMS fragmentation amplitude 10V and SmartFragmentation set at 30-300

Data was generated in data dependent mode with strict active exclusion set after two spectra and

released after one minute MSMS was obtained via CID fragmentation for the six most

abundant MS peaks with a preference for doubly charged ions and excluded singly charged ions

For MS generation the ion charge control (ICC) target was set to 200000 maximum

accumulation time 5000 ms four spectral average acquisition range of mz 300-1500 and scan

speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target was set to

200000 maximum accumulation time 5000 ms three spectral averages acquisition range of

mz 100-2000 and scan speed (Ultrascan) of 32000 mz s-1

Mass Spectrometry Data Acquisitions for Electron Transfer Dissociation (ETD)

The amaZon ETD ion-trap mass spectrometer was operated in electron transfer dissociation for

MSMS fragmentation with the same optimized settings as reported for CID unless otherwise

stated Smart parameter setting (SPS) was set to 500 mz compound stability and trap drive

level were set at 100 MSMS was obtained via ETD fragmentation for the four most

abundant MS peaks with no preference for specifically charged ions except to exclude singly

charged ions The ETD reagent parameters were set to 400000 target ICC for the fluoranthene

radical anion The fluoranthene radical anion has an mz of 210 and therefore the 210 mz value

was removed from the resulting MSMS spectra and 160 mz was set for the MSMS low mz

cut-off

176

Direct Infusion of Sulfated Peptide Standards into the Mass Spectrometer using ETD and

CID Fragmentation

The sulfated peptide standards were infused into the mass spectrometer at a flow rate of 300

nlmin using a fused-silica capillary with 75 μm id and 360 μm od coupled to a custom pulled

tip The CCK-8 human peptide (20 microM) was infused in 5050 ACNH2O and detected in

negative ionization mode with an ICC of 70000 and fragmented with CID using the same

settings as the previous CID experiments Alternatively the CCK-like lobster sulfated peptide

(2 microM) was mixed with 30 microM of different salts (MgCl2 KCl or CaCl2) and 01 formic acid in

5050 ACNH2O The plus three charge of the CCK-like lobster peptide was selected for ETD

fragmentation in positive mode with the same setting as the previous ETD experiments The

data were then de novo sequenced for coverage and localization of the sulfation site

Data Analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen Germany)

Deviations in parameters from the default Protein Analysis in DataAnalysis were as follows

intensity threshold 1000 maximum number of compounds 1E7 and retention time window 05

minutes These parameter changes assisted in providing better quality spectra for sequencing

Identification of peptides was performed using Mascot (Version 23 Matrix Science London

UK) Searches were performed against a custom crustacean database none enzyme allow up to

1 missed cleavage amidated C-terminal as variable modifications peptide tolerance mass error

12 Da MSMS mass error tolerance is 06 Da

Results and Discussion

177

Identification and Characterization of Intact CPRPs Using ETD

Using ESI-Q-TOF 9 truncated CPRPs fragments were sequenced to cover the amino acid

sequence of the intact CPRP peptide RSAEGLGRMGRLLASLKSDTVTPLRGFEGETGHPLE

A representative CID MSMS spectrum of ASLKSDTVTPLR is shown in Figure 1 (a) CID

using ESI-Q-TOF mainly produces fragmentation for doubly and triply charge molecules which

is adequate to sequence peptides with relatively small molecular weight (less than 2000 Da)

However CID fragmentation is insufficient to sequence the larger intact CPRP in a complex

sample whose molecular mass is 3837 Da and therefore it is extremely difficult to directly

sequence by traditional CID ESI-Q-TOF ETD is an attractive fragmentation alternative to

sequence the highly charged CPRP peptide In ETD a protonated peptide reacts with an anion

(fluoranthene radical) where the anion transfers an electron to the peptide cation and the resulting

fragmentation occurs along the N-Cα bond by free-radical-driven-cleavage The large size of

CPRP offers multiple basic amino acid sites to sequester charge and reduce the sequence

coverage from collision induced dissociate by preventing random backbone cleavage whereas

ETD does not cleave the weakest bond and provides the alternative fragmentation pathway to

obtain superior sequence coverage to larger peptides As shown in Figure 1 (cd) the

fragmentation of ETD is highly amenable to ETD compared to the CID fragmentation in Figure

1 (b) As evident from comparison ETD offers more extensive fragmentation than CID thus

providing enhanced coverage for large intact peptides such as CPRPs Specifically we observe

125 of the b- and y- cleavages for CID as compared to an average of 535 of c- and zbull-

fragments More than a four-fold increase in fragments using ETD suggests the utility of this

relatively new tandem MS fragmentation method as complementary techniques for de novo

sequencing of large intact endogenous peptides and novel neuropeptide discovery endeavors

178

Negative Mode Sulfated Peptide Identification

An accepted method for identification and quantification for sulfated peptides is negative

ionization30

CCK-8 sulfated peptide standards show intense signal in negative ionization mode

without needing to have additives added such as salts Figure 2 shows the CID MSMS

spectrum from the negative ionization and produces an intense b6 ion at 430 mz The transition

from the doubly negatively charged MS ion of 570 mz to the b6 ion provides a selected reaction

monitoring transition for quantification studies but the sequence information is limited and the

presence of the methionine produces variable oxidation In addition Figure 2 shows the

MSMS product ions with the loss of the sulfate group thus making site-specific location of

sulfation more difficult

Investigation of Cation Salt Adducts to Produce Multiply Charged Sulfated Peptides

Sulfated peptides can be isolated in positive scanning mode but usually only in a plus one

state with low signal intensity If ETD is performed on the singly charged peptide cation a

neutral is formed and is lost in the mass spectrometer and thus no sequence information can be

obtained In order to remedy this situation a technique that adding metal salts to peptides to

increase charge state for ECD used in Fourier transform ion cyclotron resonance mass

spectrometry (FTICR-MS)29

inspired the investigation of increasing charge state of targeted

peptide via metal salt adduction followed by ETD analysis in a Bruker amaZon ion trap

Various salts were investigated including MgCl2 CaCl2 and KCl each used at a concentration of

30 microM and mixed with a sulfated peptide standard at 2 microM The addition of KCl produced

mostly doubly charged ion of the lobster CCK-like peptide standard K adduct which produced

insufficient sequence information from ETD fragmentation (data not shown) In comparison

the MgCl2 and CaCl2 produced intense triply charged peptide ions but CaCl2 produced lower

179

signal intensity compared to MgCl2 (data not shown)

Cation Assisted ETD Fragmentation of CCK-like Lobster Sulfated Peptide and Future

Directions

The addition of MgCl2 to the lobster CCK-like sulfated peptide is shown in Figure 3

Except for z1 the complete z-series is obtained including the z7 ion with and without the

sulfation PTM The spectrum is complicated by the additional Mg adducts but several peaks

are shown without any Mg adduct such as z2 z3 z4 etc Clearly the method of cation

assisted ETD fragmentation can be useful using ion trap instruments and can properly sequence

sulfated peptides that are prone to neutral loss from the labile PTM One concern about future

direction in sulfopeptide isolation in non-genome sequenced species is isolation of sulfopeptides

Currently antibodies to specific sulfopeptides is the only commonly used enrichment technique

for sulfopeptides Also since metal cations are needed for this method direct infusion into an

ESI mass spectrometer is preferred to prevent running large amounts of adduct forming salts

through the LC system With direct infusion the lack of separation confounds the problem in

sulfopeptide detection because any co-eluting peptides could have ionization efficiency and thus

reduce the signal of the sulfopeptide Once discovered and sequenced a selected reaction

monitoring (SRM) method could be developed using LC retention coupled with negative

ionization mode followed by CID MSMS or possibly MSMSMS to perform quantitative

studies for sulfopeptides

Conclusions

In this study ETD was performed to improve the sequence coverage of large endogenous

neuropeptides with higher charge and bigger size The intact CPRP from Callinectes sapidus was

identified and characterized with 68 sequence coverage by MS for the first time Cation

180

assisted ETD fragmentation was also shown to be a powerful tool in de novo sequencing of

sulfated neuropeptides These endeavors into using ETD for certain neuropeptides will assist in

future analysis of large neuropeptides and PTM containing neuropeptides

181

Reference

1 Schwartz M W Woods S C Porte D Jr Seeley R J Baskin D G Central nervous system control of

food intake Nature 2000 404 (6778) 661-71

2 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R

Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide

family of aplysia J Neurosci 2002 22 (17) 7797-808

3 Baggerman G Cerstiaens A De Loof A Schoofs L Peptidomics of the larval Drosophila melanogaster

central nervous system Journal of Biological Chemistry 2002 277 (43) 40368-40374

4 Desiderio D M Mass spectrometric analysis of neuropeptidergic systems in the human pituitary and

cerebrospinal fluid Journal of Chromatography B 1999 731 (1) 3-22

5 Li L J Kelley W P Billimoria C P Christie A E Pulver S R Sweedler J V Marder E Mass

spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab Cancer

borealis Journal of Neurochemistry 2003 87 (3) 642-656

6 Li L J Moroz T P Garden R W Floyd P D Weiss K R Sweedler J V Mass spectrometric survey of

interganglionically transported peptides in Aplysia Peptides 1998 19 (8) 1425-1433

7 Strand F L Neuropeptides regulators of physiological processes 1st ed ed MIT Press Cambridge Mass

1999 p 658 p

8 Fu Q Goy M F Li L J Identification of neuropeptides from the decapod crustacean sinus glands using

nanoscale liquid chromatography tandem mass spectrometry Biochemical and Biophysical Research

Communications 2005 337 (3) 765-778

9 Fu Q Christie A E Li L J Mass spectrometric characterization of crustacean hyperglycemic hormone

precursor-related peptides (CPRPs) from the sinus gland of the crab Cancer productus Peptides 2005 26 (11)

2137-2150

10 Ma M M Chen R B Ge Y He H Marshall A G Li L J Combining Bottom-Up and Top-Down Mass

Spectrometric Strategies for De Novo Sequencing of the Crustacean Hyperglycemic Hormone from Cancer borealis

Analytical Chemistry 2009 81 (1) 240-247

11 Ma M M Sturm R M Kutz-Naber K K Fu Q Li L J Immunoaffinity-based mass spectrometric

characterization of the FMRFamide-related peptide family in the pericardial organ of Cancer borealis Biochemical

and Biophysical Research Communications 2009 390 (2) 325-330

12 Ma M M Bors E K Dickinson E S Kwiatkowski M A Sousa G L Henry R P Smith C M Towle

D W Christie A E Li L J Characterization of the Carcinus maenas neuropeptidome by mass spectrometry and

functional genomics General and Comparative Endocrinology 2009 161 (3) 320-334

13 Chen R B Jiang X Y Conaway M C P Mohtashemi I Hui L M Viner R Li L J Mass Spectral

Analysis of Neuropeptide Expression and Distribution in the Nervous System of the Lobster Homarus americanus

Journal of Proteome Research 2010 9 (2) 818-832

14 Ma M M Chen R B Sousa G L Bors E K Kwiatkowski M A Goiney C C Goy M F Christie A

E Li L J Mass spectral characterization of peptide transmittershormones in the nervous system and

neuroendocrine organs of the American lobster Homarus americanus General and Comparative Endocrinology

2008 156 (2) 395-409

15 Ollivaux C Gallois D Amiche M Boscameric M Soyez D Molecular and cellular specificity of

post-translational aminoacyl isomerization in the crustacean hyperglycaemic hormone family FEBS J 2009 276

(17) 4790-802

16 Wilcockson D C Chung J S Webster S G Is crustacean hyperglycaemic hormone precursor-related

peptide a circulating neurohormone in crabs Cell and Tissue Research 2002 307 (1) 129-138

17 Choi C Y Zheng J Watson R D Molecular cloning of a cDNA encoding a crustacean hyperglycemic

hormone from eyestalk ganglia of the blue crab Callinectes sapidus General and Comparative Endocrinology 2006

148 (3) 383-387

18 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and protein sequence analysis

by electron transfer dissociation mass spectrometry Proc Natl Acad Sci U S A 2004 101 (26) 9528-33

19 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion dependence in the partitioning

between proton and electron transfer in ionion reactions International Journal of Mass Spectrometry 2004 236

(1-3) 33-42

20 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-site identity and

position on electron-transfer dissociation of polypeptide cations J Am Chem Soc 2007 129 (40) 12232-43

182

21 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved identification of endogenous

peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric

analysis J Proteome Res 2009 8 (2) 870-6

22 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass spectrometry for shotgun

proteomics Nat Methods 2008 5 (11) 959-64

23 Monigatti F Hekking B Steen H Protein sulfation analysis--A primer Biochim Biophys Acta 2006 1764

(12) 1904-13

24 Chandler P C Wauford P K Oswald K D Maldonado C R Hagan M M Change in CCK-8 response

after diet-induced obesity and MC34-receptor blockade Peptides 2004 25 (2) 299-306

25 Little T J Feltrin K L Horowitz M Meyer J H Wishart J Chapman I M Feinle-Bisset C A

high-fat diet raises fasting plasma CCK but does not affect upper gut motility PYY and ghrelin or energy intake

during CCK-8 infusion in lean men Am J Physiol Regul Integr Comp Physiol 2008 294 (1) R45-51

26 Blevins J E Morton G J Williams D L Caldwell D W Bastian L S Wisse B E Schwartz M W

Baskin D G Forebrain melanocortin signaling enhances the hindbrain satiety response to CCK-8 Am J Physiol

Regul Integr Comp Physiol 2009 296 (3) R476-84

27 Turrigiano G G Selverston A I A cholecystokinin-like hormone activates a feeding-related neural circuit in

lobster Nature 1990 344 (6269) 866-8

28 Wolfender J L Chu F Ball H Wolfender F Fainzilber M Baldwin M A Burlingame A L

Identification of tyrosine sulfation in Conus pennaceus conotoxins alpha-PnIA and alpha-PnIB further investigation

of labile sulfo- and phosphopeptides by electrospray matrix-assisted laser desorptionionization (MALDI) and

atmospheric pressure MALDI mass spectrometry J Mass Spectrom 1999 34 (4) 447-54

29 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides complexed with divalent

metal cations Anal Chem 2006 78 (21) 7570-6

30 Young S A Julka S Bartley G Gilbert J R Wendelburg B M Hung S C Anderson W H

Yokoyama W H Quantification of the sulfated cholecystokinin CCK-8 in hamster plasma using

immunoprecipitation liquid chromatography-mass spectrometrymass spectrometry Anal Chem 2009 81 (21)

9120-8

183

Figure 1 Comparison of tandem MS spectra of truncated CPRP (ASLKSDTVTPL mz 128766)

by ESI-Q-TOF (a) and intact CPRP (MW 3837) by the amaZon ETD for both CID and ETD

fragmentation (a) MSn of precursor ion with mz 640 with charge state +6 by CID represent

loss of NH3 ordm represent loss of H2O (b) MS+6

of precursor ion with mz 640 with charge state +6

by ETD at z represent z+1 z represent z+2 (c) MS+5

of precursor ion with mz 768 with charge

state +5 by ETD z represent z+1 z represent z+2 For all labels in Figure 1 charge state +1 is

not specified

184

185

Figure 2 Negative CID spectrum of CCK-8 peptide standard (20 microM) via direct infusion show

the doubly charged b6 ion provides the most intense MSMS transition

186

Figure 3 ETD spectrum of E(pyro)FDEY(Sulfo)GHMRF-amide (2 microM) obtained on the

amaZon ETD via direct infusion with 30 microM MgCl2 The sulfated peptide can be identified

with a visible z-series of z2 to z9 and identified sulfate loss

187

Chapter 7

Investigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysis

Adapted from ldquoInvestigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J

Wellner D Li L Journal of Mass Spectrometry In Press

188

ABSTRACT

This work investigates the introduction of methanol and a salt modifier to molecular

weight cut-off membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide

quantities Using a neuropeptide standard bradykinin sample loss was reduced over two orders

of magnitude with and without undigested protein present Additionally a bovine serum

albumin (BSA) tryptic digestion was investigated Twenty-seven tryptic peptides were identified

from MALDI mass spectra after enriching with methanol while only two tryptic peptides were

identified after the standard MWCO protocol The strategy presented here enhances recovery

from MWCO separation for sub-microg peptide samples

INTRODUCTION

Molecular weight cut-off membrane-based centrifugal filter devices (MWCO) are

commonly used to desalt and concentrate large molecular weight proteins [1] Greeing and

Simpson recently investigated various MWCO membranes for large amounts of starting material

(~6 mg) focusing on optimal conditions for the sub 25 kDa protein fraction [2] The authors

recovered 200 μg to 29 mg of protein from multiple MWCO experiments and demonstrated that

a 1090 acetonitrile (ACN)H2O elution solvent produced optimal results [3] In addition Manza

et al provided an alternative approach to isolate proteins with a 5 kDa MWCO by using

NH4HCO3 and recovering the retained proteins [4] Alternatively the elution from MWCOs can

be collected to recover only low molecular weight peptides Multiple peptidomic studies have

utilized MWCOs for peptide isolation during the first few steps of sample preparation [5 6]

When sample amount is limited or peptide content is below 1 microg sample loss is a significant

concern when using MWCOs to isolate endogenous peptides Optimized protocols have been

189

investigated using ACN [3 7] salt [4 8] SDS [5] or native sample [6 9 10] but these

experiments primarily focused on large sample amounts rather than sub-microgram peptide

quantities

MWCOs separate large molecules from small molecules The small molecule fraction

may be rich with signaling peptides (SP) cytokines and other small molecules involved in cell-

cell signaling Signaling peptides perform various functions in the body including cell growth

cell survival and hormonal signaling between organs [11] Individual SP contribute to different

aspects of behavior such as pain (enkephalins) [12] feeding (neuropeptide Y) [13] and blood

pressure (bradykinin) [14] MWCO separations can be used to enrich biologically important SP

and explore the peptide content from biological fluids with relatively low peptide content like

blood or cerebrospinal fluid (CSF) In a recent investigation the detection of neuropeptides and

standards in crustacean hemolymph was improved when methanol and protease inhibitors were

present before performing MWCO neuropeptide isolation The impact of methanol on MWCO

sample loss was not investigated in the study [15] In another study a large-scale mass

fingerprinting protocol of endogenous peptides from CSF used a combination of salts before

MWCO fractionation but the impact of adding salts was not discussed [16] The most

commonly used brand of MWCO in the publications and in peptidomic studies is Millipore

Therefore Millipore MWCOs (using regenerated cellulose as the membrane) were used in the

present study The purpose of this work is to provide an optimized sample preparation technique

for MWCO filtering to reduce sample loss and allow sub-microg detection of peptides using MALDI

mass spectrometry

MATERIALS AND METHODS

190

Materials and Chemicals

Water acetonitrile methanol (optima LCMS grade) and sodium chloride (995) were

purchased from Fisher Scientific (Fair Lawn NJ) The α-cyano-4-hydroxy-cinnamic acid (99)

formic acid (FA) (ge98) and bovine serum albumin (ge96) were purchased from Sigma-

Aldrich (St Louis MO) Amicon Ultra 05 mL 10000 MWCO centrifugal filters and ZipTips

packed with C18 reversed-phase resin were purchased from Millipore (Billerica MA) Trypsin-

digested bovine serum albumin (BSA) was purchased from Waters (Milford MA) Bradykinin

was purchased from American Peptide Company (Sunnyvale CA)

MALDI MS Instrumentation

An AutoFLEX III MALDI TOFTOF mass spectrometer (Bruker Daltonics Billerica

MA) was operated in positive ion reflectron mode The MALDI MS instrument is equipped with

a proprietary smart beam (Bruker Daltonics Billerica MA) laser operating at 200 Hz The

instrument was internally calibrated over the mass range of mz 500minus2500 using a standard

peptide mix Two thousand laser shots were collected per sample spot at an accelerating voltage

of 19 kV and a constant laser power using random shot selection The acquired data were

analyzed using FlexAnalysis software (Bruker Daltonics Billerica MA) Mass spectrometry

data acquisition was obtained by averaging 2000 laser shots

Molecular weight cut off separation procedure

The MWCO separations were performed using Amicon Ultra 05 mL 10000 MWCO

centrifugal filters (Billerica MA) Before MWCO separation three washing steps were

performed to remove contaminants from the filter The three washes were 500 μL of 5050

H2OMeOH 500 μL of H2O and 400 μL of the solution used for MWCO separation For the

191

100 H2O solution 1 microg of BSA or bradykinin was used for separation All the other MWCO

separation experiments used 500 ng of BSA or 100 ng or less of bradykinin The MWCO filter

was then centrifuged at 14000 rcf for 5 min at room temperature in an Eppendorf 5415 D

microcentrifuge (Brinkmann Instruments Inc Westbury NY) The filtrate was concentrated in a

Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation West Palm Beach FL)

and acidified The resulting sample was desalted according to the manufacturer using C18

ZipTips from Millipore (Billerica MA) by washing the ZipTip with three times 100 ACN

three aqueous washes of 01 FA binding the peptides from the solution and one aqueous wash

of 01 FA Peptides were then eluted from the ZipTips using 15 microL of 50 ACN in 01 FA

Matrix deposition

Equal volumes of 05 μL from the 15 μL sample solution (including standards not subject

to MWCO filtering) and α-cyano-4-hydroxy-cinnamic acid (CHCA) matrix solution in 50

ACN50 H2O were mixed using a dried-droplet method and spotted on a MALDI target The

resulting droplets were allowed to air dry prior to mass spectrometry acquisition

RESULTS AND DISCUSSION

Analysis of two orders of magnitude increase for bradykinin standard

Bradykinin was selected to assess the potential peptide loss in the flow-through after

performing MWCO separation As shown in Figure 1 1 microg of bradykinin standard does not

produce a detectable signal by MALDI mass spectrometry analysis after a 10 kDa MWCO

separation in water (performed in triplicate) For comparison 1 ng of bradykinin standard

diluted to 15 microL produced an intense signal on the MALDI mass spectrometer suggesting

192

significant sample loss occurs when the target analyte is low in quantity (data not shown

performed in triplicate) Figure 1 shows that the addition of a salt in this case NaCl improves

the limits of detection and decreases sample loss when 7030 watermethanol was compared to

7030 aqueous 1 M NaClmethanol The reproducible results gave a relative standard deviation

(RSD) of 6 for peak intensity Figure 1 shows that even when starting with 1 μg of bradykinin

too much sample is lost during the MWCO separation in water to detect the remainder

However an intense signal is observed for 10 ng of bradykinin after the MWCO separation when

7030 aqueous 1 M NaClmethanol is used as an elution solvent Sample loss from zip-tipping

was estimated in triplicate by calculating the decreases in peak intensity When 10 ng of

bradykinin was used sample loss was ~41 In comparison the calculated yield for 10 ng of

bradykinin after the 7030 aqueous 1 M NaClmethanol MWCO separation and zip-tip desalting

showed an estimated sample loss of 63 meaning more loss can be attributed to sample clean-

up than MWCO filtration

A series of experiments were performed to determine if 7030 aqueous 1 M

NaClmethanol is an optimal solution to recover peptides during a MWCO separation (data not

shown) A 5050 aqueous 1 M NaClmethanol and a 5050 watermethanol elution were

performed in duplicate but signal intensity of the resulting bradykinin was poor and numerous

polymer peaks were detected in the flow through A lower salt concentration 01 M NaCl was

used but also produced greatly reduced signal when compared to aqueous 1 M NaCl To assess

the optimized methodrsquos compatibility with lower sample amounts the 7030 aqueous 1 M

NaClmethanol solution was added to 1 ng of bradykinin for MWCO separation but no signal

was obtained (data not shown) Using a neuropeptide standard the addition of methanol and

NaCl salt significantly improved the sample recovery in sub-microg amounts

193

BSA tryptic peptide mixture analysis

After demonstrating the importance of using an optimized solution for MWCO

separations with an individual peptide the new method was applied to 500 ng of BSA tryptic

digest to investigate its utility with more complex peptide mixtures Table 1 lists the BSA

tryptic peptides identified in the MALDI MS analysis from different solution conditions

processed by MWCO separation As shown in Table 1 a directly spotted BSA tryptic peptide

standard in the absence of any MWCO filtration enabled identification of 39 tryptic peptides by

accurate peptide mass measurements Once again when using 100 H2O for MWCO

separations the starting amount was doubled to 1 microg (also done with 500 ng data not shown)

However many tryptic peptides were not detected due to low signal intensities and non-optimal

elution conditions Instead of H2O a 1 M NaCl solution was used for the MWCO elution but

only two tryptic peptides were identified (Table 1) The addition of 30 methanol into the

sample before MWCO filtration produced the first increase in identified BSA tryptic peptides

The remaining data from Table 1 shows improved BSA tryptic peptide identifications as the

sample (elution) conditions were further optimized Figure 2 shows the actual mass spectra

associated with the three most promising elution solutions along with 100 H2O

The BSA tryptic peptide intensities are shown in Figure 2A and the most intense tryptic

peptide YLYEIAR mz 92749 was observed in the four different solutions shown in Figure 2B

but not in the 100 H2O or 100 1 M NaCl solutions (data not shown) In Figure 2A all mass

spectra are normalized to an intensity of 7 x 104 to illustrate two points First the MWCO

filtering step still produced sample loss regardless of the solvent conditions chosen Second

there is a noticeable increase in peptide peak intensity using the optimized solvent 6040

194

aqueous 1 M NaClmethanol (Figure 2A) Figure 2C displays a zoomed-in view of a BSA

tryptic peptide signal LKECC

DKPLLEK mz 153266 (

carbamidomethyl) observed only in

the optimized solvent The detection of the mz 153266 peptide in Figure 2C highlights the

potential gain in sample and detectable peptides by using an optimized saltMeOH combination

A non-optimized saltMeOH combination will still reduce sample loss but further minimizing

sample loss during sample preparation will always be desirable in any analytical protocol

MWCO composition

The purpose of this application note is to provide evidence of sub-microg sample loss during

MWCO separations of peptide samples and a solution to overcome this limitation The

explanation of why adding MeOH and NaCl to the sample solution provides a significant

reduction in sample loss is beyond the scope of this application note Regardless Supplemental

Table S1 is an expanded version of Table 1 showing the amino acid sequence hydrophobicity

calculated using GRAVY scores and pI of the identified peptides in this study No discernible

trend was obtained from the data The membrane of commonly used MWCO in peptidomics and

for this study is comprised of chemically treated (regenerated) cellulose which is a

polysaccharide containing β (1rarr4) linked D-glucose Glucose has numerous free hydroxyl

groups which could non-specifically adsorb peptides flowing through the MWCO The addition

of MeOH has the most significant effect on signal which could be due to disrupting the

interaction between peptides and hydroxyl groups from glucose NaCl has a less significant

effect on sample recovery compared to MeOH but a detectable reduction in sample loss is noted

This improvement in sample recovery could be analogous to the use of NaCl in

195

immunodepletion protocols to reduce non-specific binding which is accomplished by adding

150 mM NaCl [17]

Analysis of bradykinin in the presence of undigested BSA

When using MWCO for peptide isolation proteins are typically present in the samples

usually in larger amounts Figure 3 shows the effect that adding BSA protein to a 10 ng

bradykinin solution before MWCO fractionation has on the resulting recovery of bradykinin

Adding 10 μg of BSA to the optimized 7030 aqueous 1 M NaClmethanol solution slightly

decreased bradykininrsquos signal with a RSD of 10 More severe signal reduction occurred after

adding 100 microg BSA with a RSD of 2 (N=2) It is not unexpected that more signal reduction

due to sample loss would occur especially in the 100 microg BSA sample because the BSA protein

has a molar ratio of 160 BSA protein molecules to one bradykinin peptide Figure 3 shows the

usefulness of the MWCO method with samples containing large amounts of proteins

RecommendationConclusion

The present work provides solutions to reduce sample loss from the use of MWCO for

sub-microg peptide isolation with or without non-digested proteins in the sample Despite its

widespread utility significant sample loss often occurs during the MWCO fractionation step

which is particularly problematic when analyzing low-abundance peptides from limited starting

material This application note aims to reduce sample loss during MWCO separations

specifically for sub-microg peptide isolation If complex samples are processed with MWCO

separation the authors recommend eluting the sample with 6040 aqueous 1 M NaClmethanol

solution as a starting point to minimize sample loss This application note provides a viable

196

alternative for sub-microg peptide MWCO separation circumventing the need to increase the starting

material by minimizing sample loss from using a MWCO membrane-based centrifugal filter

device

References

[1] HM Georgiou GE Rice MS Baker Proteomic analysis of human plasma failure of

centrifugal ultrafiltration to remove albumin and other high molecular weight proteins

Proteomics 2001 1 1503

[2] DW Greening RJ Simpson Low-molecular weight plasma proteome analysis using

centrifugal ultrafiltration Methods Mol Biol 2011 278 109

[3] DW Greening RJ Simpson A centrifugal ultrafiltration strategy for isolating the low-

molecular weight (ltor=25K) component of human plasma proteome J Proteomics 2010 73

637

[4] LL Manza SL Stamer AJ Ham SG Codreanu DC Liebler Sample preparation and

digestion for proteomic analyses using spin filters Proteomics 2005 5 1742

[5] D Theodorescu D Fliser S Wittke H Mischak R Krebs M Walden M Ross E Eltze O

Bettendorf C Wulfing A Semjonow Pilot study of capillary electrophoresis coupled to mass

spectrometry as a tool to define potential prostate cancer biomarkers in urine Electrophoresis

2005 26 2797

[6] K Antwi G Hostetter MJ Demeure BA Katchman GA Decker Y Ruiz TD Sielaff LJ

Koep DF Lake Analysis of the plasma peptidome from pancreas cancer patients connects a

peptide in plasma to overexpression of the parent protein in tumors J Proteome Res 2009 8

4722

[7] LP Aristoteli MP Molloy MS Baker Evaluation of endogenous plasma peptide extraction

methods for mass spectrometric biomarker discovery J Proteome Res 2007 6 571

[8] A Zougman B Pilch A Podtelejnikov M Kiehntopf C Schnabel C Kumar M Mann

Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7

386

[9] X Yuan DM Desiderio Human cerebrospinal fluid peptidomics J Mass Spectrom 2005 40

176

[10] X Zheng H Baker WS Hancock Analysis of the low molecular weight serum peptidome

using ultrafiltration and a hybrid ion trap-Fourier transform mass spectrometer J Chromatogr A

2006 1120 173

[11] L Li JV Sweedler Peptides in the brain mass spectrometry-based measurement approaches

and challenges Annu Rev Anal Chem 2008 1 451

[12] GB Stefano G Fricchione Y Goumon T Esch Pain immunity opiate and opioid

compounds and health Med Sci Monit 2005 11 MS47

[13] J Jensen Regulatory peptides and control of food intake in non-mammalian vertebrates Comp

Biochem Physiol A Mol Integr Physiol 2001 128 471

197

[14] A Kuoppala KA Lindstedt J Saarinen PT Kovanen JO Kokkonen Inactivation of

bradykinin by angiotensin-converting enzyme and by carboxypeptidase N in human plasma Am

J Physiol Heart Circ Physiol 2000 278 H1069

[15] R Chen M Ma L Hui J Zhang L Li Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 708

[16] H Jahn S Wittke P Zurbig TJ Raedler S Arlt M Kellmann W Mullen M Eichenlaub H

Mischak K Wiedemann Peptide fingerprinting of Alzheimers disease in cerebrospinal fluid

identification and prospective evaluation of new synaptic biomarkers PLoS One 2011 6

e26540

[17] NA Cellar AS Karnoup DR Albers ML Langhorst SA Young Immunodepletion of high

abundance proteins coupled on-line with reversed-phase liquid chromatography a two-

dimensional LC sample enrichment and fractionation technique for mammalian proteomics J

Chromatogr B Analyt Technol Biomed Life Sci 2009 877 79

198

Table 1 Identified BSA tryptic peptides from various MWCO separation conditions

BSA tryptic

peptide (MH+)

100

H2O 1microg

100

1 M NaCl

70

H2O

80

1 M NaCl

70

1 M NaCl

60

H2O

60

1 M NaCl

5083

5453

6894

7124

8985

9275

10345

10725

11385

11636

12496

12837

13057

13997

14157

14197

14398

14636

14798

15026

15118

15328

15547

15677

15768

16399

16678

16738

17248

17408

17477

17497

18809

18890

19019

19079

20450

21139

22479

Total 39 2 2 6 8 15 15 27

199

Figure 1 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard showing improvement over two orders of magnitude in detection limits Each MWCO

separation was performed at minimum in triplicate with representative spectrum selected for

each with a calculated RSD from the peak heights Three different amounts of bradykinin were

tested to assess the magnitude of sample loss under different MWCO solvent conditions The

top panel shows 1 microg of bradykinin standard after MWCO separation with 100 H2O elution

produced no signal The addition of 40 or 30 MeOH produced very low bradykinin signals

for both 100 ng (RSD of 28) and 10 ng (SNlt3 no RSD calculated) respectively In the

bottom two spectra each showed very large intensity but the 7030 aqueous 1 M NaClmethanol

10 ng bradykinin was processed with a 10kDa MWCO and zip-tipped and was reproducible with

200

a RSD of 6 The last spectrum was from 10 ng bradykinin (RSD of 3) which was diluted to

an equivalent volume as all the other experiments and directly spotted onto the MALDI plate

201

Figure 2 Representative MALDI mass spectra from MWCO separation of a BSA tryptic

peptide standard showing sample loss Stacked mass spectra from mz range 875-2150

normalized to 7 x 104 intensity representing the detection difference from a BSA tryptic peptide

standard from different MWCO separation conditions (A) It should be noted that when the

solvent for MWCO elution was 100 H2O 1 microg of BSA tryptic peptides was processed instead

of 500 ng A zoomed in view of the most abundant BSA tryptic peptide detected YLYEIAR

mz 92749 (B) Various percentages of MeOH produced significant signal but addition of a salt

(1 M NaCl) increases the signal which is closest to a directly spotted BSA tryptic peptide

standard A zoomed in view of a representative low intensity BSA tryptic peptide detected

LKECC

DKPLLEK mz 153266 (C) The optimized solution to be used for MWCO filtration

202

6040 aqueous 1 M NaClmethanol was the only procedure that enabled the detection of the

tryptic peptide in Figure 2C which was also detected in the directly spotted BSA tryptic peptide

standard All experiments were performed a minimum of two times with nearly identical results

) Carbamidomethyl amino acid modification

ordm) Tryptic peptide identified in three of the spectra in Figure 2A

dagger) Tryptic peptide identified in two of the spectra in Figure 2A

) Tryptic peptide identified in a single spectrum in Figure 2A

203

Figure 3 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard with a BSA protein present showing optimized solvent conditions minimized samples

losses Each experiment was performed in duplicate Two different amounts of BSA protein

were tested to assess the magnitude of sample loss caused by the presence of a protein The top

panel shows 10 microg of BSA protein and the second panel shows 100 microg of BSA protein added

while only 10 ng of bradykinin was added Detectable sample loss was observed when the BSA

protein was added but panel two shows that the amount of BSA protein was 1 x 104 greater

(equivalent to 160 fold molar excess) than bradykinin The last two spectra were controls using

a MWCO with the optimized solution in panel 3 and panel 4 using 10 ng bradykinin which was

diluted to an equivalent volume as all the other experiments and directly spotted onto the

MALDI plate

204

Supplemental Table 1 Expanded Table 1 including grand average of hydropathicity (GRAVY)

score theoretical pI and the sequence from the underlying amino acid sequence for the peptides

identified in the BSA digest The GRAVY and pI scores were obtained from ExPASy

Bioinformatics and modifications were not taken into consideration

(httpwebexpasyorgprotparam) Peptide assignments to the recorded peaks were done by

BSA

tryptic

peptide

(MH+)

GRAVY

score

Theoretical

pI

Sequence 100

H2O

1microg

100

1 M

NaCl

70

H2O

80

1 M

NaCl

70

1 M

NaCl

60

H2O

60

1 M

NaCl

5083 NA NA FGER

5453 0900 972 VASLR

6894 0267 979 AWSVAR

7124 -0950 647 SEIAHR

8985 0529 674 LcVLHEK

9275 -0071 600 YLYEIAR

10345 -0725 674 NEcFLSHK

10725 -0211 538 SHcIAEVEK

11385 0 599 ccTESLVNR

11636 0130 453 LVNELTEFAK

12496 -1250 545 FKDLGEEHFK

12837 0264 675 HPEYAVSVLLR

13057 -0582 532 HLVDEPQNLIK

13997 0567 437 TVMENFVAFVDK

14157 0567 437 TVmENFVAFVDK

14197 0058 530 SLHTLFGDELcK

14398 -0133 875 RHPEYAVSVLLR

14636 -0515 465 TcVADESHAGcEK

14798 0292 600 LGEYGFQNALIVR

15026 -0625 409 EYEATLEEccAK

15118 0207 597 VPQVSTPTLVEVSR

15328 -0617 617 LKEccDKPLLEK

15547 -0823 441 DDPHAcYSTVFDK

15677 -0085 437 DAFLGSFLYEYSR

15768 -0985 456 LKPDPNTLcDEFK

16399 -0067 875 KVPQVSTPTLVEVSR

16678 0064 437 MPCTEDYLSLILNR

16738 -1723 550 QEPERNEcFLSHK

17248 0064 437 MPcTEDYLSLILNR

17408 0064 437 mPcTEDYLSLILNR

17477 -0914 414 YNGVFQEccQAEDK

17497 -0621 410 EccHGDLLEcADDR

18809 -0537 606 RPcFSALTPDETYVPK

18890 -0567 674 HPYFYAPELLYYANK

19019 -1275 466 NEcFLSHKDDSPDLPK

19079 0044 454 LFTFHADIcTLPDTEK

20450 -0812 839 RHPYFYAPELLYYANK

21139 -0682 480 VHKEccHGDLLEcADDR

22479 -0458 423 EccHGDLLEcADDRADLAK

Total 39 2 2 6 8 15 15 27

205

mass matching with no tandem mass spectrometry performed Lower case amino acids indicate

a modification present in the peptide of carbamidomethyl (c) or oxidation (m)

206

Chapter 8

Conclusions and Future Directions

207

Summary

Comparative shotgun proteomics investigating numerous biological changes in various

species and sample media and peptidomic method development have been reported The

developed comparative shotgun proteomics based on label-free spectral counting with nanoLC

MSMS platform have been employed in mouse CSF rat CSF yeast culture and other biological

specimens Mass spectrometry (MS)-based peptidomic enhancements using ETD and improved

sample preparation methods for molecular weight cut-offs have been reported Together these

studies add to the Li Labs capabilities in comparative shotgun proteomics and expand available

methods for peptidomic research

Immunodepletion of CSF for comparative proteomics

Chapters 3 and 4 used similar methods to generate a list of differentially expressed

proteins providing insight into how Alexander disease (AxD) perturbs the proteome and how the

new prion model rat adapted scrapie (RAS) proteome varies from control rats In the GFAP

overexpressor mouse CSF study in Chapter 3 we have produced a panel of proteins with

significant up or down regulation in the CSF of transgenic GFAP overexpressor mice MS-based

proteomic study of this mouse model for AxD was consistent with the previous studies showing

elevation of GFAP in CSF The development of a modified IgY-14 immunodepletion technique

for low amounts of CSF with recommendations for future antibody depletion techniques to deal

with the unique challenges of mouse CSF was presented Modified proteomics protocols were

employed to profile mouse CSF with biological and technical triplicates addressing the

variability and providing quantitation with dNSAF spectral counting Validation of the data was

performed using both ELISA and RNA microarray data to provide corroboration with the

208

changes in the putative biomarkers This work presents numerous interesting targets for future

study in AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

Using the protocol developed in Chapter 3 we performed a similar study in RAS CSF

compared to control rat CSF Two differences in sample preparation for the rat CSF compared

to the mouse CSF are noted (1) rat IgY-7 immunodepletion was performed and (2) each rat

CSF sample was collected from one animal due to sufficient volume instead of pooling from

multiple animals for the mouse CSF sample After immunodepletion the CSF samples from

control and RAS (biological N=5 technical replicates N=3) were digested and separated using

one dimensional RP nanoLC separation We were able to identify 167 proteins with redundant

isoforms removed which was derived from total 512 proteins with a 1 FDR from all the CSF

samples Comparative analysis was performed using dNSAF spectral counting and 21 proteins

were significantly changed Our data were consistent with previous prion CSF studies showing

14-3-3 protein increased in CSF of prion infected animals Genomic analysis was also

performed and was used to cross-validate our proteomic data and numerous proteins were found

to be consistent including a novel marker related to prion disease RNAset2 (ribonuclease T2)

In summary this work provides a foundation for investigation of the perturbed proteome of a

new prion model RAS

Yeast Phosphoproteomic Comparison between Starved and Glucose Fed Conditions

This work presented a qualitative comparison of the phosphoproteome between starved

and glucose fed Saccharomyces cerevisiae (bakers yeast) A large scale IMAC enrichment of

yeast cell lysate followed by 2-D separations provided a rich source of phosphopeptides for CID

MSMS and neutral loss triggered ETD analysis using mass spectrometry A specific motif for

PKA was highlighted to show the differences in proteins identified between starved and glucose

209

fed conditions In total 477 unique phosphopeptides were identified with 06 FDR and 669

unique phosphopeptides identified with 54 FDR Phosphosite validation was performed using

a localization algorithm Ascore to provide further confidence on the site-specific

characterization of these phosphopeptides Two proteins Ssd1 and PMA2 emerged as potential

intriguing targets for more in-depth studies on protein phosphorylation involved in glucose

response

Methods for Peptide Sample Preparation and Sequencing

In this study ETD was performed to improve the sequence coverage of endogenous large

neuropeptides and labile tyrosine sulfation CCK-like neuropeptides found in the blue crab

Callinectes sapidus The intact CPRP from Callinectes sapidus was identified and characterized

with 68 sequence coverage by MS for the first time Cation assisted ETD fragmentation using

MgCl2 was also shown to be a powerful tool in de novo sequencing of sulfated neuropeptides

These endeavors into using ETD for certain neuropeptides will assist in future analysis of large

neuropeptides and PTM containing neuropeptides

In addition to ETD sequencing I presented a protocol on improving recovery of minute

quantities of peptides by adding methanol and a salt modifier (NaCl) to molecular weight cut-off

membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide quantities

Despite its widespread utility significant sample loss often occurs during the MWCO

fractionation step which is particularly problematic when analyzing low-abundance peptides

from limited starting material This work presented a method to reduce sample loss during

MWCO separations specifically for sub-microg peptide isolation Using a neuropeptide standard

bradykinin sample loss was reduced by over two orders of magnitude with and without

210

undigested protein present The presence of bovine serum albumin (BSA) undigested protein

and the bradykinin standard lends evidence that sample loss occurs because of the MWCO and

not the presence of the protein Additionally a BSA tryptic digestion is presented where twenty-

seven tryptic peptides are identified from MALDI mass spectra after enriching with methanol

while only two tryptic peptides are identified after the standard MWCO protocol

Ongoing Projects and Future Directions

CSF Projects

Rat Adapted Scrapie and Time Course Study of Rat CSF

In ongoing experiments from the work described in Chapter 4 related to rat adapted

scrapie (RAS) we are working towards more complete proteome coverage of CSF and a time

course study of RAS After the promising results of the 1-D proteomic comparison between

RAS and control CSF we want to perform IgY immunodepletion on 1 mL of rat CSF followed

by 2-D high pH-low pH RP-RP LC-MSMS IgY immunodepletion is still required and

afterwards approximately 40 microg of low abundance protein would remain Following traditional

urea tryptic digestion (see Appendix 1) and solid phase extraction (SPE) clean-up the sample

would be dried down and reconstituted in mobile phase A (25 mM ammonium formate basic

pH) for high pH RP separation Fractions would be dried and subjected to MS analysis similar to

the work described in Chapter 4 The purpose of this work would be to increase the proteome

coverage from a few hundred to a thousand CSF proteins identified A time course study of RAS

is also desirable to gain insight into disease progression Rats at different stages will be

sacrificed and CSF will be collected The goal is to perform isobaric labeling between the time

courses with spectral counting being an alternative for relative protein expression We will use

the targets identified in Chapter 4 to track certain proteins for time course analysis Overall

211

these future projects will dig deeper into the proteome and how this novel prion model RAS

perturbs the proteins expressed in rats over time

Glycoproteomics and Peptidomic Analysis of CSF Samples from Individuals with

Alzheimerrsquos Disease

Alzheimerrsquos disease is an incurable progressive neurodegenerative disorder which results

in cognitive impairment Our aim is to provide additional biomarkers andor targets for drug

treatment Currently we used 500 microL of human CSF for non-membrane glycoprotein

enrichment (see Appendix 1) followed by 2-D high-pH-low-pH RP-RP amaZon ion trap LC-

MSMS analysis The initial work was a failure due to low amount of signal and significant

sample loss (data not shown) Through a comparison with Dr Xin Weirsquos PhD thesis we

estimated roughly 20 microg of proteins was obtained after glycoprotein enrichment and 1-D analysis

produced over 100 protein identifications (data not shown) but the additional off-line 2-D

separation and sample clean up resulted in low number of protein identifications for each fraction

analyzed by MS (data not shown) We have recently obtained 1 mL of human CSF samples

from healthy controls and individuals suffering from Alzheimerrsquos disease and plan to perform

the same experiments with double the starting amount and reduce the fractions collected from 2-

D separation from 11 to 6 In addition to the glycoproteomics anything not enriched will be

subjected to MWCO separation using the method presented in Chapter 7 The obtained peptide

sample will be subjected to Q-TOF nanoLC MSE data acquisition followed by de novo

sequencing using various programs including PEAKS and Mascot Collectively we feel this

project has great potential to lead to interesting targets and further expand the proteomic

knowledge of Alzheimerrsquos disease

GFAP Knock-in Mouse CSF

212

In a direct follow-up to Chapter 3 we aim to perform comparative proteomics on control

vs glial fibrillary acidic protein (GFAP) knock-in mouse CSF samples The sample preparation

protocol as presented in Chapter 3 will be used For quantitative proteomics we plan on

performing isobaric labeling followed by performing high energy collision induced dissociation

(HCD) on the Orbitrap Elite The MS acquisition will also perform a top ten scan where the top

ten MS peaks will be selected for tandem MS analysis In addition targeted quantitation of

specific proteins using multiple reaction monitoring (MRM) can be performed on potential

biomarkers and identified in this follow-up study and the work presented in Chapter 3 Also any

CSF samples with noticeable blood content cannot be used for the exploratory proteomics

experiments but can potentially be used for the MRM analysis and should be kept for such

experiments in the future

Large Scale Proteomics

Proteomics of Mouse Amniotic Fluid for Lung Maturation

The overall goal of this project is to determine what proteins are present in amniotic fluid

when the embryo is 175 weeks compared to amniotic fluid at 155 weeks The rationale behind

why these two time points matter was investigated through a lung explant culture where amniotic

fluid was added at 155 weeks and 175 weeks Visual maturation of lungs occurred with the

175 week amniotic fluid and lung maturation genes were up regulated in the mRNA in the lung

explant culture when compared to the 155 week amniotic fluid The compound which is

causing the maturation of the lungs is unknown and search for a secreted protein might provide a

clue to this process In order to test this hypothesis we carried out discovery proteomics

experiments The workflow involves tryptic digestion SPE and 1-D low pH nanoLC separation

coupled to an amaZon ETD ion trap for tandem MS analysis The resulting mass spectrometric

213

acquisition identified less than 100 proteins with a 1 FDR (data not published) To address the

lack of depth in the proteome coverage we purchased an IgY immunodepletion column to

remove the most abundant proteins in amniotic fluid similarly to Chapter 3 and 4 but on a larger

scale One abundant protein alpha-fetoprotein is present in amniotic fluid but absent or present

in very low concentration in CSF serum or plasma Alpha-fetoprotein is similar to albumin and

thus we ran amniotic fluid on an IgY immunodepletion column and observed significant

reduction in alpha-fetoprotein amount (data not published) After immunodepletion 2-D high

pH-low pH RP-RP will be performed followed by nanoLCMSMS on the amaZon ETD ion trap

We will perform mass spectrometry experiments using the 155 week amniotic fluid and 175

week amniotic fluid which have average protein contents ~2 mgmL We anticipate a minimum

of tenfold increase in the proteome coverage of amniotic fluid and provide meaningful

hypothesis driven biological experiments from this work

Phosphoproteomics of JNK Activation

c-Jun N-terminal kinase (JNK) is an important cellular mediator of stress activated

signaling Under conditions of oxidative stress JNK is activated resulting in the downstream

phosphorylation of a large number of proteins including c-Jun However the cellular response

to JNK activation is extremely complex and JNK activation can result in extremely different

physiological outcomes For example JNK is at the crossroads of cellular death and survival

and exhibits both pro-death and pro-survival activities These highly disparate outcomes of JNK

activation are highly contextual and depend on the type of stressor and duration of stress In the

brain JNK has been shown to be activated in models of Alzheimerrsquos disease Parkinsonrsquos

disease amyotrophic lateral sclerosis and stroke It is thought that JNK is activated in these

diseases as a result of oxidative stress and it is unknown whether this activation is pro-survival or

214

pro-death To elucidate JNK signaling in the brain we chose to analyze primary cortical

astrocytes under conditions of oxidative stress Since hydrogen peroxide is a physiologically

relevant oxidant in the brain and known to activate JNK we chose to use this to treat astrocytes

and then analyze changes to the phosphoproteome by mass spectrometry By doing this we

hope to elucidate important JNK-dependent targets involved in stress signaling in the brain and

that identifying these targets could lead to the identification of novel disease mechanisms and

potentially new therapeutic targets for neurodegeneration Specifically we plan on performing

stable isotope labeling by amino acids (SILAC) of astrocytes for control hydrogen peroxide

treated and hydrogen peroxide treated with JNK inhibitor The SILAC labeled astrocyte cell

lysate will be digested enriched using Fe-NTA IMAC and separated using 2-D high pH-low pH

RP-RP and infused into the amaZon ETD ion trap The protocol is similar to the yeast

comparative phosphoproteomics from Chapter 5 After analysis we plan on processing the data

using ProteoIQ to identify phosphoproteins with significant changes

Immunoprecipitation Followed by Mass Spectrometry

Stb3 Mass Spectrometry Analysis

Stb3 (Sin3-binding protein) has previously been shown to change location depending on

the presence or absence of glucose (Heideman Lab Dr Michael Conways Thesis) An

immunoprecipitation of Stb3 was performed in parallel to the work described in Chapter 5 Two

separate experiments were performed one with wild type Stb3 and another tagged with myc for

improved protein immunoprecipitation Myc tagging is a polypeptide tag which can be

recognized by an antibody and provide a better enrichment compared to using a Stb3 antibody

alone The myc tagging was done because of the low abundance of Stb3 and the limited amount

of protein that was pulled down by the antibody for Stb3 Immunopreciptation experiments were

215

performed for both starved and glucose fed samples All samples were tryptically digested

followed by low pH RP nanoLCMSMS Both CID and ETD were used for fragmentation

analysis of each sample (n=2) The Stb3 wild type had two peptide hits in one MS run which is

not surprising due to low Stb3 antibody affinity In contrast a significant amount of Stb3 was

pulled down from Myc tagged starved and glucose fed samples For the glucose starved

samples 22 unique peptides from Stb3 were identified whereas glucose fed samples yielded 21

unique Stb3 peptides identified (unpublished data) The identified Stb3 in the myc samples

allowed us to investigate what other proteins were pulled down that are not present in the wild

type samples

From previous work by our collaborator Dr Heideman it had been suggested that Stb3

forms a complex with Sin3 and Rpd3 Our proteomic data shows multiple significant peptide

hits for Sin3 and Rpd3 in myc tagged but not in wild type where Stb3 is only observed once

with a low Mascot score When looking at the unique proteins identified in myc tagged glucose

fed sample but not included in the wild type samples the myc fed sample contained eight unique

ribosomal proteins with numerous unique peptides for each The ribosomal proteins identified in

myc fed sample supports the novel hypothesis that in the glucose fed state the complex of Stb3

Rpd3 and Sin3 interact with ribosomal proteins Other proteins unique to myc tagged glucose

starved samples are glyceraldehyde-3-phosphate dehydrogenase 2 and transcriptional regulatory

protein UME6 Also three proteins were observed in both myc fed and starved but not in the

wild type samples including transposon Ty1-DR1 Gag-Pol polyprotein(YD11B) SWIRM

domain-containing protein (FUN19) and RNA polymerase II degradation factor 1 (DEF1) Our

proteomics data for the Stb3 immunoprecipitation experiments with starved vs glucose fed

216

samples provide exciting evidence to support previous observation made by the Heideman group

and highlight the utility of MS-based approach to deciphering protein-protein interactions

Conclusions

The majority of the work described in this dissertation revolves around sample

preparation for proteomics and peptidomics with a focus on generating biologically testable

hypotheses In the area of neuropeptides CPRP was fully sequenced analytical methods were

transformed for use in an ETD ion trap and sub-microg peptides can now be analyzed by mass

spectrometry after MWCO separation In the field of comparative proteomics comparisons

between mouse CSF in GFAP overexpressors and control or rat adapted scrapie model and

control CSF yeast fed vs glucose starved cell extract have all been presented Collectively this

thesis has developed new techniques for neuropeptide sample preparation and presented

numerous comparative proteomic analyses of various biological samples and how the proteomes

are dynamically perturbed by various treatments and disease conditions

217

Appendix 1

Protocols for sample preparation for mass spectrometry based

proteomics and peptidomics

218

Small Scale Urea Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 8 μL of urea solution

(400mg05mL) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Allow to digest overnight in 37degC water bath

10 Acidify with 10μL 10 formic acid

11 Perform solid phase extraction using tips dependent of sample amount

a Sub-5μg amounts ndash Millipore Ziptips

b 5-75μg amounts ndash Bond-Elut tips from Agilent (OMIX tips)

12 Dry down in Speedvac as needed for experiment

219

Small Scale ProteaseMAX Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 2 μL of 1 of

ProtesaeMAX (Promega) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Add 1 μL ProteaseMAX and let sit for 3-4 hours

10 Acidify with 2μL 10 formic acid

11 Dry down in Speedvac as needed for experiment

220

Large Scale Urea Tryptic Digestion (mg of proteins)

1 All solutions are in a 50 mM NH4HCO3 buffer (395 mgmL)

2 Add 20μL of 05M Dithiothreitol (DTT) (77mg100μL) and 160μL of urea solution

(400mg05mL) to sample

3 Allow sample to denature 45-60 minutes in a 37degC water bath

4 Add 54μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

5 Quench reaction with 20μL of DTT solution

6 Dilute with 14mL of NH4HCO3 solution

7 Add 100μg of trypsin

8 Allow to digest overnight in 37degC water bath

9 Acidify sample with 100μL of 10 formic acid

10 Perform solid phase extraction with Hypersep C18 Cartridges with 100 mg of C18

bead volume (Thermo)

11 Dry down with the Speedvac as needed for experiment

221

Fe-NTA Preparation from Ni-NTA Beads

1 Centrifuge (5 min at 140 rcf) and use a magnet to keep beads in place as supernatant

is removed

2 Wash 3 times with 800μL of H2O (using same technique of centrifuging and using

magnet to keep beads in places as supernatant is removed)

3 Add 800μL of 100mM EDTA (372mgmL) in a 50mM NH4HCO3 (395mgmL)

buffered solution (pH around 8) and vortex for 30 minutes to chelate the Ni

centrifuge and remove supernatant

4 Wash 3 times with 800μL of H2O

5 Add 800μL of 10mM FeCl3 hexahydrate (27mgmL) and vortex for 30 minutes to

bind Fe to beads centrifuge and remove supernatant

6 Wash 3 times with 800μL H2O

7 Resuspend in a storage solution of 111 ACNMeOH001 Acetic Acid (1mL total)

222

Fe-NTA IMAC Phospho-enrichment

1 Use wash solution (80ACN 01Formic Acid) to rinse beads vortex 1 minute

centrifuge and remove supernatant

2 Add sample (around 500μL in 80ACN 01Formic Acid) and vortex 40 minutes to

allow sample to bind dispose of supernatant after centrifuging

3 Wash 3 times with 200μL of wash solution discard supernatant

4 Add 200μL of elution solution (5050 ACN5Ammonium Hydroxide) vortex 15

minutes and save supernatant

5 Add 200μL of elution solution vortex 10 minutes and save supernatant

6 Wash 2 time with wash solution (collect supernatant of first wash)

7 If reusing store in 1mL solution of 111 ACNMeOH001 Acetic Acid

223

High pH Off-line Separation

1) In general a minimum of 20 microg of peptides are needed to gain any benefit

from off-line 2D fractionation It is better to inject 100 microg of peptides on

column

2) Use a Gemini column or a similar column that can handle pH=10 and for this

protocol a 21 mm x 150 mm column was used

3) Prepare ldquobuffer Ardquo 25 mM NH4 formate pH=10 Also prepare ldquobuffer Brdquo

4) Dry down desired sample and reconstitute in buffer A

5) Inject based off sample loop (current Alliance HPLC has a 50 microL sample

loop)

6) Run gradient at bottom of the page collecting fractions every 3 minutes except

for the 1st minute which is the void volume

7) Optional If you want to reduce instrument time you can combine fractions 1

with 12 2 with 13 etc until 11 with 22

Time Mobile phase A Mobile phase B Flow Rate

05mlmin

0 98 2 05 mLmin

65rsquo 70 30 05 mLmin

65rsquo1rdquo 5 95 05 mLmin

70 5 95 05 mLmin

224

Non Membrane Glycoprotein Enrichment

1 For protocols involving membrane glycoprotein enrichment see Dr Xin Weirsquos

thesis

2 Add 75 microL of Wheat germ agglutinin (WGA) bound to agarose beads and 150 microL

of Concanavalin A (ConA) to a Handee spin cup (Thermo) Wash 2 times with

lectin affinity chromatography (LAC) binding buffer (015 M NaCl 002 M Tris-

HCl 1 mM CaCl2 pH=74) centrifuging at 400 g for 30 seconds

3 Place stopper on bottom of spin cup and add 200 microg of sample (500 microL for CSF)

Bring up to 300 microL using lectin LAC binding buffer

4 Incubate for 1 hour with continuous mixing at room temperature

5 Centrifuge at 400 g for 30 seconds

6 Perform two more 300 microL LAC binding washes followed by centrifugation

7 Add 300 microL LAC eluting buffer (0075 M NaCl 001 M Tris-HCl 02 M alpha-

methyl mannoside 02 M alpha-methyl glucoside and 05 M acetyl-D-

glucosamine) vortex for 10 minutes (have stopper in place while vortexing)

centrifuge and collect

7 Add another 300 microL LAC eluting buffer centrifuge and collect

225

MWCO separation for Sub-microg peptide concentrations

1 Wash molecular weight cut-off (MWCO) with 5050 waterMeOH centrifuge at

14000 g for 5 minutes for 10 kDa filter (time will vary Millipore Amicon Ultra

filters)

2 Wash with 100 water centrifuge at 14000 g for 5 minutes

3 Add methanol to the sample to get the concentration to 30 methanol and add

salt to achieve a roughly 05 M NaCl before adding the sample to the MWCO

4 Centrifuge at 14000 for 10 minutes collect flow through

226

Immunoprecipitation

Modified from Thermo Fisher Scientific Classic IP Kit

1 10 microL Polyclonal antibody added to 1 microg protein standard in a Handee spin cup

(Thermo) diluted with 1x TBS buffer to 400 microL and incubated overnight at on

shakerend-over-end rotator

2 Wash Protein GA beads with 2x 200microL TBS buffer and added to the

antibodysample for 15 hours at 4oC

3 Centrifuge at 400 g for 30 seconds and discard flow through

4 Wash 3x with 100microL TBS buffer centrifuge at 400 g for 30 seconds and discard

flow through

5 Wash with 1x conditioning solution (neutral pH buffer) centrifuge at 400 g for 30

seconds and discard flow through

6 Eluted with elution buffer (2x100microL) centrifuge at 400 g for 30 seconds and

collect flow through

227

C18 Solid Phase Extraction (SPE)

1 Determine amount of peptides for SPE ge5 microg use Ziptips from Millipore If

between 5-75 microg use OMIX from Agilent 100 microL version and if ge75 microg use SPE

cartridges such as 100 mg Hypersep from Thermo

2 Ensure no detergents are in the sample and it is acidified

3 The three SPE procedures all use the same sets of solutions only volumes vary

4 3x Wetting solution (100 ACN) for 60 microL OMIX (20 microL ziptip and 15 mL for

100 mg cartridge)

5 3x Wash solution (01 formic acid in 100 H2O) same volumes as 4

6 Draw up the sample about 10 times minimum (generally 30-40 is preferred)

without letting the bead volume dry out

7 1X Wash solution same volumes as 4

8 Back load the eluting solution 01 formic acid in 5050 H2OACN into the

Ziptips (15 microL) or OMIX (50 microL) tips For the SPE cartridge add 700 microL of

eluting solution

9 Dry down and prepare for next step in sample preparation

228

Laser Puller Programs and Protocols

1) Obtain about two feet of Fused-silica capillary with 75 μm id and 360 μm od

2) Wash with methanol and then air dry using the bomb

3) Cut into one foot or desired length

4) Use a lighter to burn the middle portion (about an inch in length) of the tubing

5) Remove the ash by rinsing with methanol and rubbing with a Kimwipe

6) Make sure the laser puller has been on for at least 30 minutes before use to allow

for the instrument to warm up

7) Place capillary in instrument with the burnedexposed portion in the center

making sure that the length of the capillary is pulled taut

8) Enter desired program (next page) and press pull

229

Laser Puller Programs

Program 99 (default lab program)

Heat Filament Velocity Delay Pull

250 0 25 150 15

240 0 25 150 15

235 0 25 150 15

245 0 25 150 15

Program 97 (developed for larger inner diameter tips)

Heat Filament Velocity Delay Pull

230 - 25 150 -

220 - 25 150 -

215 - 25 150 8

230

On column Immunodepletion (serum plasma CSF amniotic fluid)

1) Prepare buffer A ldquoDilution Bufferrdquo 10 mM Tris-HCl pH=74 150 mM NaCl

2) Prepare buffer B ldquoStripping Bufferrdquo 01 M Glycine-HCl pH=25

3) Prepare buffer C ldquoNeutralization Bufferrdquo 100 mM Tris-HCl pH=80

4) Determine loading amount (LC1=~1000 ug of serum or plasma less for CSF due

to the increased amount of albumin percentage in CSF)

5) Add Dilution buffer to sample before injection and ensure the sample is proper

pH (~7)

6) Use gradient below

Time A B C Flow Rate

(mLmin)

0rsquo 100 0 0 02

4rsquo59rdquo 100 0 0 02

5rsquo 100 0 0 05

8rsquo59rdquo 100 0 0 05

9rsquo 0 100 0 05

22rsquo 0 100 0 05

22rsquo1rdquo 0 0 100 05

39rsquo 0 0 100 05

7) Store the column in 1x dilution buffer until next use

231

Small Scale Immunodepletion (100 microL of CSF)

1) Use 75 microg (per 100 microL CSF) of Sigma Seppro IgY-14 or IgY-7 bead slurry

2) Add an equal volume of 2x dilution buffer (20 mM Tris-HCl pH=74 300 mM

NaCl) to the starting amount of CSF

3) Add to a spin cup with a filter and allow to mix for 30 minutes

4) Centrifuge at 400 g for 30 seconds and collect the flow through

5) Add 50 microL of 1x dilution buffer mix for 5 minutes centrifuge at 400 g for 30

seconds and collect the flow through

6) Use 1x stripping washes (01 M Glycine-HCl pH=25) centrifuge as before and

discard Repeat four times

7) Use 1x neutralization buffer (100 mM Tris-HCl pH=80) centrifuge as before

and discard Repeat two times

8) Store the beads in the spin column in 1x dilution buffer until next use

232

Alliance Maintenance Protocol

Seal Wash

10 methanol no acetonitrile This wash cleans behind the pump-head seals to

ensure proper lubrication Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime SealWsh gt Start

2 Press Stop after 5 minutes

Prime Injector

10 methanol for maintenance high organic solvent for dirty runs (eg 95

acetonitrile) Done before injecting any real samples to ensure no bubbles are

present in the injector Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime NdlWsh gt Start

2 After completion press Close

Purge Injector

Solvent is dependent on run Run this protocol at beginning of experiments each day

Minimum once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Navigate Direct Function gt 4 Purge Injector gt OK

3 Set Sample loop volumes 60 leave Compression Check unchecked gt OK

Prime Solvent Pumps

Solvent is dependent on run If solvents are changed run this protocol Minimum

once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys choose composition A type 100 Enter x4

3 Navigate Direct Function gt 3 Wet Prime gt OK

4 Set Flow Rate 7000 mLmin Time 100 min gt OK

5 Repeat for all changedactive solvent pumps

Condition Column

Dependent on user Use starting conditions for run

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys type starting solvent compositions for run

3 Navigate Direct Function gt 6 Condition Column gt OK

4 Set Time as desired

233

Appendix 2

List of Publications and Presentations

234

PUBLICATIONS

ldquoDiscovery and characterization of the crustacean hyperglycemic hormone precursor related

peptides (CPRP) and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes

sapidus using multiple tandem mass spectrometry techniquesrdquo Hui L Cunningham R Zhang

Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

ldquoInvestigation and reduction of sub-microgram peptide loss using molecular weight cut-off

fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J Wellner D Li L

Journal of Mass Spectrometry In Press

ldquoMass spectrometry-based proteomics and peptidomics for systems biology and biomarker

discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse models of

Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P Messing A Li

L Journal of Proteome Research Submitted

ldquoInvestigation of the differences in the phosphoproteome between starved vs glucose fed

Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M Heideman

W Li L In preparation

ldquoIdentification of astrocytic JNK targets by phosphoproteomics using mass spectrometryrdquo

Cunningham R Dowell J Wang J Wellner D Li L Milhelm M In preparation

ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A Cunningham R Wellner

D Wang J Ma D Li L Aiken J In preparation

235

INVITED SEMINARS AND CONFERENCE PRESENTATIONS

Robert Cunningham Davenne Mayour and Shoshanna Coon ldquoMorphology and Thermal

Stability of Monolayers on Porous Siliconrdquo The 231th

ACS National Meeting 2006 Atlanta

GA

Robert Cunningham Xin Wei Paige Jany Albee Messing and Lingjun Li ldquoMass

Spectrometry-Based Analysis of Cerebrospinal Fluid Peptidome and Proteome for Biomarker

Discovery in Alexander Diseaserdquo The 57th

ASMS Conference 2009 Philadelphia PA

Robert Cunningham ldquoWhat to Expect in a PhD Graduate Schoolrdquo Invited seminar University

of Northern Iowa 2010 Cedar Falls IA

Robert Cunningham Dustin Frost Albee Messing and Lingjun Li ldquoInvestigation of an

Optimized ProteaseMAX Assisted Trypsin Digestion of Human CSF for Pseudo-SRM

Quantification of GFAP and Identification of Biomarkersrdquo The 58th

ASMS Conference 2010

Salt Lake City UT

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo Oral presentation at Pittcon Conference and Expo 2011 2011 Atlanta

GA

Robert Cunningham Michael Conway Daniel Wellner Douglas Grunwald Warren

Heideman and Lingjun Li ldquoDevelopment of optimized phosphopeptide enrichment methods for

comparison of starved and glucose fed yeast Saccharomyces cerevisiaerdquo The 59th

ASMS

Conference 2011 Denver CO

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo 2011 Wisconsin Human Proteomics Symposium 2011 Madison WI

Page 3: Mass Spectrometry Applications for Comparative Proteomics

ii Greer Chris Lietz Chenxi Jia Dustin Frost Di Ma Hui (Vivian) Ye Nicole Woodards

and Claire Schmerberg for their collaboration in many challenging research projects and

fruitful discussions on various research areas There are too many people to thank each

one individually but every member of the Li lab has in some way contributed to my

learning experience Beyond research work their friendship also made my life here in

Madison much more enjoyable

I would also like to thank our collaborators Dr Albee Messing Dr Warren

Heideman Dr Xin Sun and Dr James Dowell It is my great pleasure to have the

opportunities to work with these amazing people and gain precious experience I have

learned so much from them and their achievements in the field have inspired me to strive

to do the best I could

Furthermore I would like to thank Gary Girdaukas and Dr Cameron Scarlett at

School of Pharmacy for the access of the MALDI-FTMS and Bruker amaZon ion trap

instruments

In particular I wish to thank my family my Mom and Step-Dad for raising me

and my Dad for always being there for me They all supported me in my decision to

pursue science and specifically a career in chemistry I would like to thank my Sister

who grew up with me and always led by example in academics Most importantly I

would like to thank my wife Na Liu for her constant support She has inspired and

helped me finish my PhD and always encouraged me to be the best I could be To them

I dedicate this thesis

iii

Table of Contents

Page

________________________________________________________________________

Acknowledgements i

Table of Contents iii

Abstract iv

Chapter 1 Introduction brief background and research summary 1

Chapter 2 Mass spectrometry-based proteomics and peptidomics for

biomarker discovery and the current state of the field 15

Chapter 3 Protein changes in immunodepleted cerebrospinal fluid from

transgenic mouse models of Alexander disease detected

using mass spectrometry 73

Chapter 4 Genomic and proteomic profiling of rat adapted scrapie 110

Chapter 5 Investigation of the differences in the phosphoproteome

between starved vs glucose fed Saccharomyces cerevisiae 139

Chapter 6 Use of electron transfer dissociation for neuropeptide

sequencing and identification 166

Chapter 7 Investigation and reduction of sub-microgram peptide loss

using molecular weight cut-off fractionation prior to

mass spectrometric analysis 187

Chapter 8 Conclusions and future directions 206

Appendix 1 Protocols for sample preparation for mass spectrometry

based proteomics and peptidomics 217

Appendix 2 Publications and presentations 233

_______________________________________________________________________

iv

Mass Spectrometry Applications for Comparative Proteomics and

Peptidomic Discovery

Robert Stewart Cunningham

Under the supervision of Professor Lingjun Li

At the University of Wisconsin-Madison

Abstract

In this thesis multiple biological samples from various diseases models or

treatments are investigated using shotgun proteomics and improved methods are

developed to enable extended characterization and detection of neuropeptides In general

this thesis aims to expand upon the rapidly evolving field of mass spectrometry (MS)-

based proteomics and peptidomics by primarily enhancing small scale sample analysis

A review of the current status and progress in the field of biomarker discovery in

peptidomics and proteomics is presented To this rapidly expanding body of literature

our critical review offers new insights into MS-based biomarker studies investigating

numerous biological samples methods for post-translational modifications quantitative

proteomics and biomarker validation Methods are developed and presented including

immunodepletion for small volume cerebrospinal fluid (CSF) samples for comparison of

the CSF proteomes between an Alexander disease transgenic mouse model with

overexpression of the glial fibrillary acidic protein and a control animal This thesis also

covers the application of the small scale immunodepletion of CSF for comparative

proteomic analysis of a novel rat adapted scrapie (RAS) model for prion disease and

v

compares the RAS CSF proteome to control rat CSF using MS Large scale

phosphoproteomics of starved vs glucose fed yeast is presented to better understand the

phosphoproteome changes that occur during glucose feeding Method development for

neuropeptide analysis is expanded upon using electron transfer dissociation (ETD)

fragmentation to successfully sequence for the first time the crustacean hyperglycemic

hormone precursor-related peptide (CPRP) from the blue crab Callinectes sapidus In

addition a method for ETD sequencing of sulfonated neuropeptides using a magnesium

salt adduct in an ion trap mass spectrometer is reported This thesis also reports on a

method for sub-microg peptide isolation when using a molecular weight cut-off filtration

device to improve sample recovery by over 2 orders of magnitude All the protocols used

throughout the work are provided in an easy to use step-by-step format in the Appendix

Collectively this body of work extends the capabilities of mass spectrometry as a

bioanalytical tool for shotgun proteomics and expands upon methods for neuropeptide

discovery and analysis

1

Chapter 1

Introduction Brief Background and Research Summary

2

Abstract

Mass spectrometry based comparative proteomics and improved methods for

neuropeptide discovery have been reported in this thesis A very brief introduction of crustacean

neuropeptidomics is covered in this chapter followed by Chapter 2 which covers in great detail

comparative proteomics using mass spectrometry with an emphasis on biomarker discovery

Chapter 3 reports a novel immunodepletion technique used for comparative proteomics between

glial fibrillary acidic protein (GFAP) overexpressor and control mouse cerebrospinal fluid (CSF)

Chapter 4 involves rat CSF comparative proteomics between rat adapted scrapie and control

animals Chapter 5 details comparisons of phosphoproteomes in Saccharomyces cerevisiae

(Bakerrsquos yeast) between starved vs glucose fed conditions Chapter 6 investigates the use of

electron transfer dissociation (ETD) for kDa sized neuropeptides and neuropeptides with tyrosine

sulfonation Chapter 7 presents a molecular weight cut-off (MWCO) protocol to allow sub-microg

peptides to be recovered Chapter 8 provides a conclusion to this thesis and outlines future

directions for certain projects

3

Background

Mass spectrometry (MS) requires gas phase ions for experimental measurement and

intact large nonvolatile biomolecules proved difficult to be analyzed by electron ionization or

chemical ionization until the invention of two soft ionization techniques matrix-assisted laser

desorptionionization (MALDI)1 and electrospray ionization (ESI)

2 ESI and MALDI are the

two most common soft ionization techniques for mass spectrometry Once ionized molecules

such as peptides or proteins can be separated by their mass to charge ratios (mz) using various

mass analyzers manipulating electric andor magnetic fields ESI and MALDI mass

spectrometric techniques have become central analytical methods in biological sciences because

they are suitable for nonvolatile thermally labile analytes in femtomole quantities3 ESI allows

the coupling of high pressure liquid chromatography and the constant flow of solvent is

electrically charged at the outlet to produce a Taylor cone4 During desolvation the Raleigh

limit is reached and a coulombic explosion occurs commonly producing multiply charged ions

A modification of ESI is nanoLC-ESI which operates at nLmin flow rates to analyze low sample

amounts such as proteins or peptides in biological samples5 NanoLC-ESI uses less solvent as

the source because of the reduced flow rate of standard LCMS with an ESI source and nanoLC-

ESI is the predominate inlet for shotgun proteomic ESI MS experiments6 Conversely MALDI

can use sub-microL volume of sample and be co-crystallized with an ultraviolet absorbing organic

matrix that is obliterated using a laser at appropriate wavelength to generate gas phase ions

Alternatively MALDI has the unique capability to work with tissue samples and ionize in the

solid state instead of liquid like ESI

4

Mass analyzers require an operating pressure between 10-4

-10-10

Torr to allow proper ion

transfer and separationisolation of the ion based upon its mz Numerous mass analyzers are

currently available and each have their own strengths and weaknesses as shown in Figure 1 The

biomolecules are separated by the mass analyzers and detected without fragmentation which is

termed MS analysis Following MS detection tandem mass spectrometry (MSMS) of the

original precursor ion can be performed to provide additional structural information such as a

ladder sequence of amino acids for peptides Numerous fragmentation techniques are available

for proteomics such as collision induced dissociation (CID) electron transfer dissociation (ETD)

or high energy collision induced dissociation (HCD) Each of these fragmentation techniques

have their own benefits in proteomics experiments and are discussed in depth in Chapter 2 The

background and current status for comparative proteomics with specific emphasis on biomarker

analysis are covered in Chapter 2

Neuropeptidomic Method Development in the Crustacean Model System

Utilizing Mass Spectrometry

Chapter 6 and 7 focus on sample preparation and ETD fragmentation techniques to

characterize neuropeptides in the neuroendocrine organs in the crustacean nervous system

Neuropeptides are short chains of amino acids which are a diverse class of cell-cell signaling

molecules in the nervous system Neuropeptides have been investigated for being involved in

numerous physiological processes such as memory7 learning

8 depression

9 pain

10 reward

11

reproduction12

sleep-wake cycles13

homeostasis14

and feeding15-17

Figure 2 depicts how

neuropeptides are synthesized as pre-prohormones in the rough endoplasmic reticulum and

5

packaged in the Golgi apparatus After being packaged these pre-prohormones are processed

into bioactive peptides within the vesicle which is occurring during vesicular transport down an

axon and released into the synaptic cleft Secreted neuropeptides stimulate the post synaptic

neurons by interacting with G-protein coupled receptors at the chemical synapse

The crustacean model nervous system is well-defined neural network which has been

used in neurobiology and neuroendocrine (neurosecretory) research and thus lends itself for

studying neuromodulation18-22

Figure 3 shows the locations of several neuroendocrine organs in

the crustacean Callinectes sapidus including the sinus gland (SG) which is studied in Chapter 6

The Lingjun Li lab has focused on neuropeptide discovery and function in crustacean

neuroendocrine organs using mass spectrometry23-25

The work presented in Chapters 6 and 7

expand on sample preparation and analytical tools to further investigate the neuropeptidome

Research Overview

Comparative Proteomics of Biological Samples

Chapter 2 provides a thorough review of comparative proteomics and biomarker analysis

using mass spectrometry The scientific community has shown great interest in the field of mass

spectrometry-based proteomics and peptidomics for its applications in biology Proteomics

technologies have evolved to generate large datasets of proteins or peptides involved in various

biological and disease progression processes producing testable hypotheses for complex

biological questions This chapter provides an introduction and insight into relevant topics in

proteomics and peptidomics including biological material selection sample preparation

separation techniques peptide fragmentation post-translational modifications quantification

6

bioinformatics and biomarker discovery and validation In addition current literature and

remaining challenges and emerging technologies for proteomics and peptidomics are discussed

Chapter 3 investigates comparative proteomics between a GFAP overexpressor mouse

model and a control mouses cerebrospinal fluid (CSF) CSF is a low protein content biological

fluid with dynamic range spanning at least nine orders of magnitude in protein content and is in

direct contact with the brain but consist of very abundant proteins similar to serum which require

removal A modified IgY-14 immunodepletion treatment is presented to remove abundant

proteins such as albumin to enhance analysis of the low volumes of CSF that are obtainable

from mice These GFAP overexpressor mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we present the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates are performed to address animal variability as well as reproducibility in mass

spectrometric analysis Relative quantitation is performed using distributive normalized spectral

abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins with

significant changes in the CSF of GFAP transgenic mice are identified with validation from

ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

Chapter 4 explores the CSF proteomics of a novel prion model called rat adapted scrapie

(RAS) A similar IgY immunodepletion technique is presented as in Chapter 3 and is similarly

used to reduce the abundant proteins in CSF CSF from control and RAS (biological N=5

technical replicates N=3) were digested and separated using one dimensional reversed-phase

nanoLC separation In total 512 proteins 167 non-redundant protein groups and 1032 unique

peptides are identified with a 1 FDR Comparative analysis was done using dNSAF spectral

7

counting and 21 proteins were significantly up or down-regulated The proteins are compared to

the 1048 differentially regulated genes and additionally compared to previously published

proteins showing changes consistent with other prion animal models Of particular interest is

RAS specific proteins like RNAseT2 which is not up-regulated in mouse prion disease but is

designated as upregulated in both the genomic and proteomics data for RAS

Chapter 5 explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Previous work by the

Heideman lab investigated the transcriptional response to fresh glucose in yeast26

Kinases such

as protein kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose

response so we described a large scale phosphoproteomic MS based study in this chapter

Yeast cell extract was digested and phosphopeptides were enriched by immobilized metal

affinity chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase

(RP)-RP separation The low pH separation was infused directly into an ion trap mass

spectrometer with neutral loss triggered ETDfragmentation Specifically the CID fragmentation

can cause a neutral loss of 98z from the H3PO4 and can produce an incomplete fragmentation

pattern and lead to ambiguous identification so when the neutral loss is detected ETD MSMS

fragmentation is performed The neutral loss triggered ETD fragmentation is included in this

study to improve phosphopeptide identifications In total 477 phosphopeptides are identified

with 06 FDR and 669 phosphopeptides identified with 54 FDR Motif analysis of PKA and

phosphosite validation are performed as well

8

The future of comparative proteomics investigating small sample amounts or PTMs is

promising Further advances in enrichment separations science mass spectrometry

analyzersdetectors and bioinformatics will continue to create more powerful tools that enable

digging deeper in proteomics with both small (mouse CSF) and large (cell extracts) sample

amounts

Methods for Neuropeptide Analysis Using ETD fragmentation and Sample

Preparation

Chapter 6 investigates the utility of ETD fragmentation to sequence endogenous large

neuropeptides and labile tyrosine sulfation in the crustacean Callinectes sapidus The sinus

gland (SG) of the crustacean is a well-defined neuroendocrine site that produces numerous

hemolymph-borne agents including the most complex class of endocrine signaling moleculesmdash

neuropeptides One such peptide is the crustacean hyperglycemic hormone (CHH) precursor-

related peptide (CPRPs) which was not previously sequenced Electron transfer dissociation

(ETD) is used for sequencing of intact CPRPs due to their large size and high charge state In

addition ETD is used to sequence a sulfated peptide cholecystokinin (CCK)-like peptide in the

lobster Homarus americanus using a salt adduct Collectively this chapter presents two

examples of the utility of ETD in sequencing neuropeptides with large molecular weight or with

labile modifications

Chapter 7 reports a protocol on improving recovery of minute quantities of peptides by

adding methanol and a salt modifier (NaCl) to molecular weight cut-off membrane-based

centrifugal filters (MWCO) to enrich sub-microgram peptide quantities In some biological

9

fluids such as CSF the endogenous peptide content is very low and using pure water to perform

the MWCO separation produces too much sample loss Using a neuropeptide standard

bradykinin sample loss is reduced over two orders of magnitude with and without undigested

protein present The presence of bovine serum albumin (BSA) undigested protein and the

bradykinin standard lends evidence that sample loss occurs because of the MWCO and not the

presence of the protein Additionally a BSA tryptic digestion is presented where twenty-seven

tryptic peptides are identified from MALDI mass spectra after enriching with methanol while

only two tryptic peptides are identified after the standard MWCO protocol The strategy

presented in Chapter 7 greatly enhances recovery from MWCO separation for sub-microg peptide

samples

10

References

1 Tanaka K Waki H Ido Y Akita S Yoshida Y Yoshida T Matsuo T Protein and polymer analyses up to mz 100 000 by laser ionization time-of-flight mass spectrometry Rapid Communications in Mass Spectrometry 1988 2 (8) 151-153

2 Fenn J B Mann M Meng C K Wong S F Whitehouse C M Electrospray ionization for mass spectrometry of large biomolecules Science 1989 246 (4926) 64-71

3 Domon B Aebersold R Mass spectrometry and protein analysis Science 2006 312 (5771) 212-7

4 Whitehouse C M Dreyer R N Yamashita M Fenn J B Electrospray interface for liquid chromatographs and mass spectrometers Anal Chem 1985 57 (3) 675-9

5 Wilm M Mann M Analytical properties of the nanoelectrospray ion source Anal Chem 1996 68 (1) 1-8

6 Karas M Bahr U Dulcks T Nano-electrospray ionization mass spectrometry addressing analytical problems beyond routine Fresenius J Anal Chem 2000 366 (6-7) 669-76

7 Gulpinar M Yegen B The physiology of learning and memory Role of peptides and stress Curr Protein Pept Sci 2004 5 (6) 457-473

8 Ogren S O Kuteeva E Elvander-Tottie E Hokfelt T Neuropeptides in learning and memory processes with focus on galanin In Eur J Pharmacol 2010 Vol 626 pp 9-17

9 Strand F Neuropeptides general characteristics and neuropharmaceutical potential in treating CNS disorders Prog Drug Res 2003 61 1-37

10 Li W Chang M Peng Y-L Gao Y-H Zhang J-N Han R-W Wang R Neuropeptide S produces antinociceptive effects at the supraspinal level in mice Regul Pept 2009 156 (1-3) 90-95

11 Wu C-H Tao P-L Huang E Y-K Distribution of neuropeptide FF (NPFF) receptors in correlation with morphine-induced reward in the rat brain Peptides 2010 31 (7) 1374-1382

12 Tsutsui K Bentley G E Kriegsfeld L J Osugi T Seong J Y Vaudry H Discovery and evolutionary history of gonadotrophin-inhibitory hormone and kisspeptin new key neuropeptides controlling reproduction J Neuroendocrinol 2010 22 (7) 716-727

13 Piper D C Upton N Smith M I Hunter A J The novel brain neuropeptide orexin-A modulates the sleep-wake cycle of rats Eur J Neurosci 2000 12 (2) 726-730

14 Tsuneki H Wada T Sasaoka T Role of orexin in the regulation of glucose homeostasis - Tsuneki - 2009 - Acta Physiologica - Wiley Online Library Acta Physiol 2010

15 Kageyama H Takenoya F Shiba K Shioda S Neuronal circuits involving ghrelin in the hypothalamus-mediated regulation of feeding Neuropeptides 2010 44 (2) 133-138

16 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide family of aplysia J Neurosci 2002 22 (17) 7797-7808

11

17 Jensen J Regulatory peptides and control of food intake in non-mammalian vertebrates Comp Biochem Physiol Part A Mol Integr Physiol 2001 128 (3) 469-477

18 Billimoria C P Li L Marder E Profiling of neuropeptides released at the stomatogastric ganglion of the crab Cancer borealis with mass spectrometry J Neurochem 2005 95 (1) 191-199

19 Cape S S Rehm K J Ma M Marder E Li L Mass spectral comparison of the neuropeptide complement of the stomatogastric ganglion and brain in the adult and embryonic lobster Homarus americanus J Neurochem 2008 105 (3) 690-702

20 Cruz Bermuacutedez N D Fu Q Kutz Naber K K Christie A E Li L Marder E Mass

spectrometric characterization and physiological actions of GAHKNYLRFamide a novel FMRFamide‐like peptide from crabs of the genus Cancer J Neurochem 2006 97 (3) 784-799

21 Grashow R Brookings T Marder E Reliable neuromodulation from circuits with variable underlying structure Proc Natl Acad Sci USA 2009 106 (28) 11742-11746

22 Ma M Szabo T M Jia C Marder E Li L Mass spectrometric characterization and physiological actions of novel crustacean C-type allatostatins Peptides 2009 30 (9) 1660-1668

23 Chen R Hui L Cape S S Wang J Li L Comparative Neuropeptidomic Analysis of Food Intake via a Multi-faceted Mass Spectrometric Approach ACS Chem Neurosci 2010 1 (3) 204-214

24 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement approaches and challenges Annu Rev Anal Chem 2008 1 451-483

25 Ma M Wang J Chen R Li L Expanding the crustacean neuropeptidome using a multifaceted mass spectrometric approach J Proteome Res 2009 8 (5) 2426-2437

26 Slattery M G Heideman W Coordinated regulation of growth genes in Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

12

Figure 1 Capabilities and performances of certain mass analyzers Check marks indicate

availability check marks in parentheses indicate optional + ++ and +++ indicate possible or

moderate goodhigh and excellentvery high respectively Adapted with permission from

reference 3

13

Figure 2 Synthesis processing and release of neuropeptides (a) Cartoon showing two

interacting neurons (b) The synthesis of neuropeptides in the neuronal organelles and their

transport down the axon to the presynaptic terminal is depicted (c) Shows neuropeptide release

and interaction with the dendrite of the post-synaptic cell (Adapted from PhD thesis by Dr

Stephanie Cape)

14

Figure 3 Schematic showing location of the neuronal tissues being analyzed in the MS studies

of neuropeptides in Callinectes sapidus The SGs (sinus glands located in the eyestalks of the

crab) and the POs (pericardial organs located in the chamber surrounding the heart) release

neurohormones into the hemolymph (crab circulatory fluid) The brain is near the STNS

(stomatogastric nervous system neural network that controls the motion of the gut and foregut)

which has direct connections to the STG (stomatogastric ganglion) The STG is located in an

artery so it is continually in contact with hemolymph (Adapted from PhD thesis by Dr Robert

Sturm)

15

Chapter 2

Mass Spectrometry-based Proteomics and Peptidomics for Biomarker

Discovery and the Current State of the Field

Adapted from ldquoMass spectrometry-based proteomics and peptidomics for systems biology and

biomarker discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

16

Abstract

The scientific community has shown great interest in the field of mass spectrometry-based

proteomics and peptidomics for its applications in biology Proteomics technologies have

evolved to produce large datasets of proteins or peptides involved in various biological and

disease progression processes producing testable hypothesis for complex biological questions

This review provides an introduction and insight to relevant topics in proteomics and

peptidomics including biological material selection sample preparation separation techniques

peptide fragmentation post-translation modifications quantification bioinformatics and

biomarker discovery and validation In addition current literature and remaining challenges and

emerging technologies for proteomics and peptidomics are presented

17

Introduction

The field of proteomics has seen a huge expansion in the last two decades Multiple factors have

contributed to the rapid expansion of this field including the ever evolving mass spectrometry

instrumentation new sample preparation methods genomic sequencing of numerous model

organisms allowing database searching of proteomes improved quantitation capabilities and

availability of bioinformatic tools The ability to investigate the proteomes of numerous

biological samples and the ability to generate future hypothesis driven experiments makes

proteomics and biomarker studies exceedingly popular in biological studies today In addition

the advances in post-translational modification (PTM) analysis and quantification ability further

enhance the utility of mass spectrometry (MS)-based proteomics A subset of proteomics

research is devoted to profiling and quantifying neurologically related proteins and endogenous

peptides which has progressed rapidly in the past decade This review provides a general

overview as outlined in Figure 1 of proteomics technology including methodological and

conceptual improvements with a focus on recent studies and neurological biomarker studies

Biological Material Selection

The choice of biological matrix is an important first step in any proteomics analysis The

ease of sample collection (eg urine plasma or saliva) versus usefulness or localization of

sample (eg specific tissue or proximity fluid) needs to be evaluated early on in a study design

Plasma derived by centrifugation of blood to remove whole cells is a very popular

choice in proteomics due to the high protein content (~65 mg ml1) and the ubiquitous nature of

blood in the body and the ability to obtain large sample amounts or various time points without

the need to sacrifice the animal or to perform invasive techniques Plasma is centrifuged

18

immediately after sample collection unlike serum where coagulation needs to occur first To

obtain plasma blood is collected in a tube with an anticoagulant added (ETDA heparin or

citrate) and centrifuged but previous reports have shown variable results when heparin has been

used as an anticoagulant2 Human Proteome Organization (HUPO) specifically recommends the

anticoagulants EDTA or citrate to treat plasma3 4

One of the primary concerns with plasma is

degradation of the protein content via endogenous proteases found in the sample5 One way to

address this problem is the use of protease inhibitors In addition freezethaw cycles need to be

minimized to prevent protein degradation and variability6 7

Plasma proteomics has seen

extensive coordinated efforts to start assessing the diagnostic needs using plasma8 HUPO also

has established a public human database for plasma and serum proteomics from 35 collaborating

labratories9 Large dynamic range studies have been performed on plasma with a starting sample

amount of 2625 microl (1575 mg) resulting in 3654 proteins identified with a sub 5 false

discovery rate10

The large dynamic range spanning across eleven orders of magnitude as visualized in

Figure 2 is one of the biggest obstacles in plasma proteomics Figure 2 also shows that as lower

abundance proteins are investigated the origins of those identified proteins are more diverse than

the most abundant proteins Recent mining of the plasma proteome showed an ability to search

for disease biomarker applications across seven orders of magnitude In addition the tissue of

origin for the identified plasma proteins were identified and its origin was more diverse as the

protein concentration decreased11

Plasma has been used as a source for biomarker studies such

as colorectal cancer12 13

cardiovascular disease14

and abdominal aortic aneurysm15

Even

though the blood brain barrier prevents direct blood to brain interaction neurological disorders

such as Alzheimerrsquos disease (AD) have had their proteomes studied using plasma16

19

An alternative sample derived from blood is serum which is plasma allowed to coagulate

instead of adding anti-coagulates The time for coagulation is usually 30 minutes and during that

time significant and random degradation from endogenous proteases can occur The additional

variability caused from the coagulation process can change the concentration of multiple

potentially valuable biomarkers As biodiversity between samples or organisms is a challenging

endeavor additional sample variability due to serum generation may be undesirable but serum is

still currently being used for biomarker disease studies17

Serum has been used to compare the

proteome differences in neurological diseases such as AD Parkinsonrsquos disease and amyotrophic

lateral sclerosis and a review can be found elsewhere discussing the subject18

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy of brain or spinal cord

in evaluating diseases of the central nervous system and has been used for studies in neurological

disorders due to being a rich source of neuro-related proteins and peptides19

The protein

composition of the most abundant proteins in CSF is well defined and numerous studies exist to

broaden the proteins identified20-22

CSF has an exceedingly low protein content (~04 μgμL)

which is ~100 times lower than serum or plasma and over 60 of the total protein content in

CSF consists of a single protein albumin23-25

In addition the variable concentrations of proteins

span up to twelve orders of magnitude further complicating analysis and masking biologically

relevant proteins to any given study26

One of the highest number of identified proteins is from

Schutzer et al with 2630 non-redundant proteins from 14 mL of pooled human CSF This study

involved the removal of highly abundant proteins by performing IgY-14 immunodepletion

followed by two dimensional (2D) liquid chromatography (LC) separation27

Studies have also

been performed to characterize individual biomarkers or complex patterns of biomarkers in

various diseases in the CSF28 29

One potential pitfall of CSF proteomic analysis is

20

contamination from blood which can be identified by counting red blood cells present or

examining surrogate markers from blood contamination other than hemoglobin such as

peroxiredoxin catalase and carbonic anhydrase30

A proof of principle CSF peptidomics study

identified numerous endogenous peptides associated with the central nervous system which can

be used as a bank for neurological disorder studies31

Numerous recent reports highlighted the

utility of CSF analysis for biomarker studies in AD32 33

medulloblastoma34

both post-mortem

and ante-mortem35

Cellular lysates offer the distinct advantage to work with a cell line yeast or bacteria

with large amounts of proteins available for analysis36 37

with Saccharomyces cerevisiae being

the most common cell lysate38 39

Other cell lines are also used including HeLa40

and E coli41

The ability to obtain milligrams of proteins easily to scale up experiments without animal

sacrifice offers a clear advantage in biological sample selection Current literature supports

cellular lysate as a valued and sought after source of proteins for large scale proteomics

experiments because of the ability to assess treatments conditions and testable hypotheses42-44

Cellular lysate from rat B104 neuroblastoma cell line was used as an in vitro model for cerebral

ischemia and showed abundance changes in multiple proteins involved in various neurological

disorders45

Other Sources of Biological Samples

Urine

The urine proteome appears to be another attractive reservoir for biomarker discovery

due to the relatively low complexity compared with the plasma proteome and the noninvasive

collection of urine Urine is often considered as an ideal source to identify biomarkers for renal

21

diseases due to the fact that in healthy adults approximately 70 of the urine proteome originate

from the kidney and the urinary tract 46

thus the use of urine to identify neurological disorders is

neglected However strong evidence have shown that proteins that are associated with

neurodegenerative diseases can be excreted in the urine47-49

indicating the application of urine

proteomics could be a useful approach to the discovery of biomarkers and development of

diagnostic assays for neurodegenerative diseases However the current view of urine proteome

is still limited by factors such as sample preparation techniques and sensitivity of the mass

spectrometers There has been a tremendous drive to increase the coverage of urine proteome

In a recent study Court et al compared and evaluated several different sample preparation

methods with the objective of developing a standardized robust and scalable protocol that could

be used in biomarkers development by shotgun proteomics50

In another study Marimuthu et al

reported the largest catalog of proteins in urine identified in a single study to date The

proteomic analysis of urine samples pooled from healthy individuals was conducted by using

high-resolution Fourier transform mass spectrometry A total of 1823 proteins were identified

of which 671 proteins have not been previously reported in urine 51

Saliva

For diagnosis purposes saliva collection has the advantage of being an easy and non-

invasive technique The recent studies on saliva proteins that are critically involved in AD and

Parkinsonrsquos diseases suggested that saliva could be a potentially important sample source to

identify biomarkers for neurodegenerative diseases Bermejo-Pareja et al reported the level of

salivary Aβ42 in patients with mild AD was noticeably increased compared to a group of

controls 52

In another study Devic et al identified two of the most important Parkinsons

22

disease related proteinsmdash α-synuclein (α-Syn) and DJ-1 in human saliva 53

They observed that

salivary α-Syn levels tended to decrease while DJ-1 levels tended to increase in Parkinsons

disease The published results from this study also suggest that α-Syn might correlate with the

severity of motor symptoms in Parkinsons disease Due in part to recent advancements in MS-

based proteomics has provided promising results in utilizing saliva to explore biomarkers for

both local and systemic diseases 54 55

the further profiling of saliva proteome will provide

valuable biomarker discovery source for neurodegenerative diseases

Tissue

Compared to body fluids such as plasma serum and urine where the proteomic analysis

is complicated by the wide dynamic range of protein concentration the analysis of tissue

homogenates using the well-established and conventional proteomic analysis techniques has the

advantage of reduced dynamic range However the homogenization and extraction process may

suffer from the caveat that spatial information is lost which would be inadequate for the

detection of biomarkers whose localization and distribution play important roles in disease

development and progression Matrix-assisted laser desorptionionization (MALDI) imaging

mass spectrometry (IMS) is a method that allows the investigation of a wide range of molecules

including proteins peptides lipids drugs and metabolites directly in thin slices of tissue 56-59

Because this technology allows for identification and simultaneous localization of biomolecules

of interests in tissue sections linking the spatial expression of molecules to histopathology

MALDI-IMS has been utilized as a powerful tool for the discovery of new cancer biomarker

candidates as well as other clinical applications60 61

The utilization of MALDI-IMS for human

or animal brain tissue to identify or map the distribution of molecules related to

neurodegenerative diseases were also recently reported62 63

23

Secretome

There has been an increasing interest in the study of proteins secreted by various cells

(the secretomes) from tissue-proximal fluids or conditioned media as a potential source of

biomarkers Cell secretomes mainly comprise proteins that are secreted or are shed from the cell

surface and these proteins can play important role in both physiological processes (eg cell

signaling communication and migration) and pathological processes including tumor

angiogenesis differentiation invasion and metastasis In particular the study of cancer cell

secretomes by MS based proteomics has offered new opportunities for cancer biomarker

discovery as tumor proteins may be secreted or shed into the bloodstream and could be used as

noninvasive biomarkers The latest advances and challenges of sample preparation sample

concentration and separation techniques used specifically for secretome analysis and its clinical

applications in the discovery of disease specific biomarkers have been comprehensively

reviewed64 65

Here we only highlight the proteomic profiling of neural cells secretome that has

been applied to neurosciences for a better understanding of the roles secreted proteins play in

response to brain injury and neurological diseases The LC-MS shotgun identification of

proteins released by astrocytes has been recently reported66-68

In these studies the changes

observed in the astrocyte secretomes induced by inflammatory cytokines or cholinergic

stimulation were investigated6667

Alternatively our group performed 2D-LC separation and

included cytoplasmic protein extract from astrocytes as a control to identify cytoplasmic protein

contaminants which are not actively secreted from cells68

Sample Preparation

24

Proteomic analysis and biomarker discovery research in biological samples such as body

fluids tissues and cells are often hampered by the vast complexity and large dynamic range of

the proteins Because disease identifying biomarkers are more likely to be low-abundance

proteins it is imperative to remove the high-abundance proteins or apply enrichment techniques

to allow detection and better coverage of the low-abundance proteins for MS analysis Several

strategies including depletion and protein equalizer approach have been used during sample

preparation to reduce sample complexity69 70

and the latest advances of these methods have been

reviewed by Selvaraju et al 71

Alternatively the complexity of biological samples can be

reduced by capturing a specific subproteome that may have the biological information of interest

The latter strategy is especially useful in the biomarker discovery where the changes in the

proteome are not solely reflected through the concentration level of specific proteins but also

through changes in the post-translational modifications (PTMs) Here we will mainly discuss

the enrichment of phosphoproteinpeptides glycoproteinpeptides and sample preparation for

peptidomics and membrane proteins

Phosphoproteomics

Phosphorylation can act as a molecular switch on a protein by turning it on or off within

the cell It is thought that up to 30 of the proteins can be phosphorylated72

and it plays

significant roles in such biological processes as the cell cycle and signal transduction73

Currently tens of thousands of phosphorylation sites can be proposed using analytical methods

available today74 75

The amino acids that are targeted for phosphorylation studies are serine

threonine and tyrosine with the abundance of detection decreasing typically in that order Other

25

amino acids have been reported to be phosphorylated but traditional phosphoproteomics

experiments ignore these rare events76

In a typical large-scale phosphoproteomics experiment the sample size is usually in

milligram amounts to account for the low stoichiometry of phosphorylated proteins The large

amount of protein is then digested typically with trypsin but alternatively experiments have

been performed with Lys-C digestion to produce large enzymatic peptides The larger peptides

produced from Lys-C render higher charged peptides during electrospray ionization (ESI) and

allow improved electron-based fragmentation to determine specific sites of phosphorylation77

From the pool of peptides phosphopeptides must be enriched otherwise they will be masked by

the vast number and higher ionization efficiency of non-phosphorylated peptides The two most

common enrichment techniques are immobilized metal ion affinity chromatography (IMAC) and

metal oxide affinity chromatography (MOAC) TiO2 being the most common oxide used for this

purpose A recent study reported that phosphorylation of neuronal intermediate filament proteins

in neurofibrillary tangles are involved in Alzheimerrsquos disease78

Glycoproteomics

Protein glycosylation is one of the most common and complicated forms of PTM Types

of protein glycosylation in eukaryotes are categorized as either N-linked where glycans are

attached to asparagine residues in a consensus sequence N-X-ST (X can be any amino acid

except proline) via an N-acetylglucosamine (N-GlcNAc) residue or the O-glycosylation where

the glycans are attached to serine or threonine Glycosylation plays a fundamental role in

numerous biological processes and aberrant alterations in protein glycosylation are associated

with neurodegenerative disease states such as Creutzfeld-Jakob Disease (CJD) and AD79 80

26

Due to the low abundance of glycosylated forms of proteins compared to non-glycosylated

proteins it is essential to enrich glycoproteins or glycopeptides in complex biological samples

prior to MS analysis Two of the most common enrichment methods used in glycoproteomics are

lectin affinity chromatography (LAC) and hydrazide chemistry The detailed methodologies of

LAC hydrazide chemistry and other enrichment methods in glycoproteomics have been

extensively reviewed in the past81 82

In particular LAC is of great interest in studies of

glycosylation alterations as markers of AD and other neurodegenerative diseases due to its recent

applications in brain glycoproteomics83

Our group has utilized multi-lectin affinity

chromatography containing concanavalin A (ConA) and wheat germ agglutinin (WGA) to enrich

N-linked glycoproteins in control and prion-infected mouse plasma84

This method enabled us to

identify a low-abundance glycoprotein serum amyloid P-component (SAP) PNGase F digestion

and Western blotting validation confirmed that the glycosylated form of SAP was significantly

elevated in mice with early prion infection and it could be potentially used as a diagnostic

biomarker for prion diseases

Membrane proteins

Membrane proteins play an indispensable role in maintaining cellular integrity of their

structure and perform many important functions including signaling transduction intercellular

communication vesicle trafficking ion transport and protein translocationintegration85

However due to being relatively insoluble in water and low abundance it is challenging to

analyze membrane proteins by traditional MS-based proteomics approaches Numerous efforts

have been made to improve the solubility and enrichment of membrane proteins during sample

preparation Several comprehensive studies recently covered the commonly used technologies in

27

membrane proteomics and different strategies that circumvent technical issues specific to the

membrane 86-90

Recently Sun et al reported using 1-butyl-3-methyl imidazolium

tetrafluoroborate (BMIM BF4) an ionic liquid (IL) as a sample preparation buffer for the

analysis of integral membrane proteins (IMPs) by microcolumn reversed phase liquid

chromatography (μRPLC)-electrospray ionization tandem mass spectrometry (ESI-MSMS)

The authors compared BMIM BF4 to the other commonly used solvents such as sodium dodecyl

sulfate methanol Rapigest and urea but they found that the number of identified IMPs from rat

brain extracted by ILs was significantly increased The improved identifications could be due to

the fact that BMIM BF4 has higher thermal stability and thus offered higher solubilizing ability

for IMPs which provided better compatibility for tryptic digestion than traditionally used solvent

systems38

In addition to characterization of membrane proteome the investigation of PTMs on

membrane proteins is equally important for characterization of disease markers and drug

treatment targets Phosphorylations and glycosylations are the two most important PTMs for

membrane proteins In many membrane protein receptors the cytoplasmic domains can be

phosphorylated reversibly and function as signal transducers whereas the receptor activities of

the extracellular domains are mediated via N-linked glycosylation Wiśniewski et al provides an

informative summary on recent advances in proteomic technology for the identification and

characterization of these modifications91

Our group has pioneered the development of detergent

assisted lectin affinity chromatography (DALAC) for the enrichment of hydrophobic

glycoproteins using mouse brain extract92

We compared the binding efficiency of lectin affinity

chromatography in the presence of four commonly used detergents and determined that under

certain concentrations detergents can minimize the nonspecific bindings and facilitate the

elution of hydrophobic glycoproteins In summary NP-40 was suggested as the most suitable

28

detergent for DALAC due to the higher membrane protein recovery glycoprotein recovery and

membranous glycoprotein identifications compared to other detergents tested In a different

study on mouse brain membrane proteome Zhang et al reported an optimized protocol using

electrostatic repulsion hydrophilic interaction chromatography (ERLIC) for the simultaneous

enrichment of glyco- and phosphopeptides from mouse brain membrane protein preparation93

Using this protocol they successfully identified 544 unique glycoproteins and 922 glycosylation

sites which were significantly higher than those using the hydrazide chemistry method

Additionally a total of 383 phosphoproteins and 915 phosphorylation sites were identified

suggesting that the ERLIC separation has the potential for simultaneous analysis of both glyco-

and phosphoproteomes

Peptidomics

Peptidomics can be loosely defined as the study of the low molecular weight fraction of

proteins encompassing biologically active endogenous peptides protein fragments from

endogenous protein degradation products or other small proteins such as cytokines and signaling

peptides Studies can involve endogenous peptides94

peptidomic profiling33

and de novo

sequencing of peptides95 96

Neuropeptidomics focuses on biologically active short segments of

peptides and have been investigated in numerous species including Rattus97 98

Mus musculus99

100 Bovine taurus

101 Japanese quail diencephalon

102 and invertebrates

103-106 The isolation of

peptides is typically performed through molecular weight cut-offs from either biofluids such as

CSF plasma or tissue extracts If the protein and peptide content is high such as for tissue or cell

lysates protein precipitation can be done via high organic solvents and the resulting supernatant

can be analyzed for extracted peptides where extraction solvent and conditions could have a

29

significant effect on what endogenous peptides are extracted from tissue107

A comparative

peptidomic study of human cell lines highlights the utility of finding peptide signatures as

potential biomarkers108

A thorough review of endogenous peptides and neuropeptides is beyond

the scope of this review and an excellent review on this topic is available elsewhere109

Fractionation and Separation

The mass spectrometer has a limited duty cycle and data dependent analysis can only

scan a limited number of mz peaks at any given time In addition significant ion suppression

can occur if there is a difference in concentration between co-eluting peptides or if too many

peptides co-elute Therefore one of the biggest challenges in biomarker discovery is the

complexity of the sample and the presence of high-abundance proteins in body fluids such as

CSF serum and plasma In addition to the removal of the most abundant proteins by

immunodepletion the reduction of the complexity of the sample by further fractionation is

indispensable to facilitate the characterization of unidentified biomarkers from the low

abundance proteins Traditionally used techniques for complex protein analysis include gel

based fractionation methods such as two-dimensional gel electrophoresis (2D-GE) and its

variation two-dimensional differential gel electrophoresis (2D-DIGE) or non-gel based such as

one- or multidimensional liquid chromatography (LC) and microscale separation techniques

such as capillary electrophoresis (CE)

2D-GE MS has been widely used as a powerful tool to separate proteins and identify

differentially expressed proteins ever since 2D gels were coupled to mass spectrometry In 2D-

GE MS thousands of proteins can be separated on a single gel according to pI and molecular

weight Individual protein spots that show differences in abundance between different samples

30

can then be excised from the gel digested into peptides and analyzed by MALDI MS or by

liquid chromatography tandem mass spectrometry (LC-MSMS) for protein identification The

introduction of 2D-DIGE adds a quantitative strategy to gel electrophoresis by enabling multiple

protein extracts to be separated on the same 2D gel thus providing comparative analysis of

proteomes in complex samples In 2D-DIGE protein extracts from two different conditions and

an internal standard can be labeled with fluorescent dyes for example Cy3 Cy5 and Cy2

respectively prior to two-dimensional gel electrophoresis Compared to traditional 2D-GE 2D-

DIGE provides the clear advantage of overcoming the inter-gel variation problem 110

Proteomic

profiling of CSF by 2D-GE and 2D-DIGE has led to the identification of putative biomarkers in

multiple neurological disorders For example Brechlin et al reported an optimized 2-DIGE

protocol profiled CSF from 36 CJD patients The applicability of their approach was proven by

the detection of known CJD biomarkers such as 14-3-3 protein neuron-specific enolase lactate

dehydrogenase and other proteins that are potentially relevant to CJD 111

In another study to

identify novel CSF biomarkers for multiple sclerosis CSF from 112 multiple sclerosis patients

and control individuals were analyzed by 2D-GE MS for comparative proteomics Ten potential

multiple sclerosis biomarkers were selected for validation by immunoassay 112

These

methodologies sample preparation techniques and applications of 2D-DIGE in

neuroproteomics were reviewed by Diez et al113

Although 2D gel provides excellent resolving

power and capability to visualize abundance changes there are some limitations to the method

For example gel based separation is not suitable for low abundance proteins extremely basic or

acidic proteins very small or large proteins and hydrophobic proteins114 115

Complementary to gel-based approaches shotgun proteomics coupled to LC have

become increasingly popular in proteomic research because they are reproducible highly

31

automated and capable of detecting low abundance proteins Furthermore another advantage of

LC-MS shotgun proteomics is the suitability for isotope labeling for protein quantification which

is reviewed in a later section In shotgun proteomics a protein mixture is digested and resulting

peptides are separated by LC prior to tandem MS fragmentation to identify different proteins by

peptide sequencing The most common separation for shotgun proteomics peptidomics or top-

down proteomics experiments use low-pH reversed phase (RP) C18 C8 or C4 columns RPLC

is well established which provides high resolution desalts the sample which can interfere with

ionization and the mobile phase is compatible with ESI Nanoscale C18 columns allow for

separation and introduction of sub microgram samples If larger amounts of sample are

available two dimensional separations are usually preferred to greatly enhance the coverage of

the investigated proteome which will be discussed in depth later It is preferable to have an

orthogonal separation method and since RP separates via hydrophobicity strong cation exchange

(SCX) was the original choice due to its separation by charge MudPIT (multidimensional

protein identification technology) usually refers to the use of SCX as the first phase of separation

and is a well-established platform116

SCX has the advantage over RP separation technologies to

effectively remove interfering detergents from the sample SCX separation is not based solely

off charge and hydrophobicity contributes to elution therefore a small amount of organic

modifier usually 10-15 is added to lessen the hydrophobicity effects117

The addition of

organic modifiers needs to be minimized otherwise binding to the C18 trap cartridge or C18

column will be reduced if performed on-line SCX can be used for PTMs and offers specific

applications for proteomic studies and an excellent current review is offered on this subject

elsewhere118

An alternative MudPIT separation scheme employing high pH RPLC as the first

phase of separation and low pH RPLC in the second dimension (RP-RP) has been successfully

32

applied to the proteomic analysis of complex biological samples119 120

The advantage of using

RP as the first dimension is the higher resolution for separation and better compatibility with

down-stream MS detection by eliminating salt Song et al reported a phosphoproteome analysis

based on this 2D RP-RP coupling scheme121

Hydrophilic interaction chromatography (HILIC) employs distinct separation modality

where the retention of peptides is increased with increasing polarity122

The loading of sample is

done by high organic and eluted by increasing the percentage of the aqueous phase or polarity of

the mobile phase opposite from RPLC thus establishing orthogonality of the two separation

modes123

HILIC has quickly become a very useful method and is actively used for proteomic

experiments124

for increased sensitivity125

phosphoproteomics126

glycoproteins127

and

quantification studies128

An alternative and modification to HILIC is ERLIC which adds an

additional mode of separation by electrostatic attraction An earlier study using ERLIC

demonstrated the ability to separate phosphopeptides from non-phosphorylated peptides at

pH=2129

A recent study looking into changes in the phosphoproteome of Marekrsquos Disease

applied ERLIC to chicken embryonic fibroblast lysate identifying only 13 phosphopeptides

out of all the identified peptides Due to the lack of isolation of phosphopeptides from ERLIC

the investigators performed immobilized metal affinity chromatography (IMAC) enrichment on

the fractions increasing identification of phosphopeptides over 50 fold130

A comparative study

of ERLIC to HILIC and SCX following TiO2 phospho-enrichment reported that

SCXgtERLICgtHILIC for phosphopeptide identifications126

Recent developments in instrumentation to combine LC with ion mobility spectrometry

(IMS) and MS (LC-IMS-MS) offered more advantages than conventional LC due to the rapid

high-resolution separations of analytes based on their charge mass and shape as reflected by

33

mobility in a given buffer gas The mobility of an ion in a buffer gas is determined by the ionrsquos

charge and its collision cross-section with the buffer gas The methodologies of IMS separations

and the application of LC-IMS-MS for the proteomics analysis of complex systems including

human plasma have been reviewed by Clemmerrsquos group131-133

They proposed a method that

employs intrinsic amino acid size parameters to obtain ion mobility predictions which can be

used to rank candidate peptide ion assignments and significantly improve peptide identification

134

Although 2D gel and LC are routinely used as separation techniques in MS-based

proteomics capillary electrophoresis (CE) has received increasing attention as a promising

alternative due to the fast and high-resolution separation it offers CE has a wide variety of

operation modes among which capillary zone electrophoresis (CZE) and capillary isoelectric

focusing (CIEF) have the greatest potential applications in MS-based proteomics thus will be

highlighted here CZE separates analytes by their charge-to-size ratios in buffers under a high

electrical field and is often used as the final dimension prior to MS analysis while the separation

feature of CIEF is based on isoelectric point and this technique is more suitable to be used as the

first dimension separation Detailed description of different CEndashMS interfaces sample

preconcentration and capillary coating to minimize analyte adsorption could be found in several

reviews135-141

CE technique is complementary to conventional LC in that it is suitable for the

analysis of polar and chargeable compounds Dovichirsquos group conducted proteomic analysis of

the secreted protein fraction of Mycobacterium marinum which has intermediate protein

complexity142

The tryptic digests were either analyzed by UPLC-ESI-MSMS in triplicates or

prefactionated by RPLC followed by CZE-ESI-MSMS It was demonstrated that the two

methods identified similar numbers of peptides and proteins within similar analysis times

34

However CZE-ESI-MSMS analysis of the prefractionated sample tended to identify more

peptides that are basic and have lower mz values than those identified by UPLC-ESI-MSMS

This analysis also presented the largest number of protein identifications by using CE-MSMS

suggesting the effectiveness of prefractionation of complex samples by LC method prior to CZE-

ESI-MSMS The use of CIEF as the first dimension of separation provides both sample

concentration and excellent resolving power The combination of CIEF and RPLC separation

has been applied to the proteomic analyses where the amount of protein sample is limited and

cannot meet the requirement of minimal load amount for 2D LC-MSMS143 144

So far CE-MS

has been widely applied to the proteomic analysis of various biological samples such as urine145

146 CSF

147 blood

148 frozen tissues

149 and the formalin-fixed and paraffin-embedded (FFPE)

tissue samples150

The recent CEndashMS applications to clinical proteomics have been summarized

in several reviews135 151 152

Protein Quantification

In 2D gel electrophoresis the quantitative analysis of protein mixtures is performed on

the gel by comparing the intensity of the protein stain The development of 2D-DIGE eliminated

the gel-to-gel variation and greatly improved the quantitative capability and reliability of 2D gel

methodology110

However the accuracy of 2D gel based protein quantification suffers from the

limitations that a seemingly single gel spot often contains multiple proteins and the difficulty of

detecting proteins with extreme molecular weights and pI values as well as highly hydrophobic

proteins such as membrane proteins Therefore non-gel based shotgun proteomics technology is

more suitable for accurate and large-scale protein identification and quantification in complex

samples Briefly the quantification in non-gel based shotgun proteomics can be categorized into

35

two major approaches stable isotope labeling-based and label-free methods The common

strategies for quantitative proteomic analysis are reviewed and summarized in Table 1

Isotope labeling methods

Because stable isotope-labeled peptides have the same chemical properties as their

unlabeled counterparts the two peptides within a mixture should exhibit identical behaviors in

MS ionization The mass difference introduced by isotope labeling enables the detection of a

pair of two distinct peptide masses by MS within the mixture and allowing for the measurement

of the relative abundance differences between two peptides Depending on how isotopes are

incorporated into the protein or peptide these labeling methods can be divided into two groups

In vitro chemical derivatization techniques which incorporate a label or tag into the peptide or

protein during sample preparation metabolic labeling techniques which introduce the isotope

label directly into the organism via isotope-enriched nutrients from food or media

1 In vitro derivatization techniques

There are multiple methods to introduce heavy isotopes into proteins or peptides in vitro

The commonly used strategies include 18

O 16

O enzymatic labeling Isotope-Coded Affinity Tag

(ICAT) Tandem Mass Tags (TMTs) and Isobaric Tags for Relative and Absolute Quantification

(iTRAQ) The 18

O labeling method enzymatically cleaves the peptide bond with trypsin in the

presence of 18

O-enriched H2O and introduces 4-Da mass shift in the tryptic peptides153

The

advantages of this method include 18

O-enriched water is extremely stable tryptic peptides will

be labeled with the same mass shift secondary reactions inherent to other chemical labeling can

be avoided Conversely widespread use of 18

O-labeling has been hindered due to the difficulty

of attaining complete 18

O incorporation and the lack of robustness154 155

Currently ICAT

36

TMTs and iTRAQ methods are extensively used in quantitative proteomics In ICAT cysteine

residues are specifically derivatized with a reagent containing either zero or eight deuterium

atoms as well as a biotin group for affinity purification of cysteine-containing peptides156 157

The advantage of ICAT is that the affinity purification via biotin moiety can facilitate the

detection of low-abundance cysteine-containing peptides In addition the mass difference

introduced by labeling increases mass spectral complexity with quantification from the different

precursor masses done by MS and peptide identification being achieved through tandem MS

(MSMS) This added complexity from different peptide masses was addressed by using isobaric

labeling methods such as TMTs and iTRAQ 158 159

where the same peptides in different samples

are isobaric after tagging and appear as single mz in MS scans thus enhancing the peptide limit

of detection and reducing the MS scan complexity Isobaric labeling reagents are composed of a

primary amine reactive group and an isotopic reporter group linked by an isotopic balancer group

for the normalization of the total mass of the tags The reporter group serves for quantification

purpose since it is cleaved during collision-induced dissociation (CID) to yield a characteristic

isotope-encoded fragment Moreover isobaric labeling methods allow the comparison of

multiple samples within a single experiment Recently a 6-plex version of TMTs was

reported160

and iTRAQ enables up to eight samples to be labeled and relatively quantified in a

single experiment161

8-plex iTRAQ reagents have been used for the comparison of complicated

biological samples such as CSF in the studies of neurodegenerative diseases 162

Recently our

group developed a novel N N-dimethyl leucine (DiLeu) 4-plex isobaric tandem mass (MS2)

tagging reagents with high quantitation efficacy DiLeu has the advantage of synthetic simplicity

and greatly reduced synthesis cost compared to TMTs and iTRAQ163

Xiang et al demonstrated

that DiLeu produced comparable iTRAQ ability for protein sequence coverage (~43) and

37

quantitation accuracy (lt15) for tryptically digested proteins More importantly DiLeu

reagents could promote enhanced fragmentation of labeled peptides thus allowing more

confident peptide and protein identifications

2 In Vivo Metabolic Labeling

Metabolic processes can also be employed for the incorporation of stable-isotope labels

into the proteins or organisms by enriching culture media or food with light or heavy versions of

isotope labels (2H

13C

15N) The advantage of in vivo labeling is that metabolic labeling does

not suffer from incomplete labeling which is an inherent drawback for in vitro derivatization

techniques In addition metabolic labeling occurs from the start of the experiment and proteins

with light or heavy labels are simultaneously extracted thus reducing the error and variability of

quantification introduced during sample preparation The most widely used strategy for

metabolic labeling is known as stable-isotope labeling of amino acids in cell culture (SILAC)

which was introduced by Mann and co-workers164 165

In SILAC one cell population is grown

in normal or light media while the other is grown in heavy media enriched with a heavy

isotope-encoded (typically 13

C or 15

N) amino acid such as arginine or leucine Cells from the

two populations are then combined proteins are extracted digested and analyzed by MS The

relative protein expression differences are then determined from the extracted ion

chromatograms from both the light and heavy peptide forms SILAC has been shown to be a

powerful tool for the study of intracellular signal transduction In addition this technique has

recently been applied to the quantitative analysis of phosphotyrosine (pTyr) proteomes to

characterize pTyr-dependent signaling pathways166 167

38

Labe-free quantification

Although various isotope labeling methods have provided powerful tools for quantitative

proteomics several limitations of these approaches are noted Labeling increases the cost and

complexity of sample preparation introduces potential errors during the labeling reaction It also

requires a higher sample concentration and complicates data processing and interpretation In

addition so far only TMTs and iTRAQ allow the comparison of multiple (up to eight) samples

simultaneously The comparison of more than eight samples in a single experiment cannot be

achieved by isotope labeling In order to address these concerns there has been significant

interest in the development of label-free quantitative approaches Current label-free

quantification methods for MS-based proteomics were developed based on the observation that

the chromatographic peak area of a peptide168 169

or frequency of MSMS spectra170

correlating

to the protein or peptide concentration Therefore the two most common label-free

quantification approaches are conducted by comparing (i) area under the curve (AUC) of any

given peptides171 172

or (ii) by frequency measurements of MSMS spectra assigned to a protein

commonly referred to as spectral counting173

Several recent reviews provided detailed and

comprehensive knowledge comparing label-free methods with labeling methods data processing

and commercially available software for label-free quantitative proteomics174-177

Dissociation Techniques

The vast majority of proteomic experiments have proteins or peptides being identified by

two critical pieces of data obtained from the mass spectrometer The first is the precursor ion

identified by its mz which is informative to the mass of the peptide being analyzed The second

is the use of tandem mass spectrometry to fragment or dissociate the precursor ion and record the

39

generated fragment ion pattern to discern the amino acid sequence The three most popular

dissociation or fragmentation techniques for peptides are CID electron-transfer dissociation

(ETD) and high-energy collision dissociation (HCD) A recent study on the human plasma

proteome demonstrated that combined fragmentation techniques enhance coverage by providing

complementary information for identifications CID enabled the greatest number of protein

identifications while HCD identified an additional 25 proteins and ETD contributed an

additional 13 protein identifications178

ETDECD

Electron capture dissociation (ECD) 179

preceded ETD but ECD was developed for use

in a Penning trap for Fourier transform ion cyclotron resonance (FTICR) mass spectrometers

ECD requires the ion of interest to be in contact with near-thermal electrons and for the electron

capture event to occur on the millisecond time scale but the time scale is inadequate for electron

trapping in Paul traps or quadrupoles in the majority of mass spectrometers180

ETD involves a

radical anion like fluoranthene with low electron affinity to be transferred to peptide cation

which results in more uniform cleavage along the peptide backbone The cation accepts an

electron and the newly formed odd-electron protonated peptide undergoes fragmentation by

cleavage of the N-Cα bond which results in fragmentation ions consisting of c- and zbull-type

product ions The uniformed cleavage results in reduced sequence discrimination to labile bonds

such as PTMs and also provides improved sequencing for larger peptides compared to CID181

The realization that larger peptides produced better MSMS quality spectra compared to CID led

to a decision tree analysis strategy where peptide charge states and size determined whether the

precursor peptide would be fragmented with CID or ETD182

One of the main benefits of

ETDECD is the ability to sequence peptides with labile PTMs such as phosphorylation77 183

40

sulfation184

glycosylation185

ubiquitination186

and histone modifications187

ETD also has the

benefit of providing better sequence information on larger neuropeptides when compared to

CID188

However a thorough analysis suggested that CID still yielded more peptideprotein

identifications than ETD in large scale proteoimcs189

HCD

High energy collision dissociation (HCD)190

is an emerging fragmentation technique that

offers improved detection of small reporter ions from iTRAQ-based studies191 192

HCD is

performed at a higher energy in a collision cell instead of an ion trap like CID thus HCD does

not suffer from the low-mass cutoff limitation Furthermore HCD offers enhanced

fragmentation efficiency assisting in MSMS spectra interpretation and protein identification193

A major drawback for HCD is that the spectral acquisition times are up to two-fold longer due to

increased ion requirement for Fourier transform detection in the orbitrap194

HCD has been

reported to increase phosphopeptide identifications over CID74

but in a different study CID was

reported to offer more phosphopeptide identifications over HCD194

Work has also been done to

transfer the decision tree analysis for HCD which basically switches CID with HCD claiming

better quality data determined by higher Mascot scores with more peptide identifications195

MSE

Data dependent acquisition (DDA) is the most commonly used ion selection process in

mass spectrometers for proteomic experiments An alternative process which does not have ion

selection nor switch between MS and MSMS modes is termed MSE MS

E is a data independent

mode and does not require precursor ions of a significant intensity to be selected for MSMS

analysis196

A data independent mode decouples the mass spectrometer choosing which

precursor ions to fragment and when the ions are fragmented MSE works by a low or high

41

energy scan and no ion isolation is occurring The low energy scan is where the precursor ion is

not fragmented and the high energy scan allows fragmentation The resulting mix of precursor

and fragmentation ions is then detected simultaneously197

The data will then need to be

deconvoluted using a proprietary time-aligned algorithm that is discussed elsewhere198

The

continuous data independent acquisition allows multiple MSMS spectra to be collected during

the natural analyte peak broadening observed in chromatography which provides more data

points for AUC label-free quantification In addition lower abundance peptides can be

sequenced as more MSMS spectra are collected throughout the elution of an LC peak allowing

better signal averaging for smaller analyte peak of interest during coelution and reducing

sampling bias in typical DDA experiments where only more abundant peaks can be selected for

fragmentation

A comparison of spiked internal protein standards into a complex protein digest provided

evidence that MSE was comparable to DDA analysis in LC-MS

199 MS

E has been used for label

free proteomics of immunodepleted serum in large scale proteomics samples200

In addition

MSE was performed for the characterization of human cerebellum and primary visual cortex

proteomes Hundreds of proteins were identified including many previously reported in

neurological disorders201

MSE is quickly becoming a versatile data acquisition method recently

used in such studies as cancer cells202

schizophrenia203

and pituitary proteome discovery204

The usefulness of MSE as an unbiased data acquisition method is being assimilated into multiple

proteomics studies including studies involving neurological disorders

Data Analysis

42

One of the major bottlenecks in non-targeted proteomic experiments is how to handle the

enormous amount of data obtained Database searches biostatistical analysis de novo

sequencing PTM validation all have their place and multiple available platforms are available

If the organism being studied has had its genome sequenced databases can be created

with a list of proteins in the FASTA format to be used in database searching There are

numerous database searching algorithms for sequence identification of MSMS data including

Mascot205

Sequest206

Xtandem207

OMSSA208

and PEAKS209

These searching algorithms are

performed by matching MSMS spectra and precursor mass to sequences found within proteins

How well the actual spectra match the theoretical spectra determines a score which is unique to

the searching algorithm and usually can be extrapolated to the probability of a random hit

Recently a database has been developed for PTM analysis by the use of the program SIMS210

Specifically for phosphopeptides Ascorersquos algorithm scans the MSMS data to determine the

likelihood of correct phosphosite identification from the presence of site identifying product

ions211

If the organism that is being analyzed has not had its genome sequenced and no (or very

limited) FASTA database is available a homology search can be performed using SPIDER212

available with PEAKS software Alternatively individual MSMS spectrum can be de novo

sequenced but software is available to perform automated de novo sequencing of numerous

spectra (PEAKS208

DeNovoX and PepSeq)

For large-scale protein identifications the false discovery rate (FDR) must be established

by the searching algorithm and that is accomplished by re-searching the data with a false

database created by reversing or scrambling the amino acid sequence of the original database

used for the protein search Any hits from the false database will contribute to the FDR and this

value can be adjusted usually around 1 An additional layer of confidence in the obtained data

43

can be achieved in shotgun proteomics experiments by removing all the proteins that are

identified by only one peptide

Once a set of confident proteins or peptides have been generated from database

searching bioinformatic analysis or biostatistical analysis is needed Numerous software

packages are available for different purposes FLEXIQuant is an example for absolute

quantitation of isotopically labeled protein or peptides of interest213

FDR analysis of

phosphopeptides or other specific PTMs can be adjusted with such software as Scaffold

providing data consisting only of a specific modification214

Bioinformatic tools such as

Scaffold or ProteoIQ also include gene ontology (GO) analysis which can classify identified

proteins by three categories cellular component molecular function or biological process

Custom bioinformatics programs can also be developed and are often useful in various proteomic

studies including biomarker discovery in neurological diseases215

More detailed review of

bioinformatics in peptidomics216

and proteomics217

can be found elsewhere

Validation of Biomarkers by Targeted Proteomics

The validation of putative biomarkers identified by MS-based proteomic analysis is often

required to provide orthogonal analysis to rule out a false positive by MS and providing

additional evidence for the biomarker candidate(s) from the study for future potential clinical

assays At present antibody-based assays such as Western blotting ELISA and

immunochemistry are the most widely used methods for biomarker validation Although accurate

and well established these methods rely on protein specific antibodies for the measurement of

the putative biomarker and could be difficult for large-scale validation of all or even a subset of a

long list of putative protein biomarkers typically obtained by MS-based comparative proteomic

44

analysis Large scale validation is impractical due to the cost for each antibody the labor to

develop a publishable Western blot or ELISA and the antibody availability for certain proteins

As an alternative strategy quantitative assays based on multiple-reaction monitoring (MRM) MS

using a triple quadrupole mass spectrometer have been employed in biomarker verification

MRM is the most common use of MSMS for absolute quantitation It is a hypothesis

driven experiment where the peptide of interest and its subsequent fragmentation pattern must be

known prior to the quantitative MRM experiments MRM involves selecting a specific mz (first

quadrupole) to be isolated for fragmentation (second quadrupole) followed by one or more of

the most intense fragment ions (third quadrupole) being monitored The ability to quantitate and

thus validate the proteins or peptides as potential biomarkers is achieved by performing MRM on

isotopically labeled reference peptide for targeted peptideprotein of interest The main obstacle

for quantification of peptides is interference and ion suppression effects from co-eluting

substances Since the isotopically labeled and native peptide will co-elute the same interference

and ion suppression will occur for both peptides and thus correcting these interfering effects

Peptides need to be systematically chosen for a highly sensitive and reproducible MRM

experiment to ensure proper validation of putative biomarkers Peptides require certain intrinsic

properties which include an mz within the practical mass detection range for the instrument and

high ionization efficiency If the desired peptide to be quantified is derived from a digestion

then peptides that have detectable incomplete digestion or missed cleavage site can be a major

source of variability Peptides with a methionine and to a lesser extent tryptophan are

traditionally removed from consideration from MRM quantitative experiments due to the

variable nature of the oxidation that can occur In addition if chromatographic separation is

performed the retention behavior of the peptide must be well behaved with little tailing effects

45

eluting late causing broadening of the peak and even irreversible binding to the column As an

example hydrophilic peptides being eluted off a C18 column may exhibit the previously

described concerns and a different chromatographic separation will need to be explored for

improved limits of detection quantitation and validation To determine consistent peptide

detection or usefulness of certain peptides databases such as Proteomics Database218

PRIDE219

PeptideAtlas220

have been developed to compile proteomic data repositories from initial

discovery experiments

After the peptide is selected for analysis the proper MRM transitions need to be selected

to optimize the sensitivity and selectivity of the experiment It is common for investigators to

select two or three of the most intense transitions for the proposed experiment It is imperative

that the same instrument is used for the determination of transition ions as different mass

spectrometers may have a bias towards different fragment ions

MRM experiments are still highly popular experiments for hypothesis directed

experiments221

biomarker analysis222

and validation223

Validation of putative biomarkers is

increasingly becoming a necessary step when performing large scale non-hypothesis driven

proteomics experiments The traditional validation techniques of ELISA Western blotting and

immunohistochemistry are still used but MRM experiments are becoming an attractive

alternative for validation of putative biomarkers due to its enhanced throughput and specificity

Current work is still being performed to both expand the linear dynamic range224

and

sensitivity225

of MRM A recent endeavor to increase the sensitivity for MRM experiments was

accomplished by ldquoPulsed MRMrdquo via the use of an ion funnel trap to enhance confinement and

accumulation of ions The authors claimed an increase by 5-fold for peak amplitude and a 2-3

fold reduction in chemical background225

46

Remaining Challenges and Emerging Technologies

Large sample numbers for mass spectrometry analysis

Multiple conventional studies in proteomics have been performed on a single or a few

biological samples As bio-variability can be exceedingly high the need for larger sample sizes

is currently being investigated Prentice et al used a starting point of 3200 patient samples

from the Womenrsquos Health Institute (WHI) to probe the plasma proteome using MS for

biomarkers The study did not test the 3200 patient samples by MS because even a simple one

hour one dimensional RP analysis on a mass spectrometer would take months of instrument time

for uninterrupted analysis Instead the authors pooled 100 samples together to bring the total

number of pooled samples to 32 To provide relevant plasma biomarkers the samples were then

subjected to immunodepletion 2-D protein separation (96 fractions total) and then 1-D RPLC of

tryptic peptide separation on-line interface to a mass spectrometer The large sample cohorts

help address bio-variability that can be a concern from small sample size proteomic experiments

and provide ample sample amounts to investigate the low abundance proteins226

Hemoglobin-derived neuropeptides and non-classical neuropeptides

Neuropeptides such as neuropeptide Y and enkephalin are short chains of amino acids

that are secreted from a range of neuronal cells that signal nearby cells In contrast non-classical

neuropeptides are termed as neuropeptides or ldquomicroproteinsrdquo which are derived from

intracellular protein fragments and synthesized from the cytosol227

MS was recently used to

determine that hemopressins which are hemoglobin-derived peptides are upregulated in Cpefatfat

mice brains Gelman et al designed an MS experiment to compare hemoglobin-derived

47

peptides comparing the brain blood and heart peptidome in mice The authors provided data

that specific hemoglobin peptides were produced in the brain and were not produced in the

blood Certain alpha and beta hemoglobin peptides were also up regulated in the brain for

Cpefatfat

mice and bind to CB1 cannabinoid receptors228

As discussed earlier in the review

peptidomics and specifically neuropeptidomics are popular fields of study utilizing MS and non-

classical neuropeptides is an exciting emerging area of research that could further expand the

diversity of cell-cell signaling molecules

Ultrasensitive mass spectrometry for single cell analysis

In addition to large scale analysis MS-based proteomics and peptidomics are making

progress into ultrasensitive single cell analysis The most successful MS-based techniques for

single cell analysis was performed with MALDI and studies that have been performed on

relatively large neurons are reviewed elsewhere229

The ultrasensitive MS analysis is currently

directed towards single cell analysis of smaller cells including cancer cells The first challenge

in single cell analysis is the isolation and further sample preparation to yield relevant data

Collection and isolation of a cell type can be accomplished using antibodies for fluorescence

activated cell sorting (FACS) and immune magnetic separation FACS works by flow cytometry

sorting cells by a laser that excites a fluorescent tag that is attached to an antibody Immune

magnetic separation allows separation by antibodies with magnetic properties such as

Dynabeads230

One exciting study combining FACS and MS termed mass cytometry This

technology works by infusing a droplet into an inductively coupled plasma mass spectrometer

(ICP-MS) containing a single cell bound to antibodies chelated to transition elements allowing a

quantifying response between single cells231

Clearly the future of single cell analysis for

48

biomarker analysis and proteomics is encouraging and has the potential to be an emerging field

in MS-based proteomics and peptidomics

Laserspray ionization (LSI)

Laserspray ionization (LSI) is an exciting new method to produce multiply charged mass

spectra from MALDI that is nearly identical to ESI232-234

Recently it has been reported that LSI

can be performed in lieu of matrix to produce a total solvent-free analysis234

The benefits of

being able to generate multiply charged peptides without any solvent may offer advantages

including MS analysis of insoluble membrane proteins or hydrophobic peptides avoidance of

chemical reactions while in solvents a reduction of sample loss due to liquid sample preparation

and ability to avoid diffusion effects from tissue imaging studies234

The multiply charged peptide and protein ions produced by LSI expand the mass range

for tissue imaging analysis More importantly the multiply-charged peptide ions are amenable

for electron-based fragmentation methods such as ETD or ECD which can be employed in

conjunction with tissue imaging experiments to yield in situ sequencing and identification of

peptides of interest235

Paper spray ionization

Paper spray (PS) is an ambient ionization method which was first reported using

chromatography paper allowing detection of metabolites from dried blood spots The original

method used a cut out piece of paper with a voltage clipped on the back while applying 10 microL of

methanolH2O236

Improvements have been made to this technology to enhance analysis

efficiency with a new solvent 91 dichloromethaneisopropanol (vv) and the use of silica paper

49

over chromatography paper237

Interesting applications or modifications have been made to PS

including direct analysis of biological tissue238

and leaf spray for direct analysis of plant

materials239

but both detect metabolites instead of proteins or peptides Paper spray ionization

was previously shown to enable detection of cytochrome c and bradykinin [2-9] standards in a

proof of principle study240

Clearly the utility of PS analysis in proteomics and peptidomics is

yet to be explored

niECD

New fragmentation techniques have been investigated for their utility in proteomics and

peptidomics including a recently reported negative-ion electron capture dissociation (niECD)

Acidic peptides which usually contain PTMs such as phosphorylation or sulfonation are often

difficult to be detected as multiply charged peptides in the positive ion mode As discussed

earlier multiply charged peptides are required for ECDETD fragmentation The fragmentation

of niECD is accomplished by a multiply negatively charged peptide adding an electron The

resulting fragmentation of multiply sulfated and phosphorylated peptide and protein standards

showed no sulfate loss and preserved phosphorylation site The resulting fragmentation pattern

from niECD was also improved in the peptide anions and provides a new strategy for de novo

sequencing with PTM localization241

Conclusions and Perspectives

Proteomics methodologies have produced large datasets of proteins involved in various

biological and disease progression processes Numerous mass spectrometry-based proteomics

and peptidomics tools have been developed and are continuously being improved in both

50

chromatographic or electrophoretic separation and MS hardware and software However several

important issues that remain to be addressed rely on further technical advances in proteomics

analysis When large proteomes consisting of thousands of proteins are analyzed and quantified

dynamic range is still limited with more abundant proteins being preferentially detected

Development and optimization of chemical tagging reagents that target specific protein classes

maybe necessary to help enrich important signaling proteins and assess cellular and molecular

heterogeneity of the proteome and peptidome Furthermore a significant bottleneck in

usefulness of proteomics research is the ability to validate the results and provide clear

significant biological relevance to the results The idea of P4 medicine242 243

is an attractive

concept where the four Prsquos stand for predictive preventive personalized and participatory

Proteomics is one of the critical ldquoomicsrdquo fields and has led to the development of enabling

innovative strategies to P4 medicine244

A goal of P4 medicine is to assess both early disease

detection and disease progression in a person A simplified example of how proteomics fits into

P4 medicine is that certain brain-specific proteins could be used for diagnosis with

presymptomatic prion disease244

The concept of proteomic experiments providing an individual

biomarker is becoming more obsolete with the revised vision being a biomolecular barcode that

could potentially be ldquoscannedrdquo or be a fingerprint for a specific disease or early onset to that

disease being closer to reality An excellent review on what biomarker analysis can do for true

patients is available245

Proteomics can also generate new hypothesis that can be tested by classical biochemical

approaches If a disease has an unknown pathogenesis proteomics is a good starting point to try

to assemble putative markers that can lead to further hypothesis for evaluation If a particular

protein or PTM is associated with a disease state either qualitatively or quantitatively potential

51

treatments could target that protein of interest or investigators could monitor that protein or

PTM during potential treatments of the disease Proteomics has expanded greatly over the last

few decades with the goal of providing revealing insights to some of the most complex

biological problems currently facing the scientific community

Acknowledgements

Preparation of this manuscript was supported in part by the University of Wisconsin Graduate

School Wisconsin Alzheimerrsquos Disease Research Center Pilot Grant and a Department of

Defense Pilot Award LL acknowledges an H I Romnes Faculty Fellowship

52

Figure 1 A summary of general workflows of biomarker discovery pipeline by MS-based

proteomic approaches

Biological sample (CSF blood urine saliva cell

lysate tissue homogenates secreted proteins etc)

Protein extraction Sample pretreatment

2D-GE2D-DIGE MS 1D or 2D LC-MSMS

MALDI-IMS

Identification of

differentially

expressed proteins

Protein identification

Potential biomarkers

Biomarker validation

- Antibody based immunoassays

- MRM

Quantitative analysis

- Isotope labeling

- Label free

Identification and

localization of

differentially expressed

biomolecules

Intact tissue

Sample preparation Slice frozen tissues

thaw-mounted on plate

Apply Matrix

53

Figure 2 Tissue expression and dynamic range of the human plasma proteome A pie chart

representing the tissue of origin for the high abundance proteins shows that the majority of

proteins come from the liver (A) Conversely the lower abundance plasma proteins have a much

more diverse tissue of origin (B and C) The large dynamic range of plasma proteins is presented

and the proteins can be grouped into three categories (classical plasma proteins tissue leakage

products interleukinscytokines) (D) Adapted from Zhang et al11

and Schiess et al246

with

permission

54

55

Table 1 A summary of the common strategies applied to MS-based quantitative proteomic

analysis

Gel based Stable isotope labeling Label free

2D-GE

2D-DIGE 110

In vitro derivatization

18O

16O

153

ICAT 156

TMT 159

iTRAQ 158

Formaldehyde 247

ICPL 248

In vivo metabolic labeling

14N

15N

249

SILAC 164

AUC measurement 169 172

Spectral counting 173

AUC Area Under Curve ICAT Isotope-Coded Affinity Tag TMT Tandem Mass Tags iTRAQ Isobaric Tags for

Relative and Absolute Quantification ICPL Isotope Coded Protein Labeling SILAC Stable Isotope Labeling by

Amino Acids in Cell Culture Isobaric Tags for Relative and Absolute Quantification (iTRAQ)

56

References

1 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein

immunoaffinity subtraction for plasma proteomics and candidate biomarker discovery

using mass spectrometry Mol Cell Proteomics 2006 5 (11) 2167-74

2 Holten-Andersen M N Murphy G Nielsen H J Pedersen A N Christensen I J

Hoyer-Hansen G Brunner N Stephens R W Quantitation of TIMP-1 in plasma of healthy

blood donors and patients with advanced cancer Br J Cancer 1999 80 (3-4) 495-503

3 Tammen H Schulte I Hess R Menzel C Kellmann M Mohring T Schulz-

Knappe P Peptidomic analysis of human blood specimens comparison between plasma

specimens and serum by differential peptide display Proteomics 2005 5 (13) 3414-22

4 Rai A J Gelfand C A Haywood B C Warunek D J Yi J Schuchard M D

Mehigh R J Cockrill S L Scott G B Tammen H Schulz-Knappe P Speicher D W

Vitzthum F Haab B B Siest G Chan D W HUPO Plasma Proteome Project specimen

collection and handling towards the standardization of parameters for plasma proteome samples

Proteomics 2005 5 (13) 3262-77

5 Lippi G Guidi G C Mattiuzzi C Plebani M Preanalytical variability the dark side

of the moon in laboratory testing Clin Chem Lab Med 2006 44 (4) 358-65

6 Holten-Andersen M N Schrohl A S Brunner N Nielsen H J Hogdall C K

Hogdall E V Evaluation of sample handling in relation to levels of tissue inhibitor of

metalloproteinases-1 measured in blood by immunoassay Int J Biol Markers 2003 18 (3) 170-

6

7 Ytting H Christensen I J Thiel S Jensenius J C Svendsen M N Nielsen L

Lottenburger T Nielsen H J Biological variation in circulating levels of mannan-binding

lectin (MBL) and MBL-associated serine protease-2 and the influence of age gender and

physical exercise Scand J Immunol 2007 66 (4) 458-64

8 Hanash S Building a foundation for the human proteome the role of the Human

Proteome Organization J Proteome Res 2004 3 (2) 197-9

9 Omenn G S States D J Adamski M Blackwell T W Menon R Hermjakob H

Apweiler R Haab B B Simpson R J Eddes J S Kapp E A Moritz R L Chan D W

Rai A J Admon A Aebersold R Eng J Hancock W S Hefta S A Meyer H Paik Y

K Yoo J S Ping P Pounds J Adkins J Qian X Wang R Wasinger V Wu C Y

Zhao X Zeng R Archakov A Tsugita A Beer I Pandey A Pisano M Andrews P

Tammen H Speicher D W Hanash S M Overview of the HUPO Plasma Proteome Project

results from the pilot phase with 35 collaborating laboratories and multiple analytical groups

generating a core dataset of 3020 proteins and a publicly-available database Proteomics 2005 5

(13) 3226-45

10 Liu T Qian W J Gritsenko M A Xiao W Moldawer L L Kaushal A Monroe

M E Varnum S M Moore R J Purvine S O Maier R V Davis R W Tompkins R

G Camp D G 2nd Smith R D High dynamic range characterization of the trauma patient

plasma proteome Mol Cell Proteomics 2006 5 (10) 1899-913

11 Zhang Q Faca V Hanash S Mining the plasma proteome for disease applications

across seven logs of protein abundance J Proteome Res 2011 10 (1) 46-50

12 Matsubara J Honda K Ono M Sekine S Tanaka Y Kobayashi M Jung G

Sakuma T Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T Tsuchida A

57

Shimahara M Yasunami Y Chiba T Yamada T Identification of adipophilin as a potential

plasma biomarker for colorectal cancer using label-free quantitative mass spectrometry and

protein microarray Cancer Epidemiol Biomarkers Prev 2011 20 (10) 2195-203

13 Murakoshi Y Honda K Sasazuki S Ono M Negishi A Matsubara J Sakuma

T Kuwabara H Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T

Shimahara M Yasunami Y Ino Y Tsuchida A Aoki T Tsugane S Yamada T Plasma

biomarker discovery and validation for colorectal cancer by quantitative shotgun mass

spectrometry and protein microarray Cancer Sci 2011 102 (3) 630-8

14 Addona T A Shi X Keshishian H Mani D R Burgess M Gillette M A

Clauser K R Shen D Lewis G D Farrell L A Fifer M A Sabatine M S Gerszten R

E Carr S A A pipeline that integrates the discovery and verification of plasma protein

biomarkers reveals candidate markers for cardiovascular disease Nat Biotechnol 2011 29 (7)

635-43

15 Acosta-Martin A E Panchaud A Chwastyniak M Dupont A Juthier F Gautier

C Jude B Amouyel P Goodlett D R Pinet F Quantitative mass spectrometry analysis

using PAcIFIC for the identification of plasma diagnostic biomarkers for abdominal aortic

aneurysm PLoS One 2011 6 (12) e28698

16 Ray S Britschgi M Herbert C Takeda-Uchimura Y Boxer A Blennow K

Friedman L F Galasko D R Jutel M Karydas A Kaye J A Leszek J Miller B L

Minthon L Quinn J F Rabinovici G D Robinson W H Sabbagh M N So Y T

Sparks D L Tabaton M Tinklenberg J Yesavage J A Tibshirani R Wyss-Coray T

Classification and prediction of clinical Alzheimers diagnosis based on plasma signaling

proteins Nat Med 2007 13 (11) 1359-62

17 Lopez M F Kuppusamy R Sarracino D A Prakash A Athanas M Krastins B

Rezai T Sutton J N Peterman S Nicolaides K Mass spectrometric discovery and selective

reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21

maternal serum J Proteome Res 2011 10 (1) 133-42

18 Sheta E A Appel S H Goldknopf I L 2D gel blood serum biomarkers reveal

differential clinical proteomics of the neurodegenerative diseases Expert Rev Proteomics 2006

3 (1) 45-62

19 Zhang J Goodlett D R Montine T J Proteomic biomarker discovery in

cerebrospinal fluid for neurodegenerative diseases J Alzheimers Dis 2005 8 (4) 377-86

20 Yuan X Desiderio D M Proteomics analysis of prefractionated human lumbar

cerebrospinal fluid Proteomics 2005 5 (2) 541-50

21 Wenner B R Lovell M A Lynn B C Proteomic analysis of human ventricular

cerebrospinal fluid from neurologically normal elderly subjects using two-dimensional LC-

MSMS J Proteome Res 2004 3 (1) 97-103

22 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry Electrophoresis 2004 25 (14) 2402-12

23 Yuan X Desiderio D M Proteomics analysis of human cerebrospinal fluid J

Chromatogr B Analyt Technol Biomed Life Sci 2005 815 (1-2) 179-89

24 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

58

25 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers Proteomics Clin Appl 2008 2 (3) 428-36

26 Rozek W Ricardo-Dukelow M Holloway S Gendelman H E Wojna V

Melendez L M Ciborowski P Cerebrospinal fluid proteomic profiling of HIV-1-infected

patients with cognitive impairment J Proteome Res 2007 6 (11) 4189-99

27 Schutzer S E Liu T Natelson B H Angel T E Schepmoes A A Purvine S O

Hixson K K Lipton M S Camp D G Coyle P K Smith R D Bergquist J

Establishing the proteome of normal human cerebrospinal fluid PLoS One 2010 5 (6) e10980

28 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

29 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2010 878 (22) 2003-12

30 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

31 Yuan X Desiderio D M Human cerebrospinal fluid peptidomics J Mass Spectrom

2005 40 (2) 176-81

32 Ringman J M Schulman H Becker C Jones T Bai Y Immermann F Cole G

Sokolow S Gylys K Geschwind D H Cummings J L Wan H I Proteomic changes in

cerebrospinal fluid of presymptomatic and affected persons carrying familial Alzheimer disease

mutations Arch Neurol 2012 69 (1) 96-104

33 Jahn H Wittke S Zurbig P Raedler T J Arlt S Kellmann M Mullen W

Eichenlaub M Mischak H Wiedemann K Peptide fingerprinting of Alzheimers disease in

cerebrospinal fluid identification and prospective evaluation of new synaptic biomarkers PLoS

One 2011 6 (10) e26540

34 Rajagopal M U Hathout Y MacDonald T J Kieran M W Gururangan S

Blaney S M Phillips P Packer R Gordish-Dressman H Rood B R Proteomic profiling

of cerebrospinal fluid identifies prostaglandin D2 synthase as a putative biomarker for pediatric

medulloblastoma A pediatric brain tumor consortium study Proteomics 2011 11 (5) 935-43

35 Giron P Dayon L Turck N Hoogland C Sanchez J C Quantitative analysis of

human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive

isobaric labeling J Proteome Res 2011 10 (1) 249-58

36 Ting L Rad R Gygi S P Haas W MS3 eliminates ratio distortion in isobaric

multiplexed quantitative proteomics Nat Methods 2011 8 (11) 937-40

37 Michalski A Damoc E Hauschild J P Lange O Wieghaus A Makarov A

Nagaraj N Cox J Mann M Horning S Mass spectrometry-based proteomics using Q

Exactive a high-performance benchtop quadrupole Orbitrap mass spectrometer Mol Cell

Proteomics 2011 10 (9) M111 011015

38 Spirin V Shpunt A Seebacher J Gentzel M Shevchenko A Gygi S Sunyaev

S Assigning spectrum-specific P-values to protein identifications by mass spectrometry

Bioinformatics 2011 27 (8) 1128-34

39 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

59

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

2012 84 (1) 209-15

40 Wilhelm M Kirchner M Steen J A Steen H mz5 Space- and Time-efficient

Storage of Mass Spectrometry Data Sets Mol Cell Proteomics 2012 11 (1) O111 011379

41 Zhou F Sikorski T W Ficarro S B Webber J T Marto J A Online nanoflow

reversed phase-strong anion exchange-reversed phase liquid chromatography-tandem mass

spectrometry platform for efficient and in-depth proteome sequence analysis of complex

organisms Anal Chem 2011 83 (18) 6996-7005

42 Winter D Steen H Optimization of cell lysis and protein digestion protocols for the

analysis of HeLa S3 cells by LC-MSMS Proteomics 2011 11 (24) 4726-30

43 Shteynberg D Deutsch E W Lam H Eng J K Sun Z Tasman N Mendoza L

Moritz R L Aebersold R Nesvizhskii A I iProphet multi-level integrative analysis of

shotgun proteomic data improves peptide and protein identification rates and error estimates

Mol Cell Proteomics 2011 10 (12) M111 007690

44 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

84 (1) 209-15

45 Datta A Park J E Li X Zhang H Ho Z S Heese K Lim S K Tam J P Sze

S K Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun

quantitative proteomics J Proteome Res 9 (1) 472-84

46 Decramer S Gonzalez de Peredo A Breuil B Mischak H Monsarrat B Bascands

J L Schanstra J P Urine in clinical proteomics Mol Cell Proteomics 2008 7 (10) 1850-62

47 De La Monte S M Wands J R The AD7c-NTP neuronal thread protein biomarker for

detecting Alzheimers disease J Alzheimers Dis 2001 3 (3) 345-353

48 Van Dorsselaer A Carapito C Delalande F Schaeffer-Reiss C Thierse D Diemer

H McNair D S Krewski D Cashman N R Detection of prion protein in urine-derived

injectable fertility products by a targeted proteomic approach PLoS One 2011 6 (3) e17815

49 Kuwabara Y Mine K Katayama A Inagawa T Akira S Takeshita T Proteomic

analyses of recombinant human follicle-stimulating hormone and urinary-derived gonadotropin

preparations J Reprod Med 2009 54 (8) 459-66

50 Court M Selevsek N Matondo M Allory Y Garin J Masselon C D Domon B

Toward a standardized urine proteome analysis methodology Proteomics 2011 11 (6) 1160-71

51 Marimuthu A OMeally R N Chaerkady R Subbannayya Y Nanjappa V Kumar

P Kelkar D S Pinto S M Sharma R Renuse S Goel R Christopher R Delanghe B

Cole R N Harsha H C Pandey A A comprehensive map of the human urinary proteome J

Proteome Res 2011 10 (6) 2734-43

52 Bermejo-Pareja F Antequera D Vargas T Molina J A Carro E Saliva levels of

Abeta1-42 as potential biomarker of Alzheimers disease a pilot study BMC Neurol 2010 10

108

53 Devic I Hwang H Edgar J S Izutsu K Presland R Pan C Goodlett D R

Wang Y Armaly J Tumas V Zabetian C P Leverenz J B Shi M Zhang J Salivary

alpha-synuclein and DJ-1 potential biomarkers for Parkinsons disease Brain 2011 134 (Pt 7)

e178

60

54 Al-Tarawneh S K Border M B Dibble C F Bencharit S Defining salivary

biomarkers using mass spectrometry-based proteomics a systematic review OMICS 2011 15

(6) 353-61

55 Castagnola M Cabras T Vitali A Sanna M T Messana I Biotechnological

implications of the salivary proteome Trends Biotechnol 2011 29 (8) 409-18

56 Caprioli R M Farmer T B Gile J Molecular imaging of biological samples

localization of peptides and proteins using MALDI-TOF MS Anal Chem 1997 69 (23) 4751-

60

57 Stoeckli M Chaurand P Hallahan D E Caprioli R M Imaging mass spectrometry

a new technology for the analysis of protein expression in mammalian tissues Nat Med 2001 7

(4) 493-6

58 Chaurand P Schwartz S A Caprioli R M Imaging mass spectrometry a new tool to

investigate the spatial organization of peptides and proteins in mammalian tissue sections Curr

Opin Chem Biol 2002 6 (5) 676-81

59 Chaurand P Stoeckli M Caprioli R M Direct profiling of proteins in biological

tissue sections by MALDI mass spectrometry Anal Chem 1999 71 (23) 5263-70

60 Seeley E H Caprioli R M MALDI imaging mass spectrometry of human tissue

method challenges and clinical perspectives Trends Biotechnol 2011 29 (3) 136-43

61 Cazares L H Troyer D A Wang B Drake R R Semmes O J MALDI tissue

imaging from biomarker discovery to clinical applications Anal Bioanal Chem 2011 401 (1)

17-27

62 Stauber J Lemaire R Franck J Bonnel D Croix D Day R Wisztorski M

Fournier I Salzet M MALDI imaging of formalin-fixed paraffin-embedded tissues

application to model animals of Parkinson disease for biomarker hunting J Proteome Res 2008

7 (3) 969-78

63 Yuki D Sugiura Y Zaima N Akatsu H Hashizume Y Yamamoto T Fujiwara

M Sugiyama K Setou M Hydroxylated and non-hydroxylated sulfatide are distinctly

distributed in the human cerebral cortex Neuroscience 2011 193 44-53

64 Makridakis M Vlahou A Secretome proteomics for discovery of cancer biomarkers J

Proteomics 2010 73 (12) 2291-305

65 Dowling P Clynes M Conditioned media from cell lines a complementary model to

clinical specimens for the discovery of disease-specific biomarkers Proteomics 2011 11 (4)

794-804

66 Moore N H Costa L G Shaffer S A Goodlett D R Guizzetti M Shotgun

proteomics implicates extracellular matrix proteins and protease systems in neuronal

development induced by astrocyte cholinergic stimulation J Neurochem 2009 108 (4) 891-908

67 Keene S D Greco T M Parastatidis I Lee S H Hughes E G Balice-Gordon R

J Speicher D W Ischiropoulos H Mass spectrometric and computational analysis of

cytokine-induced alterations in the astrocyte secretome Proteomics 2009 9 (3) 768-82

68 Dowell J A Johnson J A Li L Identification of astrocyte secreted proteins with a

combination of shotgun proteomics and bioinformatics J Proteome Res 2009 8 (8) 4135-43

69 Ahmed F E Sample preparation and fractionation for proteome analysis and cancer

biomarker discovery by mass spectrometry J Sep Sci 2009 32 (5-6) 771-98

70 Righetti P G Fasoli E Boschetti E Combinatorial peptide ligand libraries the

conquest of the hidden proteome advances at great strides Electrophoresis 2011 32 (9) 960-6

61

71 Selvaraju S El Rassi Z Liquid-phase-based separation systems for depletion

prefractionation and enrichment of proteins in biological fluids and matrices for in-depth

proteomics analysis - An update covering the period 2008-2011 Electrophoresis 2011 33 (1)

74-88

72 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

73 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

74 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

75 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

76 Wagner P D Vu N D Histidine to aspartate phosphotransferase activity of nm23

proteins phosphorylation of aldolase C on Asp-319 Biochem J 2000 346 Pt 3 623-30

77 Chi A Huttenhower C Geer L Y Coon J J Syka J E Bai D L Shabanowitz

J Burke D J Troyanskaya O G Hunt D F Analysis of phosphorylation sites on proteins

from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry Proc

Natl Acad Sci U S A 2007 104 (7) 2193-8

78 Rudrabhatla P Jaffe H Pant H C Direct evidence of phosphorylated neuronal

intermediate filament proteins in neurofibrillary tangles (NFTs) phosphoproteomics of

Alzheimers NFTs FASEB J 25 (11) 3896-905

79 Saez-Valero J Fodero L R Sjogren M Andreasen N Amici S Gallai V

Vanderstichele H Vanmechelen E Parnetti L Blennow K Small D H Glycosylation of

acetylcholinesterase and butyrylcholinesterase changes as a function of the duration of

Alzheimers disease J Neurosci Res 2003 72 (4) 520-6

80 Silveyra M X Cuadrado-Corrales N Marcos A Barquero M S Rabano A

Calero M Saez-Valero J Altered glycosylation of acetylcholinesterase in Creutzfeldt-Jakob

disease J Neurochem 2006 96 (1) 97-104

81 Wei X Li L Comparative glycoproteomics approaches and applications Brief Funct

Genomic Proteomic 2009 8 (2) 104-13

82 Tian Y Zhang H Glycoproteomics and clinical applications Proteomics Clin Appl

2010 4 (2) 124-32

83 Butterfield D A Owen J B Lectin-affinity chromatography brain glycoproteomics

and Alzheimer disease insights into protein alterations consistent with the pathology and

progression of this dementing disorder Proteomics Clin Appl 2011 5 (1-2) 50-6

84 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion

disease biomarker research delving into the glycoproteome J Proteome Res 2010 10 (6) 2687-

702

85 Rucevic M Hixson D Josic D Mammalian plasma membrane proteins as potential

biomarkers and drug targets Electrophoresis 2011 32 (13) 1549-64

86 Gilmore J M Washburn M P Advances in shotgun proteomics and the analysis of

membrane proteomes J Proteomics 2010 73 (11) 2078-91

62

87 Weekes M P Antrobus R Lill J R Duncan L M Hor S Lehner P J

Comparative analysis of techniques to purify plasma membrane proteins J Biomol Tech 2010

21 (3) 108-15

88 Helbig A O Heck A J Slijper M Exploring the membrane proteome--challenges

and analytical strategies J Proteomics 2010 73 (5) 868-78

89 Griffin N M Schnitzer J E Overcoming key technological challenges in using mass

spectrometry for mapping cell surfaces in tissues Mol Cell Proteomics 2011 10 (2) R110

000935

90 Groen A J Lilley K S Proteomics of total membranes and subcellular membranes

Expert Rev Proteomics 2010 7 (6) 867-78

91 Wisniewski J R Tools for phospho- and glycoproteomics of plasma membranes Amino

Acids 2011 41 (2) 223-33

92 Wei X Dulberger C Li L Characterization of murine brain membrane glycoproteins

by detergent assisted lectin affinity chromatography Anal Chem 2010 82 (15) 6329-33

93 Zhang H Guo T Li X Datta A Park J E Yang J Lim S K Tam J P Sze S

K Simultaneous characterization of glyco- and phosphoproteomes of mouse brain membrane

proteome with electrostatic repulsion hydrophilic interaction chromatography Mol Cell

Proteomics 2011 9 (4) 635-47

94 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

95 Ma M Chen R Ge Y He H Marshall A G Li L Combining bottom-up and top-

down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic

hormone from Cancer borealis Anal Chem 2009 81 (1) 240-7

96 Chen R Ma M Hui L Zhang J Li L Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 (4) 708-18

97 Wei H Nolkrantz K Parkin M C Chisolm C N OCallaghan J P Kennedy R

T Identification and quantification of neuropeptides in brain tissue by capillary liquid

chromatography coupled off-line to MALDI-TOF and MALDI-TOFTOF-MS Anal Chem 2006

78 (13) 4342-51

98 Dowell J A Heyden W V Li L Rat neuropeptidomics by LC-MSMS and MALDI-

FTMS Enhanced dissection and extraction techniques coupled with 2D RP-RP HPLC J

Proteome Res 2006 5 (12) 3368-75

99 Che F Y Fricker L D Quantitative peptidomics of mouse pituitary comparison of

different stable isotopic tags J Mass Spectrom 2005 40 (2) 238-49

100 Fricker L D Analysis of mouse brain peptides using mass spectrometry-based

peptidomics implications for novel functions ranging from non-classical neuropeptides to

microproteins Mol Biosyst 2010 6 (8) 1355-65

101 Colgrave M L Xi L Lehnert S A Flatscher-Bader T Wadensten H Nilsson A

Andren P E Wijffels G Neuropeptide profiling of the bovine hypothalamus thermal

stabilization is an effective tool in inhibiting post-mortem degradation Proteomics 2011 11 (7)

1264-76

102 Scholz B Alm H Mattsson A Nilsson A Kultima K Savitski M M Falth M

Skold K Brunstrom B Andren P E Dencker L Neuropeptidomic analysis of the

embryonic Japanese quail diencephalon BMC Dev Biol 2010 10 30

63

103 Hummon A B Amare A Sweedler J V Discovering new invertebrate neuropeptides

using mass spectrometry Mass Spectrom Rev 2006 25 (1) 77-98

104 Vilim F S Sasaki K Rybak J Alexeeva V Cropper E C Jing J Orekhova I

V Brezina V Price D Romanova E V Rubakhin S S Hatcher N Sweedler J V

Weiss K R Distinct mechanisms produce functionally complementary actions of neuropeptides

that are structurally related but derived from different precursors J Neurosci 2010 30 (1) 131-

47

105 Fu Q Li L De novo sequencing of neuropeptides using reductive isotopic methylation

and investigation of ESI QTOF MSMS fragmentation pattern of neuropeptides with N-terminal

dimethylation Anal Chem 2005 77 (23) 7783-95

106 Chen R Jiang X Conaway M C Mohtashemi I Hui L Viner R Li L Mass

spectral analysis of neuropeptide expression and distribution in the nervous system of the lobster

Homarus americanus J Proteome Res 2010 9 (2) 818-32

107 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved

identification of endogenous peptides from murine nervous tissue by multiplexed peptide

extraction methods and multiplexed mass spectrometric analysis J Proteome Res 2009 8 (2)

870-6

108 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Peptidomic analysis

of human cell lines J Proteome Res 2011 10 (4) 1583-92

109 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement

approaches and challenges Annu Rev Anal Chem (Palo Alto Calif) 2008 1 451-83

110 Marouga R David S Hawkins E The development of the DIGE system 2D

fluorescence difference gel analysis technology Anal Bioanal Chem 2005 382 (3) 669-78

111 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

112 Ottervald J Franzen B Nilsson K Andersson L I Khademi M Eriksson B

Kjellstrom S Marko-Varga G Vegvari A Harris R A Laurell T Miliotis T

Matusevicius D Salter H Ferm M Olsson T Multiple sclerosis Identification and clinical

evaluation of novel CSF biomarkers J Proteomics 2010 73 (6) 1117-32

113 Diez R Herbstreith M Osorio C Alzate O 2-D Fluorescence Difference Gel

Electrophoresis (DIGE) in Neuroproteomics 2010

114 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent

advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1)

46-50

115 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J

Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66

116 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

117 Burke T W Mant C T Black J A Hodges R S Strong cation-exchange high-

performance liquid chromatography of peptides Effect of non-specific hydrophobic interactions

and linearization of peptide retention behaviour J Chromatogr 1989 476 377-89

118 Edelmann M J Strong cation exchange chromatography in analysis of posttranslational

modifications innovations and perspectives J Biomed Biotechnol 2011 2011 936508

64

119 Francois I Sandra K Sandra P Comprehensive liquid chromatography fundamental

aspects and practical considerations--a review Anal Chim Acta 2009 641 (1-2) 14-31

120 Guiochon G Marchetti N Mriziq K Shalliker R A Implementations of two-

dimensional liquid chromatography J Chromatogr A 2008 1189 (1-2) 109-68

121 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

122 Alpert A J Hydrophilic-interaction chromatography for the separation of peptides

nucleic acids and other polar compounds J Chromatogr 1990 499 177-96

123 Gilar M Olivova P Daly A E Gebler J C Orthogonality of separation in two-

dimensional liquid chromatography Anal Chem 2005 77 (19) 6426-34

124 Di Palma S Stange D van de Wetering M Clevers H Heck A J Mohammed S

Highly sensitive proteome analysis of FACS-sorted adult colon stem cells J Proteome Res 2011

10 (8) 3814-9

125 Di Palma S Boersema P J Heck A J Mohammed S Zwitterionic hydrophilic

interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution

separation and increase sensitivity in proteome analysis Anal Chem 2011 83 (9) 3440-7

126 Zarei M Sprenger A Metzger F Gretzmeier C Dengjel J Comparison of ERLIC-

TiO2 HILIC-TiO2 and SCX-TiO2 for global phosphoproteomics approaches J Proteome Res

2011 10 (8) 3474-83

127 Neue K Mormann M Peter-Katalinic J Pohlentz G Elucidation of glycoprotein

structures by unspecific proteolysis and direct nanoESI mass spectrometric analysis of ZIC-

HILIC-enriched glycopeptides J Proteome Res 2011 10 (5) 2248-60

128 Ow S Y Salim M Noirel J Evans C Wright P C Minimising iTRAQ ratio

compression through understanding LC-MS elution dependence and high-resolution HILIC

fractionation Proteomics 2011 11 (11) 2341-6

129 Alpert A J Electrostatic repulsion hydrophilic interaction chromatography for isocratic

separation of charged solutes and selective isolation of phosphopeptides Anal Chem 2008 80

(1) 62-76

130 Chien K Y Liu H C Goshe M B Development and application of a

phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography

(ERLIC) IMAC and LC-MSMS analysis to study Mareks Disease Virus infection J Proteome

Res 2011 10 (9) 4041-53

131 Liu X Plasencia M Ragg S Valentine S J Clemmer D E Development of high

throughput dispersive LC-ion mobility-TOFMS techniques for analysing the human plasma

proteome Brief Funct Genomic Proteomic 2004 3 (2) 177-86

132 Valentine S J Liu X Plasencia M D Hilderbrand A E Kurulugama R T

Koeniger S L Clemmer D E Developing liquid chromatography ion mobility mass

spectometry techniques Expert Rev Proteomics 2005 2 (4) 553-65

133 Valentine S J Plasencia M D Liu X Krishnan M Naylor S Udseth H R

Smith R D Clemmer D E Toward plasma proteome profiling with ion mobility-mass

spectrometry J Proteome Res 2006 5 (11) 2977-84

134 Valentine S J Ewing M A Dilger J M Glover M S Geromanos S Hughes C

Clemmer D E Using ion mobility data to improve peptide identification intrinsic amino acid

size parameters J Proteome Res 2011 10 (5) 2318-29

65

135 Ahmed F E The role of capillary electrophoresis-mass spectrometry to proteome

analysis and biomarker discovery J Chromatogr B Analyt Technol Biomed Life Sci 2009 877

(22) 1963-81

136 Fonslow B R Yates J R 3rd Capillary electrophoresis applied to proteomic analysis

J Sep Sci 2009 32 (8) 1175-88

137 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins J Chromatogr A 2007 1159 (1-2) 81-109

138 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins 2007-2010 Electrophoresis 2011 32 (1) 66-82

139 Huck C W Bakry R Huber L A Bonn G K Progress in capillary electrophoresis

coupled to matrix-assisted laser desorptionionization - time of flight mass spectrometry

Electrophoresis 2006 27 (11) 2063-74

140 Klampfl C W CE with MS detection a rapidly developing hyphenated technique

Electrophoresis 2009 30 Suppl 1 S83-91

141 Simpson D C Smith R D Combining capillary electrophoresis with mass

spectrometry for applications in proteomics Electrophoresis 2005 26 (7-8) 1291-305

142 Li Y Champion M M Sun L Champion P A Wojcik R Dovichi N J Capillary

zone electrophoresis-electrospray ionization-tandem mass spectrometry as an alternative

proteomics platform to ultraperformance liquid chromatography-electrospray ionization-tandem

mass spectrometry for samples of intermediate complexity Anal Chem 2012 84 (3) 1617-22

143 Dai L Li C Shedden K A Lee C J Quoc H Simeone D M Lubman D M

Quantitative proteomic profiling studies of pancreatic cancer stem cells J Proteome Res 2010 9

(7) 3394-402

144 Chen J Balgley B M DeVoe D L Lee C S Capillary isoelectric focusing-based

multidimensional concentrationseparation platform for proteome analysis Anal Chem 2003 75

(13) 3145-52

145 Mischak H Delles C Klein J Schanstra J P Urinary proteomics based on capillary

electrophoresis-coupled mass spectrometry in kidney disease discovery and validation of

biomarkers and clinical application Adv Chronic Kidney Dis 2010 17 (6) 493-506

146 Albalat A Mischak H Mullen W Urine proteomics in clinical applications

technologies principal considerations and clinical implementation Prilozi 2011 32 (1) 13-44

147 Zuberovic A Hanrieder J Hellman U Bergquist J Wetterhall M Proteome

profiling of human cerebrospinal fluid exploring the potential of capillary electrophoresis with

surface modified capillaries for analysis of complex biological samples Eur J Mass Spectrom

(Chichester Eng) 2008 14 (4) 249-60

148 Johannesson N Olsson L Backstrom D Wetterhall M Danielsson R Bergquist

J Screening for biomarkers in plasma from patients with gangrenous and phlegmonous

appendicitis using CE and CEC in combination with MS Electrophoresis 2007 28 (9) 1435-43

149 Guo T Lee C S Wang W DeVoe D L Balgley B M Capillary separations

enabling tissue proteomics-based biomarker discovery Electrophoresis 2006 27 (18) 3523-32

150 Hwang S I Thumar J Lundgren D H Rezaul K Mayya V Wu L Eng J

Wright M E Han D K Direct cancer tissue proteomics a method to identify candidate

cancer biomarkers from formalin-fixed paraffin-embedded archival tissues Oncogene 2007 26

(1) 65-76

66

151 Desiderio C Rossetti D V Iavarone F Messana I Castagnola M Capillary

electrophoresis--mass spectrometry recent trends in clinical proteomics J Pharm Biomed Anal

2010 53 (5) 1161-9

152 Albalat A Mischak H Mullen W Clinical application of urinary

proteomicspeptidomics Expert Rev Proteomics 2011 8 (5) 615-29

153 Ye X Luke B Andresson T Blonder J 18O stable isotope labeling in MS-based

proteomics Brief Funct Genomic Proteomic 2009 8 (2) 136-44

154 Staes A Demol H Van Damme J Martens L Vandekerckhove J Gevaert K

Global differential non-gel proteomics by quantitative and stable labeling of tryptic peptides with

oxygen-18 J Proteome Res 2004 3 (4) 786-91

155 Jorge I Navarro P Martinez-Acedo P Nunez E Serrano H Alfranca A

Redondo J M Vazquez J Statistical model to analyze quantitative proteomics data obtained

by 18O16O labeling and linear ion trap mass spectrometry application to the study of vascular

endothelial growth factor-induced angiogenesis in endothelial cells Mol Cell Proteomics 2009

8 (5) 1130-49

156 Gygi S P Rist B Gerber S A Turecek F Gelb M H Aebersold R Quantitative

analysis of complex protein mixtures using isotope-coded affinity tags Nat Biotechnol 1999 17

(10) 994-9

157 Haqqani A S Kelly J F Stanimirovic D B Quantitative protein profiling by mass

spectrometry using isotope-coded affinity tags Methods Mol Biol 2008 439 225-40

158 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

159 Thompson A Schafer J Kuhn K Kienle S Schwarz J Schmidt G Neumann T

Johnstone R Mohammed A K Hamon C Tandem mass tags a novel quantification strategy

for comparative analysis of complex protein mixtures by MSMS Anal Chem 2003 75 (8)

1895-904

160 Dayon L Hainard A Licker V Turck N Kuhn K Hochstrasser D F Burkhard

P R Sanchez J C Relative quantification of proteins in human cerebrospinal fluids by

MSMS using 6-plex isobaric tags Anal Chem 2008 80 (8) 2921-31

161 DAscenzo M Choe L Lee K H iTRAQPak an R based analysis and visualization

package for 8-plex isobaric protein expression data Brief Funct Genomic Proteomic 2008 7 (2)

127-35

162 Choe L DAscenzo M Relkin N R Pappin D Ross P Williamson B Guertin

S Pribil P Lee K H 8-plex quantitation of changes in cerebrospinal fluid protein expression

in subjects undergoing intravenous immunoglobulin treatment for Alzheimers disease

Proteomics 2007 7 (20) 3651-60

163 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 2010 82 (7) 2817-

25

164 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

67

165 Ong S E Kratchmarova I Mann M Properties of 13C-substituted arginine in stable

isotope labeling by amino acids in cell culture (SILAC) J Proteome Res 2003 2 (2) 173-81

166 Pimienta G Chaerkady R Pandey A SILAC for global phosphoproteomic analysis

Methods Mol Biol 2009 527 107-16 x

167 Zhang G Neubert T A Use of stable isotope labeling by amino acids in cell culture

(SILAC) for phosphotyrosine protein identification and quantitation Methods Mol Biol 2009

527 79-92 xi

168 Bondarenko P V Chelius D Shaler T A Identification and relative quantitation of

protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid

chromatography-tandem mass spectrometry Anal Chem 2002 74 (18) 4741-9

169 Chelius D Bondarenko P V Quantitative profiling of proteins in complex mixtures

using liquid chromatography and mass spectrometry J Proteome Res 2002 1 (4) 317-23

170 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

171 Wang W Zhou H Lin H Roy S Shaler T A Hill L R Norton S Kumar P

Anderle M Becker C H Quantification of proteins and metabolites by mass spectrometry

without isotopic labeling or spiked standards Anal Chem 2003 75 (18) 4818-26

172 Silva J C Denny R Dorschel C A Gorenstein M Kass I J Li G Z McKenna

T Nold M J Richardson K Young P Geromanos S Quantitative proteomic analysis by

accurate mass retention time pairs Anal Chem 2005 77 (7) 2187-200

173 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

174 Filiou M D Martins-de-Souza D Guest P C Bahn S Turck C W To label or not

to label Applications of quantitative proteomics in neuroscience research Proteomics 2012

175 Neilson K A Ali N A Muralidharan S Mirzaei M Mariani M Assadourian G

Lee A van Sluyter S C Haynes P A Less label more free approaches in label-free

quantitative mass spectrometry Proteomics 2011 11 (4) 535-53

176 Zhu W Smith J W Huang C M Mass spectrometry-based label-free quantitative

proteomics J Biomed Biotechnol 2010 2010 840518

177 Xie F Liu T Qian W J Petyuk V A Smith R D Liquid chromatography-mass

spectrometry-based quantitative proteomics J Biol Chem 2012 286 (29) 25443-9

178 Shen Y Tolic N Xie F Zhao R Purvine S O Schepmoes A A Moore R J

Anderson G A Smith R D Effectiveness of CID HCD and ETD with FT MSMS for

degradomic-peptidomic analysis comparison of peptide identification methods J Proteome Res

2011 10 (9) 3929-43

179 Zubarev R A Kelleher N L McLafferty F W Electron Capture Dissociation of

Multiply Charged Protein Cations A Nonergodic Process Journal of the American Chemical

Society 1998 120 (13) 3265-3266

180 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

181 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

68

182 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass

spectrometry for shotgun proteomics Nat Methods 2008 5 (11) 959-64

183 Zhou W Ross M M Tessitore A Ornstein D Vanmeter A Liotta L A

Petricoin E F 3rd An initial characterization of the serum phosphoproteome J Proteome Res

2009 8 (12) 5523-31

184 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides

complexed with divalent metal cations Anal Chem 2006 78 (21) 7570-6

185 Snovida S I Bodnar E D Viner R Saba J Perreault H A simple cellulose

column procedure for selective enrichment of glycopeptides and characterization by nano LC

coupled with electron-transfer and high-energy collisional-dissociation tandem mass

spectrometry Carbohydr Res 2010 345 (6) 792-801

186 Sobott F Watt S J Smith J Edelmann M J Kramer H B Kessler B M

Comparison of CID versus ETD based MSMS fragmentation for the analysis of protein

ubiquitination J Am Soc Mass Spectrom 2009 20 (9) 1652-9

187 Eliuk S M Maltby D Panning B Burlingame A L High resolution electron

transfer dissociation studies of unfractionated intact histones from murine embryonic stem cells

using on-line capillary LC separation determination of abundant histone isoforms and post-

translational modifications Mol Cell Proteomics 2010 9 (5) 824-37

188 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 10 (9) 4219-29

189 Molina H Matthiesen R Kandasamy K Pandey A Comprehensive comparison of

collision induced dissociation and electron transfer dissociation Anal Chem 2008 80 (13)

4825-35

190 Olsen J V Macek B Lange O Makarov A Horning S Mann M Higher-energy

C-trap dissociation for peptide modification analysis Nat Methods 2007 4 (9) 709-12

191 McAlister G C Phanstiel D Wenger C D Lee M V Coon J J Analysis of

tandem mass spectra by FTMS for improved large-scale proteomics with superior protein

quantification Anal Chem 2010 82 (1) 316-22

192 Dayon L Pasquarello C Hoogland C Sanchez J C Scherl A Combining low-

and high-energy tandem mass spectra for optimized peptide quantification with isobaric tags J

Proteomics 2010 73 (4) 769-77

193 Second T P Blethrow J D Schwartz J C Merrihew G E MacCoss M J

Swaney D L Russell J D Coon J J Zabrouskov V Dual-pressure linear ion trap mass

spectrometer improving the analysis of complex protein mixtures Anal Chem 2009 81 (18)

7757-65

194 Jedrychowski M P Huttlin E L Haas W Sowa M E Rad R Gygi S P

Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for

murine phosphoproteomics Mol Cell Proteomics 2011 10 (12) M111 009910

195 Frese C K Altelaar A F Hennrich M L Nolting D Zeller M Griep-Raming J

Heck A J Mohammed S Improved peptide identification by targeted fragmentation using

CID HCD and ETD on an LTQ-Orbitrap Velos J Proteome Res 2011 10 (5) 2377-88

196 Chakraborty A B Berger S J Gebler J C Use of an integrated MS--multiplexed

MSMS data acquisition strategy for high-coverage peptide mapping studies Rapid Commun

Mass Spectrom 2007 21 (5) 730-44

69

197 Ramos A A Yang H Rosen L E Yao X Tandem parallel fragmentation of

peptides for mass spectrometry Anal Chem 2006 78 (18) 6391-7

198 Barbara J E Castro-Perez J M High-resolution chromatographytime-of-flight MSE

with in silico data mining is an information-rich approach to reactive metabolite screening Rapid

Commun Mass Spectrom 2011 25 (20) 3029-40

199 Geromanos S J Vissers J P Silva J C Dorschel C A Li G Z Gorenstein M

V Bateman R H Langridge J I The detection correlation and comparison of peptide

precursor and product ions from data independent LC-MS with data dependant LC-MSMS

Proteomics 2009 9 (6) 1683-95

200 Koutroukides T A Guest P C Leweke F M Bailey D M Rahmoune H Bahn

S Martins-de-Souza D Characterization of the human serum depletome by label-free shotgun

proteomics J Sep Sci 34 (13) 1621-6

201 Martins-de-Souza D Guest P C Guest F L Bauder C Rahmoune H Pietsch S

Roeber S Kretzschmar H Mann D Baborie A Bahn S Characterization of the human

primary visual cortex and cerebellum proteomes using shotgun mass spectrometry-data-

independent analyses Proteomics 12 (3) 500-4

202 Scatena R Bottoni P Pontoglio A Giardina B Revisiting the Warburg effect in

cancer cells with proteomics The emergence of new approaches to diagnosis prognosis and

therapy Proteomics Clin Appl 4 (2) 143-58

203 Herberth M Koethe D Cheng T M Krzyszton N D Schoeffmann S Guest P

C Rahmoune H Harris L W Kranaster L Leweke F M Bahn S Impaired glycolytic

response in peripheral blood mononuclear cells of first-onset antipsychotic-naive schizophrenia

patients Mol Psychiatry 16 (8) 848-59

204 Krishnamurthy D Levin Y Harris L W Umrania Y Bahn S Guest P C

Analysis of the human pituitary proteome by data independent label-free liquid chromatography

tandem mass spectrometry Proteomics 11 (3) 495-500

205 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

206 Eng J K McCormack A L Yates Iii J R An approach to correlate tandem mass

spectral data of peptides with amino acid sequences in a protein database Journal of the

American Society for Mass Spectrometry 1994 5 (11) 976-989

207 Craig R Beavis R C TANDEM matching proteins with tandem mass spectra

Bioinformatics 2004 20 (9) 1466-7

208 Geer L Y Markey S P Kowalak J A Wagner L Xu M Maynard D M Yang

X Shi W Bryant S H Open mass spectrometry search algorithm J Proteome Res 2004 3

(5) 958-64

209 Zhang J Xin L Shan B Chen W Xie M Yuen D Zhang W Zhang Z Lajoie

G A Ma B PEAKS DB De Novo sequencing assisted database search for sensitive and

accurate peptide identification Mol Cell Proteomics 2011

210 Liu J Erassov A Halina P Canete M Nguyen D V Chung C Cagney G

Ignatchenko A Fong V Emili A Sequential interval motif search unrestricted database

surveys of global MSMS data sets for detection of putative post-translational modifications

Anal Chem 2008 80 (20) 7846-54

70

211 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

212 Han Y Ma B Zhang K SPIDER software for protein identification from sequence

tags with de novo sequencing error J Bioinform Comput Biol 2005 3 (3) 697-716

213 Singh S Springer M Steen J Kirschner M W Steen H FLEXIQuant a novel tool

for the absolute quantification of proteins and the simultaneous identification and quantification

of potentially modified peptides J Proteome Res 2009 8 (5) 2201-10

214 Searle B C Scaffold a bioinformatic tool for validating MSMS-based proteomic

studies Proteomics 10 (6) 1265-9

215 Herbst A McIlwain S Schmidt J J Aiken J M Page C D Li L Prion disease

diagnosis by proteomic profiling J Proteome Res 2009 8 (2) 1030-6

216 Menschaert G Vandekerckhove T T Baggerman G Schoofs L Luyten W Van

Criekinge W Peptidomics coming of age a review of contributions from a bioinformatics

angle J Proteome Res 2010 9 (5) 2051-61

217 Kumar C Mann M Bioinformatics analysis of mass spectrometry-based proteomics

data sets FEBS Lett 2009 583 (11) 1703-12

218 Craig R Cortens J P Beavis R C Open source system for analyzing validating and

storing protein identification data J Proteome Res 2004 3 (6) 1234-42

219 Jones P Cote R G Cho S Y Klie S Martens L Quinn A F Thorneycroft D

Hermjakob H PRIDE new developments and new datasets Nucleic Acids Res 2008 36

(Database issue) D878-83

220 Deutsch E W Lam H Aebersold R PeptideAtlas a resource for target selection for

emerging targeted proteomics workflows EMBO Rep 2008 9 (5) 429-34

221 Miliotis T Ali L Palm J E Lundqvist A J Ahnoff M Andersson T B

Hilgendorf C Development of a highly sensitive method using liquid chromatography-multiple

reaction monitoring to quantify membrane P-glycoprotein in biological matrices and relationship

to transport function Drug Metab Dispos 2011 39 (12) 2440-9

222 Xiang Y Koomen J M Evaluation of Direct Infusion-Multiple Reaction Monitoring

Mass Spectrometry for Quantification of Heat Shock Proteins Anal Chem 2012

223 Ossola R Schiess R Picotti P Rinner O Reiter L Aebersold R Biomarker

validation in blood specimens by selected reaction monitoring mass spectrometry of N-

glycosites Methods Mol Biol 2011 728 179-94

224 Liu H Lam L Dasgupta P K Expanding the linear dynamic range for multiple

reaction monitoring in quantitative liquid chromatography-tandem mass spectrometry utilizing

natural isotopologue transitions Talanta 2011 87 307-10

225 Belov M E Prasad S Prior D C Danielson W F 3rd Weitz K Ibrahim Y M

Smith R D Pulsed multiple reaction monitoring approach to enhancing sensitivity of a tandem

quadrupole mass spectrometer Anal Chem 2011 83 (6) 2162-71

226 Prentice R L Paczesny S Aragaki A Amon L M Chen L Pitteri S J

McIntosh M Wang P Buson Busald T Hsia J Jackson R D Rossouw J E Manson J

E Johnson K Eaton C Hanash S M Novel proteins associated with risk for coronary heart

disease or stroke among postmenopausal women identified by in-depth plasma proteome

profiling Genome Med 2 (7) 48

227 Gelman J S Fricker L D Hemopressin and other bioactive peptides from cytosolic

proteins are these non-classical neuropeptides AAPS J 2010 12 (3) 279-89

71

228 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Hemopressins and

other hemoglobin-derived peptides in mouse brain comparison between brain blood and heart

peptidome and regulation in Cpefatfat mice J Neurochem 2010 113 (4) 871-80

229 Li L Garden R W Sweedler J V Single-cell MALDI a new tool for direct peptide

profiling Trends Biotechnol 2000 18 (4) 151-60

230 Altelaar A M Heck A J Trends in ultrasensitive proteomics Curr Opin Chem Biol

231 Bandura D R Baranov V I Ornatsky O I Antonov A Kinach R Lou X

Pavlov S Vorobiev S Dick J E Tanner S D Mass cytometry technique for real time

single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass

spectrometry Anal Chem 2009 81 (16) 6813-22

232 Trimpin S Inutan E D Herath T N McEwen C N Laserspray ionization a new

atmospheric pressure MALDI method for producing highly charged gas-phase ions of peptides

and proteins directly from solid solutions Mol Cell Proteomics 2010 9 (2) 362-7

233 McEwen C N Larsen B S Trimpin S Laserspray ionization on a commercial

atmospheric pressure-MALDI mass spectrometer ion source selecting singly or multiply

charged ions Anal Chem 2010 82 (12) 4998-5001

234 Wang B Lietz C B Inutan E D Leach S M Trimpin S Producing highly

charged ions without solvent using laserspray ionization a total solvent-free analysis approach at

atmospheric pressure Anal Chem 2011 83 (11) 4076-84

235 Inutan E D Richards A L Wager-Miller J Mackie K McEwen C N Trimpin

S Laserspray ionization a new method for protein analysis directly from tissue at atmospheric

pressure with ultrahigh mass resolution and electron transfer dissociation Mol Cell Proteomics

2010 10 (2) M110 000760

236 Wang H Liu J Cooks R G Ouyang Z Paper spray for direct analysis of complex

mixtures using mass spectrometry Angew Chem Int Ed Engl 49 (5) 877-80

237 Zhang Z Xu W Manicke N E Cooks R G Ouyang Z Silica coated paper

substrate for paper-spray analysis of therapeutic drugs in dried blood spots Anal Chem 84 (2)

931-8

238 Wang H Manicke N E Yang Q Zheng L Shi R Cooks R G Ouyang Z

Direct analysis of biological tissue by paper spray mass spectrometry Anal Chem 83 (4) 1197-

201

239 Liu J Wang H Cooks R G Ouyang Z Leaf spray direct chemical analysis of plant

material and living plants by mass spectrometry Anal Chem 83 (20) 7608-13

240 Liu J Wang H Manicke N E Lin J M Cooks R G Ouyang Z Development

characterization and application of paper spray ionization Anal Chem 82 (6) 2463-71

241 Yoo H J Wang N Zhuang S Song H Hakansson K Negative-ion electron

capture dissociation radical-driven fragmentation of charge-increased gaseous peptide anions J

Am Chem Soc 2011 133 (42) 16790-3

242 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 271 (2) 111-

21

243 Hood L Friend S H Predictive personalized preventive participatory (P4) cancer

medicine Nat Rev Clin Oncol 2011 8 (3) 184-7

244 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 2012 271

(2) 111-21

72

245 Belda-Iniesta C de Castro J Perona R Translational proteomics what can you do for

true patients J Proteome Res 2011 10 (1) 101-4

246 Schiess R Wollscheid B Aebersold R Targeted proteomic strategy for clinical

biomarker discovery Mol Oncol 2009 3 (1) 33-44

247 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

248 Schmidt A Kellermann J Lottspeich F A novel strategy for quantitative proteomics

using isotope-coded protein labels Proteomics 2005 5 (1) 4-15

249 Wang Y K Ma Z Quinn D F Fu E W Inverse 15

N-metabolic labelingmass

spectrometry for comparative proteomics and rapid identification of protein markerstargets

Rapid Commun Mass Spectrom 2002 16 (14) 1389-97

73

Chapter 3

Protein changes in immunodepleted cerebrospinal fluid from transgenic

mouse models of Alexander disease detected using mass spectrometry

Adapted from ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse

models of Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P

Messing A Li L Submitted

74

ABSTRACT

Cerebrospinal fluid (CSF) is a low protein content biological fluid with dynamic range

spanning at least nine orders of magnitude in protein content and is in direct contact with the

brain A modified IgY-14 immunodepletion treatment was performed to enhance analysis of the

low volumes of CSF that are obtainable from mice As a model system in which to test this

approach we utilized transgenic mice that over-express the intermediate filament glial fibrillary

acidic protein (GFAP) These mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we report the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates were performed to address animal variability as well as reproducibility in

mass spectrometric analysis Relative quantitation was performed using distributive normalized

spectral abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins

with significant changes in the CSF of GFAP transgenic mice has been identified with validation

from ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

75

INTRODUCTION

Alexander disease (AxD) is a fatal neurogenetic disorder caused by heterozygous point

mutations in the coding region of GFAP (encoding glial fibrillary acidic protein)1 The hallmark

diagnostic feature of this disease is the accumulation of astrocytic cytoplasmic inclusions known

as Rosenthal fibers containing GFAP Hsp27 αB-crystallin and other components2-5

Although

several potential treatment strategies6-8

are under investigation clinical trial design is hampered

by the absence of a standardized clinical scoring system or means to quantify lesions in MRI

that could serve to monitor severity and progression of disease One solution to this problem

would be the identification of biomarkers in readily sampled body fluids as indirect indicators of

disease

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy site for brain or spinal

cord in evaluating diseases of the central nervous system The protein composition of CSF is

well defined at least for the most abundant species of proteins and numerous studies exist that

characterize individual biomarkers or complex patterns of biomarkers in various diseases9 10

GFAP itself is present in CSF (albeit at much lower levels than in brain parenchyma) and in one

study of three Alexander disease patients its levels were markedly increased11

Whether an

increase in CSF GFAP will be a consistent finding in Alexander disease or whether other useful

biomarkers for this disease could be identified through an unbiased analysis of the CSF

proteome is not yet known

The rarity of Alexander disease makes analysis of human samples difficult However

mouse models exist that replicate key features of the disease such as formation of Rosenthal

fibers Unfortunately mouse CSF poses particular problems for proteomic studies and there is

76

an urgent need for technical improvements for dealing with this fluid For instance collection

from an adult mouse typically yields ~10 μL of CSF often with some contamination by blood12

To further complicate analysis CSF has an exceedingly low protein content (~04 μgμL) with

over 60 of the total protein content consisting of a single protein albumin13 14

A number of

techniques have been developed to remove albumin from biological samples including Cibacron

Blue15

IgG immunodepletion16

and IgY immunodepletion17-19

IgY which is avian in origin

offers reduced non-specific binding and increased avidity when compared to IgG antibodies from

rabbits goats and mice20-23

One widely used IgY cocktail is IgY-14 which contains fourteen

specific antibodies specific for albumin IgG transferrin fibrinogen α1-antitrypsin IgA IgM

α2-macroglobulin haptoglobin apolipoproteins A-I A-II and B complement C3 and α1-acid

glycoprotein Since existing protocols for IgY-14 depletion are optimized for use with large

volumes of serum new protocols must be developed to permit its use with the low volumes of a

low protein fluid represented by mouse CSF

Various improvements have also taken place in the field of proteomic analysis that could

facilitate analysis of mouse CSF Data dependent tandem mass spectrometry followed by

quantification of proteins is used in standard shotgun proteomics24-29

Several methods now exist

for introducing quantitation into mass spectrometry including stable isotope labeling30-32

isobaric tandem mass tags33 34

and spectral counting35 36

Spectral counting which is a

frequency measurement that uses MSMS counts of identified peptides as the metric to enable

protein quantitation is attractive because it is label-free and requires no additional sample

preparation Finally recent advances in spectral counting has produced a data refinement

strategy termed normalized spectral abundance factor (NSAF)37 38

and further developed into

distributive NSAF (dNSAF) to address issues with peptides shared by multiple proteins39

77

To identify potential biomarkers in AxD we report a novel scaled-down version of IgY

antibody depletion strategy to reduce the complexity and remove high abundance proteins in

mouse CSF samples The generated spectral counts data were then subjected to dNSAF natural

log data transformation and t-test analysis to determine which proteins differ in abundance when

comparing GFAP transgenics and controls with multiple biological and technical replicates

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) urea (gt99) sodium chloride (997) and ammonium

bicarbonate (certified) were purchased from Fisher Scientific (Fair Lawn NJ) Deionized water

(182 MΩcm) was prepared with a Milli-Q Millipore system (Billerica MA) Optima LCMS

grade acetonitrile and water were purchased from Fisher Scientific (Fair Lawn NJ) DL-

Dithiothreitol (DTT) and sequencing grade modified trypsin were purchased from Promega

(Madison WI) Formic acid (ge98) was obtained from Fluka (Buchs Switzerland)

Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris ge999 ) ammonium formate

(ge99995) glycine (ge985) and IgY-14 antibodies were purchased from Sigma-Aldrich

(Saint Louis MO)

Mice

Transgenic mice over-expressing the human GFAP gene (Tg737 line) were maintained

as hemizygotes in the FVBN background and genotyped via PCR on DNA prepared from tail

samples as described previously40

The mice were housed on a 14-10 light-dark cycle with ad

libitum access to food and water All procedures were conducted using protocols approved by

the UW-Madison IACUC

78

CSF collection

CSF was collected from mice as described previously12

Briefly mice were anesthetized

with avertin (400-600 mgkg ip) A midline sagittal incision was made over the dorsal aspect

of the hindbrain and three muscle layers carefully peeled back to expose the cisterna magna The

membrane covering the cisterna magna was pierced with a 30 gauge needle and CSF was

collected immediately using a flexible plastic pipette Approximately ten microliters of CSF was

collected per animal All samples used for MS analysis showed no visible contamination of

blood

Enzyme-linked immunosorbent assay (ELISA)

A sandwich ELISA was used to quantify GFAP8 Briefly a microtiter plate was coated

with a cocktail of monoclonal antibodies (Covance SMI-26R) Plates were blocked with 5

milk before addition of sample or standards diluted in PBS with Triton-X and BSA A rabbit

polyclonal antibody (DAKO Z334) was used to detect the GFAP followed by a peroxidase

conjugated anti-rabbit IgG antibody (Sigma A6154) secondary antibody The peroxidase activity

was detected with SuperSignal ELISA Pico Chemiluminescent Substrate (PIERCE) and

quantified with a GloRunner Microplate Luminometer Values below the biological limit of

detection (16ngL) were given the value 16ngL before multiplying by the dilution factor

Immunodepletion of abundant proteins

Currently there are no commercial immunodepletion products available for use with CSF

and to address the low protein content of this fluid (~04 microgmicroL) Therefore 100 microL of

purchased IgY-14 resin was coupled with a Pierce Spin Cup using a paper filter from Thermo

Scientific (Waltham MA) CSF samples from either transgenics or controls were first pooled to

100 microL and then added to 100 microL of 20 mM Tris-HCl 300 mM pH 74 (2x dilution buffer) and

79

allowed to mix with the custom IgY-14 depletion spin column on an end-over-end rotor for 30

minutes at 4oC The sample was then centrifuged at 04 rcf for 45 seconds with an Eppendorf

Centrifuge 5415D (Hamburg Germany) The antibodies were then washed with 50 microL 1x

dilution buffer vortexed for 5 minutes centrifuged at 04 rcf for 45 seconds and the flow through

was collected for tryptic digestion The antibodies were then stripped of the bound proteins with

four 025 mL washes of 01 M glycine pH 25 neutralized with two 025 mL washes of 01 M

Tris-HCl pH 80 and washed once with 025mL of 1x dilution buffer The immunodepletion

protocol was repeated for each transgenic or control pooled 100 microL sample (N=3)

Preparation of tryptic digests

The immunodepleted pooled mouse CSF samples (200 microL total volume) were

concentrated to 10 microL using a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA)

To each sample 8 microL of 133 M urea and 1 microL of 050 M DTT were added and allowed to

incubate at 37oC for one hour After incubation 27 microL of 055 M IAA was added for

carboxymethylation and the sample was allowed to incubate for 15 minutes in the dark To

quench the IAA 1 microL of 050 M DTT was added and allowed to react for 10 minutes To

perform trypsin digestion 70 microL of 50 mM NH4HCO3 was then added along with 025 microg

trypsin which had previously been dissolved in 50 mM acetic acid at a concentration of 05

microgmicroL Digestion was performed overnight at 37oC and quenched by addition of 25 microL of 10

formic acid The tryptic peptides were then subjected to solid phase extraction using a Varian

Omix Tip C18 100 microL (Palo Alto CA) Peptides were eluted with 50 ACN in 01 formic

acid concentrated and reconstituted in 30 microL H2O in 01 formic acid

RP nanoLC separation

80

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto an Agilent

Technologies Zorbax 300 SB-C18 5 microm 5x03 mm trap cartridge (Santa Clara CA) at a flow

rate of 5 microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm

Atlantis dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B

at 250 nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

81

range of 300-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot41

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt mus musculus

(house mouse) database (version 575) False positive analyses42

were calculated using an

automatic decoy option of Mascot Results from the Mascot results were reported using

Proteinscape 21 and technical replicates were combined and reported as a protein compilation

using ProteinExtractor (Bruker Daltonics Bremen Germany)

Mascot search parameters were as follows Allowed missed cleavages 2 enzyme

trypsin variable modifications carboxymethylation (C) and oxidation (M) peptide tolerance

plusmn12 Da maximum number of 13

C 1 MSMS tolerance plusmn05 Da instrument type ESI-Trap

Reported proteins required a minimum of one peptide with a Mascot score ge300 with bold red

characterization Spectral counts were determined from the number of MSMS spectra identified

from accepted proteins A bold red peptide combines a bold peptide which represents the first

query result from a submitted MSMS spectrum with the red peptide which indicates the top

peptide for the identified protein Requiring one bold red peptide assists in removal of

homologous redundant proteins and further improves protein results In addition requiring one

82

peptide to be identified by a score gt300 removes the ability for proteins to be identified by

multiple low Mascot scoring peptides

Each immunodepleted biological replicate had technical triplicates performed and the

technical triplicates were summed together by ProteinExtractor Peptide spectral counts were

then summed for each protein and subjected to dNSAF analysis Details for this method can be

found elsewhere37 39

but briefly peptide spectral counts are summed per protein (SpC) based on

unique peptides and a weighted distribution of any shared peptides with homologous proteins

ProteinScape removed 83 homologous proteins found in the current study to bring the total

number of proteins identified to 266 but some non-unique homologous peptides which are

shared by multiple proteins are still present in the resulting 266 remaining proteins To address

these non-unique homologous peptides distributive spectral counting was performed as

described elsewhere39

The dSpC is divided by the proteinrsquos length (L) and then divided by the

summation of the dSpCL from all proteins in the experiment (N) to produce each proteinrsquos

specific dNSAF value

N

i

i

kk

LdSpC

LdSpCdNSAF

1

)(

)()(

The resulting data were then transformed by taking the natural log of the dNSAF value The

means for each protein were calculated using the ln(dNSAF) from N=3 biological replicates and

the whole data set was subjected to the Shapiro-Wilk test for normalityGaussian distribution

performed on the software PAST (Version 198 University of Oslo Norway Osla) The

Shapiro-Wilk test was repeated multiple times to determine a non-zero value for zero spectral

83

counts A non-zero value is required to alleviate the errors of dividing by zero which was

experimentally determined to be 043 The Gaussian data were then subjected to the t-test to

identify statistically significant changes in protein expression

RESULTS AND DISCUSSION

General workflow

Individual CSF samples were manually inspected and samples were only selected that

showed no visual blood contamination Preliminary experiments showed that the maximum

degree of blood contamination estimated from counts of red blood cells in the CSF that was not

visible to the eye was less than 005 by total mass Approximately 10 individual mouse CSF

samples were pooled to achieve the desired 100 μL volume for a single biological replicate The

CSF samples were IgY immunodepleted followed by digestion with trypsin The resulting

digested samples were then de-salted concentrated reconstituted in 30 μL of 01 formic acid

and 5 μL was loaded onto a reversed phase nanoflow HPLC column subjected to a 90 minute

gradient and infused into the amaZon ETD ion trap mass spectrometer The workflow for

mouse CSF analysis is shown in Figure 1 The remainder of the desalted sample was saved for

technical replicates

Immunodepletion for CSF

Currently there are no immunodepletion techniques specifically designed for CSF

Nonetheless the protein profiles between CSF and serum are similar enough to use currently

available immunodepletion techniques designed for serum as a starting point The smallest

commercially available IgY-14 immunodepletion kit is for 15 μL of serum which is equivalent in

protein content to ~15 mL of CSF Therefore for 100 μL CSF one fifteenth of the total IgY-14

84

beads would be used for the equivalent immunodepletion which is 66 μL of proprietary bead

slurry The potential for irreversible binding of abundant proteins to their respective IgY

antibody even after an extra stripping wash and low amounts of total beads made using 66 μL

of IgY-14 bead slurry unrealistic In practice almost doubling the amount of IgY-14 beads (100

μL) used was not sufficient and resulted in albumin and transferrin peptides to be observed in

high abundance (data not shown) The most important protein to immunodeplete is albumin and

it has been reported to be a greater percentage of total CSF protein content (~60) than serum

(~49) in humans14

The difference in albumin percentage supports the results that proprietary

blends of immunodepletion beads for high abundance proteins such as albumin cannot be

scaled down on a strict protein scale and further modifications to the serum immunodepletion

protocol need to be made

Since IgY-14 beads were developed for use with serum all of its protocols need to be

taken into account to modify the protocol for CSF Serum samples should be diluted fifty times

before mixing with the IgY-14 beads but CSF protein concentration is at least one hundred times

lower than serum Therefore CSF is below half the recommended diluted protein concentration

for IgY immunodepletion Consequently multiple steps have been devised to address this

limitation First the binding time between the proteins targeted for removal from the CSF and

IgY-14 beads was doubled during the end-over-end rotor depletion step from the recommended

15 minutes to 30 minutes Second to prevent non-specific protein binding to the antibodies the

CSF samples were diluted with an equal amount (100 μL) of 2X dilution buffer The dilution

buffer of NaCl and Tris-HCl was needed to assist in reducing non-specific binding of proteins to

the 14 antibodies and ensuring the sample is held at physiological pH In addition to these

modifications a doubling of the starting IgY-14 bead slurry to 200 μL provided the desired

85

results Overall this modified protocol results in effective depletion of CSF abundant proteins

using only one-fifth of the antibodies provided by the smallest commercially available platform

Data Analysis

Spectral counting technique for relative quantitation provides numerous benefits for the

study of mouse CSF due to its label-free advantage In contrast isotopic labeling method often

involves additional sample processing that could cause sample loss which is highly undesirable

for low protein content and low volume samples Labeling methods also require a mixing of two

sets of isotopically labeled samples which would effectively increase the sample complexity and

reduce the amount of sample that can be loaded onto the nanoLC column by half In addition

more than two sets of samples can be compared by label-free methods The use of label-free

spectral counting method does not lead to an increase in sample complexity or interference in

quantitation from peptides in the mz window selected for tandem MS Using spectral counting

for relative quantitation however is dependent on reproducible HPLC separation and careful

mass spectrometry operation to minimize technical variability during the study To address

concerns of analytical reliability and run to run deviations base peak chromatograms from two

transgenic IgY-14 immunodepleted biological replicates including two technical replicates of

each were shown to be highly reproducible (Figure 2)

Each biological sample was analyzed in triplicate with the same protocols on the amaZon

ETD with three control and three transgenic samples From the three technical replicates for

each biological replicate the spectral counts of the peptides for the proteins identified were

summed The results from these mouse CSF biological triplicates are shown in Figure 3A for

GFAP overexpressor and Figure 3B for control The summation of spectral counts for each

biological replicate was performed to remove the inherent bias related to data dependent analysis

86

for protein identification One concern in grouping technical replicates is a potential loss of

information regarding analytical variability Figure 4 provides a graphical representation of

variability of technical replicates illustrating the standard deviation of technical replicates with

error bars Figure 4 shows that a statistically changed protein creatine kinase M (A) and an

unchanged protein kininogen-1 (B) have similar variability within (technical replicates) and

between samples (biological replicates) for each protein In addition Figure 4B illustrates that

even with the variability of kininogen-1 the resulting mean shown by the dashed line of control

and transgenic samples were almost equal whereas Figure 4A shows significantly different

expression level of creatine kinase M Performing replicate analysis of each biological sample

(n=3) helps correct for systematic errors deriving from sample handling and pooling of samples

helps reduce random error during the CSF sample collection process

Protein Identification and Spectral Counting Analysis

The data for dNSAF analysis like any mass spectrometry proteomics experiment

requires multiple layers of verification to ensure reliable data Our initial protein identifications

were subjected to a database search using a decoy database from Mascot which resulted in an

average false positive rate below 1 for all the experimental data collected Representative

MSMS spectra between control and GFAP transgenic samples are illustrated in Figure 5

Overall 266 proteins were identified in a combination of control and transgenic samples

(Supplemental Table 1) A total of 349 proteins were initially identified but 83 proteins were

isoforms of previously identified proteins and automatically excluded by ProteinExtractor The

next level of quality control was to only include ln(dNSAF) values from proteins identified by 2

or more unique peptides having a Mascot score of ge300 and observed in two out of three

biological replicates These selection parameters resulted in 106 proteins remaining after

87

dNSAF analysis (Supplemental Table 2) The resulting spectral counts were then subjected to

dSpC in order to account and correct for the systematic error of peptides shared by multiple

proteins (Supplemental Table 3)

It is inevitable in large scale and complex proteomics experiments that some proteins will

be seen in some samples and not others In addition when controls were compared to transgenic

samples as shown in Figure 3C 127 proteins were unique to either control or GFAP transgenic

mice and 139 proteins were seen in both samples From dNSAF equation if the spectral count

is zero the numerator is zero and the value will not be normalized between runs In order to

circumvent the zero spectral counts in dNSAF analysis zero spectral counts must be replaced by

an experimentally determined non-zero value determined to be 043 The 043 spectral counts

for zero spectral counts was calculated by serial Shapiro-Wilk tests to determine the lowest value

(043) for zero spectral counts and maintain a Gaussian distribution for all data sets The 043

value for zero spectral counts in the current study was higher than the 016 reported value for

zero spectral counts in the original NSAF spectral counting study37

Our study may have a

higher zero spectral count value than the previous study because the spectral counting data were

an addition of three technical replicates and three times 016 is close to 043 The normalized

Gaussian data were then subjected to t-test analysis and P-values below 005 are considered as

statistically significant and are presented in Table 1 The proteins with significant up or down

regulation from Table 1 can be further evaluated as how close significant proteins were to a p-

value of 005 such as ganglioside GM2 activator serine protease inhibitor A3N and collagen

alpha-2(I) chain having t-test scores of 0045 0047 and 0043 respectively Proteins exhibiting

a P-value close to 005 were more likely to be highly variable proteins or have smaller fold

changes between control and transgenic samples and thus provide less biological relevancy to

88

future studies The modest change in antithrobin-III which is reduced by 14-fold in transgenic

is included due a low pooled standard deviation in spectral counts

Spectral counting has been analyzed with fold changes derived directly from the average

spectral counts from the technical replicates and then the average of the three biological

replicates We decided to perform additional analysis using fold changes to dig deeper into

proteins that statistically support the null hypothesis from the dNSAF analysis To only pull out

highly confident protein identifications we used the same strict cut-off of two unique peptides

identified per protein as in dNSAF analysis We only accepted proteins with greater than three-

fold change in total spectral counts listed in Table 2 Two proteins contactin-1 (cntn1) and

cathepsin B (CB) illustrate the potential biomarkers missed by dNSAF analysis Cntn1 had zero

spectral count in the transgenic sample and had an average spectral count of 41 in control

samples The lack of any spectral counts in one biological control for cntn1 resulted in a large

standard deviation in the ln(dNSAF) means for the control which resulted in the t-test supporting

the null hypothesis Another example is CB which was detected by numerous spectral counts in

every GFAP transgenic biological replicate with an average spectral count of 97 (Table 2) The

presence of CB in one biological control sample (23 average spectral counts) resulted in a high

standard deviation in the mean of the control samples These examples exhibit a limitation of

dNSAF analysis which could cause a loss of potentially useful information

Previously Identified Proteins with Expression Changes

Previously three proteins have been described as increased in CSF from individual(s)

suffering from AxD In one study a single patientrsquos CSF was found to have elevated levels of

αβ-crystallin and HSP2744

In a second study three patients were reported to have elevated

levels of GFAP with concentrations from 4760 ngL to 30000 ngL (compared to lt175 ngL for

89

controls)11

GFAP was detected in our current study however the other two proteins were not

detected One possibility is that αB-crystallin and Hsp-27 levels in mouse CSF are too low for

detection by MS analysis In addition while the transgenic mice display the hallmark

pathological feature of AxD in the form of Rosenthal fibers they do not have any evident

leukodystrophy and thus may not display the full range of changes in CSF as might be found in

human patients

Creatine Kinase M

Creatine kinase M (M-CK) is a 43 kDa protein whose role is to reversibly catalyze

phosphate transfer between ATP and energy storage compounds M-CK has been primarily

found in muscle tissue and for humans creatine kinase M is diagnostically analyzed in the blood

for myocardial infarction (heart attack) In addition M-CK is also found in Purkinje neurons of

the cerebellum45 46

A related protein creatine kinase B (B-CK) also exhibited an apparent 21

fold increase in transgenic CSF over control but this difference was not statistically different

B-CK concentration is known to be elevated in CSF following head trauma47

or cerebral

infarction48

but decreased in astrocytes in individuals affected by multiple sclerosis49

Cathepsin

The data showed multiple cathepsins were up regulated in the CSF of transgenic mice

when compared to control mice The up regulated cathepsins were S L1 and B isoforms which

are all cysteine proteases Cathepsin S (CS) was never observed in control samples but

observed with an average of 73 spectral counts per analysis Cathepsin L1 (CL1) was up

regulated by 94 times in transgenic mice These two cathepsins exhibited significant changes

using dNSAF and t-test statistics as shown in Table 1 and Cathepsin B (CB) showed a 42 fold

increase in transgenic CSF as shown in Table 2

90

Cathepsins regulate apoptosis in cells50

which is the major mechanism for elimination of

cells deemed by the organism to be dangerous damaged or expendable CL and CB are

redundant in the system as thiol proteases and inhibition of CB by CA-047 causes a diminished

apoptosis response in multiple cell lines51

Intriguingly increased levels of CB or CL are

correlated with poor prognosis for cancer patients and shorter disease-free intervals It is

believed that these proteases degrade the extracellular membrane which allows tumor cells to

invade adjacent tissue and metastasize52

With regards to AxD the up regulation of these

cathepsins may be indicative of the bodyrsquos natural defense and response to Rosenthal fibers

Thus stimulation of these cathepsins may provide a further protective stress response but the

positive correlation between these proteases and cancer highlights the multiple roles of these

proteins in pathological response Alternatively it has been shown that increased CB is involved

with the tumor necrosis factor α (TNFα) induced apoptosis cascade53

The activation of the

TNFα could produce oligodendrocyte toxicity54

with the expression of TNFα being elevated in

tissue samples from mouse models and AxD patients55

The potential for a positive or a negative

effect in increasing or decreasing cathepsins warrant future research with cathepsins and AxD

Contactin-1

Contactin-1 (Cntn1) is a 133 kDa cell surface protein that is highly glycosylated and

belongs to a family of immunoglobulin domain-containing cell adhesion molecues56

Table 2

shows that Cntn1 is down regulated in transgenic mice since spectral counts were only observed

in control mice Cntn1 is expressed in neurons and oligodendrocytes and higher levels were

observed during brain development57

In addition Cntn1 leads to activation of Notch1 which

mediates differentiation and maturation of oligodendrocyte precursor cells (OPC) Although the

mechanism leading to a reduction of Cntn1 in CSF is not known it may reflect a change in

91

astrocyte interactions with either neurons or oligodendrocytes to alter their expression of this

protein

Validation of putative biomarkers and MS proteomics data using ELISA and RNA

microarray data

To further validate the relative protein expression data obtained via MS-based spectral

counting techniques orthogonal immunological and molecular biological approaches have been

examined As GFAP was shown to be elevated in the CSF of transgenic animals we used a

well-defined GFAP ELISA protocol to test its CSF concentration CSF from 8 week old male

mice was collected from both transgenic and control animals Five samples of transgenic CSF

was prepared by pooling four to eight animals for the GFAP ELISA In the case of controls

each sample represents a single animal GFAP concentrations observed by both the MS and

ELISA showed significant increases of GFAP in the CSF when comparing transgenic to control

animals

Another validation of MS spectral counts is observed in a microarray analysis performed

on transgenic mouse olfactory bulb tissue 55

In this paper nine of the proteins found by MS

showed similar gene expression in the microarray (Table 3) It is not surprising that all the genes

observed in the microarray are not the same as the proteins observed by MS analysis Gene

expression and protein synthesis and expression are not always correlated but the similarities

and overlapping trends observed with these two assays are encouraging As shown in Table 3

gene expression analysis also revealed the up regulation of several cathepsin proteins GFAP

and down regulation of contactin-1 in CSF collected from transgenic mice corroborating the

MS-based proteomics results

92

CONCLUSIONS

In this study we have produced a panel of proteins with significant up or down regulation

in the CSF of transgenic GFAP overexpressor mice This mouse model for AxD was consistent

with the previous studies showing elevation of GFAP in CSF The development of a modified

IgY-14 immunodepletion technique for low amounts of CSF was presented This improved

protocol is useful for future investigations to deal with the unique challenges of mouse CSF

analysis Modified proteomics protocols were employed to profile mouse CSF with biological

and technical triplicates addressing the variability and providing quantitation with dNSAF

spectral counting Validation of the MS-based proteomics data were performed using both

ELISA and RNA microarray data to provide further confidence in the changes in the putative

protein biomarkers This study presents three classes of interesting targets for future study in

AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

93

REFERENCES

1 Brenner M Johnson A B Boespflug-Tanguy O Rodriguez D Goldman J E

Messing A Mutations in GFAP encoding glial fibrillary acidic protein are associated with

Alexander disease Nat Genet 2001 27 (1) 117-20

2 Herndon R M Rubinstein L J Freeman J M Mathieson G Light and electron

microscopic observations on Rosenthal fibers in Alexanders disease and in multiple sclerosis J

Neuropathol Exp Neurol 1970 29 (4) 524-51

3 Alexander W S Progressive fibrinoid degeneration of fibrillary astrocytes associated

with mental retardation in a hydrocephalic infant Brain 1949 72 (3) 373-81 3 pl

4 Iwaki T Kume-Iwaki A Liem R K Goldman J E Alpha B-crystallin is expressed

in non-lenticular tissues and accumulates in Alexanders disease brain Cell 1989 57 (1) 71-8

5 Head M W Goldman J E Small heat shock proteins the cytoskeleton and inclusion

body formation Neuropathol Appl Neurobiol 2000 26 (4) 304-12

6 Messing A Daniels C M Hagemann T L Strategies for treatment in alexander

disease Neurotherapeutics 7 (4) 507-15

7 Tang G Yue Z Talloczy Z Hagemann T Cho W Messing A Sulzer D L

Goldman J E Autophagy induced by Alexander disease-mutant GFAP accumulation is

regulated by p38MAPK and mTOR signaling pathways Hum Mol Genet 2008 17 (11) 1540-

55

8 Hagemann T L Boelens W C Wawrousek E F Messing A Suppression of GFAP

toxicity by alphaB-crystallin in mouse models of Alexander disease Hum Mol Genet 2009 18

(7) 1190-9

9 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

10 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 878 (22) 2003-12

11 Kyllerman M Rosengren L Wiklund L M Holmberg E Increased levels of GFAP

in the cerebrospinal fluid in three subtypes of genetically confirmed Alexander disease

Neuropediatrics 2005 36 (5) 319-23

12 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M

Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta)

equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

13 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

14 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers PROTEOMICS ndash Clinical Applications 2008 2 (3)

428-436

15 Li C Lee K H Affinity depletion of albumin from human cerebrospinal fluid using

Cibacron-blue-3G-A-derivatized photopatterned copolymer in a microfluidic device Anal

Biochem 2004 333 (2) 381-8

94

16 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry ELECTROPHORESIS 2004 25 (14) 2402-

2412

17 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L

Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity

separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample

preparation and analysis Proteomics 2005 5 (13) 3314-28

18 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag

L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep

Biochem Biotechnol 2009 39 (3) 221-47

19 Huang L Fang X Immunoaffinity fractionation of plasma proteins by chicken IgY

antibodies Methods Mol Biol 2008 425 41-51

20 Greunke K Braren I Alpers I Blank S Sodenkamp J Bredehorst R Spillner E

Recombinant IgY for improvement of immunoglobulin-based analytical applications Clin

Biochem 2008 41 (14-15) 1237-44

21 Xiao Y Gao X Taratula O Treado S Urbas A Holbrook R D Cavicchi R E

Avedisian C T Mitra S Savla R Wagner P D Srivastava S He H Anti-HER2 IgY

antibody-functionalized single-walled carbon nanotubes for detection and selective destruction

of breast cancer cells BMC Cancer 2009 9 351

22 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein immunoaffinity

subtraction for plasma proteomics and candidate biomarker discovery using mass spectrometry

Mol Cell Proteomics 2006 5 (11) 2167-74

23 Hinerfeld D Innamorati D Pirro J Tam S W SerumPlasma depletion with

chicken immunoglobulin Y antibodies for proteomic analysis from multiple Mammalian species

J Biomol Tech 2004 15 (3) 184-90

24 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D

Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in

the discovery of candidate protein biomarkers in a diabetes autoantibody standardization

program sample subset J Proteome Res 2008 7 (2) 698-707

25 Ru Q C Zhu L A Silberman J Shriver C D Label-free semiquantitative peptide

feature profiling of human breast cancer and breast disease sera via two-dimensional liquid

chromatography-mass spectrometry Mol Cell Proteomics 2006 5 (6) 1095-104

26 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

27 Rao P V Reddy A P Lu X Dasari S Krishnaprasad A Biggs E Roberts C T

Nagalla S R Proteomic identification of salivary biomarkers of type-2 diabetes J Proteome

Res 2009 8 (1) 239-45

28 Yu K H Barry C G Austin D Busch C M Sangar V Rustgi A K Blair I A

Stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for

pancreatic cancer serum biomarker discovery J Proteome Res 2009 8 (3) 1565-76

29 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422

(6928) 198-207

95

30 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

31 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

32 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

33 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 82 (7) 2817-25

34 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

35 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

36 Old W M Meyer-Arendt K Aveline-Wolf L Pierce K G Mendoza A Sevinsky

J R Resing K A Ahn N G Comparison of label-free methods for quantifying human

proteins by shotgun proteomics Mol Cell Proteomics 2005 4 (10) 1487-502

37 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M

P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J

Proteome Res 2006 5 (9) 2339-47

38 Mosley A L Florens L Wen Z Washburn M P A label free quantitative

proteomic analysis of the Saccharomyces cerevisiae nucleus J Proteomics 2009 72 (1) 110-20

39 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome

quantitation how to deal with peptides shared by multiple proteins Anal Chem 82 (6) 2272-81

40 Messing A Head M W Galles K Galbreath E J Goldman J E Brenner M

Fatal encephalopathy with astrocyte inclusions in GFAP transgenic mice Am J Pathol 1998

152 (2) 391-8

41 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

42 Elias J E Gygi S P Target-decoy search strategy for increased confidence in large-

scale protein identifications by mass spectrometry Nat Methods 2007 4 (3) 207-14

43 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

44 Takanashi J Sugita K Tanabe Y Niimi H Adolescent case of Alexander disease

MR imaging and MR spectroscopy Pediatr Neurol 1998 18 (1) 67-70

45 Hemmer W Wallimann T Functional Aspects of Creatine Kinase in Brain

Developmental Neuroscience 1993 15 (3-5) 249-260

46 Hemmer W Zanolla E Furter-Graves E M Eppenberger H M Wallimann T

Creatine kinase isoenzymes in chicken cerebellum specific localization of brain-type creatine

96

kinase in Bergmann glial cells and muscle-type creatine kinase in Purkinje neurons Eur J

Neurosci 1994 6 (4) 538-49

47 Nordby H K Tveit B Ruud I Creatine kinase and lactate dehydrogenase in the

cerebrospinal fluid in patients with head injuries Acta Neurochirurgica 1975 32 (3) 209-217

48 Bell R D Alexander G M Nguyen T Albin M S Quantification of cerebral

infarct size by creatine kinase BB isoenzyme Stroke 1986 17 (2) 254-60

49 Steen C Wilczak N Hoogduin J M Koch M De Keyser J Reduced Creatine

Kinase B Activity in Multiple Sclerosis Normal Appearing White Matter PLoS ONE 5 (5)

e10811

50 Chwieralski C Welte T Buumlhling F Cathepsin-regulated apoptosis Apoptosis 2006

11 (2) 143-149

51 Droga-Mazovec G Bojic L Petelin A Ivanova S Romih R Repnik U Salvesen

G S Stoka V Turk V Turk B Cysteine cathepsins trigger caspase-dependent cell death

through cleavage of bid and antiapoptotic Bcl-2 homologues J Biol Chem 2008 283 (27)

19140-50

52 Duffy M J Proteases as prognostic markers in cancer Clin Cancer Res 1996 2 (4)

613-8

53 Guicciardi M E Deussing J Miyoshi H Bronk S F Svingen P A Peters C

Kaufmann S H Gores G J Cathepsin B contributes to TNF-alpha-mediated hepatocyte

apoptosis by promoting mitochondrial release of cytochrome c J Clin Invest 2000 106 (9)

1127-37

54 Butt A M Jenkins H G Morphological changes in oligodendrocytes in the intact

mouse optic nerve following intravitreal injection of tumour necrosis factor J Neuroimmunol

1994 51 (1) 27-33

55 Hagemann T L Gaeta S A Smith M A Johnson D A Johnson J A Messing

A Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal

fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum Mol

Genet 2005 14 (16) 2443-58

56 Falk J Bonnon C Girault J A Faivre-Sarrailh C F3contactin a neuronal cell

adhesion molecule implicated in axogenesis and myelination Biol Cell 2002 94 (6) 327-34

57 Eckerich C Zapf S Ulbricht U Muumlller S Fillbrandt R Westphal M Lamszus

K Contactin is expressed in human astrocytic gliomas and mediates repulsive effects Glia

2006 53 (1) 1-12

97

Table 1 Statistically changed proteins between transgenic and control mouse CSF using

dNSAF analysis

Accession Protein Pa SC

b Fold

Changec

Control

dSpCd

Transgenic

dSpCd

KCRM_MOUSE Creatine kinase M-type 0034 425 30 182 541

HEXB_MOUSE Βeta-hexosaminidase subunit β 0012 183 uarr 0 59

CMGA_MOUSE Chromogranin-A 000078 153 darr 44 0

ANT3_MOUSE Antithrombin-III 0017 176 -14 67 47

SAP3_MOUSE Ganglioside GM2 activator 0045 415 darr 28 0

SPA3N_MOUSE Serine protease inhibitor A3N 0047 170 42 10 42

CO1A2_MOUSE Collagen alpha-2(I) chain 0043 56 darr 19 0

BLVRB_MOUSE Flavin reductase 00054 204 uarr 0 12

CATS_MOUSE Cathepsin S 00032 232 uarr 0 73

GFAP_MOUSE Glial fibrillary acidic protein 0021 107 uarr 0 21

RET4_MOUSE Retinol-binding protein 4 0040 189 darr 13 0

CBPE_MOUSE Carboxypeptidase E 0043 139 darr 11 0

CATL1_MOUSE Cathepsin L1 0015 87 94 02 19

The statistics are performed using the t-test from the ln(dNSAF) Gaussian data

a P p-value of the t-test where the null hypothesis states that there was no change in expression between

control and transgenic GFAP overexpressor b SC percentage of sequence coverage of the listed protein from

sequenced tryptic peptides c Fold Change positive values are indicative of a fold change for transgenic CSF

negative values are fold changes for control CSF (ie down-regulated in transgenic CSF) uarr indicates the protein

was only detected in transgenic CSF and darr indicates the protein was only detected in control CSF d dSpC

distributive spectral counts which represent the average spectral counts observed per run analysis on the mass

spectrometer and corrected using distributive analysis for peptides shared by more than one protein

98

Table 2 Proteins showing greater than three-fold changes with at least two unique

peptides identified for each protein

Accession Protein SC ()a Fold

Change b

Control

dSpC c

Transgenic

dSpC c

MUG1_MOUSE Murinoglobulin-1 precursor 147 42 155 37

CO4B_MOUSE Complement C4-B 113 54 22 118

PRDX6_MOUSE Peroxiredoxin-6 576 46 14 64

CNTN1_MOUSE Contactin-1 65 darr 41 0

CATB_MOUSE Cathepsin B 263 42 23 97

CAH3_MOUSE Carbonic anhydrase 3 396 76 11 84

APOH_MOUSE Beta-2-glycoprotein 1 128 44 44 1

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

PGK1_MOUSE Phosphoglycerate kinase 1 355 36 17 61

NDKB_MOUSE Nucleoside diphosphate kinase B 408 34 13 44

FHL1_MOUSE

Four and a half LIM domains

protein 1 243 39 13 51

NELL2_MOUSE

Protein kinase C-binding protein

NELL2 45 -43 13 03

MDHM_MOUSE

Malate dehydrogenase

mitochondrial 385 41 12 49

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

a SC percentage of sequence coverage of the listed protein from sequenced tryptic peptides b Fold

Change positive values are indicative of a fold change for transgenic CSF negative values are fold changes for

control CSF and darr indicates the protein was only detected in control CSF c dSpC distributive spectral counts

which represent the average spectral counts observed per run analysis on the mass spectrometer and corrected using

distributive analysis for peptides shared by more than one protein

99

Table 3 Validation of changes in proteins revealed by MS-based spectral counting

consistent with previously published microarray data

Consistent changes in RNA and proteomic data

uarr regulated in transgenic darr regulated in transgenic

Cathepsin S Contactin-1

Cathepsin B Carboxypeptidase E

Cathepsin L1

Peroxiredoxin-6

Complement C4-B

Glial fibrillary acidic protein

Serine protease inhibitor A3N

Note Validation of putative biomarkers from the current proteomics dataset by previously

published RNA microarray data55

Both up and down regulated proteins were consistent with the

RNA microarray data

_

100

___________________________________________

SUPPLEMENTAL INFORMATION (Available upon request)

Table S1 Compilation list of proteins identified from all the control and transgenic biological

replicates

Table S2 Distributive spectral counting calculations performed for proteins observed to share

identified peptides

Table S3 Proteins with dNSAF spectral counting performed including t-test analysis for a

comparison between transgenic and control CSF

101

FIGURE LEGENDS

Figure 1 The general workflow indicating the major steps involved in sample collection sample

processing mass spectrometric data acquisition and analysis of mouse CSF samples

Figure 2 Assessment of run to run variability of the base peak chromatograms within and

between two biological and technical replicates The peak profile and intensity scale is

consistent between the four chromatograms The four panels show two biological replicates (Tg

4 and Tg5) with two technical replicates for each biological sample

Figure 3 A Venn-diagram of the biological triplicates (1 2 and 3) of transgenic mouse

CSF showing 78 proteins identified in all three experiments B Venn-diagram of the biological

triplicates (1 2 and 3) of control mouse CSF having 61 proteins identified by all three

replicates C The overlap between control and transgenic CSF proteomic analysis showing 139

proteins identified by both groups and 73 and 54 uniquely identified by respective groups

Figure 4 Assessment of technical replicate variability between biological replicates The error

bars in both A and B are the standard deviation derived from the technical triplicates for each

biological replicate Panel A shows creatine kinase M having more or equal variability in the

biological triplicates than each technical triplicate The means of the biological triplicates are

illustrated by the dashed line Panel B represents kininogen-1 which is unchanged between

control (Ctl) and transgenic (Tg) samples The biological variability coupled with the technical

replicates provides a barely noticeable difference in the pooled mean between control and

102

transgenic spectral counts The difference in means is contrasted with the three fold change

observed from creatine kinase M (A)

Figure 5 MSMS profile of the tryptic peptide LSVEALNSLTGEFK from creatine kinase M

(A) The top MSMS is from a control sample with an elution at 463 minutes (B) The bottom

MSMS is from the GFAP transgenic sample with an elution at 46 minutes These tandem MS

spectra show instrument reliability and consistent fragmentation patterns which are necessary for

spectral counting analysis

Figure 6 Validation of MS spectral counts using a GFAP ELISA GFAP content (ngL)

measured within mouse CSF from both transgenic and control animals The data represents the

average with standard deviation (n=5 each sample represents pooling of 1 to 8 animals) The

statistics are performed using a student t-test plt00001

103

Figure 1

104

Figure 2

105

Figure3

106

Figure 4

107

Figure 5

108

Figure 6

Ctl Tg

100

1000

10000

100000

Mouse CSF Sample

GF

AP

(n

gL

)

109

Table of Contents Summary

Cerebrospinal fluid (CSF) was obtained from transgenic mouse models of Alexander disease as

well as healthy controls We immunodepleted pooled CSF from mice by a modified IgY-14

protocol Shotgun proteomics utilizing trypsin digestion 1-D nanoRP separation CID tandem

mass spectrometry analysis Mascot database searching and relative quantitation via distributive

normalized spectral abundance factor resulted in the identification of 266 proteins and 27

putative biomarkers

110

Chapter 4

Genomic and proteomic profiling of rat adapted scrapie

Adapted from ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A

Cunningham R Wang J Wellner D Ma D Li L Aiken J In preparation

111

Abstract

A novel model of prion disease using rats called Rat Adapted Scrapie (RAS) was

developed with the genomics from brain and proteomics from cerebrospinal fluid (CSF) profiled

The CSF proteome from control and RAS (biological N=5 technical replicates N=3) were

digested and separated using one dimensional reversed-phase nanoLC coupled to data-

dependent tandem MS acquisition on an ion trap mass spectrometer In total 512 proteins 167

non-redundant protein groups and 1032 unique peptides were identified with a 1 false

discovery rate (FDR) Of the proteins identified 21 were statistically up-regulated (plt005) and

7 statistically down-regulated in the RAS CSF We also identified 1048 genes which were

differentially regulated in rat prion disease and upon mapping these changes to mouse gene

expression however only 22 of these genes were in common with mRNAs responding to

prion infection in mice suggesting that the molecular pathology observed in mice may not be

applicable to other species The proteins are compared to the differentially regulated genes as

well as to previously published proteins showing changes consistent with other prion animal

models

112

Introduction

Prion diseases are an unusual class of fatal transmissible neurodegenerative disorders

that affect the mammalian central nervous system They are caused by the accumulation of an

abnormal conformation of the normal host encoded cellular prion protein PrPC This

conformational rearrangement of PrPC is brought about by template directed misfolding wherein

seed molecules of the abnormal isoform PrPScrapie

PrPSc

convert PrPC into new PrP

Sc molecules

Scrapie in sheep and goats is the prototypic prion disease and is so named because clinically

affected animals scrape their wool off on pen and stall walls Laboratory investigation of prion

diseases typically relies upon rodents which can be infected with natural isolates of scrapie1

albeit with some difficulty as ovid scrapie isolates need time to adapt to rodents This adaptation

is characteristic of prion disease interspecies transmissions and properly reflects the molecular

adaptation that must occur to allow interaction between exogenous foreign PrPSc

and host PrPC

molecules selecting for conformers which exhibit template directed misfolding In some cases

no conformational solution is found reflecting a species barrier to disease transmission

In recent years advances in genomics and proteomics technologies have allowed

unprecedented examination of the biomolecules that are altered upon exposure to prion agents

These studies2 3

have relied upon analysis of gene and protein expression changes in response to

prion infection with the aim of trying to identify pathways that might underlie the mechanism of

prion-induced neurotoxicity A second important aim has been to identify signature molecules

that might act as surrogate biomarkers for these diseases as there are significant analytical

challenges associated with sensitively detecting and specifically distinguishing disease-induced

conformational changes (PrPSc

) of the prion protein from normal host conformations (PrPC)

113

Mass spectrometry (MS)-based proteomics has become a promising tool for biomarker

discovery from biological fluids such as CSF blood and urine4-6

Two-dimensional gel

electrophoresis MS (2D-GE MS) and two-dimensional differential gel electrophoresis (2D-DIGE

MS) are the most widely used methods for proteomic profiling of prion infected CSF so far due

to the advantage of ready separation and quantification of proteins in complex biological samples

Studies using 2D-GE MS or 2D-DIGE MS to profile proteins in CSF have led to the

identifications of currently accepted biomarker for sCJD 14-3-3 protein and other potential

biomarkers for prion diseases7-9

However the application of this method in biomarker

discovery is limited by insufficient sensitivity and potential bias against certain classes of

proteins as gel-based separation does not work well for the low abundance proteins very basic

or acidic proteins very small or large proteins and hydrophobic proteins 10 11

In contrast to 2D-

GE or 2D-DIGE shotgun proteomics employs enzymatic digestion of the protein samples

followed by chromatographic separation prior to introduction into a mass spectrometer for

tandem MS analyses Shotgun proteomics has become increasingly popular in proteomic

research because these methods are reproducible highly automated and have a greater

likelihood of detecting low abundance proteins12 13

Due to the sample complexity in CSF and

because albumin comprises over half of the protein content in CSF removal of high-abundance

proteins including albumin is necessary to improve proteomic coverage and identify low-

abundance proteins One method is IgY immunodepletion14 15

which is performed prior to LC-

MSMS and uses species specific antibodies from chicken yolk to remove abundant proteins in

biological samples such as CSF In the present work CSF from control and rat adapted scrapie

animal models (biological N=5 technical N=3 replicates) were analyzed by LC-MSMS and we

114

indentified 512 proteins and 167 non-redundant protein isoforms (referred to as protein groups)

with 21 proteins being statistically up-regulated (p lt 005) and 7 statistically down-regulated

By and large this work has been performed using laboratory mice for the gene

expression studies and human cerebrospinal fluid for proteomics mice do not provide sufficient

volumes of CSF for proteomic studies Both approaches are exceedingly valuable as the mouse

model allows cross-sectional time course experiments to be performed including the important

pre-clinical phase of disease Critically however the relevance and generalizability of mouse

prion responses to other prion diseases especially human disease is unknown Human proteomic

studies on CSF guarantee relevant protein identifications but are limited to the clinical phase of

the disease when apparent markers may reflect gross neurodegeneration covering up subtle but

more specific responses To address these issues we have adapted mouse RML prions into rats

with the aim of expanding the knowledge of prion disease responses addressing the limitations

of mice and opening up the possibility of preclinical proteomic profiling in a laboratory rodent

In the present work CSF samples from control and rat adapted scrapie were analyzed by system

biology approaches Here we show that Rat Adapted Scrapie (RAS) is a powerful tool for an -

omics based approach to decipher the molecular impact of prion disease in vivo with

applicability to the molecular mechanisms of disease and biomarker discovery We identified

1048 genes which were differentially regulated in rat prion disease Astonishingly on the whole

mouse orthologs of these genes did not change in mouse adapted scrapie and vice versa

questioning the universality of previous mouse gene expression profiles These RAS gene

expression changes were identified in the CSF proteome where we detected 512 proteins and 167

protein groups with 21 proteins being statistically up-regulated (plt005) and 7 statistically down-

115

regulated in the CSF of prion diseased rats Many of the proteins detected have previously been

observed in human CSF from CJD patients

Materials and Methods

Ethics Statement

This study was carried out in accordance with the recommendations in the NIH Guide for Care

and Use of Laboratory Animals and the guidelines of the Canadian Council on Animal Care The

protocols used were approved by the Institutional Animal Care and Use Committees at the

University of Wisconsin and University of Alberta

Chemicals

Acetonitrile (HPLC grade) and ammonium bicarbonate (certified) were purchased from

Fisher Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) Optima LCMS grade acetonitrile and water were purchased

from Fisher Scientific (Fair Lawn NJ) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris

ge999 ) ammonium formate (ge99995) glycine (ge985) and IgY-7 antibodies were

purchased from Sigma-Aldrich (Saint Louis MO)

Rat Transmission and Adaptation

Prion agents used were the rocky mountain lab RML strain of mouse adapted scrapie

Stetsonville transmissible mink encephalopathy16

(TME) Hyper (Hy) strain of Hamster TME 17

1st passage Skunk adapted TME prepared as described and C from genetically defined

transmissions18

116

Brains from animals clinically affected with prion disease were aseptically removed and

prepared as 10 wv homogenates in sterile water 50 microL of 10 brain homogenate was

inoculated into weanling wistar rats intracranially After one year of incubation preclinical rats

from RML infections were euthanized by CO2 inhalation and the brain excised homogenized

and re-inoculated into naive animals Subsequent serial passages were from rats clinically

affected with rat adapted scrapie

Brains from rat passages were aseptically removed and bisected sagittally Brain halves

were reserved for immunohistochemistry (IHC) in formalin or frozen for immunoblotting RNA

isolation or subsequent passage For IHC tissues were first fixed in 10 buffered formalin

followed by antigen retrieval at 121 oC 21 bar in 10mM citrate buffer for 2 min After cooling

to room temperature sections were treated with 88 formic acid for 30 min and 4M guanidine

thiocyanate for 2 h Endogenous peroxidases were inactivated by 003 hydrogen peroxide and

tissue was blocked with 1 normal mouse serum Mouse anti-PrP mAB SAF83 (Cayman

Chemical) was biotinylated and applied at a 1250 dilution in blocking buffer overnight at 4 oC

Washes were performed in 10mM PBS with 005 tween-20 Streptavidin-peroxidase

(Invitrogen) was added for Diaminobenzidine (DAB) color reaction Anti-GFAP

immunohistochemistry was performed as above except that formic acid and guanidine treatment

steps were excluded Mouse anti-GFAP mab G-A-5 (Sigma) was used at a 1400 dilution

Brain samples for immunoblot were treated with or without Proteinase K (Roche) at a

ratio of 35 microg PK 100 ug protein at 50 microg mL for 30 minutes at 37 oC Phosphotungstenic acid

enrichments were performed as described14 19

Bis-Tris SDS-PAGE was performed on 12

polyacrylamide gels (Invitrogen) and transferred to PVDF Immunoblotting was performed using

117

mABs SAF83 (Cayman Chemical) 6H4 (Prionics) or 3F10 (a kind gift from Yong-Sun Kim) all

at a 120000 dilution

Gene Expression Profiling

RNA was extracted from frozen brain halves from clinically affected and control animals with

the QIAshredder and RNeasy mini kit (Qiagen Valencia CA) in accordance with the

manufacturerrsquos recommendations Due to the relatively large mass of rat brain an initial

homogenization was performed with a needle and syringe in 5mL of buffer RLT before further

diluting an aliquot in accordance with the manufacturers protocol Total RNA was amplified and

labeled in preparation for chemical fragmentation and hybridization with the MessageAmp

Premier RNA amplification and labeling kit (Life Technologies Grand Island NY) Amplified

and labeled cRNAs were hybridized on Affymetrix (Santa Clara CA) rat genome 230 20 high

density oligonucleotide arrays in accordance with the manufacturers recommendations

Differentially Expressed Genes were identified using Arraystar 50 (DNAStar Madison WI)

Robust multi-array normalization using the quantile approach was used to normalize all

microarray data A moderated T-test with a multiple comparison adjustment20

was used to reduce

the false discovery rate yet preserve a meaningful number of genes for pathway analysis

Pathway analysis was performed using the DAVID Bioinformatics database21

Comparative

analysis of genes induced by prions in mouse22

and rat disease was performed on genes

exhibiting at least a 2-fold change in expression Orthologs of rat and mouse genes were

identified using ENSEMBLE biomart release 6823

CSF Proteomic Profiling

118

CSF was collected from rats anaesthetized with isoflorane via puncture of the cisterna

magna A volume of 100-200 microL was routinely obtained CSF was touch spun for 30s at 2000xg

on a benchtop nano centrifuge to identify any blood contamination by the presence of a red

pellet CSF with blood contamination was not used for proteomic analysis CSF was prepared

for profiling by first depleting abundant proteins with an antibody based immunopartitioning

column IgY-R7 (Sigma-Aldrich Saint Louis MO) The manufacturers recommendations were

followed with a few deviations 40 microg of protein (~100 microL CSF) was bound to 100 microL of IgY

bead slurry in total volume of 600 microL formulated in 150mM ammonium bicarbonate The flow

through was collected and denatured by the addition of 10 microL of 8 M guanidine thiocyanate and

lyophilized in a speed-vac apparatus The sample was then reconstituted in 10 microL of water and 1

microL of 050 M DTT were added and allowed to incubate at 37oC for 30 minutes After incubation

27 microL of 055 M IAA was added for carboxymethylation and the sample was allowed to

incubate in the dark for 15 minutes After that 1 microL of 050 M DTT was added and allowed to

sit at room temperature for 10 minutes To perform trypsin digestion 70 microL of 50 mM

NH4HCO3 was then added along with 025 microg trypsin Digestion was performed overnight at

37oC and quenched by addition of 5 microL of 10 formic acid The tryptic peptides were then

subjected to solid phase extraction using an Agilent OMIX Tip C18 100 microL (Santa Clara CA)

Peptides were eluted with 50 ACN in 01 formic acid concentrated and reconstituted in 30

microL H2O with 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of a nanoAcquity (Milford MA) with a 10 microL injection

loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B consisted of

ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto a Waters 5 microm

119

Symmetry C18 180 microm x 20 mm trap cartridge (Milford MA) at a flow rate of 75 microLmin for 5

minutes at 95 A 5 B Separation was performed on a Waters 17 microm BEH130 C18 75 microm x

100 mm (Milford MA) with a gradient of 5 to 15 B over 10 minutes followed by 15 to

40 B over 80 minutes at room temperature

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with a Bruker CaptiveSpray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Waters Acquity console software to perform MS acquisitions for all experiments Smart

parameter setting (SPS) was set to 700 mz compound stability and trap drive level was set at

100 Optimization of the CaptiveSpray source resulted in dry gas temperature of 160oC dry

gas flow of 30 Lmin capillary voltage of -1325 V end plate offset of -500 V MSMS

fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

range of 350-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

120

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot24

(Version 24 Matrix

Science London UK) Database searching was performed against a forward and reversed

concatenated SwissProt Rattus rattus Mascot search parameters were as follows Allowed

missed cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) peptide tolerance plusmn12 Da maximum number of 13

C 1 MSMS

tolerance plusmn05 Da instrument type ESI-Trap Data was then transformed into Dat file formats

and transferred to ProteoIQ (NuSep Bogart GA) False positive analyses were calculated using

ProteoIQ and set at 1

Results

Development of Rat Adapted Scrapie

To develop a rat adapted strain of prion disease we introduced 6 different prion agents RML

TME Hy Skunk Adapted TME and Chronic Wasting Disease (CWD) from wild type 96G and

96S deer16-18

into 6 rats (Fig 1) Of these primary transmissions only RML induced the

accumulation of Proteinase K resistant PrP after one year of incubation as determined by western

blotting on 10 brain homogenates and PrPSc

enriched phoshotungstenic acid precipitated brain

homogenates Second passage of rat adapted RML scrapie resulted in clinically affected rats at

565 days post inoculation Serial passages of TME or CWD agents were not performed Clinical

symptoms consisted of ataxia lethargy wasting kyphosis and myoclonus Prion affected rats

121

also showed low level porphyrin staining around their head Subsequent serial passage decreased

incubation time to 215 days

Proteinase K resistant prion protein was observed from all clinically affected animals both by

immunoblot (Fig 2) and by immunohistochemistry (Fig 3) Di- and mono-glycosylated bands

were the most abundant isoforms of PrPSc

PrPSc

was extensively deposited in the cerebral cortex

hippocampus thalamus inferior colliculus and granular layer of the cerebellum GFAP

expressing activated astrocytes were found throughout the brain particularly in the white matter

of the hippocampus thalamus and cerebellum Spongiform change was a dominant feature of

clinical rat

Gene expression Profiling

In total 1048 genes were differentially regulated within a 95 confidence interval

(Supplementary Table 1) and 305 genes were differentially expressed by greater than 2-fold (Fig

4) The 1048 genes that were statistically significant were used for pathway analysis using

DAVID Pathway analysis suggested that the gene expression profile was consistent with

immune activation and maturation as well as inflammation (Supplementary Table 2) a likely

interpretation given the observable astrocytosis (Fig 3) and gliosis associated with prion disease

Other pathways highlighted by the analysis included increases in transcription of genes involved

in lysosomes and endosomes

To further probe the gene expression data we compared genes which were differentially

expressed in rat adapted scrapie against their orthologs expression in mouse scrapie and vice

versa (Figs 5 and 6) We did not observe a high degree of correlation between expression fold

changes For example GFAP a gene whose up-regulation in prion disease is well known was

122

increased 25 fold in rat adapted scrapie but up-regulated 93 fold in mouse scrapie A

qualitative analysis of expression of orthologs in prion disease suggests that many genes

deregulated at least 2-fold in mouse or rat do not have orthologs that are differentially expressed

For example the gene RNAseT2 was up-regulated greater than threefold in rat adapted scrapie

but was not significantly up-regulated in mouse Similarly three genes important in metals

homeostatsis metallothionein 1a and 2a and ceruloplasmin were up-regulated in rat 46 43 and

3 fold respectively but were not differentially expressed in mouse prion disease

CSF Proteomics

Each immunodepleted biological replicate (N=5 for each control and RAS) had technical

triplicates performed The triplicates were summed together using ProteoIQ Peptide spectral

counts were then summed for each protein and subjected to dNSAF analysis using ProteoIQ

internal algorithms Details for this method can be found elsewhere25 26

but briefly peptide

spectral counts are summed per protein (SpC) based on unique peptides and a weighted

distribution of any shared peptides with homologous proteins T-tests were used to identify

significant changes in protein expression 1032 unique peptides which identify 512 proteins and

167 protein groups were found Of these 512 proteins 437 were identified in both RAS and

control CSF samples (Fig 7) The complete list of all 512 proteins can be viewed in

Supplemental Table 3 Using the ProteoIQ the most likely isoform was selected such as 14-3-3

protein gamma

From Table 1 we observe five proteins that agree with the genomic data for up

regulations in RAS granulin macrophage colony-stimulating factor 1 receptor cathepsin D

complement factor H and ribonuclease T2 In addition serine protease inhibitor A3N was not

123

detected as up regulated in the RAS genomic data but was found to be up-regulated in previous

genomic profiling of the mouse prion model22

One interesting trend from the data in Table 1 is

that the majority of proteins found to be up-regulated in the RAS model were not detected in the

control samples The absence of the detection of those proteins such as ribonuclease T2 in the

control CSF does not necessarily suggest the absence of the protein nonetheless it is below the

detection limits for this current proteomics protocol and instrumentation

Discussion

Mice have been the preferred laboratory rodent for prion diseases research because they

can be inexpensively housed and are amenable to transgenesis which allows for short incubation

periods as well as hypothesis testing upon targeted pathways Pursuant to the determination of

the mouse genome and the development of high density transcriptional arrays for measurements

of gene expression profiling mice have been used extensively to examine the molecular

pathology of prion disease probing the impact of disease and animal strain In order to expand

upon this foundation we adapted mouse prions to infect rats We reasoned that by taking a

comparative approach to the molecular pathology of prion disease inferences could be obtained

into the variability of the molecular response to prion diseases and that understanding this

variability might suggest whether human prion disease responses are more or less similar to

mouse responses A second rationale is the desire to identify surrogate markers of prion disease

While this approach has been taken before using gene expression profiling a more direct

approach has been to use proteomic approaches on cerebral spinal fluid (CSF) identifying

proteins that are increase in abundance with disease A rat prion disease is valuable for this

because the rat proteome is established and rats allow for the collection of relatively large

volumes of CSF (~100 microL per animal) which allows for robust analytical separations enhancing

124

detection of biomarkers Finally rats unlike humans can be used in a time course study of prion

disease This allows for the identification of early transcriptional and proteomic responses to

prion infection responses which are particularly valuable for the identification of surrogate

disease biomarkers

To initiate the development of a rat prion disease we attempted to adapt six different

prion disease agents PrPres

molecules to rat via intracranial inoculation of weanling animals

(Figure 1) Of these six agents only mouse RML prions were able to surmount the species

barrier to prion disease transmission Mouse PrP differs from rat by eight amino acid changes

six in the mature protein and four in the N-terminal octa-peptide repeat region (Supplementary

Fig 1) As rat shares the most prion protein sequence homology with mouse it is perhaps not

surprising that it transmitted whereas the other did not confirming that the primary prion protein

sequence is the most important determinant for interspecies transmission We conclude that there

is a large molecular species barrier preventing conversion of rat PrPc into PrP

res

The transmission of mouse RML into rats was characterized by a shortening of the

incubation period following each passage This is indicative of agent adaption to the new host

and increases in the titer present in end-stage brain Overall our adaptation of mouse prion

disease into rats resulted in a similar agent to that observed by Kimberlin27

The differences in

incubation period at second passage are largely due to our collecting the animals at 365 days post

inoculation upon first passage whereas Kimberlin and Walker allowed the first passage animals

to reach end-stage clinical rats

Rat adapted scrapie is in many ways similar to other prion diseases The clinical period of

disease is marked by a progressive neurological dysfunction with prominent ataxia kyphosis and

125

wasting The protease resistant PrP observed is typical as is the widespread deposition of PrPSc

in

the brain Spongiosis and reactive astrogliosis are as expected of a prion disease

Gene expression profiles from rats clinically affected with prion disease revealed a strong

neuronal inflammation associated with proliferated and activated astrocytes This is perhaps best

observed through the up-regulation of GFAP GFAP positive astrocytes were omnipresent

throughout the brain and GFAP mRNA was up-regulated 25 fold The up-regulation of GFAP is

a hallmark of the molecular response to prion infection and has been routinely observed Our

comparative analysis of gene expression changes in mouse RML versus rat adapted scrapie

suggest substantial differences in gene expression in response to prion disease despite the fact

that the overall response is neuro-inflammatory This suggests that the potential overlap between

mouse expression profiles and a putative human CJD expression profile could be quite different

at the level of individual transcripts that might be expected to be changed

CSF Proteomics

CSF proteomics can be exceedingly challenging due to the small sample available large

dynamic range in proteins and abundant proteins limiting sample loading onto nanoscale

columns Dynamic range reduction in the CSF sample was achieved using a custom amount of

IgY-7 antibodies to abundant proteins such as albumin After immunodepletion the total

protein content was reduced by ~90 limiting the proteomics analysis to one dimensional

separation Label free quantitation spectral counting was performed because it requires less

protein and does not increase sample complexity The proteins identified from the affected and

control N=5 are shown in Supplemental Table 2 Consistent detection of proteins in CSF from

both control and infected rats was observed (Fig 7C) Only two proteins were identified in

126

controls that were not observed in RAS and only 10 proteins were only observed in RAS Some

of these proteins that were only identified in RAS are significantly changed (Supplemental Table

3) One concern in proteomics data is the variability from run to run and the possibility that

certain proteins are identified from different unique peptides Figure 7A shows that the vast

majority of the unique peptides detected (783) were identified with a 1 FDR in both RAS and

control CSF samples highlighting the analytical reproducibility of our methodology

Proteomic analysis of the infected rat CSF provides a reasonable approach to cross

validate biomarkers detected by gene expression profiling For example RNAseT2 a secreted

ribonuclease was up-regulated 3-fold at the mRNA level and was also enriched in CSF from

infected rats (Table 1) Similarly coordinated increases in detection of colony stimulating factor

1 receptor complement factor H granulin and cathepsin D were also observed Conversely

proteomic analysis of CSF also allows for the observation of post-transcriptional responses to

prion disease that might be absent in gene expression profiling The 14-3-3 proteins and neuron

specific enolase both known markers for CJD are only detected by proteomic analysis Thus

gene expression profiling and proteomic detection serve to increase confidence in the

observation of up-regulation enhancing the likelihood that proteins detected by both

methodologies are specific and perhaps may be more sensitive at earlier time points

Comparison to human CSF prion disease proteome

In our comparative analysis 21 proteins were found to be up-regulated while 7 proteins

down-regulated in prion infected rats Included in the up-regulated proteins were the 14-3-3

proteins epsilon and gamma 14-3-3 zetadelta was also up-regulated but not statistically

significant Enolase 2 or neuron specific enolase (NSE) was also up-regulated in CSF of infected

127

rats These proteins are all in agreement with results from previous proteomic profiling of human

CSF from patients with CJD8 9

The detection of 14-3-3 protein is included in the diagnostic

criteria approved by World Health Organization for the pre-mortem diagnosis of clinically

suspected cases of sCJD28

although its application in large-scale screening of CJD is still

debated due to high false positive rate where elevated 14-3-3 protein levels were also reported in

other conditions associated with acute neuronal damage29 30

It was suggested that other brain-

derived proteins that are also specific to CJD can be used in conjunction with 14-3-3 protein to

increase diagnosis accuracy and specificity31

NSE is present in high concentration in neurons

and in central and peripheral neuroendocrine cells NSE serves as another valuable biomarker in

diagnosis of CJD as the elevated level of NSE in CSF was found in early and middle stages of

CJD 32

Other proteins detected in CSF included cystatin C and serpina3N although both of

these were not statistically changed These proteins were both previously identified as being

putative biomarkers for CJD33 34

Utility for biomarker discovery with emphasis on onset of biomarker appearance in CSF

The investigation of the protein changes in CSF from RAS compared to control rats

provides a solid foundation when investigating potential biomarkers with prion disease onset

The cross-validation of the genomic and proteomics data further emphasizes the targets for

consideration during disease onset Biomarker discovery provides the potential to determine if

animals such as cows in bovine spongiform encephalopathy (BSE) are affected instead of

having to be euthanized when BSE is expected In addition CSF collection in mice or hamsters

Prion models is extremely difficult and limited alternatively with the advent of the RAS model

CSF can be collected in sufficient amounts for CSF proteomics CSF collection for mice or

hamsters can be ~10 microL and could be contaminated with blood further complicating proteomic

128

analysis unlike rats which over 10 times more CSF can be collected per animal35

Due to the

amount of CSF available time course studies analyzing CSF proteomics is possible in RAS due

to animal numbers that are manageable and reasonable The RAS model further allows

investigators to bypass working with highly infections CJD CSF samples to investigate the CSF

proteome changes

Conclusion

In this study we have described the gene and protein expression changes in brain and

spinal fluid from a transmission of mouse prions into rats We find that while the overall gene

expression profile in rats is similar to that in mice the specific genes that make up that profile

are different suggesting that genes that change in response to prion disease in different species

may not be as conserved Analysis of CSF from rat adapted scrapie identified similar protein

changes as known in human CJD The rat will be a useful model to identify surrogate markers

that appear prior to the onset of clinical disease and thus may be of higher specificity and

sensitivity

Supplemental Information Available Upon Request

1 Chandler R L Encephalopathy in mice produced by inoculation with scrapie brain material Lancet 1961 1 (7191) 1378-9 2 Silva C J Onisko B C Dynin I Erickson M L Requena J R Carter J M Utility of Mass Spectrometry in the Diagnosis of Prion Diseases Anal Chem 2011 3 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion disease biomarker research delving into the glycoproteome J Proteome Res 2011 10 (6) 2687-702 4 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7 (1) 386-99 5 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2003 878 (22) 2003-12 6 Cunningham R Ma D Li L Mass spectrometry-based proteomics and peptidomics for systems biology and biomarker discovery Frontiers in Biology 2012 7 (4) 313-335

129

7 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66 8 Harrington M G Merril C R Asher D M Gajdusek D C Abnormal proteins in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease N Engl J Med 1986 315 (5) 279-83 9 Piubelli C Fiorini M Zanusso G Milli A Fasoli E Monaco S Righetti P G Searching for markers of Creutzfeldt-Jakob disease in cerebrospinal fluid by two-dimensional mapping Proteomics 2006 6 Suppl 1 S256-61 10 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1) 46-50 11 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66 12 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset J Proteome Res 2008 7 (2) 698-707 13 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422 (6928) 198-207 14 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample preparation and analysis Proteomics 2005 5 (13) 3314-28 15 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep Biochem Biotechnol 2009 39 (3) 221-47 16 Marsh R F Bessen R A Lehmann S Hartsough G R Epidemiological and experimental studies on a new incident of transmissible mink encephalopathy J Gen Virol 1991 72 ( Pt 3) 589-94 17 Bessen R A Marsh R F Biochemical and physical properties of the prion protein from two strains of the transmissible mink encephalopathy agent J Virol 1992 66 (4) 2096-101 18 Johnson C J Herbst A Duque-Velasquez C Vanderloo J P Bochsler P Chappell R McKenzie D Prion protein polymorphisms affect chronic wasting disease progression PLoS One 2011 6 (3) e17450 19 Safar J Wille H Itri V Groth D Serban H Torchia M Cohen F E Prusiner S B Eight prion strains have PrP(Sc) molecules with different conformations Nat Med 1998 4 (10) 1157-65 20 Benjamini Y Hochberg Y Controlling the False Discovery Rate A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 1995 57 (1) 289-300 21 Huang da W Sherman B T Lempicki R A Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources Nat Protoc 2009 4 (1) 44-57 22 Moody L R Herbst A J Aiken J M Upregulation of interferon-gamma-induced genes during prion infection J Toxicol Environ Health A 2009 74 (2-4) 146-53 23 Flicek P Amode M R Barrell D Beal K Brent S Carvalho-Silva D Clapham P Coates G Fairley S Fitzgerald S Gil L Gordon L Hendrix M Hourlier T Johnson N Kahari A K Keefe D Keenan S Kinsella R Komorowska M Koscielny G Kulesha E Larsson P Longden I McLaren W Muffato M Overduin B Pignatelli M Pritchard B Riat H S Ritchie G R Ruffier M Schuster M Sobral D Tang Y A Taylor K Trevanion S Vandrovcova J White S Wilson M Wilder S P Aken B L Birney E Cunningham F Dunham I Durbin R Fernandez-Suarez X M Harrow J Herrero J

130

Hubbard T J Parker A Proctor G Spudich G Vogel J Yates A Zadissa A Searle S M Ensembl 2012 Nucleic Acids Res 2012 40 (Database issue) D84-90 24 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein identification by searching sequence databases using mass spectrometry data Electrophoresis 1999 20 (18) 3551-67 25 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J Proteome Res 2006 5 (9) 2339-47 26 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome quantitation how to deal with peptides shared by multiple proteins Anal Chem 2010 82 (6) 2272-81 27 Kimberlin R H Cole S Walker C A Temporary and permanent modifications to a single strain of mouse scrapie on transmission to rats and hamsters J Gen Virol 1987 68 ( Pt 7) 1875-81 28 World Health Organization Global surveillance diagnosis and therapy of human transmissible spongiform encephalopathies report from a WHO consultation 1998 World Health Organization Geneva Switzerland 9-11 February 1998 1-26 29 Bartosik-Psujek H Archelos J J Tau protein and 14-3-3 are elevated in the cerebrospinal fluid of patients with multiple sclerosis and correlate with intrathecal synthesis of IgG J Neurol 2004 251 (4) 414-20 30 Saiz A Graus F Dalmau J Pifarre A Marin C Tolosa E Detection of 14-3-3 brain protein in the cerebrospinal fluid of patients with paraneoplastic neurological disorders Ann Neurol 1999 46 (5) 774-7 31 Sanchez-Juan P Green A Ladogana A Cuadrado-Corrales N Saanchez-Valle R Mitrovaa E Stoeck K Sklaviadis T Kulczycki J Hess K Bodemer M Slivarichova D Saiz A Calero M Ingrosso L Knight R Janssens A C van Duijn C M Zerr I CSF tests in the differential diagnosis of Creutzfeldt-Jakob disease Neurology 2006 67 (4) 637-43 32 Zerr I Bodemer M Racker S Grosche S Poser S Kretzschmar H A Weber T Cerebrospinal fluid concentration of neuron-specific enolase in diagnosis of Creutzfeldt-Jakob disease Lancet 1995 345 (8965) 1609-10 33 Sanchez J C Guillaume E Lescuyer P Allard L Carrette O Scherl A Burgess J Corthals G L Burkhard P R Hochstrasser D F Cystatin C as a potential cerebrospinal fluid marker for the diagnosis of Creutzfeldt-Jakob disease Proteomics 2004 4 (8) 2229-33 34 Miele G Seeger H Marino D Eberhard R Heikenwalder M Stoeck K Basagni M Knight R Green A Chianini F Wuthrich R P Hock C Zerr I Aguzzi A Urinary alpha1-antichymotrypsin a biomarker of prion infection PLoS One 2008 3 (12) e3870 35 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta) equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

131

Figure 1 Interspecies transmission of prion disease to rats End-stage clinical brain homogenates

were used to passage prion disease After one year of incubation animals were euthanized to

determine the extent of PrPres

accumulation Protease resistance PrP was only observed in those

animals infected with RML scrapie prions This material was serially passaged for two more

incubations before becoming rat-adapted as indicated by the shortening of the incubation period

132

Table 1 A list of selected proteins and mRNAs up-regulated in RAS CSF and brain tissue If

the protein is detected in the RAS CSF but not the control CSF the CSF expression is reported

with a infin If there is no change or data on certain genes related to an up regulated protein nd is

noted The mouse genomic data presented here was previously published22

Gene Protein Symbol Accession CSF

Expression

Rat

GEX

Mouse

GEX

14-3-3 protein zetadelta Ywhaz NP_037143 infin nd nd

14-3-3 protein epsilon Ywhae NP_113791 infin nd nd

14-3-3 protein gamma Ywhag NP_062249 infin nd nd

serine protease inhibitor A3N Serpin A3N NP_113719 54 nd 975

enolase 2 (gamma neuronal) Eno2 NP_647541 infin nd nd

granulin GRN NP_058809 62 364 184

macrophage colony-stimulating

factor 1 receptor

Csf1r NP_001025072 infin 293 205

cathepsin D CTSD NP_599161 infin 255 299

complement factor H Cfh NP_569093 376 234 nd

ribonuclease T2 RNAset2 NP_001099680 infin 302 nd

133

Figure 2 Accumulation of PrPSc

in rat adapted scrapie First second and third passage brain

homogenates were assayed for the presence of protease resistant PrP by immunoblot PrPSc

was

observed following phosphotungstenic acid enrichment at first passage By clinical disease in 2nd

and 3rd

passage rats PrPSc

had substantially accumulated

134

Figure 3 Histological appearance of rat adapted scrapie in the hippocampus at clinical disease

Infected animals showed intense immuno-staining for deposits of PrPSc

and GFAP expressing

astrocytes Spongiform change is an abundant feature in rat adapted scrapie

135

Figure 4 Scatter plot of gene expression profiling of rat adapted scrapie The expression of

individual genes from uninfected and infected animals were plotted to display up and down

regulation The dashed green line is no change Solid green lines are 2-fold changes in gene

expression

136

Figure 5 Quantitative relationship between orthologous pairs of two-fold up-regulated genes in

mouse and rat scrapie Genes up-regulated more than two fold were mapped to their orthologs

and the fold change was plotted Expression is log2 transformed

137

Figure 6 Comparison of genes upregulated in rat and mouse prion diseases Genes up-regulated

two fold in rodent scrapie were identified and the expression of their orthologs was determined

138

Figure 7 Venn diagram comparison of the CSF proteomics data between rat adapted scrapie

(RAS) model and control rats (A) Unique peptides from tryptic peptides produced for the

proteins identified (B) The total proteins identified including all isoforms within the protein

groups (C) The protein groups comparing only the top protein hit of the protein isoforms

showing very consistent protein identifications between RAS and control

139

Chapter 5

Investigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiae

Adapted from ldquoInvestigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M

Heideman W Li L In preparation

140

Abstract

This work explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Kinases such as protein

kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose response

Thus performing a large scale phosphoproteomic MS study can be highly beneficial to map the

signaling networks and cascades involved in glucose-dependent stimulated yeast growth Yeast

cell extract was digested and phosphopeptides were enriched by immobilized metal affinity

chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase (RP)-RP

separation The low pH separation was infused directly into an ion trap mass spectrometer with

neutral loss electron-transfer dissociation (ETD)-triggered fragmentation to improve

phosphopeptide identifications In total 477 unique phosphopeptides are identified with 06

false discovery rate (FDR) and 669 unique phosphopeptides identified with 54 FDR This

study includes comparative phosphoproteins for one specific motif of PKA xxxKRKRxSxxxx

which is presented and differences between starved vs glucose fed are highlighted Phosphosite

validation is performed using a localization algorithm Ascore to provide more confident and

site-specific characterization of phosphopeptides

141

Introduction

Microorganisms such as Saccharomyces cerevisiae (yeast) cycle between growth when

nutrients are available and quiescence when nutrients are scarce When glucose is limited yeast

go into growth arrest state but when glucose is added growth quickly resumes Kinases such as

protein kinase A (PKA) and target of rapamycin (TOR) regulate growth in response to nutrient

conditions and have been well studied through transcriptional control1-4

Yeast execute large

transcriptome alterations in response to changing environmental growth conditions5 6

Gene

regulation by glucose introduction in yeast has been studied including genes used for growth on

alternative carbon sources and activation of genes coding for glucose transport and protein

synthesis7-10

Response to nutrients for survival is not limited to yeast biology and indeed all

living creatures both prokaryotes and multicellular eukaryotes like mammals require nutrient

responsiveness and coordinating cellular functions to survive

With regulation of certain genes well studied by glucose introduction the mechanism and

global protein modulation caused by glucose introduction remain unknown6 Large-scale

phosphorylation analysis has been previously performed in yeast using mass spectrometry11-14

Mapping the phosphoproteome in yeast transitioning from quiescence to growth conditions to

better understand the roles of phosphorylation in orchestrating growth is needed The

phosphorylation state during growth or starvation could be used to engineer cells to drive ectopic

activity (or inhibition) to promote growth and ethanol production on non-native sugars like

xylose

It has been reported that the phosphorylation state can be affected by the introduction of

glucose to carbon-starved yeast15

and phosphorylation plays a significant role in the cell cycle

and signal transduction16

Specifically O-Phosphorylation can function as a molecular switch by

142

changing the structure of a protein via alteration of the chemical nature of an amino acid for

serine threonine or tyrosine It has been reported that up to 30 of the proteome can undergo

phophorylation17

Mass spectrometry has evolved as a powerful tool to accomplish phosphosite

mapping using shotgun proteomics With available technology tens of thousands of

phosphorylation sites can be mapped to amino acids on thousands of proteins using shotgun

proteomics18-20

Mass spectrometry can offer sensitive automated non-targeted global analysis of

phosphorylation events in proteomic samples but in any large scale phosphoproteomic

investigation enrichment of phosphoproteinspeptides is required First phosphorylation is

usually a sub-stoichiometric process where only a percentage of all protein copies are

phosphorylated21

Various enrichment methods have been used for phosphopeptide enrichment

including metal oxide affinity chromatography (MOAC)22

such as TiO223

immobilized metal

affinity chromatography (IMAC)12 24 25

electrostatic repulsion-hydrophilic interaction

chromatography (ERLIC)26

and immunoaffinity of tyrosine phosphorylation27 28

After

enrichment MS analyses of phosphorylated peptides are still challenging due to ion suppression

from non-phosphorylated peptides

Even after phosphopeptide enrichment further sample preparation is needed for large

scale proteomic experiments Additional fractionation can increase protein coverage of a

sample by over ten fold such as MudPIT29

(multidimensional protein identification technology)

In online MudPIT a sample is injected onto a strong cation exchange (SCX) column coupled to

a RP column Successive salt bumps followed by low pH gradients provide the separation of

peptides in two dimensions Since the phosphate groups on phosphopeptides have a low pKa

value due to being more acidic then their unmodified counterparts they tend to elute earlier and

143

disproportionally in ion exchange chromatography like SCX Alternatively reversed-phase

reversed-phase (RP-RP) separation has been proposed as an alternative to MudPIT for offline

two dimensional (2D) separation30

One of the caveats of 2D separation is the potential for

wasted mass spectrometry time from early and late fractions having very few peptides present

all while having too much sample for middle fractions One simple method to reduce these

ldquowasted fractionsrdquo is to combine early or late fractions with the middle fractions to reduce MS

runs with little peptide content to analyze thus shortening the overall analysis time31

In addition the labile phosphorylation group has a large propensity to undergo cleavage

during collision induced dissociation (CID) producing a neutral loss The neutral loss can

produce insufficient backbone fragment ions for MSMS identification32

A solution to neutral

loss fragmentation in phosphopeptides is MS3 using CID to produce improved backbone

fragmentation13 14 33

An alternative fragmentation method to CID for fast sampling ion traps is

electron transfer dissociation (ETD)34-36

ETD produces a more uniform back-bone cleavage

where the cation peptide receives an electron from a low affinity radical anion37

The transferred

electron induces a dissociation at the N-Cα bond to produce c- and zbull-type product ions while

retaining the labile post-translational modifications (PTM) such as phosphorylation bound to the

product ions38

The Bruker amaZon ETD ion trap mass spectrometer has the potential to trigger

ETD fragmentation after a labile PTM produces a neutral loss from CID fragmentation This

method is termed neutral loss-triggered ETD fragmentation and provides a complementary

fragmentation pathway to labile poor fragmenting phosphorylated peptides

In this work we provide a qualitative comparative list of yeast phosphopeptides observed

in glucose fed vs glucose starved conditions

144

Experimental

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) ammonium hydroxide (ACS plus certified) urea (gt99)

sodium chloride (997) ammonium bicarbonate (certified) Optima LCMS grade acetonitrile

Optima LCMS grade water and EDTA disodium salt (99) were purchased from Fisher

Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) ammonium formate (ge99995) Trizma

hydrochloride (Tris HCl ge990) Triton X-100 (laboratory grade) and iron (III) chloride

hexahydrate (97) were purchased from Sigma-Aldrich (Saint Louis MO) Sodium dodecyl

sulfate (SDS) (998) was purchased from US Biological (Marblehead MA) Nickel

nitrilotriacetic acid (Ni-NTA) magnetic agarose beads were purchased from Qiagen (Valencia

CA) Autoclaved yeast peptone dextrose (YPD) media was prepared in 1 L of deionized water

using 10 g yeast extract 20 g Peptone from Becton Dickinson and Company (Sparks MD) and

20 g D-(+)-Glucose (ge995) purchased from Sigma-Aldrich (Saint Louis MO)

Modified Mary Miller Yeast Protein Isolation

The yeast culture was prepared and protein extraction was performed using a modified

Mary Miller protocol39

Briefly yeast strain s288c was inoculated with YPD media and shook

for 72 hours to allow the yeast to be glucose starved After the 72 hours the yeast culture was

partitioned into two flasks To one flask glucose was added at 2 of the final concentration and

allowed to incubate and shake for 10 minutes Then 100 optical density units (ODU) of the yeast

145

culture were collected from each flask Each yeast sample was spun down in a Beckman-Coulter

J6B centrifuge (Indianapolis IN) at 2500xg for three minutes The media was aspirated and the

tubes were flash frozen in liquid nitrogen and stored at -80oC The yeast pellets were thawed on

ice and resuspended in 1 mL of lysis buffer (50 mM Tris HCl 01 triton x-100 05 SDS

pH=80) and 1X protease and phosphatase inhibitor cocktail from Thermo Scientific (Rockford

IL) and transferred to an Eppendorf tube To the yeast 200 microL of glass beads were added and

amalgamated for 25 minutes at 4oC The sample was inverted and the Eppendorf tube was

pierced with a hot 23 gauge needle through the bottom and the tube was placed atop a 5 mL

culture tube and centrifuge in the Beckman-Coulter J6B at 2500xg for three minutes at 4oC to

collect the liquid containing the yeast cells while the glass beads remain trapped in the

Eppendorf tubes The protein lysate was then centrifuged at 3000xg for five minutes at 4oC and

the supernatant was collected and stored at -80oC

Preparation of tryptic digests

The Saccharomyces cerevisiae protein cell extract for fed or starved were analyzed with a

BCA protein assay kit (Thermo Scientific Rockford IL) and 2 mg of protein was added to four

parts -20oC acetone and allowed to precipitate the protein for 1 hour at -20

oC The samples were

then centrifuged at 14000xg for 5 minutes and the supernatant was removed and the protein

pellet was reconstituted in 360 microl of 8 M urea To each sample 20 microL of 050 M DTT was

added and allowed to incubate at 37oC for 45 minutes After incubation 54 microL of 055 M IAA

was added for carbamidomethyl of cysteine and the sample was allowed to incubate for 15

minutes in the dark To quench the IAA 20 microL of 050 M DTT was added and allowed to react

for 10 minutes To perform trypsin digestion 1400 microL of 50 mM NH4HCO3 was then added

along with 20 microg trypsin to each sample Digestion was performed overnight at 37oC and

146

quenched by addition of 25 microL of 10 formic acid For each sample the tryptic peptides were

then subjected to solid phase extraction using a Hypersep C18 100 mg solid phase extraction

(SPE) column (Thermo Scientific Bellefonte PA) Peptides were eluted with 50 ACN in

01 formic acid concentrated and reconstituted in 500 microL of 80 ACN and 01 formic acid

Phosphopeptide enrichment using immobilized metal affinity chromatography (IMAC)

One ml of Ni-NTA was washed three times with 800 microL of H2O and then the Ni was

removed with 800 microL of 100 mM EDTA in 50 mM ammonium bicarbonate and vortexed for 30

minutes The supernatant was removed and the NTA was washed with 800 microL of H2O three

times followed by addition of 800 microL 10 mM FeCl3 hexahydrate and vortexed for 30 minutes

The Fe-NTA was then washed three times with 800 microL of H2O and once with 80 ACN in 01

formic acid before being combined with the cell extract for phosphopeptide enrichment and

vortexed for 40 minutes The sample was washed three times with 200 microL of 80 ACN in 01

formic acid before elution with two aliquots of 200 microL of 5 ammonium hydroxide in 5050

ACNH2O with 15 minutes of vortexing The enriched phosphopeptides were then dried down

with a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA) and reconstituted in 55 microL

25mM ammonium formate pH=75

First dimension neutral pH separation

Individually 50 microL of each yeast phosphopeptide enriched sample was injected onto a

Phenomenex Gemini-NX 3microm C18 110 Aring 150 x 2 mm column (Torrance CA) The Gemini

column was coupled to a Phenomenex SecurityGuard Gemini C18 2 mm guard cartridge

(Torrance CA) Mobile phase A consisted of 25 mM ammonium formate at pH=75 and mobile

phase B consisted of 25 mM ammonium formateACN with a 19 ratio respectively at pH=75

The column was operated at 50oC at a flow rate of 05 mLmin The flow profile was 2-30 B

147

over 65 minutes and then 5 min at 100 B A total of 22 fractions were collected every 3

minutes and the fractions were combined as follows 1 with 12 2 with 13 etc to 11 with 22

The combined fractions were dried down and reconstituted in 25 microL of 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL from each fraction onto an Agilent Technologies

Zorbax 300 SB-C18 5 microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5

microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm Atlantis

dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B at 250

nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) for MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

148

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions An additional mode of MSMS fragmentation electron transfer dissociation

(ETD) was triggered on the precursor ion when a neutral loss was observed in CID

fragmentation from phosphoric acid H3PO4 (Δmz of 49 and 327 for 2+ and 3+ charge states

respectively) or for metaphosphoric acid (HPO3) (Δmz of 40 and 267 for 2+ and 3+ charge

states respectively) For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms rolling average 2 acquisition range of 300-1500 mz

and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target

was set to 300000 maximum accumulation time 5000 ms two spectral averages acquisition

range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required to prevent artificial data

reduction Identification of peptides were performed using Mascot40

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt Saccharomyces

cerevisiae database (version 575) Mascot search parameters were as follows Allowed missed

cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) and phosphorylation (STY) peptide tolerance plusmn12 Da maximum

number of 13

C 1 MSMS tolerance peptide decoy (Mascot) checked plusmn05 Da instrument type

149

ESI-Trap Results from the Mascot search was exported as DAT files for analysis by Scaffold 3

and Scaffold PTM

Scaffold and Ascore data processing

Scaffold 3 (Proteome Software Portland OR)was used for MSMS proteomics data

comparison and false discovery rate analysis The DAT files were loaded into Scaffold 3 and

the fractions for the two dimensional fractionation were combined The resulting biological

triplicates for starved and glucose fed yeast were filtered with a 1 false discovery rate (FDR)

on the protein level (Supplemental Table 1) Due to the inherent poor fragmentation of

phosphopeptides a lenient FDR level may still yield biologically relevant data and thus a 54

FDR was generated for phosphopeptides (Supplemental Table 2) For a more conservative list of

phosphopeptides with a 06 FDR was generated from Scaffold 3 (Supplemental Table 3) FDR

analysis is sufficient at preventing poor data from being reported but does not prevent false

phosphosite identification in phosphopeptides To provide confidence in site identification

Scaffold PTM was used to perform Ascore41

analysis Ascore uses an algorithm to score the

probability of the phosphosite from a phosphopeptide identified by a database searching

algorithm such as Mascot Ascore can be freely accessed at httpascoremedharvardedu

Cell collection RNA isolation and microarray data analysis

All experiments were performed in biological duplicates Cell samples (10 ODU) were

taken at starved and fed states spun at 5000xg for 2 minutes at 30 oC the supernatant was

removed and the pellets snap frozen using liquid nitrogen RNA was isolated using the Epicentre

MasterPure Yeast Purification Kit (Epicentre Technologies) and quality was checked using gel

electrophoresis cRNA synthesis was carried out using the GeneChipreg Expression 3

Amplification One-Cycle Target Labeling and Control Reagents kit (Affymetrix) All

150

experiments followed the manufactures instructions cRNA samples were hybridized to

GeneChip Yeast Genome 20 Arrays for 16 hours Arrays were washed stained and scanned

according the manufactures recommendations Affymetrix CEL files were RMA normalized

with R and the Bioconductor Suite Data analysis was performed within TIGR Multiexperiment

Viewer v451 in-house Perl scripting R and Bioconductor

Results

Sample preparation for shotgun proteomics

As discussed in the introduction the purpose of this study is to provide an exploratory list

of qualitative differences in the phosphoproteome between starved vs glucose fed yeast After

yeast cell lysate production a substantial amount of sample preparation is performed to enhance

the depth of the phosphoproteome coverage A workflow of the steps following cell lysis is

outlined in Figure 1A The yeast proteins are isolated via acetone precipitation followed by

digestion with trypsin to produce peptides Phosphopeptides are intermixed with the entire

tryptic peptide sample and therefore Fe-NTA IMAC is used to enrich the phosphopeptides To

improve upon the number of phosphopeptides we then performed two dimensional separation

with high pH RP followed by low pH RP C18 and infused the eluent directly onto an ion trap

mass spectrometer Figure 1B show an improved technique for the first dimension of separation

to combine the early eluting and late eluting fractions from the first phase of separation to reduce

overall MS analysis time The combination of fractions 1 and 12 2 and 13 etc essentially

improves the duty cycle of the mass spectrometer by ensuring sufficient peptide content is

injected onto a low pH nanoLC RP C18 column

ETD-triggered mass spectrometry

151

In the present study labile phosphorylation can lead to non-informative neutral loss

During MS scanning mode the instrument will choose the 6 most abundant peaks with correct

isotopic distribution and select them for MSMS CID fragmentation During CID fragmentation

it is common for labile PTMs such as phosphorylation to undergo neutral loss and thus limited

informative b and y-type ions are formed Alternatively ETD fragmentation can be used on

specific MSMS scans which produce a neutral loss indicative of phosphorylation (98Daz or

80Daz) The subsequent ETD fragmentation on these putative phosphopeptides will lead to

uniform backbone cleavage resulting in confident identification of phosphopeptides with site-

specific localization during MSMS It is important to note that CID fragmentation still produces

very informative fragmentation for phosphorylation but ETD provides an orthogonal

fragmentation pathway to further increase the phosphoproteome coverage Additionally the

duty cycle for ETD fragmentation is roughly half that of CID therefore about half as many

potential peptides would be fragmented and sequenced if the instrument was using ETD

fragmentation exclusively

Protein Data

Even with phosphopeptide enrichment it is inevitable that non-phosphopeptides will also

be identified All data were searched with Mascot and in total over 1000 proteins were identified

with a 1 FDR rate and are listed in Supplemental Table 1 The proteins listed in Supplemental

Table 1 include both phosphoproteins and non-phosphorylated proteins In addition to the

proteins identified in the fed and starved states the unique peptides and spectral counts are also

listed for reference The phosphopeptides are specifically listed with a 54 FDR rate in

Supplemental Table 2 and comparisons between starved and fed with Ascore analysis are listed

for every phosphopeptide identified A higher confidence of phosphopeptide identification is

152

sometimes required before investing in time consuming biological experiments so a list of

phosphopeptides identified in starved and fed was taken from a setting directing Scaffold to

produce a 05 FDR which resulted in the generated FDR of 06 FDR and listed in

Supplemental Table 3

A summary of specific phosphorylation sites are listed in Table 1 with a 06 FDR and

Table 2 with a 54 FDR Each table gives totals for unique phosphopeptides identified having

an Ascore localization score ge80 without Ascore and phosphorylation events on each unique

peptides As expected the majority of phosphorylation events (over 50) occurred on serine

whereas fewer phosphorylation events (~10) occurred on tyrosine In addition the vast

majority of phosphorylation events were single phosphorylation (ge65) with very few

identifications having more than two phosphosites per peptide For specific phosphopeptide

identification and subsequent phosphoprotein identification refer to Supplemental Table 2 and 3

Discussion

Transcriptional response to glucose feeding

Yeast responds to the repletion of glucose after glucose-depletion by broad

transcriptional changes to induce growth Roughly one-third of the yeast genome is altered by at

least two fold when glucose is introduced as shown in Figure 2 Figure 2 is a heat map from a

microarray analysis showing 1601 genes up regulated and 1457 genes down regulated after

addition of glucose compared to the starved state The arbitrary cut-offs for these values were as

follows two fold induction after nutrient repletion and a false discovery rate (q-value) of 001

Red color in the heat map indicates a particular gene is up-regulated in the fed state compared to

the starved state Alternatively genes coded in green are less expressed in the fed state

compared to the starved condition The intensity of the green or red colors is indicative of the

153

intensity of the fold change in gene expression These large transcriptional changes after glucose

repletion drive and complement the current phosphoproteomic study

PKA motif analysis

One benefit of a large scale phosphoproteomics experiment is to examine the different

phosphopeptides from known motifs for specific kinases PKA and TOR are responsible for the

majority of the transcriptional response and thus PKA is a good target for motif analysis Figure

3 shows the PKA motif sequence xxxRKRKxSxxxxxx with the phosphorylation occurring on

the serine Other than F16P (fructose-16-bisphosphatase) all the proteins listed are unique to the

starved or fed samples A motif sequence will inevitably show up by random chance in any

analysis For this study the control for motif analysis uses the swissprot protein list for the

entire yeast proteome for the background Compared to background this specific PKA kinase

from Figure 3 is up-regulated by 264 fold when compared to the background One interesting

protein emerged from this motif analysis in the fed sample but not the starved sample is

Ssd1which is implicated in the control of the cell cycle in G1 phase42

Ssd1 also is

phosphorylated by Cbk1 and Cbk1 has been proposed to inhibit Ssd143

and provides an

intriguing target for future studies on starved vs glucose fed yeast growth

Localization of the phosphorylation sites

When a phosphopeptide contains any number of serine threonine or tyrosine amino

acids the localization of the phosphosite can sometimes be ambiguous Database searches used

to identify peptides like Mascot do not provide any probability for localization of correct

phosphorylation sites Ascore is an algorithm that uses the same MSMS spectra as Mascot but

instead of comparing theoretical peptide MSMS to experimental spectra Ascore searches for

informative a b or y-type peptide fragments giving evidence for phosphosite The Scaffold

154

program adds a localization probability to the Ascore values and the values are listed in

Supplemental Table 2 and 3 Figure 4 has a representative Ascore mass spectra assigning the

peaks identified and providing evidence that the phosphorylation site occurs at the threonine

instead of the serine Incorporating Ascore into this study provides a layer of validation for

putative phosphosite identification

Plasma Membrane 2-ATPase

A previous study identified and localized phosphorylation sites on plasma membrane 1-

ATPase after glucose was introduced to starved yeast15

In the current study PMA2 (plasma

membrane ATPase 2) was identified in glucose fed and not starved samples The doubly

threonine phosphorylated peptide identified in fed yeast is PMA2 with amino acid sequence

IKMLTGDAVGIAKETCR (583-599) whereas the homologous peptide in PMA1 has the exact

same amino acid sequence except for the first isoleucine substituted for valine

VKMLTGDAVGIAKETCR (554-570 not observed in data) PMA2 was observed in the 06

FDR list of phosphopeptides and has an Ascore localization probably of 100 A plant study

showed that PMA2 phosphorylation level was higher in early growth phase than when in

stationary phase44

In addition PMA2 expression in yeast permits the growth of yeast and

threonine phosphorylation has been reported on Thr-95545

The identification of PMA2 in the

fed glucose cell extract provides an interesting target for future study on the molecular

mechanisms involved in regulation growth in starved vs glucose fed yeast

Conclusion

In conclusion this work provides a qualitative comparison in the phosphoproteome

between starved and glucose fed yeast A large scale IMAC enrichment of yeast cell lysate

followed by 2-D separation provided a rich source of phosphopeptides for neutral loss triggered

155

ETD analysis using mass spectrometry A specific motif for PKA is highlighted to show the

differences in proteins identified between starved vs fed conditions In total 477 unique

phosphopeptides are identified with 06 FDR and 669 unique phosphopeptides identified with

54 FDR Phosphosite validation is performed using a localization algorithm Ascore to

provide further confidence on the site-specific characterization of these phosphopeptides The

proteins Ssd1 and PMA2 emerged as potential intriguing targets for more in-depth studies on

protein phosphorylation involved in glucose response

Supplemental Tables 1 2 and 3 are available upon request

References

1 Martinez M J Roy S Archuletta A B Wentzell P D Anna-Arriola S S

Rodriguez A L Aragon A D Quinones G A Allen C Werner-Washburne M Genomic

analysis of stationary-phase and exit in Saccharomyces cerevisiae gene expression and

identification of novel essential genes Mol Biol Cell 2004 15 (12) 5295-305

2 Radonjic M Andrau J C Lijnzaad P Kemmeren P Kockelkorn T T van Leenen

D van Berkum N L Holstege F C Genome-wide analyses reveal RNA polymerase II

located upstream of genes poised for rapid response upon S cerevisiae stationary phase exit Mol

Cell 2005 18 (2) 171-83

3 Slattery M G Heideman W Coordinated regulation of growth genes in

Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

4 Wang Y Pierce M Schneper L GAtildefrac14ldal C G k e Zhang X Tavazoie S

Broach J R Ras and Gpa2 Mediate One Branch of a Redundant Glucose Signaling Pathway in

Yeast PLoS Biol 2004 2 (5) e128

5 Newcomb L L Hall D D Heideman W AZF1 is a glucose-dependent positive

regulator of CLN3 transcription in Saccharomyces cerevisiae Mol Cell Biol 2002 22 (5) 1607-

14

6 Newcomb L L Diderich J A Slattery M G Heideman W Glucose regulation of

Saccharomyces cerevisiae cell cycle genes Eukaryot Cell 2003 2 (1) 143-9

7 Carlson M Glucose repression in yeast Curr Opin Microbiol 1999 2 (2) 202-7

8 Gancedo J M Yeast carbon catabolite repression Microbiol Mol Biol Rev 1998 62

(2) 334-61

9 Johnston M Feasting fasting and fermenting Glucose sensing in yeast and other cells

Trends Genet 1999 15 (1) 29-33

156

10 Warner J R The economics of ribosome biosynthesis in yeast Trends Biochem Sci

1999 24 (11) 437-40

11 Li X Gerber S A Rudner A D Beausoleil S A Haas W Villen J Elias J E

Gygi S P Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces

cerevisiae J Proteome Res 2007 6 (3) 1190-7

12 Ficarro S B McCleland M L Stukenberg P T Burke D J Ross M M

Shabanowitz J Hunt D F White F M Phosphoproteome analysis by mass spectrometry and

its application to Saccharomyces cerevisiae Nat Biotechnol 2002 20 (3) 301-5

13 Gruhler A Olsen J V Mohammed S Mortensen P Faergeman N J Mann M

Jensen O N Quantitative phosphoproteomics applied to the yeast pheromone signaling

pathway Mol Cell Proteomics 2005 4 (3) 310-27

14 Peng J Schwartz D Elias J E Thoreen C C Cheng D Marsischky G Roelofs

J Finley D Gygi S P A proteomics approach to understanding protein ubiquitination Nat

Biotechnol 2003 21 (8) 921-6

15 Lecchi S Nelson C J Allen K E Swaney D L Thompson K L Coon J J

Sussman M R Slayman C W Tandem phosphorylation of Ser-911 and Thr-912 at the C

terminus of yeast plasma membrane H+-ATPase leads to glucose-dependent activation J Biol

Chem 2007 282 (49) 35471-81

16 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

17 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

18 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

19 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

20 Breitkopf S B Asara J M Determining In Vivo Phosphorylation Sites Using Mass

Spectrometry In Current Protocols in Molecular Biology John Wiley amp Sons Inc 2012

21 Steen H Jebanathirajah J A Rush J Morrice N Kirschner M W Phosphorylation

analysis by mass spectrometry myths facts and the consequences for qualitative and

quantitative measurements Mol Cell Proteomics 2006 5 (1) 172-81

22 Kweon H K Hakansson K Metal oxide-based enrichment combined with gas-phase

ion-electron reactions for improved mass spectrometric characterization of protein

phosphorylation J Proteome Res 2008 7 (2) 749-55

23 Larsen M R Thingholm T E Jensen O N Roepstorff P Jorgensen T J Highly

selective enrichment of phosphorylated peptides from peptide mixtures using titanium dioxide

microcolumns Mol Cell Proteomics 2005 4 (7) 873-86

24 Kokubu M Ishihama Y Sato T Nagasu T Oda Y Specificity of immobilized

metal affinity-based IMACC18 tip enrichment of phosphopeptides for protein phosphorylation

analysis Anal Chem 2005 77 (16) 5144-54

25 Swaney D L Wenger C D Thomson J A Coon J J Human embryonic stem cell

phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry Proc

Natl Acad Sci U S A 2009 106 (4) 995-1000

157

26 Hao P Guo T Sze S K Simultaneous analysis of proteome phospho- and

glycoproteome of rat kidney tissue with electrostatic repulsion hydrophilic interaction

chromatography PLoS One 2011 6 (2) e16884

27 Rush J Moritz A Lee K A Guo A Goss V L Spek E J Zhang H Zha X

M Polakiewicz R D Comb M J Immunoaffinity profiling of tyrosine phosphorylation in

cancer cells Nat Biotechnol 2005 23 (1) 94-101

28 Ficarro S Chertihin O Westbrook V A White F Jayes F Kalab P Marto J A

Shabanowitz J Herr J C Hunt D F Visconti P E Phosphoproteome analysis of

capacitated human sperm Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3

and valosin-containing proteinp97 during capacitation J Biol Chem 2003 278 (13) 11579-89

29 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

30 Dowell J A Frost D C Zhang J Li L Comparison of two-dimensional

fractionation techniques for shotgun proteomics Anal Chem 2008 80 (17) 6715-23

31 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

32 Palumbo A M Smith S A Kalcic C L Dantus M Stemmer P M Reid G E

Tandem mass spectrometry strategies for phosphoproteome analysis Mass Spectrom Rev 2011

30 (4) 600-25

33 Beausoleil S A Jedrychowski M Schwartz D Elias J E Villen J Li J Cohn M

A Cantley L C Gygi S P Large-scale characterization of HeLa cell nuclear

phosphoproteins Proc Natl Acad Sci U S A 2004 101 (33) 12130-5

34 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

35 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion

dependence in the partitioning between proton and electron transfer in ionion reactions

International Journal of Mass Spectrometry 2004 236 (1acirceuroldquo3) 33-42

36 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

37 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

38 Coon J J Collisions or electrons Protein sequence analysis in the 21st century Anal

Chem 2009 81 (9) 3208-15

39 Miller M E Cross F R Distinct subcellular localization patterns contribute to

functional specificity of the Cln2 and Cln3 cyclins of Saccharomyces cerevisiae Mol Cell Biol

2000 20 (2) 542-55

40 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

158

41 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

42 Sutton A Immanuel D Arndt K T The SIT4 protein phosphatase functions in late

G1 for progression into S phase Mol Cell Biol 1991 11 (4) 2133-48

43 Jansen J M Wanless A G Seidel C W Weiss E L Cbk1 regulation of the RNA-

binding protein Ssd1 integrates cell fate with translational control Curr Biol 2009 19 (24)

2114-20

44 Kanczewska J Marco S Vandermeeren C Maudoux O Rigaud J L Boutry M

Activation of the plant plasma membrane H+-ATPase by phosphorylation and binding of 14-3-3

proteins converts a dimer into a hexamer Proc Natl Acad Sci U S A 2005 102 (33) 11675-80

45 Maudoux O Batoko H Oecking C Gevaert K Vandekerckhove J Boutry M

Morsomme P A plant plasma membrane H+-ATPase expressed in yeast is activated by

phosphorylation at its penultimate residue and binding of 14-3-3 regulatory proteins in the

absence of fusicoccin J Biol Chem 2000 275 (23) 17762-70

159

Figure 1 The general workflow indicating the major steps involved in sample collection

sample processing mass spectrometric data acquisition and analysis of comparative

phosphoproteomics for starved vs glucose fed yeast (A) The high pH RP separation

procedure for combining fractions to reduce low peptide containing fractions from the

UV-VIS trace is also shown (B)

160

Figure 2 Heat map for the global transcriptional response to glucose fed vs starved samples

S288C cells starved for glucose until growth was arrested and subsequently glucose was added

161

Samples for microarray analysis were taken at the starved state and 60 minutes after glucose was

added The heat map shows the fed log2 fold change for each gene relative to the starved state

across the genome performed in biological replicate (A) Black indicates no change in

expression level while red indicates higher expression for the fed relative to the starved state

Green indicates higher expression for the starved state compared to the fed state (Adapted from

Dr Michael Conways Thesis)

162

Figure 3 The motif for protein kinase A (PKA) that was observed for fed vs starved which is

xxxRKRKxSxxxxxx with the phosphorylation occurring on the serine The motif occurred at a

rate 264 fold higher than the yeast proteome used for background In addition one protein was

observed in both starved and fed with accession identification of F16P (Fructose-16-

bisphosphatase)

163

06 FDR phosphopeptide analysis

Table 1 The number of phosphopeptides identified with a 06 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

Starved Fed All

Ascore ge80 score

unique

STY 164 153 317

S 87 (530) 82 (536) 169 (533)

T 60 (366) 55 (359) 115 (363)

Y 17 (104) 16 (105) 33 (104)

Unique no Ascore

STY 242 235 477

S 131 (541) 133 (566) 264 (553)

T 86 (355) 78 (332) 164 (344)

Y 25 (103) 24 (102) 49 (103)

Phosphorylation events

on each unique peptide

1 102 113 187

2 36 40 68

3 12 11 22

4 or more 8 3 11

164

54 FDR phosphopeptide analysis

Starved Fed All

Ascore ge80 score

unique

STY 217 217 434

S 115 (530) 113 (521) 228 (525)

T 78 (359) 78 (359) 156 (359)

Y 24 (111) 26 (120) 50 (115)

Unique no Ascore

STY 337 332 669

S 193 (573) 180 (542) 373 (558)

T 111 (329) 116 (349) 227 (339)

Y

Phosphorylation events

on each unique peptide

1

2

3

4 or more

33 (98)

135

56

16

11

36 (108)

169

55

14

3

69 (103)

280

100

27

13

Table 2 The number of phosphopeptides identified with a 54 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

165

Figure 4 Ascore validation of the phosphopeptide LSLAtKK from the protein SGM1 (slow

growth on galactose and mannose protein 1) with 100 localization probability observed

in only fed but not starved yeast samples Ascorersquos algorithm identifies a b and y-type

ions and looks to identify peaks that provide evidence for a specific phosphorylation site

For this example Ascore assigns this phosphosite to threonine (T5) instead of the serine

(S2) The intense peaks at 4024 mz and 5250 mz are not identified by any a b or y-

type ions From the ladder sequence of the peptide shown numerous ions indicate the

threonine is phosphorylated while the serine is not Among these ions used for

localization are b2 y2 y5+H2O y3 y4 and y5

166

Chapter 6

Use of electron transfer dissociation for neuropeptide sequencing and

identification

Adapted from ldquoDiscovery and Characterization of the Crustacean Hyperglycemic Hormone

Precursor Related Peptides (CPRP) and Orcokinin Neuropeptides in the Sinus Glands of the Blue

Crab Callinectes sapidus Using Multiple Tandem Mass Spectrometry Techniquesrdquo Hui L

Cunningham R Zhang Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

167

Abstract

The crustacean Callinectes sapidus sinus gland (SG) is a well-defined neuroendocrine site that

produces numerous hemolymph-borne agents including the most complex class of endocrine

signaling moleculesmdashneuropeptides One such peptide is the crustacean hyperglycemic hormone

(CHH) precursor-related peptide (CPRPs) which was not previously sequenced Electron

transfer dissociation (ETD) was used for sequencing of intact CPRPs due to their large size and

high charge state In addition ETD was used to sequence the sulfated peptide Cholecystokinin

CCK-like Homarus americanus using a salt adduct Collectively these two examples

demonstrated the utility of ETD in sequencing neuropeptides with large molecular weight or

with labile modifications

168

Introduction

Neuropeptides are the largest and most diverse group of endocrine signaling molecules in

the nervous system They are necessary and critical for initiation and regulation of numerous

physiological processes such as feeding reproduction and development1 2

Mass spectrometry

(MS) with unique advantages such as high sensitivity high throughput chemical specificity and

the capability of de novo sequencing with limited genomic information is well suited for the

detection and sequencing of neuropeptides in endocrine glands and simultaneously provides the

potential for information on post-translational modifications such as sulfonation3-6

The sinus glands (SG) are located between the medulla interna and medulla externa of the

eyestalk in Callinectes sapidus It is a well-known neuroendocrine site that synthesizes and

secretes peptide hormones regulating various physiological activities such as molting

hemolymph glucose levels integument color changes eye pigment movements and

hydro-mineral balance7 Our labrsquos previous neuropeptidomic studies of SG in several

crustacean species including Cancer borealis8-11

Carcinus maenas12

and Homarus americanus13

14 used multiple MS strategies combining MALDI-based high resolution accurate mass profiling

biochemical derivatization and nanoscale separation coupled to tandem MS for de novo

sequencing In the current study we explore the neuropeptidome of the SG in the blue crab

Callinectes sapidus a vital species of economic importance on the seafood market worldwide In

total 70 neuropeptides are identified including 27 novel neuropeptides and among them the

crustacean hyperglycemic hormone precursor-related peptides (CPRPs) and orcokinins represent

major neuropeptide families known in the SG

The crustacean hyperglycemic hormone (CHH) and its precursor-related peptide (CPRP) are

produced concurrently during the cleavage of CHH from the CHH preprohormone protein15

The

169

CPRP peptide is located between the signal peptide and the CHH sequence and is separated from

the CHH by a dibasic cleavage site The functions of CPRPs are currently unknown16

However

the complete characterization of CPRPs provides a foundation for future experiments elucidating

their functional roles in crustaceans The cDNA of CHH preprohormone in SG of Callinectes

sapidus has been characterized17

but the direct detection of CPRP has not been reported due to

its relatively large size and possible post-translational modifications While small fragments of

CPRPs can be identified using nanoLC-ESI-Q-TOF tandem mass spectrometry the intact

peptide is difficult to detect due to the large molecular weight of CPRPs

Peptidomics traditionally uses collision induced dissociation (CID) for tandem MS

experiments for de novo sequencing Recently an alternative peptide fragmentation method has

been developed using the transfer of an electron termed electron transfer dissociation (ETD)18 19

ETD involves a radical anion with low electron affinity to be transferred to peptide cation which

results in reduced sequence discrimination and thus provides improved sequencing for larger

peptides compared to CID20

Specifically for an ion trap ETD the fluoranthene radical anion is

allowed to react with peptide cations in the three dimensional trap The resulting dissociation

cleaves at the N-Cα bond to produce c and zbull type product ions The peptidome of model

organisms like Callinectes sapidus can be further explored and expanded utilizing ETD as a

complementary fragmentation technique to CID Previous peptidomic analysis has been

completed using ETD as an additional fragmentation method21

It was observed that

enzymatically produced peptides with a higher mz produced improved sequence coverage using

ETD This observation termed decision tree analysis determined that a charge state of ge6 all

peptides endogenous or enzymatic should be fragmented via ETD22

In the present study the

highly charged peptide CPRP with an intact molecular weight of 3837 readily produces plus six

170

charge states The higher charge state of CPRP is highly amenable to fragmentation by ETD

which produces remarkably improved fragmentation and thus increased sequence coverage when

compared to CID

Sulfonation of tyrosine is a post-translational modification (PTM) that is thought to occur on

trans-membrane spanning and secreted proteins23

Cholecystokinin-8 (CCK-8) is a sulfated

peptide which has been linked to satiety24-26

and a CCK-like peptide has been observed in

lobster27

Sulfonation is an extremely labile modification and is often lost during soft

ionization such as ESI or MALDI but can be avoided by optimizing the ionization process28

One potential strategy to identify sulfopeptides is to scan for a neutral loss of 80 Da after CID

but this method does not allow for identification of site of sulfonation and has the risk to be

mistaken for phosphorylation The sulfonation PTM is a negatively charged modification on

the peptide which allows for negative ion scanning in the mass spectrometer but provides

minimal MSMS sequence coverage during CID due to preferential loss of the sulfate group

Alternatively electron-based dissociation technique enables more rapid radical driven

fragmentation where the cleavage pattern is more uniform along the peptide backbone without

initially cleaving the labile sulfated group thus preserving the site of modification These types

of dissociation techniques only work for multiply-charged ions thus a method to install a

multiply-charged cation on the target peptide is desirable It has been shown that the electron

capture dissociation (ECD) can be used to sequence sulfated peptides when a multi-charged

cation is added to the solution29

We use a similar multi-charge cation solution technique to

sequence a sulfated peptide from Homarus americanus via ETD on an ion trap mass

spectrometer Here we presented the use of the ETD fragmentation technique for the analysis

of large peptides and peptides containing labile post-translational modification

171

Experimental Section

Chemical and materials

Acetonitrile methanol magnesium chloride (ACS grade) potassium chloride (ACS grade) and

calcium chloride (ACS grade) were purchased from Fisher Scientific (Pittsburgh PA) Formic

acid was from Sigma-Aldrich (St Louis MO) Human sulfated CCK-8 and CCK-like peptide

(E(pyro)FDEY(Sulfo)GHMRFamide) were purchased from American Peptide (Sunnyvale CA)

Fused-silica capillary with 75 μm id and 360 μm od was purchased from Polymicro

Technologies (Phoenix AZ) Millipore C18 Ziptip column was used for sample cleaning and all

water used in this study was deionized water (182 MΩcm) prepared from a Milli-Q Millipore

system (Billerica MA) The physiological saline consisted of 440 mM NaCl 11mM KCl 26

mM MgCl2 13 mM CaCl2 11 mM Trizma base and 5 mM maleic acid in pH 745

Animals and dissection

Callinectes sapidus (blue crab) were obtained from commercial food market and maintained

without food in artificial sea water at 10-12 ordmC Animals were cold-anesthetized by packing on

ice for 15-30min before dissection The optic ganglia with the SG attached were dissected in

chilled (~4ordmC) physiological saline and individual SGs were isolated from the optic ganglia by

micro-dissection and immediately placed in acidified methanol (90 methanol 9 glacial acetic

acid and 1 water) and stored at -80ordmC until tissue extraction

Tissue homogenization

Acidified methanol was used during the homogenization of SGs The homogenized SGs were

immediately centrifuged at 16100 g using an Eppendorf 5415D tabletop centrifuge (Eppendorf

172

AG) The pellet was washed using acidified methanol and combined with the supernatant and

further dried in a Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation) The

resulting dried sample was re-suspended with a minimum amount (20 microL) of 01 formic acid

Fractionation of homogenates using reversed phase (RP)-HPLC

The re-suspended extracts were then vortexed and briefly centrifuged The resulting supernatants

were subsequently fractionated via high performance liquid chromatography (HPLC) HPLC

separations were performed using a Rainin Dynamax HPLC system equipped with a Dynamax

UV-D II absorbance detector (Rainin Instrument Inc Woburn MA) The mobile phases included

Solution A (deionized water containing 01 formic acid) and Solution B (acetonitrile containing

01 formic acid) About 20 μl of extract was injected onto a Macrosphere C18 column (21 mm

id x 250 mm length 5 microm particle size Alltech Assoc Inc Deerfield IL) The separation

consisted of a 120 minute gradient of 5-95 Solution B Fractions were automatically collected

every two minutes using a Rainin Dynamax FC-4 fraction collector (Rainin Instrument Inc

Woburn MA) The fractions were then concentrated using in Savant SC 110 SpeedVac

concentrator (Thermo Electron Corporation) and re-suspended with minimum amount of 01

formic acid

Nano-LC-ESI-Q-TOF MSMS

Nanoscale LC-ESI-Q-TOF MSMS was performed using a Waters nanoAcquity UPLC system

coupled to a Q-TOF Micro mass spectrometer (Waters Corp Milford MA)

Chromatographic separations were performed on a homemade C18 reversed phase capillary

column (75 microm internal diameter x100 mm length 3 microm particle size 100 Aring) The mobile phases

173

used were 01 formic acid in deionized water (A) 01 formic acid in acetonitrile (B) An

aliquot of 50 microl of a tissue extract or HPLC fraction was injected and loaded onto the trap

column (Zorbax 300SB-C18 Nano trapping column Agilent Technologies Santa Clara CA)

using 95 mobile phase A and 5 mobile phase Bat a flow rate of 10 microlmin for 10 minutes

Following this the stream select module was switched to a position at which the trap column

came in line with the analytical capillary column and a linear gradient of mobile phases A and B

was initiated For neuropeptides the linear gradient was from 5 buffer B to 45 buffer B over

90 min The nanoflow ESI source conditions were set as follows capillary voltage 3200 V

sample cone voltage 35 V extraction cone voltage 1 V source temperature 100ordmC A data

dependent acquisition was employed for the MS survey scan and the selection of three precursor

ions and subsequent MSMS of the selected parent ions The MS scan range was from mz

400-1800 and the MSMS scan was from mz 50-1800

Peptide Prediction De Novo Sequencing and Database Searching

De novo sequencing was performed using a combination of MassLynxTM

41 PepSeq software

(Waters) and manual sequencing Tandem mass spectra acquired from the Q-TOF were first

deconvoluted using MaxEnt 3 software (Waters) to convert multiply charged ions into their

singly charged forms The resulting spectra were pasted into the PepSeq window for sequencing

analysis The candidate sequences generated by the PepSeq software were compared and

evaluated for homology with previous known peptides The online program blastp (National

Center for Biotechnology Information Bethesda MD httpwwwncbinlmnihgovBLAST)

was used to search the existing NCBI crustacean protein database using the candidate peptide

sequences as queries For all searches the blastp database was set to non-redundant protein

174

sequences (ie nr) and restricted to crustacean sequences (ie taxid 6657) For each of the

proteins putatively identified via blastp the BLAST score and BLAST-generated E-value for

significant alignment are provided in the appropriate subsection of the results Peptides with

partial sequence homology were selected for further examination by comparing theoretical

MSMS fragmentation spectra generated by PepSeq with the raw MSMS spectra If the

fragmentation patterns did not match well manual sequencing was performed

NanoLC Coupled to MSMS by CID and ETD

Setup for RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN (Fisher Scientific Optima LCMS grade Fair Lawn NJ) Injections

consisted of 5 microL of prepared tissue extract onto an Agilent Technologies Zorbax 300 SB-C18 5

microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5 microLmin for 5 minutes at 95

A5 B Separation was performed on a Waters HPLC column with 3 microm Atlantis dC18 75 microm

x 150 mm (Milford MA) on a gradient from 5 to 45 mobile phase B at 250 nLmin over 90

minutes at room temperature Emitter tips were pulled from 75 microm inner diameter 360 microm

outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using a commercial

laser puller model P-2000 (Sutter Instrument Co Novato CA)

Mass Spectrometry Data Acquisitions for Collision Induced Dissociation (CID)

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany) equipped

with an on-line nanospray source was used for mass spectrometry data acquisition Hystar

(Version 32 Bruker Daltonics Bremen Germany) was used to couple and control Eksigent

175

nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for all

experiments Smart parameter setting (SPS) was set to mz 700 compound stability and trap

drive level were set at 100 Optimization of the nanospray source resulted in dry gas

temperature 125oC dry gas 40 Lmin capillary voltage -1300 V end plate offset -500 V

MSMS fragmentation amplitude 10V and SmartFragmentation set at 30-300

Data was generated in data dependent mode with strict active exclusion set after two spectra and

released after one minute MSMS was obtained via CID fragmentation for the six most

abundant MS peaks with a preference for doubly charged ions and excluded singly charged ions

For MS generation the ion charge control (ICC) target was set to 200000 maximum

accumulation time 5000 ms four spectral average acquisition range of mz 300-1500 and scan

speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target was set to

200000 maximum accumulation time 5000 ms three spectral averages acquisition range of

mz 100-2000 and scan speed (Ultrascan) of 32000 mz s-1

Mass Spectrometry Data Acquisitions for Electron Transfer Dissociation (ETD)

The amaZon ETD ion-trap mass spectrometer was operated in electron transfer dissociation for

MSMS fragmentation with the same optimized settings as reported for CID unless otherwise

stated Smart parameter setting (SPS) was set to 500 mz compound stability and trap drive

level were set at 100 MSMS was obtained via ETD fragmentation for the four most

abundant MS peaks with no preference for specifically charged ions except to exclude singly

charged ions The ETD reagent parameters were set to 400000 target ICC for the fluoranthene

radical anion The fluoranthene radical anion has an mz of 210 and therefore the 210 mz value

was removed from the resulting MSMS spectra and 160 mz was set for the MSMS low mz

cut-off

176

Direct Infusion of Sulfated Peptide Standards into the Mass Spectrometer using ETD and

CID Fragmentation

The sulfated peptide standards were infused into the mass spectrometer at a flow rate of 300

nlmin using a fused-silica capillary with 75 μm id and 360 μm od coupled to a custom pulled

tip The CCK-8 human peptide (20 microM) was infused in 5050 ACNH2O and detected in

negative ionization mode with an ICC of 70000 and fragmented with CID using the same

settings as the previous CID experiments Alternatively the CCK-like lobster sulfated peptide

(2 microM) was mixed with 30 microM of different salts (MgCl2 KCl or CaCl2) and 01 formic acid in

5050 ACNH2O The plus three charge of the CCK-like lobster peptide was selected for ETD

fragmentation in positive mode with the same setting as the previous ETD experiments The

data were then de novo sequenced for coverage and localization of the sulfation site

Data Analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen Germany)

Deviations in parameters from the default Protein Analysis in DataAnalysis were as follows

intensity threshold 1000 maximum number of compounds 1E7 and retention time window 05

minutes These parameter changes assisted in providing better quality spectra for sequencing

Identification of peptides was performed using Mascot (Version 23 Matrix Science London

UK) Searches were performed against a custom crustacean database none enzyme allow up to

1 missed cleavage amidated C-terminal as variable modifications peptide tolerance mass error

12 Da MSMS mass error tolerance is 06 Da

Results and Discussion

177

Identification and Characterization of Intact CPRPs Using ETD

Using ESI-Q-TOF 9 truncated CPRPs fragments were sequenced to cover the amino acid

sequence of the intact CPRP peptide RSAEGLGRMGRLLASLKSDTVTPLRGFEGETGHPLE

A representative CID MSMS spectrum of ASLKSDTVTPLR is shown in Figure 1 (a) CID

using ESI-Q-TOF mainly produces fragmentation for doubly and triply charge molecules which

is adequate to sequence peptides with relatively small molecular weight (less than 2000 Da)

However CID fragmentation is insufficient to sequence the larger intact CPRP in a complex

sample whose molecular mass is 3837 Da and therefore it is extremely difficult to directly

sequence by traditional CID ESI-Q-TOF ETD is an attractive fragmentation alternative to

sequence the highly charged CPRP peptide In ETD a protonated peptide reacts with an anion

(fluoranthene radical) where the anion transfers an electron to the peptide cation and the resulting

fragmentation occurs along the N-Cα bond by free-radical-driven-cleavage The large size of

CPRP offers multiple basic amino acid sites to sequester charge and reduce the sequence

coverage from collision induced dissociate by preventing random backbone cleavage whereas

ETD does not cleave the weakest bond and provides the alternative fragmentation pathway to

obtain superior sequence coverage to larger peptides As shown in Figure 1 (cd) the

fragmentation of ETD is highly amenable to ETD compared to the CID fragmentation in Figure

1 (b) As evident from comparison ETD offers more extensive fragmentation than CID thus

providing enhanced coverage for large intact peptides such as CPRPs Specifically we observe

125 of the b- and y- cleavages for CID as compared to an average of 535 of c- and zbull-

fragments More than a four-fold increase in fragments using ETD suggests the utility of this

relatively new tandem MS fragmentation method as complementary techniques for de novo

sequencing of large intact endogenous peptides and novel neuropeptide discovery endeavors

178

Negative Mode Sulfated Peptide Identification

An accepted method for identification and quantification for sulfated peptides is negative

ionization30

CCK-8 sulfated peptide standards show intense signal in negative ionization mode

without needing to have additives added such as salts Figure 2 shows the CID MSMS

spectrum from the negative ionization and produces an intense b6 ion at 430 mz The transition

from the doubly negatively charged MS ion of 570 mz to the b6 ion provides a selected reaction

monitoring transition for quantification studies but the sequence information is limited and the

presence of the methionine produces variable oxidation In addition Figure 2 shows the

MSMS product ions with the loss of the sulfate group thus making site-specific location of

sulfation more difficult

Investigation of Cation Salt Adducts to Produce Multiply Charged Sulfated Peptides

Sulfated peptides can be isolated in positive scanning mode but usually only in a plus one

state with low signal intensity If ETD is performed on the singly charged peptide cation a

neutral is formed and is lost in the mass spectrometer and thus no sequence information can be

obtained In order to remedy this situation a technique that adding metal salts to peptides to

increase charge state for ECD used in Fourier transform ion cyclotron resonance mass

spectrometry (FTICR-MS)29

inspired the investigation of increasing charge state of targeted

peptide via metal salt adduction followed by ETD analysis in a Bruker amaZon ion trap

Various salts were investigated including MgCl2 CaCl2 and KCl each used at a concentration of

30 microM and mixed with a sulfated peptide standard at 2 microM The addition of KCl produced

mostly doubly charged ion of the lobster CCK-like peptide standard K adduct which produced

insufficient sequence information from ETD fragmentation (data not shown) In comparison

the MgCl2 and CaCl2 produced intense triply charged peptide ions but CaCl2 produced lower

179

signal intensity compared to MgCl2 (data not shown)

Cation Assisted ETD Fragmentation of CCK-like Lobster Sulfated Peptide and Future

Directions

The addition of MgCl2 to the lobster CCK-like sulfated peptide is shown in Figure 3

Except for z1 the complete z-series is obtained including the z7 ion with and without the

sulfation PTM The spectrum is complicated by the additional Mg adducts but several peaks

are shown without any Mg adduct such as z2 z3 z4 etc Clearly the method of cation

assisted ETD fragmentation can be useful using ion trap instruments and can properly sequence

sulfated peptides that are prone to neutral loss from the labile PTM One concern about future

direction in sulfopeptide isolation in non-genome sequenced species is isolation of sulfopeptides

Currently antibodies to specific sulfopeptides is the only commonly used enrichment technique

for sulfopeptides Also since metal cations are needed for this method direct infusion into an

ESI mass spectrometer is preferred to prevent running large amounts of adduct forming salts

through the LC system With direct infusion the lack of separation confounds the problem in

sulfopeptide detection because any co-eluting peptides could have ionization efficiency and thus

reduce the signal of the sulfopeptide Once discovered and sequenced a selected reaction

monitoring (SRM) method could be developed using LC retention coupled with negative

ionization mode followed by CID MSMS or possibly MSMSMS to perform quantitative

studies for sulfopeptides

Conclusions

In this study ETD was performed to improve the sequence coverage of large endogenous

neuropeptides with higher charge and bigger size The intact CPRP from Callinectes sapidus was

identified and characterized with 68 sequence coverage by MS for the first time Cation

180

assisted ETD fragmentation was also shown to be a powerful tool in de novo sequencing of

sulfated neuropeptides These endeavors into using ETD for certain neuropeptides will assist in

future analysis of large neuropeptides and PTM containing neuropeptides

181

Reference

1 Schwartz M W Woods S C Porte D Jr Seeley R J Baskin D G Central nervous system control of

food intake Nature 2000 404 (6778) 661-71

2 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R

Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide

family of aplysia J Neurosci 2002 22 (17) 7797-808

3 Baggerman G Cerstiaens A De Loof A Schoofs L Peptidomics of the larval Drosophila melanogaster

central nervous system Journal of Biological Chemistry 2002 277 (43) 40368-40374

4 Desiderio D M Mass spectrometric analysis of neuropeptidergic systems in the human pituitary and

cerebrospinal fluid Journal of Chromatography B 1999 731 (1) 3-22

5 Li L J Kelley W P Billimoria C P Christie A E Pulver S R Sweedler J V Marder E Mass

spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab Cancer

borealis Journal of Neurochemistry 2003 87 (3) 642-656

6 Li L J Moroz T P Garden R W Floyd P D Weiss K R Sweedler J V Mass spectrometric survey of

interganglionically transported peptides in Aplysia Peptides 1998 19 (8) 1425-1433

7 Strand F L Neuropeptides regulators of physiological processes 1st ed ed MIT Press Cambridge Mass

1999 p 658 p

8 Fu Q Goy M F Li L J Identification of neuropeptides from the decapod crustacean sinus glands using

nanoscale liquid chromatography tandem mass spectrometry Biochemical and Biophysical Research

Communications 2005 337 (3) 765-778

9 Fu Q Christie A E Li L J Mass spectrometric characterization of crustacean hyperglycemic hormone

precursor-related peptides (CPRPs) from the sinus gland of the crab Cancer productus Peptides 2005 26 (11)

2137-2150

10 Ma M M Chen R B Ge Y He H Marshall A G Li L J Combining Bottom-Up and Top-Down Mass

Spectrometric Strategies for De Novo Sequencing of the Crustacean Hyperglycemic Hormone from Cancer borealis

Analytical Chemistry 2009 81 (1) 240-247

11 Ma M M Sturm R M Kutz-Naber K K Fu Q Li L J Immunoaffinity-based mass spectrometric

characterization of the FMRFamide-related peptide family in the pericardial organ of Cancer borealis Biochemical

and Biophysical Research Communications 2009 390 (2) 325-330

12 Ma M M Bors E K Dickinson E S Kwiatkowski M A Sousa G L Henry R P Smith C M Towle

D W Christie A E Li L J Characterization of the Carcinus maenas neuropeptidome by mass spectrometry and

functional genomics General and Comparative Endocrinology 2009 161 (3) 320-334

13 Chen R B Jiang X Y Conaway M C P Mohtashemi I Hui L M Viner R Li L J Mass Spectral

Analysis of Neuropeptide Expression and Distribution in the Nervous System of the Lobster Homarus americanus

Journal of Proteome Research 2010 9 (2) 818-832

14 Ma M M Chen R B Sousa G L Bors E K Kwiatkowski M A Goiney C C Goy M F Christie A

E Li L J Mass spectral characterization of peptide transmittershormones in the nervous system and

neuroendocrine organs of the American lobster Homarus americanus General and Comparative Endocrinology

2008 156 (2) 395-409

15 Ollivaux C Gallois D Amiche M Boscameric M Soyez D Molecular and cellular specificity of

post-translational aminoacyl isomerization in the crustacean hyperglycaemic hormone family FEBS J 2009 276

(17) 4790-802

16 Wilcockson D C Chung J S Webster S G Is crustacean hyperglycaemic hormone precursor-related

peptide a circulating neurohormone in crabs Cell and Tissue Research 2002 307 (1) 129-138

17 Choi C Y Zheng J Watson R D Molecular cloning of a cDNA encoding a crustacean hyperglycemic

hormone from eyestalk ganglia of the blue crab Callinectes sapidus General and Comparative Endocrinology 2006

148 (3) 383-387

18 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and protein sequence analysis

by electron transfer dissociation mass spectrometry Proc Natl Acad Sci U S A 2004 101 (26) 9528-33

19 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion dependence in the partitioning

between proton and electron transfer in ionion reactions International Journal of Mass Spectrometry 2004 236

(1-3) 33-42

20 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-site identity and

position on electron-transfer dissociation of polypeptide cations J Am Chem Soc 2007 129 (40) 12232-43

182

21 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved identification of endogenous

peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric

analysis J Proteome Res 2009 8 (2) 870-6

22 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass spectrometry for shotgun

proteomics Nat Methods 2008 5 (11) 959-64

23 Monigatti F Hekking B Steen H Protein sulfation analysis--A primer Biochim Biophys Acta 2006 1764

(12) 1904-13

24 Chandler P C Wauford P K Oswald K D Maldonado C R Hagan M M Change in CCK-8 response

after diet-induced obesity and MC34-receptor blockade Peptides 2004 25 (2) 299-306

25 Little T J Feltrin K L Horowitz M Meyer J H Wishart J Chapman I M Feinle-Bisset C A

high-fat diet raises fasting plasma CCK but does not affect upper gut motility PYY and ghrelin or energy intake

during CCK-8 infusion in lean men Am J Physiol Regul Integr Comp Physiol 2008 294 (1) R45-51

26 Blevins J E Morton G J Williams D L Caldwell D W Bastian L S Wisse B E Schwartz M W

Baskin D G Forebrain melanocortin signaling enhances the hindbrain satiety response to CCK-8 Am J Physiol

Regul Integr Comp Physiol 2009 296 (3) R476-84

27 Turrigiano G G Selverston A I A cholecystokinin-like hormone activates a feeding-related neural circuit in

lobster Nature 1990 344 (6269) 866-8

28 Wolfender J L Chu F Ball H Wolfender F Fainzilber M Baldwin M A Burlingame A L

Identification of tyrosine sulfation in Conus pennaceus conotoxins alpha-PnIA and alpha-PnIB further investigation

of labile sulfo- and phosphopeptides by electrospray matrix-assisted laser desorptionionization (MALDI) and

atmospheric pressure MALDI mass spectrometry J Mass Spectrom 1999 34 (4) 447-54

29 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides complexed with divalent

metal cations Anal Chem 2006 78 (21) 7570-6

30 Young S A Julka S Bartley G Gilbert J R Wendelburg B M Hung S C Anderson W H

Yokoyama W H Quantification of the sulfated cholecystokinin CCK-8 in hamster plasma using

immunoprecipitation liquid chromatography-mass spectrometrymass spectrometry Anal Chem 2009 81 (21)

9120-8

183

Figure 1 Comparison of tandem MS spectra of truncated CPRP (ASLKSDTVTPL mz 128766)

by ESI-Q-TOF (a) and intact CPRP (MW 3837) by the amaZon ETD for both CID and ETD

fragmentation (a) MSn of precursor ion with mz 640 with charge state +6 by CID represent

loss of NH3 ordm represent loss of H2O (b) MS+6

of precursor ion with mz 640 with charge state +6

by ETD at z represent z+1 z represent z+2 (c) MS+5

of precursor ion with mz 768 with charge

state +5 by ETD z represent z+1 z represent z+2 For all labels in Figure 1 charge state +1 is

not specified

184

185

Figure 2 Negative CID spectrum of CCK-8 peptide standard (20 microM) via direct infusion show

the doubly charged b6 ion provides the most intense MSMS transition

186

Figure 3 ETD spectrum of E(pyro)FDEY(Sulfo)GHMRF-amide (2 microM) obtained on the

amaZon ETD via direct infusion with 30 microM MgCl2 The sulfated peptide can be identified

with a visible z-series of z2 to z9 and identified sulfate loss

187

Chapter 7

Investigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysis

Adapted from ldquoInvestigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J

Wellner D Li L Journal of Mass Spectrometry In Press

188

ABSTRACT

This work investigates the introduction of methanol and a salt modifier to molecular

weight cut-off membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide

quantities Using a neuropeptide standard bradykinin sample loss was reduced over two orders

of magnitude with and without undigested protein present Additionally a bovine serum

albumin (BSA) tryptic digestion was investigated Twenty-seven tryptic peptides were identified

from MALDI mass spectra after enriching with methanol while only two tryptic peptides were

identified after the standard MWCO protocol The strategy presented here enhances recovery

from MWCO separation for sub-microg peptide samples

INTRODUCTION

Molecular weight cut-off membrane-based centrifugal filter devices (MWCO) are

commonly used to desalt and concentrate large molecular weight proteins [1] Greeing and

Simpson recently investigated various MWCO membranes for large amounts of starting material

(~6 mg) focusing on optimal conditions for the sub 25 kDa protein fraction [2] The authors

recovered 200 μg to 29 mg of protein from multiple MWCO experiments and demonstrated that

a 1090 acetonitrile (ACN)H2O elution solvent produced optimal results [3] In addition Manza

et al provided an alternative approach to isolate proteins with a 5 kDa MWCO by using

NH4HCO3 and recovering the retained proteins [4] Alternatively the elution from MWCOs can

be collected to recover only low molecular weight peptides Multiple peptidomic studies have

utilized MWCOs for peptide isolation during the first few steps of sample preparation [5 6]

When sample amount is limited or peptide content is below 1 microg sample loss is a significant

concern when using MWCOs to isolate endogenous peptides Optimized protocols have been

189

investigated using ACN [3 7] salt [4 8] SDS [5] or native sample [6 9 10] but these

experiments primarily focused on large sample amounts rather than sub-microgram peptide

quantities

MWCOs separate large molecules from small molecules The small molecule fraction

may be rich with signaling peptides (SP) cytokines and other small molecules involved in cell-

cell signaling Signaling peptides perform various functions in the body including cell growth

cell survival and hormonal signaling between organs [11] Individual SP contribute to different

aspects of behavior such as pain (enkephalins) [12] feeding (neuropeptide Y) [13] and blood

pressure (bradykinin) [14] MWCO separations can be used to enrich biologically important SP

and explore the peptide content from biological fluids with relatively low peptide content like

blood or cerebrospinal fluid (CSF) In a recent investigation the detection of neuropeptides and

standards in crustacean hemolymph was improved when methanol and protease inhibitors were

present before performing MWCO neuropeptide isolation The impact of methanol on MWCO

sample loss was not investigated in the study [15] In another study a large-scale mass

fingerprinting protocol of endogenous peptides from CSF used a combination of salts before

MWCO fractionation but the impact of adding salts was not discussed [16] The most

commonly used brand of MWCO in the publications and in peptidomic studies is Millipore

Therefore Millipore MWCOs (using regenerated cellulose as the membrane) were used in the

present study The purpose of this work is to provide an optimized sample preparation technique

for MWCO filtering to reduce sample loss and allow sub-microg detection of peptides using MALDI

mass spectrometry

MATERIALS AND METHODS

190

Materials and Chemicals

Water acetonitrile methanol (optima LCMS grade) and sodium chloride (995) were

purchased from Fisher Scientific (Fair Lawn NJ) The α-cyano-4-hydroxy-cinnamic acid (99)

formic acid (FA) (ge98) and bovine serum albumin (ge96) were purchased from Sigma-

Aldrich (St Louis MO) Amicon Ultra 05 mL 10000 MWCO centrifugal filters and ZipTips

packed with C18 reversed-phase resin were purchased from Millipore (Billerica MA) Trypsin-

digested bovine serum albumin (BSA) was purchased from Waters (Milford MA) Bradykinin

was purchased from American Peptide Company (Sunnyvale CA)

MALDI MS Instrumentation

An AutoFLEX III MALDI TOFTOF mass spectrometer (Bruker Daltonics Billerica

MA) was operated in positive ion reflectron mode The MALDI MS instrument is equipped with

a proprietary smart beam (Bruker Daltonics Billerica MA) laser operating at 200 Hz The

instrument was internally calibrated over the mass range of mz 500minus2500 using a standard

peptide mix Two thousand laser shots were collected per sample spot at an accelerating voltage

of 19 kV and a constant laser power using random shot selection The acquired data were

analyzed using FlexAnalysis software (Bruker Daltonics Billerica MA) Mass spectrometry

data acquisition was obtained by averaging 2000 laser shots

Molecular weight cut off separation procedure

The MWCO separations were performed using Amicon Ultra 05 mL 10000 MWCO

centrifugal filters (Billerica MA) Before MWCO separation three washing steps were

performed to remove contaminants from the filter The three washes were 500 μL of 5050

H2OMeOH 500 μL of H2O and 400 μL of the solution used for MWCO separation For the

191

100 H2O solution 1 microg of BSA or bradykinin was used for separation All the other MWCO

separation experiments used 500 ng of BSA or 100 ng or less of bradykinin The MWCO filter

was then centrifuged at 14000 rcf for 5 min at room temperature in an Eppendorf 5415 D

microcentrifuge (Brinkmann Instruments Inc Westbury NY) The filtrate was concentrated in a

Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation West Palm Beach FL)

and acidified The resulting sample was desalted according to the manufacturer using C18

ZipTips from Millipore (Billerica MA) by washing the ZipTip with three times 100 ACN

three aqueous washes of 01 FA binding the peptides from the solution and one aqueous wash

of 01 FA Peptides were then eluted from the ZipTips using 15 microL of 50 ACN in 01 FA

Matrix deposition

Equal volumes of 05 μL from the 15 μL sample solution (including standards not subject

to MWCO filtering) and α-cyano-4-hydroxy-cinnamic acid (CHCA) matrix solution in 50

ACN50 H2O were mixed using a dried-droplet method and spotted on a MALDI target The

resulting droplets were allowed to air dry prior to mass spectrometry acquisition

RESULTS AND DISCUSSION

Analysis of two orders of magnitude increase for bradykinin standard

Bradykinin was selected to assess the potential peptide loss in the flow-through after

performing MWCO separation As shown in Figure 1 1 microg of bradykinin standard does not

produce a detectable signal by MALDI mass spectrometry analysis after a 10 kDa MWCO

separation in water (performed in triplicate) For comparison 1 ng of bradykinin standard

diluted to 15 microL produced an intense signal on the MALDI mass spectrometer suggesting

192

significant sample loss occurs when the target analyte is low in quantity (data not shown

performed in triplicate) Figure 1 shows that the addition of a salt in this case NaCl improves

the limits of detection and decreases sample loss when 7030 watermethanol was compared to

7030 aqueous 1 M NaClmethanol The reproducible results gave a relative standard deviation

(RSD) of 6 for peak intensity Figure 1 shows that even when starting with 1 μg of bradykinin

too much sample is lost during the MWCO separation in water to detect the remainder

However an intense signal is observed for 10 ng of bradykinin after the MWCO separation when

7030 aqueous 1 M NaClmethanol is used as an elution solvent Sample loss from zip-tipping

was estimated in triplicate by calculating the decreases in peak intensity When 10 ng of

bradykinin was used sample loss was ~41 In comparison the calculated yield for 10 ng of

bradykinin after the 7030 aqueous 1 M NaClmethanol MWCO separation and zip-tip desalting

showed an estimated sample loss of 63 meaning more loss can be attributed to sample clean-

up than MWCO filtration

A series of experiments were performed to determine if 7030 aqueous 1 M

NaClmethanol is an optimal solution to recover peptides during a MWCO separation (data not

shown) A 5050 aqueous 1 M NaClmethanol and a 5050 watermethanol elution were

performed in duplicate but signal intensity of the resulting bradykinin was poor and numerous

polymer peaks were detected in the flow through A lower salt concentration 01 M NaCl was

used but also produced greatly reduced signal when compared to aqueous 1 M NaCl To assess

the optimized methodrsquos compatibility with lower sample amounts the 7030 aqueous 1 M

NaClmethanol solution was added to 1 ng of bradykinin for MWCO separation but no signal

was obtained (data not shown) Using a neuropeptide standard the addition of methanol and

NaCl salt significantly improved the sample recovery in sub-microg amounts

193

BSA tryptic peptide mixture analysis

After demonstrating the importance of using an optimized solution for MWCO

separations with an individual peptide the new method was applied to 500 ng of BSA tryptic

digest to investigate its utility with more complex peptide mixtures Table 1 lists the BSA

tryptic peptides identified in the MALDI MS analysis from different solution conditions

processed by MWCO separation As shown in Table 1 a directly spotted BSA tryptic peptide

standard in the absence of any MWCO filtration enabled identification of 39 tryptic peptides by

accurate peptide mass measurements Once again when using 100 H2O for MWCO

separations the starting amount was doubled to 1 microg (also done with 500 ng data not shown)

However many tryptic peptides were not detected due to low signal intensities and non-optimal

elution conditions Instead of H2O a 1 M NaCl solution was used for the MWCO elution but

only two tryptic peptides were identified (Table 1) The addition of 30 methanol into the

sample before MWCO filtration produced the first increase in identified BSA tryptic peptides

The remaining data from Table 1 shows improved BSA tryptic peptide identifications as the

sample (elution) conditions were further optimized Figure 2 shows the actual mass spectra

associated with the three most promising elution solutions along with 100 H2O

The BSA tryptic peptide intensities are shown in Figure 2A and the most intense tryptic

peptide YLYEIAR mz 92749 was observed in the four different solutions shown in Figure 2B

but not in the 100 H2O or 100 1 M NaCl solutions (data not shown) In Figure 2A all mass

spectra are normalized to an intensity of 7 x 104 to illustrate two points First the MWCO

filtering step still produced sample loss regardless of the solvent conditions chosen Second

there is a noticeable increase in peptide peak intensity using the optimized solvent 6040

194

aqueous 1 M NaClmethanol (Figure 2A) Figure 2C displays a zoomed-in view of a BSA

tryptic peptide signal LKECC

DKPLLEK mz 153266 (

carbamidomethyl) observed only in

the optimized solvent The detection of the mz 153266 peptide in Figure 2C highlights the

potential gain in sample and detectable peptides by using an optimized saltMeOH combination

A non-optimized saltMeOH combination will still reduce sample loss but further minimizing

sample loss during sample preparation will always be desirable in any analytical protocol

MWCO composition

The purpose of this application note is to provide evidence of sub-microg sample loss during

MWCO separations of peptide samples and a solution to overcome this limitation The

explanation of why adding MeOH and NaCl to the sample solution provides a significant

reduction in sample loss is beyond the scope of this application note Regardless Supplemental

Table S1 is an expanded version of Table 1 showing the amino acid sequence hydrophobicity

calculated using GRAVY scores and pI of the identified peptides in this study No discernible

trend was obtained from the data The membrane of commonly used MWCO in peptidomics and

for this study is comprised of chemically treated (regenerated) cellulose which is a

polysaccharide containing β (1rarr4) linked D-glucose Glucose has numerous free hydroxyl

groups which could non-specifically adsorb peptides flowing through the MWCO The addition

of MeOH has the most significant effect on signal which could be due to disrupting the

interaction between peptides and hydroxyl groups from glucose NaCl has a less significant

effect on sample recovery compared to MeOH but a detectable reduction in sample loss is noted

This improvement in sample recovery could be analogous to the use of NaCl in

195

immunodepletion protocols to reduce non-specific binding which is accomplished by adding

150 mM NaCl [17]

Analysis of bradykinin in the presence of undigested BSA

When using MWCO for peptide isolation proteins are typically present in the samples

usually in larger amounts Figure 3 shows the effect that adding BSA protein to a 10 ng

bradykinin solution before MWCO fractionation has on the resulting recovery of bradykinin

Adding 10 μg of BSA to the optimized 7030 aqueous 1 M NaClmethanol solution slightly

decreased bradykininrsquos signal with a RSD of 10 More severe signal reduction occurred after

adding 100 microg BSA with a RSD of 2 (N=2) It is not unexpected that more signal reduction

due to sample loss would occur especially in the 100 microg BSA sample because the BSA protein

has a molar ratio of 160 BSA protein molecules to one bradykinin peptide Figure 3 shows the

usefulness of the MWCO method with samples containing large amounts of proteins

RecommendationConclusion

The present work provides solutions to reduce sample loss from the use of MWCO for

sub-microg peptide isolation with or without non-digested proteins in the sample Despite its

widespread utility significant sample loss often occurs during the MWCO fractionation step

which is particularly problematic when analyzing low-abundance peptides from limited starting

material This application note aims to reduce sample loss during MWCO separations

specifically for sub-microg peptide isolation If complex samples are processed with MWCO

separation the authors recommend eluting the sample with 6040 aqueous 1 M NaClmethanol

solution as a starting point to minimize sample loss This application note provides a viable

196

alternative for sub-microg peptide MWCO separation circumventing the need to increase the starting

material by minimizing sample loss from using a MWCO membrane-based centrifugal filter

device

References

[1] HM Georgiou GE Rice MS Baker Proteomic analysis of human plasma failure of

centrifugal ultrafiltration to remove albumin and other high molecular weight proteins

Proteomics 2001 1 1503

[2] DW Greening RJ Simpson Low-molecular weight plasma proteome analysis using

centrifugal ultrafiltration Methods Mol Biol 2011 278 109

[3] DW Greening RJ Simpson A centrifugal ultrafiltration strategy for isolating the low-

molecular weight (ltor=25K) component of human plasma proteome J Proteomics 2010 73

637

[4] LL Manza SL Stamer AJ Ham SG Codreanu DC Liebler Sample preparation and

digestion for proteomic analyses using spin filters Proteomics 2005 5 1742

[5] D Theodorescu D Fliser S Wittke H Mischak R Krebs M Walden M Ross E Eltze O

Bettendorf C Wulfing A Semjonow Pilot study of capillary electrophoresis coupled to mass

spectrometry as a tool to define potential prostate cancer biomarkers in urine Electrophoresis

2005 26 2797

[6] K Antwi G Hostetter MJ Demeure BA Katchman GA Decker Y Ruiz TD Sielaff LJ

Koep DF Lake Analysis of the plasma peptidome from pancreas cancer patients connects a

peptide in plasma to overexpression of the parent protein in tumors J Proteome Res 2009 8

4722

[7] LP Aristoteli MP Molloy MS Baker Evaluation of endogenous plasma peptide extraction

methods for mass spectrometric biomarker discovery J Proteome Res 2007 6 571

[8] A Zougman B Pilch A Podtelejnikov M Kiehntopf C Schnabel C Kumar M Mann

Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7

386

[9] X Yuan DM Desiderio Human cerebrospinal fluid peptidomics J Mass Spectrom 2005 40

176

[10] X Zheng H Baker WS Hancock Analysis of the low molecular weight serum peptidome

using ultrafiltration and a hybrid ion trap-Fourier transform mass spectrometer J Chromatogr A

2006 1120 173

[11] L Li JV Sweedler Peptides in the brain mass spectrometry-based measurement approaches

and challenges Annu Rev Anal Chem 2008 1 451

[12] GB Stefano G Fricchione Y Goumon T Esch Pain immunity opiate and opioid

compounds and health Med Sci Monit 2005 11 MS47

[13] J Jensen Regulatory peptides and control of food intake in non-mammalian vertebrates Comp

Biochem Physiol A Mol Integr Physiol 2001 128 471

197

[14] A Kuoppala KA Lindstedt J Saarinen PT Kovanen JO Kokkonen Inactivation of

bradykinin by angiotensin-converting enzyme and by carboxypeptidase N in human plasma Am

J Physiol Heart Circ Physiol 2000 278 H1069

[15] R Chen M Ma L Hui J Zhang L Li Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 708

[16] H Jahn S Wittke P Zurbig TJ Raedler S Arlt M Kellmann W Mullen M Eichenlaub H

Mischak K Wiedemann Peptide fingerprinting of Alzheimers disease in cerebrospinal fluid

identification and prospective evaluation of new synaptic biomarkers PLoS One 2011 6

e26540

[17] NA Cellar AS Karnoup DR Albers ML Langhorst SA Young Immunodepletion of high

abundance proteins coupled on-line with reversed-phase liquid chromatography a two-

dimensional LC sample enrichment and fractionation technique for mammalian proteomics J

Chromatogr B Analyt Technol Biomed Life Sci 2009 877 79

198

Table 1 Identified BSA tryptic peptides from various MWCO separation conditions

BSA tryptic

peptide (MH+)

100

H2O 1microg

100

1 M NaCl

70

H2O

80

1 M NaCl

70

1 M NaCl

60

H2O

60

1 M NaCl

5083

5453

6894

7124

8985

9275

10345

10725

11385

11636

12496

12837

13057

13997

14157

14197

14398

14636

14798

15026

15118

15328

15547

15677

15768

16399

16678

16738

17248

17408

17477

17497

18809

18890

19019

19079

20450

21139

22479

Total 39 2 2 6 8 15 15 27

199

Figure 1 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard showing improvement over two orders of magnitude in detection limits Each MWCO

separation was performed at minimum in triplicate with representative spectrum selected for

each with a calculated RSD from the peak heights Three different amounts of bradykinin were

tested to assess the magnitude of sample loss under different MWCO solvent conditions The

top panel shows 1 microg of bradykinin standard after MWCO separation with 100 H2O elution

produced no signal The addition of 40 or 30 MeOH produced very low bradykinin signals

for both 100 ng (RSD of 28) and 10 ng (SNlt3 no RSD calculated) respectively In the

bottom two spectra each showed very large intensity but the 7030 aqueous 1 M NaClmethanol

10 ng bradykinin was processed with a 10kDa MWCO and zip-tipped and was reproducible with

200

a RSD of 6 The last spectrum was from 10 ng bradykinin (RSD of 3) which was diluted to

an equivalent volume as all the other experiments and directly spotted onto the MALDI plate

201

Figure 2 Representative MALDI mass spectra from MWCO separation of a BSA tryptic

peptide standard showing sample loss Stacked mass spectra from mz range 875-2150

normalized to 7 x 104 intensity representing the detection difference from a BSA tryptic peptide

standard from different MWCO separation conditions (A) It should be noted that when the

solvent for MWCO elution was 100 H2O 1 microg of BSA tryptic peptides was processed instead

of 500 ng A zoomed in view of the most abundant BSA tryptic peptide detected YLYEIAR

mz 92749 (B) Various percentages of MeOH produced significant signal but addition of a salt

(1 M NaCl) increases the signal which is closest to a directly spotted BSA tryptic peptide

standard A zoomed in view of a representative low intensity BSA tryptic peptide detected

LKECC

DKPLLEK mz 153266 (C) The optimized solution to be used for MWCO filtration

202

6040 aqueous 1 M NaClmethanol was the only procedure that enabled the detection of the

tryptic peptide in Figure 2C which was also detected in the directly spotted BSA tryptic peptide

standard All experiments were performed a minimum of two times with nearly identical results

) Carbamidomethyl amino acid modification

ordm) Tryptic peptide identified in three of the spectra in Figure 2A

dagger) Tryptic peptide identified in two of the spectra in Figure 2A

) Tryptic peptide identified in a single spectrum in Figure 2A

203

Figure 3 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard with a BSA protein present showing optimized solvent conditions minimized samples

losses Each experiment was performed in duplicate Two different amounts of BSA protein

were tested to assess the magnitude of sample loss caused by the presence of a protein The top

panel shows 10 microg of BSA protein and the second panel shows 100 microg of BSA protein added

while only 10 ng of bradykinin was added Detectable sample loss was observed when the BSA

protein was added but panel two shows that the amount of BSA protein was 1 x 104 greater

(equivalent to 160 fold molar excess) than bradykinin The last two spectra were controls using

a MWCO with the optimized solution in panel 3 and panel 4 using 10 ng bradykinin which was

diluted to an equivalent volume as all the other experiments and directly spotted onto the

MALDI plate

204

Supplemental Table 1 Expanded Table 1 including grand average of hydropathicity (GRAVY)

score theoretical pI and the sequence from the underlying amino acid sequence for the peptides

identified in the BSA digest The GRAVY and pI scores were obtained from ExPASy

Bioinformatics and modifications were not taken into consideration

(httpwebexpasyorgprotparam) Peptide assignments to the recorded peaks were done by

BSA

tryptic

peptide

(MH+)

GRAVY

score

Theoretical

pI

Sequence 100

H2O

1microg

100

1 M

NaCl

70

H2O

80

1 M

NaCl

70

1 M

NaCl

60

H2O

60

1 M

NaCl

5083 NA NA FGER

5453 0900 972 VASLR

6894 0267 979 AWSVAR

7124 -0950 647 SEIAHR

8985 0529 674 LcVLHEK

9275 -0071 600 YLYEIAR

10345 -0725 674 NEcFLSHK

10725 -0211 538 SHcIAEVEK

11385 0 599 ccTESLVNR

11636 0130 453 LVNELTEFAK

12496 -1250 545 FKDLGEEHFK

12837 0264 675 HPEYAVSVLLR

13057 -0582 532 HLVDEPQNLIK

13997 0567 437 TVMENFVAFVDK

14157 0567 437 TVmENFVAFVDK

14197 0058 530 SLHTLFGDELcK

14398 -0133 875 RHPEYAVSVLLR

14636 -0515 465 TcVADESHAGcEK

14798 0292 600 LGEYGFQNALIVR

15026 -0625 409 EYEATLEEccAK

15118 0207 597 VPQVSTPTLVEVSR

15328 -0617 617 LKEccDKPLLEK

15547 -0823 441 DDPHAcYSTVFDK

15677 -0085 437 DAFLGSFLYEYSR

15768 -0985 456 LKPDPNTLcDEFK

16399 -0067 875 KVPQVSTPTLVEVSR

16678 0064 437 MPCTEDYLSLILNR

16738 -1723 550 QEPERNEcFLSHK

17248 0064 437 MPcTEDYLSLILNR

17408 0064 437 mPcTEDYLSLILNR

17477 -0914 414 YNGVFQEccQAEDK

17497 -0621 410 EccHGDLLEcADDR

18809 -0537 606 RPcFSALTPDETYVPK

18890 -0567 674 HPYFYAPELLYYANK

19019 -1275 466 NEcFLSHKDDSPDLPK

19079 0044 454 LFTFHADIcTLPDTEK

20450 -0812 839 RHPYFYAPELLYYANK

21139 -0682 480 VHKEccHGDLLEcADDR

22479 -0458 423 EccHGDLLEcADDRADLAK

Total 39 2 2 6 8 15 15 27

205

mass matching with no tandem mass spectrometry performed Lower case amino acids indicate

a modification present in the peptide of carbamidomethyl (c) or oxidation (m)

206

Chapter 8

Conclusions and Future Directions

207

Summary

Comparative shotgun proteomics investigating numerous biological changes in various

species and sample media and peptidomic method development have been reported The

developed comparative shotgun proteomics based on label-free spectral counting with nanoLC

MSMS platform have been employed in mouse CSF rat CSF yeast culture and other biological

specimens Mass spectrometry (MS)-based peptidomic enhancements using ETD and improved

sample preparation methods for molecular weight cut-offs have been reported Together these

studies add to the Li Labs capabilities in comparative shotgun proteomics and expand available

methods for peptidomic research

Immunodepletion of CSF for comparative proteomics

Chapters 3 and 4 used similar methods to generate a list of differentially expressed

proteins providing insight into how Alexander disease (AxD) perturbs the proteome and how the

new prion model rat adapted scrapie (RAS) proteome varies from control rats In the GFAP

overexpressor mouse CSF study in Chapter 3 we have produced a panel of proteins with

significant up or down regulation in the CSF of transgenic GFAP overexpressor mice MS-based

proteomic study of this mouse model for AxD was consistent with the previous studies showing

elevation of GFAP in CSF The development of a modified IgY-14 immunodepletion technique

for low amounts of CSF with recommendations for future antibody depletion techniques to deal

with the unique challenges of mouse CSF was presented Modified proteomics protocols were

employed to profile mouse CSF with biological and technical triplicates addressing the

variability and providing quantitation with dNSAF spectral counting Validation of the data was

performed using both ELISA and RNA microarray data to provide corroboration with the

208

changes in the putative biomarkers This work presents numerous interesting targets for future

study in AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

Using the protocol developed in Chapter 3 we performed a similar study in RAS CSF

compared to control rat CSF Two differences in sample preparation for the rat CSF compared

to the mouse CSF are noted (1) rat IgY-7 immunodepletion was performed and (2) each rat

CSF sample was collected from one animal due to sufficient volume instead of pooling from

multiple animals for the mouse CSF sample After immunodepletion the CSF samples from

control and RAS (biological N=5 technical replicates N=3) were digested and separated using

one dimensional RP nanoLC separation We were able to identify 167 proteins with redundant

isoforms removed which was derived from total 512 proteins with a 1 FDR from all the CSF

samples Comparative analysis was performed using dNSAF spectral counting and 21 proteins

were significantly changed Our data were consistent with previous prion CSF studies showing

14-3-3 protein increased in CSF of prion infected animals Genomic analysis was also

performed and was used to cross-validate our proteomic data and numerous proteins were found

to be consistent including a novel marker related to prion disease RNAset2 (ribonuclease T2)

In summary this work provides a foundation for investigation of the perturbed proteome of a

new prion model RAS

Yeast Phosphoproteomic Comparison between Starved and Glucose Fed Conditions

This work presented a qualitative comparison of the phosphoproteome between starved

and glucose fed Saccharomyces cerevisiae (bakers yeast) A large scale IMAC enrichment of

yeast cell lysate followed by 2-D separations provided a rich source of phosphopeptides for CID

MSMS and neutral loss triggered ETD analysis using mass spectrometry A specific motif for

PKA was highlighted to show the differences in proteins identified between starved and glucose

209

fed conditions In total 477 unique phosphopeptides were identified with 06 FDR and 669

unique phosphopeptides identified with 54 FDR Phosphosite validation was performed using

a localization algorithm Ascore to provide further confidence on the site-specific

characterization of these phosphopeptides Two proteins Ssd1 and PMA2 emerged as potential

intriguing targets for more in-depth studies on protein phosphorylation involved in glucose

response

Methods for Peptide Sample Preparation and Sequencing

In this study ETD was performed to improve the sequence coverage of endogenous large

neuropeptides and labile tyrosine sulfation CCK-like neuropeptides found in the blue crab

Callinectes sapidus The intact CPRP from Callinectes sapidus was identified and characterized

with 68 sequence coverage by MS for the first time Cation assisted ETD fragmentation using

MgCl2 was also shown to be a powerful tool in de novo sequencing of sulfated neuropeptides

These endeavors into using ETD for certain neuropeptides will assist in future analysis of large

neuropeptides and PTM containing neuropeptides

In addition to ETD sequencing I presented a protocol on improving recovery of minute

quantities of peptides by adding methanol and a salt modifier (NaCl) to molecular weight cut-off

membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide quantities

Despite its widespread utility significant sample loss often occurs during the MWCO

fractionation step which is particularly problematic when analyzing low-abundance peptides

from limited starting material This work presented a method to reduce sample loss during

MWCO separations specifically for sub-microg peptide isolation Using a neuropeptide standard

bradykinin sample loss was reduced by over two orders of magnitude with and without

210

undigested protein present The presence of bovine serum albumin (BSA) undigested protein

and the bradykinin standard lends evidence that sample loss occurs because of the MWCO and

not the presence of the protein Additionally a BSA tryptic digestion is presented where twenty-

seven tryptic peptides are identified from MALDI mass spectra after enriching with methanol

while only two tryptic peptides are identified after the standard MWCO protocol

Ongoing Projects and Future Directions

CSF Projects

Rat Adapted Scrapie and Time Course Study of Rat CSF

In ongoing experiments from the work described in Chapter 4 related to rat adapted

scrapie (RAS) we are working towards more complete proteome coverage of CSF and a time

course study of RAS After the promising results of the 1-D proteomic comparison between

RAS and control CSF we want to perform IgY immunodepletion on 1 mL of rat CSF followed

by 2-D high pH-low pH RP-RP LC-MSMS IgY immunodepletion is still required and

afterwards approximately 40 microg of low abundance protein would remain Following traditional

urea tryptic digestion (see Appendix 1) and solid phase extraction (SPE) clean-up the sample

would be dried down and reconstituted in mobile phase A (25 mM ammonium formate basic

pH) for high pH RP separation Fractions would be dried and subjected to MS analysis similar to

the work described in Chapter 4 The purpose of this work would be to increase the proteome

coverage from a few hundred to a thousand CSF proteins identified A time course study of RAS

is also desirable to gain insight into disease progression Rats at different stages will be

sacrificed and CSF will be collected The goal is to perform isobaric labeling between the time

courses with spectral counting being an alternative for relative protein expression We will use

the targets identified in Chapter 4 to track certain proteins for time course analysis Overall

211

these future projects will dig deeper into the proteome and how this novel prion model RAS

perturbs the proteins expressed in rats over time

Glycoproteomics and Peptidomic Analysis of CSF Samples from Individuals with

Alzheimerrsquos Disease

Alzheimerrsquos disease is an incurable progressive neurodegenerative disorder which results

in cognitive impairment Our aim is to provide additional biomarkers andor targets for drug

treatment Currently we used 500 microL of human CSF for non-membrane glycoprotein

enrichment (see Appendix 1) followed by 2-D high-pH-low-pH RP-RP amaZon ion trap LC-

MSMS analysis The initial work was a failure due to low amount of signal and significant

sample loss (data not shown) Through a comparison with Dr Xin Weirsquos PhD thesis we

estimated roughly 20 microg of proteins was obtained after glycoprotein enrichment and 1-D analysis

produced over 100 protein identifications (data not shown) but the additional off-line 2-D

separation and sample clean up resulted in low number of protein identifications for each fraction

analyzed by MS (data not shown) We have recently obtained 1 mL of human CSF samples

from healthy controls and individuals suffering from Alzheimerrsquos disease and plan to perform

the same experiments with double the starting amount and reduce the fractions collected from 2-

D separation from 11 to 6 In addition to the glycoproteomics anything not enriched will be

subjected to MWCO separation using the method presented in Chapter 7 The obtained peptide

sample will be subjected to Q-TOF nanoLC MSE data acquisition followed by de novo

sequencing using various programs including PEAKS and Mascot Collectively we feel this

project has great potential to lead to interesting targets and further expand the proteomic

knowledge of Alzheimerrsquos disease

GFAP Knock-in Mouse CSF

212

In a direct follow-up to Chapter 3 we aim to perform comparative proteomics on control

vs glial fibrillary acidic protein (GFAP) knock-in mouse CSF samples The sample preparation

protocol as presented in Chapter 3 will be used For quantitative proteomics we plan on

performing isobaric labeling followed by performing high energy collision induced dissociation

(HCD) on the Orbitrap Elite The MS acquisition will also perform a top ten scan where the top

ten MS peaks will be selected for tandem MS analysis In addition targeted quantitation of

specific proteins using multiple reaction monitoring (MRM) can be performed on potential

biomarkers and identified in this follow-up study and the work presented in Chapter 3 Also any

CSF samples with noticeable blood content cannot be used for the exploratory proteomics

experiments but can potentially be used for the MRM analysis and should be kept for such

experiments in the future

Large Scale Proteomics

Proteomics of Mouse Amniotic Fluid for Lung Maturation

The overall goal of this project is to determine what proteins are present in amniotic fluid

when the embryo is 175 weeks compared to amniotic fluid at 155 weeks The rationale behind

why these two time points matter was investigated through a lung explant culture where amniotic

fluid was added at 155 weeks and 175 weeks Visual maturation of lungs occurred with the

175 week amniotic fluid and lung maturation genes were up regulated in the mRNA in the lung

explant culture when compared to the 155 week amniotic fluid The compound which is

causing the maturation of the lungs is unknown and search for a secreted protein might provide a

clue to this process In order to test this hypothesis we carried out discovery proteomics

experiments The workflow involves tryptic digestion SPE and 1-D low pH nanoLC separation

coupled to an amaZon ETD ion trap for tandem MS analysis The resulting mass spectrometric

213

acquisition identified less than 100 proteins with a 1 FDR (data not published) To address the

lack of depth in the proteome coverage we purchased an IgY immunodepletion column to

remove the most abundant proteins in amniotic fluid similarly to Chapter 3 and 4 but on a larger

scale One abundant protein alpha-fetoprotein is present in amniotic fluid but absent or present

in very low concentration in CSF serum or plasma Alpha-fetoprotein is similar to albumin and

thus we ran amniotic fluid on an IgY immunodepletion column and observed significant

reduction in alpha-fetoprotein amount (data not published) After immunodepletion 2-D high

pH-low pH RP-RP will be performed followed by nanoLCMSMS on the amaZon ETD ion trap

We will perform mass spectrometry experiments using the 155 week amniotic fluid and 175

week amniotic fluid which have average protein contents ~2 mgmL We anticipate a minimum

of tenfold increase in the proteome coverage of amniotic fluid and provide meaningful

hypothesis driven biological experiments from this work

Phosphoproteomics of JNK Activation

c-Jun N-terminal kinase (JNK) is an important cellular mediator of stress activated

signaling Under conditions of oxidative stress JNK is activated resulting in the downstream

phosphorylation of a large number of proteins including c-Jun However the cellular response

to JNK activation is extremely complex and JNK activation can result in extremely different

physiological outcomes For example JNK is at the crossroads of cellular death and survival

and exhibits both pro-death and pro-survival activities These highly disparate outcomes of JNK

activation are highly contextual and depend on the type of stressor and duration of stress In the

brain JNK has been shown to be activated in models of Alzheimerrsquos disease Parkinsonrsquos

disease amyotrophic lateral sclerosis and stroke It is thought that JNK is activated in these

diseases as a result of oxidative stress and it is unknown whether this activation is pro-survival or

214

pro-death To elucidate JNK signaling in the brain we chose to analyze primary cortical

astrocytes under conditions of oxidative stress Since hydrogen peroxide is a physiologically

relevant oxidant in the brain and known to activate JNK we chose to use this to treat astrocytes

and then analyze changes to the phosphoproteome by mass spectrometry By doing this we

hope to elucidate important JNK-dependent targets involved in stress signaling in the brain and

that identifying these targets could lead to the identification of novel disease mechanisms and

potentially new therapeutic targets for neurodegeneration Specifically we plan on performing

stable isotope labeling by amino acids (SILAC) of astrocytes for control hydrogen peroxide

treated and hydrogen peroxide treated with JNK inhibitor The SILAC labeled astrocyte cell

lysate will be digested enriched using Fe-NTA IMAC and separated using 2-D high pH-low pH

RP-RP and infused into the amaZon ETD ion trap The protocol is similar to the yeast

comparative phosphoproteomics from Chapter 5 After analysis we plan on processing the data

using ProteoIQ to identify phosphoproteins with significant changes

Immunoprecipitation Followed by Mass Spectrometry

Stb3 Mass Spectrometry Analysis

Stb3 (Sin3-binding protein) has previously been shown to change location depending on

the presence or absence of glucose (Heideman Lab Dr Michael Conways Thesis) An

immunoprecipitation of Stb3 was performed in parallel to the work described in Chapter 5 Two

separate experiments were performed one with wild type Stb3 and another tagged with myc for

improved protein immunoprecipitation Myc tagging is a polypeptide tag which can be

recognized by an antibody and provide a better enrichment compared to using a Stb3 antibody

alone The myc tagging was done because of the low abundance of Stb3 and the limited amount

of protein that was pulled down by the antibody for Stb3 Immunopreciptation experiments were

215

performed for both starved and glucose fed samples All samples were tryptically digested

followed by low pH RP nanoLCMSMS Both CID and ETD were used for fragmentation

analysis of each sample (n=2) The Stb3 wild type had two peptide hits in one MS run which is

not surprising due to low Stb3 antibody affinity In contrast a significant amount of Stb3 was

pulled down from Myc tagged starved and glucose fed samples For the glucose starved

samples 22 unique peptides from Stb3 were identified whereas glucose fed samples yielded 21

unique Stb3 peptides identified (unpublished data) The identified Stb3 in the myc samples

allowed us to investigate what other proteins were pulled down that are not present in the wild

type samples

From previous work by our collaborator Dr Heideman it had been suggested that Stb3

forms a complex with Sin3 and Rpd3 Our proteomic data shows multiple significant peptide

hits for Sin3 and Rpd3 in myc tagged but not in wild type where Stb3 is only observed once

with a low Mascot score When looking at the unique proteins identified in myc tagged glucose

fed sample but not included in the wild type samples the myc fed sample contained eight unique

ribosomal proteins with numerous unique peptides for each The ribosomal proteins identified in

myc fed sample supports the novel hypothesis that in the glucose fed state the complex of Stb3

Rpd3 and Sin3 interact with ribosomal proteins Other proteins unique to myc tagged glucose

starved samples are glyceraldehyde-3-phosphate dehydrogenase 2 and transcriptional regulatory

protein UME6 Also three proteins were observed in both myc fed and starved but not in the

wild type samples including transposon Ty1-DR1 Gag-Pol polyprotein(YD11B) SWIRM

domain-containing protein (FUN19) and RNA polymerase II degradation factor 1 (DEF1) Our

proteomics data for the Stb3 immunoprecipitation experiments with starved vs glucose fed

216

samples provide exciting evidence to support previous observation made by the Heideman group

and highlight the utility of MS-based approach to deciphering protein-protein interactions

Conclusions

The majority of the work described in this dissertation revolves around sample

preparation for proteomics and peptidomics with a focus on generating biologically testable

hypotheses In the area of neuropeptides CPRP was fully sequenced analytical methods were

transformed for use in an ETD ion trap and sub-microg peptides can now be analyzed by mass

spectrometry after MWCO separation In the field of comparative proteomics comparisons

between mouse CSF in GFAP overexpressors and control or rat adapted scrapie model and

control CSF yeast fed vs glucose starved cell extract have all been presented Collectively this

thesis has developed new techniques for neuropeptide sample preparation and presented

numerous comparative proteomic analyses of various biological samples and how the proteomes

are dynamically perturbed by various treatments and disease conditions

217

Appendix 1

Protocols for sample preparation for mass spectrometry based

proteomics and peptidomics

218

Small Scale Urea Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 8 μL of urea solution

(400mg05mL) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Allow to digest overnight in 37degC water bath

10 Acidify with 10μL 10 formic acid

11 Perform solid phase extraction using tips dependent of sample amount

a Sub-5μg amounts ndash Millipore Ziptips

b 5-75μg amounts ndash Bond-Elut tips from Agilent (OMIX tips)

12 Dry down in Speedvac as needed for experiment

219

Small Scale ProteaseMAX Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 2 μL of 1 of

ProtesaeMAX (Promega) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Add 1 μL ProteaseMAX and let sit for 3-4 hours

10 Acidify with 2μL 10 formic acid

11 Dry down in Speedvac as needed for experiment

220

Large Scale Urea Tryptic Digestion (mg of proteins)

1 All solutions are in a 50 mM NH4HCO3 buffer (395 mgmL)

2 Add 20μL of 05M Dithiothreitol (DTT) (77mg100μL) and 160μL of urea solution

(400mg05mL) to sample

3 Allow sample to denature 45-60 minutes in a 37degC water bath

4 Add 54μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

5 Quench reaction with 20μL of DTT solution

6 Dilute with 14mL of NH4HCO3 solution

7 Add 100μg of trypsin

8 Allow to digest overnight in 37degC water bath

9 Acidify sample with 100μL of 10 formic acid

10 Perform solid phase extraction with Hypersep C18 Cartridges with 100 mg of C18

bead volume (Thermo)

11 Dry down with the Speedvac as needed for experiment

221

Fe-NTA Preparation from Ni-NTA Beads

1 Centrifuge (5 min at 140 rcf) and use a magnet to keep beads in place as supernatant

is removed

2 Wash 3 times with 800μL of H2O (using same technique of centrifuging and using

magnet to keep beads in places as supernatant is removed)

3 Add 800μL of 100mM EDTA (372mgmL) in a 50mM NH4HCO3 (395mgmL)

buffered solution (pH around 8) and vortex for 30 minutes to chelate the Ni

centrifuge and remove supernatant

4 Wash 3 times with 800μL of H2O

5 Add 800μL of 10mM FeCl3 hexahydrate (27mgmL) and vortex for 30 minutes to

bind Fe to beads centrifuge and remove supernatant

6 Wash 3 times with 800μL H2O

7 Resuspend in a storage solution of 111 ACNMeOH001 Acetic Acid (1mL total)

222

Fe-NTA IMAC Phospho-enrichment

1 Use wash solution (80ACN 01Formic Acid) to rinse beads vortex 1 minute

centrifuge and remove supernatant

2 Add sample (around 500μL in 80ACN 01Formic Acid) and vortex 40 minutes to

allow sample to bind dispose of supernatant after centrifuging

3 Wash 3 times with 200μL of wash solution discard supernatant

4 Add 200μL of elution solution (5050 ACN5Ammonium Hydroxide) vortex 15

minutes and save supernatant

5 Add 200μL of elution solution vortex 10 minutes and save supernatant

6 Wash 2 time with wash solution (collect supernatant of first wash)

7 If reusing store in 1mL solution of 111 ACNMeOH001 Acetic Acid

223

High pH Off-line Separation

1) In general a minimum of 20 microg of peptides are needed to gain any benefit

from off-line 2D fractionation It is better to inject 100 microg of peptides on

column

2) Use a Gemini column or a similar column that can handle pH=10 and for this

protocol a 21 mm x 150 mm column was used

3) Prepare ldquobuffer Ardquo 25 mM NH4 formate pH=10 Also prepare ldquobuffer Brdquo

4) Dry down desired sample and reconstitute in buffer A

5) Inject based off sample loop (current Alliance HPLC has a 50 microL sample

loop)

6) Run gradient at bottom of the page collecting fractions every 3 minutes except

for the 1st minute which is the void volume

7) Optional If you want to reduce instrument time you can combine fractions 1

with 12 2 with 13 etc until 11 with 22

Time Mobile phase A Mobile phase B Flow Rate

05mlmin

0 98 2 05 mLmin

65rsquo 70 30 05 mLmin

65rsquo1rdquo 5 95 05 mLmin

70 5 95 05 mLmin

224

Non Membrane Glycoprotein Enrichment

1 For protocols involving membrane glycoprotein enrichment see Dr Xin Weirsquos

thesis

2 Add 75 microL of Wheat germ agglutinin (WGA) bound to agarose beads and 150 microL

of Concanavalin A (ConA) to a Handee spin cup (Thermo) Wash 2 times with

lectin affinity chromatography (LAC) binding buffer (015 M NaCl 002 M Tris-

HCl 1 mM CaCl2 pH=74) centrifuging at 400 g for 30 seconds

3 Place stopper on bottom of spin cup and add 200 microg of sample (500 microL for CSF)

Bring up to 300 microL using lectin LAC binding buffer

4 Incubate for 1 hour with continuous mixing at room temperature

5 Centrifuge at 400 g for 30 seconds

6 Perform two more 300 microL LAC binding washes followed by centrifugation

7 Add 300 microL LAC eluting buffer (0075 M NaCl 001 M Tris-HCl 02 M alpha-

methyl mannoside 02 M alpha-methyl glucoside and 05 M acetyl-D-

glucosamine) vortex for 10 minutes (have stopper in place while vortexing)

centrifuge and collect

7 Add another 300 microL LAC eluting buffer centrifuge and collect

225

MWCO separation for Sub-microg peptide concentrations

1 Wash molecular weight cut-off (MWCO) with 5050 waterMeOH centrifuge at

14000 g for 5 minutes for 10 kDa filter (time will vary Millipore Amicon Ultra

filters)

2 Wash with 100 water centrifuge at 14000 g for 5 minutes

3 Add methanol to the sample to get the concentration to 30 methanol and add

salt to achieve a roughly 05 M NaCl before adding the sample to the MWCO

4 Centrifuge at 14000 for 10 minutes collect flow through

226

Immunoprecipitation

Modified from Thermo Fisher Scientific Classic IP Kit

1 10 microL Polyclonal antibody added to 1 microg protein standard in a Handee spin cup

(Thermo) diluted with 1x TBS buffer to 400 microL and incubated overnight at on

shakerend-over-end rotator

2 Wash Protein GA beads with 2x 200microL TBS buffer and added to the

antibodysample for 15 hours at 4oC

3 Centrifuge at 400 g for 30 seconds and discard flow through

4 Wash 3x with 100microL TBS buffer centrifuge at 400 g for 30 seconds and discard

flow through

5 Wash with 1x conditioning solution (neutral pH buffer) centrifuge at 400 g for 30

seconds and discard flow through

6 Eluted with elution buffer (2x100microL) centrifuge at 400 g for 30 seconds and

collect flow through

227

C18 Solid Phase Extraction (SPE)

1 Determine amount of peptides for SPE ge5 microg use Ziptips from Millipore If

between 5-75 microg use OMIX from Agilent 100 microL version and if ge75 microg use SPE

cartridges such as 100 mg Hypersep from Thermo

2 Ensure no detergents are in the sample and it is acidified

3 The three SPE procedures all use the same sets of solutions only volumes vary

4 3x Wetting solution (100 ACN) for 60 microL OMIX (20 microL ziptip and 15 mL for

100 mg cartridge)

5 3x Wash solution (01 formic acid in 100 H2O) same volumes as 4

6 Draw up the sample about 10 times minimum (generally 30-40 is preferred)

without letting the bead volume dry out

7 1X Wash solution same volumes as 4

8 Back load the eluting solution 01 formic acid in 5050 H2OACN into the

Ziptips (15 microL) or OMIX (50 microL) tips For the SPE cartridge add 700 microL of

eluting solution

9 Dry down and prepare for next step in sample preparation

228

Laser Puller Programs and Protocols

1) Obtain about two feet of Fused-silica capillary with 75 μm id and 360 μm od

2) Wash with methanol and then air dry using the bomb

3) Cut into one foot or desired length

4) Use a lighter to burn the middle portion (about an inch in length) of the tubing

5) Remove the ash by rinsing with methanol and rubbing with a Kimwipe

6) Make sure the laser puller has been on for at least 30 minutes before use to allow

for the instrument to warm up

7) Place capillary in instrument with the burnedexposed portion in the center

making sure that the length of the capillary is pulled taut

8) Enter desired program (next page) and press pull

229

Laser Puller Programs

Program 99 (default lab program)

Heat Filament Velocity Delay Pull

250 0 25 150 15

240 0 25 150 15

235 0 25 150 15

245 0 25 150 15

Program 97 (developed for larger inner diameter tips)

Heat Filament Velocity Delay Pull

230 - 25 150 -

220 - 25 150 -

215 - 25 150 8

230

On column Immunodepletion (serum plasma CSF amniotic fluid)

1) Prepare buffer A ldquoDilution Bufferrdquo 10 mM Tris-HCl pH=74 150 mM NaCl

2) Prepare buffer B ldquoStripping Bufferrdquo 01 M Glycine-HCl pH=25

3) Prepare buffer C ldquoNeutralization Bufferrdquo 100 mM Tris-HCl pH=80

4) Determine loading amount (LC1=~1000 ug of serum or plasma less for CSF due

to the increased amount of albumin percentage in CSF)

5) Add Dilution buffer to sample before injection and ensure the sample is proper

pH (~7)

6) Use gradient below

Time A B C Flow Rate

(mLmin)

0rsquo 100 0 0 02

4rsquo59rdquo 100 0 0 02

5rsquo 100 0 0 05

8rsquo59rdquo 100 0 0 05

9rsquo 0 100 0 05

22rsquo 0 100 0 05

22rsquo1rdquo 0 0 100 05

39rsquo 0 0 100 05

7) Store the column in 1x dilution buffer until next use

231

Small Scale Immunodepletion (100 microL of CSF)

1) Use 75 microg (per 100 microL CSF) of Sigma Seppro IgY-14 or IgY-7 bead slurry

2) Add an equal volume of 2x dilution buffer (20 mM Tris-HCl pH=74 300 mM

NaCl) to the starting amount of CSF

3) Add to a spin cup with a filter and allow to mix for 30 minutes

4) Centrifuge at 400 g for 30 seconds and collect the flow through

5) Add 50 microL of 1x dilution buffer mix for 5 minutes centrifuge at 400 g for 30

seconds and collect the flow through

6) Use 1x stripping washes (01 M Glycine-HCl pH=25) centrifuge as before and

discard Repeat four times

7) Use 1x neutralization buffer (100 mM Tris-HCl pH=80) centrifuge as before

and discard Repeat two times

8) Store the beads in the spin column in 1x dilution buffer until next use

232

Alliance Maintenance Protocol

Seal Wash

10 methanol no acetonitrile This wash cleans behind the pump-head seals to

ensure proper lubrication Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime SealWsh gt Start

2 Press Stop after 5 minutes

Prime Injector

10 methanol for maintenance high organic solvent for dirty runs (eg 95

acetonitrile) Done before injecting any real samples to ensure no bubbles are

present in the injector Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime NdlWsh gt Start

2 After completion press Close

Purge Injector

Solvent is dependent on run Run this protocol at beginning of experiments each day

Minimum once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Navigate Direct Function gt 4 Purge Injector gt OK

3 Set Sample loop volumes 60 leave Compression Check unchecked gt OK

Prime Solvent Pumps

Solvent is dependent on run If solvents are changed run this protocol Minimum

once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys choose composition A type 100 Enter x4

3 Navigate Direct Function gt 3 Wet Prime gt OK

4 Set Flow Rate 7000 mLmin Time 100 min gt OK

5 Repeat for all changedactive solvent pumps

Condition Column

Dependent on user Use starting conditions for run

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys type starting solvent compositions for run

3 Navigate Direct Function gt 6 Condition Column gt OK

4 Set Time as desired

233

Appendix 2

List of Publications and Presentations

234

PUBLICATIONS

ldquoDiscovery and characterization of the crustacean hyperglycemic hormone precursor related

peptides (CPRP) and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes

sapidus using multiple tandem mass spectrometry techniquesrdquo Hui L Cunningham R Zhang

Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

ldquoInvestigation and reduction of sub-microgram peptide loss using molecular weight cut-off

fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J Wellner D Li L

Journal of Mass Spectrometry In Press

ldquoMass spectrometry-based proteomics and peptidomics for systems biology and biomarker

discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse models of

Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P Messing A Li

L Journal of Proteome Research Submitted

ldquoInvestigation of the differences in the phosphoproteome between starved vs glucose fed

Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M Heideman

W Li L In preparation

ldquoIdentification of astrocytic JNK targets by phosphoproteomics using mass spectrometryrdquo

Cunningham R Dowell J Wang J Wellner D Li L Milhelm M In preparation

ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A Cunningham R Wellner

D Wang J Ma D Li L Aiken J In preparation

235

INVITED SEMINARS AND CONFERENCE PRESENTATIONS

Robert Cunningham Davenne Mayour and Shoshanna Coon ldquoMorphology and Thermal

Stability of Monolayers on Porous Siliconrdquo The 231th

ACS National Meeting 2006 Atlanta

GA

Robert Cunningham Xin Wei Paige Jany Albee Messing and Lingjun Li ldquoMass

Spectrometry-Based Analysis of Cerebrospinal Fluid Peptidome and Proteome for Biomarker

Discovery in Alexander Diseaserdquo The 57th

ASMS Conference 2009 Philadelphia PA

Robert Cunningham ldquoWhat to Expect in a PhD Graduate Schoolrdquo Invited seminar University

of Northern Iowa 2010 Cedar Falls IA

Robert Cunningham Dustin Frost Albee Messing and Lingjun Li ldquoInvestigation of an

Optimized ProteaseMAX Assisted Trypsin Digestion of Human CSF for Pseudo-SRM

Quantification of GFAP and Identification of Biomarkersrdquo The 58th

ASMS Conference 2010

Salt Lake City UT

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo Oral presentation at Pittcon Conference and Expo 2011 2011 Atlanta

GA

Robert Cunningham Michael Conway Daniel Wellner Douglas Grunwald Warren

Heideman and Lingjun Li ldquoDevelopment of optimized phosphopeptide enrichment methods for

comparison of starved and glucose fed yeast Saccharomyces cerevisiaerdquo The 59th

ASMS

Conference 2011 Denver CO

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo 2011 Wisconsin Human Proteomics Symposium 2011 Madison WI

Page 4: Mass Spectrometry Applications for Comparative Proteomics

iii

Table of Contents

Page

________________________________________________________________________

Acknowledgements i

Table of Contents iii

Abstract iv

Chapter 1 Introduction brief background and research summary 1

Chapter 2 Mass spectrometry-based proteomics and peptidomics for

biomarker discovery and the current state of the field 15

Chapter 3 Protein changes in immunodepleted cerebrospinal fluid from

transgenic mouse models of Alexander disease detected

using mass spectrometry 73

Chapter 4 Genomic and proteomic profiling of rat adapted scrapie 110

Chapter 5 Investigation of the differences in the phosphoproteome

between starved vs glucose fed Saccharomyces cerevisiae 139

Chapter 6 Use of electron transfer dissociation for neuropeptide

sequencing and identification 166

Chapter 7 Investigation and reduction of sub-microgram peptide loss

using molecular weight cut-off fractionation prior to

mass spectrometric analysis 187

Chapter 8 Conclusions and future directions 206

Appendix 1 Protocols for sample preparation for mass spectrometry

based proteomics and peptidomics 217

Appendix 2 Publications and presentations 233

_______________________________________________________________________

iv

Mass Spectrometry Applications for Comparative Proteomics and

Peptidomic Discovery

Robert Stewart Cunningham

Under the supervision of Professor Lingjun Li

At the University of Wisconsin-Madison

Abstract

In this thesis multiple biological samples from various diseases models or

treatments are investigated using shotgun proteomics and improved methods are

developed to enable extended characterization and detection of neuropeptides In general

this thesis aims to expand upon the rapidly evolving field of mass spectrometry (MS)-

based proteomics and peptidomics by primarily enhancing small scale sample analysis

A review of the current status and progress in the field of biomarker discovery in

peptidomics and proteomics is presented To this rapidly expanding body of literature

our critical review offers new insights into MS-based biomarker studies investigating

numerous biological samples methods for post-translational modifications quantitative

proteomics and biomarker validation Methods are developed and presented including

immunodepletion for small volume cerebrospinal fluid (CSF) samples for comparison of

the CSF proteomes between an Alexander disease transgenic mouse model with

overexpression of the glial fibrillary acidic protein and a control animal This thesis also

covers the application of the small scale immunodepletion of CSF for comparative

proteomic analysis of a novel rat adapted scrapie (RAS) model for prion disease and

v

compares the RAS CSF proteome to control rat CSF using MS Large scale

phosphoproteomics of starved vs glucose fed yeast is presented to better understand the

phosphoproteome changes that occur during glucose feeding Method development for

neuropeptide analysis is expanded upon using electron transfer dissociation (ETD)

fragmentation to successfully sequence for the first time the crustacean hyperglycemic

hormone precursor-related peptide (CPRP) from the blue crab Callinectes sapidus In

addition a method for ETD sequencing of sulfonated neuropeptides using a magnesium

salt adduct in an ion trap mass spectrometer is reported This thesis also reports on a

method for sub-microg peptide isolation when using a molecular weight cut-off filtration

device to improve sample recovery by over 2 orders of magnitude All the protocols used

throughout the work are provided in an easy to use step-by-step format in the Appendix

Collectively this body of work extends the capabilities of mass spectrometry as a

bioanalytical tool for shotgun proteomics and expands upon methods for neuropeptide

discovery and analysis

1

Chapter 1

Introduction Brief Background and Research Summary

2

Abstract

Mass spectrometry based comparative proteomics and improved methods for

neuropeptide discovery have been reported in this thesis A very brief introduction of crustacean

neuropeptidomics is covered in this chapter followed by Chapter 2 which covers in great detail

comparative proteomics using mass spectrometry with an emphasis on biomarker discovery

Chapter 3 reports a novel immunodepletion technique used for comparative proteomics between

glial fibrillary acidic protein (GFAP) overexpressor and control mouse cerebrospinal fluid (CSF)

Chapter 4 involves rat CSF comparative proteomics between rat adapted scrapie and control

animals Chapter 5 details comparisons of phosphoproteomes in Saccharomyces cerevisiae

(Bakerrsquos yeast) between starved vs glucose fed conditions Chapter 6 investigates the use of

electron transfer dissociation (ETD) for kDa sized neuropeptides and neuropeptides with tyrosine

sulfonation Chapter 7 presents a molecular weight cut-off (MWCO) protocol to allow sub-microg

peptides to be recovered Chapter 8 provides a conclusion to this thesis and outlines future

directions for certain projects

3

Background

Mass spectrometry (MS) requires gas phase ions for experimental measurement and

intact large nonvolatile biomolecules proved difficult to be analyzed by electron ionization or

chemical ionization until the invention of two soft ionization techniques matrix-assisted laser

desorptionionization (MALDI)1 and electrospray ionization (ESI)

2 ESI and MALDI are the

two most common soft ionization techniques for mass spectrometry Once ionized molecules

such as peptides or proteins can be separated by their mass to charge ratios (mz) using various

mass analyzers manipulating electric andor magnetic fields ESI and MALDI mass

spectrometric techniques have become central analytical methods in biological sciences because

they are suitable for nonvolatile thermally labile analytes in femtomole quantities3 ESI allows

the coupling of high pressure liquid chromatography and the constant flow of solvent is

electrically charged at the outlet to produce a Taylor cone4 During desolvation the Raleigh

limit is reached and a coulombic explosion occurs commonly producing multiply charged ions

A modification of ESI is nanoLC-ESI which operates at nLmin flow rates to analyze low sample

amounts such as proteins or peptides in biological samples5 NanoLC-ESI uses less solvent as

the source because of the reduced flow rate of standard LCMS with an ESI source and nanoLC-

ESI is the predominate inlet for shotgun proteomic ESI MS experiments6 Conversely MALDI

can use sub-microL volume of sample and be co-crystallized with an ultraviolet absorbing organic

matrix that is obliterated using a laser at appropriate wavelength to generate gas phase ions

Alternatively MALDI has the unique capability to work with tissue samples and ionize in the

solid state instead of liquid like ESI

4

Mass analyzers require an operating pressure between 10-4

-10-10

Torr to allow proper ion

transfer and separationisolation of the ion based upon its mz Numerous mass analyzers are

currently available and each have their own strengths and weaknesses as shown in Figure 1 The

biomolecules are separated by the mass analyzers and detected without fragmentation which is

termed MS analysis Following MS detection tandem mass spectrometry (MSMS) of the

original precursor ion can be performed to provide additional structural information such as a

ladder sequence of amino acids for peptides Numerous fragmentation techniques are available

for proteomics such as collision induced dissociation (CID) electron transfer dissociation (ETD)

or high energy collision induced dissociation (HCD) Each of these fragmentation techniques

have their own benefits in proteomics experiments and are discussed in depth in Chapter 2 The

background and current status for comparative proteomics with specific emphasis on biomarker

analysis are covered in Chapter 2

Neuropeptidomic Method Development in the Crustacean Model System

Utilizing Mass Spectrometry

Chapter 6 and 7 focus on sample preparation and ETD fragmentation techniques to

characterize neuropeptides in the neuroendocrine organs in the crustacean nervous system

Neuropeptides are short chains of amino acids which are a diverse class of cell-cell signaling

molecules in the nervous system Neuropeptides have been investigated for being involved in

numerous physiological processes such as memory7 learning

8 depression

9 pain

10 reward

11

reproduction12

sleep-wake cycles13

homeostasis14

and feeding15-17

Figure 2 depicts how

neuropeptides are synthesized as pre-prohormones in the rough endoplasmic reticulum and

5

packaged in the Golgi apparatus After being packaged these pre-prohormones are processed

into bioactive peptides within the vesicle which is occurring during vesicular transport down an

axon and released into the synaptic cleft Secreted neuropeptides stimulate the post synaptic

neurons by interacting with G-protein coupled receptors at the chemical synapse

The crustacean model nervous system is well-defined neural network which has been

used in neurobiology and neuroendocrine (neurosecretory) research and thus lends itself for

studying neuromodulation18-22

Figure 3 shows the locations of several neuroendocrine organs in

the crustacean Callinectes sapidus including the sinus gland (SG) which is studied in Chapter 6

The Lingjun Li lab has focused on neuropeptide discovery and function in crustacean

neuroendocrine organs using mass spectrometry23-25

The work presented in Chapters 6 and 7

expand on sample preparation and analytical tools to further investigate the neuropeptidome

Research Overview

Comparative Proteomics of Biological Samples

Chapter 2 provides a thorough review of comparative proteomics and biomarker analysis

using mass spectrometry The scientific community has shown great interest in the field of mass

spectrometry-based proteomics and peptidomics for its applications in biology Proteomics

technologies have evolved to generate large datasets of proteins or peptides involved in various

biological and disease progression processes producing testable hypotheses for complex

biological questions This chapter provides an introduction and insight into relevant topics in

proteomics and peptidomics including biological material selection sample preparation

separation techniques peptide fragmentation post-translational modifications quantification

6

bioinformatics and biomarker discovery and validation In addition current literature and

remaining challenges and emerging technologies for proteomics and peptidomics are discussed

Chapter 3 investigates comparative proteomics between a GFAP overexpressor mouse

model and a control mouses cerebrospinal fluid (CSF) CSF is a low protein content biological

fluid with dynamic range spanning at least nine orders of magnitude in protein content and is in

direct contact with the brain but consist of very abundant proteins similar to serum which require

removal A modified IgY-14 immunodepletion treatment is presented to remove abundant

proteins such as albumin to enhance analysis of the low volumes of CSF that are obtainable

from mice These GFAP overexpressor mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we present the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates are performed to address animal variability as well as reproducibility in mass

spectrometric analysis Relative quantitation is performed using distributive normalized spectral

abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins with

significant changes in the CSF of GFAP transgenic mice are identified with validation from

ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

Chapter 4 explores the CSF proteomics of a novel prion model called rat adapted scrapie

(RAS) A similar IgY immunodepletion technique is presented as in Chapter 3 and is similarly

used to reduce the abundant proteins in CSF CSF from control and RAS (biological N=5

technical replicates N=3) were digested and separated using one dimensional reversed-phase

nanoLC separation In total 512 proteins 167 non-redundant protein groups and 1032 unique

peptides are identified with a 1 FDR Comparative analysis was done using dNSAF spectral

7

counting and 21 proteins were significantly up or down-regulated The proteins are compared to

the 1048 differentially regulated genes and additionally compared to previously published

proteins showing changes consistent with other prion animal models Of particular interest is

RAS specific proteins like RNAseT2 which is not up-regulated in mouse prion disease but is

designated as upregulated in both the genomic and proteomics data for RAS

Chapter 5 explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Previous work by the

Heideman lab investigated the transcriptional response to fresh glucose in yeast26

Kinases such

as protein kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose

response so we described a large scale phosphoproteomic MS based study in this chapter

Yeast cell extract was digested and phosphopeptides were enriched by immobilized metal

affinity chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase

(RP)-RP separation The low pH separation was infused directly into an ion trap mass

spectrometer with neutral loss triggered ETDfragmentation Specifically the CID fragmentation

can cause a neutral loss of 98z from the H3PO4 and can produce an incomplete fragmentation

pattern and lead to ambiguous identification so when the neutral loss is detected ETD MSMS

fragmentation is performed The neutral loss triggered ETD fragmentation is included in this

study to improve phosphopeptide identifications In total 477 phosphopeptides are identified

with 06 FDR and 669 phosphopeptides identified with 54 FDR Motif analysis of PKA and

phosphosite validation are performed as well

8

The future of comparative proteomics investigating small sample amounts or PTMs is

promising Further advances in enrichment separations science mass spectrometry

analyzersdetectors and bioinformatics will continue to create more powerful tools that enable

digging deeper in proteomics with both small (mouse CSF) and large (cell extracts) sample

amounts

Methods for Neuropeptide Analysis Using ETD fragmentation and Sample

Preparation

Chapter 6 investigates the utility of ETD fragmentation to sequence endogenous large

neuropeptides and labile tyrosine sulfation in the crustacean Callinectes sapidus The sinus

gland (SG) of the crustacean is a well-defined neuroendocrine site that produces numerous

hemolymph-borne agents including the most complex class of endocrine signaling moleculesmdash

neuropeptides One such peptide is the crustacean hyperglycemic hormone (CHH) precursor-

related peptide (CPRPs) which was not previously sequenced Electron transfer dissociation

(ETD) is used for sequencing of intact CPRPs due to their large size and high charge state In

addition ETD is used to sequence a sulfated peptide cholecystokinin (CCK)-like peptide in the

lobster Homarus americanus using a salt adduct Collectively this chapter presents two

examples of the utility of ETD in sequencing neuropeptides with large molecular weight or with

labile modifications

Chapter 7 reports a protocol on improving recovery of minute quantities of peptides by

adding methanol and a salt modifier (NaCl) to molecular weight cut-off membrane-based

centrifugal filters (MWCO) to enrich sub-microgram peptide quantities In some biological

9

fluids such as CSF the endogenous peptide content is very low and using pure water to perform

the MWCO separation produces too much sample loss Using a neuropeptide standard

bradykinin sample loss is reduced over two orders of magnitude with and without undigested

protein present The presence of bovine serum albumin (BSA) undigested protein and the

bradykinin standard lends evidence that sample loss occurs because of the MWCO and not the

presence of the protein Additionally a BSA tryptic digestion is presented where twenty-seven

tryptic peptides are identified from MALDI mass spectra after enriching with methanol while

only two tryptic peptides are identified after the standard MWCO protocol The strategy

presented in Chapter 7 greatly enhances recovery from MWCO separation for sub-microg peptide

samples

10

References

1 Tanaka K Waki H Ido Y Akita S Yoshida Y Yoshida T Matsuo T Protein and polymer analyses up to mz 100 000 by laser ionization time-of-flight mass spectrometry Rapid Communications in Mass Spectrometry 1988 2 (8) 151-153

2 Fenn J B Mann M Meng C K Wong S F Whitehouse C M Electrospray ionization for mass spectrometry of large biomolecules Science 1989 246 (4926) 64-71

3 Domon B Aebersold R Mass spectrometry and protein analysis Science 2006 312 (5771) 212-7

4 Whitehouse C M Dreyer R N Yamashita M Fenn J B Electrospray interface for liquid chromatographs and mass spectrometers Anal Chem 1985 57 (3) 675-9

5 Wilm M Mann M Analytical properties of the nanoelectrospray ion source Anal Chem 1996 68 (1) 1-8

6 Karas M Bahr U Dulcks T Nano-electrospray ionization mass spectrometry addressing analytical problems beyond routine Fresenius J Anal Chem 2000 366 (6-7) 669-76

7 Gulpinar M Yegen B The physiology of learning and memory Role of peptides and stress Curr Protein Pept Sci 2004 5 (6) 457-473

8 Ogren S O Kuteeva E Elvander-Tottie E Hokfelt T Neuropeptides in learning and memory processes with focus on galanin In Eur J Pharmacol 2010 Vol 626 pp 9-17

9 Strand F Neuropeptides general characteristics and neuropharmaceutical potential in treating CNS disorders Prog Drug Res 2003 61 1-37

10 Li W Chang M Peng Y-L Gao Y-H Zhang J-N Han R-W Wang R Neuropeptide S produces antinociceptive effects at the supraspinal level in mice Regul Pept 2009 156 (1-3) 90-95

11 Wu C-H Tao P-L Huang E Y-K Distribution of neuropeptide FF (NPFF) receptors in correlation with morphine-induced reward in the rat brain Peptides 2010 31 (7) 1374-1382

12 Tsutsui K Bentley G E Kriegsfeld L J Osugi T Seong J Y Vaudry H Discovery and evolutionary history of gonadotrophin-inhibitory hormone and kisspeptin new key neuropeptides controlling reproduction J Neuroendocrinol 2010 22 (7) 716-727

13 Piper D C Upton N Smith M I Hunter A J The novel brain neuropeptide orexin-A modulates the sleep-wake cycle of rats Eur J Neurosci 2000 12 (2) 726-730

14 Tsuneki H Wada T Sasaoka T Role of orexin in the regulation of glucose homeostasis - Tsuneki - 2009 - Acta Physiologica - Wiley Online Library Acta Physiol 2010

15 Kageyama H Takenoya F Shiba K Shioda S Neuronal circuits involving ghrelin in the hypothalamus-mediated regulation of feeding Neuropeptides 2010 44 (2) 133-138

16 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide family of aplysia J Neurosci 2002 22 (17) 7797-7808

11

17 Jensen J Regulatory peptides and control of food intake in non-mammalian vertebrates Comp Biochem Physiol Part A Mol Integr Physiol 2001 128 (3) 469-477

18 Billimoria C P Li L Marder E Profiling of neuropeptides released at the stomatogastric ganglion of the crab Cancer borealis with mass spectrometry J Neurochem 2005 95 (1) 191-199

19 Cape S S Rehm K J Ma M Marder E Li L Mass spectral comparison of the neuropeptide complement of the stomatogastric ganglion and brain in the adult and embryonic lobster Homarus americanus J Neurochem 2008 105 (3) 690-702

20 Cruz Bermuacutedez N D Fu Q Kutz Naber K K Christie A E Li L Marder E Mass

spectrometric characterization and physiological actions of GAHKNYLRFamide a novel FMRFamide‐like peptide from crabs of the genus Cancer J Neurochem 2006 97 (3) 784-799

21 Grashow R Brookings T Marder E Reliable neuromodulation from circuits with variable underlying structure Proc Natl Acad Sci USA 2009 106 (28) 11742-11746

22 Ma M Szabo T M Jia C Marder E Li L Mass spectrometric characterization and physiological actions of novel crustacean C-type allatostatins Peptides 2009 30 (9) 1660-1668

23 Chen R Hui L Cape S S Wang J Li L Comparative Neuropeptidomic Analysis of Food Intake via a Multi-faceted Mass Spectrometric Approach ACS Chem Neurosci 2010 1 (3) 204-214

24 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement approaches and challenges Annu Rev Anal Chem 2008 1 451-483

25 Ma M Wang J Chen R Li L Expanding the crustacean neuropeptidome using a multifaceted mass spectrometric approach J Proteome Res 2009 8 (5) 2426-2437

26 Slattery M G Heideman W Coordinated regulation of growth genes in Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

12

Figure 1 Capabilities and performances of certain mass analyzers Check marks indicate

availability check marks in parentheses indicate optional + ++ and +++ indicate possible or

moderate goodhigh and excellentvery high respectively Adapted with permission from

reference 3

13

Figure 2 Synthesis processing and release of neuropeptides (a) Cartoon showing two

interacting neurons (b) The synthesis of neuropeptides in the neuronal organelles and their

transport down the axon to the presynaptic terminal is depicted (c) Shows neuropeptide release

and interaction with the dendrite of the post-synaptic cell (Adapted from PhD thesis by Dr

Stephanie Cape)

14

Figure 3 Schematic showing location of the neuronal tissues being analyzed in the MS studies

of neuropeptides in Callinectes sapidus The SGs (sinus glands located in the eyestalks of the

crab) and the POs (pericardial organs located in the chamber surrounding the heart) release

neurohormones into the hemolymph (crab circulatory fluid) The brain is near the STNS

(stomatogastric nervous system neural network that controls the motion of the gut and foregut)

which has direct connections to the STG (stomatogastric ganglion) The STG is located in an

artery so it is continually in contact with hemolymph (Adapted from PhD thesis by Dr Robert

Sturm)

15

Chapter 2

Mass Spectrometry-based Proteomics and Peptidomics for Biomarker

Discovery and the Current State of the Field

Adapted from ldquoMass spectrometry-based proteomics and peptidomics for systems biology and

biomarker discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

16

Abstract

The scientific community has shown great interest in the field of mass spectrometry-based

proteomics and peptidomics for its applications in biology Proteomics technologies have

evolved to produce large datasets of proteins or peptides involved in various biological and

disease progression processes producing testable hypothesis for complex biological questions

This review provides an introduction and insight to relevant topics in proteomics and

peptidomics including biological material selection sample preparation separation techniques

peptide fragmentation post-translation modifications quantification bioinformatics and

biomarker discovery and validation In addition current literature and remaining challenges and

emerging technologies for proteomics and peptidomics are presented

17

Introduction

The field of proteomics has seen a huge expansion in the last two decades Multiple factors have

contributed to the rapid expansion of this field including the ever evolving mass spectrometry

instrumentation new sample preparation methods genomic sequencing of numerous model

organisms allowing database searching of proteomes improved quantitation capabilities and

availability of bioinformatic tools The ability to investigate the proteomes of numerous

biological samples and the ability to generate future hypothesis driven experiments makes

proteomics and biomarker studies exceedingly popular in biological studies today In addition

the advances in post-translational modification (PTM) analysis and quantification ability further

enhance the utility of mass spectrometry (MS)-based proteomics A subset of proteomics

research is devoted to profiling and quantifying neurologically related proteins and endogenous

peptides which has progressed rapidly in the past decade This review provides a general

overview as outlined in Figure 1 of proteomics technology including methodological and

conceptual improvements with a focus on recent studies and neurological biomarker studies

Biological Material Selection

The choice of biological matrix is an important first step in any proteomics analysis The

ease of sample collection (eg urine plasma or saliva) versus usefulness or localization of

sample (eg specific tissue or proximity fluid) needs to be evaluated early on in a study design

Plasma derived by centrifugation of blood to remove whole cells is a very popular

choice in proteomics due to the high protein content (~65 mg ml1) and the ubiquitous nature of

blood in the body and the ability to obtain large sample amounts or various time points without

the need to sacrifice the animal or to perform invasive techniques Plasma is centrifuged

18

immediately after sample collection unlike serum where coagulation needs to occur first To

obtain plasma blood is collected in a tube with an anticoagulant added (ETDA heparin or

citrate) and centrifuged but previous reports have shown variable results when heparin has been

used as an anticoagulant2 Human Proteome Organization (HUPO) specifically recommends the

anticoagulants EDTA or citrate to treat plasma3 4

One of the primary concerns with plasma is

degradation of the protein content via endogenous proteases found in the sample5 One way to

address this problem is the use of protease inhibitors In addition freezethaw cycles need to be

minimized to prevent protein degradation and variability6 7

Plasma proteomics has seen

extensive coordinated efforts to start assessing the diagnostic needs using plasma8 HUPO also

has established a public human database for plasma and serum proteomics from 35 collaborating

labratories9 Large dynamic range studies have been performed on plasma with a starting sample

amount of 2625 microl (1575 mg) resulting in 3654 proteins identified with a sub 5 false

discovery rate10

The large dynamic range spanning across eleven orders of magnitude as visualized in

Figure 2 is one of the biggest obstacles in plasma proteomics Figure 2 also shows that as lower

abundance proteins are investigated the origins of those identified proteins are more diverse than

the most abundant proteins Recent mining of the plasma proteome showed an ability to search

for disease biomarker applications across seven orders of magnitude In addition the tissue of

origin for the identified plasma proteins were identified and its origin was more diverse as the

protein concentration decreased11

Plasma has been used as a source for biomarker studies such

as colorectal cancer12 13

cardiovascular disease14

and abdominal aortic aneurysm15

Even

though the blood brain barrier prevents direct blood to brain interaction neurological disorders

such as Alzheimerrsquos disease (AD) have had their proteomes studied using plasma16

19

An alternative sample derived from blood is serum which is plasma allowed to coagulate

instead of adding anti-coagulates The time for coagulation is usually 30 minutes and during that

time significant and random degradation from endogenous proteases can occur The additional

variability caused from the coagulation process can change the concentration of multiple

potentially valuable biomarkers As biodiversity between samples or organisms is a challenging

endeavor additional sample variability due to serum generation may be undesirable but serum is

still currently being used for biomarker disease studies17

Serum has been used to compare the

proteome differences in neurological diseases such as AD Parkinsonrsquos disease and amyotrophic

lateral sclerosis and a review can be found elsewhere discussing the subject18

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy of brain or spinal cord

in evaluating diseases of the central nervous system and has been used for studies in neurological

disorders due to being a rich source of neuro-related proteins and peptides19

The protein

composition of the most abundant proteins in CSF is well defined and numerous studies exist to

broaden the proteins identified20-22

CSF has an exceedingly low protein content (~04 μgμL)

which is ~100 times lower than serum or plasma and over 60 of the total protein content in

CSF consists of a single protein albumin23-25

In addition the variable concentrations of proteins

span up to twelve orders of magnitude further complicating analysis and masking biologically

relevant proteins to any given study26

One of the highest number of identified proteins is from

Schutzer et al with 2630 non-redundant proteins from 14 mL of pooled human CSF This study

involved the removal of highly abundant proteins by performing IgY-14 immunodepletion

followed by two dimensional (2D) liquid chromatography (LC) separation27

Studies have also

been performed to characterize individual biomarkers or complex patterns of biomarkers in

various diseases in the CSF28 29

One potential pitfall of CSF proteomic analysis is

20

contamination from blood which can be identified by counting red blood cells present or

examining surrogate markers from blood contamination other than hemoglobin such as

peroxiredoxin catalase and carbonic anhydrase30

A proof of principle CSF peptidomics study

identified numerous endogenous peptides associated with the central nervous system which can

be used as a bank for neurological disorder studies31

Numerous recent reports highlighted the

utility of CSF analysis for biomarker studies in AD32 33

medulloblastoma34

both post-mortem

and ante-mortem35

Cellular lysates offer the distinct advantage to work with a cell line yeast or bacteria

with large amounts of proteins available for analysis36 37

with Saccharomyces cerevisiae being

the most common cell lysate38 39

Other cell lines are also used including HeLa40

and E coli41

The ability to obtain milligrams of proteins easily to scale up experiments without animal

sacrifice offers a clear advantage in biological sample selection Current literature supports

cellular lysate as a valued and sought after source of proteins for large scale proteomics

experiments because of the ability to assess treatments conditions and testable hypotheses42-44

Cellular lysate from rat B104 neuroblastoma cell line was used as an in vitro model for cerebral

ischemia and showed abundance changes in multiple proteins involved in various neurological

disorders45

Other Sources of Biological Samples

Urine

The urine proteome appears to be another attractive reservoir for biomarker discovery

due to the relatively low complexity compared with the plasma proteome and the noninvasive

collection of urine Urine is often considered as an ideal source to identify biomarkers for renal

21

diseases due to the fact that in healthy adults approximately 70 of the urine proteome originate

from the kidney and the urinary tract 46

thus the use of urine to identify neurological disorders is

neglected However strong evidence have shown that proteins that are associated with

neurodegenerative diseases can be excreted in the urine47-49

indicating the application of urine

proteomics could be a useful approach to the discovery of biomarkers and development of

diagnostic assays for neurodegenerative diseases However the current view of urine proteome

is still limited by factors such as sample preparation techniques and sensitivity of the mass

spectrometers There has been a tremendous drive to increase the coverage of urine proteome

In a recent study Court et al compared and evaluated several different sample preparation

methods with the objective of developing a standardized robust and scalable protocol that could

be used in biomarkers development by shotgun proteomics50

In another study Marimuthu et al

reported the largest catalog of proteins in urine identified in a single study to date The

proteomic analysis of urine samples pooled from healthy individuals was conducted by using

high-resolution Fourier transform mass spectrometry A total of 1823 proteins were identified

of which 671 proteins have not been previously reported in urine 51

Saliva

For diagnosis purposes saliva collection has the advantage of being an easy and non-

invasive technique The recent studies on saliva proteins that are critically involved in AD and

Parkinsonrsquos diseases suggested that saliva could be a potentially important sample source to

identify biomarkers for neurodegenerative diseases Bermejo-Pareja et al reported the level of

salivary Aβ42 in patients with mild AD was noticeably increased compared to a group of

controls 52

In another study Devic et al identified two of the most important Parkinsons

22

disease related proteinsmdash α-synuclein (α-Syn) and DJ-1 in human saliva 53

They observed that

salivary α-Syn levels tended to decrease while DJ-1 levels tended to increase in Parkinsons

disease The published results from this study also suggest that α-Syn might correlate with the

severity of motor symptoms in Parkinsons disease Due in part to recent advancements in MS-

based proteomics has provided promising results in utilizing saliva to explore biomarkers for

both local and systemic diseases 54 55

the further profiling of saliva proteome will provide

valuable biomarker discovery source for neurodegenerative diseases

Tissue

Compared to body fluids such as plasma serum and urine where the proteomic analysis

is complicated by the wide dynamic range of protein concentration the analysis of tissue

homogenates using the well-established and conventional proteomic analysis techniques has the

advantage of reduced dynamic range However the homogenization and extraction process may

suffer from the caveat that spatial information is lost which would be inadequate for the

detection of biomarkers whose localization and distribution play important roles in disease

development and progression Matrix-assisted laser desorptionionization (MALDI) imaging

mass spectrometry (IMS) is a method that allows the investigation of a wide range of molecules

including proteins peptides lipids drugs and metabolites directly in thin slices of tissue 56-59

Because this technology allows for identification and simultaneous localization of biomolecules

of interests in tissue sections linking the spatial expression of molecules to histopathology

MALDI-IMS has been utilized as a powerful tool for the discovery of new cancer biomarker

candidates as well as other clinical applications60 61

The utilization of MALDI-IMS for human

or animal brain tissue to identify or map the distribution of molecules related to

neurodegenerative diseases were also recently reported62 63

23

Secretome

There has been an increasing interest in the study of proteins secreted by various cells

(the secretomes) from tissue-proximal fluids or conditioned media as a potential source of

biomarkers Cell secretomes mainly comprise proteins that are secreted or are shed from the cell

surface and these proteins can play important role in both physiological processes (eg cell

signaling communication and migration) and pathological processes including tumor

angiogenesis differentiation invasion and metastasis In particular the study of cancer cell

secretomes by MS based proteomics has offered new opportunities for cancer biomarker

discovery as tumor proteins may be secreted or shed into the bloodstream and could be used as

noninvasive biomarkers The latest advances and challenges of sample preparation sample

concentration and separation techniques used specifically for secretome analysis and its clinical

applications in the discovery of disease specific biomarkers have been comprehensively

reviewed64 65

Here we only highlight the proteomic profiling of neural cells secretome that has

been applied to neurosciences for a better understanding of the roles secreted proteins play in

response to brain injury and neurological diseases The LC-MS shotgun identification of

proteins released by astrocytes has been recently reported66-68

In these studies the changes

observed in the astrocyte secretomes induced by inflammatory cytokines or cholinergic

stimulation were investigated6667

Alternatively our group performed 2D-LC separation and

included cytoplasmic protein extract from astrocytes as a control to identify cytoplasmic protein

contaminants which are not actively secreted from cells68

Sample Preparation

24

Proteomic analysis and biomarker discovery research in biological samples such as body

fluids tissues and cells are often hampered by the vast complexity and large dynamic range of

the proteins Because disease identifying biomarkers are more likely to be low-abundance

proteins it is imperative to remove the high-abundance proteins or apply enrichment techniques

to allow detection and better coverage of the low-abundance proteins for MS analysis Several

strategies including depletion and protein equalizer approach have been used during sample

preparation to reduce sample complexity69 70

and the latest advances of these methods have been

reviewed by Selvaraju et al 71

Alternatively the complexity of biological samples can be

reduced by capturing a specific subproteome that may have the biological information of interest

The latter strategy is especially useful in the biomarker discovery where the changes in the

proteome are not solely reflected through the concentration level of specific proteins but also

through changes in the post-translational modifications (PTMs) Here we will mainly discuss

the enrichment of phosphoproteinpeptides glycoproteinpeptides and sample preparation for

peptidomics and membrane proteins

Phosphoproteomics

Phosphorylation can act as a molecular switch on a protein by turning it on or off within

the cell It is thought that up to 30 of the proteins can be phosphorylated72

and it plays

significant roles in such biological processes as the cell cycle and signal transduction73

Currently tens of thousands of phosphorylation sites can be proposed using analytical methods

available today74 75

The amino acids that are targeted for phosphorylation studies are serine

threonine and tyrosine with the abundance of detection decreasing typically in that order Other

25

amino acids have been reported to be phosphorylated but traditional phosphoproteomics

experiments ignore these rare events76

In a typical large-scale phosphoproteomics experiment the sample size is usually in

milligram amounts to account for the low stoichiometry of phosphorylated proteins The large

amount of protein is then digested typically with trypsin but alternatively experiments have

been performed with Lys-C digestion to produce large enzymatic peptides The larger peptides

produced from Lys-C render higher charged peptides during electrospray ionization (ESI) and

allow improved electron-based fragmentation to determine specific sites of phosphorylation77

From the pool of peptides phosphopeptides must be enriched otherwise they will be masked by

the vast number and higher ionization efficiency of non-phosphorylated peptides The two most

common enrichment techniques are immobilized metal ion affinity chromatography (IMAC) and

metal oxide affinity chromatography (MOAC) TiO2 being the most common oxide used for this

purpose A recent study reported that phosphorylation of neuronal intermediate filament proteins

in neurofibrillary tangles are involved in Alzheimerrsquos disease78

Glycoproteomics

Protein glycosylation is one of the most common and complicated forms of PTM Types

of protein glycosylation in eukaryotes are categorized as either N-linked where glycans are

attached to asparagine residues in a consensus sequence N-X-ST (X can be any amino acid

except proline) via an N-acetylglucosamine (N-GlcNAc) residue or the O-glycosylation where

the glycans are attached to serine or threonine Glycosylation plays a fundamental role in

numerous biological processes and aberrant alterations in protein glycosylation are associated

with neurodegenerative disease states such as Creutzfeld-Jakob Disease (CJD) and AD79 80

26

Due to the low abundance of glycosylated forms of proteins compared to non-glycosylated

proteins it is essential to enrich glycoproteins or glycopeptides in complex biological samples

prior to MS analysis Two of the most common enrichment methods used in glycoproteomics are

lectin affinity chromatography (LAC) and hydrazide chemistry The detailed methodologies of

LAC hydrazide chemistry and other enrichment methods in glycoproteomics have been

extensively reviewed in the past81 82

In particular LAC is of great interest in studies of

glycosylation alterations as markers of AD and other neurodegenerative diseases due to its recent

applications in brain glycoproteomics83

Our group has utilized multi-lectin affinity

chromatography containing concanavalin A (ConA) and wheat germ agglutinin (WGA) to enrich

N-linked glycoproteins in control and prion-infected mouse plasma84

This method enabled us to

identify a low-abundance glycoprotein serum amyloid P-component (SAP) PNGase F digestion

and Western blotting validation confirmed that the glycosylated form of SAP was significantly

elevated in mice with early prion infection and it could be potentially used as a diagnostic

biomarker for prion diseases

Membrane proteins

Membrane proteins play an indispensable role in maintaining cellular integrity of their

structure and perform many important functions including signaling transduction intercellular

communication vesicle trafficking ion transport and protein translocationintegration85

However due to being relatively insoluble in water and low abundance it is challenging to

analyze membrane proteins by traditional MS-based proteomics approaches Numerous efforts

have been made to improve the solubility and enrichment of membrane proteins during sample

preparation Several comprehensive studies recently covered the commonly used technologies in

27

membrane proteomics and different strategies that circumvent technical issues specific to the

membrane 86-90

Recently Sun et al reported using 1-butyl-3-methyl imidazolium

tetrafluoroborate (BMIM BF4) an ionic liquid (IL) as a sample preparation buffer for the

analysis of integral membrane proteins (IMPs) by microcolumn reversed phase liquid

chromatography (μRPLC)-electrospray ionization tandem mass spectrometry (ESI-MSMS)

The authors compared BMIM BF4 to the other commonly used solvents such as sodium dodecyl

sulfate methanol Rapigest and urea but they found that the number of identified IMPs from rat

brain extracted by ILs was significantly increased The improved identifications could be due to

the fact that BMIM BF4 has higher thermal stability and thus offered higher solubilizing ability

for IMPs which provided better compatibility for tryptic digestion than traditionally used solvent

systems38

In addition to characterization of membrane proteome the investigation of PTMs on

membrane proteins is equally important for characterization of disease markers and drug

treatment targets Phosphorylations and glycosylations are the two most important PTMs for

membrane proteins In many membrane protein receptors the cytoplasmic domains can be

phosphorylated reversibly and function as signal transducers whereas the receptor activities of

the extracellular domains are mediated via N-linked glycosylation Wiśniewski et al provides an

informative summary on recent advances in proteomic technology for the identification and

characterization of these modifications91

Our group has pioneered the development of detergent

assisted lectin affinity chromatography (DALAC) for the enrichment of hydrophobic

glycoproteins using mouse brain extract92

We compared the binding efficiency of lectin affinity

chromatography in the presence of four commonly used detergents and determined that under

certain concentrations detergents can minimize the nonspecific bindings and facilitate the

elution of hydrophobic glycoproteins In summary NP-40 was suggested as the most suitable

28

detergent for DALAC due to the higher membrane protein recovery glycoprotein recovery and

membranous glycoprotein identifications compared to other detergents tested In a different

study on mouse brain membrane proteome Zhang et al reported an optimized protocol using

electrostatic repulsion hydrophilic interaction chromatography (ERLIC) for the simultaneous

enrichment of glyco- and phosphopeptides from mouse brain membrane protein preparation93

Using this protocol they successfully identified 544 unique glycoproteins and 922 glycosylation

sites which were significantly higher than those using the hydrazide chemistry method

Additionally a total of 383 phosphoproteins and 915 phosphorylation sites were identified

suggesting that the ERLIC separation has the potential for simultaneous analysis of both glyco-

and phosphoproteomes

Peptidomics

Peptidomics can be loosely defined as the study of the low molecular weight fraction of

proteins encompassing biologically active endogenous peptides protein fragments from

endogenous protein degradation products or other small proteins such as cytokines and signaling

peptides Studies can involve endogenous peptides94

peptidomic profiling33

and de novo

sequencing of peptides95 96

Neuropeptidomics focuses on biologically active short segments of

peptides and have been investigated in numerous species including Rattus97 98

Mus musculus99

100 Bovine taurus

101 Japanese quail diencephalon

102 and invertebrates

103-106 The isolation of

peptides is typically performed through molecular weight cut-offs from either biofluids such as

CSF plasma or tissue extracts If the protein and peptide content is high such as for tissue or cell

lysates protein precipitation can be done via high organic solvents and the resulting supernatant

can be analyzed for extracted peptides where extraction solvent and conditions could have a

29

significant effect on what endogenous peptides are extracted from tissue107

A comparative

peptidomic study of human cell lines highlights the utility of finding peptide signatures as

potential biomarkers108

A thorough review of endogenous peptides and neuropeptides is beyond

the scope of this review and an excellent review on this topic is available elsewhere109

Fractionation and Separation

The mass spectrometer has a limited duty cycle and data dependent analysis can only

scan a limited number of mz peaks at any given time In addition significant ion suppression

can occur if there is a difference in concentration between co-eluting peptides or if too many

peptides co-elute Therefore one of the biggest challenges in biomarker discovery is the

complexity of the sample and the presence of high-abundance proteins in body fluids such as

CSF serum and plasma In addition to the removal of the most abundant proteins by

immunodepletion the reduction of the complexity of the sample by further fractionation is

indispensable to facilitate the characterization of unidentified biomarkers from the low

abundance proteins Traditionally used techniques for complex protein analysis include gel

based fractionation methods such as two-dimensional gel electrophoresis (2D-GE) and its

variation two-dimensional differential gel electrophoresis (2D-DIGE) or non-gel based such as

one- or multidimensional liquid chromatography (LC) and microscale separation techniques

such as capillary electrophoresis (CE)

2D-GE MS has been widely used as a powerful tool to separate proteins and identify

differentially expressed proteins ever since 2D gels were coupled to mass spectrometry In 2D-

GE MS thousands of proteins can be separated on a single gel according to pI and molecular

weight Individual protein spots that show differences in abundance between different samples

30

can then be excised from the gel digested into peptides and analyzed by MALDI MS or by

liquid chromatography tandem mass spectrometry (LC-MSMS) for protein identification The

introduction of 2D-DIGE adds a quantitative strategy to gel electrophoresis by enabling multiple

protein extracts to be separated on the same 2D gel thus providing comparative analysis of

proteomes in complex samples In 2D-DIGE protein extracts from two different conditions and

an internal standard can be labeled with fluorescent dyes for example Cy3 Cy5 and Cy2

respectively prior to two-dimensional gel electrophoresis Compared to traditional 2D-GE 2D-

DIGE provides the clear advantage of overcoming the inter-gel variation problem 110

Proteomic

profiling of CSF by 2D-GE and 2D-DIGE has led to the identification of putative biomarkers in

multiple neurological disorders For example Brechlin et al reported an optimized 2-DIGE

protocol profiled CSF from 36 CJD patients The applicability of their approach was proven by

the detection of known CJD biomarkers such as 14-3-3 protein neuron-specific enolase lactate

dehydrogenase and other proteins that are potentially relevant to CJD 111

In another study to

identify novel CSF biomarkers for multiple sclerosis CSF from 112 multiple sclerosis patients

and control individuals were analyzed by 2D-GE MS for comparative proteomics Ten potential

multiple sclerosis biomarkers were selected for validation by immunoassay 112

These

methodologies sample preparation techniques and applications of 2D-DIGE in

neuroproteomics were reviewed by Diez et al113

Although 2D gel provides excellent resolving

power and capability to visualize abundance changes there are some limitations to the method

For example gel based separation is not suitable for low abundance proteins extremely basic or

acidic proteins very small or large proteins and hydrophobic proteins114 115

Complementary to gel-based approaches shotgun proteomics coupled to LC have

become increasingly popular in proteomic research because they are reproducible highly

31

automated and capable of detecting low abundance proteins Furthermore another advantage of

LC-MS shotgun proteomics is the suitability for isotope labeling for protein quantification which

is reviewed in a later section In shotgun proteomics a protein mixture is digested and resulting

peptides are separated by LC prior to tandem MS fragmentation to identify different proteins by

peptide sequencing The most common separation for shotgun proteomics peptidomics or top-

down proteomics experiments use low-pH reversed phase (RP) C18 C8 or C4 columns RPLC

is well established which provides high resolution desalts the sample which can interfere with

ionization and the mobile phase is compatible with ESI Nanoscale C18 columns allow for

separation and introduction of sub microgram samples If larger amounts of sample are

available two dimensional separations are usually preferred to greatly enhance the coverage of

the investigated proteome which will be discussed in depth later It is preferable to have an

orthogonal separation method and since RP separates via hydrophobicity strong cation exchange

(SCX) was the original choice due to its separation by charge MudPIT (multidimensional

protein identification technology) usually refers to the use of SCX as the first phase of separation

and is a well-established platform116

SCX has the advantage over RP separation technologies to

effectively remove interfering detergents from the sample SCX separation is not based solely

off charge and hydrophobicity contributes to elution therefore a small amount of organic

modifier usually 10-15 is added to lessen the hydrophobicity effects117

The addition of

organic modifiers needs to be minimized otherwise binding to the C18 trap cartridge or C18

column will be reduced if performed on-line SCX can be used for PTMs and offers specific

applications for proteomic studies and an excellent current review is offered on this subject

elsewhere118

An alternative MudPIT separation scheme employing high pH RPLC as the first

phase of separation and low pH RPLC in the second dimension (RP-RP) has been successfully

32

applied to the proteomic analysis of complex biological samples119 120

The advantage of using

RP as the first dimension is the higher resolution for separation and better compatibility with

down-stream MS detection by eliminating salt Song et al reported a phosphoproteome analysis

based on this 2D RP-RP coupling scheme121

Hydrophilic interaction chromatography (HILIC) employs distinct separation modality

where the retention of peptides is increased with increasing polarity122

The loading of sample is

done by high organic and eluted by increasing the percentage of the aqueous phase or polarity of

the mobile phase opposite from RPLC thus establishing orthogonality of the two separation

modes123

HILIC has quickly become a very useful method and is actively used for proteomic

experiments124

for increased sensitivity125

phosphoproteomics126

glycoproteins127

and

quantification studies128

An alternative and modification to HILIC is ERLIC which adds an

additional mode of separation by electrostatic attraction An earlier study using ERLIC

demonstrated the ability to separate phosphopeptides from non-phosphorylated peptides at

pH=2129

A recent study looking into changes in the phosphoproteome of Marekrsquos Disease

applied ERLIC to chicken embryonic fibroblast lysate identifying only 13 phosphopeptides

out of all the identified peptides Due to the lack of isolation of phosphopeptides from ERLIC

the investigators performed immobilized metal affinity chromatography (IMAC) enrichment on

the fractions increasing identification of phosphopeptides over 50 fold130

A comparative study

of ERLIC to HILIC and SCX following TiO2 phospho-enrichment reported that

SCXgtERLICgtHILIC for phosphopeptide identifications126

Recent developments in instrumentation to combine LC with ion mobility spectrometry

(IMS) and MS (LC-IMS-MS) offered more advantages than conventional LC due to the rapid

high-resolution separations of analytes based on their charge mass and shape as reflected by

33

mobility in a given buffer gas The mobility of an ion in a buffer gas is determined by the ionrsquos

charge and its collision cross-section with the buffer gas The methodologies of IMS separations

and the application of LC-IMS-MS for the proteomics analysis of complex systems including

human plasma have been reviewed by Clemmerrsquos group131-133

They proposed a method that

employs intrinsic amino acid size parameters to obtain ion mobility predictions which can be

used to rank candidate peptide ion assignments and significantly improve peptide identification

134

Although 2D gel and LC are routinely used as separation techniques in MS-based

proteomics capillary electrophoresis (CE) has received increasing attention as a promising

alternative due to the fast and high-resolution separation it offers CE has a wide variety of

operation modes among which capillary zone electrophoresis (CZE) and capillary isoelectric

focusing (CIEF) have the greatest potential applications in MS-based proteomics thus will be

highlighted here CZE separates analytes by their charge-to-size ratios in buffers under a high

electrical field and is often used as the final dimension prior to MS analysis while the separation

feature of CIEF is based on isoelectric point and this technique is more suitable to be used as the

first dimension separation Detailed description of different CEndashMS interfaces sample

preconcentration and capillary coating to minimize analyte adsorption could be found in several

reviews135-141

CE technique is complementary to conventional LC in that it is suitable for the

analysis of polar and chargeable compounds Dovichirsquos group conducted proteomic analysis of

the secreted protein fraction of Mycobacterium marinum which has intermediate protein

complexity142

The tryptic digests were either analyzed by UPLC-ESI-MSMS in triplicates or

prefactionated by RPLC followed by CZE-ESI-MSMS It was demonstrated that the two

methods identified similar numbers of peptides and proteins within similar analysis times

34

However CZE-ESI-MSMS analysis of the prefractionated sample tended to identify more

peptides that are basic and have lower mz values than those identified by UPLC-ESI-MSMS

This analysis also presented the largest number of protein identifications by using CE-MSMS

suggesting the effectiveness of prefractionation of complex samples by LC method prior to CZE-

ESI-MSMS The use of CIEF as the first dimension of separation provides both sample

concentration and excellent resolving power The combination of CIEF and RPLC separation

has been applied to the proteomic analyses where the amount of protein sample is limited and

cannot meet the requirement of minimal load amount for 2D LC-MSMS143 144

So far CE-MS

has been widely applied to the proteomic analysis of various biological samples such as urine145

146 CSF

147 blood

148 frozen tissues

149 and the formalin-fixed and paraffin-embedded (FFPE)

tissue samples150

The recent CEndashMS applications to clinical proteomics have been summarized

in several reviews135 151 152

Protein Quantification

In 2D gel electrophoresis the quantitative analysis of protein mixtures is performed on

the gel by comparing the intensity of the protein stain The development of 2D-DIGE eliminated

the gel-to-gel variation and greatly improved the quantitative capability and reliability of 2D gel

methodology110

However the accuracy of 2D gel based protein quantification suffers from the

limitations that a seemingly single gel spot often contains multiple proteins and the difficulty of

detecting proteins with extreme molecular weights and pI values as well as highly hydrophobic

proteins such as membrane proteins Therefore non-gel based shotgun proteomics technology is

more suitable for accurate and large-scale protein identification and quantification in complex

samples Briefly the quantification in non-gel based shotgun proteomics can be categorized into

35

two major approaches stable isotope labeling-based and label-free methods The common

strategies for quantitative proteomic analysis are reviewed and summarized in Table 1

Isotope labeling methods

Because stable isotope-labeled peptides have the same chemical properties as their

unlabeled counterparts the two peptides within a mixture should exhibit identical behaviors in

MS ionization The mass difference introduced by isotope labeling enables the detection of a

pair of two distinct peptide masses by MS within the mixture and allowing for the measurement

of the relative abundance differences between two peptides Depending on how isotopes are

incorporated into the protein or peptide these labeling methods can be divided into two groups

In vitro chemical derivatization techniques which incorporate a label or tag into the peptide or

protein during sample preparation metabolic labeling techniques which introduce the isotope

label directly into the organism via isotope-enriched nutrients from food or media

1 In vitro derivatization techniques

There are multiple methods to introduce heavy isotopes into proteins or peptides in vitro

The commonly used strategies include 18

O 16

O enzymatic labeling Isotope-Coded Affinity Tag

(ICAT) Tandem Mass Tags (TMTs) and Isobaric Tags for Relative and Absolute Quantification

(iTRAQ) The 18

O labeling method enzymatically cleaves the peptide bond with trypsin in the

presence of 18

O-enriched H2O and introduces 4-Da mass shift in the tryptic peptides153

The

advantages of this method include 18

O-enriched water is extremely stable tryptic peptides will

be labeled with the same mass shift secondary reactions inherent to other chemical labeling can

be avoided Conversely widespread use of 18

O-labeling has been hindered due to the difficulty

of attaining complete 18

O incorporation and the lack of robustness154 155

Currently ICAT

36

TMTs and iTRAQ methods are extensively used in quantitative proteomics In ICAT cysteine

residues are specifically derivatized with a reagent containing either zero or eight deuterium

atoms as well as a biotin group for affinity purification of cysteine-containing peptides156 157

The advantage of ICAT is that the affinity purification via biotin moiety can facilitate the

detection of low-abundance cysteine-containing peptides In addition the mass difference

introduced by labeling increases mass spectral complexity with quantification from the different

precursor masses done by MS and peptide identification being achieved through tandem MS

(MSMS) This added complexity from different peptide masses was addressed by using isobaric

labeling methods such as TMTs and iTRAQ 158 159

where the same peptides in different samples

are isobaric after tagging and appear as single mz in MS scans thus enhancing the peptide limit

of detection and reducing the MS scan complexity Isobaric labeling reagents are composed of a

primary amine reactive group and an isotopic reporter group linked by an isotopic balancer group

for the normalization of the total mass of the tags The reporter group serves for quantification

purpose since it is cleaved during collision-induced dissociation (CID) to yield a characteristic

isotope-encoded fragment Moreover isobaric labeling methods allow the comparison of

multiple samples within a single experiment Recently a 6-plex version of TMTs was

reported160

and iTRAQ enables up to eight samples to be labeled and relatively quantified in a

single experiment161

8-plex iTRAQ reagents have been used for the comparison of complicated

biological samples such as CSF in the studies of neurodegenerative diseases 162

Recently our

group developed a novel N N-dimethyl leucine (DiLeu) 4-plex isobaric tandem mass (MS2)

tagging reagents with high quantitation efficacy DiLeu has the advantage of synthetic simplicity

and greatly reduced synthesis cost compared to TMTs and iTRAQ163

Xiang et al demonstrated

that DiLeu produced comparable iTRAQ ability for protein sequence coverage (~43) and

37

quantitation accuracy (lt15) for tryptically digested proteins More importantly DiLeu

reagents could promote enhanced fragmentation of labeled peptides thus allowing more

confident peptide and protein identifications

2 In Vivo Metabolic Labeling

Metabolic processes can also be employed for the incorporation of stable-isotope labels

into the proteins or organisms by enriching culture media or food with light or heavy versions of

isotope labels (2H

13C

15N) The advantage of in vivo labeling is that metabolic labeling does

not suffer from incomplete labeling which is an inherent drawback for in vitro derivatization

techniques In addition metabolic labeling occurs from the start of the experiment and proteins

with light or heavy labels are simultaneously extracted thus reducing the error and variability of

quantification introduced during sample preparation The most widely used strategy for

metabolic labeling is known as stable-isotope labeling of amino acids in cell culture (SILAC)

which was introduced by Mann and co-workers164 165

In SILAC one cell population is grown

in normal or light media while the other is grown in heavy media enriched with a heavy

isotope-encoded (typically 13

C or 15

N) amino acid such as arginine or leucine Cells from the

two populations are then combined proteins are extracted digested and analyzed by MS The

relative protein expression differences are then determined from the extracted ion

chromatograms from both the light and heavy peptide forms SILAC has been shown to be a

powerful tool for the study of intracellular signal transduction In addition this technique has

recently been applied to the quantitative analysis of phosphotyrosine (pTyr) proteomes to

characterize pTyr-dependent signaling pathways166 167

38

Labe-free quantification

Although various isotope labeling methods have provided powerful tools for quantitative

proteomics several limitations of these approaches are noted Labeling increases the cost and

complexity of sample preparation introduces potential errors during the labeling reaction It also

requires a higher sample concentration and complicates data processing and interpretation In

addition so far only TMTs and iTRAQ allow the comparison of multiple (up to eight) samples

simultaneously The comparison of more than eight samples in a single experiment cannot be

achieved by isotope labeling In order to address these concerns there has been significant

interest in the development of label-free quantitative approaches Current label-free

quantification methods for MS-based proteomics were developed based on the observation that

the chromatographic peak area of a peptide168 169

or frequency of MSMS spectra170

correlating

to the protein or peptide concentration Therefore the two most common label-free

quantification approaches are conducted by comparing (i) area under the curve (AUC) of any

given peptides171 172

or (ii) by frequency measurements of MSMS spectra assigned to a protein

commonly referred to as spectral counting173

Several recent reviews provided detailed and

comprehensive knowledge comparing label-free methods with labeling methods data processing

and commercially available software for label-free quantitative proteomics174-177

Dissociation Techniques

The vast majority of proteomic experiments have proteins or peptides being identified by

two critical pieces of data obtained from the mass spectrometer The first is the precursor ion

identified by its mz which is informative to the mass of the peptide being analyzed The second

is the use of tandem mass spectrometry to fragment or dissociate the precursor ion and record the

39

generated fragment ion pattern to discern the amino acid sequence The three most popular

dissociation or fragmentation techniques for peptides are CID electron-transfer dissociation

(ETD) and high-energy collision dissociation (HCD) A recent study on the human plasma

proteome demonstrated that combined fragmentation techniques enhance coverage by providing

complementary information for identifications CID enabled the greatest number of protein

identifications while HCD identified an additional 25 proteins and ETD contributed an

additional 13 protein identifications178

ETDECD

Electron capture dissociation (ECD) 179

preceded ETD but ECD was developed for use

in a Penning trap for Fourier transform ion cyclotron resonance (FTICR) mass spectrometers

ECD requires the ion of interest to be in contact with near-thermal electrons and for the electron

capture event to occur on the millisecond time scale but the time scale is inadequate for electron

trapping in Paul traps or quadrupoles in the majority of mass spectrometers180

ETD involves a

radical anion like fluoranthene with low electron affinity to be transferred to peptide cation

which results in more uniform cleavage along the peptide backbone The cation accepts an

electron and the newly formed odd-electron protonated peptide undergoes fragmentation by

cleavage of the N-Cα bond which results in fragmentation ions consisting of c- and zbull-type

product ions The uniformed cleavage results in reduced sequence discrimination to labile bonds

such as PTMs and also provides improved sequencing for larger peptides compared to CID181

The realization that larger peptides produced better MSMS quality spectra compared to CID led

to a decision tree analysis strategy where peptide charge states and size determined whether the

precursor peptide would be fragmented with CID or ETD182

One of the main benefits of

ETDECD is the ability to sequence peptides with labile PTMs such as phosphorylation77 183

40

sulfation184

glycosylation185

ubiquitination186

and histone modifications187

ETD also has the

benefit of providing better sequence information on larger neuropeptides when compared to

CID188

However a thorough analysis suggested that CID still yielded more peptideprotein

identifications than ETD in large scale proteoimcs189

HCD

High energy collision dissociation (HCD)190

is an emerging fragmentation technique that

offers improved detection of small reporter ions from iTRAQ-based studies191 192

HCD is

performed at a higher energy in a collision cell instead of an ion trap like CID thus HCD does

not suffer from the low-mass cutoff limitation Furthermore HCD offers enhanced

fragmentation efficiency assisting in MSMS spectra interpretation and protein identification193

A major drawback for HCD is that the spectral acquisition times are up to two-fold longer due to

increased ion requirement for Fourier transform detection in the orbitrap194

HCD has been

reported to increase phosphopeptide identifications over CID74

but in a different study CID was

reported to offer more phosphopeptide identifications over HCD194

Work has also been done to

transfer the decision tree analysis for HCD which basically switches CID with HCD claiming

better quality data determined by higher Mascot scores with more peptide identifications195

MSE

Data dependent acquisition (DDA) is the most commonly used ion selection process in

mass spectrometers for proteomic experiments An alternative process which does not have ion

selection nor switch between MS and MSMS modes is termed MSE MS

E is a data independent

mode and does not require precursor ions of a significant intensity to be selected for MSMS

analysis196

A data independent mode decouples the mass spectrometer choosing which

precursor ions to fragment and when the ions are fragmented MSE works by a low or high

41

energy scan and no ion isolation is occurring The low energy scan is where the precursor ion is

not fragmented and the high energy scan allows fragmentation The resulting mix of precursor

and fragmentation ions is then detected simultaneously197

The data will then need to be

deconvoluted using a proprietary time-aligned algorithm that is discussed elsewhere198

The

continuous data independent acquisition allows multiple MSMS spectra to be collected during

the natural analyte peak broadening observed in chromatography which provides more data

points for AUC label-free quantification In addition lower abundance peptides can be

sequenced as more MSMS spectra are collected throughout the elution of an LC peak allowing

better signal averaging for smaller analyte peak of interest during coelution and reducing

sampling bias in typical DDA experiments where only more abundant peaks can be selected for

fragmentation

A comparison of spiked internal protein standards into a complex protein digest provided

evidence that MSE was comparable to DDA analysis in LC-MS

199 MS

E has been used for label

free proteomics of immunodepleted serum in large scale proteomics samples200

In addition

MSE was performed for the characterization of human cerebellum and primary visual cortex

proteomes Hundreds of proteins were identified including many previously reported in

neurological disorders201

MSE is quickly becoming a versatile data acquisition method recently

used in such studies as cancer cells202

schizophrenia203

and pituitary proteome discovery204

The usefulness of MSE as an unbiased data acquisition method is being assimilated into multiple

proteomics studies including studies involving neurological disorders

Data Analysis

42

One of the major bottlenecks in non-targeted proteomic experiments is how to handle the

enormous amount of data obtained Database searches biostatistical analysis de novo

sequencing PTM validation all have their place and multiple available platforms are available

If the organism being studied has had its genome sequenced databases can be created

with a list of proteins in the FASTA format to be used in database searching There are

numerous database searching algorithms for sequence identification of MSMS data including

Mascot205

Sequest206

Xtandem207

OMSSA208

and PEAKS209

These searching algorithms are

performed by matching MSMS spectra and precursor mass to sequences found within proteins

How well the actual spectra match the theoretical spectra determines a score which is unique to

the searching algorithm and usually can be extrapolated to the probability of a random hit

Recently a database has been developed for PTM analysis by the use of the program SIMS210

Specifically for phosphopeptides Ascorersquos algorithm scans the MSMS data to determine the

likelihood of correct phosphosite identification from the presence of site identifying product

ions211

If the organism that is being analyzed has not had its genome sequenced and no (or very

limited) FASTA database is available a homology search can be performed using SPIDER212

available with PEAKS software Alternatively individual MSMS spectrum can be de novo

sequenced but software is available to perform automated de novo sequencing of numerous

spectra (PEAKS208

DeNovoX and PepSeq)

For large-scale protein identifications the false discovery rate (FDR) must be established

by the searching algorithm and that is accomplished by re-searching the data with a false

database created by reversing or scrambling the amino acid sequence of the original database

used for the protein search Any hits from the false database will contribute to the FDR and this

value can be adjusted usually around 1 An additional layer of confidence in the obtained data

43

can be achieved in shotgun proteomics experiments by removing all the proteins that are

identified by only one peptide

Once a set of confident proteins or peptides have been generated from database

searching bioinformatic analysis or biostatistical analysis is needed Numerous software

packages are available for different purposes FLEXIQuant is an example for absolute

quantitation of isotopically labeled protein or peptides of interest213

FDR analysis of

phosphopeptides or other specific PTMs can be adjusted with such software as Scaffold

providing data consisting only of a specific modification214

Bioinformatic tools such as

Scaffold or ProteoIQ also include gene ontology (GO) analysis which can classify identified

proteins by three categories cellular component molecular function or biological process

Custom bioinformatics programs can also be developed and are often useful in various proteomic

studies including biomarker discovery in neurological diseases215

More detailed review of

bioinformatics in peptidomics216

and proteomics217

can be found elsewhere

Validation of Biomarkers by Targeted Proteomics

The validation of putative biomarkers identified by MS-based proteomic analysis is often

required to provide orthogonal analysis to rule out a false positive by MS and providing

additional evidence for the biomarker candidate(s) from the study for future potential clinical

assays At present antibody-based assays such as Western blotting ELISA and

immunochemistry are the most widely used methods for biomarker validation Although accurate

and well established these methods rely on protein specific antibodies for the measurement of

the putative biomarker and could be difficult for large-scale validation of all or even a subset of a

long list of putative protein biomarkers typically obtained by MS-based comparative proteomic

44

analysis Large scale validation is impractical due to the cost for each antibody the labor to

develop a publishable Western blot or ELISA and the antibody availability for certain proteins

As an alternative strategy quantitative assays based on multiple-reaction monitoring (MRM) MS

using a triple quadrupole mass spectrometer have been employed in biomarker verification

MRM is the most common use of MSMS for absolute quantitation It is a hypothesis

driven experiment where the peptide of interest and its subsequent fragmentation pattern must be

known prior to the quantitative MRM experiments MRM involves selecting a specific mz (first

quadrupole) to be isolated for fragmentation (second quadrupole) followed by one or more of

the most intense fragment ions (third quadrupole) being monitored The ability to quantitate and

thus validate the proteins or peptides as potential biomarkers is achieved by performing MRM on

isotopically labeled reference peptide for targeted peptideprotein of interest The main obstacle

for quantification of peptides is interference and ion suppression effects from co-eluting

substances Since the isotopically labeled and native peptide will co-elute the same interference

and ion suppression will occur for both peptides and thus correcting these interfering effects

Peptides need to be systematically chosen for a highly sensitive and reproducible MRM

experiment to ensure proper validation of putative biomarkers Peptides require certain intrinsic

properties which include an mz within the practical mass detection range for the instrument and

high ionization efficiency If the desired peptide to be quantified is derived from a digestion

then peptides that have detectable incomplete digestion or missed cleavage site can be a major

source of variability Peptides with a methionine and to a lesser extent tryptophan are

traditionally removed from consideration from MRM quantitative experiments due to the

variable nature of the oxidation that can occur In addition if chromatographic separation is

performed the retention behavior of the peptide must be well behaved with little tailing effects

45

eluting late causing broadening of the peak and even irreversible binding to the column As an

example hydrophilic peptides being eluted off a C18 column may exhibit the previously

described concerns and a different chromatographic separation will need to be explored for

improved limits of detection quantitation and validation To determine consistent peptide

detection or usefulness of certain peptides databases such as Proteomics Database218

PRIDE219

PeptideAtlas220

have been developed to compile proteomic data repositories from initial

discovery experiments

After the peptide is selected for analysis the proper MRM transitions need to be selected

to optimize the sensitivity and selectivity of the experiment It is common for investigators to

select two or three of the most intense transitions for the proposed experiment It is imperative

that the same instrument is used for the determination of transition ions as different mass

spectrometers may have a bias towards different fragment ions

MRM experiments are still highly popular experiments for hypothesis directed

experiments221

biomarker analysis222

and validation223

Validation of putative biomarkers is

increasingly becoming a necessary step when performing large scale non-hypothesis driven

proteomics experiments The traditional validation techniques of ELISA Western blotting and

immunohistochemistry are still used but MRM experiments are becoming an attractive

alternative for validation of putative biomarkers due to its enhanced throughput and specificity

Current work is still being performed to both expand the linear dynamic range224

and

sensitivity225

of MRM A recent endeavor to increase the sensitivity for MRM experiments was

accomplished by ldquoPulsed MRMrdquo via the use of an ion funnel trap to enhance confinement and

accumulation of ions The authors claimed an increase by 5-fold for peak amplitude and a 2-3

fold reduction in chemical background225

46

Remaining Challenges and Emerging Technologies

Large sample numbers for mass spectrometry analysis

Multiple conventional studies in proteomics have been performed on a single or a few

biological samples As bio-variability can be exceedingly high the need for larger sample sizes

is currently being investigated Prentice et al used a starting point of 3200 patient samples

from the Womenrsquos Health Institute (WHI) to probe the plasma proteome using MS for

biomarkers The study did not test the 3200 patient samples by MS because even a simple one

hour one dimensional RP analysis on a mass spectrometer would take months of instrument time

for uninterrupted analysis Instead the authors pooled 100 samples together to bring the total

number of pooled samples to 32 To provide relevant plasma biomarkers the samples were then

subjected to immunodepletion 2-D protein separation (96 fractions total) and then 1-D RPLC of

tryptic peptide separation on-line interface to a mass spectrometer The large sample cohorts

help address bio-variability that can be a concern from small sample size proteomic experiments

and provide ample sample amounts to investigate the low abundance proteins226

Hemoglobin-derived neuropeptides and non-classical neuropeptides

Neuropeptides such as neuropeptide Y and enkephalin are short chains of amino acids

that are secreted from a range of neuronal cells that signal nearby cells In contrast non-classical

neuropeptides are termed as neuropeptides or ldquomicroproteinsrdquo which are derived from

intracellular protein fragments and synthesized from the cytosol227

MS was recently used to

determine that hemopressins which are hemoglobin-derived peptides are upregulated in Cpefatfat

mice brains Gelman et al designed an MS experiment to compare hemoglobin-derived

47

peptides comparing the brain blood and heart peptidome in mice The authors provided data

that specific hemoglobin peptides were produced in the brain and were not produced in the

blood Certain alpha and beta hemoglobin peptides were also up regulated in the brain for

Cpefatfat

mice and bind to CB1 cannabinoid receptors228

As discussed earlier in the review

peptidomics and specifically neuropeptidomics are popular fields of study utilizing MS and non-

classical neuropeptides is an exciting emerging area of research that could further expand the

diversity of cell-cell signaling molecules

Ultrasensitive mass spectrometry for single cell analysis

In addition to large scale analysis MS-based proteomics and peptidomics are making

progress into ultrasensitive single cell analysis The most successful MS-based techniques for

single cell analysis was performed with MALDI and studies that have been performed on

relatively large neurons are reviewed elsewhere229

The ultrasensitive MS analysis is currently

directed towards single cell analysis of smaller cells including cancer cells The first challenge

in single cell analysis is the isolation and further sample preparation to yield relevant data

Collection and isolation of a cell type can be accomplished using antibodies for fluorescence

activated cell sorting (FACS) and immune magnetic separation FACS works by flow cytometry

sorting cells by a laser that excites a fluorescent tag that is attached to an antibody Immune

magnetic separation allows separation by antibodies with magnetic properties such as

Dynabeads230

One exciting study combining FACS and MS termed mass cytometry This

technology works by infusing a droplet into an inductively coupled plasma mass spectrometer

(ICP-MS) containing a single cell bound to antibodies chelated to transition elements allowing a

quantifying response between single cells231

Clearly the future of single cell analysis for

48

biomarker analysis and proteomics is encouraging and has the potential to be an emerging field

in MS-based proteomics and peptidomics

Laserspray ionization (LSI)

Laserspray ionization (LSI) is an exciting new method to produce multiply charged mass

spectra from MALDI that is nearly identical to ESI232-234

Recently it has been reported that LSI

can be performed in lieu of matrix to produce a total solvent-free analysis234

The benefits of

being able to generate multiply charged peptides without any solvent may offer advantages

including MS analysis of insoluble membrane proteins or hydrophobic peptides avoidance of

chemical reactions while in solvents a reduction of sample loss due to liquid sample preparation

and ability to avoid diffusion effects from tissue imaging studies234

The multiply charged peptide and protein ions produced by LSI expand the mass range

for tissue imaging analysis More importantly the multiply-charged peptide ions are amenable

for electron-based fragmentation methods such as ETD or ECD which can be employed in

conjunction with tissue imaging experiments to yield in situ sequencing and identification of

peptides of interest235

Paper spray ionization

Paper spray (PS) is an ambient ionization method which was first reported using

chromatography paper allowing detection of metabolites from dried blood spots The original

method used a cut out piece of paper with a voltage clipped on the back while applying 10 microL of

methanolH2O236

Improvements have been made to this technology to enhance analysis

efficiency with a new solvent 91 dichloromethaneisopropanol (vv) and the use of silica paper

49

over chromatography paper237

Interesting applications or modifications have been made to PS

including direct analysis of biological tissue238

and leaf spray for direct analysis of plant

materials239

but both detect metabolites instead of proteins or peptides Paper spray ionization

was previously shown to enable detection of cytochrome c and bradykinin [2-9] standards in a

proof of principle study240

Clearly the utility of PS analysis in proteomics and peptidomics is

yet to be explored

niECD

New fragmentation techniques have been investigated for their utility in proteomics and

peptidomics including a recently reported negative-ion electron capture dissociation (niECD)

Acidic peptides which usually contain PTMs such as phosphorylation or sulfonation are often

difficult to be detected as multiply charged peptides in the positive ion mode As discussed

earlier multiply charged peptides are required for ECDETD fragmentation The fragmentation

of niECD is accomplished by a multiply negatively charged peptide adding an electron The

resulting fragmentation of multiply sulfated and phosphorylated peptide and protein standards

showed no sulfate loss and preserved phosphorylation site The resulting fragmentation pattern

from niECD was also improved in the peptide anions and provides a new strategy for de novo

sequencing with PTM localization241

Conclusions and Perspectives

Proteomics methodologies have produced large datasets of proteins involved in various

biological and disease progression processes Numerous mass spectrometry-based proteomics

and peptidomics tools have been developed and are continuously being improved in both

50

chromatographic or electrophoretic separation and MS hardware and software However several

important issues that remain to be addressed rely on further technical advances in proteomics

analysis When large proteomes consisting of thousands of proteins are analyzed and quantified

dynamic range is still limited with more abundant proteins being preferentially detected

Development and optimization of chemical tagging reagents that target specific protein classes

maybe necessary to help enrich important signaling proteins and assess cellular and molecular

heterogeneity of the proteome and peptidome Furthermore a significant bottleneck in

usefulness of proteomics research is the ability to validate the results and provide clear

significant biological relevance to the results The idea of P4 medicine242 243

is an attractive

concept where the four Prsquos stand for predictive preventive personalized and participatory

Proteomics is one of the critical ldquoomicsrdquo fields and has led to the development of enabling

innovative strategies to P4 medicine244

A goal of P4 medicine is to assess both early disease

detection and disease progression in a person A simplified example of how proteomics fits into

P4 medicine is that certain brain-specific proteins could be used for diagnosis with

presymptomatic prion disease244

The concept of proteomic experiments providing an individual

biomarker is becoming more obsolete with the revised vision being a biomolecular barcode that

could potentially be ldquoscannedrdquo or be a fingerprint for a specific disease or early onset to that

disease being closer to reality An excellent review on what biomarker analysis can do for true

patients is available245

Proteomics can also generate new hypothesis that can be tested by classical biochemical

approaches If a disease has an unknown pathogenesis proteomics is a good starting point to try

to assemble putative markers that can lead to further hypothesis for evaluation If a particular

protein or PTM is associated with a disease state either qualitatively or quantitatively potential

51

treatments could target that protein of interest or investigators could monitor that protein or

PTM during potential treatments of the disease Proteomics has expanded greatly over the last

few decades with the goal of providing revealing insights to some of the most complex

biological problems currently facing the scientific community

Acknowledgements

Preparation of this manuscript was supported in part by the University of Wisconsin Graduate

School Wisconsin Alzheimerrsquos Disease Research Center Pilot Grant and a Department of

Defense Pilot Award LL acknowledges an H I Romnes Faculty Fellowship

52

Figure 1 A summary of general workflows of biomarker discovery pipeline by MS-based

proteomic approaches

Biological sample (CSF blood urine saliva cell

lysate tissue homogenates secreted proteins etc)

Protein extraction Sample pretreatment

2D-GE2D-DIGE MS 1D or 2D LC-MSMS

MALDI-IMS

Identification of

differentially

expressed proteins

Protein identification

Potential biomarkers

Biomarker validation

- Antibody based immunoassays

- MRM

Quantitative analysis

- Isotope labeling

- Label free

Identification and

localization of

differentially expressed

biomolecules

Intact tissue

Sample preparation Slice frozen tissues

thaw-mounted on plate

Apply Matrix

53

Figure 2 Tissue expression and dynamic range of the human plasma proteome A pie chart

representing the tissue of origin for the high abundance proteins shows that the majority of

proteins come from the liver (A) Conversely the lower abundance plasma proteins have a much

more diverse tissue of origin (B and C) The large dynamic range of plasma proteins is presented

and the proteins can be grouped into three categories (classical plasma proteins tissue leakage

products interleukinscytokines) (D) Adapted from Zhang et al11

and Schiess et al246

with

permission

54

55

Table 1 A summary of the common strategies applied to MS-based quantitative proteomic

analysis

Gel based Stable isotope labeling Label free

2D-GE

2D-DIGE 110

In vitro derivatization

18O

16O

153

ICAT 156

TMT 159

iTRAQ 158

Formaldehyde 247

ICPL 248

In vivo metabolic labeling

14N

15N

249

SILAC 164

AUC measurement 169 172

Spectral counting 173

AUC Area Under Curve ICAT Isotope-Coded Affinity Tag TMT Tandem Mass Tags iTRAQ Isobaric Tags for

Relative and Absolute Quantification ICPL Isotope Coded Protein Labeling SILAC Stable Isotope Labeling by

Amino Acids in Cell Culture Isobaric Tags for Relative and Absolute Quantification (iTRAQ)

56

References

1 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein

immunoaffinity subtraction for plasma proteomics and candidate biomarker discovery

using mass spectrometry Mol Cell Proteomics 2006 5 (11) 2167-74

2 Holten-Andersen M N Murphy G Nielsen H J Pedersen A N Christensen I J

Hoyer-Hansen G Brunner N Stephens R W Quantitation of TIMP-1 in plasma of healthy

blood donors and patients with advanced cancer Br J Cancer 1999 80 (3-4) 495-503

3 Tammen H Schulte I Hess R Menzel C Kellmann M Mohring T Schulz-

Knappe P Peptidomic analysis of human blood specimens comparison between plasma

specimens and serum by differential peptide display Proteomics 2005 5 (13) 3414-22

4 Rai A J Gelfand C A Haywood B C Warunek D J Yi J Schuchard M D

Mehigh R J Cockrill S L Scott G B Tammen H Schulz-Knappe P Speicher D W

Vitzthum F Haab B B Siest G Chan D W HUPO Plasma Proteome Project specimen

collection and handling towards the standardization of parameters for plasma proteome samples

Proteomics 2005 5 (13) 3262-77

5 Lippi G Guidi G C Mattiuzzi C Plebani M Preanalytical variability the dark side

of the moon in laboratory testing Clin Chem Lab Med 2006 44 (4) 358-65

6 Holten-Andersen M N Schrohl A S Brunner N Nielsen H J Hogdall C K

Hogdall E V Evaluation of sample handling in relation to levels of tissue inhibitor of

metalloproteinases-1 measured in blood by immunoassay Int J Biol Markers 2003 18 (3) 170-

6

7 Ytting H Christensen I J Thiel S Jensenius J C Svendsen M N Nielsen L

Lottenburger T Nielsen H J Biological variation in circulating levels of mannan-binding

lectin (MBL) and MBL-associated serine protease-2 and the influence of age gender and

physical exercise Scand J Immunol 2007 66 (4) 458-64

8 Hanash S Building a foundation for the human proteome the role of the Human

Proteome Organization J Proteome Res 2004 3 (2) 197-9

9 Omenn G S States D J Adamski M Blackwell T W Menon R Hermjakob H

Apweiler R Haab B B Simpson R J Eddes J S Kapp E A Moritz R L Chan D W

Rai A J Admon A Aebersold R Eng J Hancock W S Hefta S A Meyer H Paik Y

K Yoo J S Ping P Pounds J Adkins J Qian X Wang R Wasinger V Wu C Y

Zhao X Zeng R Archakov A Tsugita A Beer I Pandey A Pisano M Andrews P

Tammen H Speicher D W Hanash S M Overview of the HUPO Plasma Proteome Project

results from the pilot phase with 35 collaborating laboratories and multiple analytical groups

generating a core dataset of 3020 proteins and a publicly-available database Proteomics 2005 5

(13) 3226-45

10 Liu T Qian W J Gritsenko M A Xiao W Moldawer L L Kaushal A Monroe

M E Varnum S M Moore R J Purvine S O Maier R V Davis R W Tompkins R

G Camp D G 2nd Smith R D High dynamic range characterization of the trauma patient

plasma proteome Mol Cell Proteomics 2006 5 (10) 1899-913

11 Zhang Q Faca V Hanash S Mining the plasma proteome for disease applications

across seven logs of protein abundance J Proteome Res 2011 10 (1) 46-50

12 Matsubara J Honda K Ono M Sekine S Tanaka Y Kobayashi M Jung G

Sakuma T Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T Tsuchida A

57

Shimahara M Yasunami Y Chiba T Yamada T Identification of adipophilin as a potential

plasma biomarker for colorectal cancer using label-free quantitative mass spectrometry and

protein microarray Cancer Epidemiol Biomarkers Prev 2011 20 (10) 2195-203

13 Murakoshi Y Honda K Sasazuki S Ono M Negishi A Matsubara J Sakuma

T Kuwabara H Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T

Shimahara M Yasunami Y Ino Y Tsuchida A Aoki T Tsugane S Yamada T Plasma

biomarker discovery and validation for colorectal cancer by quantitative shotgun mass

spectrometry and protein microarray Cancer Sci 2011 102 (3) 630-8

14 Addona T A Shi X Keshishian H Mani D R Burgess M Gillette M A

Clauser K R Shen D Lewis G D Farrell L A Fifer M A Sabatine M S Gerszten R

E Carr S A A pipeline that integrates the discovery and verification of plasma protein

biomarkers reveals candidate markers for cardiovascular disease Nat Biotechnol 2011 29 (7)

635-43

15 Acosta-Martin A E Panchaud A Chwastyniak M Dupont A Juthier F Gautier

C Jude B Amouyel P Goodlett D R Pinet F Quantitative mass spectrometry analysis

using PAcIFIC for the identification of plasma diagnostic biomarkers for abdominal aortic

aneurysm PLoS One 2011 6 (12) e28698

16 Ray S Britschgi M Herbert C Takeda-Uchimura Y Boxer A Blennow K

Friedman L F Galasko D R Jutel M Karydas A Kaye J A Leszek J Miller B L

Minthon L Quinn J F Rabinovici G D Robinson W H Sabbagh M N So Y T

Sparks D L Tabaton M Tinklenberg J Yesavage J A Tibshirani R Wyss-Coray T

Classification and prediction of clinical Alzheimers diagnosis based on plasma signaling

proteins Nat Med 2007 13 (11) 1359-62

17 Lopez M F Kuppusamy R Sarracino D A Prakash A Athanas M Krastins B

Rezai T Sutton J N Peterman S Nicolaides K Mass spectrometric discovery and selective

reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21

maternal serum J Proteome Res 2011 10 (1) 133-42

18 Sheta E A Appel S H Goldknopf I L 2D gel blood serum biomarkers reveal

differential clinical proteomics of the neurodegenerative diseases Expert Rev Proteomics 2006

3 (1) 45-62

19 Zhang J Goodlett D R Montine T J Proteomic biomarker discovery in

cerebrospinal fluid for neurodegenerative diseases J Alzheimers Dis 2005 8 (4) 377-86

20 Yuan X Desiderio D M Proteomics analysis of prefractionated human lumbar

cerebrospinal fluid Proteomics 2005 5 (2) 541-50

21 Wenner B R Lovell M A Lynn B C Proteomic analysis of human ventricular

cerebrospinal fluid from neurologically normal elderly subjects using two-dimensional LC-

MSMS J Proteome Res 2004 3 (1) 97-103

22 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry Electrophoresis 2004 25 (14) 2402-12

23 Yuan X Desiderio D M Proteomics analysis of human cerebrospinal fluid J

Chromatogr B Analyt Technol Biomed Life Sci 2005 815 (1-2) 179-89

24 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

58

25 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers Proteomics Clin Appl 2008 2 (3) 428-36

26 Rozek W Ricardo-Dukelow M Holloway S Gendelman H E Wojna V

Melendez L M Ciborowski P Cerebrospinal fluid proteomic profiling of HIV-1-infected

patients with cognitive impairment J Proteome Res 2007 6 (11) 4189-99

27 Schutzer S E Liu T Natelson B H Angel T E Schepmoes A A Purvine S O

Hixson K K Lipton M S Camp D G Coyle P K Smith R D Bergquist J

Establishing the proteome of normal human cerebrospinal fluid PLoS One 2010 5 (6) e10980

28 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

29 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2010 878 (22) 2003-12

30 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

31 Yuan X Desiderio D M Human cerebrospinal fluid peptidomics J Mass Spectrom

2005 40 (2) 176-81

32 Ringman J M Schulman H Becker C Jones T Bai Y Immermann F Cole G

Sokolow S Gylys K Geschwind D H Cummings J L Wan H I Proteomic changes in

cerebrospinal fluid of presymptomatic and affected persons carrying familial Alzheimer disease

mutations Arch Neurol 2012 69 (1) 96-104

33 Jahn H Wittke S Zurbig P Raedler T J Arlt S Kellmann M Mullen W

Eichenlaub M Mischak H Wiedemann K Peptide fingerprinting of Alzheimers disease in

cerebrospinal fluid identification and prospective evaluation of new synaptic biomarkers PLoS

One 2011 6 (10) e26540

34 Rajagopal M U Hathout Y MacDonald T J Kieran M W Gururangan S

Blaney S M Phillips P Packer R Gordish-Dressman H Rood B R Proteomic profiling

of cerebrospinal fluid identifies prostaglandin D2 synthase as a putative biomarker for pediatric

medulloblastoma A pediatric brain tumor consortium study Proteomics 2011 11 (5) 935-43

35 Giron P Dayon L Turck N Hoogland C Sanchez J C Quantitative analysis of

human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive

isobaric labeling J Proteome Res 2011 10 (1) 249-58

36 Ting L Rad R Gygi S P Haas W MS3 eliminates ratio distortion in isobaric

multiplexed quantitative proteomics Nat Methods 2011 8 (11) 937-40

37 Michalski A Damoc E Hauschild J P Lange O Wieghaus A Makarov A

Nagaraj N Cox J Mann M Horning S Mass spectrometry-based proteomics using Q

Exactive a high-performance benchtop quadrupole Orbitrap mass spectrometer Mol Cell

Proteomics 2011 10 (9) M111 011015

38 Spirin V Shpunt A Seebacher J Gentzel M Shevchenko A Gygi S Sunyaev

S Assigning spectrum-specific P-values to protein identifications by mass spectrometry

Bioinformatics 2011 27 (8) 1128-34

39 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

59

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

2012 84 (1) 209-15

40 Wilhelm M Kirchner M Steen J A Steen H mz5 Space- and Time-efficient

Storage of Mass Spectrometry Data Sets Mol Cell Proteomics 2012 11 (1) O111 011379

41 Zhou F Sikorski T W Ficarro S B Webber J T Marto J A Online nanoflow

reversed phase-strong anion exchange-reversed phase liquid chromatography-tandem mass

spectrometry platform for efficient and in-depth proteome sequence analysis of complex

organisms Anal Chem 2011 83 (18) 6996-7005

42 Winter D Steen H Optimization of cell lysis and protein digestion protocols for the

analysis of HeLa S3 cells by LC-MSMS Proteomics 2011 11 (24) 4726-30

43 Shteynberg D Deutsch E W Lam H Eng J K Sun Z Tasman N Mendoza L

Moritz R L Aebersold R Nesvizhskii A I iProphet multi-level integrative analysis of

shotgun proteomic data improves peptide and protein identification rates and error estimates

Mol Cell Proteomics 2011 10 (12) M111 007690

44 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

84 (1) 209-15

45 Datta A Park J E Li X Zhang H Ho Z S Heese K Lim S K Tam J P Sze

S K Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun

quantitative proteomics J Proteome Res 9 (1) 472-84

46 Decramer S Gonzalez de Peredo A Breuil B Mischak H Monsarrat B Bascands

J L Schanstra J P Urine in clinical proteomics Mol Cell Proteomics 2008 7 (10) 1850-62

47 De La Monte S M Wands J R The AD7c-NTP neuronal thread protein biomarker for

detecting Alzheimers disease J Alzheimers Dis 2001 3 (3) 345-353

48 Van Dorsselaer A Carapito C Delalande F Schaeffer-Reiss C Thierse D Diemer

H McNair D S Krewski D Cashman N R Detection of prion protein in urine-derived

injectable fertility products by a targeted proteomic approach PLoS One 2011 6 (3) e17815

49 Kuwabara Y Mine K Katayama A Inagawa T Akira S Takeshita T Proteomic

analyses of recombinant human follicle-stimulating hormone and urinary-derived gonadotropin

preparations J Reprod Med 2009 54 (8) 459-66

50 Court M Selevsek N Matondo M Allory Y Garin J Masselon C D Domon B

Toward a standardized urine proteome analysis methodology Proteomics 2011 11 (6) 1160-71

51 Marimuthu A OMeally R N Chaerkady R Subbannayya Y Nanjappa V Kumar

P Kelkar D S Pinto S M Sharma R Renuse S Goel R Christopher R Delanghe B

Cole R N Harsha H C Pandey A A comprehensive map of the human urinary proteome J

Proteome Res 2011 10 (6) 2734-43

52 Bermejo-Pareja F Antequera D Vargas T Molina J A Carro E Saliva levels of

Abeta1-42 as potential biomarker of Alzheimers disease a pilot study BMC Neurol 2010 10

108

53 Devic I Hwang H Edgar J S Izutsu K Presland R Pan C Goodlett D R

Wang Y Armaly J Tumas V Zabetian C P Leverenz J B Shi M Zhang J Salivary

alpha-synuclein and DJ-1 potential biomarkers for Parkinsons disease Brain 2011 134 (Pt 7)

e178

60

54 Al-Tarawneh S K Border M B Dibble C F Bencharit S Defining salivary

biomarkers using mass spectrometry-based proteomics a systematic review OMICS 2011 15

(6) 353-61

55 Castagnola M Cabras T Vitali A Sanna M T Messana I Biotechnological

implications of the salivary proteome Trends Biotechnol 2011 29 (8) 409-18

56 Caprioli R M Farmer T B Gile J Molecular imaging of biological samples

localization of peptides and proteins using MALDI-TOF MS Anal Chem 1997 69 (23) 4751-

60

57 Stoeckli M Chaurand P Hallahan D E Caprioli R M Imaging mass spectrometry

a new technology for the analysis of protein expression in mammalian tissues Nat Med 2001 7

(4) 493-6

58 Chaurand P Schwartz S A Caprioli R M Imaging mass spectrometry a new tool to

investigate the spatial organization of peptides and proteins in mammalian tissue sections Curr

Opin Chem Biol 2002 6 (5) 676-81

59 Chaurand P Stoeckli M Caprioli R M Direct profiling of proteins in biological

tissue sections by MALDI mass spectrometry Anal Chem 1999 71 (23) 5263-70

60 Seeley E H Caprioli R M MALDI imaging mass spectrometry of human tissue

method challenges and clinical perspectives Trends Biotechnol 2011 29 (3) 136-43

61 Cazares L H Troyer D A Wang B Drake R R Semmes O J MALDI tissue

imaging from biomarker discovery to clinical applications Anal Bioanal Chem 2011 401 (1)

17-27

62 Stauber J Lemaire R Franck J Bonnel D Croix D Day R Wisztorski M

Fournier I Salzet M MALDI imaging of formalin-fixed paraffin-embedded tissues

application to model animals of Parkinson disease for biomarker hunting J Proteome Res 2008

7 (3) 969-78

63 Yuki D Sugiura Y Zaima N Akatsu H Hashizume Y Yamamoto T Fujiwara

M Sugiyama K Setou M Hydroxylated and non-hydroxylated sulfatide are distinctly

distributed in the human cerebral cortex Neuroscience 2011 193 44-53

64 Makridakis M Vlahou A Secretome proteomics for discovery of cancer biomarkers J

Proteomics 2010 73 (12) 2291-305

65 Dowling P Clynes M Conditioned media from cell lines a complementary model to

clinical specimens for the discovery of disease-specific biomarkers Proteomics 2011 11 (4)

794-804

66 Moore N H Costa L G Shaffer S A Goodlett D R Guizzetti M Shotgun

proteomics implicates extracellular matrix proteins and protease systems in neuronal

development induced by astrocyte cholinergic stimulation J Neurochem 2009 108 (4) 891-908

67 Keene S D Greco T M Parastatidis I Lee S H Hughes E G Balice-Gordon R

J Speicher D W Ischiropoulos H Mass spectrometric and computational analysis of

cytokine-induced alterations in the astrocyte secretome Proteomics 2009 9 (3) 768-82

68 Dowell J A Johnson J A Li L Identification of astrocyte secreted proteins with a

combination of shotgun proteomics and bioinformatics J Proteome Res 2009 8 (8) 4135-43

69 Ahmed F E Sample preparation and fractionation for proteome analysis and cancer

biomarker discovery by mass spectrometry J Sep Sci 2009 32 (5-6) 771-98

70 Righetti P G Fasoli E Boschetti E Combinatorial peptide ligand libraries the

conquest of the hidden proteome advances at great strides Electrophoresis 2011 32 (9) 960-6

61

71 Selvaraju S El Rassi Z Liquid-phase-based separation systems for depletion

prefractionation and enrichment of proteins in biological fluids and matrices for in-depth

proteomics analysis - An update covering the period 2008-2011 Electrophoresis 2011 33 (1)

74-88

72 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

73 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

74 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

75 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

76 Wagner P D Vu N D Histidine to aspartate phosphotransferase activity of nm23

proteins phosphorylation of aldolase C on Asp-319 Biochem J 2000 346 Pt 3 623-30

77 Chi A Huttenhower C Geer L Y Coon J J Syka J E Bai D L Shabanowitz

J Burke D J Troyanskaya O G Hunt D F Analysis of phosphorylation sites on proteins

from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry Proc

Natl Acad Sci U S A 2007 104 (7) 2193-8

78 Rudrabhatla P Jaffe H Pant H C Direct evidence of phosphorylated neuronal

intermediate filament proteins in neurofibrillary tangles (NFTs) phosphoproteomics of

Alzheimers NFTs FASEB J 25 (11) 3896-905

79 Saez-Valero J Fodero L R Sjogren M Andreasen N Amici S Gallai V

Vanderstichele H Vanmechelen E Parnetti L Blennow K Small D H Glycosylation of

acetylcholinesterase and butyrylcholinesterase changes as a function of the duration of

Alzheimers disease J Neurosci Res 2003 72 (4) 520-6

80 Silveyra M X Cuadrado-Corrales N Marcos A Barquero M S Rabano A

Calero M Saez-Valero J Altered glycosylation of acetylcholinesterase in Creutzfeldt-Jakob

disease J Neurochem 2006 96 (1) 97-104

81 Wei X Li L Comparative glycoproteomics approaches and applications Brief Funct

Genomic Proteomic 2009 8 (2) 104-13

82 Tian Y Zhang H Glycoproteomics and clinical applications Proteomics Clin Appl

2010 4 (2) 124-32

83 Butterfield D A Owen J B Lectin-affinity chromatography brain glycoproteomics

and Alzheimer disease insights into protein alterations consistent with the pathology and

progression of this dementing disorder Proteomics Clin Appl 2011 5 (1-2) 50-6

84 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion

disease biomarker research delving into the glycoproteome J Proteome Res 2010 10 (6) 2687-

702

85 Rucevic M Hixson D Josic D Mammalian plasma membrane proteins as potential

biomarkers and drug targets Electrophoresis 2011 32 (13) 1549-64

86 Gilmore J M Washburn M P Advances in shotgun proteomics and the analysis of

membrane proteomes J Proteomics 2010 73 (11) 2078-91

62

87 Weekes M P Antrobus R Lill J R Duncan L M Hor S Lehner P J

Comparative analysis of techniques to purify plasma membrane proteins J Biomol Tech 2010

21 (3) 108-15

88 Helbig A O Heck A J Slijper M Exploring the membrane proteome--challenges

and analytical strategies J Proteomics 2010 73 (5) 868-78

89 Griffin N M Schnitzer J E Overcoming key technological challenges in using mass

spectrometry for mapping cell surfaces in tissues Mol Cell Proteomics 2011 10 (2) R110

000935

90 Groen A J Lilley K S Proteomics of total membranes and subcellular membranes

Expert Rev Proteomics 2010 7 (6) 867-78

91 Wisniewski J R Tools for phospho- and glycoproteomics of plasma membranes Amino

Acids 2011 41 (2) 223-33

92 Wei X Dulberger C Li L Characterization of murine brain membrane glycoproteins

by detergent assisted lectin affinity chromatography Anal Chem 2010 82 (15) 6329-33

93 Zhang H Guo T Li X Datta A Park J E Yang J Lim S K Tam J P Sze S

K Simultaneous characterization of glyco- and phosphoproteomes of mouse brain membrane

proteome with electrostatic repulsion hydrophilic interaction chromatography Mol Cell

Proteomics 2011 9 (4) 635-47

94 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

95 Ma M Chen R Ge Y He H Marshall A G Li L Combining bottom-up and top-

down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic

hormone from Cancer borealis Anal Chem 2009 81 (1) 240-7

96 Chen R Ma M Hui L Zhang J Li L Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 (4) 708-18

97 Wei H Nolkrantz K Parkin M C Chisolm C N OCallaghan J P Kennedy R

T Identification and quantification of neuropeptides in brain tissue by capillary liquid

chromatography coupled off-line to MALDI-TOF and MALDI-TOFTOF-MS Anal Chem 2006

78 (13) 4342-51

98 Dowell J A Heyden W V Li L Rat neuropeptidomics by LC-MSMS and MALDI-

FTMS Enhanced dissection and extraction techniques coupled with 2D RP-RP HPLC J

Proteome Res 2006 5 (12) 3368-75

99 Che F Y Fricker L D Quantitative peptidomics of mouse pituitary comparison of

different stable isotopic tags J Mass Spectrom 2005 40 (2) 238-49

100 Fricker L D Analysis of mouse brain peptides using mass spectrometry-based

peptidomics implications for novel functions ranging from non-classical neuropeptides to

microproteins Mol Biosyst 2010 6 (8) 1355-65

101 Colgrave M L Xi L Lehnert S A Flatscher-Bader T Wadensten H Nilsson A

Andren P E Wijffels G Neuropeptide profiling of the bovine hypothalamus thermal

stabilization is an effective tool in inhibiting post-mortem degradation Proteomics 2011 11 (7)

1264-76

102 Scholz B Alm H Mattsson A Nilsson A Kultima K Savitski M M Falth M

Skold K Brunstrom B Andren P E Dencker L Neuropeptidomic analysis of the

embryonic Japanese quail diencephalon BMC Dev Biol 2010 10 30

63

103 Hummon A B Amare A Sweedler J V Discovering new invertebrate neuropeptides

using mass spectrometry Mass Spectrom Rev 2006 25 (1) 77-98

104 Vilim F S Sasaki K Rybak J Alexeeva V Cropper E C Jing J Orekhova I

V Brezina V Price D Romanova E V Rubakhin S S Hatcher N Sweedler J V

Weiss K R Distinct mechanisms produce functionally complementary actions of neuropeptides

that are structurally related but derived from different precursors J Neurosci 2010 30 (1) 131-

47

105 Fu Q Li L De novo sequencing of neuropeptides using reductive isotopic methylation

and investigation of ESI QTOF MSMS fragmentation pattern of neuropeptides with N-terminal

dimethylation Anal Chem 2005 77 (23) 7783-95

106 Chen R Jiang X Conaway M C Mohtashemi I Hui L Viner R Li L Mass

spectral analysis of neuropeptide expression and distribution in the nervous system of the lobster

Homarus americanus J Proteome Res 2010 9 (2) 818-32

107 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved

identification of endogenous peptides from murine nervous tissue by multiplexed peptide

extraction methods and multiplexed mass spectrometric analysis J Proteome Res 2009 8 (2)

870-6

108 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Peptidomic analysis

of human cell lines J Proteome Res 2011 10 (4) 1583-92

109 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement

approaches and challenges Annu Rev Anal Chem (Palo Alto Calif) 2008 1 451-83

110 Marouga R David S Hawkins E The development of the DIGE system 2D

fluorescence difference gel analysis technology Anal Bioanal Chem 2005 382 (3) 669-78

111 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

112 Ottervald J Franzen B Nilsson K Andersson L I Khademi M Eriksson B

Kjellstrom S Marko-Varga G Vegvari A Harris R A Laurell T Miliotis T

Matusevicius D Salter H Ferm M Olsson T Multiple sclerosis Identification and clinical

evaluation of novel CSF biomarkers J Proteomics 2010 73 (6) 1117-32

113 Diez R Herbstreith M Osorio C Alzate O 2-D Fluorescence Difference Gel

Electrophoresis (DIGE) in Neuroproteomics 2010

114 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent

advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1)

46-50

115 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J

Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66

116 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

117 Burke T W Mant C T Black J A Hodges R S Strong cation-exchange high-

performance liquid chromatography of peptides Effect of non-specific hydrophobic interactions

and linearization of peptide retention behaviour J Chromatogr 1989 476 377-89

118 Edelmann M J Strong cation exchange chromatography in analysis of posttranslational

modifications innovations and perspectives J Biomed Biotechnol 2011 2011 936508

64

119 Francois I Sandra K Sandra P Comprehensive liquid chromatography fundamental

aspects and practical considerations--a review Anal Chim Acta 2009 641 (1-2) 14-31

120 Guiochon G Marchetti N Mriziq K Shalliker R A Implementations of two-

dimensional liquid chromatography J Chromatogr A 2008 1189 (1-2) 109-68

121 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

122 Alpert A J Hydrophilic-interaction chromatography for the separation of peptides

nucleic acids and other polar compounds J Chromatogr 1990 499 177-96

123 Gilar M Olivova P Daly A E Gebler J C Orthogonality of separation in two-

dimensional liquid chromatography Anal Chem 2005 77 (19) 6426-34

124 Di Palma S Stange D van de Wetering M Clevers H Heck A J Mohammed S

Highly sensitive proteome analysis of FACS-sorted adult colon stem cells J Proteome Res 2011

10 (8) 3814-9

125 Di Palma S Boersema P J Heck A J Mohammed S Zwitterionic hydrophilic

interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution

separation and increase sensitivity in proteome analysis Anal Chem 2011 83 (9) 3440-7

126 Zarei M Sprenger A Metzger F Gretzmeier C Dengjel J Comparison of ERLIC-

TiO2 HILIC-TiO2 and SCX-TiO2 for global phosphoproteomics approaches J Proteome Res

2011 10 (8) 3474-83

127 Neue K Mormann M Peter-Katalinic J Pohlentz G Elucidation of glycoprotein

structures by unspecific proteolysis and direct nanoESI mass spectrometric analysis of ZIC-

HILIC-enriched glycopeptides J Proteome Res 2011 10 (5) 2248-60

128 Ow S Y Salim M Noirel J Evans C Wright P C Minimising iTRAQ ratio

compression through understanding LC-MS elution dependence and high-resolution HILIC

fractionation Proteomics 2011 11 (11) 2341-6

129 Alpert A J Electrostatic repulsion hydrophilic interaction chromatography for isocratic

separation of charged solutes and selective isolation of phosphopeptides Anal Chem 2008 80

(1) 62-76

130 Chien K Y Liu H C Goshe M B Development and application of a

phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography

(ERLIC) IMAC and LC-MSMS analysis to study Mareks Disease Virus infection J Proteome

Res 2011 10 (9) 4041-53

131 Liu X Plasencia M Ragg S Valentine S J Clemmer D E Development of high

throughput dispersive LC-ion mobility-TOFMS techniques for analysing the human plasma

proteome Brief Funct Genomic Proteomic 2004 3 (2) 177-86

132 Valentine S J Liu X Plasencia M D Hilderbrand A E Kurulugama R T

Koeniger S L Clemmer D E Developing liquid chromatography ion mobility mass

spectometry techniques Expert Rev Proteomics 2005 2 (4) 553-65

133 Valentine S J Plasencia M D Liu X Krishnan M Naylor S Udseth H R

Smith R D Clemmer D E Toward plasma proteome profiling with ion mobility-mass

spectrometry J Proteome Res 2006 5 (11) 2977-84

134 Valentine S J Ewing M A Dilger J M Glover M S Geromanos S Hughes C

Clemmer D E Using ion mobility data to improve peptide identification intrinsic amino acid

size parameters J Proteome Res 2011 10 (5) 2318-29

65

135 Ahmed F E The role of capillary electrophoresis-mass spectrometry to proteome

analysis and biomarker discovery J Chromatogr B Analyt Technol Biomed Life Sci 2009 877

(22) 1963-81

136 Fonslow B R Yates J R 3rd Capillary electrophoresis applied to proteomic analysis

J Sep Sci 2009 32 (8) 1175-88

137 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins J Chromatogr A 2007 1159 (1-2) 81-109

138 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins 2007-2010 Electrophoresis 2011 32 (1) 66-82

139 Huck C W Bakry R Huber L A Bonn G K Progress in capillary electrophoresis

coupled to matrix-assisted laser desorptionionization - time of flight mass spectrometry

Electrophoresis 2006 27 (11) 2063-74

140 Klampfl C W CE with MS detection a rapidly developing hyphenated technique

Electrophoresis 2009 30 Suppl 1 S83-91

141 Simpson D C Smith R D Combining capillary electrophoresis with mass

spectrometry for applications in proteomics Electrophoresis 2005 26 (7-8) 1291-305

142 Li Y Champion M M Sun L Champion P A Wojcik R Dovichi N J Capillary

zone electrophoresis-electrospray ionization-tandem mass spectrometry as an alternative

proteomics platform to ultraperformance liquid chromatography-electrospray ionization-tandem

mass spectrometry for samples of intermediate complexity Anal Chem 2012 84 (3) 1617-22

143 Dai L Li C Shedden K A Lee C J Quoc H Simeone D M Lubman D M

Quantitative proteomic profiling studies of pancreatic cancer stem cells J Proteome Res 2010 9

(7) 3394-402

144 Chen J Balgley B M DeVoe D L Lee C S Capillary isoelectric focusing-based

multidimensional concentrationseparation platform for proteome analysis Anal Chem 2003 75

(13) 3145-52

145 Mischak H Delles C Klein J Schanstra J P Urinary proteomics based on capillary

electrophoresis-coupled mass spectrometry in kidney disease discovery and validation of

biomarkers and clinical application Adv Chronic Kidney Dis 2010 17 (6) 493-506

146 Albalat A Mischak H Mullen W Urine proteomics in clinical applications

technologies principal considerations and clinical implementation Prilozi 2011 32 (1) 13-44

147 Zuberovic A Hanrieder J Hellman U Bergquist J Wetterhall M Proteome

profiling of human cerebrospinal fluid exploring the potential of capillary electrophoresis with

surface modified capillaries for analysis of complex biological samples Eur J Mass Spectrom

(Chichester Eng) 2008 14 (4) 249-60

148 Johannesson N Olsson L Backstrom D Wetterhall M Danielsson R Bergquist

J Screening for biomarkers in plasma from patients with gangrenous and phlegmonous

appendicitis using CE and CEC in combination with MS Electrophoresis 2007 28 (9) 1435-43

149 Guo T Lee C S Wang W DeVoe D L Balgley B M Capillary separations

enabling tissue proteomics-based biomarker discovery Electrophoresis 2006 27 (18) 3523-32

150 Hwang S I Thumar J Lundgren D H Rezaul K Mayya V Wu L Eng J

Wright M E Han D K Direct cancer tissue proteomics a method to identify candidate

cancer biomarkers from formalin-fixed paraffin-embedded archival tissues Oncogene 2007 26

(1) 65-76

66

151 Desiderio C Rossetti D V Iavarone F Messana I Castagnola M Capillary

electrophoresis--mass spectrometry recent trends in clinical proteomics J Pharm Biomed Anal

2010 53 (5) 1161-9

152 Albalat A Mischak H Mullen W Clinical application of urinary

proteomicspeptidomics Expert Rev Proteomics 2011 8 (5) 615-29

153 Ye X Luke B Andresson T Blonder J 18O stable isotope labeling in MS-based

proteomics Brief Funct Genomic Proteomic 2009 8 (2) 136-44

154 Staes A Demol H Van Damme J Martens L Vandekerckhove J Gevaert K

Global differential non-gel proteomics by quantitative and stable labeling of tryptic peptides with

oxygen-18 J Proteome Res 2004 3 (4) 786-91

155 Jorge I Navarro P Martinez-Acedo P Nunez E Serrano H Alfranca A

Redondo J M Vazquez J Statistical model to analyze quantitative proteomics data obtained

by 18O16O labeling and linear ion trap mass spectrometry application to the study of vascular

endothelial growth factor-induced angiogenesis in endothelial cells Mol Cell Proteomics 2009

8 (5) 1130-49

156 Gygi S P Rist B Gerber S A Turecek F Gelb M H Aebersold R Quantitative

analysis of complex protein mixtures using isotope-coded affinity tags Nat Biotechnol 1999 17

(10) 994-9

157 Haqqani A S Kelly J F Stanimirovic D B Quantitative protein profiling by mass

spectrometry using isotope-coded affinity tags Methods Mol Biol 2008 439 225-40

158 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

159 Thompson A Schafer J Kuhn K Kienle S Schwarz J Schmidt G Neumann T

Johnstone R Mohammed A K Hamon C Tandem mass tags a novel quantification strategy

for comparative analysis of complex protein mixtures by MSMS Anal Chem 2003 75 (8)

1895-904

160 Dayon L Hainard A Licker V Turck N Kuhn K Hochstrasser D F Burkhard

P R Sanchez J C Relative quantification of proteins in human cerebrospinal fluids by

MSMS using 6-plex isobaric tags Anal Chem 2008 80 (8) 2921-31

161 DAscenzo M Choe L Lee K H iTRAQPak an R based analysis and visualization

package for 8-plex isobaric protein expression data Brief Funct Genomic Proteomic 2008 7 (2)

127-35

162 Choe L DAscenzo M Relkin N R Pappin D Ross P Williamson B Guertin

S Pribil P Lee K H 8-plex quantitation of changes in cerebrospinal fluid protein expression

in subjects undergoing intravenous immunoglobulin treatment for Alzheimers disease

Proteomics 2007 7 (20) 3651-60

163 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 2010 82 (7) 2817-

25

164 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

67

165 Ong S E Kratchmarova I Mann M Properties of 13C-substituted arginine in stable

isotope labeling by amino acids in cell culture (SILAC) J Proteome Res 2003 2 (2) 173-81

166 Pimienta G Chaerkady R Pandey A SILAC for global phosphoproteomic analysis

Methods Mol Biol 2009 527 107-16 x

167 Zhang G Neubert T A Use of stable isotope labeling by amino acids in cell culture

(SILAC) for phosphotyrosine protein identification and quantitation Methods Mol Biol 2009

527 79-92 xi

168 Bondarenko P V Chelius D Shaler T A Identification and relative quantitation of

protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid

chromatography-tandem mass spectrometry Anal Chem 2002 74 (18) 4741-9

169 Chelius D Bondarenko P V Quantitative profiling of proteins in complex mixtures

using liquid chromatography and mass spectrometry J Proteome Res 2002 1 (4) 317-23

170 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

171 Wang W Zhou H Lin H Roy S Shaler T A Hill L R Norton S Kumar P

Anderle M Becker C H Quantification of proteins and metabolites by mass spectrometry

without isotopic labeling or spiked standards Anal Chem 2003 75 (18) 4818-26

172 Silva J C Denny R Dorschel C A Gorenstein M Kass I J Li G Z McKenna

T Nold M J Richardson K Young P Geromanos S Quantitative proteomic analysis by

accurate mass retention time pairs Anal Chem 2005 77 (7) 2187-200

173 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

174 Filiou M D Martins-de-Souza D Guest P C Bahn S Turck C W To label or not

to label Applications of quantitative proteomics in neuroscience research Proteomics 2012

175 Neilson K A Ali N A Muralidharan S Mirzaei M Mariani M Assadourian G

Lee A van Sluyter S C Haynes P A Less label more free approaches in label-free

quantitative mass spectrometry Proteomics 2011 11 (4) 535-53

176 Zhu W Smith J W Huang C M Mass spectrometry-based label-free quantitative

proteomics J Biomed Biotechnol 2010 2010 840518

177 Xie F Liu T Qian W J Petyuk V A Smith R D Liquid chromatography-mass

spectrometry-based quantitative proteomics J Biol Chem 2012 286 (29) 25443-9

178 Shen Y Tolic N Xie F Zhao R Purvine S O Schepmoes A A Moore R J

Anderson G A Smith R D Effectiveness of CID HCD and ETD with FT MSMS for

degradomic-peptidomic analysis comparison of peptide identification methods J Proteome Res

2011 10 (9) 3929-43

179 Zubarev R A Kelleher N L McLafferty F W Electron Capture Dissociation of

Multiply Charged Protein Cations A Nonergodic Process Journal of the American Chemical

Society 1998 120 (13) 3265-3266

180 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

181 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

68

182 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass

spectrometry for shotgun proteomics Nat Methods 2008 5 (11) 959-64

183 Zhou W Ross M M Tessitore A Ornstein D Vanmeter A Liotta L A

Petricoin E F 3rd An initial characterization of the serum phosphoproteome J Proteome Res

2009 8 (12) 5523-31

184 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides

complexed with divalent metal cations Anal Chem 2006 78 (21) 7570-6

185 Snovida S I Bodnar E D Viner R Saba J Perreault H A simple cellulose

column procedure for selective enrichment of glycopeptides and characterization by nano LC

coupled with electron-transfer and high-energy collisional-dissociation tandem mass

spectrometry Carbohydr Res 2010 345 (6) 792-801

186 Sobott F Watt S J Smith J Edelmann M J Kramer H B Kessler B M

Comparison of CID versus ETD based MSMS fragmentation for the analysis of protein

ubiquitination J Am Soc Mass Spectrom 2009 20 (9) 1652-9

187 Eliuk S M Maltby D Panning B Burlingame A L High resolution electron

transfer dissociation studies of unfractionated intact histones from murine embryonic stem cells

using on-line capillary LC separation determination of abundant histone isoforms and post-

translational modifications Mol Cell Proteomics 2010 9 (5) 824-37

188 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 10 (9) 4219-29

189 Molina H Matthiesen R Kandasamy K Pandey A Comprehensive comparison of

collision induced dissociation and electron transfer dissociation Anal Chem 2008 80 (13)

4825-35

190 Olsen J V Macek B Lange O Makarov A Horning S Mann M Higher-energy

C-trap dissociation for peptide modification analysis Nat Methods 2007 4 (9) 709-12

191 McAlister G C Phanstiel D Wenger C D Lee M V Coon J J Analysis of

tandem mass spectra by FTMS for improved large-scale proteomics with superior protein

quantification Anal Chem 2010 82 (1) 316-22

192 Dayon L Pasquarello C Hoogland C Sanchez J C Scherl A Combining low-

and high-energy tandem mass spectra for optimized peptide quantification with isobaric tags J

Proteomics 2010 73 (4) 769-77

193 Second T P Blethrow J D Schwartz J C Merrihew G E MacCoss M J

Swaney D L Russell J D Coon J J Zabrouskov V Dual-pressure linear ion trap mass

spectrometer improving the analysis of complex protein mixtures Anal Chem 2009 81 (18)

7757-65

194 Jedrychowski M P Huttlin E L Haas W Sowa M E Rad R Gygi S P

Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for

murine phosphoproteomics Mol Cell Proteomics 2011 10 (12) M111 009910

195 Frese C K Altelaar A F Hennrich M L Nolting D Zeller M Griep-Raming J

Heck A J Mohammed S Improved peptide identification by targeted fragmentation using

CID HCD and ETD on an LTQ-Orbitrap Velos J Proteome Res 2011 10 (5) 2377-88

196 Chakraborty A B Berger S J Gebler J C Use of an integrated MS--multiplexed

MSMS data acquisition strategy for high-coverage peptide mapping studies Rapid Commun

Mass Spectrom 2007 21 (5) 730-44

69

197 Ramos A A Yang H Rosen L E Yao X Tandem parallel fragmentation of

peptides for mass spectrometry Anal Chem 2006 78 (18) 6391-7

198 Barbara J E Castro-Perez J M High-resolution chromatographytime-of-flight MSE

with in silico data mining is an information-rich approach to reactive metabolite screening Rapid

Commun Mass Spectrom 2011 25 (20) 3029-40

199 Geromanos S J Vissers J P Silva J C Dorschel C A Li G Z Gorenstein M

V Bateman R H Langridge J I The detection correlation and comparison of peptide

precursor and product ions from data independent LC-MS with data dependant LC-MSMS

Proteomics 2009 9 (6) 1683-95

200 Koutroukides T A Guest P C Leweke F M Bailey D M Rahmoune H Bahn

S Martins-de-Souza D Characterization of the human serum depletome by label-free shotgun

proteomics J Sep Sci 34 (13) 1621-6

201 Martins-de-Souza D Guest P C Guest F L Bauder C Rahmoune H Pietsch S

Roeber S Kretzschmar H Mann D Baborie A Bahn S Characterization of the human

primary visual cortex and cerebellum proteomes using shotgun mass spectrometry-data-

independent analyses Proteomics 12 (3) 500-4

202 Scatena R Bottoni P Pontoglio A Giardina B Revisiting the Warburg effect in

cancer cells with proteomics The emergence of new approaches to diagnosis prognosis and

therapy Proteomics Clin Appl 4 (2) 143-58

203 Herberth M Koethe D Cheng T M Krzyszton N D Schoeffmann S Guest P

C Rahmoune H Harris L W Kranaster L Leweke F M Bahn S Impaired glycolytic

response in peripheral blood mononuclear cells of first-onset antipsychotic-naive schizophrenia

patients Mol Psychiatry 16 (8) 848-59

204 Krishnamurthy D Levin Y Harris L W Umrania Y Bahn S Guest P C

Analysis of the human pituitary proteome by data independent label-free liquid chromatography

tandem mass spectrometry Proteomics 11 (3) 495-500

205 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

206 Eng J K McCormack A L Yates Iii J R An approach to correlate tandem mass

spectral data of peptides with amino acid sequences in a protein database Journal of the

American Society for Mass Spectrometry 1994 5 (11) 976-989

207 Craig R Beavis R C TANDEM matching proteins with tandem mass spectra

Bioinformatics 2004 20 (9) 1466-7

208 Geer L Y Markey S P Kowalak J A Wagner L Xu M Maynard D M Yang

X Shi W Bryant S H Open mass spectrometry search algorithm J Proteome Res 2004 3

(5) 958-64

209 Zhang J Xin L Shan B Chen W Xie M Yuen D Zhang W Zhang Z Lajoie

G A Ma B PEAKS DB De Novo sequencing assisted database search for sensitive and

accurate peptide identification Mol Cell Proteomics 2011

210 Liu J Erassov A Halina P Canete M Nguyen D V Chung C Cagney G

Ignatchenko A Fong V Emili A Sequential interval motif search unrestricted database

surveys of global MSMS data sets for detection of putative post-translational modifications

Anal Chem 2008 80 (20) 7846-54

70

211 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

212 Han Y Ma B Zhang K SPIDER software for protein identification from sequence

tags with de novo sequencing error J Bioinform Comput Biol 2005 3 (3) 697-716

213 Singh S Springer M Steen J Kirschner M W Steen H FLEXIQuant a novel tool

for the absolute quantification of proteins and the simultaneous identification and quantification

of potentially modified peptides J Proteome Res 2009 8 (5) 2201-10

214 Searle B C Scaffold a bioinformatic tool for validating MSMS-based proteomic

studies Proteomics 10 (6) 1265-9

215 Herbst A McIlwain S Schmidt J J Aiken J M Page C D Li L Prion disease

diagnosis by proteomic profiling J Proteome Res 2009 8 (2) 1030-6

216 Menschaert G Vandekerckhove T T Baggerman G Schoofs L Luyten W Van

Criekinge W Peptidomics coming of age a review of contributions from a bioinformatics

angle J Proteome Res 2010 9 (5) 2051-61

217 Kumar C Mann M Bioinformatics analysis of mass spectrometry-based proteomics

data sets FEBS Lett 2009 583 (11) 1703-12

218 Craig R Cortens J P Beavis R C Open source system for analyzing validating and

storing protein identification data J Proteome Res 2004 3 (6) 1234-42

219 Jones P Cote R G Cho S Y Klie S Martens L Quinn A F Thorneycroft D

Hermjakob H PRIDE new developments and new datasets Nucleic Acids Res 2008 36

(Database issue) D878-83

220 Deutsch E W Lam H Aebersold R PeptideAtlas a resource for target selection for

emerging targeted proteomics workflows EMBO Rep 2008 9 (5) 429-34

221 Miliotis T Ali L Palm J E Lundqvist A J Ahnoff M Andersson T B

Hilgendorf C Development of a highly sensitive method using liquid chromatography-multiple

reaction monitoring to quantify membrane P-glycoprotein in biological matrices and relationship

to transport function Drug Metab Dispos 2011 39 (12) 2440-9

222 Xiang Y Koomen J M Evaluation of Direct Infusion-Multiple Reaction Monitoring

Mass Spectrometry for Quantification of Heat Shock Proteins Anal Chem 2012

223 Ossola R Schiess R Picotti P Rinner O Reiter L Aebersold R Biomarker

validation in blood specimens by selected reaction monitoring mass spectrometry of N-

glycosites Methods Mol Biol 2011 728 179-94

224 Liu H Lam L Dasgupta P K Expanding the linear dynamic range for multiple

reaction monitoring in quantitative liquid chromatography-tandem mass spectrometry utilizing

natural isotopologue transitions Talanta 2011 87 307-10

225 Belov M E Prasad S Prior D C Danielson W F 3rd Weitz K Ibrahim Y M

Smith R D Pulsed multiple reaction monitoring approach to enhancing sensitivity of a tandem

quadrupole mass spectrometer Anal Chem 2011 83 (6) 2162-71

226 Prentice R L Paczesny S Aragaki A Amon L M Chen L Pitteri S J

McIntosh M Wang P Buson Busald T Hsia J Jackson R D Rossouw J E Manson J

E Johnson K Eaton C Hanash S M Novel proteins associated with risk for coronary heart

disease or stroke among postmenopausal women identified by in-depth plasma proteome

profiling Genome Med 2 (7) 48

227 Gelman J S Fricker L D Hemopressin and other bioactive peptides from cytosolic

proteins are these non-classical neuropeptides AAPS J 2010 12 (3) 279-89

71

228 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Hemopressins and

other hemoglobin-derived peptides in mouse brain comparison between brain blood and heart

peptidome and regulation in Cpefatfat mice J Neurochem 2010 113 (4) 871-80

229 Li L Garden R W Sweedler J V Single-cell MALDI a new tool for direct peptide

profiling Trends Biotechnol 2000 18 (4) 151-60

230 Altelaar A M Heck A J Trends in ultrasensitive proteomics Curr Opin Chem Biol

231 Bandura D R Baranov V I Ornatsky O I Antonov A Kinach R Lou X

Pavlov S Vorobiev S Dick J E Tanner S D Mass cytometry technique for real time

single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass

spectrometry Anal Chem 2009 81 (16) 6813-22

232 Trimpin S Inutan E D Herath T N McEwen C N Laserspray ionization a new

atmospheric pressure MALDI method for producing highly charged gas-phase ions of peptides

and proteins directly from solid solutions Mol Cell Proteomics 2010 9 (2) 362-7

233 McEwen C N Larsen B S Trimpin S Laserspray ionization on a commercial

atmospheric pressure-MALDI mass spectrometer ion source selecting singly or multiply

charged ions Anal Chem 2010 82 (12) 4998-5001

234 Wang B Lietz C B Inutan E D Leach S M Trimpin S Producing highly

charged ions without solvent using laserspray ionization a total solvent-free analysis approach at

atmospheric pressure Anal Chem 2011 83 (11) 4076-84

235 Inutan E D Richards A L Wager-Miller J Mackie K McEwen C N Trimpin

S Laserspray ionization a new method for protein analysis directly from tissue at atmospheric

pressure with ultrahigh mass resolution and electron transfer dissociation Mol Cell Proteomics

2010 10 (2) M110 000760

236 Wang H Liu J Cooks R G Ouyang Z Paper spray for direct analysis of complex

mixtures using mass spectrometry Angew Chem Int Ed Engl 49 (5) 877-80

237 Zhang Z Xu W Manicke N E Cooks R G Ouyang Z Silica coated paper

substrate for paper-spray analysis of therapeutic drugs in dried blood spots Anal Chem 84 (2)

931-8

238 Wang H Manicke N E Yang Q Zheng L Shi R Cooks R G Ouyang Z

Direct analysis of biological tissue by paper spray mass spectrometry Anal Chem 83 (4) 1197-

201

239 Liu J Wang H Cooks R G Ouyang Z Leaf spray direct chemical analysis of plant

material and living plants by mass spectrometry Anal Chem 83 (20) 7608-13

240 Liu J Wang H Manicke N E Lin J M Cooks R G Ouyang Z Development

characterization and application of paper spray ionization Anal Chem 82 (6) 2463-71

241 Yoo H J Wang N Zhuang S Song H Hakansson K Negative-ion electron

capture dissociation radical-driven fragmentation of charge-increased gaseous peptide anions J

Am Chem Soc 2011 133 (42) 16790-3

242 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 271 (2) 111-

21

243 Hood L Friend S H Predictive personalized preventive participatory (P4) cancer

medicine Nat Rev Clin Oncol 2011 8 (3) 184-7

244 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 2012 271

(2) 111-21

72

245 Belda-Iniesta C de Castro J Perona R Translational proteomics what can you do for

true patients J Proteome Res 2011 10 (1) 101-4

246 Schiess R Wollscheid B Aebersold R Targeted proteomic strategy for clinical

biomarker discovery Mol Oncol 2009 3 (1) 33-44

247 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

248 Schmidt A Kellermann J Lottspeich F A novel strategy for quantitative proteomics

using isotope-coded protein labels Proteomics 2005 5 (1) 4-15

249 Wang Y K Ma Z Quinn D F Fu E W Inverse 15

N-metabolic labelingmass

spectrometry for comparative proteomics and rapid identification of protein markerstargets

Rapid Commun Mass Spectrom 2002 16 (14) 1389-97

73

Chapter 3

Protein changes in immunodepleted cerebrospinal fluid from transgenic

mouse models of Alexander disease detected using mass spectrometry

Adapted from ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse

models of Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P

Messing A Li L Submitted

74

ABSTRACT

Cerebrospinal fluid (CSF) is a low protein content biological fluid with dynamic range

spanning at least nine orders of magnitude in protein content and is in direct contact with the

brain A modified IgY-14 immunodepletion treatment was performed to enhance analysis of the

low volumes of CSF that are obtainable from mice As a model system in which to test this

approach we utilized transgenic mice that over-express the intermediate filament glial fibrillary

acidic protein (GFAP) These mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we report the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates were performed to address animal variability as well as reproducibility in

mass spectrometric analysis Relative quantitation was performed using distributive normalized

spectral abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins

with significant changes in the CSF of GFAP transgenic mice has been identified with validation

from ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

75

INTRODUCTION

Alexander disease (AxD) is a fatal neurogenetic disorder caused by heterozygous point

mutations in the coding region of GFAP (encoding glial fibrillary acidic protein)1 The hallmark

diagnostic feature of this disease is the accumulation of astrocytic cytoplasmic inclusions known

as Rosenthal fibers containing GFAP Hsp27 αB-crystallin and other components2-5

Although

several potential treatment strategies6-8

are under investigation clinical trial design is hampered

by the absence of a standardized clinical scoring system or means to quantify lesions in MRI

that could serve to monitor severity and progression of disease One solution to this problem

would be the identification of biomarkers in readily sampled body fluids as indirect indicators of

disease

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy site for brain or spinal

cord in evaluating diseases of the central nervous system The protein composition of CSF is

well defined at least for the most abundant species of proteins and numerous studies exist that

characterize individual biomarkers or complex patterns of biomarkers in various diseases9 10

GFAP itself is present in CSF (albeit at much lower levels than in brain parenchyma) and in one

study of three Alexander disease patients its levels were markedly increased11

Whether an

increase in CSF GFAP will be a consistent finding in Alexander disease or whether other useful

biomarkers for this disease could be identified through an unbiased analysis of the CSF

proteome is not yet known

The rarity of Alexander disease makes analysis of human samples difficult However

mouse models exist that replicate key features of the disease such as formation of Rosenthal

fibers Unfortunately mouse CSF poses particular problems for proteomic studies and there is

76

an urgent need for technical improvements for dealing with this fluid For instance collection

from an adult mouse typically yields ~10 μL of CSF often with some contamination by blood12

To further complicate analysis CSF has an exceedingly low protein content (~04 μgμL) with

over 60 of the total protein content consisting of a single protein albumin13 14

A number of

techniques have been developed to remove albumin from biological samples including Cibacron

Blue15

IgG immunodepletion16

and IgY immunodepletion17-19

IgY which is avian in origin

offers reduced non-specific binding and increased avidity when compared to IgG antibodies from

rabbits goats and mice20-23

One widely used IgY cocktail is IgY-14 which contains fourteen

specific antibodies specific for albumin IgG transferrin fibrinogen α1-antitrypsin IgA IgM

α2-macroglobulin haptoglobin apolipoproteins A-I A-II and B complement C3 and α1-acid

glycoprotein Since existing protocols for IgY-14 depletion are optimized for use with large

volumes of serum new protocols must be developed to permit its use with the low volumes of a

low protein fluid represented by mouse CSF

Various improvements have also taken place in the field of proteomic analysis that could

facilitate analysis of mouse CSF Data dependent tandem mass spectrometry followed by

quantification of proteins is used in standard shotgun proteomics24-29

Several methods now exist

for introducing quantitation into mass spectrometry including stable isotope labeling30-32

isobaric tandem mass tags33 34

and spectral counting35 36

Spectral counting which is a

frequency measurement that uses MSMS counts of identified peptides as the metric to enable

protein quantitation is attractive because it is label-free and requires no additional sample

preparation Finally recent advances in spectral counting has produced a data refinement

strategy termed normalized spectral abundance factor (NSAF)37 38

and further developed into

distributive NSAF (dNSAF) to address issues with peptides shared by multiple proteins39

77

To identify potential biomarkers in AxD we report a novel scaled-down version of IgY

antibody depletion strategy to reduce the complexity and remove high abundance proteins in

mouse CSF samples The generated spectral counts data were then subjected to dNSAF natural

log data transformation and t-test analysis to determine which proteins differ in abundance when

comparing GFAP transgenics and controls with multiple biological and technical replicates

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) urea (gt99) sodium chloride (997) and ammonium

bicarbonate (certified) were purchased from Fisher Scientific (Fair Lawn NJ) Deionized water

(182 MΩcm) was prepared with a Milli-Q Millipore system (Billerica MA) Optima LCMS

grade acetonitrile and water were purchased from Fisher Scientific (Fair Lawn NJ) DL-

Dithiothreitol (DTT) and sequencing grade modified trypsin were purchased from Promega

(Madison WI) Formic acid (ge98) was obtained from Fluka (Buchs Switzerland)

Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris ge999 ) ammonium formate

(ge99995) glycine (ge985) and IgY-14 antibodies were purchased from Sigma-Aldrich

(Saint Louis MO)

Mice

Transgenic mice over-expressing the human GFAP gene (Tg737 line) were maintained

as hemizygotes in the FVBN background and genotyped via PCR on DNA prepared from tail

samples as described previously40

The mice were housed on a 14-10 light-dark cycle with ad

libitum access to food and water All procedures were conducted using protocols approved by

the UW-Madison IACUC

78

CSF collection

CSF was collected from mice as described previously12

Briefly mice were anesthetized

with avertin (400-600 mgkg ip) A midline sagittal incision was made over the dorsal aspect

of the hindbrain and three muscle layers carefully peeled back to expose the cisterna magna The

membrane covering the cisterna magna was pierced with a 30 gauge needle and CSF was

collected immediately using a flexible plastic pipette Approximately ten microliters of CSF was

collected per animal All samples used for MS analysis showed no visible contamination of

blood

Enzyme-linked immunosorbent assay (ELISA)

A sandwich ELISA was used to quantify GFAP8 Briefly a microtiter plate was coated

with a cocktail of monoclonal antibodies (Covance SMI-26R) Plates were blocked with 5

milk before addition of sample or standards diluted in PBS with Triton-X and BSA A rabbit

polyclonal antibody (DAKO Z334) was used to detect the GFAP followed by a peroxidase

conjugated anti-rabbit IgG antibody (Sigma A6154) secondary antibody The peroxidase activity

was detected with SuperSignal ELISA Pico Chemiluminescent Substrate (PIERCE) and

quantified with a GloRunner Microplate Luminometer Values below the biological limit of

detection (16ngL) were given the value 16ngL before multiplying by the dilution factor

Immunodepletion of abundant proteins

Currently there are no commercial immunodepletion products available for use with CSF

and to address the low protein content of this fluid (~04 microgmicroL) Therefore 100 microL of

purchased IgY-14 resin was coupled with a Pierce Spin Cup using a paper filter from Thermo

Scientific (Waltham MA) CSF samples from either transgenics or controls were first pooled to

100 microL and then added to 100 microL of 20 mM Tris-HCl 300 mM pH 74 (2x dilution buffer) and

79

allowed to mix with the custom IgY-14 depletion spin column on an end-over-end rotor for 30

minutes at 4oC The sample was then centrifuged at 04 rcf for 45 seconds with an Eppendorf

Centrifuge 5415D (Hamburg Germany) The antibodies were then washed with 50 microL 1x

dilution buffer vortexed for 5 minutes centrifuged at 04 rcf for 45 seconds and the flow through

was collected for tryptic digestion The antibodies were then stripped of the bound proteins with

four 025 mL washes of 01 M glycine pH 25 neutralized with two 025 mL washes of 01 M

Tris-HCl pH 80 and washed once with 025mL of 1x dilution buffer The immunodepletion

protocol was repeated for each transgenic or control pooled 100 microL sample (N=3)

Preparation of tryptic digests

The immunodepleted pooled mouse CSF samples (200 microL total volume) were

concentrated to 10 microL using a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA)

To each sample 8 microL of 133 M urea and 1 microL of 050 M DTT were added and allowed to

incubate at 37oC for one hour After incubation 27 microL of 055 M IAA was added for

carboxymethylation and the sample was allowed to incubate for 15 minutes in the dark To

quench the IAA 1 microL of 050 M DTT was added and allowed to react for 10 minutes To

perform trypsin digestion 70 microL of 50 mM NH4HCO3 was then added along with 025 microg

trypsin which had previously been dissolved in 50 mM acetic acid at a concentration of 05

microgmicroL Digestion was performed overnight at 37oC and quenched by addition of 25 microL of 10

formic acid The tryptic peptides were then subjected to solid phase extraction using a Varian

Omix Tip C18 100 microL (Palo Alto CA) Peptides were eluted with 50 ACN in 01 formic

acid concentrated and reconstituted in 30 microL H2O in 01 formic acid

RP nanoLC separation

80

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto an Agilent

Technologies Zorbax 300 SB-C18 5 microm 5x03 mm trap cartridge (Santa Clara CA) at a flow

rate of 5 microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm

Atlantis dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B

at 250 nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

81

range of 300-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot41

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt mus musculus

(house mouse) database (version 575) False positive analyses42

were calculated using an

automatic decoy option of Mascot Results from the Mascot results were reported using

Proteinscape 21 and technical replicates were combined and reported as a protein compilation

using ProteinExtractor (Bruker Daltonics Bremen Germany)

Mascot search parameters were as follows Allowed missed cleavages 2 enzyme

trypsin variable modifications carboxymethylation (C) and oxidation (M) peptide tolerance

plusmn12 Da maximum number of 13

C 1 MSMS tolerance plusmn05 Da instrument type ESI-Trap

Reported proteins required a minimum of one peptide with a Mascot score ge300 with bold red

characterization Spectral counts were determined from the number of MSMS spectra identified

from accepted proteins A bold red peptide combines a bold peptide which represents the first

query result from a submitted MSMS spectrum with the red peptide which indicates the top

peptide for the identified protein Requiring one bold red peptide assists in removal of

homologous redundant proteins and further improves protein results In addition requiring one

82

peptide to be identified by a score gt300 removes the ability for proteins to be identified by

multiple low Mascot scoring peptides

Each immunodepleted biological replicate had technical triplicates performed and the

technical triplicates were summed together by ProteinExtractor Peptide spectral counts were

then summed for each protein and subjected to dNSAF analysis Details for this method can be

found elsewhere37 39

but briefly peptide spectral counts are summed per protein (SpC) based on

unique peptides and a weighted distribution of any shared peptides with homologous proteins

ProteinScape removed 83 homologous proteins found in the current study to bring the total

number of proteins identified to 266 but some non-unique homologous peptides which are

shared by multiple proteins are still present in the resulting 266 remaining proteins To address

these non-unique homologous peptides distributive spectral counting was performed as

described elsewhere39

The dSpC is divided by the proteinrsquos length (L) and then divided by the

summation of the dSpCL from all proteins in the experiment (N) to produce each proteinrsquos

specific dNSAF value

N

i

i

kk

LdSpC

LdSpCdNSAF

1

)(

)()(

The resulting data were then transformed by taking the natural log of the dNSAF value The

means for each protein were calculated using the ln(dNSAF) from N=3 biological replicates and

the whole data set was subjected to the Shapiro-Wilk test for normalityGaussian distribution

performed on the software PAST (Version 198 University of Oslo Norway Osla) The

Shapiro-Wilk test was repeated multiple times to determine a non-zero value for zero spectral

83

counts A non-zero value is required to alleviate the errors of dividing by zero which was

experimentally determined to be 043 The Gaussian data were then subjected to the t-test to

identify statistically significant changes in protein expression

RESULTS AND DISCUSSION

General workflow

Individual CSF samples were manually inspected and samples were only selected that

showed no visual blood contamination Preliminary experiments showed that the maximum

degree of blood contamination estimated from counts of red blood cells in the CSF that was not

visible to the eye was less than 005 by total mass Approximately 10 individual mouse CSF

samples were pooled to achieve the desired 100 μL volume for a single biological replicate The

CSF samples were IgY immunodepleted followed by digestion with trypsin The resulting

digested samples were then de-salted concentrated reconstituted in 30 μL of 01 formic acid

and 5 μL was loaded onto a reversed phase nanoflow HPLC column subjected to a 90 minute

gradient and infused into the amaZon ETD ion trap mass spectrometer The workflow for

mouse CSF analysis is shown in Figure 1 The remainder of the desalted sample was saved for

technical replicates

Immunodepletion for CSF

Currently there are no immunodepletion techniques specifically designed for CSF

Nonetheless the protein profiles between CSF and serum are similar enough to use currently

available immunodepletion techniques designed for serum as a starting point The smallest

commercially available IgY-14 immunodepletion kit is for 15 μL of serum which is equivalent in

protein content to ~15 mL of CSF Therefore for 100 μL CSF one fifteenth of the total IgY-14

84

beads would be used for the equivalent immunodepletion which is 66 μL of proprietary bead

slurry The potential for irreversible binding of abundant proteins to their respective IgY

antibody even after an extra stripping wash and low amounts of total beads made using 66 μL

of IgY-14 bead slurry unrealistic In practice almost doubling the amount of IgY-14 beads (100

μL) used was not sufficient and resulted in albumin and transferrin peptides to be observed in

high abundance (data not shown) The most important protein to immunodeplete is albumin and

it has been reported to be a greater percentage of total CSF protein content (~60) than serum

(~49) in humans14

The difference in albumin percentage supports the results that proprietary

blends of immunodepletion beads for high abundance proteins such as albumin cannot be

scaled down on a strict protein scale and further modifications to the serum immunodepletion

protocol need to be made

Since IgY-14 beads were developed for use with serum all of its protocols need to be

taken into account to modify the protocol for CSF Serum samples should be diluted fifty times

before mixing with the IgY-14 beads but CSF protein concentration is at least one hundred times

lower than serum Therefore CSF is below half the recommended diluted protein concentration

for IgY immunodepletion Consequently multiple steps have been devised to address this

limitation First the binding time between the proteins targeted for removal from the CSF and

IgY-14 beads was doubled during the end-over-end rotor depletion step from the recommended

15 minutes to 30 minutes Second to prevent non-specific protein binding to the antibodies the

CSF samples were diluted with an equal amount (100 μL) of 2X dilution buffer The dilution

buffer of NaCl and Tris-HCl was needed to assist in reducing non-specific binding of proteins to

the 14 antibodies and ensuring the sample is held at physiological pH In addition to these

modifications a doubling of the starting IgY-14 bead slurry to 200 μL provided the desired

85

results Overall this modified protocol results in effective depletion of CSF abundant proteins

using only one-fifth of the antibodies provided by the smallest commercially available platform

Data Analysis

Spectral counting technique for relative quantitation provides numerous benefits for the

study of mouse CSF due to its label-free advantage In contrast isotopic labeling method often

involves additional sample processing that could cause sample loss which is highly undesirable

for low protein content and low volume samples Labeling methods also require a mixing of two

sets of isotopically labeled samples which would effectively increase the sample complexity and

reduce the amount of sample that can be loaded onto the nanoLC column by half In addition

more than two sets of samples can be compared by label-free methods The use of label-free

spectral counting method does not lead to an increase in sample complexity or interference in

quantitation from peptides in the mz window selected for tandem MS Using spectral counting

for relative quantitation however is dependent on reproducible HPLC separation and careful

mass spectrometry operation to minimize technical variability during the study To address

concerns of analytical reliability and run to run deviations base peak chromatograms from two

transgenic IgY-14 immunodepleted biological replicates including two technical replicates of

each were shown to be highly reproducible (Figure 2)

Each biological sample was analyzed in triplicate with the same protocols on the amaZon

ETD with three control and three transgenic samples From the three technical replicates for

each biological replicate the spectral counts of the peptides for the proteins identified were

summed The results from these mouse CSF biological triplicates are shown in Figure 3A for

GFAP overexpressor and Figure 3B for control The summation of spectral counts for each

biological replicate was performed to remove the inherent bias related to data dependent analysis

86

for protein identification One concern in grouping technical replicates is a potential loss of

information regarding analytical variability Figure 4 provides a graphical representation of

variability of technical replicates illustrating the standard deviation of technical replicates with

error bars Figure 4 shows that a statistically changed protein creatine kinase M (A) and an

unchanged protein kininogen-1 (B) have similar variability within (technical replicates) and

between samples (biological replicates) for each protein In addition Figure 4B illustrates that

even with the variability of kininogen-1 the resulting mean shown by the dashed line of control

and transgenic samples were almost equal whereas Figure 4A shows significantly different

expression level of creatine kinase M Performing replicate analysis of each biological sample

(n=3) helps correct for systematic errors deriving from sample handling and pooling of samples

helps reduce random error during the CSF sample collection process

Protein Identification and Spectral Counting Analysis

The data for dNSAF analysis like any mass spectrometry proteomics experiment

requires multiple layers of verification to ensure reliable data Our initial protein identifications

were subjected to a database search using a decoy database from Mascot which resulted in an

average false positive rate below 1 for all the experimental data collected Representative

MSMS spectra between control and GFAP transgenic samples are illustrated in Figure 5

Overall 266 proteins were identified in a combination of control and transgenic samples

(Supplemental Table 1) A total of 349 proteins were initially identified but 83 proteins were

isoforms of previously identified proteins and automatically excluded by ProteinExtractor The

next level of quality control was to only include ln(dNSAF) values from proteins identified by 2

or more unique peptides having a Mascot score of ge300 and observed in two out of three

biological replicates These selection parameters resulted in 106 proteins remaining after

87

dNSAF analysis (Supplemental Table 2) The resulting spectral counts were then subjected to

dSpC in order to account and correct for the systematic error of peptides shared by multiple

proteins (Supplemental Table 3)

It is inevitable in large scale and complex proteomics experiments that some proteins will

be seen in some samples and not others In addition when controls were compared to transgenic

samples as shown in Figure 3C 127 proteins were unique to either control or GFAP transgenic

mice and 139 proteins were seen in both samples From dNSAF equation if the spectral count

is zero the numerator is zero and the value will not be normalized between runs In order to

circumvent the zero spectral counts in dNSAF analysis zero spectral counts must be replaced by

an experimentally determined non-zero value determined to be 043 The 043 spectral counts

for zero spectral counts was calculated by serial Shapiro-Wilk tests to determine the lowest value

(043) for zero spectral counts and maintain a Gaussian distribution for all data sets The 043

value for zero spectral counts in the current study was higher than the 016 reported value for

zero spectral counts in the original NSAF spectral counting study37

Our study may have a

higher zero spectral count value than the previous study because the spectral counting data were

an addition of three technical replicates and three times 016 is close to 043 The normalized

Gaussian data were then subjected to t-test analysis and P-values below 005 are considered as

statistically significant and are presented in Table 1 The proteins with significant up or down

regulation from Table 1 can be further evaluated as how close significant proteins were to a p-

value of 005 such as ganglioside GM2 activator serine protease inhibitor A3N and collagen

alpha-2(I) chain having t-test scores of 0045 0047 and 0043 respectively Proteins exhibiting

a P-value close to 005 were more likely to be highly variable proteins or have smaller fold

changes between control and transgenic samples and thus provide less biological relevancy to

88

future studies The modest change in antithrobin-III which is reduced by 14-fold in transgenic

is included due a low pooled standard deviation in spectral counts

Spectral counting has been analyzed with fold changes derived directly from the average

spectral counts from the technical replicates and then the average of the three biological

replicates We decided to perform additional analysis using fold changes to dig deeper into

proteins that statistically support the null hypothesis from the dNSAF analysis To only pull out

highly confident protein identifications we used the same strict cut-off of two unique peptides

identified per protein as in dNSAF analysis We only accepted proteins with greater than three-

fold change in total spectral counts listed in Table 2 Two proteins contactin-1 (cntn1) and

cathepsin B (CB) illustrate the potential biomarkers missed by dNSAF analysis Cntn1 had zero

spectral count in the transgenic sample and had an average spectral count of 41 in control

samples The lack of any spectral counts in one biological control for cntn1 resulted in a large

standard deviation in the ln(dNSAF) means for the control which resulted in the t-test supporting

the null hypothesis Another example is CB which was detected by numerous spectral counts in

every GFAP transgenic biological replicate with an average spectral count of 97 (Table 2) The

presence of CB in one biological control sample (23 average spectral counts) resulted in a high

standard deviation in the mean of the control samples These examples exhibit a limitation of

dNSAF analysis which could cause a loss of potentially useful information

Previously Identified Proteins with Expression Changes

Previously three proteins have been described as increased in CSF from individual(s)

suffering from AxD In one study a single patientrsquos CSF was found to have elevated levels of

αβ-crystallin and HSP2744

In a second study three patients were reported to have elevated

levels of GFAP with concentrations from 4760 ngL to 30000 ngL (compared to lt175 ngL for

89

controls)11

GFAP was detected in our current study however the other two proteins were not

detected One possibility is that αB-crystallin and Hsp-27 levels in mouse CSF are too low for

detection by MS analysis In addition while the transgenic mice display the hallmark

pathological feature of AxD in the form of Rosenthal fibers they do not have any evident

leukodystrophy and thus may not display the full range of changes in CSF as might be found in

human patients

Creatine Kinase M

Creatine kinase M (M-CK) is a 43 kDa protein whose role is to reversibly catalyze

phosphate transfer between ATP and energy storage compounds M-CK has been primarily

found in muscle tissue and for humans creatine kinase M is diagnostically analyzed in the blood

for myocardial infarction (heart attack) In addition M-CK is also found in Purkinje neurons of

the cerebellum45 46

A related protein creatine kinase B (B-CK) also exhibited an apparent 21

fold increase in transgenic CSF over control but this difference was not statistically different

B-CK concentration is known to be elevated in CSF following head trauma47

or cerebral

infarction48

but decreased in astrocytes in individuals affected by multiple sclerosis49

Cathepsin

The data showed multiple cathepsins were up regulated in the CSF of transgenic mice

when compared to control mice The up regulated cathepsins were S L1 and B isoforms which

are all cysteine proteases Cathepsin S (CS) was never observed in control samples but

observed with an average of 73 spectral counts per analysis Cathepsin L1 (CL1) was up

regulated by 94 times in transgenic mice These two cathepsins exhibited significant changes

using dNSAF and t-test statistics as shown in Table 1 and Cathepsin B (CB) showed a 42 fold

increase in transgenic CSF as shown in Table 2

90

Cathepsins regulate apoptosis in cells50

which is the major mechanism for elimination of

cells deemed by the organism to be dangerous damaged or expendable CL and CB are

redundant in the system as thiol proteases and inhibition of CB by CA-047 causes a diminished

apoptosis response in multiple cell lines51

Intriguingly increased levels of CB or CL are

correlated with poor prognosis for cancer patients and shorter disease-free intervals It is

believed that these proteases degrade the extracellular membrane which allows tumor cells to

invade adjacent tissue and metastasize52

With regards to AxD the up regulation of these

cathepsins may be indicative of the bodyrsquos natural defense and response to Rosenthal fibers

Thus stimulation of these cathepsins may provide a further protective stress response but the

positive correlation between these proteases and cancer highlights the multiple roles of these

proteins in pathological response Alternatively it has been shown that increased CB is involved

with the tumor necrosis factor α (TNFα) induced apoptosis cascade53

The activation of the

TNFα could produce oligodendrocyte toxicity54

with the expression of TNFα being elevated in

tissue samples from mouse models and AxD patients55

The potential for a positive or a negative

effect in increasing or decreasing cathepsins warrant future research with cathepsins and AxD

Contactin-1

Contactin-1 (Cntn1) is a 133 kDa cell surface protein that is highly glycosylated and

belongs to a family of immunoglobulin domain-containing cell adhesion molecues56

Table 2

shows that Cntn1 is down regulated in transgenic mice since spectral counts were only observed

in control mice Cntn1 is expressed in neurons and oligodendrocytes and higher levels were

observed during brain development57

In addition Cntn1 leads to activation of Notch1 which

mediates differentiation and maturation of oligodendrocyte precursor cells (OPC) Although the

mechanism leading to a reduction of Cntn1 in CSF is not known it may reflect a change in

91

astrocyte interactions with either neurons or oligodendrocytes to alter their expression of this

protein

Validation of putative biomarkers and MS proteomics data using ELISA and RNA

microarray data

To further validate the relative protein expression data obtained via MS-based spectral

counting techniques orthogonal immunological and molecular biological approaches have been

examined As GFAP was shown to be elevated in the CSF of transgenic animals we used a

well-defined GFAP ELISA protocol to test its CSF concentration CSF from 8 week old male

mice was collected from both transgenic and control animals Five samples of transgenic CSF

was prepared by pooling four to eight animals for the GFAP ELISA In the case of controls

each sample represents a single animal GFAP concentrations observed by both the MS and

ELISA showed significant increases of GFAP in the CSF when comparing transgenic to control

animals

Another validation of MS spectral counts is observed in a microarray analysis performed

on transgenic mouse olfactory bulb tissue 55

In this paper nine of the proteins found by MS

showed similar gene expression in the microarray (Table 3) It is not surprising that all the genes

observed in the microarray are not the same as the proteins observed by MS analysis Gene

expression and protein synthesis and expression are not always correlated but the similarities

and overlapping trends observed with these two assays are encouraging As shown in Table 3

gene expression analysis also revealed the up regulation of several cathepsin proteins GFAP

and down regulation of contactin-1 in CSF collected from transgenic mice corroborating the

MS-based proteomics results

92

CONCLUSIONS

In this study we have produced a panel of proteins with significant up or down regulation

in the CSF of transgenic GFAP overexpressor mice This mouse model for AxD was consistent

with the previous studies showing elevation of GFAP in CSF The development of a modified

IgY-14 immunodepletion technique for low amounts of CSF was presented This improved

protocol is useful for future investigations to deal with the unique challenges of mouse CSF

analysis Modified proteomics protocols were employed to profile mouse CSF with biological

and technical triplicates addressing the variability and providing quantitation with dNSAF

spectral counting Validation of the MS-based proteomics data were performed using both

ELISA and RNA microarray data to provide further confidence in the changes in the putative

protein biomarkers This study presents three classes of interesting targets for future study in

AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

93

REFERENCES

1 Brenner M Johnson A B Boespflug-Tanguy O Rodriguez D Goldman J E

Messing A Mutations in GFAP encoding glial fibrillary acidic protein are associated with

Alexander disease Nat Genet 2001 27 (1) 117-20

2 Herndon R M Rubinstein L J Freeman J M Mathieson G Light and electron

microscopic observations on Rosenthal fibers in Alexanders disease and in multiple sclerosis J

Neuropathol Exp Neurol 1970 29 (4) 524-51

3 Alexander W S Progressive fibrinoid degeneration of fibrillary astrocytes associated

with mental retardation in a hydrocephalic infant Brain 1949 72 (3) 373-81 3 pl

4 Iwaki T Kume-Iwaki A Liem R K Goldman J E Alpha B-crystallin is expressed

in non-lenticular tissues and accumulates in Alexanders disease brain Cell 1989 57 (1) 71-8

5 Head M W Goldman J E Small heat shock proteins the cytoskeleton and inclusion

body formation Neuropathol Appl Neurobiol 2000 26 (4) 304-12

6 Messing A Daniels C M Hagemann T L Strategies for treatment in alexander

disease Neurotherapeutics 7 (4) 507-15

7 Tang G Yue Z Talloczy Z Hagemann T Cho W Messing A Sulzer D L

Goldman J E Autophagy induced by Alexander disease-mutant GFAP accumulation is

regulated by p38MAPK and mTOR signaling pathways Hum Mol Genet 2008 17 (11) 1540-

55

8 Hagemann T L Boelens W C Wawrousek E F Messing A Suppression of GFAP

toxicity by alphaB-crystallin in mouse models of Alexander disease Hum Mol Genet 2009 18

(7) 1190-9

9 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

10 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 878 (22) 2003-12

11 Kyllerman M Rosengren L Wiklund L M Holmberg E Increased levels of GFAP

in the cerebrospinal fluid in three subtypes of genetically confirmed Alexander disease

Neuropediatrics 2005 36 (5) 319-23

12 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M

Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta)

equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

13 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

14 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers PROTEOMICS ndash Clinical Applications 2008 2 (3)

428-436

15 Li C Lee K H Affinity depletion of albumin from human cerebrospinal fluid using

Cibacron-blue-3G-A-derivatized photopatterned copolymer in a microfluidic device Anal

Biochem 2004 333 (2) 381-8

94

16 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry ELECTROPHORESIS 2004 25 (14) 2402-

2412

17 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L

Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity

separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample

preparation and analysis Proteomics 2005 5 (13) 3314-28

18 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag

L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep

Biochem Biotechnol 2009 39 (3) 221-47

19 Huang L Fang X Immunoaffinity fractionation of plasma proteins by chicken IgY

antibodies Methods Mol Biol 2008 425 41-51

20 Greunke K Braren I Alpers I Blank S Sodenkamp J Bredehorst R Spillner E

Recombinant IgY for improvement of immunoglobulin-based analytical applications Clin

Biochem 2008 41 (14-15) 1237-44

21 Xiao Y Gao X Taratula O Treado S Urbas A Holbrook R D Cavicchi R E

Avedisian C T Mitra S Savla R Wagner P D Srivastava S He H Anti-HER2 IgY

antibody-functionalized single-walled carbon nanotubes for detection and selective destruction

of breast cancer cells BMC Cancer 2009 9 351

22 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein immunoaffinity

subtraction for plasma proteomics and candidate biomarker discovery using mass spectrometry

Mol Cell Proteomics 2006 5 (11) 2167-74

23 Hinerfeld D Innamorati D Pirro J Tam S W SerumPlasma depletion with

chicken immunoglobulin Y antibodies for proteomic analysis from multiple Mammalian species

J Biomol Tech 2004 15 (3) 184-90

24 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D

Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in

the discovery of candidate protein biomarkers in a diabetes autoantibody standardization

program sample subset J Proteome Res 2008 7 (2) 698-707

25 Ru Q C Zhu L A Silberman J Shriver C D Label-free semiquantitative peptide

feature profiling of human breast cancer and breast disease sera via two-dimensional liquid

chromatography-mass spectrometry Mol Cell Proteomics 2006 5 (6) 1095-104

26 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

27 Rao P V Reddy A P Lu X Dasari S Krishnaprasad A Biggs E Roberts C T

Nagalla S R Proteomic identification of salivary biomarkers of type-2 diabetes J Proteome

Res 2009 8 (1) 239-45

28 Yu K H Barry C G Austin D Busch C M Sangar V Rustgi A K Blair I A

Stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for

pancreatic cancer serum biomarker discovery J Proteome Res 2009 8 (3) 1565-76

29 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422

(6928) 198-207

95

30 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

31 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

32 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

33 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 82 (7) 2817-25

34 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

35 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

36 Old W M Meyer-Arendt K Aveline-Wolf L Pierce K G Mendoza A Sevinsky

J R Resing K A Ahn N G Comparison of label-free methods for quantifying human

proteins by shotgun proteomics Mol Cell Proteomics 2005 4 (10) 1487-502

37 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M

P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J

Proteome Res 2006 5 (9) 2339-47

38 Mosley A L Florens L Wen Z Washburn M P A label free quantitative

proteomic analysis of the Saccharomyces cerevisiae nucleus J Proteomics 2009 72 (1) 110-20

39 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome

quantitation how to deal with peptides shared by multiple proteins Anal Chem 82 (6) 2272-81

40 Messing A Head M W Galles K Galbreath E J Goldman J E Brenner M

Fatal encephalopathy with astrocyte inclusions in GFAP transgenic mice Am J Pathol 1998

152 (2) 391-8

41 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

42 Elias J E Gygi S P Target-decoy search strategy for increased confidence in large-

scale protein identifications by mass spectrometry Nat Methods 2007 4 (3) 207-14

43 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

44 Takanashi J Sugita K Tanabe Y Niimi H Adolescent case of Alexander disease

MR imaging and MR spectroscopy Pediatr Neurol 1998 18 (1) 67-70

45 Hemmer W Wallimann T Functional Aspects of Creatine Kinase in Brain

Developmental Neuroscience 1993 15 (3-5) 249-260

46 Hemmer W Zanolla E Furter-Graves E M Eppenberger H M Wallimann T

Creatine kinase isoenzymes in chicken cerebellum specific localization of brain-type creatine

96

kinase in Bergmann glial cells and muscle-type creatine kinase in Purkinje neurons Eur J

Neurosci 1994 6 (4) 538-49

47 Nordby H K Tveit B Ruud I Creatine kinase and lactate dehydrogenase in the

cerebrospinal fluid in patients with head injuries Acta Neurochirurgica 1975 32 (3) 209-217

48 Bell R D Alexander G M Nguyen T Albin M S Quantification of cerebral

infarct size by creatine kinase BB isoenzyme Stroke 1986 17 (2) 254-60

49 Steen C Wilczak N Hoogduin J M Koch M De Keyser J Reduced Creatine

Kinase B Activity in Multiple Sclerosis Normal Appearing White Matter PLoS ONE 5 (5)

e10811

50 Chwieralski C Welte T Buumlhling F Cathepsin-regulated apoptosis Apoptosis 2006

11 (2) 143-149

51 Droga-Mazovec G Bojic L Petelin A Ivanova S Romih R Repnik U Salvesen

G S Stoka V Turk V Turk B Cysteine cathepsins trigger caspase-dependent cell death

through cleavage of bid and antiapoptotic Bcl-2 homologues J Biol Chem 2008 283 (27)

19140-50

52 Duffy M J Proteases as prognostic markers in cancer Clin Cancer Res 1996 2 (4)

613-8

53 Guicciardi M E Deussing J Miyoshi H Bronk S F Svingen P A Peters C

Kaufmann S H Gores G J Cathepsin B contributes to TNF-alpha-mediated hepatocyte

apoptosis by promoting mitochondrial release of cytochrome c J Clin Invest 2000 106 (9)

1127-37

54 Butt A M Jenkins H G Morphological changes in oligodendrocytes in the intact

mouse optic nerve following intravitreal injection of tumour necrosis factor J Neuroimmunol

1994 51 (1) 27-33

55 Hagemann T L Gaeta S A Smith M A Johnson D A Johnson J A Messing

A Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal

fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum Mol

Genet 2005 14 (16) 2443-58

56 Falk J Bonnon C Girault J A Faivre-Sarrailh C F3contactin a neuronal cell

adhesion molecule implicated in axogenesis and myelination Biol Cell 2002 94 (6) 327-34

57 Eckerich C Zapf S Ulbricht U Muumlller S Fillbrandt R Westphal M Lamszus

K Contactin is expressed in human astrocytic gliomas and mediates repulsive effects Glia

2006 53 (1) 1-12

97

Table 1 Statistically changed proteins between transgenic and control mouse CSF using

dNSAF analysis

Accession Protein Pa SC

b Fold

Changec

Control

dSpCd

Transgenic

dSpCd

KCRM_MOUSE Creatine kinase M-type 0034 425 30 182 541

HEXB_MOUSE Βeta-hexosaminidase subunit β 0012 183 uarr 0 59

CMGA_MOUSE Chromogranin-A 000078 153 darr 44 0

ANT3_MOUSE Antithrombin-III 0017 176 -14 67 47

SAP3_MOUSE Ganglioside GM2 activator 0045 415 darr 28 0

SPA3N_MOUSE Serine protease inhibitor A3N 0047 170 42 10 42

CO1A2_MOUSE Collagen alpha-2(I) chain 0043 56 darr 19 0

BLVRB_MOUSE Flavin reductase 00054 204 uarr 0 12

CATS_MOUSE Cathepsin S 00032 232 uarr 0 73

GFAP_MOUSE Glial fibrillary acidic protein 0021 107 uarr 0 21

RET4_MOUSE Retinol-binding protein 4 0040 189 darr 13 0

CBPE_MOUSE Carboxypeptidase E 0043 139 darr 11 0

CATL1_MOUSE Cathepsin L1 0015 87 94 02 19

The statistics are performed using the t-test from the ln(dNSAF) Gaussian data

a P p-value of the t-test where the null hypothesis states that there was no change in expression between

control and transgenic GFAP overexpressor b SC percentage of sequence coverage of the listed protein from

sequenced tryptic peptides c Fold Change positive values are indicative of a fold change for transgenic CSF

negative values are fold changes for control CSF (ie down-regulated in transgenic CSF) uarr indicates the protein

was only detected in transgenic CSF and darr indicates the protein was only detected in control CSF d dSpC

distributive spectral counts which represent the average spectral counts observed per run analysis on the mass

spectrometer and corrected using distributive analysis for peptides shared by more than one protein

98

Table 2 Proteins showing greater than three-fold changes with at least two unique

peptides identified for each protein

Accession Protein SC ()a Fold

Change b

Control

dSpC c

Transgenic

dSpC c

MUG1_MOUSE Murinoglobulin-1 precursor 147 42 155 37

CO4B_MOUSE Complement C4-B 113 54 22 118

PRDX6_MOUSE Peroxiredoxin-6 576 46 14 64

CNTN1_MOUSE Contactin-1 65 darr 41 0

CATB_MOUSE Cathepsin B 263 42 23 97

CAH3_MOUSE Carbonic anhydrase 3 396 76 11 84

APOH_MOUSE Beta-2-glycoprotein 1 128 44 44 1

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

PGK1_MOUSE Phosphoglycerate kinase 1 355 36 17 61

NDKB_MOUSE Nucleoside diphosphate kinase B 408 34 13 44

FHL1_MOUSE

Four and a half LIM domains

protein 1 243 39 13 51

NELL2_MOUSE

Protein kinase C-binding protein

NELL2 45 -43 13 03

MDHM_MOUSE

Malate dehydrogenase

mitochondrial 385 41 12 49

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

a SC percentage of sequence coverage of the listed protein from sequenced tryptic peptides b Fold

Change positive values are indicative of a fold change for transgenic CSF negative values are fold changes for

control CSF and darr indicates the protein was only detected in control CSF c dSpC distributive spectral counts

which represent the average spectral counts observed per run analysis on the mass spectrometer and corrected using

distributive analysis for peptides shared by more than one protein

99

Table 3 Validation of changes in proteins revealed by MS-based spectral counting

consistent with previously published microarray data

Consistent changes in RNA and proteomic data

uarr regulated in transgenic darr regulated in transgenic

Cathepsin S Contactin-1

Cathepsin B Carboxypeptidase E

Cathepsin L1

Peroxiredoxin-6

Complement C4-B

Glial fibrillary acidic protein

Serine protease inhibitor A3N

Note Validation of putative biomarkers from the current proteomics dataset by previously

published RNA microarray data55

Both up and down regulated proteins were consistent with the

RNA microarray data

_

100

___________________________________________

SUPPLEMENTAL INFORMATION (Available upon request)

Table S1 Compilation list of proteins identified from all the control and transgenic biological

replicates

Table S2 Distributive spectral counting calculations performed for proteins observed to share

identified peptides

Table S3 Proteins with dNSAF spectral counting performed including t-test analysis for a

comparison between transgenic and control CSF

101

FIGURE LEGENDS

Figure 1 The general workflow indicating the major steps involved in sample collection sample

processing mass spectrometric data acquisition and analysis of mouse CSF samples

Figure 2 Assessment of run to run variability of the base peak chromatograms within and

between two biological and technical replicates The peak profile and intensity scale is

consistent between the four chromatograms The four panels show two biological replicates (Tg

4 and Tg5) with two technical replicates for each biological sample

Figure 3 A Venn-diagram of the biological triplicates (1 2 and 3) of transgenic mouse

CSF showing 78 proteins identified in all three experiments B Venn-diagram of the biological

triplicates (1 2 and 3) of control mouse CSF having 61 proteins identified by all three

replicates C The overlap between control and transgenic CSF proteomic analysis showing 139

proteins identified by both groups and 73 and 54 uniquely identified by respective groups

Figure 4 Assessment of technical replicate variability between biological replicates The error

bars in both A and B are the standard deviation derived from the technical triplicates for each

biological replicate Panel A shows creatine kinase M having more or equal variability in the

biological triplicates than each technical triplicate The means of the biological triplicates are

illustrated by the dashed line Panel B represents kininogen-1 which is unchanged between

control (Ctl) and transgenic (Tg) samples The biological variability coupled with the technical

replicates provides a barely noticeable difference in the pooled mean between control and

102

transgenic spectral counts The difference in means is contrasted with the three fold change

observed from creatine kinase M (A)

Figure 5 MSMS profile of the tryptic peptide LSVEALNSLTGEFK from creatine kinase M

(A) The top MSMS is from a control sample with an elution at 463 minutes (B) The bottom

MSMS is from the GFAP transgenic sample with an elution at 46 minutes These tandem MS

spectra show instrument reliability and consistent fragmentation patterns which are necessary for

spectral counting analysis

Figure 6 Validation of MS spectral counts using a GFAP ELISA GFAP content (ngL)

measured within mouse CSF from both transgenic and control animals The data represents the

average with standard deviation (n=5 each sample represents pooling of 1 to 8 animals) The

statistics are performed using a student t-test plt00001

103

Figure 1

104

Figure 2

105

Figure3

106

Figure 4

107

Figure 5

108

Figure 6

Ctl Tg

100

1000

10000

100000

Mouse CSF Sample

GF

AP

(n

gL

)

109

Table of Contents Summary

Cerebrospinal fluid (CSF) was obtained from transgenic mouse models of Alexander disease as

well as healthy controls We immunodepleted pooled CSF from mice by a modified IgY-14

protocol Shotgun proteomics utilizing trypsin digestion 1-D nanoRP separation CID tandem

mass spectrometry analysis Mascot database searching and relative quantitation via distributive

normalized spectral abundance factor resulted in the identification of 266 proteins and 27

putative biomarkers

110

Chapter 4

Genomic and proteomic profiling of rat adapted scrapie

Adapted from ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A

Cunningham R Wang J Wellner D Ma D Li L Aiken J In preparation

111

Abstract

A novel model of prion disease using rats called Rat Adapted Scrapie (RAS) was

developed with the genomics from brain and proteomics from cerebrospinal fluid (CSF) profiled

The CSF proteome from control and RAS (biological N=5 technical replicates N=3) were

digested and separated using one dimensional reversed-phase nanoLC coupled to data-

dependent tandem MS acquisition on an ion trap mass spectrometer In total 512 proteins 167

non-redundant protein groups and 1032 unique peptides were identified with a 1 false

discovery rate (FDR) Of the proteins identified 21 were statistically up-regulated (plt005) and

7 statistically down-regulated in the RAS CSF We also identified 1048 genes which were

differentially regulated in rat prion disease and upon mapping these changes to mouse gene

expression however only 22 of these genes were in common with mRNAs responding to

prion infection in mice suggesting that the molecular pathology observed in mice may not be

applicable to other species The proteins are compared to the differentially regulated genes as

well as to previously published proteins showing changes consistent with other prion animal

models

112

Introduction

Prion diseases are an unusual class of fatal transmissible neurodegenerative disorders

that affect the mammalian central nervous system They are caused by the accumulation of an

abnormal conformation of the normal host encoded cellular prion protein PrPC This

conformational rearrangement of PrPC is brought about by template directed misfolding wherein

seed molecules of the abnormal isoform PrPScrapie

PrPSc

convert PrPC into new PrP

Sc molecules

Scrapie in sheep and goats is the prototypic prion disease and is so named because clinically

affected animals scrape their wool off on pen and stall walls Laboratory investigation of prion

diseases typically relies upon rodents which can be infected with natural isolates of scrapie1

albeit with some difficulty as ovid scrapie isolates need time to adapt to rodents This adaptation

is characteristic of prion disease interspecies transmissions and properly reflects the molecular

adaptation that must occur to allow interaction between exogenous foreign PrPSc

and host PrPC

molecules selecting for conformers which exhibit template directed misfolding In some cases

no conformational solution is found reflecting a species barrier to disease transmission

In recent years advances in genomics and proteomics technologies have allowed

unprecedented examination of the biomolecules that are altered upon exposure to prion agents

These studies2 3

have relied upon analysis of gene and protein expression changes in response to

prion infection with the aim of trying to identify pathways that might underlie the mechanism of

prion-induced neurotoxicity A second important aim has been to identify signature molecules

that might act as surrogate biomarkers for these diseases as there are significant analytical

challenges associated with sensitively detecting and specifically distinguishing disease-induced

conformational changes (PrPSc

) of the prion protein from normal host conformations (PrPC)

113

Mass spectrometry (MS)-based proteomics has become a promising tool for biomarker

discovery from biological fluids such as CSF blood and urine4-6

Two-dimensional gel

electrophoresis MS (2D-GE MS) and two-dimensional differential gel electrophoresis (2D-DIGE

MS) are the most widely used methods for proteomic profiling of prion infected CSF so far due

to the advantage of ready separation and quantification of proteins in complex biological samples

Studies using 2D-GE MS or 2D-DIGE MS to profile proteins in CSF have led to the

identifications of currently accepted biomarker for sCJD 14-3-3 protein and other potential

biomarkers for prion diseases7-9

However the application of this method in biomarker

discovery is limited by insufficient sensitivity and potential bias against certain classes of

proteins as gel-based separation does not work well for the low abundance proteins very basic

or acidic proteins very small or large proteins and hydrophobic proteins 10 11

In contrast to 2D-

GE or 2D-DIGE shotgun proteomics employs enzymatic digestion of the protein samples

followed by chromatographic separation prior to introduction into a mass spectrometer for

tandem MS analyses Shotgun proteomics has become increasingly popular in proteomic

research because these methods are reproducible highly automated and have a greater

likelihood of detecting low abundance proteins12 13

Due to the sample complexity in CSF and

because albumin comprises over half of the protein content in CSF removal of high-abundance

proteins including albumin is necessary to improve proteomic coverage and identify low-

abundance proteins One method is IgY immunodepletion14 15

which is performed prior to LC-

MSMS and uses species specific antibodies from chicken yolk to remove abundant proteins in

biological samples such as CSF In the present work CSF from control and rat adapted scrapie

animal models (biological N=5 technical N=3 replicates) were analyzed by LC-MSMS and we

114

indentified 512 proteins and 167 non-redundant protein isoforms (referred to as protein groups)

with 21 proteins being statistically up-regulated (p lt 005) and 7 statistically down-regulated

By and large this work has been performed using laboratory mice for the gene

expression studies and human cerebrospinal fluid for proteomics mice do not provide sufficient

volumes of CSF for proteomic studies Both approaches are exceedingly valuable as the mouse

model allows cross-sectional time course experiments to be performed including the important

pre-clinical phase of disease Critically however the relevance and generalizability of mouse

prion responses to other prion diseases especially human disease is unknown Human proteomic

studies on CSF guarantee relevant protein identifications but are limited to the clinical phase of

the disease when apparent markers may reflect gross neurodegeneration covering up subtle but

more specific responses To address these issues we have adapted mouse RML prions into rats

with the aim of expanding the knowledge of prion disease responses addressing the limitations

of mice and opening up the possibility of preclinical proteomic profiling in a laboratory rodent

In the present work CSF samples from control and rat adapted scrapie were analyzed by system

biology approaches Here we show that Rat Adapted Scrapie (RAS) is a powerful tool for an -

omics based approach to decipher the molecular impact of prion disease in vivo with

applicability to the molecular mechanisms of disease and biomarker discovery We identified

1048 genes which were differentially regulated in rat prion disease Astonishingly on the whole

mouse orthologs of these genes did not change in mouse adapted scrapie and vice versa

questioning the universality of previous mouse gene expression profiles These RAS gene

expression changes were identified in the CSF proteome where we detected 512 proteins and 167

protein groups with 21 proteins being statistically up-regulated (plt005) and 7 statistically down-

115

regulated in the CSF of prion diseased rats Many of the proteins detected have previously been

observed in human CSF from CJD patients

Materials and Methods

Ethics Statement

This study was carried out in accordance with the recommendations in the NIH Guide for Care

and Use of Laboratory Animals and the guidelines of the Canadian Council on Animal Care The

protocols used were approved by the Institutional Animal Care and Use Committees at the

University of Wisconsin and University of Alberta

Chemicals

Acetonitrile (HPLC grade) and ammonium bicarbonate (certified) were purchased from

Fisher Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) Optima LCMS grade acetonitrile and water were purchased

from Fisher Scientific (Fair Lawn NJ) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris

ge999 ) ammonium formate (ge99995) glycine (ge985) and IgY-7 antibodies were

purchased from Sigma-Aldrich (Saint Louis MO)

Rat Transmission and Adaptation

Prion agents used were the rocky mountain lab RML strain of mouse adapted scrapie

Stetsonville transmissible mink encephalopathy16

(TME) Hyper (Hy) strain of Hamster TME 17

1st passage Skunk adapted TME prepared as described and C from genetically defined

transmissions18

116

Brains from animals clinically affected with prion disease were aseptically removed and

prepared as 10 wv homogenates in sterile water 50 microL of 10 brain homogenate was

inoculated into weanling wistar rats intracranially After one year of incubation preclinical rats

from RML infections were euthanized by CO2 inhalation and the brain excised homogenized

and re-inoculated into naive animals Subsequent serial passages were from rats clinically

affected with rat adapted scrapie

Brains from rat passages were aseptically removed and bisected sagittally Brain halves

were reserved for immunohistochemistry (IHC) in formalin or frozen for immunoblotting RNA

isolation or subsequent passage For IHC tissues were first fixed in 10 buffered formalin

followed by antigen retrieval at 121 oC 21 bar in 10mM citrate buffer for 2 min After cooling

to room temperature sections were treated with 88 formic acid for 30 min and 4M guanidine

thiocyanate for 2 h Endogenous peroxidases were inactivated by 003 hydrogen peroxide and

tissue was blocked with 1 normal mouse serum Mouse anti-PrP mAB SAF83 (Cayman

Chemical) was biotinylated and applied at a 1250 dilution in blocking buffer overnight at 4 oC

Washes were performed in 10mM PBS with 005 tween-20 Streptavidin-peroxidase

(Invitrogen) was added for Diaminobenzidine (DAB) color reaction Anti-GFAP

immunohistochemistry was performed as above except that formic acid and guanidine treatment

steps were excluded Mouse anti-GFAP mab G-A-5 (Sigma) was used at a 1400 dilution

Brain samples for immunoblot were treated with or without Proteinase K (Roche) at a

ratio of 35 microg PK 100 ug protein at 50 microg mL for 30 minutes at 37 oC Phosphotungstenic acid

enrichments were performed as described14 19

Bis-Tris SDS-PAGE was performed on 12

polyacrylamide gels (Invitrogen) and transferred to PVDF Immunoblotting was performed using

117

mABs SAF83 (Cayman Chemical) 6H4 (Prionics) or 3F10 (a kind gift from Yong-Sun Kim) all

at a 120000 dilution

Gene Expression Profiling

RNA was extracted from frozen brain halves from clinically affected and control animals with

the QIAshredder and RNeasy mini kit (Qiagen Valencia CA) in accordance with the

manufacturerrsquos recommendations Due to the relatively large mass of rat brain an initial

homogenization was performed with a needle and syringe in 5mL of buffer RLT before further

diluting an aliquot in accordance with the manufacturers protocol Total RNA was amplified and

labeled in preparation for chemical fragmentation and hybridization with the MessageAmp

Premier RNA amplification and labeling kit (Life Technologies Grand Island NY) Amplified

and labeled cRNAs were hybridized on Affymetrix (Santa Clara CA) rat genome 230 20 high

density oligonucleotide arrays in accordance with the manufacturers recommendations

Differentially Expressed Genes were identified using Arraystar 50 (DNAStar Madison WI)

Robust multi-array normalization using the quantile approach was used to normalize all

microarray data A moderated T-test with a multiple comparison adjustment20

was used to reduce

the false discovery rate yet preserve a meaningful number of genes for pathway analysis

Pathway analysis was performed using the DAVID Bioinformatics database21

Comparative

analysis of genes induced by prions in mouse22

and rat disease was performed on genes

exhibiting at least a 2-fold change in expression Orthologs of rat and mouse genes were

identified using ENSEMBLE biomart release 6823

CSF Proteomic Profiling

118

CSF was collected from rats anaesthetized with isoflorane via puncture of the cisterna

magna A volume of 100-200 microL was routinely obtained CSF was touch spun for 30s at 2000xg

on a benchtop nano centrifuge to identify any blood contamination by the presence of a red

pellet CSF with blood contamination was not used for proteomic analysis CSF was prepared

for profiling by first depleting abundant proteins with an antibody based immunopartitioning

column IgY-R7 (Sigma-Aldrich Saint Louis MO) The manufacturers recommendations were

followed with a few deviations 40 microg of protein (~100 microL CSF) was bound to 100 microL of IgY

bead slurry in total volume of 600 microL formulated in 150mM ammonium bicarbonate The flow

through was collected and denatured by the addition of 10 microL of 8 M guanidine thiocyanate and

lyophilized in a speed-vac apparatus The sample was then reconstituted in 10 microL of water and 1

microL of 050 M DTT were added and allowed to incubate at 37oC for 30 minutes After incubation

27 microL of 055 M IAA was added for carboxymethylation and the sample was allowed to

incubate in the dark for 15 minutes After that 1 microL of 050 M DTT was added and allowed to

sit at room temperature for 10 minutes To perform trypsin digestion 70 microL of 50 mM

NH4HCO3 was then added along with 025 microg trypsin Digestion was performed overnight at

37oC and quenched by addition of 5 microL of 10 formic acid The tryptic peptides were then

subjected to solid phase extraction using an Agilent OMIX Tip C18 100 microL (Santa Clara CA)

Peptides were eluted with 50 ACN in 01 formic acid concentrated and reconstituted in 30

microL H2O with 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of a nanoAcquity (Milford MA) with a 10 microL injection

loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B consisted of

ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto a Waters 5 microm

119

Symmetry C18 180 microm x 20 mm trap cartridge (Milford MA) at a flow rate of 75 microLmin for 5

minutes at 95 A 5 B Separation was performed on a Waters 17 microm BEH130 C18 75 microm x

100 mm (Milford MA) with a gradient of 5 to 15 B over 10 minutes followed by 15 to

40 B over 80 minutes at room temperature

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with a Bruker CaptiveSpray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Waters Acquity console software to perform MS acquisitions for all experiments Smart

parameter setting (SPS) was set to 700 mz compound stability and trap drive level was set at

100 Optimization of the CaptiveSpray source resulted in dry gas temperature of 160oC dry

gas flow of 30 Lmin capillary voltage of -1325 V end plate offset of -500 V MSMS

fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

range of 350-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

120

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot24

(Version 24 Matrix

Science London UK) Database searching was performed against a forward and reversed

concatenated SwissProt Rattus rattus Mascot search parameters were as follows Allowed

missed cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) peptide tolerance plusmn12 Da maximum number of 13

C 1 MSMS

tolerance plusmn05 Da instrument type ESI-Trap Data was then transformed into Dat file formats

and transferred to ProteoIQ (NuSep Bogart GA) False positive analyses were calculated using

ProteoIQ and set at 1

Results

Development of Rat Adapted Scrapie

To develop a rat adapted strain of prion disease we introduced 6 different prion agents RML

TME Hy Skunk Adapted TME and Chronic Wasting Disease (CWD) from wild type 96G and

96S deer16-18

into 6 rats (Fig 1) Of these primary transmissions only RML induced the

accumulation of Proteinase K resistant PrP after one year of incubation as determined by western

blotting on 10 brain homogenates and PrPSc

enriched phoshotungstenic acid precipitated brain

homogenates Second passage of rat adapted RML scrapie resulted in clinically affected rats at

565 days post inoculation Serial passages of TME or CWD agents were not performed Clinical

symptoms consisted of ataxia lethargy wasting kyphosis and myoclonus Prion affected rats

121

also showed low level porphyrin staining around their head Subsequent serial passage decreased

incubation time to 215 days

Proteinase K resistant prion protein was observed from all clinically affected animals both by

immunoblot (Fig 2) and by immunohistochemistry (Fig 3) Di- and mono-glycosylated bands

were the most abundant isoforms of PrPSc

PrPSc

was extensively deposited in the cerebral cortex

hippocampus thalamus inferior colliculus and granular layer of the cerebellum GFAP

expressing activated astrocytes were found throughout the brain particularly in the white matter

of the hippocampus thalamus and cerebellum Spongiform change was a dominant feature of

clinical rat

Gene expression Profiling

In total 1048 genes were differentially regulated within a 95 confidence interval

(Supplementary Table 1) and 305 genes were differentially expressed by greater than 2-fold (Fig

4) The 1048 genes that were statistically significant were used for pathway analysis using

DAVID Pathway analysis suggested that the gene expression profile was consistent with

immune activation and maturation as well as inflammation (Supplementary Table 2) a likely

interpretation given the observable astrocytosis (Fig 3) and gliosis associated with prion disease

Other pathways highlighted by the analysis included increases in transcription of genes involved

in lysosomes and endosomes

To further probe the gene expression data we compared genes which were differentially

expressed in rat adapted scrapie against their orthologs expression in mouse scrapie and vice

versa (Figs 5 and 6) We did not observe a high degree of correlation between expression fold

changes For example GFAP a gene whose up-regulation in prion disease is well known was

122

increased 25 fold in rat adapted scrapie but up-regulated 93 fold in mouse scrapie A

qualitative analysis of expression of orthologs in prion disease suggests that many genes

deregulated at least 2-fold in mouse or rat do not have orthologs that are differentially expressed

For example the gene RNAseT2 was up-regulated greater than threefold in rat adapted scrapie

but was not significantly up-regulated in mouse Similarly three genes important in metals

homeostatsis metallothionein 1a and 2a and ceruloplasmin were up-regulated in rat 46 43 and

3 fold respectively but were not differentially expressed in mouse prion disease

CSF Proteomics

Each immunodepleted biological replicate (N=5 for each control and RAS) had technical

triplicates performed The triplicates were summed together using ProteoIQ Peptide spectral

counts were then summed for each protein and subjected to dNSAF analysis using ProteoIQ

internal algorithms Details for this method can be found elsewhere25 26

but briefly peptide

spectral counts are summed per protein (SpC) based on unique peptides and a weighted

distribution of any shared peptides with homologous proteins T-tests were used to identify

significant changes in protein expression 1032 unique peptides which identify 512 proteins and

167 protein groups were found Of these 512 proteins 437 were identified in both RAS and

control CSF samples (Fig 7) The complete list of all 512 proteins can be viewed in

Supplemental Table 3 Using the ProteoIQ the most likely isoform was selected such as 14-3-3

protein gamma

From Table 1 we observe five proteins that agree with the genomic data for up

regulations in RAS granulin macrophage colony-stimulating factor 1 receptor cathepsin D

complement factor H and ribonuclease T2 In addition serine protease inhibitor A3N was not

123

detected as up regulated in the RAS genomic data but was found to be up-regulated in previous

genomic profiling of the mouse prion model22

One interesting trend from the data in Table 1 is

that the majority of proteins found to be up-regulated in the RAS model were not detected in the

control samples The absence of the detection of those proteins such as ribonuclease T2 in the

control CSF does not necessarily suggest the absence of the protein nonetheless it is below the

detection limits for this current proteomics protocol and instrumentation

Discussion

Mice have been the preferred laboratory rodent for prion diseases research because they

can be inexpensively housed and are amenable to transgenesis which allows for short incubation

periods as well as hypothesis testing upon targeted pathways Pursuant to the determination of

the mouse genome and the development of high density transcriptional arrays for measurements

of gene expression profiling mice have been used extensively to examine the molecular

pathology of prion disease probing the impact of disease and animal strain In order to expand

upon this foundation we adapted mouse prions to infect rats We reasoned that by taking a

comparative approach to the molecular pathology of prion disease inferences could be obtained

into the variability of the molecular response to prion diseases and that understanding this

variability might suggest whether human prion disease responses are more or less similar to

mouse responses A second rationale is the desire to identify surrogate markers of prion disease

While this approach has been taken before using gene expression profiling a more direct

approach has been to use proteomic approaches on cerebral spinal fluid (CSF) identifying

proteins that are increase in abundance with disease A rat prion disease is valuable for this

because the rat proteome is established and rats allow for the collection of relatively large

volumes of CSF (~100 microL per animal) which allows for robust analytical separations enhancing

124

detection of biomarkers Finally rats unlike humans can be used in a time course study of prion

disease This allows for the identification of early transcriptional and proteomic responses to

prion infection responses which are particularly valuable for the identification of surrogate

disease biomarkers

To initiate the development of a rat prion disease we attempted to adapt six different

prion disease agents PrPres

molecules to rat via intracranial inoculation of weanling animals

(Figure 1) Of these six agents only mouse RML prions were able to surmount the species

barrier to prion disease transmission Mouse PrP differs from rat by eight amino acid changes

six in the mature protein and four in the N-terminal octa-peptide repeat region (Supplementary

Fig 1) As rat shares the most prion protein sequence homology with mouse it is perhaps not

surprising that it transmitted whereas the other did not confirming that the primary prion protein

sequence is the most important determinant for interspecies transmission We conclude that there

is a large molecular species barrier preventing conversion of rat PrPc into PrP

res

The transmission of mouse RML into rats was characterized by a shortening of the

incubation period following each passage This is indicative of agent adaption to the new host

and increases in the titer present in end-stage brain Overall our adaptation of mouse prion

disease into rats resulted in a similar agent to that observed by Kimberlin27

The differences in

incubation period at second passage are largely due to our collecting the animals at 365 days post

inoculation upon first passage whereas Kimberlin and Walker allowed the first passage animals

to reach end-stage clinical rats

Rat adapted scrapie is in many ways similar to other prion diseases The clinical period of

disease is marked by a progressive neurological dysfunction with prominent ataxia kyphosis and

125

wasting The protease resistant PrP observed is typical as is the widespread deposition of PrPSc

in

the brain Spongiosis and reactive astrogliosis are as expected of a prion disease

Gene expression profiles from rats clinically affected with prion disease revealed a strong

neuronal inflammation associated with proliferated and activated astrocytes This is perhaps best

observed through the up-regulation of GFAP GFAP positive astrocytes were omnipresent

throughout the brain and GFAP mRNA was up-regulated 25 fold The up-regulation of GFAP is

a hallmark of the molecular response to prion infection and has been routinely observed Our

comparative analysis of gene expression changes in mouse RML versus rat adapted scrapie

suggest substantial differences in gene expression in response to prion disease despite the fact

that the overall response is neuro-inflammatory This suggests that the potential overlap between

mouse expression profiles and a putative human CJD expression profile could be quite different

at the level of individual transcripts that might be expected to be changed

CSF Proteomics

CSF proteomics can be exceedingly challenging due to the small sample available large

dynamic range in proteins and abundant proteins limiting sample loading onto nanoscale

columns Dynamic range reduction in the CSF sample was achieved using a custom amount of

IgY-7 antibodies to abundant proteins such as albumin After immunodepletion the total

protein content was reduced by ~90 limiting the proteomics analysis to one dimensional

separation Label free quantitation spectral counting was performed because it requires less

protein and does not increase sample complexity The proteins identified from the affected and

control N=5 are shown in Supplemental Table 2 Consistent detection of proteins in CSF from

both control and infected rats was observed (Fig 7C) Only two proteins were identified in

126

controls that were not observed in RAS and only 10 proteins were only observed in RAS Some

of these proteins that were only identified in RAS are significantly changed (Supplemental Table

3) One concern in proteomics data is the variability from run to run and the possibility that

certain proteins are identified from different unique peptides Figure 7A shows that the vast

majority of the unique peptides detected (783) were identified with a 1 FDR in both RAS and

control CSF samples highlighting the analytical reproducibility of our methodology

Proteomic analysis of the infected rat CSF provides a reasonable approach to cross

validate biomarkers detected by gene expression profiling For example RNAseT2 a secreted

ribonuclease was up-regulated 3-fold at the mRNA level and was also enriched in CSF from

infected rats (Table 1) Similarly coordinated increases in detection of colony stimulating factor

1 receptor complement factor H granulin and cathepsin D were also observed Conversely

proteomic analysis of CSF also allows for the observation of post-transcriptional responses to

prion disease that might be absent in gene expression profiling The 14-3-3 proteins and neuron

specific enolase both known markers for CJD are only detected by proteomic analysis Thus

gene expression profiling and proteomic detection serve to increase confidence in the

observation of up-regulation enhancing the likelihood that proteins detected by both

methodologies are specific and perhaps may be more sensitive at earlier time points

Comparison to human CSF prion disease proteome

In our comparative analysis 21 proteins were found to be up-regulated while 7 proteins

down-regulated in prion infected rats Included in the up-regulated proteins were the 14-3-3

proteins epsilon and gamma 14-3-3 zetadelta was also up-regulated but not statistically

significant Enolase 2 or neuron specific enolase (NSE) was also up-regulated in CSF of infected

127

rats These proteins are all in agreement with results from previous proteomic profiling of human

CSF from patients with CJD8 9

The detection of 14-3-3 protein is included in the diagnostic

criteria approved by World Health Organization for the pre-mortem diagnosis of clinically

suspected cases of sCJD28

although its application in large-scale screening of CJD is still

debated due to high false positive rate where elevated 14-3-3 protein levels were also reported in

other conditions associated with acute neuronal damage29 30

It was suggested that other brain-

derived proteins that are also specific to CJD can be used in conjunction with 14-3-3 protein to

increase diagnosis accuracy and specificity31

NSE is present in high concentration in neurons

and in central and peripheral neuroendocrine cells NSE serves as another valuable biomarker in

diagnosis of CJD as the elevated level of NSE in CSF was found in early and middle stages of

CJD 32

Other proteins detected in CSF included cystatin C and serpina3N although both of

these were not statistically changed These proteins were both previously identified as being

putative biomarkers for CJD33 34

Utility for biomarker discovery with emphasis on onset of biomarker appearance in CSF

The investigation of the protein changes in CSF from RAS compared to control rats

provides a solid foundation when investigating potential biomarkers with prion disease onset

The cross-validation of the genomic and proteomics data further emphasizes the targets for

consideration during disease onset Biomarker discovery provides the potential to determine if

animals such as cows in bovine spongiform encephalopathy (BSE) are affected instead of

having to be euthanized when BSE is expected In addition CSF collection in mice or hamsters

Prion models is extremely difficult and limited alternatively with the advent of the RAS model

CSF can be collected in sufficient amounts for CSF proteomics CSF collection for mice or

hamsters can be ~10 microL and could be contaminated with blood further complicating proteomic

128

analysis unlike rats which over 10 times more CSF can be collected per animal35

Due to the

amount of CSF available time course studies analyzing CSF proteomics is possible in RAS due

to animal numbers that are manageable and reasonable The RAS model further allows

investigators to bypass working with highly infections CJD CSF samples to investigate the CSF

proteome changes

Conclusion

In this study we have described the gene and protein expression changes in brain and

spinal fluid from a transmission of mouse prions into rats We find that while the overall gene

expression profile in rats is similar to that in mice the specific genes that make up that profile

are different suggesting that genes that change in response to prion disease in different species

may not be as conserved Analysis of CSF from rat adapted scrapie identified similar protein

changes as known in human CJD The rat will be a useful model to identify surrogate markers

that appear prior to the onset of clinical disease and thus may be of higher specificity and

sensitivity

Supplemental Information Available Upon Request

1 Chandler R L Encephalopathy in mice produced by inoculation with scrapie brain material Lancet 1961 1 (7191) 1378-9 2 Silva C J Onisko B C Dynin I Erickson M L Requena J R Carter J M Utility of Mass Spectrometry in the Diagnosis of Prion Diseases Anal Chem 2011 3 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion disease biomarker research delving into the glycoproteome J Proteome Res 2011 10 (6) 2687-702 4 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7 (1) 386-99 5 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2003 878 (22) 2003-12 6 Cunningham R Ma D Li L Mass spectrometry-based proteomics and peptidomics for systems biology and biomarker discovery Frontiers in Biology 2012 7 (4) 313-335

129

7 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66 8 Harrington M G Merril C R Asher D M Gajdusek D C Abnormal proteins in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease N Engl J Med 1986 315 (5) 279-83 9 Piubelli C Fiorini M Zanusso G Milli A Fasoli E Monaco S Righetti P G Searching for markers of Creutzfeldt-Jakob disease in cerebrospinal fluid by two-dimensional mapping Proteomics 2006 6 Suppl 1 S256-61 10 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1) 46-50 11 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66 12 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset J Proteome Res 2008 7 (2) 698-707 13 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422 (6928) 198-207 14 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample preparation and analysis Proteomics 2005 5 (13) 3314-28 15 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep Biochem Biotechnol 2009 39 (3) 221-47 16 Marsh R F Bessen R A Lehmann S Hartsough G R Epidemiological and experimental studies on a new incident of transmissible mink encephalopathy J Gen Virol 1991 72 ( Pt 3) 589-94 17 Bessen R A Marsh R F Biochemical and physical properties of the prion protein from two strains of the transmissible mink encephalopathy agent J Virol 1992 66 (4) 2096-101 18 Johnson C J Herbst A Duque-Velasquez C Vanderloo J P Bochsler P Chappell R McKenzie D Prion protein polymorphisms affect chronic wasting disease progression PLoS One 2011 6 (3) e17450 19 Safar J Wille H Itri V Groth D Serban H Torchia M Cohen F E Prusiner S B Eight prion strains have PrP(Sc) molecules with different conformations Nat Med 1998 4 (10) 1157-65 20 Benjamini Y Hochberg Y Controlling the False Discovery Rate A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 1995 57 (1) 289-300 21 Huang da W Sherman B T Lempicki R A Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources Nat Protoc 2009 4 (1) 44-57 22 Moody L R Herbst A J Aiken J M Upregulation of interferon-gamma-induced genes during prion infection J Toxicol Environ Health A 2009 74 (2-4) 146-53 23 Flicek P Amode M R Barrell D Beal K Brent S Carvalho-Silva D Clapham P Coates G Fairley S Fitzgerald S Gil L Gordon L Hendrix M Hourlier T Johnson N Kahari A K Keefe D Keenan S Kinsella R Komorowska M Koscielny G Kulesha E Larsson P Longden I McLaren W Muffato M Overduin B Pignatelli M Pritchard B Riat H S Ritchie G R Ruffier M Schuster M Sobral D Tang Y A Taylor K Trevanion S Vandrovcova J White S Wilson M Wilder S P Aken B L Birney E Cunningham F Dunham I Durbin R Fernandez-Suarez X M Harrow J Herrero J

130

Hubbard T J Parker A Proctor G Spudich G Vogel J Yates A Zadissa A Searle S M Ensembl 2012 Nucleic Acids Res 2012 40 (Database issue) D84-90 24 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein identification by searching sequence databases using mass spectrometry data Electrophoresis 1999 20 (18) 3551-67 25 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J Proteome Res 2006 5 (9) 2339-47 26 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome quantitation how to deal with peptides shared by multiple proteins Anal Chem 2010 82 (6) 2272-81 27 Kimberlin R H Cole S Walker C A Temporary and permanent modifications to a single strain of mouse scrapie on transmission to rats and hamsters J Gen Virol 1987 68 ( Pt 7) 1875-81 28 World Health Organization Global surveillance diagnosis and therapy of human transmissible spongiform encephalopathies report from a WHO consultation 1998 World Health Organization Geneva Switzerland 9-11 February 1998 1-26 29 Bartosik-Psujek H Archelos J J Tau protein and 14-3-3 are elevated in the cerebrospinal fluid of patients with multiple sclerosis and correlate with intrathecal synthesis of IgG J Neurol 2004 251 (4) 414-20 30 Saiz A Graus F Dalmau J Pifarre A Marin C Tolosa E Detection of 14-3-3 brain protein in the cerebrospinal fluid of patients with paraneoplastic neurological disorders Ann Neurol 1999 46 (5) 774-7 31 Sanchez-Juan P Green A Ladogana A Cuadrado-Corrales N Saanchez-Valle R Mitrovaa E Stoeck K Sklaviadis T Kulczycki J Hess K Bodemer M Slivarichova D Saiz A Calero M Ingrosso L Knight R Janssens A C van Duijn C M Zerr I CSF tests in the differential diagnosis of Creutzfeldt-Jakob disease Neurology 2006 67 (4) 637-43 32 Zerr I Bodemer M Racker S Grosche S Poser S Kretzschmar H A Weber T Cerebrospinal fluid concentration of neuron-specific enolase in diagnosis of Creutzfeldt-Jakob disease Lancet 1995 345 (8965) 1609-10 33 Sanchez J C Guillaume E Lescuyer P Allard L Carrette O Scherl A Burgess J Corthals G L Burkhard P R Hochstrasser D F Cystatin C as a potential cerebrospinal fluid marker for the diagnosis of Creutzfeldt-Jakob disease Proteomics 2004 4 (8) 2229-33 34 Miele G Seeger H Marino D Eberhard R Heikenwalder M Stoeck K Basagni M Knight R Green A Chianini F Wuthrich R P Hock C Zerr I Aguzzi A Urinary alpha1-antichymotrypsin a biomarker of prion infection PLoS One 2008 3 (12) e3870 35 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta) equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

131

Figure 1 Interspecies transmission of prion disease to rats End-stage clinical brain homogenates

were used to passage prion disease After one year of incubation animals were euthanized to

determine the extent of PrPres

accumulation Protease resistance PrP was only observed in those

animals infected with RML scrapie prions This material was serially passaged for two more

incubations before becoming rat-adapted as indicated by the shortening of the incubation period

132

Table 1 A list of selected proteins and mRNAs up-regulated in RAS CSF and brain tissue If

the protein is detected in the RAS CSF but not the control CSF the CSF expression is reported

with a infin If there is no change or data on certain genes related to an up regulated protein nd is

noted The mouse genomic data presented here was previously published22

Gene Protein Symbol Accession CSF

Expression

Rat

GEX

Mouse

GEX

14-3-3 protein zetadelta Ywhaz NP_037143 infin nd nd

14-3-3 protein epsilon Ywhae NP_113791 infin nd nd

14-3-3 protein gamma Ywhag NP_062249 infin nd nd

serine protease inhibitor A3N Serpin A3N NP_113719 54 nd 975

enolase 2 (gamma neuronal) Eno2 NP_647541 infin nd nd

granulin GRN NP_058809 62 364 184

macrophage colony-stimulating

factor 1 receptor

Csf1r NP_001025072 infin 293 205

cathepsin D CTSD NP_599161 infin 255 299

complement factor H Cfh NP_569093 376 234 nd

ribonuclease T2 RNAset2 NP_001099680 infin 302 nd

133

Figure 2 Accumulation of PrPSc

in rat adapted scrapie First second and third passage brain

homogenates were assayed for the presence of protease resistant PrP by immunoblot PrPSc

was

observed following phosphotungstenic acid enrichment at first passage By clinical disease in 2nd

and 3rd

passage rats PrPSc

had substantially accumulated

134

Figure 3 Histological appearance of rat adapted scrapie in the hippocampus at clinical disease

Infected animals showed intense immuno-staining for deposits of PrPSc

and GFAP expressing

astrocytes Spongiform change is an abundant feature in rat adapted scrapie

135

Figure 4 Scatter plot of gene expression profiling of rat adapted scrapie The expression of

individual genes from uninfected and infected animals were plotted to display up and down

regulation The dashed green line is no change Solid green lines are 2-fold changes in gene

expression

136

Figure 5 Quantitative relationship between orthologous pairs of two-fold up-regulated genes in

mouse and rat scrapie Genes up-regulated more than two fold were mapped to their orthologs

and the fold change was plotted Expression is log2 transformed

137

Figure 6 Comparison of genes upregulated in rat and mouse prion diseases Genes up-regulated

two fold in rodent scrapie were identified and the expression of their orthologs was determined

138

Figure 7 Venn diagram comparison of the CSF proteomics data between rat adapted scrapie

(RAS) model and control rats (A) Unique peptides from tryptic peptides produced for the

proteins identified (B) The total proteins identified including all isoforms within the protein

groups (C) The protein groups comparing only the top protein hit of the protein isoforms

showing very consistent protein identifications between RAS and control

139

Chapter 5

Investigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiae

Adapted from ldquoInvestigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M

Heideman W Li L In preparation

140

Abstract

This work explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Kinases such as protein

kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose response

Thus performing a large scale phosphoproteomic MS study can be highly beneficial to map the

signaling networks and cascades involved in glucose-dependent stimulated yeast growth Yeast

cell extract was digested and phosphopeptides were enriched by immobilized metal affinity

chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase (RP)-RP

separation The low pH separation was infused directly into an ion trap mass spectrometer with

neutral loss electron-transfer dissociation (ETD)-triggered fragmentation to improve

phosphopeptide identifications In total 477 unique phosphopeptides are identified with 06

false discovery rate (FDR) and 669 unique phosphopeptides identified with 54 FDR This

study includes comparative phosphoproteins for one specific motif of PKA xxxKRKRxSxxxx

which is presented and differences between starved vs glucose fed are highlighted Phosphosite

validation is performed using a localization algorithm Ascore to provide more confident and

site-specific characterization of phosphopeptides

141

Introduction

Microorganisms such as Saccharomyces cerevisiae (yeast) cycle between growth when

nutrients are available and quiescence when nutrients are scarce When glucose is limited yeast

go into growth arrest state but when glucose is added growth quickly resumes Kinases such as

protein kinase A (PKA) and target of rapamycin (TOR) regulate growth in response to nutrient

conditions and have been well studied through transcriptional control1-4

Yeast execute large

transcriptome alterations in response to changing environmental growth conditions5 6

Gene

regulation by glucose introduction in yeast has been studied including genes used for growth on

alternative carbon sources and activation of genes coding for glucose transport and protein

synthesis7-10

Response to nutrients for survival is not limited to yeast biology and indeed all

living creatures both prokaryotes and multicellular eukaryotes like mammals require nutrient

responsiveness and coordinating cellular functions to survive

With regulation of certain genes well studied by glucose introduction the mechanism and

global protein modulation caused by glucose introduction remain unknown6 Large-scale

phosphorylation analysis has been previously performed in yeast using mass spectrometry11-14

Mapping the phosphoproteome in yeast transitioning from quiescence to growth conditions to

better understand the roles of phosphorylation in orchestrating growth is needed The

phosphorylation state during growth or starvation could be used to engineer cells to drive ectopic

activity (or inhibition) to promote growth and ethanol production on non-native sugars like

xylose

It has been reported that the phosphorylation state can be affected by the introduction of

glucose to carbon-starved yeast15

and phosphorylation plays a significant role in the cell cycle

and signal transduction16

Specifically O-Phosphorylation can function as a molecular switch by

142

changing the structure of a protein via alteration of the chemical nature of an amino acid for

serine threonine or tyrosine It has been reported that up to 30 of the proteome can undergo

phophorylation17

Mass spectrometry has evolved as a powerful tool to accomplish phosphosite

mapping using shotgun proteomics With available technology tens of thousands of

phosphorylation sites can be mapped to amino acids on thousands of proteins using shotgun

proteomics18-20

Mass spectrometry can offer sensitive automated non-targeted global analysis of

phosphorylation events in proteomic samples but in any large scale phosphoproteomic

investigation enrichment of phosphoproteinspeptides is required First phosphorylation is

usually a sub-stoichiometric process where only a percentage of all protein copies are

phosphorylated21

Various enrichment methods have been used for phosphopeptide enrichment

including metal oxide affinity chromatography (MOAC)22

such as TiO223

immobilized metal

affinity chromatography (IMAC)12 24 25

electrostatic repulsion-hydrophilic interaction

chromatography (ERLIC)26

and immunoaffinity of tyrosine phosphorylation27 28

After

enrichment MS analyses of phosphorylated peptides are still challenging due to ion suppression

from non-phosphorylated peptides

Even after phosphopeptide enrichment further sample preparation is needed for large

scale proteomic experiments Additional fractionation can increase protein coverage of a

sample by over ten fold such as MudPIT29

(multidimensional protein identification technology)

In online MudPIT a sample is injected onto a strong cation exchange (SCX) column coupled to

a RP column Successive salt bumps followed by low pH gradients provide the separation of

peptides in two dimensions Since the phosphate groups on phosphopeptides have a low pKa

value due to being more acidic then their unmodified counterparts they tend to elute earlier and

143

disproportionally in ion exchange chromatography like SCX Alternatively reversed-phase

reversed-phase (RP-RP) separation has been proposed as an alternative to MudPIT for offline

two dimensional (2D) separation30

One of the caveats of 2D separation is the potential for

wasted mass spectrometry time from early and late fractions having very few peptides present

all while having too much sample for middle fractions One simple method to reduce these

ldquowasted fractionsrdquo is to combine early or late fractions with the middle fractions to reduce MS

runs with little peptide content to analyze thus shortening the overall analysis time31

In addition the labile phosphorylation group has a large propensity to undergo cleavage

during collision induced dissociation (CID) producing a neutral loss The neutral loss can

produce insufficient backbone fragment ions for MSMS identification32

A solution to neutral

loss fragmentation in phosphopeptides is MS3 using CID to produce improved backbone

fragmentation13 14 33

An alternative fragmentation method to CID for fast sampling ion traps is

electron transfer dissociation (ETD)34-36

ETD produces a more uniform back-bone cleavage

where the cation peptide receives an electron from a low affinity radical anion37

The transferred

electron induces a dissociation at the N-Cα bond to produce c- and zbull-type product ions while

retaining the labile post-translational modifications (PTM) such as phosphorylation bound to the

product ions38

The Bruker amaZon ETD ion trap mass spectrometer has the potential to trigger

ETD fragmentation after a labile PTM produces a neutral loss from CID fragmentation This

method is termed neutral loss-triggered ETD fragmentation and provides a complementary

fragmentation pathway to labile poor fragmenting phosphorylated peptides

In this work we provide a qualitative comparative list of yeast phosphopeptides observed

in glucose fed vs glucose starved conditions

144

Experimental

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) ammonium hydroxide (ACS plus certified) urea (gt99)

sodium chloride (997) ammonium bicarbonate (certified) Optima LCMS grade acetonitrile

Optima LCMS grade water and EDTA disodium salt (99) were purchased from Fisher

Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) ammonium formate (ge99995) Trizma

hydrochloride (Tris HCl ge990) Triton X-100 (laboratory grade) and iron (III) chloride

hexahydrate (97) were purchased from Sigma-Aldrich (Saint Louis MO) Sodium dodecyl

sulfate (SDS) (998) was purchased from US Biological (Marblehead MA) Nickel

nitrilotriacetic acid (Ni-NTA) magnetic agarose beads were purchased from Qiagen (Valencia

CA) Autoclaved yeast peptone dextrose (YPD) media was prepared in 1 L of deionized water

using 10 g yeast extract 20 g Peptone from Becton Dickinson and Company (Sparks MD) and

20 g D-(+)-Glucose (ge995) purchased from Sigma-Aldrich (Saint Louis MO)

Modified Mary Miller Yeast Protein Isolation

The yeast culture was prepared and protein extraction was performed using a modified

Mary Miller protocol39

Briefly yeast strain s288c was inoculated with YPD media and shook

for 72 hours to allow the yeast to be glucose starved After the 72 hours the yeast culture was

partitioned into two flasks To one flask glucose was added at 2 of the final concentration and

allowed to incubate and shake for 10 minutes Then 100 optical density units (ODU) of the yeast

145

culture were collected from each flask Each yeast sample was spun down in a Beckman-Coulter

J6B centrifuge (Indianapolis IN) at 2500xg for three minutes The media was aspirated and the

tubes were flash frozen in liquid nitrogen and stored at -80oC The yeast pellets were thawed on

ice and resuspended in 1 mL of lysis buffer (50 mM Tris HCl 01 triton x-100 05 SDS

pH=80) and 1X protease and phosphatase inhibitor cocktail from Thermo Scientific (Rockford

IL) and transferred to an Eppendorf tube To the yeast 200 microL of glass beads were added and

amalgamated for 25 minutes at 4oC The sample was inverted and the Eppendorf tube was

pierced with a hot 23 gauge needle through the bottom and the tube was placed atop a 5 mL

culture tube and centrifuge in the Beckman-Coulter J6B at 2500xg for three minutes at 4oC to

collect the liquid containing the yeast cells while the glass beads remain trapped in the

Eppendorf tubes The protein lysate was then centrifuged at 3000xg for five minutes at 4oC and

the supernatant was collected and stored at -80oC

Preparation of tryptic digests

The Saccharomyces cerevisiae protein cell extract for fed or starved were analyzed with a

BCA protein assay kit (Thermo Scientific Rockford IL) and 2 mg of protein was added to four

parts -20oC acetone and allowed to precipitate the protein for 1 hour at -20

oC The samples were

then centrifuged at 14000xg for 5 minutes and the supernatant was removed and the protein

pellet was reconstituted in 360 microl of 8 M urea To each sample 20 microL of 050 M DTT was

added and allowed to incubate at 37oC for 45 minutes After incubation 54 microL of 055 M IAA

was added for carbamidomethyl of cysteine and the sample was allowed to incubate for 15

minutes in the dark To quench the IAA 20 microL of 050 M DTT was added and allowed to react

for 10 minutes To perform trypsin digestion 1400 microL of 50 mM NH4HCO3 was then added

along with 20 microg trypsin to each sample Digestion was performed overnight at 37oC and

146

quenched by addition of 25 microL of 10 formic acid For each sample the tryptic peptides were

then subjected to solid phase extraction using a Hypersep C18 100 mg solid phase extraction

(SPE) column (Thermo Scientific Bellefonte PA) Peptides were eluted with 50 ACN in

01 formic acid concentrated and reconstituted in 500 microL of 80 ACN and 01 formic acid

Phosphopeptide enrichment using immobilized metal affinity chromatography (IMAC)

One ml of Ni-NTA was washed three times with 800 microL of H2O and then the Ni was

removed with 800 microL of 100 mM EDTA in 50 mM ammonium bicarbonate and vortexed for 30

minutes The supernatant was removed and the NTA was washed with 800 microL of H2O three

times followed by addition of 800 microL 10 mM FeCl3 hexahydrate and vortexed for 30 minutes

The Fe-NTA was then washed three times with 800 microL of H2O and once with 80 ACN in 01

formic acid before being combined with the cell extract for phosphopeptide enrichment and

vortexed for 40 minutes The sample was washed three times with 200 microL of 80 ACN in 01

formic acid before elution with two aliquots of 200 microL of 5 ammonium hydroxide in 5050

ACNH2O with 15 minutes of vortexing The enriched phosphopeptides were then dried down

with a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA) and reconstituted in 55 microL

25mM ammonium formate pH=75

First dimension neutral pH separation

Individually 50 microL of each yeast phosphopeptide enriched sample was injected onto a

Phenomenex Gemini-NX 3microm C18 110 Aring 150 x 2 mm column (Torrance CA) The Gemini

column was coupled to a Phenomenex SecurityGuard Gemini C18 2 mm guard cartridge

(Torrance CA) Mobile phase A consisted of 25 mM ammonium formate at pH=75 and mobile

phase B consisted of 25 mM ammonium formateACN with a 19 ratio respectively at pH=75

The column was operated at 50oC at a flow rate of 05 mLmin The flow profile was 2-30 B

147

over 65 minutes and then 5 min at 100 B A total of 22 fractions were collected every 3

minutes and the fractions were combined as follows 1 with 12 2 with 13 etc to 11 with 22

The combined fractions were dried down and reconstituted in 25 microL of 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL from each fraction onto an Agilent Technologies

Zorbax 300 SB-C18 5 microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5

microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm Atlantis

dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B at 250

nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) for MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

148

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions An additional mode of MSMS fragmentation electron transfer dissociation

(ETD) was triggered on the precursor ion when a neutral loss was observed in CID

fragmentation from phosphoric acid H3PO4 (Δmz of 49 and 327 for 2+ and 3+ charge states

respectively) or for metaphosphoric acid (HPO3) (Δmz of 40 and 267 for 2+ and 3+ charge

states respectively) For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms rolling average 2 acquisition range of 300-1500 mz

and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target

was set to 300000 maximum accumulation time 5000 ms two spectral averages acquisition

range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required to prevent artificial data

reduction Identification of peptides were performed using Mascot40

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt Saccharomyces

cerevisiae database (version 575) Mascot search parameters were as follows Allowed missed

cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) and phosphorylation (STY) peptide tolerance plusmn12 Da maximum

number of 13

C 1 MSMS tolerance peptide decoy (Mascot) checked plusmn05 Da instrument type

149

ESI-Trap Results from the Mascot search was exported as DAT files for analysis by Scaffold 3

and Scaffold PTM

Scaffold and Ascore data processing

Scaffold 3 (Proteome Software Portland OR)was used for MSMS proteomics data

comparison and false discovery rate analysis The DAT files were loaded into Scaffold 3 and

the fractions for the two dimensional fractionation were combined The resulting biological

triplicates for starved and glucose fed yeast were filtered with a 1 false discovery rate (FDR)

on the protein level (Supplemental Table 1) Due to the inherent poor fragmentation of

phosphopeptides a lenient FDR level may still yield biologically relevant data and thus a 54

FDR was generated for phosphopeptides (Supplemental Table 2) For a more conservative list of

phosphopeptides with a 06 FDR was generated from Scaffold 3 (Supplemental Table 3) FDR

analysis is sufficient at preventing poor data from being reported but does not prevent false

phosphosite identification in phosphopeptides To provide confidence in site identification

Scaffold PTM was used to perform Ascore41

analysis Ascore uses an algorithm to score the

probability of the phosphosite from a phosphopeptide identified by a database searching

algorithm such as Mascot Ascore can be freely accessed at httpascoremedharvardedu

Cell collection RNA isolation and microarray data analysis

All experiments were performed in biological duplicates Cell samples (10 ODU) were

taken at starved and fed states spun at 5000xg for 2 minutes at 30 oC the supernatant was

removed and the pellets snap frozen using liquid nitrogen RNA was isolated using the Epicentre

MasterPure Yeast Purification Kit (Epicentre Technologies) and quality was checked using gel

electrophoresis cRNA synthesis was carried out using the GeneChipreg Expression 3

Amplification One-Cycle Target Labeling and Control Reagents kit (Affymetrix) All

150

experiments followed the manufactures instructions cRNA samples were hybridized to

GeneChip Yeast Genome 20 Arrays for 16 hours Arrays were washed stained and scanned

according the manufactures recommendations Affymetrix CEL files were RMA normalized

with R and the Bioconductor Suite Data analysis was performed within TIGR Multiexperiment

Viewer v451 in-house Perl scripting R and Bioconductor

Results

Sample preparation for shotgun proteomics

As discussed in the introduction the purpose of this study is to provide an exploratory list

of qualitative differences in the phosphoproteome between starved vs glucose fed yeast After

yeast cell lysate production a substantial amount of sample preparation is performed to enhance

the depth of the phosphoproteome coverage A workflow of the steps following cell lysis is

outlined in Figure 1A The yeast proteins are isolated via acetone precipitation followed by

digestion with trypsin to produce peptides Phosphopeptides are intermixed with the entire

tryptic peptide sample and therefore Fe-NTA IMAC is used to enrich the phosphopeptides To

improve upon the number of phosphopeptides we then performed two dimensional separation

with high pH RP followed by low pH RP C18 and infused the eluent directly onto an ion trap

mass spectrometer Figure 1B show an improved technique for the first dimension of separation

to combine the early eluting and late eluting fractions from the first phase of separation to reduce

overall MS analysis time The combination of fractions 1 and 12 2 and 13 etc essentially

improves the duty cycle of the mass spectrometer by ensuring sufficient peptide content is

injected onto a low pH nanoLC RP C18 column

ETD-triggered mass spectrometry

151

In the present study labile phosphorylation can lead to non-informative neutral loss

During MS scanning mode the instrument will choose the 6 most abundant peaks with correct

isotopic distribution and select them for MSMS CID fragmentation During CID fragmentation

it is common for labile PTMs such as phosphorylation to undergo neutral loss and thus limited

informative b and y-type ions are formed Alternatively ETD fragmentation can be used on

specific MSMS scans which produce a neutral loss indicative of phosphorylation (98Daz or

80Daz) The subsequent ETD fragmentation on these putative phosphopeptides will lead to

uniform backbone cleavage resulting in confident identification of phosphopeptides with site-

specific localization during MSMS It is important to note that CID fragmentation still produces

very informative fragmentation for phosphorylation but ETD provides an orthogonal

fragmentation pathway to further increase the phosphoproteome coverage Additionally the

duty cycle for ETD fragmentation is roughly half that of CID therefore about half as many

potential peptides would be fragmented and sequenced if the instrument was using ETD

fragmentation exclusively

Protein Data

Even with phosphopeptide enrichment it is inevitable that non-phosphopeptides will also

be identified All data were searched with Mascot and in total over 1000 proteins were identified

with a 1 FDR rate and are listed in Supplemental Table 1 The proteins listed in Supplemental

Table 1 include both phosphoproteins and non-phosphorylated proteins In addition to the

proteins identified in the fed and starved states the unique peptides and spectral counts are also

listed for reference The phosphopeptides are specifically listed with a 54 FDR rate in

Supplemental Table 2 and comparisons between starved and fed with Ascore analysis are listed

for every phosphopeptide identified A higher confidence of phosphopeptide identification is

152

sometimes required before investing in time consuming biological experiments so a list of

phosphopeptides identified in starved and fed was taken from a setting directing Scaffold to

produce a 05 FDR which resulted in the generated FDR of 06 FDR and listed in

Supplemental Table 3

A summary of specific phosphorylation sites are listed in Table 1 with a 06 FDR and

Table 2 with a 54 FDR Each table gives totals for unique phosphopeptides identified having

an Ascore localization score ge80 without Ascore and phosphorylation events on each unique

peptides As expected the majority of phosphorylation events (over 50) occurred on serine

whereas fewer phosphorylation events (~10) occurred on tyrosine In addition the vast

majority of phosphorylation events were single phosphorylation (ge65) with very few

identifications having more than two phosphosites per peptide For specific phosphopeptide

identification and subsequent phosphoprotein identification refer to Supplemental Table 2 and 3

Discussion

Transcriptional response to glucose feeding

Yeast responds to the repletion of glucose after glucose-depletion by broad

transcriptional changes to induce growth Roughly one-third of the yeast genome is altered by at

least two fold when glucose is introduced as shown in Figure 2 Figure 2 is a heat map from a

microarray analysis showing 1601 genes up regulated and 1457 genes down regulated after

addition of glucose compared to the starved state The arbitrary cut-offs for these values were as

follows two fold induction after nutrient repletion and a false discovery rate (q-value) of 001

Red color in the heat map indicates a particular gene is up-regulated in the fed state compared to

the starved state Alternatively genes coded in green are less expressed in the fed state

compared to the starved condition The intensity of the green or red colors is indicative of the

153

intensity of the fold change in gene expression These large transcriptional changes after glucose

repletion drive and complement the current phosphoproteomic study

PKA motif analysis

One benefit of a large scale phosphoproteomics experiment is to examine the different

phosphopeptides from known motifs for specific kinases PKA and TOR are responsible for the

majority of the transcriptional response and thus PKA is a good target for motif analysis Figure

3 shows the PKA motif sequence xxxRKRKxSxxxxxx with the phosphorylation occurring on

the serine Other than F16P (fructose-16-bisphosphatase) all the proteins listed are unique to the

starved or fed samples A motif sequence will inevitably show up by random chance in any

analysis For this study the control for motif analysis uses the swissprot protein list for the

entire yeast proteome for the background Compared to background this specific PKA kinase

from Figure 3 is up-regulated by 264 fold when compared to the background One interesting

protein emerged from this motif analysis in the fed sample but not the starved sample is

Ssd1which is implicated in the control of the cell cycle in G1 phase42

Ssd1 also is

phosphorylated by Cbk1 and Cbk1 has been proposed to inhibit Ssd143

and provides an

intriguing target for future studies on starved vs glucose fed yeast growth

Localization of the phosphorylation sites

When a phosphopeptide contains any number of serine threonine or tyrosine amino

acids the localization of the phosphosite can sometimes be ambiguous Database searches used

to identify peptides like Mascot do not provide any probability for localization of correct

phosphorylation sites Ascore is an algorithm that uses the same MSMS spectra as Mascot but

instead of comparing theoretical peptide MSMS to experimental spectra Ascore searches for

informative a b or y-type peptide fragments giving evidence for phosphosite The Scaffold

154

program adds a localization probability to the Ascore values and the values are listed in

Supplemental Table 2 and 3 Figure 4 has a representative Ascore mass spectra assigning the

peaks identified and providing evidence that the phosphorylation site occurs at the threonine

instead of the serine Incorporating Ascore into this study provides a layer of validation for

putative phosphosite identification

Plasma Membrane 2-ATPase

A previous study identified and localized phosphorylation sites on plasma membrane 1-

ATPase after glucose was introduced to starved yeast15

In the current study PMA2 (plasma

membrane ATPase 2) was identified in glucose fed and not starved samples The doubly

threonine phosphorylated peptide identified in fed yeast is PMA2 with amino acid sequence

IKMLTGDAVGIAKETCR (583-599) whereas the homologous peptide in PMA1 has the exact

same amino acid sequence except for the first isoleucine substituted for valine

VKMLTGDAVGIAKETCR (554-570 not observed in data) PMA2 was observed in the 06

FDR list of phosphopeptides and has an Ascore localization probably of 100 A plant study

showed that PMA2 phosphorylation level was higher in early growth phase than when in

stationary phase44

In addition PMA2 expression in yeast permits the growth of yeast and

threonine phosphorylation has been reported on Thr-95545

The identification of PMA2 in the

fed glucose cell extract provides an interesting target for future study on the molecular

mechanisms involved in regulation growth in starved vs glucose fed yeast

Conclusion

In conclusion this work provides a qualitative comparison in the phosphoproteome

between starved and glucose fed yeast A large scale IMAC enrichment of yeast cell lysate

followed by 2-D separation provided a rich source of phosphopeptides for neutral loss triggered

155

ETD analysis using mass spectrometry A specific motif for PKA is highlighted to show the

differences in proteins identified between starved vs fed conditions In total 477 unique

phosphopeptides are identified with 06 FDR and 669 unique phosphopeptides identified with

54 FDR Phosphosite validation is performed using a localization algorithm Ascore to

provide further confidence on the site-specific characterization of these phosphopeptides The

proteins Ssd1 and PMA2 emerged as potential intriguing targets for more in-depth studies on

protein phosphorylation involved in glucose response

Supplemental Tables 1 2 and 3 are available upon request

References

1 Martinez M J Roy S Archuletta A B Wentzell P D Anna-Arriola S S

Rodriguez A L Aragon A D Quinones G A Allen C Werner-Washburne M Genomic

analysis of stationary-phase and exit in Saccharomyces cerevisiae gene expression and

identification of novel essential genes Mol Biol Cell 2004 15 (12) 5295-305

2 Radonjic M Andrau J C Lijnzaad P Kemmeren P Kockelkorn T T van Leenen

D van Berkum N L Holstege F C Genome-wide analyses reveal RNA polymerase II

located upstream of genes poised for rapid response upon S cerevisiae stationary phase exit Mol

Cell 2005 18 (2) 171-83

3 Slattery M G Heideman W Coordinated regulation of growth genes in

Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

4 Wang Y Pierce M Schneper L GAtildefrac14ldal C G k e Zhang X Tavazoie S

Broach J R Ras and Gpa2 Mediate One Branch of a Redundant Glucose Signaling Pathway in

Yeast PLoS Biol 2004 2 (5) e128

5 Newcomb L L Hall D D Heideman W AZF1 is a glucose-dependent positive

regulator of CLN3 transcription in Saccharomyces cerevisiae Mol Cell Biol 2002 22 (5) 1607-

14

6 Newcomb L L Diderich J A Slattery M G Heideman W Glucose regulation of

Saccharomyces cerevisiae cell cycle genes Eukaryot Cell 2003 2 (1) 143-9

7 Carlson M Glucose repression in yeast Curr Opin Microbiol 1999 2 (2) 202-7

8 Gancedo J M Yeast carbon catabolite repression Microbiol Mol Biol Rev 1998 62

(2) 334-61

9 Johnston M Feasting fasting and fermenting Glucose sensing in yeast and other cells

Trends Genet 1999 15 (1) 29-33

156

10 Warner J R The economics of ribosome biosynthesis in yeast Trends Biochem Sci

1999 24 (11) 437-40

11 Li X Gerber S A Rudner A D Beausoleil S A Haas W Villen J Elias J E

Gygi S P Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces

cerevisiae J Proteome Res 2007 6 (3) 1190-7

12 Ficarro S B McCleland M L Stukenberg P T Burke D J Ross M M

Shabanowitz J Hunt D F White F M Phosphoproteome analysis by mass spectrometry and

its application to Saccharomyces cerevisiae Nat Biotechnol 2002 20 (3) 301-5

13 Gruhler A Olsen J V Mohammed S Mortensen P Faergeman N J Mann M

Jensen O N Quantitative phosphoproteomics applied to the yeast pheromone signaling

pathway Mol Cell Proteomics 2005 4 (3) 310-27

14 Peng J Schwartz D Elias J E Thoreen C C Cheng D Marsischky G Roelofs

J Finley D Gygi S P A proteomics approach to understanding protein ubiquitination Nat

Biotechnol 2003 21 (8) 921-6

15 Lecchi S Nelson C J Allen K E Swaney D L Thompson K L Coon J J

Sussman M R Slayman C W Tandem phosphorylation of Ser-911 and Thr-912 at the C

terminus of yeast plasma membrane H+-ATPase leads to glucose-dependent activation J Biol

Chem 2007 282 (49) 35471-81

16 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

17 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

18 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

19 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

20 Breitkopf S B Asara J M Determining In Vivo Phosphorylation Sites Using Mass

Spectrometry In Current Protocols in Molecular Biology John Wiley amp Sons Inc 2012

21 Steen H Jebanathirajah J A Rush J Morrice N Kirschner M W Phosphorylation

analysis by mass spectrometry myths facts and the consequences for qualitative and

quantitative measurements Mol Cell Proteomics 2006 5 (1) 172-81

22 Kweon H K Hakansson K Metal oxide-based enrichment combined with gas-phase

ion-electron reactions for improved mass spectrometric characterization of protein

phosphorylation J Proteome Res 2008 7 (2) 749-55

23 Larsen M R Thingholm T E Jensen O N Roepstorff P Jorgensen T J Highly

selective enrichment of phosphorylated peptides from peptide mixtures using titanium dioxide

microcolumns Mol Cell Proteomics 2005 4 (7) 873-86

24 Kokubu M Ishihama Y Sato T Nagasu T Oda Y Specificity of immobilized

metal affinity-based IMACC18 tip enrichment of phosphopeptides for protein phosphorylation

analysis Anal Chem 2005 77 (16) 5144-54

25 Swaney D L Wenger C D Thomson J A Coon J J Human embryonic stem cell

phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry Proc

Natl Acad Sci U S A 2009 106 (4) 995-1000

157

26 Hao P Guo T Sze S K Simultaneous analysis of proteome phospho- and

glycoproteome of rat kidney tissue with electrostatic repulsion hydrophilic interaction

chromatography PLoS One 2011 6 (2) e16884

27 Rush J Moritz A Lee K A Guo A Goss V L Spek E J Zhang H Zha X

M Polakiewicz R D Comb M J Immunoaffinity profiling of tyrosine phosphorylation in

cancer cells Nat Biotechnol 2005 23 (1) 94-101

28 Ficarro S Chertihin O Westbrook V A White F Jayes F Kalab P Marto J A

Shabanowitz J Herr J C Hunt D F Visconti P E Phosphoproteome analysis of

capacitated human sperm Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3

and valosin-containing proteinp97 during capacitation J Biol Chem 2003 278 (13) 11579-89

29 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

30 Dowell J A Frost D C Zhang J Li L Comparison of two-dimensional

fractionation techniques for shotgun proteomics Anal Chem 2008 80 (17) 6715-23

31 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

32 Palumbo A M Smith S A Kalcic C L Dantus M Stemmer P M Reid G E

Tandem mass spectrometry strategies for phosphoproteome analysis Mass Spectrom Rev 2011

30 (4) 600-25

33 Beausoleil S A Jedrychowski M Schwartz D Elias J E Villen J Li J Cohn M

A Cantley L C Gygi S P Large-scale characterization of HeLa cell nuclear

phosphoproteins Proc Natl Acad Sci U S A 2004 101 (33) 12130-5

34 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

35 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion

dependence in the partitioning between proton and electron transfer in ionion reactions

International Journal of Mass Spectrometry 2004 236 (1acirceuroldquo3) 33-42

36 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

37 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

38 Coon J J Collisions or electrons Protein sequence analysis in the 21st century Anal

Chem 2009 81 (9) 3208-15

39 Miller M E Cross F R Distinct subcellular localization patterns contribute to

functional specificity of the Cln2 and Cln3 cyclins of Saccharomyces cerevisiae Mol Cell Biol

2000 20 (2) 542-55

40 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

158

41 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

42 Sutton A Immanuel D Arndt K T The SIT4 protein phosphatase functions in late

G1 for progression into S phase Mol Cell Biol 1991 11 (4) 2133-48

43 Jansen J M Wanless A G Seidel C W Weiss E L Cbk1 regulation of the RNA-

binding protein Ssd1 integrates cell fate with translational control Curr Biol 2009 19 (24)

2114-20

44 Kanczewska J Marco S Vandermeeren C Maudoux O Rigaud J L Boutry M

Activation of the plant plasma membrane H+-ATPase by phosphorylation and binding of 14-3-3

proteins converts a dimer into a hexamer Proc Natl Acad Sci U S A 2005 102 (33) 11675-80

45 Maudoux O Batoko H Oecking C Gevaert K Vandekerckhove J Boutry M

Morsomme P A plant plasma membrane H+-ATPase expressed in yeast is activated by

phosphorylation at its penultimate residue and binding of 14-3-3 regulatory proteins in the

absence of fusicoccin J Biol Chem 2000 275 (23) 17762-70

159

Figure 1 The general workflow indicating the major steps involved in sample collection

sample processing mass spectrometric data acquisition and analysis of comparative

phosphoproteomics for starved vs glucose fed yeast (A) The high pH RP separation

procedure for combining fractions to reduce low peptide containing fractions from the

UV-VIS trace is also shown (B)

160

Figure 2 Heat map for the global transcriptional response to glucose fed vs starved samples

S288C cells starved for glucose until growth was arrested and subsequently glucose was added

161

Samples for microarray analysis were taken at the starved state and 60 minutes after glucose was

added The heat map shows the fed log2 fold change for each gene relative to the starved state

across the genome performed in biological replicate (A) Black indicates no change in

expression level while red indicates higher expression for the fed relative to the starved state

Green indicates higher expression for the starved state compared to the fed state (Adapted from

Dr Michael Conways Thesis)

162

Figure 3 The motif for protein kinase A (PKA) that was observed for fed vs starved which is

xxxRKRKxSxxxxxx with the phosphorylation occurring on the serine The motif occurred at a

rate 264 fold higher than the yeast proteome used for background In addition one protein was

observed in both starved and fed with accession identification of F16P (Fructose-16-

bisphosphatase)

163

06 FDR phosphopeptide analysis

Table 1 The number of phosphopeptides identified with a 06 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

Starved Fed All

Ascore ge80 score

unique

STY 164 153 317

S 87 (530) 82 (536) 169 (533)

T 60 (366) 55 (359) 115 (363)

Y 17 (104) 16 (105) 33 (104)

Unique no Ascore

STY 242 235 477

S 131 (541) 133 (566) 264 (553)

T 86 (355) 78 (332) 164 (344)

Y 25 (103) 24 (102) 49 (103)

Phosphorylation events

on each unique peptide

1 102 113 187

2 36 40 68

3 12 11 22

4 or more 8 3 11

164

54 FDR phosphopeptide analysis

Starved Fed All

Ascore ge80 score

unique

STY 217 217 434

S 115 (530) 113 (521) 228 (525)

T 78 (359) 78 (359) 156 (359)

Y 24 (111) 26 (120) 50 (115)

Unique no Ascore

STY 337 332 669

S 193 (573) 180 (542) 373 (558)

T 111 (329) 116 (349) 227 (339)

Y

Phosphorylation events

on each unique peptide

1

2

3

4 or more

33 (98)

135

56

16

11

36 (108)

169

55

14

3

69 (103)

280

100

27

13

Table 2 The number of phosphopeptides identified with a 54 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

165

Figure 4 Ascore validation of the phosphopeptide LSLAtKK from the protein SGM1 (slow

growth on galactose and mannose protein 1) with 100 localization probability observed

in only fed but not starved yeast samples Ascorersquos algorithm identifies a b and y-type

ions and looks to identify peaks that provide evidence for a specific phosphorylation site

For this example Ascore assigns this phosphosite to threonine (T5) instead of the serine

(S2) The intense peaks at 4024 mz and 5250 mz are not identified by any a b or y-

type ions From the ladder sequence of the peptide shown numerous ions indicate the

threonine is phosphorylated while the serine is not Among these ions used for

localization are b2 y2 y5+H2O y3 y4 and y5

166

Chapter 6

Use of electron transfer dissociation for neuropeptide sequencing and

identification

Adapted from ldquoDiscovery and Characterization of the Crustacean Hyperglycemic Hormone

Precursor Related Peptides (CPRP) and Orcokinin Neuropeptides in the Sinus Glands of the Blue

Crab Callinectes sapidus Using Multiple Tandem Mass Spectrometry Techniquesrdquo Hui L

Cunningham R Zhang Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

167

Abstract

The crustacean Callinectes sapidus sinus gland (SG) is a well-defined neuroendocrine site that

produces numerous hemolymph-borne agents including the most complex class of endocrine

signaling moleculesmdashneuropeptides One such peptide is the crustacean hyperglycemic hormone

(CHH) precursor-related peptide (CPRPs) which was not previously sequenced Electron

transfer dissociation (ETD) was used for sequencing of intact CPRPs due to their large size and

high charge state In addition ETD was used to sequence the sulfated peptide Cholecystokinin

CCK-like Homarus americanus using a salt adduct Collectively these two examples

demonstrated the utility of ETD in sequencing neuropeptides with large molecular weight or

with labile modifications

168

Introduction

Neuropeptides are the largest and most diverse group of endocrine signaling molecules in

the nervous system They are necessary and critical for initiation and regulation of numerous

physiological processes such as feeding reproduction and development1 2

Mass spectrometry

(MS) with unique advantages such as high sensitivity high throughput chemical specificity and

the capability of de novo sequencing with limited genomic information is well suited for the

detection and sequencing of neuropeptides in endocrine glands and simultaneously provides the

potential for information on post-translational modifications such as sulfonation3-6

The sinus glands (SG) are located between the medulla interna and medulla externa of the

eyestalk in Callinectes sapidus It is a well-known neuroendocrine site that synthesizes and

secretes peptide hormones regulating various physiological activities such as molting

hemolymph glucose levels integument color changes eye pigment movements and

hydro-mineral balance7 Our labrsquos previous neuropeptidomic studies of SG in several

crustacean species including Cancer borealis8-11

Carcinus maenas12

and Homarus americanus13

14 used multiple MS strategies combining MALDI-based high resolution accurate mass profiling

biochemical derivatization and nanoscale separation coupled to tandem MS for de novo

sequencing In the current study we explore the neuropeptidome of the SG in the blue crab

Callinectes sapidus a vital species of economic importance on the seafood market worldwide In

total 70 neuropeptides are identified including 27 novel neuropeptides and among them the

crustacean hyperglycemic hormone precursor-related peptides (CPRPs) and orcokinins represent

major neuropeptide families known in the SG

The crustacean hyperglycemic hormone (CHH) and its precursor-related peptide (CPRP) are

produced concurrently during the cleavage of CHH from the CHH preprohormone protein15

The

169

CPRP peptide is located between the signal peptide and the CHH sequence and is separated from

the CHH by a dibasic cleavage site The functions of CPRPs are currently unknown16

However

the complete characterization of CPRPs provides a foundation for future experiments elucidating

their functional roles in crustaceans The cDNA of CHH preprohormone in SG of Callinectes

sapidus has been characterized17

but the direct detection of CPRP has not been reported due to

its relatively large size and possible post-translational modifications While small fragments of

CPRPs can be identified using nanoLC-ESI-Q-TOF tandem mass spectrometry the intact

peptide is difficult to detect due to the large molecular weight of CPRPs

Peptidomics traditionally uses collision induced dissociation (CID) for tandem MS

experiments for de novo sequencing Recently an alternative peptide fragmentation method has

been developed using the transfer of an electron termed electron transfer dissociation (ETD)18 19

ETD involves a radical anion with low electron affinity to be transferred to peptide cation which

results in reduced sequence discrimination and thus provides improved sequencing for larger

peptides compared to CID20

Specifically for an ion trap ETD the fluoranthene radical anion is

allowed to react with peptide cations in the three dimensional trap The resulting dissociation

cleaves at the N-Cα bond to produce c and zbull type product ions The peptidome of model

organisms like Callinectes sapidus can be further explored and expanded utilizing ETD as a

complementary fragmentation technique to CID Previous peptidomic analysis has been

completed using ETD as an additional fragmentation method21

It was observed that

enzymatically produced peptides with a higher mz produced improved sequence coverage using

ETD This observation termed decision tree analysis determined that a charge state of ge6 all

peptides endogenous or enzymatic should be fragmented via ETD22

In the present study the

highly charged peptide CPRP with an intact molecular weight of 3837 readily produces plus six

170

charge states The higher charge state of CPRP is highly amenable to fragmentation by ETD

which produces remarkably improved fragmentation and thus increased sequence coverage when

compared to CID

Sulfonation of tyrosine is a post-translational modification (PTM) that is thought to occur on

trans-membrane spanning and secreted proteins23

Cholecystokinin-8 (CCK-8) is a sulfated

peptide which has been linked to satiety24-26

and a CCK-like peptide has been observed in

lobster27

Sulfonation is an extremely labile modification and is often lost during soft

ionization such as ESI or MALDI but can be avoided by optimizing the ionization process28

One potential strategy to identify sulfopeptides is to scan for a neutral loss of 80 Da after CID

but this method does not allow for identification of site of sulfonation and has the risk to be

mistaken for phosphorylation The sulfonation PTM is a negatively charged modification on

the peptide which allows for negative ion scanning in the mass spectrometer but provides

minimal MSMS sequence coverage during CID due to preferential loss of the sulfate group

Alternatively electron-based dissociation technique enables more rapid radical driven

fragmentation where the cleavage pattern is more uniform along the peptide backbone without

initially cleaving the labile sulfated group thus preserving the site of modification These types

of dissociation techniques only work for multiply-charged ions thus a method to install a

multiply-charged cation on the target peptide is desirable It has been shown that the electron

capture dissociation (ECD) can be used to sequence sulfated peptides when a multi-charged

cation is added to the solution29

We use a similar multi-charge cation solution technique to

sequence a sulfated peptide from Homarus americanus via ETD on an ion trap mass

spectrometer Here we presented the use of the ETD fragmentation technique for the analysis

of large peptides and peptides containing labile post-translational modification

171

Experimental Section

Chemical and materials

Acetonitrile methanol magnesium chloride (ACS grade) potassium chloride (ACS grade) and

calcium chloride (ACS grade) were purchased from Fisher Scientific (Pittsburgh PA) Formic

acid was from Sigma-Aldrich (St Louis MO) Human sulfated CCK-8 and CCK-like peptide

(E(pyro)FDEY(Sulfo)GHMRFamide) were purchased from American Peptide (Sunnyvale CA)

Fused-silica capillary with 75 μm id and 360 μm od was purchased from Polymicro

Technologies (Phoenix AZ) Millipore C18 Ziptip column was used for sample cleaning and all

water used in this study was deionized water (182 MΩcm) prepared from a Milli-Q Millipore

system (Billerica MA) The physiological saline consisted of 440 mM NaCl 11mM KCl 26

mM MgCl2 13 mM CaCl2 11 mM Trizma base and 5 mM maleic acid in pH 745

Animals and dissection

Callinectes sapidus (blue crab) were obtained from commercial food market and maintained

without food in artificial sea water at 10-12 ordmC Animals were cold-anesthetized by packing on

ice for 15-30min before dissection The optic ganglia with the SG attached were dissected in

chilled (~4ordmC) physiological saline and individual SGs were isolated from the optic ganglia by

micro-dissection and immediately placed in acidified methanol (90 methanol 9 glacial acetic

acid and 1 water) and stored at -80ordmC until tissue extraction

Tissue homogenization

Acidified methanol was used during the homogenization of SGs The homogenized SGs were

immediately centrifuged at 16100 g using an Eppendorf 5415D tabletop centrifuge (Eppendorf

172

AG) The pellet was washed using acidified methanol and combined with the supernatant and

further dried in a Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation) The

resulting dried sample was re-suspended with a minimum amount (20 microL) of 01 formic acid

Fractionation of homogenates using reversed phase (RP)-HPLC

The re-suspended extracts were then vortexed and briefly centrifuged The resulting supernatants

were subsequently fractionated via high performance liquid chromatography (HPLC) HPLC

separations were performed using a Rainin Dynamax HPLC system equipped with a Dynamax

UV-D II absorbance detector (Rainin Instrument Inc Woburn MA) The mobile phases included

Solution A (deionized water containing 01 formic acid) and Solution B (acetonitrile containing

01 formic acid) About 20 μl of extract was injected onto a Macrosphere C18 column (21 mm

id x 250 mm length 5 microm particle size Alltech Assoc Inc Deerfield IL) The separation

consisted of a 120 minute gradient of 5-95 Solution B Fractions were automatically collected

every two minutes using a Rainin Dynamax FC-4 fraction collector (Rainin Instrument Inc

Woburn MA) The fractions were then concentrated using in Savant SC 110 SpeedVac

concentrator (Thermo Electron Corporation) and re-suspended with minimum amount of 01

formic acid

Nano-LC-ESI-Q-TOF MSMS

Nanoscale LC-ESI-Q-TOF MSMS was performed using a Waters nanoAcquity UPLC system

coupled to a Q-TOF Micro mass spectrometer (Waters Corp Milford MA)

Chromatographic separations were performed on a homemade C18 reversed phase capillary

column (75 microm internal diameter x100 mm length 3 microm particle size 100 Aring) The mobile phases

173

used were 01 formic acid in deionized water (A) 01 formic acid in acetonitrile (B) An

aliquot of 50 microl of a tissue extract or HPLC fraction was injected and loaded onto the trap

column (Zorbax 300SB-C18 Nano trapping column Agilent Technologies Santa Clara CA)

using 95 mobile phase A and 5 mobile phase Bat a flow rate of 10 microlmin for 10 minutes

Following this the stream select module was switched to a position at which the trap column

came in line with the analytical capillary column and a linear gradient of mobile phases A and B

was initiated For neuropeptides the linear gradient was from 5 buffer B to 45 buffer B over

90 min The nanoflow ESI source conditions were set as follows capillary voltage 3200 V

sample cone voltage 35 V extraction cone voltage 1 V source temperature 100ordmC A data

dependent acquisition was employed for the MS survey scan and the selection of three precursor

ions and subsequent MSMS of the selected parent ions The MS scan range was from mz

400-1800 and the MSMS scan was from mz 50-1800

Peptide Prediction De Novo Sequencing and Database Searching

De novo sequencing was performed using a combination of MassLynxTM

41 PepSeq software

(Waters) and manual sequencing Tandem mass spectra acquired from the Q-TOF were first

deconvoluted using MaxEnt 3 software (Waters) to convert multiply charged ions into their

singly charged forms The resulting spectra were pasted into the PepSeq window for sequencing

analysis The candidate sequences generated by the PepSeq software were compared and

evaluated for homology with previous known peptides The online program blastp (National

Center for Biotechnology Information Bethesda MD httpwwwncbinlmnihgovBLAST)

was used to search the existing NCBI crustacean protein database using the candidate peptide

sequences as queries For all searches the blastp database was set to non-redundant protein

174

sequences (ie nr) and restricted to crustacean sequences (ie taxid 6657) For each of the

proteins putatively identified via blastp the BLAST score and BLAST-generated E-value for

significant alignment are provided in the appropriate subsection of the results Peptides with

partial sequence homology were selected for further examination by comparing theoretical

MSMS fragmentation spectra generated by PepSeq with the raw MSMS spectra If the

fragmentation patterns did not match well manual sequencing was performed

NanoLC Coupled to MSMS by CID and ETD

Setup for RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN (Fisher Scientific Optima LCMS grade Fair Lawn NJ) Injections

consisted of 5 microL of prepared tissue extract onto an Agilent Technologies Zorbax 300 SB-C18 5

microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5 microLmin for 5 minutes at 95

A5 B Separation was performed on a Waters HPLC column with 3 microm Atlantis dC18 75 microm

x 150 mm (Milford MA) on a gradient from 5 to 45 mobile phase B at 250 nLmin over 90

minutes at room temperature Emitter tips were pulled from 75 microm inner diameter 360 microm

outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using a commercial

laser puller model P-2000 (Sutter Instrument Co Novato CA)

Mass Spectrometry Data Acquisitions for Collision Induced Dissociation (CID)

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany) equipped

with an on-line nanospray source was used for mass spectrometry data acquisition Hystar

(Version 32 Bruker Daltonics Bremen Germany) was used to couple and control Eksigent

175

nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for all

experiments Smart parameter setting (SPS) was set to mz 700 compound stability and trap

drive level were set at 100 Optimization of the nanospray source resulted in dry gas

temperature 125oC dry gas 40 Lmin capillary voltage -1300 V end plate offset -500 V

MSMS fragmentation amplitude 10V and SmartFragmentation set at 30-300

Data was generated in data dependent mode with strict active exclusion set after two spectra and

released after one minute MSMS was obtained via CID fragmentation for the six most

abundant MS peaks with a preference for doubly charged ions and excluded singly charged ions

For MS generation the ion charge control (ICC) target was set to 200000 maximum

accumulation time 5000 ms four spectral average acquisition range of mz 300-1500 and scan

speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target was set to

200000 maximum accumulation time 5000 ms three spectral averages acquisition range of

mz 100-2000 and scan speed (Ultrascan) of 32000 mz s-1

Mass Spectrometry Data Acquisitions for Electron Transfer Dissociation (ETD)

The amaZon ETD ion-trap mass spectrometer was operated in electron transfer dissociation for

MSMS fragmentation with the same optimized settings as reported for CID unless otherwise

stated Smart parameter setting (SPS) was set to 500 mz compound stability and trap drive

level were set at 100 MSMS was obtained via ETD fragmentation for the four most

abundant MS peaks with no preference for specifically charged ions except to exclude singly

charged ions The ETD reagent parameters were set to 400000 target ICC for the fluoranthene

radical anion The fluoranthene radical anion has an mz of 210 and therefore the 210 mz value

was removed from the resulting MSMS spectra and 160 mz was set for the MSMS low mz

cut-off

176

Direct Infusion of Sulfated Peptide Standards into the Mass Spectrometer using ETD and

CID Fragmentation

The sulfated peptide standards were infused into the mass spectrometer at a flow rate of 300

nlmin using a fused-silica capillary with 75 μm id and 360 μm od coupled to a custom pulled

tip The CCK-8 human peptide (20 microM) was infused in 5050 ACNH2O and detected in

negative ionization mode with an ICC of 70000 and fragmented with CID using the same

settings as the previous CID experiments Alternatively the CCK-like lobster sulfated peptide

(2 microM) was mixed with 30 microM of different salts (MgCl2 KCl or CaCl2) and 01 formic acid in

5050 ACNH2O The plus three charge of the CCK-like lobster peptide was selected for ETD

fragmentation in positive mode with the same setting as the previous ETD experiments The

data were then de novo sequenced for coverage and localization of the sulfation site

Data Analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen Germany)

Deviations in parameters from the default Protein Analysis in DataAnalysis were as follows

intensity threshold 1000 maximum number of compounds 1E7 and retention time window 05

minutes These parameter changes assisted in providing better quality spectra for sequencing

Identification of peptides was performed using Mascot (Version 23 Matrix Science London

UK) Searches were performed against a custom crustacean database none enzyme allow up to

1 missed cleavage amidated C-terminal as variable modifications peptide tolerance mass error

12 Da MSMS mass error tolerance is 06 Da

Results and Discussion

177

Identification and Characterization of Intact CPRPs Using ETD

Using ESI-Q-TOF 9 truncated CPRPs fragments were sequenced to cover the amino acid

sequence of the intact CPRP peptide RSAEGLGRMGRLLASLKSDTVTPLRGFEGETGHPLE

A representative CID MSMS spectrum of ASLKSDTVTPLR is shown in Figure 1 (a) CID

using ESI-Q-TOF mainly produces fragmentation for doubly and triply charge molecules which

is adequate to sequence peptides with relatively small molecular weight (less than 2000 Da)

However CID fragmentation is insufficient to sequence the larger intact CPRP in a complex

sample whose molecular mass is 3837 Da and therefore it is extremely difficult to directly

sequence by traditional CID ESI-Q-TOF ETD is an attractive fragmentation alternative to

sequence the highly charged CPRP peptide In ETD a protonated peptide reacts with an anion

(fluoranthene radical) where the anion transfers an electron to the peptide cation and the resulting

fragmentation occurs along the N-Cα bond by free-radical-driven-cleavage The large size of

CPRP offers multiple basic amino acid sites to sequester charge and reduce the sequence

coverage from collision induced dissociate by preventing random backbone cleavage whereas

ETD does not cleave the weakest bond and provides the alternative fragmentation pathway to

obtain superior sequence coverage to larger peptides As shown in Figure 1 (cd) the

fragmentation of ETD is highly amenable to ETD compared to the CID fragmentation in Figure

1 (b) As evident from comparison ETD offers more extensive fragmentation than CID thus

providing enhanced coverage for large intact peptides such as CPRPs Specifically we observe

125 of the b- and y- cleavages for CID as compared to an average of 535 of c- and zbull-

fragments More than a four-fold increase in fragments using ETD suggests the utility of this

relatively new tandem MS fragmentation method as complementary techniques for de novo

sequencing of large intact endogenous peptides and novel neuropeptide discovery endeavors

178

Negative Mode Sulfated Peptide Identification

An accepted method for identification and quantification for sulfated peptides is negative

ionization30

CCK-8 sulfated peptide standards show intense signal in negative ionization mode

without needing to have additives added such as salts Figure 2 shows the CID MSMS

spectrum from the negative ionization and produces an intense b6 ion at 430 mz The transition

from the doubly negatively charged MS ion of 570 mz to the b6 ion provides a selected reaction

monitoring transition for quantification studies but the sequence information is limited and the

presence of the methionine produces variable oxidation In addition Figure 2 shows the

MSMS product ions with the loss of the sulfate group thus making site-specific location of

sulfation more difficult

Investigation of Cation Salt Adducts to Produce Multiply Charged Sulfated Peptides

Sulfated peptides can be isolated in positive scanning mode but usually only in a plus one

state with low signal intensity If ETD is performed on the singly charged peptide cation a

neutral is formed and is lost in the mass spectrometer and thus no sequence information can be

obtained In order to remedy this situation a technique that adding metal salts to peptides to

increase charge state for ECD used in Fourier transform ion cyclotron resonance mass

spectrometry (FTICR-MS)29

inspired the investigation of increasing charge state of targeted

peptide via metal salt adduction followed by ETD analysis in a Bruker amaZon ion trap

Various salts were investigated including MgCl2 CaCl2 and KCl each used at a concentration of

30 microM and mixed with a sulfated peptide standard at 2 microM The addition of KCl produced

mostly doubly charged ion of the lobster CCK-like peptide standard K adduct which produced

insufficient sequence information from ETD fragmentation (data not shown) In comparison

the MgCl2 and CaCl2 produced intense triply charged peptide ions but CaCl2 produced lower

179

signal intensity compared to MgCl2 (data not shown)

Cation Assisted ETD Fragmentation of CCK-like Lobster Sulfated Peptide and Future

Directions

The addition of MgCl2 to the lobster CCK-like sulfated peptide is shown in Figure 3

Except for z1 the complete z-series is obtained including the z7 ion with and without the

sulfation PTM The spectrum is complicated by the additional Mg adducts but several peaks

are shown without any Mg adduct such as z2 z3 z4 etc Clearly the method of cation

assisted ETD fragmentation can be useful using ion trap instruments and can properly sequence

sulfated peptides that are prone to neutral loss from the labile PTM One concern about future

direction in sulfopeptide isolation in non-genome sequenced species is isolation of sulfopeptides

Currently antibodies to specific sulfopeptides is the only commonly used enrichment technique

for sulfopeptides Also since metal cations are needed for this method direct infusion into an

ESI mass spectrometer is preferred to prevent running large amounts of adduct forming salts

through the LC system With direct infusion the lack of separation confounds the problem in

sulfopeptide detection because any co-eluting peptides could have ionization efficiency and thus

reduce the signal of the sulfopeptide Once discovered and sequenced a selected reaction

monitoring (SRM) method could be developed using LC retention coupled with negative

ionization mode followed by CID MSMS or possibly MSMSMS to perform quantitative

studies for sulfopeptides

Conclusions

In this study ETD was performed to improve the sequence coverage of large endogenous

neuropeptides with higher charge and bigger size The intact CPRP from Callinectes sapidus was

identified and characterized with 68 sequence coverage by MS for the first time Cation

180

assisted ETD fragmentation was also shown to be a powerful tool in de novo sequencing of

sulfated neuropeptides These endeavors into using ETD for certain neuropeptides will assist in

future analysis of large neuropeptides and PTM containing neuropeptides

181

Reference

1 Schwartz M W Woods S C Porte D Jr Seeley R J Baskin D G Central nervous system control of

food intake Nature 2000 404 (6778) 661-71

2 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R

Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide

family of aplysia J Neurosci 2002 22 (17) 7797-808

3 Baggerman G Cerstiaens A De Loof A Schoofs L Peptidomics of the larval Drosophila melanogaster

central nervous system Journal of Biological Chemistry 2002 277 (43) 40368-40374

4 Desiderio D M Mass spectrometric analysis of neuropeptidergic systems in the human pituitary and

cerebrospinal fluid Journal of Chromatography B 1999 731 (1) 3-22

5 Li L J Kelley W P Billimoria C P Christie A E Pulver S R Sweedler J V Marder E Mass

spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab Cancer

borealis Journal of Neurochemistry 2003 87 (3) 642-656

6 Li L J Moroz T P Garden R W Floyd P D Weiss K R Sweedler J V Mass spectrometric survey of

interganglionically transported peptides in Aplysia Peptides 1998 19 (8) 1425-1433

7 Strand F L Neuropeptides regulators of physiological processes 1st ed ed MIT Press Cambridge Mass

1999 p 658 p

8 Fu Q Goy M F Li L J Identification of neuropeptides from the decapod crustacean sinus glands using

nanoscale liquid chromatography tandem mass spectrometry Biochemical and Biophysical Research

Communications 2005 337 (3) 765-778

9 Fu Q Christie A E Li L J Mass spectrometric characterization of crustacean hyperglycemic hormone

precursor-related peptides (CPRPs) from the sinus gland of the crab Cancer productus Peptides 2005 26 (11)

2137-2150

10 Ma M M Chen R B Ge Y He H Marshall A G Li L J Combining Bottom-Up and Top-Down Mass

Spectrometric Strategies for De Novo Sequencing of the Crustacean Hyperglycemic Hormone from Cancer borealis

Analytical Chemistry 2009 81 (1) 240-247

11 Ma M M Sturm R M Kutz-Naber K K Fu Q Li L J Immunoaffinity-based mass spectrometric

characterization of the FMRFamide-related peptide family in the pericardial organ of Cancer borealis Biochemical

and Biophysical Research Communications 2009 390 (2) 325-330

12 Ma M M Bors E K Dickinson E S Kwiatkowski M A Sousa G L Henry R P Smith C M Towle

D W Christie A E Li L J Characterization of the Carcinus maenas neuropeptidome by mass spectrometry and

functional genomics General and Comparative Endocrinology 2009 161 (3) 320-334

13 Chen R B Jiang X Y Conaway M C P Mohtashemi I Hui L M Viner R Li L J Mass Spectral

Analysis of Neuropeptide Expression and Distribution in the Nervous System of the Lobster Homarus americanus

Journal of Proteome Research 2010 9 (2) 818-832

14 Ma M M Chen R B Sousa G L Bors E K Kwiatkowski M A Goiney C C Goy M F Christie A

E Li L J Mass spectral characterization of peptide transmittershormones in the nervous system and

neuroendocrine organs of the American lobster Homarus americanus General and Comparative Endocrinology

2008 156 (2) 395-409

15 Ollivaux C Gallois D Amiche M Boscameric M Soyez D Molecular and cellular specificity of

post-translational aminoacyl isomerization in the crustacean hyperglycaemic hormone family FEBS J 2009 276

(17) 4790-802

16 Wilcockson D C Chung J S Webster S G Is crustacean hyperglycaemic hormone precursor-related

peptide a circulating neurohormone in crabs Cell and Tissue Research 2002 307 (1) 129-138

17 Choi C Y Zheng J Watson R D Molecular cloning of a cDNA encoding a crustacean hyperglycemic

hormone from eyestalk ganglia of the blue crab Callinectes sapidus General and Comparative Endocrinology 2006

148 (3) 383-387

18 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and protein sequence analysis

by electron transfer dissociation mass spectrometry Proc Natl Acad Sci U S A 2004 101 (26) 9528-33

19 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion dependence in the partitioning

between proton and electron transfer in ionion reactions International Journal of Mass Spectrometry 2004 236

(1-3) 33-42

20 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-site identity and

position on electron-transfer dissociation of polypeptide cations J Am Chem Soc 2007 129 (40) 12232-43

182

21 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved identification of endogenous

peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric

analysis J Proteome Res 2009 8 (2) 870-6

22 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass spectrometry for shotgun

proteomics Nat Methods 2008 5 (11) 959-64

23 Monigatti F Hekking B Steen H Protein sulfation analysis--A primer Biochim Biophys Acta 2006 1764

(12) 1904-13

24 Chandler P C Wauford P K Oswald K D Maldonado C R Hagan M M Change in CCK-8 response

after diet-induced obesity and MC34-receptor blockade Peptides 2004 25 (2) 299-306

25 Little T J Feltrin K L Horowitz M Meyer J H Wishart J Chapman I M Feinle-Bisset C A

high-fat diet raises fasting plasma CCK but does not affect upper gut motility PYY and ghrelin or energy intake

during CCK-8 infusion in lean men Am J Physiol Regul Integr Comp Physiol 2008 294 (1) R45-51

26 Blevins J E Morton G J Williams D L Caldwell D W Bastian L S Wisse B E Schwartz M W

Baskin D G Forebrain melanocortin signaling enhances the hindbrain satiety response to CCK-8 Am J Physiol

Regul Integr Comp Physiol 2009 296 (3) R476-84

27 Turrigiano G G Selverston A I A cholecystokinin-like hormone activates a feeding-related neural circuit in

lobster Nature 1990 344 (6269) 866-8

28 Wolfender J L Chu F Ball H Wolfender F Fainzilber M Baldwin M A Burlingame A L

Identification of tyrosine sulfation in Conus pennaceus conotoxins alpha-PnIA and alpha-PnIB further investigation

of labile sulfo- and phosphopeptides by electrospray matrix-assisted laser desorptionionization (MALDI) and

atmospheric pressure MALDI mass spectrometry J Mass Spectrom 1999 34 (4) 447-54

29 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides complexed with divalent

metal cations Anal Chem 2006 78 (21) 7570-6

30 Young S A Julka S Bartley G Gilbert J R Wendelburg B M Hung S C Anderson W H

Yokoyama W H Quantification of the sulfated cholecystokinin CCK-8 in hamster plasma using

immunoprecipitation liquid chromatography-mass spectrometrymass spectrometry Anal Chem 2009 81 (21)

9120-8

183

Figure 1 Comparison of tandem MS spectra of truncated CPRP (ASLKSDTVTPL mz 128766)

by ESI-Q-TOF (a) and intact CPRP (MW 3837) by the amaZon ETD for both CID and ETD

fragmentation (a) MSn of precursor ion with mz 640 with charge state +6 by CID represent

loss of NH3 ordm represent loss of H2O (b) MS+6

of precursor ion with mz 640 with charge state +6

by ETD at z represent z+1 z represent z+2 (c) MS+5

of precursor ion with mz 768 with charge

state +5 by ETD z represent z+1 z represent z+2 For all labels in Figure 1 charge state +1 is

not specified

184

185

Figure 2 Negative CID spectrum of CCK-8 peptide standard (20 microM) via direct infusion show

the doubly charged b6 ion provides the most intense MSMS transition

186

Figure 3 ETD spectrum of E(pyro)FDEY(Sulfo)GHMRF-amide (2 microM) obtained on the

amaZon ETD via direct infusion with 30 microM MgCl2 The sulfated peptide can be identified

with a visible z-series of z2 to z9 and identified sulfate loss

187

Chapter 7

Investigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysis

Adapted from ldquoInvestigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J

Wellner D Li L Journal of Mass Spectrometry In Press

188

ABSTRACT

This work investigates the introduction of methanol and a salt modifier to molecular

weight cut-off membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide

quantities Using a neuropeptide standard bradykinin sample loss was reduced over two orders

of magnitude with and without undigested protein present Additionally a bovine serum

albumin (BSA) tryptic digestion was investigated Twenty-seven tryptic peptides were identified

from MALDI mass spectra after enriching with methanol while only two tryptic peptides were

identified after the standard MWCO protocol The strategy presented here enhances recovery

from MWCO separation for sub-microg peptide samples

INTRODUCTION

Molecular weight cut-off membrane-based centrifugal filter devices (MWCO) are

commonly used to desalt and concentrate large molecular weight proteins [1] Greeing and

Simpson recently investigated various MWCO membranes for large amounts of starting material

(~6 mg) focusing on optimal conditions for the sub 25 kDa protein fraction [2] The authors

recovered 200 μg to 29 mg of protein from multiple MWCO experiments and demonstrated that

a 1090 acetonitrile (ACN)H2O elution solvent produced optimal results [3] In addition Manza

et al provided an alternative approach to isolate proteins with a 5 kDa MWCO by using

NH4HCO3 and recovering the retained proteins [4] Alternatively the elution from MWCOs can

be collected to recover only low molecular weight peptides Multiple peptidomic studies have

utilized MWCOs for peptide isolation during the first few steps of sample preparation [5 6]

When sample amount is limited or peptide content is below 1 microg sample loss is a significant

concern when using MWCOs to isolate endogenous peptides Optimized protocols have been

189

investigated using ACN [3 7] salt [4 8] SDS [5] or native sample [6 9 10] but these

experiments primarily focused on large sample amounts rather than sub-microgram peptide

quantities

MWCOs separate large molecules from small molecules The small molecule fraction

may be rich with signaling peptides (SP) cytokines and other small molecules involved in cell-

cell signaling Signaling peptides perform various functions in the body including cell growth

cell survival and hormonal signaling between organs [11] Individual SP contribute to different

aspects of behavior such as pain (enkephalins) [12] feeding (neuropeptide Y) [13] and blood

pressure (bradykinin) [14] MWCO separations can be used to enrich biologically important SP

and explore the peptide content from biological fluids with relatively low peptide content like

blood or cerebrospinal fluid (CSF) In a recent investigation the detection of neuropeptides and

standards in crustacean hemolymph was improved when methanol and protease inhibitors were

present before performing MWCO neuropeptide isolation The impact of methanol on MWCO

sample loss was not investigated in the study [15] In another study a large-scale mass

fingerprinting protocol of endogenous peptides from CSF used a combination of salts before

MWCO fractionation but the impact of adding salts was not discussed [16] The most

commonly used brand of MWCO in the publications and in peptidomic studies is Millipore

Therefore Millipore MWCOs (using regenerated cellulose as the membrane) were used in the

present study The purpose of this work is to provide an optimized sample preparation technique

for MWCO filtering to reduce sample loss and allow sub-microg detection of peptides using MALDI

mass spectrometry

MATERIALS AND METHODS

190

Materials and Chemicals

Water acetonitrile methanol (optima LCMS grade) and sodium chloride (995) were

purchased from Fisher Scientific (Fair Lawn NJ) The α-cyano-4-hydroxy-cinnamic acid (99)

formic acid (FA) (ge98) and bovine serum albumin (ge96) were purchased from Sigma-

Aldrich (St Louis MO) Amicon Ultra 05 mL 10000 MWCO centrifugal filters and ZipTips

packed with C18 reversed-phase resin were purchased from Millipore (Billerica MA) Trypsin-

digested bovine serum albumin (BSA) was purchased from Waters (Milford MA) Bradykinin

was purchased from American Peptide Company (Sunnyvale CA)

MALDI MS Instrumentation

An AutoFLEX III MALDI TOFTOF mass spectrometer (Bruker Daltonics Billerica

MA) was operated in positive ion reflectron mode The MALDI MS instrument is equipped with

a proprietary smart beam (Bruker Daltonics Billerica MA) laser operating at 200 Hz The

instrument was internally calibrated over the mass range of mz 500minus2500 using a standard

peptide mix Two thousand laser shots were collected per sample spot at an accelerating voltage

of 19 kV and a constant laser power using random shot selection The acquired data were

analyzed using FlexAnalysis software (Bruker Daltonics Billerica MA) Mass spectrometry

data acquisition was obtained by averaging 2000 laser shots

Molecular weight cut off separation procedure

The MWCO separations were performed using Amicon Ultra 05 mL 10000 MWCO

centrifugal filters (Billerica MA) Before MWCO separation three washing steps were

performed to remove contaminants from the filter The three washes were 500 μL of 5050

H2OMeOH 500 μL of H2O and 400 μL of the solution used for MWCO separation For the

191

100 H2O solution 1 microg of BSA or bradykinin was used for separation All the other MWCO

separation experiments used 500 ng of BSA or 100 ng or less of bradykinin The MWCO filter

was then centrifuged at 14000 rcf for 5 min at room temperature in an Eppendorf 5415 D

microcentrifuge (Brinkmann Instruments Inc Westbury NY) The filtrate was concentrated in a

Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation West Palm Beach FL)

and acidified The resulting sample was desalted according to the manufacturer using C18

ZipTips from Millipore (Billerica MA) by washing the ZipTip with three times 100 ACN

three aqueous washes of 01 FA binding the peptides from the solution and one aqueous wash

of 01 FA Peptides were then eluted from the ZipTips using 15 microL of 50 ACN in 01 FA

Matrix deposition

Equal volumes of 05 μL from the 15 μL sample solution (including standards not subject

to MWCO filtering) and α-cyano-4-hydroxy-cinnamic acid (CHCA) matrix solution in 50

ACN50 H2O were mixed using a dried-droplet method and spotted on a MALDI target The

resulting droplets were allowed to air dry prior to mass spectrometry acquisition

RESULTS AND DISCUSSION

Analysis of two orders of magnitude increase for bradykinin standard

Bradykinin was selected to assess the potential peptide loss in the flow-through after

performing MWCO separation As shown in Figure 1 1 microg of bradykinin standard does not

produce a detectable signal by MALDI mass spectrometry analysis after a 10 kDa MWCO

separation in water (performed in triplicate) For comparison 1 ng of bradykinin standard

diluted to 15 microL produced an intense signal on the MALDI mass spectrometer suggesting

192

significant sample loss occurs when the target analyte is low in quantity (data not shown

performed in triplicate) Figure 1 shows that the addition of a salt in this case NaCl improves

the limits of detection and decreases sample loss when 7030 watermethanol was compared to

7030 aqueous 1 M NaClmethanol The reproducible results gave a relative standard deviation

(RSD) of 6 for peak intensity Figure 1 shows that even when starting with 1 μg of bradykinin

too much sample is lost during the MWCO separation in water to detect the remainder

However an intense signal is observed for 10 ng of bradykinin after the MWCO separation when

7030 aqueous 1 M NaClmethanol is used as an elution solvent Sample loss from zip-tipping

was estimated in triplicate by calculating the decreases in peak intensity When 10 ng of

bradykinin was used sample loss was ~41 In comparison the calculated yield for 10 ng of

bradykinin after the 7030 aqueous 1 M NaClmethanol MWCO separation and zip-tip desalting

showed an estimated sample loss of 63 meaning more loss can be attributed to sample clean-

up than MWCO filtration

A series of experiments were performed to determine if 7030 aqueous 1 M

NaClmethanol is an optimal solution to recover peptides during a MWCO separation (data not

shown) A 5050 aqueous 1 M NaClmethanol and a 5050 watermethanol elution were

performed in duplicate but signal intensity of the resulting bradykinin was poor and numerous

polymer peaks were detected in the flow through A lower salt concentration 01 M NaCl was

used but also produced greatly reduced signal when compared to aqueous 1 M NaCl To assess

the optimized methodrsquos compatibility with lower sample amounts the 7030 aqueous 1 M

NaClmethanol solution was added to 1 ng of bradykinin for MWCO separation but no signal

was obtained (data not shown) Using a neuropeptide standard the addition of methanol and

NaCl salt significantly improved the sample recovery in sub-microg amounts

193

BSA tryptic peptide mixture analysis

After demonstrating the importance of using an optimized solution for MWCO

separations with an individual peptide the new method was applied to 500 ng of BSA tryptic

digest to investigate its utility with more complex peptide mixtures Table 1 lists the BSA

tryptic peptides identified in the MALDI MS analysis from different solution conditions

processed by MWCO separation As shown in Table 1 a directly spotted BSA tryptic peptide

standard in the absence of any MWCO filtration enabled identification of 39 tryptic peptides by

accurate peptide mass measurements Once again when using 100 H2O for MWCO

separations the starting amount was doubled to 1 microg (also done with 500 ng data not shown)

However many tryptic peptides were not detected due to low signal intensities and non-optimal

elution conditions Instead of H2O a 1 M NaCl solution was used for the MWCO elution but

only two tryptic peptides were identified (Table 1) The addition of 30 methanol into the

sample before MWCO filtration produced the first increase in identified BSA tryptic peptides

The remaining data from Table 1 shows improved BSA tryptic peptide identifications as the

sample (elution) conditions were further optimized Figure 2 shows the actual mass spectra

associated with the three most promising elution solutions along with 100 H2O

The BSA tryptic peptide intensities are shown in Figure 2A and the most intense tryptic

peptide YLYEIAR mz 92749 was observed in the four different solutions shown in Figure 2B

but not in the 100 H2O or 100 1 M NaCl solutions (data not shown) In Figure 2A all mass

spectra are normalized to an intensity of 7 x 104 to illustrate two points First the MWCO

filtering step still produced sample loss regardless of the solvent conditions chosen Second

there is a noticeable increase in peptide peak intensity using the optimized solvent 6040

194

aqueous 1 M NaClmethanol (Figure 2A) Figure 2C displays a zoomed-in view of a BSA

tryptic peptide signal LKECC

DKPLLEK mz 153266 (

carbamidomethyl) observed only in

the optimized solvent The detection of the mz 153266 peptide in Figure 2C highlights the

potential gain in sample and detectable peptides by using an optimized saltMeOH combination

A non-optimized saltMeOH combination will still reduce sample loss but further minimizing

sample loss during sample preparation will always be desirable in any analytical protocol

MWCO composition

The purpose of this application note is to provide evidence of sub-microg sample loss during

MWCO separations of peptide samples and a solution to overcome this limitation The

explanation of why adding MeOH and NaCl to the sample solution provides a significant

reduction in sample loss is beyond the scope of this application note Regardless Supplemental

Table S1 is an expanded version of Table 1 showing the amino acid sequence hydrophobicity

calculated using GRAVY scores and pI of the identified peptides in this study No discernible

trend was obtained from the data The membrane of commonly used MWCO in peptidomics and

for this study is comprised of chemically treated (regenerated) cellulose which is a

polysaccharide containing β (1rarr4) linked D-glucose Glucose has numerous free hydroxyl

groups which could non-specifically adsorb peptides flowing through the MWCO The addition

of MeOH has the most significant effect on signal which could be due to disrupting the

interaction between peptides and hydroxyl groups from glucose NaCl has a less significant

effect on sample recovery compared to MeOH but a detectable reduction in sample loss is noted

This improvement in sample recovery could be analogous to the use of NaCl in

195

immunodepletion protocols to reduce non-specific binding which is accomplished by adding

150 mM NaCl [17]

Analysis of bradykinin in the presence of undigested BSA

When using MWCO for peptide isolation proteins are typically present in the samples

usually in larger amounts Figure 3 shows the effect that adding BSA protein to a 10 ng

bradykinin solution before MWCO fractionation has on the resulting recovery of bradykinin

Adding 10 μg of BSA to the optimized 7030 aqueous 1 M NaClmethanol solution slightly

decreased bradykininrsquos signal with a RSD of 10 More severe signal reduction occurred after

adding 100 microg BSA with a RSD of 2 (N=2) It is not unexpected that more signal reduction

due to sample loss would occur especially in the 100 microg BSA sample because the BSA protein

has a molar ratio of 160 BSA protein molecules to one bradykinin peptide Figure 3 shows the

usefulness of the MWCO method with samples containing large amounts of proteins

RecommendationConclusion

The present work provides solutions to reduce sample loss from the use of MWCO for

sub-microg peptide isolation with or without non-digested proteins in the sample Despite its

widespread utility significant sample loss often occurs during the MWCO fractionation step

which is particularly problematic when analyzing low-abundance peptides from limited starting

material This application note aims to reduce sample loss during MWCO separations

specifically for sub-microg peptide isolation If complex samples are processed with MWCO

separation the authors recommend eluting the sample with 6040 aqueous 1 M NaClmethanol

solution as a starting point to minimize sample loss This application note provides a viable

196

alternative for sub-microg peptide MWCO separation circumventing the need to increase the starting

material by minimizing sample loss from using a MWCO membrane-based centrifugal filter

device

References

[1] HM Georgiou GE Rice MS Baker Proteomic analysis of human plasma failure of

centrifugal ultrafiltration to remove albumin and other high molecular weight proteins

Proteomics 2001 1 1503

[2] DW Greening RJ Simpson Low-molecular weight plasma proteome analysis using

centrifugal ultrafiltration Methods Mol Biol 2011 278 109

[3] DW Greening RJ Simpson A centrifugal ultrafiltration strategy for isolating the low-

molecular weight (ltor=25K) component of human plasma proteome J Proteomics 2010 73

637

[4] LL Manza SL Stamer AJ Ham SG Codreanu DC Liebler Sample preparation and

digestion for proteomic analyses using spin filters Proteomics 2005 5 1742

[5] D Theodorescu D Fliser S Wittke H Mischak R Krebs M Walden M Ross E Eltze O

Bettendorf C Wulfing A Semjonow Pilot study of capillary electrophoresis coupled to mass

spectrometry as a tool to define potential prostate cancer biomarkers in urine Electrophoresis

2005 26 2797

[6] K Antwi G Hostetter MJ Demeure BA Katchman GA Decker Y Ruiz TD Sielaff LJ

Koep DF Lake Analysis of the plasma peptidome from pancreas cancer patients connects a

peptide in plasma to overexpression of the parent protein in tumors J Proteome Res 2009 8

4722

[7] LP Aristoteli MP Molloy MS Baker Evaluation of endogenous plasma peptide extraction

methods for mass spectrometric biomarker discovery J Proteome Res 2007 6 571

[8] A Zougman B Pilch A Podtelejnikov M Kiehntopf C Schnabel C Kumar M Mann

Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7

386

[9] X Yuan DM Desiderio Human cerebrospinal fluid peptidomics J Mass Spectrom 2005 40

176

[10] X Zheng H Baker WS Hancock Analysis of the low molecular weight serum peptidome

using ultrafiltration and a hybrid ion trap-Fourier transform mass spectrometer J Chromatogr A

2006 1120 173

[11] L Li JV Sweedler Peptides in the brain mass spectrometry-based measurement approaches

and challenges Annu Rev Anal Chem 2008 1 451

[12] GB Stefano G Fricchione Y Goumon T Esch Pain immunity opiate and opioid

compounds and health Med Sci Monit 2005 11 MS47

[13] J Jensen Regulatory peptides and control of food intake in non-mammalian vertebrates Comp

Biochem Physiol A Mol Integr Physiol 2001 128 471

197

[14] A Kuoppala KA Lindstedt J Saarinen PT Kovanen JO Kokkonen Inactivation of

bradykinin by angiotensin-converting enzyme and by carboxypeptidase N in human plasma Am

J Physiol Heart Circ Physiol 2000 278 H1069

[15] R Chen M Ma L Hui J Zhang L Li Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 708

[16] H Jahn S Wittke P Zurbig TJ Raedler S Arlt M Kellmann W Mullen M Eichenlaub H

Mischak K Wiedemann Peptide fingerprinting of Alzheimers disease in cerebrospinal fluid

identification and prospective evaluation of new synaptic biomarkers PLoS One 2011 6

e26540

[17] NA Cellar AS Karnoup DR Albers ML Langhorst SA Young Immunodepletion of high

abundance proteins coupled on-line with reversed-phase liquid chromatography a two-

dimensional LC sample enrichment and fractionation technique for mammalian proteomics J

Chromatogr B Analyt Technol Biomed Life Sci 2009 877 79

198

Table 1 Identified BSA tryptic peptides from various MWCO separation conditions

BSA tryptic

peptide (MH+)

100

H2O 1microg

100

1 M NaCl

70

H2O

80

1 M NaCl

70

1 M NaCl

60

H2O

60

1 M NaCl

5083

5453

6894

7124

8985

9275

10345

10725

11385

11636

12496

12837

13057

13997

14157

14197

14398

14636

14798

15026

15118

15328

15547

15677

15768

16399

16678

16738

17248

17408

17477

17497

18809

18890

19019

19079

20450

21139

22479

Total 39 2 2 6 8 15 15 27

199

Figure 1 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard showing improvement over two orders of magnitude in detection limits Each MWCO

separation was performed at minimum in triplicate with representative spectrum selected for

each with a calculated RSD from the peak heights Three different amounts of bradykinin were

tested to assess the magnitude of sample loss under different MWCO solvent conditions The

top panel shows 1 microg of bradykinin standard after MWCO separation with 100 H2O elution

produced no signal The addition of 40 or 30 MeOH produced very low bradykinin signals

for both 100 ng (RSD of 28) and 10 ng (SNlt3 no RSD calculated) respectively In the

bottom two spectra each showed very large intensity but the 7030 aqueous 1 M NaClmethanol

10 ng bradykinin was processed with a 10kDa MWCO and zip-tipped and was reproducible with

200

a RSD of 6 The last spectrum was from 10 ng bradykinin (RSD of 3) which was diluted to

an equivalent volume as all the other experiments and directly spotted onto the MALDI plate

201

Figure 2 Representative MALDI mass spectra from MWCO separation of a BSA tryptic

peptide standard showing sample loss Stacked mass spectra from mz range 875-2150

normalized to 7 x 104 intensity representing the detection difference from a BSA tryptic peptide

standard from different MWCO separation conditions (A) It should be noted that when the

solvent for MWCO elution was 100 H2O 1 microg of BSA tryptic peptides was processed instead

of 500 ng A zoomed in view of the most abundant BSA tryptic peptide detected YLYEIAR

mz 92749 (B) Various percentages of MeOH produced significant signal but addition of a salt

(1 M NaCl) increases the signal which is closest to a directly spotted BSA tryptic peptide

standard A zoomed in view of a representative low intensity BSA tryptic peptide detected

LKECC

DKPLLEK mz 153266 (C) The optimized solution to be used for MWCO filtration

202

6040 aqueous 1 M NaClmethanol was the only procedure that enabled the detection of the

tryptic peptide in Figure 2C which was also detected in the directly spotted BSA tryptic peptide

standard All experiments were performed a minimum of two times with nearly identical results

) Carbamidomethyl amino acid modification

ordm) Tryptic peptide identified in three of the spectra in Figure 2A

dagger) Tryptic peptide identified in two of the spectra in Figure 2A

) Tryptic peptide identified in a single spectrum in Figure 2A

203

Figure 3 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard with a BSA protein present showing optimized solvent conditions minimized samples

losses Each experiment was performed in duplicate Two different amounts of BSA protein

were tested to assess the magnitude of sample loss caused by the presence of a protein The top

panel shows 10 microg of BSA protein and the second panel shows 100 microg of BSA protein added

while only 10 ng of bradykinin was added Detectable sample loss was observed when the BSA

protein was added but panel two shows that the amount of BSA protein was 1 x 104 greater

(equivalent to 160 fold molar excess) than bradykinin The last two spectra were controls using

a MWCO with the optimized solution in panel 3 and panel 4 using 10 ng bradykinin which was

diluted to an equivalent volume as all the other experiments and directly spotted onto the

MALDI plate

204

Supplemental Table 1 Expanded Table 1 including grand average of hydropathicity (GRAVY)

score theoretical pI and the sequence from the underlying amino acid sequence for the peptides

identified in the BSA digest The GRAVY and pI scores were obtained from ExPASy

Bioinformatics and modifications were not taken into consideration

(httpwebexpasyorgprotparam) Peptide assignments to the recorded peaks were done by

BSA

tryptic

peptide

(MH+)

GRAVY

score

Theoretical

pI

Sequence 100

H2O

1microg

100

1 M

NaCl

70

H2O

80

1 M

NaCl

70

1 M

NaCl

60

H2O

60

1 M

NaCl

5083 NA NA FGER

5453 0900 972 VASLR

6894 0267 979 AWSVAR

7124 -0950 647 SEIAHR

8985 0529 674 LcVLHEK

9275 -0071 600 YLYEIAR

10345 -0725 674 NEcFLSHK

10725 -0211 538 SHcIAEVEK

11385 0 599 ccTESLVNR

11636 0130 453 LVNELTEFAK

12496 -1250 545 FKDLGEEHFK

12837 0264 675 HPEYAVSVLLR

13057 -0582 532 HLVDEPQNLIK

13997 0567 437 TVMENFVAFVDK

14157 0567 437 TVmENFVAFVDK

14197 0058 530 SLHTLFGDELcK

14398 -0133 875 RHPEYAVSVLLR

14636 -0515 465 TcVADESHAGcEK

14798 0292 600 LGEYGFQNALIVR

15026 -0625 409 EYEATLEEccAK

15118 0207 597 VPQVSTPTLVEVSR

15328 -0617 617 LKEccDKPLLEK

15547 -0823 441 DDPHAcYSTVFDK

15677 -0085 437 DAFLGSFLYEYSR

15768 -0985 456 LKPDPNTLcDEFK

16399 -0067 875 KVPQVSTPTLVEVSR

16678 0064 437 MPCTEDYLSLILNR

16738 -1723 550 QEPERNEcFLSHK

17248 0064 437 MPcTEDYLSLILNR

17408 0064 437 mPcTEDYLSLILNR

17477 -0914 414 YNGVFQEccQAEDK

17497 -0621 410 EccHGDLLEcADDR

18809 -0537 606 RPcFSALTPDETYVPK

18890 -0567 674 HPYFYAPELLYYANK

19019 -1275 466 NEcFLSHKDDSPDLPK

19079 0044 454 LFTFHADIcTLPDTEK

20450 -0812 839 RHPYFYAPELLYYANK

21139 -0682 480 VHKEccHGDLLEcADDR

22479 -0458 423 EccHGDLLEcADDRADLAK

Total 39 2 2 6 8 15 15 27

205

mass matching with no tandem mass spectrometry performed Lower case amino acids indicate

a modification present in the peptide of carbamidomethyl (c) or oxidation (m)

206

Chapter 8

Conclusions and Future Directions

207

Summary

Comparative shotgun proteomics investigating numerous biological changes in various

species and sample media and peptidomic method development have been reported The

developed comparative shotgun proteomics based on label-free spectral counting with nanoLC

MSMS platform have been employed in mouse CSF rat CSF yeast culture and other biological

specimens Mass spectrometry (MS)-based peptidomic enhancements using ETD and improved

sample preparation methods for molecular weight cut-offs have been reported Together these

studies add to the Li Labs capabilities in comparative shotgun proteomics and expand available

methods for peptidomic research

Immunodepletion of CSF for comparative proteomics

Chapters 3 and 4 used similar methods to generate a list of differentially expressed

proteins providing insight into how Alexander disease (AxD) perturbs the proteome and how the

new prion model rat adapted scrapie (RAS) proteome varies from control rats In the GFAP

overexpressor mouse CSF study in Chapter 3 we have produced a panel of proteins with

significant up or down regulation in the CSF of transgenic GFAP overexpressor mice MS-based

proteomic study of this mouse model for AxD was consistent with the previous studies showing

elevation of GFAP in CSF The development of a modified IgY-14 immunodepletion technique

for low amounts of CSF with recommendations for future antibody depletion techniques to deal

with the unique challenges of mouse CSF was presented Modified proteomics protocols were

employed to profile mouse CSF with biological and technical triplicates addressing the

variability and providing quantitation with dNSAF spectral counting Validation of the data was

performed using both ELISA and RNA microarray data to provide corroboration with the

208

changes in the putative biomarkers This work presents numerous interesting targets for future

study in AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

Using the protocol developed in Chapter 3 we performed a similar study in RAS CSF

compared to control rat CSF Two differences in sample preparation for the rat CSF compared

to the mouse CSF are noted (1) rat IgY-7 immunodepletion was performed and (2) each rat

CSF sample was collected from one animal due to sufficient volume instead of pooling from

multiple animals for the mouse CSF sample After immunodepletion the CSF samples from

control and RAS (biological N=5 technical replicates N=3) were digested and separated using

one dimensional RP nanoLC separation We were able to identify 167 proteins with redundant

isoforms removed which was derived from total 512 proteins with a 1 FDR from all the CSF

samples Comparative analysis was performed using dNSAF spectral counting and 21 proteins

were significantly changed Our data were consistent with previous prion CSF studies showing

14-3-3 protein increased in CSF of prion infected animals Genomic analysis was also

performed and was used to cross-validate our proteomic data and numerous proteins were found

to be consistent including a novel marker related to prion disease RNAset2 (ribonuclease T2)

In summary this work provides a foundation for investigation of the perturbed proteome of a

new prion model RAS

Yeast Phosphoproteomic Comparison between Starved and Glucose Fed Conditions

This work presented a qualitative comparison of the phosphoproteome between starved

and glucose fed Saccharomyces cerevisiae (bakers yeast) A large scale IMAC enrichment of

yeast cell lysate followed by 2-D separations provided a rich source of phosphopeptides for CID

MSMS and neutral loss triggered ETD analysis using mass spectrometry A specific motif for

PKA was highlighted to show the differences in proteins identified between starved and glucose

209

fed conditions In total 477 unique phosphopeptides were identified with 06 FDR and 669

unique phosphopeptides identified with 54 FDR Phosphosite validation was performed using

a localization algorithm Ascore to provide further confidence on the site-specific

characterization of these phosphopeptides Two proteins Ssd1 and PMA2 emerged as potential

intriguing targets for more in-depth studies on protein phosphorylation involved in glucose

response

Methods for Peptide Sample Preparation and Sequencing

In this study ETD was performed to improve the sequence coverage of endogenous large

neuropeptides and labile tyrosine sulfation CCK-like neuropeptides found in the blue crab

Callinectes sapidus The intact CPRP from Callinectes sapidus was identified and characterized

with 68 sequence coverage by MS for the first time Cation assisted ETD fragmentation using

MgCl2 was also shown to be a powerful tool in de novo sequencing of sulfated neuropeptides

These endeavors into using ETD for certain neuropeptides will assist in future analysis of large

neuropeptides and PTM containing neuropeptides

In addition to ETD sequencing I presented a protocol on improving recovery of minute

quantities of peptides by adding methanol and a salt modifier (NaCl) to molecular weight cut-off

membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide quantities

Despite its widespread utility significant sample loss often occurs during the MWCO

fractionation step which is particularly problematic when analyzing low-abundance peptides

from limited starting material This work presented a method to reduce sample loss during

MWCO separations specifically for sub-microg peptide isolation Using a neuropeptide standard

bradykinin sample loss was reduced by over two orders of magnitude with and without

210

undigested protein present The presence of bovine serum albumin (BSA) undigested protein

and the bradykinin standard lends evidence that sample loss occurs because of the MWCO and

not the presence of the protein Additionally a BSA tryptic digestion is presented where twenty-

seven tryptic peptides are identified from MALDI mass spectra after enriching with methanol

while only two tryptic peptides are identified after the standard MWCO protocol

Ongoing Projects and Future Directions

CSF Projects

Rat Adapted Scrapie and Time Course Study of Rat CSF

In ongoing experiments from the work described in Chapter 4 related to rat adapted

scrapie (RAS) we are working towards more complete proteome coverage of CSF and a time

course study of RAS After the promising results of the 1-D proteomic comparison between

RAS and control CSF we want to perform IgY immunodepletion on 1 mL of rat CSF followed

by 2-D high pH-low pH RP-RP LC-MSMS IgY immunodepletion is still required and

afterwards approximately 40 microg of low abundance protein would remain Following traditional

urea tryptic digestion (see Appendix 1) and solid phase extraction (SPE) clean-up the sample

would be dried down and reconstituted in mobile phase A (25 mM ammonium formate basic

pH) for high pH RP separation Fractions would be dried and subjected to MS analysis similar to

the work described in Chapter 4 The purpose of this work would be to increase the proteome

coverage from a few hundred to a thousand CSF proteins identified A time course study of RAS

is also desirable to gain insight into disease progression Rats at different stages will be

sacrificed and CSF will be collected The goal is to perform isobaric labeling between the time

courses with spectral counting being an alternative for relative protein expression We will use

the targets identified in Chapter 4 to track certain proteins for time course analysis Overall

211

these future projects will dig deeper into the proteome and how this novel prion model RAS

perturbs the proteins expressed in rats over time

Glycoproteomics and Peptidomic Analysis of CSF Samples from Individuals with

Alzheimerrsquos Disease

Alzheimerrsquos disease is an incurable progressive neurodegenerative disorder which results

in cognitive impairment Our aim is to provide additional biomarkers andor targets for drug

treatment Currently we used 500 microL of human CSF for non-membrane glycoprotein

enrichment (see Appendix 1) followed by 2-D high-pH-low-pH RP-RP amaZon ion trap LC-

MSMS analysis The initial work was a failure due to low amount of signal and significant

sample loss (data not shown) Through a comparison with Dr Xin Weirsquos PhD thesis we

estimated roughly 20 microg of proteins was obtained after glycoprotein enrichment and 1-D analysis

produced over 100 protein identifications (data not shown) but the additional off-line 2-D

separation and sample clean up resulted in low number of protein identifications for each fraction

analyzed by MS (data not shown) We have recently obtained 1 mL of human CSF samples

from healthy controls and individuals suffering from Alzheimerrsquos disease and plan to perform

the same experiments with double the starting amount and reduce the fractions collected from 2-

D separation from 11 to 6 In addition to the glycoproteomics anything not enriched will be

subjected to MWCO separation using the method presented in Chapter 7 The obtained peptide

sample will be subjected to Q-TOF nanoLC MSE data acquisition followed by de novo

sequencing using various programs including PEAKS and Mascot Collectively we feel this

project has great potential to lead to interesting targets and further expand the proteomic

knowledge of Alzheimerrsquos disease

GFAP Knock-in Mouse CSF

212

In a direct follow-up to Chapter 3 we aim to perform comparative proteomics on control

vs glial fibrillary acidic protein (GFAP) knock-in mouse CSF samples The sample preparation

protocol as presented in Chapter 3 will be used For quantitative proteomics we plan on

performing isobaric labeling followed by performing high energy collision induced dissociation

(HCD) on the Orbitrap Elite The MS acquisition will also perform a top ten scan where the top

ten MS peaks will be selected for tandem MS analysis In addition targeted quantitation of

specific proteins using multiple reaction monitoring (MRM) can be performed on potential

biomarkers and identified in this follow-up study and the work presented in Chapter 3 Also any

CSF samples with noticeable blood content cannot be used for the exploratory proteomics

experiments but can potentially be used for the MRM analysis and should be kept for such

experiments in the future

Large Scale Proteomics

Proteomics of Mouse Amniotic Fluid for Lung Maturation

The overall goal of this project is to determine what proteins are present in amniotic fluid

when the embryo is 175 weeks compared to amniotic fluid at 155 weeks The rationale behind

why these two time points matter was investigated through a lung explant culture where amniotic

fluid was added at 155 weeks and 175 weeks Visual maturation of lungs occurred with the

175 week amniotic fluid and lung maturation genes were up regulated in the mRNA in the lung

explant culture when compared to the 155 week amniotic fluid The compound which is

causing the maturation of the lungs is unknown and search for a secreted protein might provide a

clue to this process In order to test this hypothesis we carried out discovery proteomics

experiments The workflow involves tryptic digestion SPE and 1-D low pH nanoLC separation

coupled to an amaZon ETD ion trap for tandem MS analysis The resulting mass spectrometric

213

acquisition identified less than 100 proteins with a 1 FDR (data not published) To address the

lack of depth in the proteome coverage we purchased an IgY immunodepletion column to

remove the most abundant proteins in amniotic fluid similarly to Chapter 3 and 4 but on a larger

scale One abundant protein alpha-fetoprotein is present in amniotic fluid but absent or present

in very low concentration in CSF serum or plasma Alpha-fetoprotein is similar to albumin and

thus we ran amniotic fluid on an IgY immunodepletion column and observed significant

reduction in alpha-fetoprotein amount (data not published) After immunodepletion 2-D high

pH-low pH RP-RP will be performed followed by nanoLCMSMS on the amaZon ETD ion trap

We will perform mass spectrometry experiments using the 155 week amniotic fluid and 175

week amniotic fluid which have average protein contents ~2 mgmL We anticipate a minimum

of tenfold increase in the proteome coverage of amniotic fluid and provide meaningful

hypothesis driven biological experiments from this work

Phosphoproteomics of JNK Activation

c-Jun N-terminal kinase (JNK) is an important cellular mediator of stress activated

signaling Under conditions of oxidative stress JNK is activated resulting in the downstream

phosphorylation of a large number of proteins including c-Jun However the cellular response

to JNK activation is extremely complex and JNK activation can result in extremely different

physiological outcomes For example JNK is at the crossroads of cellular death and survival

and exhibits both pro-death and pro-survival activities These highly disparate outcomes of JNK

activation are highly contextual and depend on the type of stressor and duration of stress In the

brain JNK has been shown to be activated in models of Alzheimerrsquos disease Parkinsonrsquos

disease amyotrophic lateral sclerosis and stroke It is thought that JNK is activated in these

diseases as a result of oxidative stress and it is unknown whether this activation is pro-survival or

214

pro-death To elucidate JNK signaling in the brain we chose to analyze primary cortical

astrocytes under conditions of oxidative stress Since hydrogen peroxide is a physiologically

relevant oxidant in the brain and known to activate JNK we chose to use this to treat astrocytes

and then analyze changes to the phosphoproteome by mass spectrometry By doing this we

hope to elucidate important JNK-dependent targets involved in stress signaling in the brain and

that identifying these targets could lead to the identification of novel disease mechanisms and

potentially new therapeutic targets for neurodegeneration Specifically we plan on performing

stable isotope labeling by amino acids (SILAC) of astrocytes for control hydrogen peroxide

treated and hydrogen peroxide treated with JNK inhibitor The SILAC labeled astrocyte cell

lysate will be digested enriched using Fe-NTA IMAC and separated using 2-D high pH-low pH

RP-RP and infused into the amaZon ETD ion trap The protocol is similar to the yeast

comparative phosphoproteomics from Chapter 5 After analysis we plan on processing the data

using ProteoIQ to identify phosphoproteins with significant changes

Immunoprecipitation Followed by Mass Spectrometry

Stb3 Mass Spectrometry Analysis

Stb3 (Sin3-binding protein) has previously been shown to change location depending on

the presence or absence of glucose (Heideman Lab Dr Michael Conways Thesis) An

immunoprecipitation of Stb3 was performed in parallel to the work described in Chapter 5 Two

separate experiments were performed one with wild type Stb3 and another tagged with myc for

improved protein immunoprecipitation Myc tagging is a polypeptide tag which can be

recognized by an antibody and provide a better enrichment compared to using a Stb3 antibody

alone The myc tagging was done because of the low abundance of Stb3 and the limited amount

of protein that was pulled down by the antibody for Stb3 Immunopreciptation experiments were

215

performed for both starved and glucose fed samples All samples were tryptically digested

followed by low pH RP nanoLCMSMS Both CID and ETD were used for fragmentation

analysis of each sample (n=2) The Stb3 wild type had two peptide hits in one MS run which is

not surprising due to low Stb3 antibody affinity In contrast a significant amount of Stb3 was

pulled down from Myc tagged starved and glucose fed samples For the glucose starved

samples 22 unique peptides from Stb3 were identified whereas glucose fed samples yielded 21

unique Stb3 peptides identified (unpublished data) The identified Stb3 in the myc samples

allowed us to investigate what other proteins were pulled down that are not present in the wild

type samples

From previous work by our collaborator Dr Heideman it had been suggested that Stb3

forms a complex with Sin3 and Rpd3 Our proteomic data shows multiple significant peptide

hits for Sin3 and Rpd3 in myc tagged but not in wild type where Stb3 is only observed once

with a low Mascot score When looking at the unique proteins identified in myc tagged glucose

fed sample but not included in the wild type samples the myc fed sample contained eight unique

ribosomal proteins with numerous unique peptides for each The ribosomal proteins identified in

myc fed sample supports the novel hypothesis that in the glucose fed state the complex of Stb3

Rpd3 and Sin3 interact with ribosomal proteins Other proteins unique to myc tagged glucose

starved samples are glyceraldehyde-3-phosphate dehydrogenase 2 and transcriptional regulatory

protein UME6 Also three proteins were observed in both myc fed and starved but not in the

wild type samples including transposon Ty1-DR1 Gag-Pol polyprotein(YD11B) SWIRM

domain-containing protein (FUN19) and RNA polymerase II degradation factor 1 (DEF1) Our

proteomics data for the Stb3 immunoprecipitation experiments with starved vs glucose fed

216

samples provide exciting evidence to support previous observation made by the Heideman group

and highlight the utility of MS-based approach to deciphering protein-protein interactions

Conclusions

The majority of the work described in this dissertation revolves around sample

preparation for proteomics and peptidomics with a focus on generating biologically testable

hypotheses In the area of neuropeptides CPRP was fully sequenced analytical methods were

transformed for use in an ETD ion trap and sub-microg peptides can now be analyzed by mass

spectrometry after MWCO separation In the field of comparative proteomics comparisons

between mouse CSF in GFAP overexpressors and control or rat adapted scrapie model and

control CSF yeast fed vs glucose starved cell extract have all been presented Collectively this

thesis has developed new techniques for neuropeptide sample preparation and presented

numerous comparative proteomic analyses of various biological samples and how the proteomes

are dynamically perturbed by various treatments and disease conditions

217

Appendix 1

Protocols for sample preparation for mass spectrometry based

proteomics and peptidomics

218

Small Scale Urea Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 8 μL of urea solution

(400mg05mL) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Allow to digest overnight in 37degC water bath

10 Acidify with 10μL 10 formic acid

11 Perform solid phase extraction using tips dependent of sample amount

a Sub-5μg amounts ndash Millipore Ziptips

b 5-75μg amounts ndash Bond-Elut tips from Agilent (OMIX tips)

12 Dry down in Speedvac as needed for experiment

219

Small Scale ProteaseMAX Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 2 μL of 1 of

ProtesaeMAX (Promega) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Add 1 μL ProteaseMAX and let sit for 3-4 hours

10 Acidify with 2μL 10 formic acid

11 Dry down in Speedvac as needed for experiment

220

Large Scale Urea Tryptic Digestion (mg of proteins)

1 All solutions are in a 50 mM NH4HCO3 buffer (395 mgmL)

2 Add 20μL of 05M Dithiothreitol (DTT) (77mg100μL) and 160μL of urea solution

(400mg05mL) to sample

3 Allow sample to denature 45-60 minutes in a 37degC water bath

4 Add 54μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

5 Quench reaction with 20μL of DTT solution

6 Dilute with 14mL of NH4HCO3 solution

7 Add 100μg of trypsin

8 Allow to digest overnight in 37degC water bath

9 Acidify sample with 100μL of 10 formic acid

10 Perform solid phase extraction with Hypersep C18 Cartridges with 100 mg of C18

bead volume (Thermo)

11 Dry down with the Speedvac as needed for experiment

221

Fe-NTA Preparation from Ni-NTA Beads

1 Centrifuge (5 min at 140 rcf) and use a magnet to keep beads in place as supernatant

is removed

2 Wash 3 times with 800μL of H2O (using same technique of centrifuging and using

magnet to keep beads in places as supernatant is removed)

3 Add 800μL of 100mM EDTA (372mgmL) in a 50mM NH4HCO3 (395mgmL)

buffered solution (pH around 8) and vortex for 30 minutes to chelate the Ni

centrifuge and remove supernatant

4 Wash 3 times with 800μL of H2O

5 Add 800μL of 10mM FeCl3 hexahydrate (27mgmL) and vortex for 30 minutes to

bind Fe to beads centrifuge and remove supernatant

6 Wash 3 times with 800μL H2O

7 Resuspend in a storage solution of 111 ACNMeOH001 Acetic Acid (1mL total)

222

Fe-NTA IMAC Phospho-enrichment

1 Use wash solution (80ACN 01Formic Acid) to rinse beads vortex 1 minute

centrifuge and remove supernatant

2 Add sample (around 500μL in 80ACN 01Formic Acid) and vortex 40 minutes to

allow sample to bind dispose of supernatant after centrifuging

3 Wash 3 times with 200μL of wash solution discard supernatant

4 Add 200μL of elution solution (5050 ACN5Ammonium Hydroxide) vortex 15

minutes and save supernatant

5 Add 200μL of elution solution vortex 10 minutes and save supernatant

6 Wash 2 time with wash solution (collect supernatant of first wash)

7 If reusing store in 1mL solution of 111 ACNMeOH001 Acetic Acid

223

High pH Off-line Separation

1) In general a minimum of 20 microg of peptides are needed to gain any benefit

from off-line 2D fractionation It is better to inject 100 microg of peptides on

column

2) Use a Gemini column or a similar column that can handle pH=10 and for this

protocol a 21 mm x 150 mm column was used

3) Prepare ldquobuffer Ardquo 25 mM NH4 formate pH=10 Also prepare ldquobuffer Brdquo

4) Dry down desired sample and reconstitute in buffer A

5) Inject based off sample loop (current Alliance HPLC has a 50 microL sample

loop)

6) Run gradient at bottom of the page collecting fractions every 3 minutes except

for the 1st minute which is the void volume

7) Optional If you want to reduce instrument time you can combine fractions 1

with 12 2 with 13 etc until 11 with 22

Time Mobile phase A Mobile phase B Flow Rate

05mlmin

0 98 2 05 mLmin

65rsquo 70 30 05 mLmin

65rsquo1rdquo 5 95 05 mLmin

70 5 95 05 mLmin

224

Non Membrane Glycoprotein Enrichment

1 For protocols involving membrane glycoprotein enrichment see Dr Xin Weirsquos

thesis

2 Add 75 microL of Wheat germ agglutinin (WGA) bound to agarose beads and 150 microL

of Concanavalin A (ConA) to a Handee spin cup (Thermo) Wash 2 times with

lectin affinity chromatography (LAC) binding buffer (015 M NaCl 002 M Tris-

HCl 1 mM CaCl2 pH=74) centrifuging at 400 g for 30 seconds

3 Place stopper on bottom of spin cup and add 200 microg of sample (500 microL for CSF)

Bring up to 300 microL using lectin LAC binding buffer

4 Incubate for 1 hour with continuous mixing at room temperature

5 Centrifuge at 400 g for 30 seconds

6 Perform two more 300 microL LAC binding washes followed by centrifugation

7 Add 300 microL LAC eluting buffer (0075 M NaCl 001 M Tris-HCl 02 M alpha-

methyl mannoside 02 M alpha-methyl glucoside and 05 M acetyl-D-

glucosamine) vortex for 10 minutes (have stopper in place while vortexing)

centrifuge and collect

7 Add another 300 microL LAC eluting buffer centrifuge and collect

225

MWCO separation for Sub-microg peptide concentrations

1 Wash molecular weight cut-off (MWCO) with 5050 waterMeOH centrifuge at

14000 g for 5 minutes for 10 kDa filter (time will vary Millipore Amicon Ultra

filters)

2 Wash with 100 water centrifuge at 14000 g for 5 minutes

3 Add methanol to the sample to get the concentration to 30 methanol and add

salt to achieve a roughly 05 M NaCl before adding the sample to the MWCO

4 Centrifuge at 14000 for 10 minutes collect flow through

226

Immunoprecipitation

Modified from Thermo Fisher Scientific Classic IP Kit

1 10 microL Polyclonal antibody added to 1 microg protein standard in a Handee spin cup

(Thermo) diluted with 1x TBS buffer to 400 microL and incubated overnight at on

shakerend-over-end rotator

2 Wash Protein GA beads with 2x 200microL TBS buffer and added to the

antibodysample for 15 hours at 4oC

3 Centrifuge at 400 g for 30 seconds and discard flow through

4 Wash 3x with 100microL TBS buffer centrifuge at 400 g for 30 seconds and discard

flow through

5 Wash with 1x conditioning solution (neutral pH buffer) centrifuge at 400 g for 30

seconds and discard flow through

6 Eluted with elution buffer (2x100microL) centrifuge at 400 g for 30 seconds and

collect flow through

227

C18 Solid Phase Extraction (SPE)

1 Determine amount of peptides for SPE ge5 microg use Ziptips from Millipore If

between 5-75 microg use OMIX from Agilent 100 microL version and if ge75 microg use SPE

cartridges such as 100 mg Hypersep from Thermo

2 Ensure no detergents are in the sample and it is acidified

3 The three SPE procedures all use the same sets of solutions only volumes vary

4 3x Wetting solution (100 ACN) for 60 microL OMIX (20 microL ziptip and 15 mL for

100 mg cartridge)

5 3x Wash solution (01 formic acid in 100 H2O) same volumes as 4

6 Draw up the sample about 10 times minimum (generally 30-40 is preferred)

without letting the bead volume dry out

7 1X Wash solution same volumes as 4

8 Back load the eluting solution 01 formic acid in 5050 H2OACN into the

Ziptips (15 microL) or OMIX (50 microL) tips For the SPE cartridge add 700 microL of

eluting solution

9 Dry down and prepare for next step in sample preparation

228

Laser Puller Programs and Protocols

1) Obtain about two feet of Fused-silica capillary with 75 μm id and 360 μm od

2) Wash with methanol and then air dry using the bomb

3) Cut into one foot or desired length

4) Use a lighter to burn the middle portion (about an inch in length) of the tubing

5) Remove the ash by rinsing with methanol and rubbing with a Kimwipe

6) Make sure the laser puller has been on for at least 30 minutes before use to allow

for the instrument to warm up

7) Place capillary in instrument with the burnedexposed portion in the center

making sure that the length of the capillary is pulled taut

8) Enter desired program (next page) and press pull

229

Laser Puller Programs

Program 99 (default lab program)

Heat Filament Velocity Delay Pull

250 0 25 150 15

240 0 25 150 15

235 0 25 150 15

245 0 25 150 15

Program 97 (developed for larger inner diameter tips)

Heat Filament Velocity Delay Pull

230 - 25 150 -

220 - 25 150 -

215 - 25 150 8

230

On column Immunodepletion (serum plasma CSF amniotic fluid)

1) Prepare buffer A ldquoDilution Bufferrdquo 10 mM Tris-HCl pH=74 150 mM NaCl

2) Prepare buffer B ldquoStripping Bufferrdquo 01 M Glycine-HCl pH=25

3) Prepare buffer C ldquoNeutralization Bufferrdquo 100 mM Tris-HCl pH=80

4) Determine loading amount (LC1=~1000 ug of serum or plasma less for CSF due

to the increased amount of albumin percentage in CSF)

5) Add Dilution buffer to sample before injection and ensure the sample is proper

pH (~7)

6) Use gradient below

Time A B C Flow Rate

(mLmin)

0rsquo 100 0 0 02

4rsquo59rdquo 100 0 0 02

5rsquo 100 0 0 05

8rsquo59rdquo 100 0 0 05

9rsquo 0 100 0 05

22rsquo 0 100 0 05

22rsquo1rdquo 0 0 100 05

39rsquo 0 0 100 05

7) Store the column in 1x dilution buffer until next use

231

Small Scale Immunodepletion (100 microL of CSF)

1) Use 75 microg (per 100 microL CSF) of Sigma Seppro IgY-14 or IgY-7 bead slurry

2) Add an equal volume of 2x dilution buffer (20 mM Tris-HCl pH=74 300 mM

NaCl) to the starting amount of CSF

3) Add to a spin cup with a filter and allow to mix for 30 minutes

4) Centrifuge at 400 g for 30 seconds and collect the flow through

5) Add 50 microL of 1x dilution buffer mix for 5 minutes centrifuge at 400 g for 30

seconds and collect the flow through

6) Use 1x stripping washes (01 M Glycine-HCl pH=25) centrifuge as before and

discard Repeat four times

7) Use 1x neutralization buffer (100 mM Tris-HCl pH=80) centrifuge as before

and discard Repeat two times

8) Store the beads in the spin column in 1x dilution buffer until next use

232

Alliance Maintenance Protocol

Seal Wash

10 methanol no acetonitrile This wash cleans behind the pump-head seals to

ensure proper lubrication Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime SealWsh gt Start

2 Press Stop after 5 minutes

Prime Injector

10 methanol for maintenance high organic solvent for dirty runs (eg 95

acetonitrile) Done before injecting any real samples to ensure no bubbles are

present in the injector Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime NdlWsh gt Start

2 After completion press Close

Purge Injector

Solvent is dependent on run Run this protocol at beginning of experiments each day

Minimum once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Navigate Direct Function gt 4 Purge Injector gt OK

3 Set Sample loop volumes 60 leave Compression Check unchecked gt OK

Prime Solvent Pumps

Solvent is dependent on run If solvents are changed run this protocol Minimum

once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys choose composition A type 100 Enter x4

3 Navigate Direct Function gt 3 Wet Prime gt OK

4 Set Flow Rate 7000 mLmin Time 100 min gt OK

5 Repeat for all changedactive solvent pumps

Condition Column

Dependent on user Use starting conditions for run

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys type starting solvent compositions for run

3 Navigate Direct Function gt 6 Condition Column gt OK

4 Set Time as desired

233

Appendix 2

List of Publications and Presentations

234

PUBLICATIONS

ldquoDiscovery and characterization of the crustacean hyperglycemic hormone precursor related

peptides (CPRP) and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes

sapidus using multiple tandem mass spectrometry techniquesrdquo Hui L Cunningham R Zhang

Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

ldquoInvestigation and reduction of sub-microgram peptide loss using molecular weight cut-off

fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J Wellner D Li L

Journal of Mass Spectrometry In Press

ldquoMass spectrometry-based proteomics and peptidomics for systems biology and biomarker

discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse models of

Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P Messing A Li

L Journal of Proteome Research Submitted

ldquoInvestigation of the differences in the phosphoproteome between starved vs glucose fed

Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M Heideman

W Li L In preparation

ldquoIdentification of astrocytic JNK targets by phosphoproteomics using mass spectrometryrdquo

Cunningham R Dowell J Wang J Wellner D Li L Milhelm M In preparation

ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A Cunningham R Wellner

D Wang J Ma D Li L Aiken J In preparation

235

INVITED SEMINARS AND CONFERENCE PRESENTATIONS

Robert Cunningham Davenne Mayour and Shoshanna Coon ldquoMorphology and Thermal

Stability of Monolayers on Porous Siliconrdquo The 231th

ACS National Meeting 2006 Atlanta

GA

Robert Cunningham Xin Wei Paige Jany Albee Messing and Lingjun Li ldquoMass

Spectrometry-Based Analysis of Cerebrospinal Fluid Peptidome and Proteome for Biomarker

Discovery in Alexander Diseaserdquo The 57th

ASMS Conference 2009 Philadelphia PA

Robert Cunningham ldquoWhat to Expect in a PhD Graduate Schoolrdquo Invited seminar University

of Northern Iowa 2010 Cedar Falls IA

Robert Cunningham Dustin Frost Albee Messing and Lingjun Li ldquoInvestigation of an

Optimized ProteaseMAX Assisted Trypsin Digestion of Human CSF for Pseudo-SRM

Quantification of GFAP and Identification of Biomarkersrdquo The 58th

ASMS Conference 2010

Salt Lake City UT

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo Oral presentation at Pittcon Conference and Expo 2011 2011 Atlanta

GA

Robert Cunningham Michael Conway Daniel Wellner Douglas Grunwald Warren

Heideman and Lingjun Li ldquoDevelopment of optimized phosphopeptide enrichment methods for

comparison of starved and glucose fed yeast Saccharomyces cerevisiaerdquo The 59th

ASMS

Conference 2011 Denver CO

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo 2011 Wisconsin Human Proteomics Symposium 2011 Madison WI

Page 5: Mass Spectrometry Applications for Comparative Proteomics

iv

Mass Spectrometry Applications for Comparative Proteomics and

Peptidomic Discovery

Robert Stewart Cunningham

Under the supervision of Professor Lingjun Li

At the University of Wisconsin-Madison

Abstract

In this thesis multiple biological samples from various diseases models or

treatments are investigated using shotgun proteomics and improved methods are

developed to enable extended characterization and detection of neuropeptides In general

this thesis aims to expand upon the rapidly evolving field of mass spectrometry (MS)-

based proteomics and peptidomics by primarily enhancing small scale sample analysis

A review of the current status and progress in the field of biomarker discovery in

peptidomics and proteomics is presented To this rapidly expanding body of literature

our critical review offers new insights into MS-based biomarker studies investigating

numerous biological samples methods for post-translational modifications quantitative

proteomics and biomarker validation Methods are developed and presented including

immunodepletion for small volume cerebrospinal fluid (CSF) samples for comparison of

the CSF proteomes between an Alexander disease transgenic mouse model with

overexpression of the glial fibrillary acidic protein and a control animal This thesis also

covers the application of the small scale immunodepletion of CSF for comparative

proteomic analysis of a novel rat adapted scrapie (RAS) model for prion disease and

v

compares the RAS CSF proteome to control rat CSF using MS Large scale

phosphoproteomics of starved vs glucose fed yeast is presented to better understand the

phosphoproteome changes that occur during glucose feeding Method development for

neuropeptide analysis is expanded upon using electron transfer dissociation (ETD)

fragmentation to successfully sequence for the first time the crustacean hyperglycemic

hormone precursor-related peptide (CPRP) from the blue crab Callinectes sapidus In

addition a method for ETD sequencing of sulfonated neuropeptides using a magnesium

salt adduct in an ion trap mass spectrometer is reported This thesis also reports on a

method for sub-microg peptide isolation when using a molecular weight cut-off filtration

device to improve sample recovery by over 2 orders of magnitude All the protocols used

throughout the work are provided in an easy to use step-by-step format in the Appendix

Collectively this body of work extends the capabilities of mass spectrometry as a

bioanalytical tool for shotgun proteomics and expands upon methods for neuropeptide

discovery and analysis

1

Chapter 1

Introduction Brief Background and Research Summary

2

Abstract

Mass spectrometry based comparative proteomics and improved methods for

neuropeptide discovery have been reported in this thesis A very brief introduction of crustacean

neuropeptidomics is covered in this chapter followed by Chapter 2 which covers in great detail

comparative proteomics using mass spectrometry with an emphasis on biomarker discovery

Chapter 3 reports a novel immunodepletion technique used for comparative proteomics between

glial fibrillary acidic protein (GFAP) overexpressor and control mouse cerebrospinal fluid (CSF)

Chapter 4 involves rat CSF comparative proteomics between rat adapted scrapie and control

animals Chapter 5 details comparisons of phosphoproteomes in Saccharomyces cerevisiae

(Bakerrsquos yeast) between starved vs glucose fed conditions Chapter 6 investigates the use of

electron transfer dissociation (ETD) for kDa sized neuropeptides and neuropeptides with tyrosine

sulfonation Chapter 7 presents a molecular weight cut-off (MWCO) protocol to allow sub-microg

peptides to be recovered Chapter 8 provides a conclusion to this thesis and outlines future

directions for certain projects

3

Background

Mass spectrometry (MS) requires gas phase ions for experimental measurement and

intact large nonvolatile biomolecules proved difficult to be analyzed by electron ionization or

chemical ionization until the invention of two soft ionization techniques matrix-assisted laser

desorptionionization (MALDI)1 and electrospray ionization (ESI)

2 ESI and MALDI are the

two most common soft ionization techniques for mass spectrometry Once ionized molecules

such as peptides or proteins can be separated by their mass to charge ratios (mz) using various

mass analyzers manipulating electric andor magnetic fields ESI and MALDI mass

spectrometric techniques have become central analytical methods in biological sciences because

they are suitable for nonvolatile thermally labile analytes in femtomole quantities3 ESI allows

the coupling of high pressure liquid chromatography and the constant flow of solvent is

electrically charged at the outlet to produce a Taylor cone4 During desolvation the Raleigh

limit is reached and a coulombic explosion occurs commonly producing multiply charged ions

A modification of ESI is nanoLC-ESI which operates at nLmin flow rates to analyze low sample

amounts such as proteins or peptides in biological samples5 NanoLC-ESI uses less solvent as

the source because of the reduced flow rate of standard LCMS with an ESI source and nanoLC-

ESI is the predominate inlet for shotgun proteomic ESI MS experiments6 Conversely MALDI

can use sub-microL volume of sample and be co-crystallized with an ultraviolet absorbing organic

matrix that is obliterated using a laser at appropriate wavelength to generate gas phase ions

Alternatively MALDI has the unique capability to work with tissue samples and ionize in the

solid state instead of liquid like ESI

4

Mass analyzers require an operating pressure between 10-4

-10-10

Torr to allow proper ion

transfer and separationisolation of the ion based upon its mz Numerous mass analyzers are

currently available and each have their own strengths and weaknesses as shown in Figure 1 The

biomolecules are separated by the mass analyzers and detected without fragmentation which is

termed MS analysis Following MS detection tandem mass spectrometry (MSMS) of the

original precursor ion can be performed to provide additional structural information such as a

ladder sequence of amino acids for peptides Numerous fragmentation techniques are available

for proteomics such as collision induced dissociation (CID) electron transfer dissociation (ETD)

or high energy collision induced dissociation (HCD) Each of these fragmentation techniques

have their own benefits in proteomics experiments and are discussed in depth in Chapter 2 The

background and current status for comparative proteomics with specific emphasis on biomarker

analysis are covered in Chapter 2

Neuropeptidomic Method Development in the Crustacean Model System

Utilizing Mass Spectrometry

Chapter 6 and 7 focus on sample preparation and ETD fragmentation techniques to

characterize neuropeptides in the neuroendocrine organs in the crustacean nervous system

Neuropeptides are short chains of amino acids which are a diverse class of cell-cell signaling

molecules in the nervous system Neuropeptides have been investigated for being involved in

numerous physiological processes such as memory7 learning

8 depression

9 pain

10 reward

11

reproduction12

sleep-wake cycles13

homeostasis14

and feeding15-17

Figure 2 depicts how

neuropeptides are synthesized as pre-prohormones in the rough endoplasmic reticulum and

5

packaged in the Golgi apparatus After being packaged these pre-prohormones are processed

into bioactive peptides within the vesicle which is occurring during vesicular transport down an

axon and released into the synaptic cleft Secreted neuropeptides stimulate the post synaptic

neurons by interacting with G-protein coupled receptors at the chemical synapse

The crustacean model nervous system is well-defined neural network which has been

used in neurobiology and neuroendocrine (neurosecretory) research and thus lends itself for

studying neuromodulation18-22

Figure 3 shows the locations of several neuroendocrine organs in

the crustacean Callinectes sapidus including the sinus gland (SG) which is studied in Chapter 6

The Lingjun Li lab has focused on neuropeptide discovery and function in crustacean

neuroendocrine organs using mass spectrometry23-25

The work presented in Chapters 6 and 7

expand on sample preparation and analytical tools to further investigate the neuropeptidome

Research Overview

Comparative Proteomics of Biological Samples

Chapter 2 provides a thorough review of comparative proteomics and biomarker analysis

using mass spectrometry The scientific community has shown great interest in the field of mass

spectrometry-based proteomics and peptidomics for its applications in biology Proteomics

technologies have evolved to generate large datasets of proteins or peptides involved in various

biological and disease progression processes producing testable hypotheses for complex

biological questions This chapter provides an introduction and insight into relevant topics in

proteomics and peptidomics including biological material selection sample preparation

separation techniques peptide fragmentation post-translational modifications quantification

6

bioinformatics and biomarker discovery and validation In addition current literature and

remaining challenges and emerging technologies for proteomics and peptidomics are discussed

Chapter 3 investigates comparative proteomics between a GFAP overexpressor mouse

model and a control mouses cerebrospinal fluid (CSF) CSF is a low protein content biological

fluid with dynamic range spanning at least nine orders of magnitude in protein content and is in

direct contact with the brain but consist of very abundant proteins similar to serum which require

removal A modified IgY-14 immunodepletion treatment is presented to remove abundant

proteins such as albumin to enhance analysis of the low volumes of CSF that are obtainable

from mice These GFAP overexpressor mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we present the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates are performed to address animal variability as well as reproducibility in mass

spectrometric analysis Relative quantitation is performed using distributive normalized spectral

abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins with

significant changes in the CSF of GFAP transgenic mice are identified with validation from

ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

Chapter 4 explores the CSF proteomics of a novel prion model called rat adapted scrapie

(RAS) A similar IgY immunodepletion technique is presented as in Chapter 3 and is similarly

used to reduce the abundant proteins in CSF CSF from control and RAS (biological N=5

technical replicates N=3) were digested and separated using one dimensional reversed-phase

nanoLC separation In total 512 proteins 167 non-redundant protein groups and 1032 unique

peptides are identified with a 1 FDR Comparative analysis was done using dNSAF spectral

7

counting and 21 proteins were significantly up or down-regulated The proteins are compared to

the 1048 differentially regulated genes and additionally compared to previously published

proteins showing changes consistent with other prion animal models Of particular interest is

RAS specific proteins like RNAseT2 which is not up-regulated in mouse prion disease but is

designated as upregulated in both the genomic and proteomics data for RAS

Chapter 5 explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Previous work by the

Heideman lab investigated the transcriptional response to fresh glucose in yeast26

Kinases such

as protein kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose

response so we described a large scale phosphoproteomic MS based study in this chapter

Yeast cell extract was digested and phosphopeptides were enriched by immobilized metal

affinity chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase

(RP)-RP separation The low pH separation was infused directly into an ion trap mass

spectrometer with neutral loss triggered ETDfragmentation Specifically the CID fragmentation

can cause a neutral loss of 98z from the H3PO4 and can produce an incomplete fragmentation

pattern and lead to ambiguous identification so when the neutral loss is detected ETD MSMS

fragmentation is performed The neutral loss triggered ETD fragmentation is included in this

study to improve phosphopeptide identifications In total 477 phosphopeptides are identified

with 06 FDR and 669 phosphopeptides identified with 54 FDR Motif analysis of PKA and

phosphosite validation are performed as well

8

The future of comparative proteomics investigating small sample amounts or PTMs is

promising Further advances in enrichment separations science mass spectrometry

analyzersdetectors and bioinformatics will continue to create more powerful tools that enable

digging deeper in proteomics with both small (mouse CSF) and large (cell extracts) sample

amounts

Methods for Neuropeptide Analysis Using ETD fragmentation and Sample

Preparation

Chapter 6 investigates the utility of ETD fragmentation to sequence endogenous large

neuropeptides and labile tyrosine sulfation in the crustacean Callinectes sapidus The sinus

gland (SG) of the crustacean is a well-defined neuroendocrine site that produces numerous

hemolymph-borne agents including the most complex class of endocrine signaling moleculesmdash

neuropeptides One such peptide is the crustacean hyperglycemic hormone (CHH) precursor-

related peptide (CPRPs) which was not previously sequenced Electron transfer dissociation

(ETD) is used for sequencing of intact CPRPs due to their large size and high charge state In

addition ETD is used to sequence a sulfated peptide cholecystokinin (CCK)-like peptide in the

lobster Homarus americanus using a salt adduct Collectively this chapter presents two

examples of the utility of ETD in sequencing neuropeptides with large molecular weight or with

labile modifications

Chapter 7 reports a protocol on improving recovery of minute quantities of peptides by

adding methanol and a salt modifier (NaCl) to molecular weight cut-off membrane-based

centrifugal filters (MWCO) to enrich sub-microgram peptide quantities In some biological

9

fluids such as CSF the endogenous peptide content is very low and using pure water to perform

the MWCO separation produces too much sample loss Using a neuropeptide standard

bradykinin sample loss is reduced over two orders of magnitude with and without undigested

protein present The presence of bovine serum albumin (BSA) undigested protein and the

bradykinin standard lends evidence that sample loss occurs because of the MWCO and not the

presence of the protein Additionally a BSA tryptic digestion is presented where twenty-seven

tryptic peptides are identified from MALDI mass spectra after enriching with methanol while

only two tryptic peptides are identified after the standard MWCO protocol The strategy

presented in Chapter 7 greatly enhances recovery from MWCO separation for sub-microg peptide

samples

10

References

1 Tanaka K Waki H Ido Y Akita S Yoshida Y Yoshida T Matsuo T Protein and polymer analyses up to mz 100 000 by laser ionization time-of-flight mass spectrometry Rapid Communications in Mass Spectrometry 1988 2 (8) 151-153

2 Fenn J B Mann M Meng C K Wong S F Whitehouse C M Electrospray ionization for mass spectrometry of large biomolecules Science 1989 246 (4926) 64-71

3 Domon B Aebersold R Mass spectrometry and protein analysis Science 2006 312 (5771) 212-7

4 Whitehouse C M Dreyer R N Yamashita M Fenn J B Electrospray interface for liquid chromatographs and mass spectrometers Anal Chem 1985 57 (3) 675-9

5 Wilm M Mann M Analytical properties of the nanoelectrospray ion source Anal Chem 1996 68 (1) 1-8

6 Karas M Bahr U Dulcks T Nano-electrospray ionization mass spectrometry addressing analytical problems beyond routine Fresenius J Anal Chem 2000 366 (6-7) 669-76

7 Gulpinar M Yegen B The physiology of learning and memory Role of peptides and stress Curr Protein Pept Sci 2004 5 (6) 457-473

8 Ogren S O Kuteeva E Elvander-Tottie E Hokfelt T Neuropeptides in learning and memory processes with focus on galanin In Eur J Pharmacol 2010 Vol 626 pp 9-17

9 Strand F Neuropeptides general characteristics and neuropharmaceutical potential in treating CNS disorders Prog Drug Res 2003 61 1-37

10 Li W Chang M Peng Y-L Gao Y-H Zhang J-N Han R-W Wang R Neuropeptide S produces antinociceptive effects at the supraspinal level in mice Regul Pept 2009 156 (1-3) 90-95

11 Wu C-H Tao P-L Huang E Y-K Distribution of neuropeptide FF (NPFF) receptors in correlation with morphine-induced reward in the rat brain Peptides 2010 31 (7) 1374-1382

12 Tsutsui K Bentley G E Kriegsfeld L J Osugi T Seong J Y Vaudry H Discovery and evolutionary history of gonadotrophin-inhibitory hormone and kisspeptin new key neuropeptides controlling reproduction J Neuroendocrinol 2010 22 (7) 716-727

13 Piper D C Upton N Smith M I Hunter A J The novel brain neuropeptide orexin-A modulates the sleep-wake cycle of rats Eur J Neurosci 2000 12 (2) 726-730

14 Tsuneki H Wada T Sasaoka T Role of orexin in the regulation of glucose homeostasis - Tsuneki - 2009 - Acta Physiologica - Wiley Online Library Acta Physiol 2010

15 Kageyama H Takenoya F Shiba K Shioda S Neuronal circuits involving ghrelin in the hypothalamus-mediated regulation of feeding Neuropeptides 2010 44 (2) 133-138

16 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide family of aplysia J Neurosci 2002 22 (17) 7797-7808

11

17 Jensen J Regulatory peptides and control of food intake in non-mammalian vertebrates Comp Biochem Physiol Part A Mol Integr Physiol 2001 128 (3) 469-477

18 Billimoria C P Li L Marder E Profiling of neuropeptides released at the stomatogastric ganglion of the crab Cancer borealis with mass spectrometry J Neurochem 2005 95 (1) 191-199

19 Cape S S Rehm K J Ma M Marder E Li L Mass spectral comparison of the neuropeptide complement of the stomatogastric ganglion and brain in the adult and embryonic lobster Homarus americanus J Neurochem 2008 105 (3) 690-702

20 Cruz Bermuacutedez N D Fu Q Kutz Naber K K Christie A E Li L Marder E Mass

spectrometric characterization and physiological actions of GAHKNYLRFamide a novel FMRFamide‐like peptide from crabs of the genus Cancer J Neurochem 2006 97 (3) 784-799

21 Grashow R Brookings T Marder E Reliable neuromodulation from circuits with variable underlying structure Proc Natl Acad Sci USA 2009 106 (28) 11742-11746

22 Ma M Szabo T M Jia C Marder E Li L Mass spectrometric characterization and physiological actions of novel crustacean C-type allatostatins Peptides 2009 30 (9) 1660-1668

23 Chen R Hui L Cape S S Wang J Li L Comparative Neuropeptidomic Analysis of Food Intake via a Multi-faceted Mass Spectrometric Approach ACS Chem Neurosci 2010 1 (3) 204-214

24 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement approaches and challenges Annu Rev Anal Chem 2008 1 451-483

25 Ma M Wang J Chen R Li L Expanding the crustacean neuropeptidome using a multifaceted mass spectrometric approach J Proteome Res 2009 8 (5) 2426-2437

26 Slattery M G Heideman W Coordinated regulation of growth genes in Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

12

Figure 1 Capabilities and performances of certain mass analyzers Check marks indicate

availability check marks in parentheses indicate optional + ++ and +++ indicate possible or

moderate goodhigh and excellentvery high respectively Adapted with permission from

reference 3

13

Figure 2 Synthesis processing and release of neuropeptides (a) Cartoon showing two

interacting neurons (b) The synthesis of neuropeptides in the neuronal organelles and their

transport down the axon to the presynaptic terminal is depicted (c) Shows neuropeptide release

and interaction with the dendrite of the post-synaptic cell (Adapted from PhD thesis by Dr

Stephanie Cape)

14

Figure 3 Schematic showing location of the neuronal tissues being analyzed in the MS studies

of neuropeptides in Callinectes sapidus The SGs (sinus glands located in the eyestalks of the

crab) and the POs (pericardial organs located in the chamber surrounding the heart) release

neurohormones into the hemolymph (crab circulatory fluid) The brain is near the STNS

(stomatogastric nervous system neural network that controls the motion of the gut and foregut)

which has direct connections to the STG (stomatogastric ganglion) The STG is located in an

artery so it is continually in contact with hemolymph (Adapted from PhD thesis by Dr Robert

Sturm)

15

Chapter 2

Mass Spectrometry-based Proteomics and Peptidomics for Biomarker

Discovery and the Current State of the Field

Adapted from ldquoMass spectrometry-based proteomics and peptidomics for systems biology and

biomarker discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

16

Abstract

The scientific community has shown great interest in the field of mass spectrometry-based

proteomics and peptidomics for its applications in biology Proteomics technologies have

evolved to produce large datasets of proteins or peptides involved in various biological and

disease progression processes producing testable hypothesis for complex biological questions

This review provides an introduction and insight to relevant topics in proteomics and

peptidomics including biological material selection sample preparation separation techniques

peptide fragmentation post-translation modifications quantification bioinformatics and

biomarker discovery and validation In addition current literature and remaining challenges and

emerging technologies for proteomics and peptidomics are presented

17

Introduction

The field of proteomics has seen a huge expansion in the last two decades Multiple factors have

contributed to the rapid expansion of this field including the ever evolving mass spectrometry

instrumentation new sample preparation methods genomic sequencing of numerous model

organisms allowing database searching of proteomes improved quantitation capabilities and

availability of bioinformatic tools The ability to investigate the proteomes of numerous

biological samples and the ability to generate future hypothesis driven experiments makes

proteomics and biomarker studies exceedingly popular in biological studies today In addition

the advances in post-translational modification (PTM) analysis and quantification ability further

enhance the utility of mass spectrometry (MS)-based proteomics A subset of proteomics

research is devoted to profiling and quantifying neurologically related proteins and endogenous

peptides which has progressed rapidly in the past decade This review provides a general

overview as outlined in Figure 1 of proteomics technology including methodological and

conceptual improvements with a focus on recent studies and neurological biomarker studies

Biological Material Selection

The choice of biological matrix is an important first step in any proteomics analysis The

ease of sample collection (eg urine plasma or saliva) versus usefulness or localization of

sample (eg specific tissue or proximity fluid) needs to be evaluated early on in a study design

Plasma derived by centrifugation of blood to remove whole cells is a very popular

choice in proteomics due to the high protein content (~65 mg ml1) and the ubiquitous nature of

blood in the body and the ability to obtain large sample amounts or various time points without

the need to sacrifice the animal or to perform invasive techniques Plasma is centrifuged

18

immediately after sample collection unlike serum where coagulation needs to occur first To

obtain plasma blood is collected in a tube with an anticoagulant added (ETDA heparin or

citrate) and centrifuged but previous reports have shown variable results when heparin has been

used as an anticoagulant2 Human Proteome Organization (HUPO) specifically recommends the

anticoagulants EDTA or citrate to treat plasma3 4

One of the primary concerns with plasma is

degradation of the protein content via endogenous proteases found in the sample5 One way to

address this problem is the use of protease inhibitors In addition freezethaw cycles need to be

minimized to prevent protein degradation and variability6 7

Plasma proteomics has seen

extensive coordinated efforts to start assessing the diagnostic needs using plasma8 HUPO also

has established a public human database for plasma and serum proteomics from 35 collaborating

labratories9 Large dynamic range studies have been performed on plasma with a starting sample

amount of 2625 microl (1575 mg) resulting in 3654 proteins identified with a sub 5 false

discovery rate10

The large dynamic range spanning across eleven orders of magnitude as visualized in

Figure 2 is one of the biggest obstacles in plasma proteomics Figure 2 also shows that as lower

abundance proteins are investigated the origins of those identified proteins are more diverse than

the most abundant proteins Recent mining of the plasma proteome showed an ability to search

for disease biomarker applications across seven orders of magnitude In addition the tissue of

origin for the identified plasma proteins were identified and its origin was more diverse as the

protein concentration decreased11

Plasma has been used as a source for biomarker studies such

as colorectal cancer12 13

cardiovascular disease14

and abdominal aortic aneurysm15

Even

though the blood brain barrier prevents direct blood to brain interaction neurological disorders

such as Alzheimerrsquos disease (AD) have had their proteomes studied using plasma16

19

An alternative sample derived from blood is serum which is plasma allowed to coagulate

instead of adding anti-coagulates The time for coagulation is usually 30 minutes and during that

time significant and random degradation from endogenous proteases can occur The additional

variability caused from the coagulation process can change the concentration of multiple

potentially valuable biomarkers As biodiversity between samples or organisms is a challenging

endeavor additional sample variability due to serum generation may be undesirable but serum is

still currently being used for biomarker disease studies17

Serum has been used to compare the

proteome differences in neurological diseases such as AD Parkinsonrsquos disease and amyotrophic

lateral sclerosis and a review can be found elsewhere discussing the subject18

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy of brain or spinal cord

in evaluating diseases of the central nervous system and has been used for studies in neurological

disorders due to being a rich source of neuro-related proteins and peptides19

The protein

composition of the most abundant proteins in CSF is well defined and numerous studies exist to

broaden the proteins identified20-22

CSF has an exceedingly low protein content (~04 μgμL)

which is ~100 times lower than serum or plasma and over 60 of the total protein content in

CSF consists of a single protein albumin23-25

In addition the variable concentrations of proteins

span up to twelve orders of magnitude further complicating analysis and masking biologically

relevant proteins to any given study26

One of the highest number of identified proteins is from

Schutzer et al with 2630 non-redundant proteins from 14 mL of pooled human CSF This study

involved the removal of highly abundant proteins by performing IgY-14 immunodepletion

followed by two dimensional (2D) liquid chromatography (LC) separation27

Studies have also

been performed to characterize individual biomarkers or complex patterns of biomarkers in

various diseases in the CSF28 29

One potential pitfall of CSF proteomic analysis is

20

contamination from blood which can be identified by counting red blood cells present or

examining surrogate markers from blood contamination other than hemoglobin such as

peroxiredoxin catalase and carbonic anhydrase30

A proof of principle CSF peptidomics study

identified numerous endogenous peptides associated with the central nervous system which can

be used as a bank for neurological disorder studies31

Numerous recent reports highlighted the

utility of CSF analysis for biomarker studies in AD32 33

medulloblastoma34

both post-mortem

and ante-mortem35

Cellular lysates offer the distinct advantage to work with a cell line yeast or bacteria

with large amounts of proteins available for analysis36 37

with Saccharomyces cerevisiae being

the most common cell lysate38 39

Other cell lines are also used including HeLa40

and E coli41

The ability to obtain milligrams of proteins easily to scale up experiments without animal

sacrifice offers a clear advantage in biological sample selection Current literature supports

cellular lysate as a valued and sought after source of proteins for large scale proteomics

experiments because of the ability to assess treatments conditions and testable hypotheses42-44

Cellular lysate from rat B104 neuroblastoma cell line was used as an in vitro model for cerebral

ischemia and showed abundance changes in multiple proteins involved in various neurological

disorders45

Other Sources of Biological Samples

Urine

The urine proteome appears to be another attractive reservoir for biomarker discovery

due to the relatively low complexity compared with the plasma proteome and the noninvasive

collection of urine Urine is often considered as an ideal source to identify biomarkers for renal

21

diseases due to the fact that in healthy adults approximately 70 of the urine proteome originate

from the kidney and the urinary tract 46

thus the use of urine to identify neurological disorders is

neglected However strong evidence have shown that proteins that are associated with

neurodegenerative diseases can be excreted in the urine47-49

indicating the application of urine

proteomics could be a useful approach to the discovery of biomarkers and development of

diagnostic assays for neurodegenerative diseases However the current view of urine proteome

is still limited by factors such as sample preparation techniques and sensitivity of the mass

spectrometers There has been a tremendous drive to increase the coverage of urine proteome

In a recent study Court et al compared and evaluated several different sample preparation

methods with the objective of developing a standardized robust and scalable protocol that could

be used in biomarkers development by shotgun proteomics50

In another study Marimuthu et al

reported the largest catalog of proteins in urine identified in a single study to date The

proteomic analysis of urine samples pooled from healthy individuals was conducted by using

high-resolution Fourier transform mass spectrometry A total of 1823 proteins were identified

of which 671 proteins have not been previously reported in urine 51

Saliva

For diagnosis purposes saliva collection has the advantage of being an easy and non-

invasive technique The recent studies on saliva proteins that are critically involved in AD and

Parkinsonrsquos diseases suggested that saliva could be a potentially important sample source to

identify biomarkers for neurodegenerative diseases Bermejo-Pareja et al reported the level of

salivary Aβ42 in patients with mild AD was noticeably increased compared to a group of

controls 52

In another study Devic et al identified two of the most important Parkinsons

22

disease related proteinsmdash α-synuclein (α-Syn) and DJ-1 in human saliva 53

They observed that

salivary α-Syn levels tended to decrease while DJ-1 levels tended to increase in Parkinsons

disease The published results from this study also suggest that α-Syn might correlate with the

severity of motor symptoms in Parkinsons disease Due in part to recent advancements in MS-

based proteomics has provided promising results in utilizing saliva to explore biomarkers for

both local and systemic diseases 54 55

the further profiling of saliva proteome will provide

valuable biomarker discovery source for neurodegenerative diseases

Tissue

Compared to body fluids such as plasma serum and urine where the proteomic analysis

is complicated by the wide dynamic range of protein concentration the analysis of tissue

homogenates using the well-established and conventional proteomic analysis techniques has the

advantage of reduced dynamic range However the homogenization and extraction process may

suffer from the caveat that spatial information is lost which would be inadequate for the

detection of biomarkers whose localization and distribution play important roles in disease

development and progression Matrix-assisted laser desorptionionization (MALDI) imaging

mass spectrometry (IMS) is a method that allows the investigation of a wide range of molecules

including proteins peptides lipids drugs and metabolites directly in thin slices of tissue 56-59

Because this technology allows for identification and simultaneous localization of biomolecules

of interests in tissue sections linking the spatial expression of molecules to histopathology

MALDI-IMS has been utilized as a powerful tool for the discovery of new cancer biomarker

candidates as well as other clinical applications60 61

The utilization of MALDI-IMS for human

or animal brain tissue to identify or map the distribution of molecules related to

neurodegenerative diseases were also recently reported62 63

23

Secretome

There has been an increasing interest in the study of proteins secreted by various cells

(the secretomes) from tissue-proximal fluids or conditioned media as a potential source of

biomarkers Cell secretomes mainly comprise proteins that are secreted or are shed from the cell

surface and these proteins can play important role in both physiological processes (eg cell

signaling communication and migration) and pathological processes including tumor

angiogenesis differentiation invasion and metastasis In particular the study of cancer cell

secretomes by MS based proteomics has offered new opportunities for cancer biomarker

discovery as tumor proteins may be secreted or shed into the bloodstream and could be used as

noninvasive biomarkers The latest advances and challenges of sample preparation sample

concentration and separation techniques used specifically for secretome analysis and its clinical

applications in the discovery of disease specific biomarkers have been comprehensively

reviewed64 65

Here we only highlight the proteomic profiling of neural cells secretome that has

been applied to neurosciences for a better understanding of the roles secreted proteins play in

response to brain injury and neurological diseases The LC-MS shotgun identification of

proteins released by astrocytes has been recently reported66-68

In these studies the changes

observed in the astrocyte secretomes induced by inflammatory cytokines or cholinergic

stimulation were investigated6667

Alternatively our group performed 2D-LC separation and

included cytoplasmic protein extract from astrocytes as a control to identify cytoplasmic protein

contaminants which are not actively secreted from cells68

Sample Preparation

24

Proteomic analysis and biomarker discovery research in biological samples such as body

fluids tissues and cells are often hampered by the vast complexity and large dynamic range of

the proteins Because disease identifying biomarkers are more likely to be low-abundance

proteins it is imperative to remove the high-abundance proteins or apply enrichment techniques

to allow detection and better coverage of the low-abundance proteins for MS analysis Several

strategies including depletion and protein equalizer approach have been used during sample

preparation to reduce sample complexity69 70

and the latest advances of these methods have been

reviewed by Selvaraju et al 71

Alternatively the complexity of biological samples can be

reduced by capturing a specific subproteome that may have the biological information of interest

The latter strategy is especially useful in the biomarker discovery where the changes in the

proteome are not solely reflected through the concentration level of specific proteins but also

through changes in the post-translational modifications (PTMs) Here we will mainly discuss

the enrichment of phosphoproteinpeptides glycoproteinpeptides and sample preparation for

peptidomics and membrane proteins

Phosphoproteomics

Phosphorylation can act as a molecular switch on a protein by turning it on or off within

the cell It is thought that up to 30 of the proteins can be phosphorylated72

and it plays

significant roles in such biological processes as the cell cycle and signal transduction73

Currently tens of thousands of phosphorylation sites can be proposed using analytical methods

available today74 75

The amino acids that are targeted for phosphorylation studies are serine

threonine and tyrosine with the abundance of detection decreasing typically in that order Other

25

amino acids have been reported to be phosphorylated but traditional phosphoproteomics

experiments ignore these rare events76

In a typical large-scale phosphoproteomics experiment the sample size is usually in

milligram amounts to account for the low stoichiometry of phosphorylated proteins The large

amount of protein is then digested typically with trypsin but alternatively experiments have

been performed with Lys-C digestion to produce large enzymatic peptides The larger peptides

produced from Lys-C render higher charged peptides during electrospray ionization (ESI) and

allow improved electron-based fragmentation to determine specific sites of phosphorylation77

From the pool of peptides phosphopeptides must be enriched otherwise they will be masked by

the vast number and higher ionization efficiency of non-phosphorylated peptides The two most

common enrichment techniques are immobilized metal ion affinity chromatography (IMAC) and

metal oxide affinity chromatography (MOAC) TiO2 being the most common oxide used for this

purpose A recent study reported that phosphorylation of neuronal intermediate filament proteins

in neurofibrillary tangles are involved in Alzheimerrsquos disease78

Glycoproteomics

Protein glycosylation is one of the most common and complicated forms of PTM Types

of protein glycosylation in eukaryotes are categorized as either N-linked where glycans are

attached to asparagine residues in a consensus sequence N-X-ST (X can be any amino acid

except proline) via an N-acetylglucosamine (N-GlcNAc) residue or the O-glycosylation where

the glycans are attached to serine or threonine Glycosylation plays a fundamental role in

numerous biological processes and aberrant alterations in protein glycosylation are associated

with neurodegenerative disease states such as Creutzfeld-Jakob Disease (CJD) and AD79 80

26

Due to the low abundance of glycosylated forms of proteins compared to non-glycosylated

proteins it is essential to enrich glycoproteins or glycopeptides in complex biological samples

prior to MS analysis Two of the most common enrichment methods used in glycoproteomics are

lectin affinity chromatography (LAC) and hydrazide chemistry The detailed methodologies of

LAC hydrazide chemistry and other enrichment methods in glycoproteomics have been

extensively reviewed in the past81 82

In particular LAC is of great interest in studies of

glycosylation alterations as markers of AD and other neurodegenerative diseases due to its recent

applications in brain glycoproteomics83

Our group has utilized multi-lectin affinity

chromatography containing concanavalin A (ConA) and wheat germ agglutinin (WGA) to enrich

N-linked glycoproteins in control and prion-infected mouse plasma84

This method enabled us to

identify a low-abundance glycoprotein serum amyloid P-component (SAP) PNGase F digestion

and Western blotting validation confirmed that the glycosylated form of SAP was significantly

elevated in mice with early prion infection and it could be potentially used as a diagnostic

biomarker for prion diseases

Membrane proteins

Membrane proteins play an indispensable role in maintaining cellular integrity of their

structure and perform many important functions including signaling transduction intercellular

communication vesicle trafficking ion transport and protein translocationintegration85

However due to being relatively insoluble in water and low abundance it is challenging to

analyze membrane proteins by traditional MS-based proteomics approaches Numerous efforts

have been made to improve the solubility and enrichment of membrane proteins during sample

preparation Several comprehensive studies recently covered the commonly used technologies in

27

membrane proteomics and different strategies that circumvent technical issues specific to the

membrane 86-90

Recently Sun et al reported using 1-butyl-3-methyl imidazolium

tetrafluoroborate (BMIM BF4) an ionic liquid (IL) as a sample preparation buffer for the

analysis of integral membrane proteins (IMPs) by microcolumn reversed phase liquid

chromatography (μRPLC)-electrospray ionization tandem mass spectrometry (ESI-MSMS)

The authors compared BMIM BF4 to the other commonly used solvents such as sodium dodecyl

sulfate methanol Rapigest and urea but they found that the number of identified IMPs from rat

brain extracted by ILs was significantly increased The improved identifications could be due to

the fact that BMIM BF4 has higher thermal stability and thus offered higher solubilizing ability

for IMPs which provided better compatibility for tryptic digestion than traditionally used solvent

systems38

In addition to characterization of membrane proteome the investigation of PTMs on

membrane proteins is equally important for characterization of disease markers and drug

treatment targets Phosphorylations and glycosylations are the two most important PTMs for

membrane proteins In many membrane protein receptors the cytoplasmic domains can be

phosphorylated reversibly and function as signal transducers whereas the receptor activities of

the extracellular domains are mediated via N-linked glycosylation Wiśniewski et al provides an

informative summary on recent advances in proteomic technology for the identification and

characterization of these modifications91

Our group has pioneered the development of detergent

assisted lectin affinity chromatography (DALAC) for the enrichment of hydrophobic

glycoproteins using mouse brain extract92

We compared the binding efficiency of lectin affinity

chromatography in the presence of four commonly used detergents and determined that under

certain concentrations detergents can minimize the nonspecific bindings and facilitate the

elution of hydrophobic glycoproteins In summary NP-40 was suggested as the most suitable

28

detergent for DALAC due to the higher membrane protein recovery glycoprotein recovery and

membranous glycoprotein identifications compared to other detergents tested In a different

study on mouse brain membrane proteome Zhang et al reported an optimized protocol using

electrostatic repulsion hydrophilic interaction chromatography (ERLIC) for the simultaneous

enrichment of glyco- and phosphopeptides from mouse brain membrane protein preparation93

Using this protocol they successfully identified 544 unique glycoproteins and 922 glycosylation

sites which were significantly higher than those using the hydrazide chemistry method

Additionally a total of 383 phosphoproteins and 915 phosphorylation sites were identified

suggesting that the ERLIC separation has the potential for simultaneous analysis of both glyco-

and phosphoproteomes

Peptidomics

Peptidomics can be loosely defined as the study of the low molecular weight fraction of

proteins encompassing biologically active endogenous peptides protein fragments from

endogenous protein degradation products or other small proteins such as cytokines and signaling

peptides Studies can involve endogenous peptides94

peptidomic profiling33

and de novo

sequencing of peptides95 96

Neuropeptidomics focuses on biologically active short segments of

peptides and have been investigated in numerous species including Rattus97 98

Mus musculus99

100 Bovine taurus

101 Japanese quail diencephalon

102 and invertebrates

103-106 The isolation of

peptides is typically performed through molecular weight cut-offs from either biofluids such as

CSF plasma or tissue extracts If the protein and peptide content is high such as for tissue or cell

lysates protein precipitation can be done via high organic solvents and the resulting supernatant

can be analyzed for extracted peptides where extraction solvent and conditions could have a

29

significant effect on what endogenous peptides are extracted from tissue107

A comparative

peptidomic study of human cell lines highlights the utility of finding peptide signatures as

potential biomarkers108

A thorough review of endogenous peptides and neuropeptides is beyond

the scope of this review and an excellent review on this topic is available elsewhere109

Fractionation and Separation

The mass spectrometer has a limited duty cycle and data dependent analysis can only

scan a limited number of mz peaks at any given time In addition significant ion suppression

can occur if there is a difference in concentration between co-eluting peptides or if too many

peptides co-elute Therefore one of the biggest challenges in biomarker discovery is the

complexity of the sample and the presence of high-abundance proteins in body fluids such as

CSF serum and plasma In addition to the removal of the most abundant proteins by

immunodepletion the reduction of the complexity of the sample by further fractionation is

indispensable to facilitate the characterization of unidentified biomarkers from the low

abundance proteins Traditionally used techniques for complex protein analysis include gel

based fractionation methods such as two-dimensional gel electrophoresis (2D-GE) and its

variation two-dimensional differential gel electrophoresis (2D-DIGE) or non-gel based such as

one- or multidimensional liquid chromatography (LC) and microscale separation techniques

such as capillary electrophoresis (CE)

2D-GE MS has been widely used as a powerful tool to separate proteins and identify

differentially expressed proteins ever since 2D gels were coupled to mass spectrometry In 2D-

GE MS thousands of proteins can be separated on a single gel according to pI and molecular

weight Individual protein spots that show differences in abundance between different samples

30

can then be excised from the gel digested into peptides and analyzed by MALDI MS or by

liquid chromatography tandem mass spectrometry (LC-MSMS) for protein identification The

introduction of 2D-DIGE adds a quantitative strategy to gel electrophoresis by enabling multiple

protein extracts to be separated on the same 2D gel thus providing comparative analysis of

proteomes in complex samples In 2D-DIGE protein extracts from two different conditions and

an internal standard can be labeled with fluorescent dyes for example Cy3 Cy5 and Cy2

respectively prior to two-dimensional gel electrophoresis Compared to traditional 2D-GE 2D-

DIGE provides the clear advantage of overcoming the inter-gel variation problem 110

Proteomic

profiling of CSF by 2D-GE and 2D-DIGE has led to the identification of putative biomarkers in

multiple neurological disorders For example Brechlin et al reported an optimized 2-DIGE

protocol profiled CSF from 36 CJD patients The applicability of their approach was proven by

the detection of known CJD biomarkers such as 14-3-3 protein neuron-specific enolase lactate

dehydrogenase and other proteins that are potentially relevant to CJD 111

In another study to

identify novel CSF biomarkers for multiple sclerosis CSF from 112 multiple sclerosis patients

and control individuals were analyzed by 2D-GE MS for comparative proteomics Ten potential

multiple sclerosis biomarkers were selected for validation by immunoassay 112

These

methodologies sample preparation techniques and applications of 2D-DIGE in

neuroproteomics were reviewed by Diez et al113

Although 2D gel provides excellent resolving

power and capability to visualize abundance changes there are some limitations to the method

For example gel based separation is not suitable for low abundance proteins extremely basic or

acidic proteins very small or large proteins and hydrophobic proteins114 115

Complementary to gel-based approaches shotgun proteomics coupled to LC have

become increasingly popular in proteomic research because they are reproducible highly

31

automated and capable of detecting low abundance proteins Furthermore another advantage of

LC-MS shotgun proteomics is the suitability for isotope labeling for protein quantification which

is reviewed in a later section In shotgun proteomics a protein mixture is digested and resulting

peptides are separated by LC prior to tandem MS fragmentation to identify different proteins by

peptide sequencing The most common separation for shotgun proteomics peptidomics or top-

down proteomics experiments use low-pH reversed phase (RP) C18 C8 or C4 columns RPLC

is well established which provides high resolution desalts the sample which can interfere with

ionization and the mobile phase is compatible with ESI Nanoscale C18 columns allow for

separation and introduction of sub microgram samples If larger amounts of sample are

available two dimensional separations are usually preferred to greatly enhance the coverage of

the investigated proteome which will be discussed in depth later It is preferable to have an

orthogonal separation method and since RP separates via hydrophobicity strong cation exchange

(SCX) was the original choice due to its separation by charge MudPIT (multidimensional

protein identification technology) usually refers to the use of SCX as the first phase of separation

and is a well-established platform116

SCX has the advantage over RP separation technologies to

effectively remove interfering detergents from the sample SCX separation is not based solely

off charge and hydrophobicity contributes to elution therefore a small amount of organic

modifier usually 10-15 is added to lessen the hydrophobicity effects117

The addition of

organic modifiers needs to be minimized otherwise binding to the C18 trap cartridge or C18

column will be reduced if performed on-line SCX can be used for PTMs and offers specific

applications for proteomic studies and an excellent current review is offered on this subject

elsewhere118

An alternative MudPIT separation scheme employing high pH RPLC as the first

phase of separation and low pH RPLC in the second dimension (RP-RP) has been successfully

32

applied to the proteomic analysis of complex biological samples119 120

The advantage of using

RP as the first dimension is the higher resolution for separation and better compatibility with

down-stream MS detection by eliminating salt Song et al reported a phosphoproteome analysis

based on this 2D RP-RP coupling scheme121

Hydrophilic interaction chromatography (HILIC) employs distinct separation modality

where the retention of peptides is increased with increasing polarity122

The loading of sample is

done by high organic and eluted by increasing the percentage of the aqueous phase or polarity of

the mobile phase opposite from RPLC thus establishing orthogonality of the two separation

modes123

HILIC has quickly become a very useful method and is actively used for proteomic

experiments124

for increased sensitivity125

phosphoproteomics126

glycoproteins127

and

quantification studies128

An alternative and modification to HILIC is ERLIC which adds an

additional mode of separation by electrostatic attraction An earlier study using ERLIC

demonstrated the ability to separate phosphopeptides from non-phosphorylated peptides at

pH=2129

A recent study looking into changes in the phosphoproteome of Marekrsquos Disease

applied ERLIC to chicken embryonic fibroblast lysate identifying only 13 phosphopeptides

out of all the identified peptides Due to the lack of isolation of phosphopeptides from ERLIC

the investigators performed immobilized metal affinity chromatography (IMAC) enrichment on

the fractions increasing identification of phosphopeptides over 50 fold130

A comparative study

of ERLIC to HILIC and SCX following TiO2 phospho-enrichment reported that

SCXgtERLICgtHILIC for phosphopeptide identifications126

Recent developments in instrumentation to combine LC with ion mobility spectrometry

(IMS) and MS (LC-IMS-MS) offered more advantages than conventional LC due to the rapid

high-resolution separations of analytes based on their charge mass and shape as reflected by

33

mobility in a given buffer gas The mobility of an ion in a buffer gas is determined by the ionrsquos

charge and its collision cross-section with the buffer gas The methodologies of IMS separations

and the application of LC-IMS-MS for the proteomics analysis of complex systems including

human plasma have been reviewed by Clemmerrsquos group131-133

They proposed a method that

employs intrinsic amino acid size parameters to obtain ion mobility predictions which can be

used to rank candidate peptide ion assignments and significantly improve peptide identification

134

Although 2D gel and LC are routinely used as separation techniques in MS-based

proteomics capillary electrophoresis (CE) has received increasing attention as a promising

alternative due to the fast and high-resolution separation it offers CE has a wide variety of

operation modes among which capillary zone electrophoresis (CZE) and capillary isoelectric

focusing (CIEF) have the greatest potential applications in MS-based proteomics thus will be

highlighted here CZE separates analytes by their charge-to-size ratios in buffers under a high

electrical field and is often used as the final dimension prior to MS analysis while the separation

feature of CIEF is based on isoelectric point and this technique is more suitable to be used as the

first dimension separation Detailed description of different CEndashMS interfaces sample

preconcentration and capillary coating to minimize analyte adsorption could be found in several

reviews135-141

CE technique is complementary to conventional LC in that it is suitable for the

analysis of polar and chargeable compounds Dovichirsquos group conducted proteomic analysis of

the secreted protein fraction of Mycobacterium marinum which has intermediate protein

complexity142

The tryptic digests were either analyzed by UPLC-ESI-MSMS in triplicates or

prefactionated by RPLC followed by CZE-ESI-MSMS It was demonstrated that the two

methods identified similar numbers of peptides and proteins within similar analysis times

34

However CZE-ESI-MSMS analysis of the prefractionated sample tended to identify more

peptides that are basic and have lower mz values than those identified by UPLC-ESI-MSMS

This analysis also presented the largest number of protein identifications by using CE-MSMS

suggesting the effectiveness of prefractionation of complex samples by LC method prior to CZE-

ESI-MSMS The use of CIEF as the first dimension of separation provides both sample

concentration and excellent resolving power The combination of CIEF and RPLC separation

has been applied to the proteomic analyses where the amount of protein sample is limited and

cannot meet the requirement of minimal load amount for 2D LC-MSMS143 144

So far CE-MS

has been widely applied to the proteomic analysis of various biological samples such as urine145

146 CSF

147 blood

148 frozen tissues

149 and the formalin-fixed and paraffin-embedded (FFPE)

tissue samples150

The recent CEndashMS applications to clinical proteomics have been summarized

in several reviews135 151 152

Protein Quantification

In 2D gel electrophoresis the quantitative analysis of protein mixtures is performed on

the gel by comparing the intensity of the protein stain The development of 2D-DIGE eliminated

the gel-to-gel variation and greatly improved the quantitative capability and reliability of 2D gel

methodology110

However the accuracy of 2D gel based protein quantification suffers from the

limitations that a seemingly single gel spot often contains multiple proteins and the difficulty of

detecting proteins with extreme molecular weights and pI values as well as highly hydrophobic

proteins such as membrane proteins Therefore non-gel based shotgun proteomics technology is

more suitable for accurate and large-scale protein identification and quantification in complex

samples Briefly the quantification in non-gel based shotgun proteomics can be categorized into

35

two major approaches stable isotope labeling-based and label-free methods The common

strategies for quantitative proteomic analysis are reviewed and summarized in Table 1

Isotope labeling methods

Because stable isotope-labeled peptides have the same chemical properties as their

unlabeled counterparts the two peptides within a mixture should exhibit identical behaviors in

MS ionization The mass difference introduced by isotope labeling enables the detection of a

pair of two distinct peptide masses by MS within the mixture and allowing for the measurement

of the relative abundance differences between two peptides Depending on how isotopes are

incorporated into the protein or peptide these labeling methods can be divided into two groups

In vitro chemical derivatization techniques which incorporate a label or tag into the peptide or

protein during sample preparation metabolic labeling techniques which introduce the isotope

label directly into the organism via isotope-enriched nutrients from food or media

1 In vitro derivatization techniques

There are multiple methods to introduce heavy isotopes into proteins or peptides in vitro

The commonly used strategies include 18

O 16

O enzymatic labeling Isotope-Coded Affinity Tag

(ICAT) Tandem Mass Tags (TMTs) and Isobaric Tags for Relative and Absolute Quantification

(iTRAQ) The 18

O labeling method enzymatically cleaves the peptide bond with trypsin in the

presence of 18

O-enriched H2O and introduces 4-Da mass shift in the tryptic peptides153

The

advantages of this method include 18

O-enriched water is extremely stable tryptic peptides will

be labeled with the same mass shift secondary reactions inherent to other chemical labeling can

be avoided Conversely widespread use of 18

O-labeling has been hindered due to the difficulty

of attaining complete 18

O incorporation and the lack of robustness154 155

Currently ICAT

36

TMTs and iTRAQ methods are extensively used in quantitative proteomics In ICAT cysteine

residues are specifically derivatized with a reagent containing either zero or eight deuterium

atoms as well as a biotin group for affinity purification of cysteine-containing peptides156 157

The advantage of ICAT is that the affinity purification via biotin moiety can facilitate the

detection of low-abundance cysteine-containing peptides In addition the mass difference

introduced by labeling increases mass spectral complexity with quantification from the different

precursor masses done by MS and peptide identification being achieved through tandem MS

(MSMS) This added complexity from different peptide masses was addressed by using isobaric

labeling methods such as TMTs and iTRAQ 158 159

where the same peptides in different samples

are isobaric after tagging and appear as single mz in MS scans thus enhancing the peptide limit

of detection and reducing the MS scan complexity Isobaric labeling reagents are composed of a

primary amine reactive group and an isotopic reporter group linked by an isotopic balancer group

for the normalization of the total mass of the tags The reporter group serves for quantification

purpose since it is cleaved during collision-induced dissociation (CID) to yield a characteristic

isotope-encoded fragment Moreover isobaric labeling methods allow the comparison of

multiple samples within a single experiment Recently a 6-plex version of TMTs was

reported160

and iTRAQ enables up to eight samples to be labeled and relatively quantified in a

single experiment161

8-plex iTRAQ reagents have been used for the comparison of complicated

biological samples such as CSF in the studies of neurodegenerative diseases 162

Recently our

group developed a novel N N-dimethyl leucine (DiLeu) 4-plex isobaric tandem mass (MS2)

tagging reagents with high quantitation efficacy DiLeu has the advantage of synthetic simplicity

and greatly reduced synthesis cost compared to TMTs and iTRAQ163

Xiang et al demonstrated

that DiLeu produced comparable iTRAQ ability for protein sequence coverage (~43) and

37

quantitation accuracy (lt15) for tryptically digested proteins More importantly DiLeu

reagents could promote enhanced fragmentation of labeled peptides thus allowing more

confident peptide and protein identifications

2 In Vivo Metabolic Labeling

Metabolic processes can also be employed for the incorporation of stable-isotope labels

into the proteins or organisms by enriching culture media or food with light or heavy versions of

isotope labels (2H

13C

15N) The advantage of in vivo labeling is that metabolic labeling does

not suffer from incomplete labeling which is an inherent drawback for in vitro derivatization

techniques In addition metabolic labeling occurs from the start of the experiment and proteins

with light or heavy labels are simultaneously extracted thus reducing the error and variability of

quantification introduced during sample preparation The most widely used strategy for

metabolic labeling is known as stable-isotope labeling of amino acids in cell culture (SILAC)

which was introduced by Mann and co-workers164 165

In SILAC one cell population is grown

in normal or light media while the other is grown in heavy media enriched with a heavy

isotope-encoded (typically 13

C or 15

N) amino acid such as arginine or leucine Cells from the

two populations are then combined proteins are extracted digested and analyzed by MS The

relative protein expression differences are then determined from the extracted ion

chromatograms from both the light and heavy peptide forms SILAC has been shown to be a

powerful tool for the study of intracellular signal transduction In addition this technique has

recently been applied to the quantitative analysis of phosphotyrosine (pTyr) proteomes to

characterize pTyr-dependent signaling pathways166 167

38

Labe-free quantification

Although various isotope labeling methods have provided powerful tools for quantitative

proteomics several limitations of these approaches are noted Labeling increases the cost and

complexity of sample preparation introduces potential errors during the labeling reaction It also

requires a higher sample concentration and complicates data processing and interpretation In

addition so far only TMTs and iTRAQ allow the comparison of multiple (up to eight) samples

simultaneously The comparison of more than eight samples in a single experiment cannot be

achieved by isotope labeling In order to address these concerns there has been significant

interest in the development of label-free quantitative approaches Current label-free

quantification methods for MS-based proteomics were developed based on the observation that

the chromatographic peak area of a peptide168 169

or frequency of MSMS spectra170

correlating

to the protein or peptide concentration Therefore the two most common label-free

quantification approaches are conducted by comparing (i) area under the curve (AUC) of any

given peptides171 172

or (ii) by frequency measurements of MSMS spectra assigned to a protein

commonly referred to as spectral counting173

Several recent reviews provided detailed and

comprehensive knowledge comparing label-free methods with labeling methods data processing

and commercially available software for label-free quantitative proteomics174-177

Dissociation Techniques

The vast majority of proteomic experiments have proteins or peptides being identified by

two critical pieces of data obtained from the mass spectrometer The first is the precursor ion

identified by its mz which is informative to the mass of the peptide being analyzed The second

is the use of tandem mass spectrometry to fragment or dissociate the precursor ion and record the

39

generated fragment ion pattern to discern the amino acid sequence The three most popular

dissociation or fragmentation techniques for peptides are CID electron-transfer dissociation

(ETD) and high-energy collision dissociation (HCD) A recent study on the human plasma

proteome demonstrated that combined fragmentation techniques enhance coverage by providing

complementary information for identifications CID enabled the greatest number of protein

identifications while HCD identified an additional 25 proteins and ETD contributed an

additional 13 protein identifications178

ETDECD

Electron capture dissociation (ECD) 179

preceded ETD but ECD was developed for use

in a Penning trap for Fourier transform ion cyclotron resonance (FTICR) mass spectrometers

ECD requires the ion of interest to be in contact with near-thermal electrons and for the electron

capture event to occur on the millisecond time scale but the time scale is inadequate for electron

trapping in Paul traps or quadrupoles in the majority of mass spectrometers180

ETD involves a

radical anion like fluoranthene with low electron affinity to be transferred to peptide cation

which results in more uniform cleavage along the peptide backbone The cation accepts an

electron and the newly formed odd-electron protonated peptide undergoes fragmentation by

cleavage of the N-Cα bond which results in fragmentation ions consisting of c- and zbull-type

product ions The uniformed cleavage results in reduced sequence discrimination to labile bonds

such as PTMs and also provides improved sequencing for larger peptides compared to CID181

The realization that larger peptides produced better MSMS quality spectra compared to CID led

to a decision tree analysis strategy where peptide charge states and size determined whether the

precursor peptide would be fragmented with CID or ETD182

One of the main benefits of

ETDECD is the ability to sequence peptides with labile PTMs such as phosphorylation77 183

40

sulfation184

glycosylation185

ubiquitination186

and histone modifications187

ETD also has the

benefit of providing better sequence information on larger neuropeptides when compared to

CID188

However a thorough analysis suggested that CID still yielded more peptideprotein

identifications than ETD in large scale proteoimcs189

HCD

High energy collision dissociation (HCD)190

is an emerging fragmentation technique that

offers improved detection of small reporter ions from iTRAQ-based studies191 192

HCD is

performed at a higher energy in a collision cell instead of an ion trap like CID thus HCD does

not suffer from the low-mass cutoff limitation Furthermore HCD offers enhanced

fragmentation efficiency assisting in MSMS spectra interpretation and protein identification193

A major drawback for HCD is that the spectral acquisition times are up to two-fold longer due to

increased ion requirement for Fourier transform detection in the orbitrap194

HCD has been

reported to increase phosphopeptide identifications over CID74

but in a different study CID was

reported to offer more phosphopeptide identifications over HCD194

Work has also been done to

transfer the decision tree analysis for HCD which basically switches CID with HCD claiming

better quality data determined by higher Mascot scores with more peptide identifications195

MSE

Data dependent acquisition (DDA) is the most commonly used ion selection process in

mass spectrometers for proteomic experiments An alternative process which does not have ion

selection nor switch between MS and MSMS modes is termed MSE MS

E is a data independent

mode and does not require precursor ions of a significant intensity to be selected for MSMS

analysis196

A data independent mode decouples the mass spectrometer choosing which

precursor ions to fragment and when the ions are fragmented MSE works by a low or high

41

energy scan and no ion isolation is occurring The low energy scan is where the precursor ion is

not fragmented and the high energy scan allows fragmentation The resulting mix of precursor

and fragmentation ions is then detected simultaneously197

The data will then need to be

deconvoluted using a proprietary time-aligned algorithm that is discussed elsewhere198

The

continuous data independent acquisition allows multiple MSMS spectra to be collected during

the natural analyte peak broadening observed in chromatography which provides more data

points for AUC label-free quantification In addition lower abundance peptides can be

sequenced as more MSMS spectra are collected throughout the elution of an LC peak allowing

better signal averaging for smaller analyte peak of interest during coelution and reducing

sampling bias in typical DDA experiments where only more abundant peaks can be selected for

fragmentation

A comparison of spiked internal protein standards into a complex protein digest provided

evidence that MSE was comparable to DDA analysis in LC-MS

199 MS

E has been used for label

free proteomics of immunodepleted serum in large scale proteomics samples200

In addition

MSE was performed for the characterization of human cerebellum and primary visual cortex

proteomes Hundreds of proteins were identified including many previously reported in

neurological disorders201

MSE is quickly becoming a versatile data acquisition method recently

used in such studies as cancer cells202

schizophrenia203

and pituitary proteome discovery204

The usefulness of MSE as an unbiased data acquisition method is being assimilated into multiple

proteomics studies including studies involving neurological disorders

Data Analysis

42

One of the major bottlenecks in non-targeted proteomic experiments is how to handle the

enormous amount of data obtained Database searches biostatistical analysis de novo

sequencing PTM validation all have their place and multiple available platforms are available

If the organism being studied has had its genome sequenced databases can be created

with a list of proteins in the FASTA format to be used in database searching There are

numerous database searching algorithms for sequence identification of MSMS data including

Mascot205

Sequest206

Xtandem207

OMSSA208

and PEAKS209

These searching algorithms are

performed by matching MSMS spectra and precursor mass to sequences found within proteins

How well the actual spectra match the theoretical spectra determines a score which is unique to

the searching algorithm and usually can be extrapolated to the probability of a random hit

Recently a database has been developed for PTM analysis by the use of the program SIMS210

Specifically for phosphopeptides Ascorersquos algorithm scans the MSMS data to determine the

likelihood of correct phosphosite identification from the presence of site identifying product

ions211

If the organism that is being analyzed has not had its genome sequenced and no (or very

limited) FASTA database is available a homology search can be performed using SPIDER212

available with PEAKS software Alternatively individual MSMS spectrum can be de novo

sequenced but software is available to perform automated de novo sequencing of numerous

spectra (PEAKS208

DeNovoX and PepSeq)

For large-scale protein identifications the false discovery rate (FDR) must be established

by the searching algorithm and that is accomplished by re-searching the data with a false

database created by reversing or scrambling the amino acid sequence of the original database

used for the protein search Any hits from the false database will contribute to the FDR and this

value can be adjusted usually around 1 An additional layer of confidence in the obtained data

43

can be achieved in shotgun proteomics experiments by removing all the proteins that are

identified by only one peptide

Once a set of confident proteins or peptides have been generated from database

searching bioinformatic analysis or biostatistical analysis is needed Numerous software

packages are available for different purposes FLEXIQuant is an example for absolute

quantitation of isotopically labeled protein or peptides of interest213

FDR analysis of

phosphopeptides or other specific PTMs can be adjusted with such software as Scaffold

providing data consisting only of a specific modification214

Bioinformatic tools such as

Scaffold or ProteoIQ also include gene ontology (GO) analysis which can classify identified

proteins by three categories cellular component molecular function or biological process

Custom bioinformatics programs can also be developed and are often useful in various proteomic

studies including biomarker discovery in neurological diseases215

More detailed review of

bioinformatics in peptidomics216

and proteomics217

can be found elsewhere

Validation of Biomarkers by Targeted Proteomics

The validation of putative biomarkers identified by MS-based proteomic analysis is often

required to provide orthogonal analysis to rule out a false positive by MS and providing

additional evidence for the biomarker candidate(s) from the study for future potential clinical

assays At present antibody-based assays such as Western blotting ELISA and

immunochemistry are the most widely used methods for biomarker validation Although accurate

and well established these methods rely on protein specific antibodies for the measurement of

the putative biomarker and could be difficult for large-scale validation of all or even a subset of a

long list of putative protein biomarkers typically obtained by MS-based comparative proteomic

44

analysis Large scale validation is impractical due to the cost for each antibody the labor to

develop a publishable Western blot or ELISA and the antibody availability for certain proteins

As an alternative strategy quantitative assays based on multiple-reaction monitoring (MRM) MS

using a triple quadrupole mass spectrometer have been employed in biomarker verification

MRM is the most common use of MSMS for absolute quantitation It is a hypothesis

driven experiment where the peptide of interest and its subsequent fragmentation pattern must be

known prior to the quantitative MRM experiments MRM involves selecting a specific mz (first

quadrupole) to be isolated for fragmentation (second quadrupole) followed by one or more of

the most intense fragment ions (third quadrupole) being monitored The ability to quantitate and

thus validate the proteins or peptides as potential biomarkers is achieved by performing MRM on

isotopically labeled reference peptide for targeted peptideprotein of interest The main obstacle

for quantification of peptides is interference and ion suppression effects from co-eluting

substances Since the isotopically labeled and native peptide will co-elute the same interference

and ion suppression will occur for both peptides and thus correcting these interfering effects

Peptides need to be systematically chosen for a highly sensitive and reproducible MRM

experiment to ensure proper validation of putative biomarkers Peptides require certain intrinsic

properties which include an mz within the practical mass detection range for the instrument and

high ionization efficiency If the desired peptide to be quantified is derived from a digestion

then peptides that have detectable incomplete digestion or missed cleavage site can be a major

source of variability Peptides with a methionine and to a lesser extent tryptophan are

traditionally removed from consideration from MRM quantitative experiments due to the

variable nature of the oxidation that can occur In addition if chromatographic separation is

performed the retention behavior of the peptide must be well behaved with little tailing effects

45

eluting late causing broadening of the peak and even irreversible binding to the column As an

example hydrophilic peptides being eluted off a C18 column may exhibit the previously

described concerns and a different chromatographic separation will need to be explored for

improved limits of detection quantitation and validation To determine consistent peptide

detection or usefulness of certain peptides databases such as Proteomics Database218

PRIDE219

PeptideAtlas220

have been developed to compile proteomic data repositories from initial

discovery experiments

After the peptide is selected for analysis the proper MRM transitions need to be selected

to optimize the sensitivity and selectivity of the experiment It is common for investigators to

select two or three of the most intense transitions for the proposed experiment It is imperative

that the same instrument is used for the determination of transition ions as different mass

spectrometers may have a bias towards different fragment ions

MRM experiments are still highly popular experiments for hypothesis directed

experiments221

biomarker analysis222

and validation223

Validation of putative biomarkers is

increasingly becoming a necessary step when performing large scale non-hypothesis driven

proteomics experiments The traditional validation techniques of ELISA Western blotting and

immunohistochemistry are still used but MRM experiments are becoming an attractive

alternative for validation of putative biomarkers due to its enhanced throughput and specificity

Current work is still being performed to both expand the linear dynamic range224

and

sensitivity225

of MRM A recent endeavor to increase the sensitivity for MRM experiments was

accomplished by ldquoPulsed MRMrdquo via the use of an ion funnel trap to enhance confinement and

accumulation of ions The authors claimed an increase by 5-fold for peak amplitude and a 2-3

fold reduction in chemical background225

46

Remaining Challenges and Emerging Technologies

Large sample numbers for mass spectrometry analysis

Multiple conventional studies in proteomics have been performed on a single or a few

biological samples As bio-variability can be exceedingly high the need for larger sample sizes

is currently being investigated Prentice et al used a starting point of 3200 patient samples

from the Womenrsquos Health Institute (WHI) to probe the plasma proteome using MS for

biomarkers The study did not test the 3200 patient samples by MS because even a simple one

hour one dimensional RP analysis on a mass spectrometer would take months of instrument time

for uninterrupted analysis Instead the authors pooled 100 samples together to bring the total

number of pooled samples to 32 To provide relevant plasma biomarkers the samples were then

subjected to immunodepletion 2-D protein separation (96 fractions total) and then 1-D RPLC of

tryptic peptide separation on-line interface to a mass spectrometer The large sample cohorts

help address bio-variability that can be a concern from small sample size proteomic experiments

and provide ample sample amounts to investigate the low abundance proteins226

Hemoglobin-derived neuropeptides and non-classical neuropeptides

Neuropeptides such as neuropeptide Y and enkephalin are short chains of amino acids

that are secreted from a range of neuronal cells that signal nearby cells In contrast non-classical

neuropeptides are termed as neuropeptides or ldquomicroproteinsrdquo which are derived from

intracellular protein fragments and synthesized from the cytosol227

MS was recently used to

determine that hemopressins which are hemoglobin-derived peptides are upregulated in Cpefatfat

mice brains Gelman et al designed an MS experiment to compare hemoglobin-derived

47

peptides comparing the brain blood and heart peptidome in mice The authors provided data

that specific hemoglobin peptides were produced in the brain and were not produced in the

blood Certain alpha and beta hemoglobin peptides were also up regulated in the brain for

Cpefatfat

mice and bind to CB1 cannabinoid receptors228

As discussed earlier in the review

peptidomics and specifically neuropeptidomics are popular fields of study utilizing MS and non-

classical neuropeptides is an exciting emerging area of research that could further expand the

diversity of cell-cell signaling molecules

Ultrasensitive mass spectrometry for single cell analysis

In addition to large scale analysis MS-based proteomics and peptidomics are making

progress into ultrasensitive single cell analysis The most successful MS-based techniques for

single cell analysis was performed with MALDI and studies that have been performed on

relatively large neurons are reviewed elsewhere229

The ultrasensitive MS analysis is currently

directed towards single cell analysis of smaller cells including cancer cells The first challenge

in single cell analysis is the isolation and further sample preparation to yield relevant data

Collection and isolation of a cell type can be accomplished using antibodies for fluorescence

activated cell sorting (FACS) and immune magnetic separation FACS works by flow cytometry

sorting cells by a laser that excites a fluorescent tag that is attached to an antibody Immune

magnetic separation allows separation by antibodies with magnetic properties such as

Dynabeads230

One exciting study combining FACS and MS termed mass cytometry This

technology works by infusing a droplet into an inductively coupled plasma mass spectrometer

(ICP-MS) containing a single cell bound to antibodies chelated to transition elements allowing a

quantifying response between single cells231

Clearly the future of single cell analysis for

48

biomarker analysis and proteomics is encouraging and has the potential to be an emerging field

in MS-based proteomics and peptidomics

Laserspray ionization (LSI)

Laserspray ionization (LSI) is an exciting new method to produce multiply charged mass

spectra from MALDI that is nearly identical to ESI232-234

Recently it has been reported that LSI

can be performed in lieu of matrix to produce a total solvent-free analysis234

The benefits of

being able to generate multiply charged peptides without any solvent may offer advantages

including MS analysis of insoluble membrane proteins or hydrophobic peptides avoidance of

chemical reactions while in solvents a reduction of sample loss due to liquid sample preparation

and ability to avoid diffusion effects from tissue imaging studies234

The multiply charged peptide and protein ions produced by LSI expand the mass range

for tissue imaging analysis More importantly the multiply-charged peptide ions are amenable

for electron-based fragmentation methods such as ETD or ECD which can be employed in

conjunction with tissue imaging experiments to yield in situ sequencing and identification of

peptides of interest235

Paper spray ionization

Paper spray (PS) is an ambient ionization method which was first reported using

chromatography paper allowing detection of metabolites from dried blood spots The original

method used a cut out piece of paper with a voltage clipped on the back while applying 10 microL of

methanolH2O236

Improvements have been made to this technology to enhance analysis

efficiency with a new solvent 91 dichloromethaneisopropanol (vv) and the use of silica paper

49

over chromatography paper237

Interesting applications or modifications have been made to PS

including direct analysis of biological tissue238

and leaf spray for direct analysis of plant

materials239

but both detect metabolites instead of proteins or peptides Paper spray ionization

was previously shown to enable detection of cytochrome c and bradykinin [2-9] standards in a

proof of principle study240

Clearly the utility of PS analysis in proteomics and peptidomics is

yet to be explored

niECD

New fragmentation techniques have been investigated for their utility in proteomics and

peptidomics including a recently reported negative-ion electron capture dissociation (niECD)

Acidic peptides which usually contain PTMs such as phosphorylation or sulfonation are often

difficult to be detected as multiply charged peptides in the positive ion mode As discussed

earlier multiply charged peptides are required for ECDETD fragmentation The fragmentation

of niECD is accomplished by a multiply negatively charged peptide adding an electron The

resulting fragmentation of multiply sulfated and phosphorylated peptide and protein standards

showed no sulfate loss and preserved phosphorylation site The resulting fragmentation pattern

from niECD was also improved in the peptide anions and provides a new strategy for de novo

sequencing with PTM localization241

Conclusions and Perspectives

Proteomics methodologies have produced large datasets of proteins involved in various

biological and disease progression processes Numerous mass spectrometry-based proteomics

and peptidomics tools have been developed and are continuously being improved in both

50

chromatographic or electrophoretic separation and MS hardware and software However several

important issues that remain to be addressed rely on further technical advances in proteomics

analysis When large proteomes consisting of thousands of proteins are analyzed and quantified

dynamic range is still limited with more abundant proteins being preferentially detected

Development and optimization of chemical tagging reagents that target specific protein classes

maybe necessary to help enrich important signaling proteins and assess cellular and molecular

heterogeneity of the proteome and peptidome Furthermore a significant bottleneck in

usefulness of proteomics research is the ability to validate the results and provide clear

significant biological relevance to the results The idea of P4 medicine242 243

is an attractive

concept where the four Prsquos stand for predictive preventive personalized and participatory

Proteomics is one of the critical ldquoomicsrdquo fields and has led to the development of enabling

innovative strategies to P4 medicine244

A goal of P4 medicine is to assess both early disease

detection and disease progression in a person A simplified example of how proteomics fits into

P4 medicine is that certain brain-specific proteins could be used for diagnosis with

presymptomatic prion disease244

The concept of proteomic experiments providing an individual

biomarker is becoming more obsolete with the revised vision being a biomolecular barcode that

could potentially be ldquoscannedrdquo or be a fingerprint for a specific disease or early onset to that

disease being closer to reality An excellent review on what biomarker analysis can do for true

patients is available245

Proteomics can also generate new hypothesis that can be tested by classical biochemical

approaches If a disease has an unknown pathogenesis proteomics is a good starting point to try

to assemble putative markers that can lead to further hypothesis for evaluation If a particular

protein or PTM is associated with a disease state either qualitatively or quantitatively potential

51

treatments could target that protein of interest or investigators could monitor that protein or

PTM during potential treatments of the disease Proteomics has expanded greatly over the last

few decades with the goal of providing revealing insights to some of the most complex

biological problems currently facing the scientific community

Acknowledgements

Preparation of this manuscript was supported in part by the University of Wisconsin Graduate

School Wisconsin Alzheimerrsquos Disease Research Center Pilot Grant and a Department of

Defense Pilot Award LL acknowledges an H I Romnes Faculty Fellowship

52

Figure 1 A summary of general workflows of biomarker discovery pipeline by MS-based

proteomic approaches

Biological sample (CSF blood urine saliva cell

lysate tissue homogenates secreted proteins etc)

Protein extraction Sample pretreatment

2D-GE2D-DIGE MS 1D or 2D LC-MSMS

MALDI-IMS

Identification of

differentially

expressed proteins

Protein identification

Potential biomarkers

Biomarker validation

- Antibody based immunoassays

- MRM

Quantitative analysis

- Isotope labeling

- Label free

Identification and

localization of

differentially expressed

biomolecules

Intact tissue

Sample preparation Slice frozen tissues

thaw-mounted on plate

Apply Matrix

53

Figure 2 Tissue expression and dynamic range of the human plasma proteome A pie chart

representing the tissue of origin for the high abundance proteins shows that the majority of

proteins come from the liver (A) Conversely the lower abundance plasma proteins have a much

more diverse tissue of origin (B and C) The large dynamic range of plasma proteins is presented

and the proteins can be grouped into three categories (classical plasma proteins tissue leakage

products interleukinscytokines) (D) Adapted from Zhang et al11

and Schiess et al246

with

permission

54

55

Table 1 A summary of the common strategies applied to MS-based quantitative proteomic

analysis

Gel based Stable isotope labeling Label free

2D-GE

2D-DIGE 110

In vitro derivatization

18O

16O

153

ICAT 156

TMT 159

iTRAQ 158

Formaldehyde 247

ICPL 248

In vivo metabolic labeling

14N

15N

249

SILAC 164

AUC measurement 169 172

Spectral counting 173

AUC Area Under Curve ICAT Isotope-Coded Affinity Tag TMT Tandem Mass Tags iTRAQ Isobaric Tags for

Relative and Absolute Quantification ICPL Isotope Coded Protein Labeling SILAC Stable Isotope Labeling by

Amino Acids in Cell Culture Isobaric Tags for Relative and Absolute Quantification (iTRAQ)

56

References

1 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein

immunoaffinity subtraction for plasma proteomics and candidate biomarker discovery

using mass spectrometry Mol Cell Proteomics 2006 5 (11) 2167-74

2 Holten-Andersen M N Murphy G Nielsen H J Pedersen A N Christensen I J

Hoyer-Hansen G Brunner N Stephens R W Quantitation of TIMP-1 in plasma of healthy

blood donors and patients with advanced cancer Br J Cancer 1999 80 (3-4) 495-503

3 Tammen H Schulte I Hess R Menzel C Kellmann M Mohring T Schulz-

Knappe P Peptidomic analysis of human blood specimens comparison between plasma

specimens and serum by differential peptide display Proteomics 2005 5 (13) 3414-22

4 Rai A J Gelfand C A Haywood B C Warunek D J Yi J Schuchard M D

Mehigh R J Cockrill S L Scott G B Tammen H Schulz-Knappe P Speicher D W

Vitzthum F Haab B B Siest G Chan D W HUPO Plasma Proteome Project specimen

collection and handling towards the standardization of parameters for plasma proteome samples

Proteomics 2005 5 (13) 3262-77

5 Lippi G Guidi G C Mattiuzzi C Plebani M Preanalytical variability the dark side

of the moon in laboratory testing Clin Chem Lab Med 2006 44 (4) 358-65

6 Holten-Andersen M N Schrohl A S Brunner N Nielsen H J Hogdall C K

Hogdall E V Evaluation of sample handling in relation to levels of tissue inhibitor of

metalloproteinases-1 measured in blood by immunoassay Int J Biol Markers 2003 18 (3) 170-

6

7 Ytting H Christensen I J Thiel S Jensenius J C Svendsen M N Nielsen L

Lottenburger T Nielsen H J Biological variation in circulating levels of mannan-binding

lectin (MBL) and MBL-associated serine protease-2 and the influence of age gender and

physical exercise Scand J Immunol 2007 66 (4) 458-64

8 Hanash S Building a foundation for the human proteome the role of the Human

Proteome Organization J Proteome Res 2004 3 (2) 197-9

9 Omenn G S States D J Adamski M Blackwell T W Menon R Hermjakob H

Apweiler R Haab B B Simpson R J Eddes J S Kapp E A Moritz R L Chan D W

Rai A J Admon A Aebersold R Eng J Hancock W S Hefta S A Meyer H Paik Y

K Yoo J S Ping P Pounds J Adkins J Qian X Wang R Wasinger V Wu C Y

Zhao X Zeng R Archakov A Tsugita A Beer I Pandey A Pisano M Andrews P

Tammen H Speicher D W Hanash S M Overview of the HUPO Plasma Proteome Project

results from the pilot phase with 35 collaborating laboratories and multiple analytical groups

generating a core dataset of 3020 proteins and a publicly-available database Proteomics 2005 5

(13) 3226-45

10 Liu T Qian W J Gritsenko M A Xiao W Moldawer L L Kaushal A Monroe

M E Varnum S M Moore R J Purvine S O Maier R V Davis R W Tompkins R

G Camp D G 2nd Smith R D High dynamic range characterization of the trauma patient

plasma proteome Mol Cell Proteomics 2006 5 (10) 1899-913

11 Zhang Q Faca V Hanash S Mining the plasma proteome for disease applications

across seven logs of protein abundance J Proteome Res 2011 10 (1) 46-50

12 Matsubara J Honda K Ono M Sekine S Tanaka Y Kobayashi M Jung G

Sakuma T Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T Tsuchida A

57

Shimahara M Yasunami Y Chiba T Yamada T Identification of adipophilin as a potential

plasma biomarker for colorectal cancer using label-free quantitative mass spectrometry and

protein microarray Cancer Epidemiol Biomarkers Prev 2011 20 (10) 2195-203

13 Murakoshi Y Honda K Sasazuki S Ono M Negishi A Matsubara J Sakuma

T Kuwabara H Nakamori S Sata N Nagai H Ioka T Okusaka T Kosuge T

Shimahara M Yasunami Y Ino Y Tsuchida A Aoki T Tsugane S Yamada T Plasma

biomarker discovery and validation for colorectal cancer by quantitative shotgun mass

spectrometry and protein microarray Cancer Sci 2011 102 (3) 630-8

14 Addona T A Shi X Keshishian H Mani D R Burgess M Gillette M A

Clauser K R Shen D Lewis G D Farrell L A Fifer M A Sabatine M S Gerszten R

E Carr S A A pipeline that integrates the discovery and verification of plasma protein

biomarkers reveals candidate markers for cardiovascular disease Nat Biotechnol 2011 29 (7)

635-43

15 Acosta-Martin A E Panchaud A Chwastyniak M Dupont A Juthier F Gautier

C Jude B Amouyel P Goodlett D R Pinet F Quantitative mass spectrometry analysis

using PAcIFIC for the identification of plasma diagnostic biomarkers for abdominal aortic

aneurysm PLoS One 2011 6 (12) e28698

16 Ray S Britschgi M Herbert C Takeda-Uchimura Y Boxer A Blennow K

Friedman L F Galasko D R Jutel M Karydas A Kaye J A Leszek J Miller B L

Minthon L Quinn J F Rabinovici G D Robinson W H Sabbagh M N So Y T

Sparks D L Tabaton M Tinklenberg J Yesavage J A Tibshirani R Wyss-Coray T

Classification and prediction of clinical Alzheimers diagnosis based on plasma signaling

proteins Nat Med 2007 13 (11) 1359-62

17 Lopez M F Kuppusamy R Sarracino D A Prakash A Athanas M Krastins B

Rezai T Sutton J N Peterman S Nicolaides K Mass spectrometric discovery and selective

reaction monitoring (SRM) of putative protein biomarker candidates in first trimester Trisomy 21

maternal serum J Proteome Res 2011 10 (1) 133-42

18 Sheta E A Appel S H Goldknopf I L 2D gel blood serum biomarkers reveal

differential clinical proteomics of the neurodegenerative diseases Expert Rev Proteomics 2006

3 (1) 45-62

19 Zhang J Goodlett D R Montine T J Proteomic biomarker discovery in

cerebrospinal fluid for neurodegenerative diseases J Alzheimers Dis 2005 8 (4) 377-86

20 Yuan X Desiderio D M Proteomics analysis of prefractionated human lumbar

cerebrospinal fluid Proteomics 2005 5 (2) 541-50

21 Wenner B R Lovell M A Lynn B C Proteomic analysis of human ventricular

cerebrospinal fluid from neurologically normal elderly subjects using two-dimensional LC-

MSMS J Proteome Res 2004 3 (1) 97-103

22 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry Electrophoresis 2004 25 (14) 2402-12

23 Yuan X Desiderio D M Proteomics analysis of human cerebrospinal fluid J

Chromatogr B Analyt Technol Biomed Life Sci 2005 815 (1-2) 179-89

24 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

58

25 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers Proteomics Clin Appl 2008 2 (3) 428-36

26 Rozek W Ricardo-Dukelow M Holloway S Gendelman H E Wojna V

Melendez L M Ciborowski P Cerebrospinal fluid proteomic profiling of HIV-1-infected

patients with cognitive impairment J Proteome Res 2007 6 (11) 4189-99

27 Schutzer S E Liu T Natelson B H Angel T E Schepmoes A A Purvine S O

Hixson K K Lipton M S Camp D G Coyle P K Smith R D Bergquist J

Establishing the proteome of normal human cerebrospinal fluid PLoS One 2010 5 (6) e10980

28 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

29 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2010 878 (22) 2003-12

30 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

31 Yuan X Desiderio D M Human cerebrospinal fluid peptidomics J Mass Spectrom

2005 40 (2) 176-81

32 Ringman J M Schulman H Becker C Jones T Bai Y Immermann F Cole G

Sokolow S Gylys K Geschwind D H Cummings J L Wan H I Proteomic changes in

cerebrospinal fluid of presymptomatic and affected persons carrying familial Alzheimer disease

mutations Arch Neurol 2012 69 (1) 96-104

33 Jahn H Wittke S Zurbig P Raedler T J Arlt S Kellmann M Mullen W

Eichenlaub M Mischak H Wiedemann K Peptide fingerprinting of Alzheimers disease in

cerebrospinal fluid identification and prospective evaluation of new synaptic biomarkers PLoS

One 2011 6 (10) e26540

34 Rajagopal M U Hathout Y MacDonald T J Kieran M W Gururangan S

Blaney S M Phillips P Packer R Gordish-Dressman H Rood B R Proteomic profiling

of cerebrospinal fluid identifies prostaglandin D2 synthase as a putative biomarker for pediatric

medulloblastoma A pediatric brain tumor consortium study Proteomics 2011 11 (5) 935-43

35 Giron P Dayon L Turck N Hoogland C Sanchez J C Quantitative analysis of

human cerebrospinal fluid proteins using a combination of cysteine tagging and amine-reactive

isobaric labeling J Proteome Res 2011 10 (1) 249-58

36 Ting L Rad R Gygi S P Haas W MS3 eliminates ratio distortion in isobaric

multiplexed quantitative proteomics Nat Methods 2011 8 (11) 937-40

37 Michalski A Damoc E Hauschild J P Lange O Wieghaus A Makarov A

Nagaraj N Cox J Mann M Horning S Mass spectrometry-based proteomics using Q

Exactive a high-performance benchtop quadrupole Orbitrap mass spectrometer Mol Cell

Proteomics 2011 10 (9) M111 011015

38 Spirin V Shpunt A Seebacher J Gentzel M Shevchenko A Gygi S Sunyaev

S Assigning spectrum-specific P-values to protein identifications by mass spectrometry

Bioinformatics 2011 27 (8) 1128-34

39 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

59

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

2012 84 (1) 209-15

40 Wilhelm M Kirchner M Steen J A Steen H mz5 Space- and Time-efficient

Storage of Mass Spectrometry Data Sets Mol Cell Proteomics 2012 11 (1) O111 011379

41 Zhou F Sikorski T W Ficarro S B Webber J T Marto J A Online nanoflow

reversed phase-strong anion exchange-reversed phase liquid chromatography-tandem mass

spectrometry platform for efficient and in-depth proteome sequence analysis of complex

organisms Anal Chem 2011 83 (18) 6996-7005

42 Winter D Steen H Optimization of cell lysis and protein digestion protocols for the

analysis of HeLa S3 cells by LC-MSMS Proteomics 2011 11 (24) 4726-30

43 Shteynberg D Deutsch E W Lam H Eng J K Sun Z Tasman N Mendoza L

Moritz R L Aebersold R Nesvizhskii A I iProphet multi-level integrative analysis of

shotgun proteomic data improves peptide and protein identification rates and error estimates

Mol Cell Proteomics 2011 10 (12) M111 007690

44 Kellie J F Catherman A D Durbin K R Tran J C Tipton J D Norris J L

Witkowski C E Thomas P M Kelleher N L Robust Analysis of the Yeast Proteome under

50 kDa by Molecular-Mass-Based Fractionation and Top-Down Mass Spectrometry Anal Chem

84 (1) 209-15

45 Datta A Park J E Li X Zhang H Ho Z S Heese K Lim S K Tam J P Sze

S K Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun

quantitative proteomics J Proteome Res 9 (1) 472-84

46 Decramer S Gonzalez de Peredo A Breuil B Mischak H Monsarrat B Bascands

J L Schanstra J P Urine in clinical proteomics Mol Cell Proteomics 2008 7 (10) 1850-62

47 De La Monte S M Wands J R The AD7c-NTP neuronal thread protein biomarker for

detecting Alzheimers disease J Alzheimers Dis 2001 3 (3) 345-353

48 Van Dorsselaer A Carapito C Delalande F Schaeffer-Reiss C Thierse D Diemer

H McNair D S Krewski D Cashman N R Detection of prion protein in urine-derived

injectable fertility products by a targeted proteomic approach PLoS One 2011 6 (3) e17815

49 Kuwabara Y Mine K Katayama A Inagawa T Akira S Takeshita T Proteomic

analyses of recombinant human follicle-stimulating hormone and urinary-derived gonadotropin

preparations J Reprod Med 2009 54 (8) 459-66

50 Court M Selevsek N Matondo M Allory Y Garin J Masselon C D Domon B

Toward a standardized urine proteome analysis methodology Proteomics 2011 11 (6) 1160-71

51 Marimuthu A OMeally R N Chaerkady R Subbannayya Y Nanjappa V Kumar

P Kelkar D S Pinto S M Sharma R Renuse S Goel R Christopher R Delanghe B

Cole R N Harsha H C Pandey A A comprehensive map of the human urinary proteome J

Proteome Res 2011 10 (6) 2734-43

52 Bermejo-Pareja F Antequera D Vargas T Molina J A Carro E Saliva levels of

Abeta1-42 as potential biomarker of Alzheimers disease a pilot study BMC Neurol 2010 10

108

53 Devic I Hwang H Edgar J S Izutsu K Presland R Pan C Goodlett D R

Wang Y Armaly J Tumas V Zabetian C P Leverenz J B Shi M Zhang J Salivary

alpha-synuclein and DJ-1 potential biomarkers for Parkinsons disease Brain 2011 134 (Pt 7)

e178

60

54 Al-Tarawneh S K Border M B Dibble C F Bencharit S Defining salivary

biomarkers using mass spectrometry-based proteomics a systematic review OMICS 2011 15

(6) 353-61

55 Castagnola M Cabras T Vitali A Sanna M T Messana I Biotechnological

implications of the salivary proteome Trends Biotechnol 2011 29 (8) 409-18

56 Caprioli R M Farmer T B Gile J Molecular imaging of biological samples

localization of peptides and proteins using MALDI-TOF MS Anal Chem 1997 69 (23) 4751-

60

57 Stoeckli M Chaurand P Hallahan D E Caprioli R M Imaging mass spectrometry

a new technology for the analysis of protein expression in mammalian tissues Nat Med 2001 7

(4) 493-6

58 Chaurand P Schwartz S A Caprioli R M Imaging mass spectrometry a new tool to

investigate the spatial organization of peptides and proteins in mammalian tissue sections Curr

Opin Chem Biol 2002 6 (5) 676-81

59 Chaurand P Stoeckli M Caprioli R M Direct profiling of proteins in biological

tissue sections by MALDI mass spectrometry Anal Chem 1999 71 (23) 5263-70

60 Seeley E H Caprioli R M MALDI imaging mass spectrometry of human tissue

method challenges and clinical perspectives Trends Biotechnol 2011 29 (3) 136-43

61 Cazares L H Troyer D A Wang B Drake R R Semmes O J MALDI tissue

imaging from biomarker discovery to clinical applications Anal Bioanal Chem 2011 401 (1)

17-27

62 Stauber J Lemaire R Franck J Bonnel D Croix D Day R Wisztorski M

Fournier I Salzet M MALDI imaging of formalin-fixed paraffin-embedded tissues

application to model animals of Parkinson disease for biomarker hunting J Proteome Res 2008

7 (3) 969-78

63 Yuki D Sugiura Y Zaima N Akatsu H Hashizume Y Yamamoto T Fujiwara

M Sugiyama K Setou M Hydroxylated and non-hydroxylated sulfatide are distinctly

distributed in the human cerebral cortex Neuroscience 2011 193 44-53

64 Makridakis M Vlahou A Secretome proteomics for discovery of cancer biomarkers J

Proteomics 2010 73 (12) 2291-305

65 Dowling P Clynes M Conditioned media from cell lines a complementary model to

clinical specimens for the discovery of disease-specific biomarkers Proteomics 2011 11 (4)

794-804

66 Moore N H Costa L G Shaffer S A Goodlett D R Guizzetti M Shotgun

proteomics implicates extracellular matrix proteins and protease systems in neuronal

development induced by astrocyte cholinergic stimulation J Neurochem 2009 108 (4) 891-908

67 Keene S D Greco T M Parastatidis I Lee S H Hughes E G Balice-Gordon R

J Speicher D W Ischiropoulos H Mass spectrometric and computational analysis of

cytokine-induced alterations in the astrocyte secretome Proteomics 2009 9 (3) 768-82

68 Dowell J A Johnson J A Li L Identification of astrocyte secreted proteins with a

combination of shotgun proteomics and bioinformatics J Proteome Res 2009 8 (8) 4135-43

69 Ahmed F E Sample preparation and fractionation for proteome analysis and cancer

biomarker discovery by mass spectrometry J Sep Sci 2009 32 (5-6) 771-98

70 Righetti P G Fasoli E Boschetti E Combinatorial peptide ligand libraries the

conquest of the hidden proteome advances at great strides Electrophoresis 2011 32 (9) 960-6

61

71 Selvaraju S El Rassi Z Liquid-phase-based separation systems for depletion

prefractionation and enrichment of proteins in biological fluids and matrices for in-depth

proteomics analysis - An update covering the period 2008-2011 Electrophoresis 2011 33 (1)

74-88

72 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

73 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

74 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

75 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

76 Wagner P D Vu N D Histidine to aspartate phosphotransferase activity of nm23

proteins phosphorylation of aldolase C on Asp-319 Biochem J 2000 346 Pt 3 623-30

77 Chi A Huttenhower C Geer L Y Coon J J Syka J E Bai D L Shabanowitz

J Burke D J Troyanskaya O G Hunt D F Analysis of phosphorylation sites on proteins

from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry Proc

Natl Acad Sci U S A 2007 104 (7) 2193-8

78 Rudrabhatla P Jaffe H Pant H C Direct evidence of phosphorylated neuronal

intermediate filament proteins in neurofibrillary tangles (NFTs) phosphoproteomics of

Alzheimers NFTs FASEB J 25 (11) 3896-905

79 Saez-Valero J Fodero L R Sjogren M Andreasen N Amici S Gallai V

Vanderstichele H Vanmechelen E Parnetti L Blennow K Small D H Glycosylation of

acetylcholinesterase and butyrylcholinesterase changes as a function of the duration of

Alzheimers disease J Neurosci Res 2003 72 (4) 520-6

80 Silveyra M X Cuadrado-Corrales N Marcos A Barquero M S Rabano A

Calero M Saez-Valero J Altered glycosylation of acetylcholinesterase in Creutzfeldt-Jakob

disease J Neurochem 2006 96 (1) 97-104

81 Wei X Li L Comparative glycoproteomics approaches and applications Brief Funct

Genomic Proteomic 2009 8 (2) 104-13

82 Tian Y Zhang H Glycoproteomics and clinical applications Proteomics Clin Appl

2010 4 (2) 124-32

83 Butterfield D A Owen J B Lectin-affinity chromatography brain glycoproteomics

and Alzheimer disease insights into protein alterations consistent with the pathology and

progression of this dementing disorder Proteomics Clin Appl 2011 5 (1-2) 50-6

84 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion

disease biomarker research delving into the glycoproteome J Proteome Res 2010 10 (6) 2687-

702

85 Rucevic M Hixson D Josic D Mammalian plasma membrane proteins as potential

biomarkers and drug targets Electrophoresis 2011 32 (13) 1549-64

86 Gilmore J M Washburn M P Advances in shotgun proteomics and the analysis of

membrane proteomes J Proteomics 2010 73 (11) 2078-91

62

87 Weekes M P Antrobus R Lill J R Duncan L M Hor S Lehner P J

Comparative analysis of techniques to purify plasma membrane proteins J Biomol Tech 2010

21 (3) 108-15

88 Helbig A O Heck A J Slijper M Exploring the membrane proteome--challenges

and analytical strategies J Proteomics 2010 73 (5) 868-78

89 Griffin N M Schnitzer J E Overcoming key technological challenges in using mass

spectrometry for mapping cell surfaces in tissues Mol Cell Proteomics 2011 10 (2) R110

000935

90 Groen A J Lilley K S Proteomics of total membranes and subcellular membranes

Expert Rev Proteomics 2010 7 (6) 867-78

91 Wisniewski J R Tools for phospho- and glycoproteomics of plasma membranes Amino

Acids 2011 41 (2) 223-33

92 Wei X Dulberger C Li L Characterization of murine brain membrane glycoproteins

by detergent assisted lectin affinity chromatography Anal Chem 2010 82 (15) 6329-33

93 Zhang H Guo T Li X Datta A Park J E Yang J Lim S K Tam J P Sze S

K Simultaneous characterization of glyco- and phosphoproteomes of mouse brain membrane

proteome with electrostatic repulsion hydrophilic interaction chromatography Mol Cell

Proteomics 2011 9 (4) 635-47

94 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

95 Ma M Chen R Ge Y He H Marshall A G Li L Combining bottom-up and top-

down mass spectrometric strategies for de novo sequencing of the crustacean hyperglycemic

hormone from Cancer borealis Anal Chem 2009 81 (1) 240-7

96 Chen R Ma M Hui L Zhang J Li L Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 (4) 708-18

97 Wei H Nolkrantz K Parkin M C Chisolm C N OCallaghan J P Kennedy R

T Identification and quantification of neuropeptides in brain tissue by capillary liquid

chromatography coupled off-line to MALDI-TOF and MALDI-TOFTOF-MS Anal Chem 2006

78 (13) 4342-51

98 Dowell J A Heyden W V Li L Rat neuropeptidomics by LC-MSMS and MALDI-

FTMS Enhanced dissection and extraction techniques coupled with 2D RP-RP HPLC J

Proteome Res 2006 5 (12) 3368-75

99 Che F Y Fricker L D Quantitative peptidomics of mouse pituitary comparison of

different stable isotopic tags J Mass Spectrom 2005 40 (2) 238-49

100 Fricker L D Analysis of mouse brain peptides using mass spectrometry-based

peptidomics implications for novel functions ranging from non-classical neuropeptides to

microproteins Mol Biosyst 2010 6 (8) 1355-65

101 Colgrave M L Xi L Lehnert S A Flatscher-Bader T Wadensten H Nilsson A

Andren P E Wijffels G Neuropeptide profiling of the bovine hypothalamus thermal

stabilization is an effective tool in inhibiting post-mortem degradation Proteomics 2011 11 (7)

1264-76

102 Scholz B Alm H Mattsson A Nilsson A Kultima K Savitski M M Falth M

Skold K Brunstrom B Andren P E Dencker L Neuropeptidomic analysis of the

embryonic Japanese quail diencephalon BMC Dev Biol 2010 10 30

63

103 Hummon A B Amare A Sweedler J V Discovering new invertebrate neuropeptides

using mass spectrometry Mass Spectrom Rev 2006 25 (1) 77-98

104 Vilim F S Sasaki K Rybak J Alexeeva V Cropper E C Jing J Orekhova I

V Brezina V Price D Romanova E V Rubakhin S S Hatcher N Sweedler J V

Weiss K R Distinct mechanisms produce functionally complementary actions of neuropeptides

that are structurally related but derived from different precursors J Neurosci 2010 30 (1) 131-

47

105 Fu Q Li L De novo sequencing of neuropeptides using reductive isotopic methylation

and investigation of ESI QTOF MSMS fragmentation pattern of neuropeptides with N-terminal

dimethylation Anal Chem 2005 77 (23) 7783-95

106 Chen R Jiang X Conaway M C Mohtashemi I Hui L Viner R Li L Mass

spectral analysis of neuropeptide expression and distribution in the nervous system of the lobster

Homarus americanus J Proteome Res 2010 9 (2) 818-32

107 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved

identification of endogenous peptides from murine nervous tissue by multiplexed peptide

extraction methods and multiplexed mass spectrometric analysis J Proteome Res 2009 8 (2)

870-6

108 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Peptidomic analysis

of human cell lines J Proteome Res 2011 10 (4) 1583-92

109 Li L Sweedler J V Peptides in the brain mass spectrometry-based measurement

approaches and challenges Annu Rev Anal Chem (Palo Alto Calif) 2008 1 451-83

110 Marouga R David S Hawkins E The development of the DIGE system 2D

fluorescence difference gel analysis technology Anal Bioanal Chem 2005 382 (3) 669-78

111 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

112 Ottervald J Franzen B Nilsson K Andersson L I Khademi M Eriksson B

Kjellstrom S Marko-Varga G Vegvari A Harris R A Laurell T Miliotis T

Matusevicius D Salter H Ferm M Olsson T Multiple sclerosis Identification and clinical

evaluation of novel CSF biomarkers J Proteomics 2010 73 (6) 1117-32

113 Diez R Herbstreith M Osorio C Alzate O 2-D Fluorescence Difference Gel

Electrophoresis (DIGE) in Neuroproteomics 2010

114 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent

advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1)

46-50

115 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J

Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66

116 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

117 Burke T W Mant C T Black J A Hodges R S Strong cation-exchange high-

performance liquid chromatography of peptides Effect of non-specific hydrophobic interactions

and linearization of peptide retention behaviour J Chromatogr 1989 476 377-89

118 Edelmann M J Strong cation exchange chromatography in analysis of posttranslational

modifications innovations and perspectives J Biomed Biotechnol 2011 2011 936508

64

119 Francois I Sandra K Sandra P Comprehensive liquid chromatography fundamental

aspects and practical considerations--a review Anal Chim Acta 2009 641 (1-2) 14-31

120 Guiochon G Marchetti N Mriziq K Shalliker R A Implementations of two-

dimensional liquid chromatography J Chromatogr A 2008 1189 (1-2) 109-68

121 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

122 Alpert A J Hydrophilic-interaction chromatography for the separation of peptides

nucleic acids and other polar compounds J Chromatogr 1990 499 177-96

123 Gilar M Olivova P Daly A E Gebler J C Orthogonality of separation in two-

dimensional liquid chromatography Anal Chem 2005 77 (19) 6426-34

124 Di Palma S Stange D van de Wetering M Clevers H Heck A J Mohammed S

Highly sensitive proteome analysis of FACS-sorted adult colon stem cells J Proteome Res 2011

10 (8) 3814-9

125 Di Palma S Boersema P J Heck A J Mohammed S Zwitterionic hydrophilic

interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution

separation and increase sensitivity in proteome analysis Anal Chem 2011 83 (9) 3440-7

126 Zarei M Sprenger A Metzger F Gretzmeier C Dengjel J Comparison of ERLIC-

TiO2 HILIC-TiO2 and SCX-TiO2 for global phosphoproteomics approaches J Proteome Res

2011 10 (8) 3474-83

127 Neue K Mormann M Peter-Katalinic J Pohlentz G Elucidation of glycoprotein

structures by unspecific proteolysis and direct nanoESI mass spectrometric analysis of ZIC-

HILIC-enriched glycopeptides J Proteome Res 2011 10 (5) 2248-60

128 Ow S Y Salim M Noirel J Evans C Wright P C Minimising iTRAQ ratio

compression through understanding LC-MS elution dependence and high-resolution HILIC

fractionation Proteomics 2011 11 (11) 2341-6

129 Alpert A J Electrostatic repulsion hydrophilic interaction chromatography for isocratic

separation of charged solutes and selective isolation of phosphopeptides Anal Chem 2008 80

(1) 62-76

130 Chien K Y Liu H C Goshe M B Development and application of a

phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography

(ERLIC) IMAC and LC-MSMS analysis to study Mareks Disease Virus infection J Proteome

Res 2011 10 (9) 4041-53

131 Liu X Plasencia M Ragg S Valentine S J Clemmer D E Development of high

throughput dispersive LC-ion mobility-TOFMS techniques for analysing the human plasma

proteome Brief Funct Genomic Proteomic 2004 3 (2) 177-86

132 Valentine S J Liu X Plasencia M D Hilderbrand A E Kurulugama R T

Koeniger S L Clemmer D E Developing liquid chromatography ion mobility mass

spectometry techniques Expert Rev Proteomics 2005 2 (4) 553-65

133 Valentine S J Plasencia M D Liu X Krishnan M Naylor S Udseth H R

Smith R D Clemmer D E Toward plasma proteome profiling with ion mobility-mass

spectrometry J Proteome Res 2006 5 (11) 2977-84

134 Valentine S J Ewing M A Dilger J M Glover M S Geromanos S Hughes C

Clemmer D E Using ion mobility data to improve peptide identification intrinsic amino acid

size parameters J Proteome Res 2011 10 (5) 2318-29

65

135 Ahmed F E The role of capillary electrophoresis-mass spectrometry to proteome

analysis and biomarker discovery J Chromatogr B Analyt Technol Biomed Life Sci 2009 877

(22) 1963-81

136 Fonslow B R Yates J R 3rd Capillary electrophoresis applied to proteomic analysis

J Sep Sci 2009 32 (8) 1175-88

137 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins J Chromatogr A 2007 1159 (1-2) 81-109

138 Haselberg R de Jong G J Somsen G W Capillary electrophoresis-mass

spectrometry for the analysis of intact proteins 2007-2010 Electrophoresis 2011 32 (1) 66-82

139 Huck C W Bakry R Huber L A Bonn G K Progress in capillary electrophoresis

coupled to matrix-assisted laser desorptionionization - time of flight mass spectrometry

Electrophoresis 2006 27 (11) 2063-74

140 Klampfl C W CE with MS detection a rapidly developing hyphenated technique

Electrophoresis 2009 30 Suppl 1 S83-91

141 Simpson D C Smith R D Combining capillary electrophoresis with mass

spectrometry for applications in proteomics Electrophoresis 2005 26 (7-8) 1291-305

142 Li Y Champion M M Sun L Champion P A Wojcik R Dovichi N J Capillary

zone electrophoresis-electrospray ionization-tandem mass spectrometry as an alternative

proteomics platform to ultraperformance liquid chromatography-electrospray ionization-tandem

mass spectrometry for samples of intermediate complexity Anal Chem 2012 84 (3) 1617-22

143 Dai L Li C Shedden K A Lee C J Quoc H Simeone D M Lubman D M

Quantitative proteomic profiling studies of pancreatic cancer stem cells J Proteome Res 2010 9

(7) 3394-402

144 Chen J Balgley B M DeVoe D L Lee C S Capillary isoelectric focusing-based

multidimensional concentrationseparation platform for proteome analysis Anal Chem 2003 75

(13) 3145-52

145 Mischak H Delles C Klein J Schanstra J P Urinary proteomics based on capillary

electrophoresis-coupled mass spectrometry in kidney disease discovery and validation of

biomarkers and clinical application Adv Chronic Kidney Dis 2010 17 (6) 493-506

146 Albalat A Mischak H Mullen W Urine proteomics in clinical applications

technologies principal considerations and clinical implementation Prilozi 2011 32 (1) 13-44

147 Zuberovic A Hanrieder J Hellman U Bergquist J Wetterhall M Proteome

profiling of human cerebrospinal fluid exploring the potential of capillary electrophoresis with

surface modified capillaries for analysis of complex biological samples Eur J Mass Spectrom

(Chichester Eng) 2008 14 (4) 249-60

148 Johannesson N Olsson L Backstrom D Wetterhall M Danielsson R Bergquist

J Screening for biomarkers in plasma from patients with gangrenous and phlegmonous

appendicitis using CE and CEC in combination with MS Electrophoresis 2007 28 (9) 1435-43

149 Guo T Lee C S Wang W DeVoe D L Balgley B M Capillary separations

enabling tissue proteomics-based biomarker discovery Electrophoresis 2006 27 (18) 3523-32

150 Hwang S I Thumar J Lundgren D H Rezaul K Mayya V Wu L Eng J

Wright M E Han D K Direct cancer tissue proteomics a method to identify candidate

cancer biomarkers from formalin-fixed paraffin-embedded archival tissues Oncogene 2007 26

(1) 65-76

66

151 Desiderio C Rossetti D V Iavarone F Messana I Castagnola M Capillary

electrophoresis--mass spectrometry recent trends in clinical proteomics J Pharm Biomed Anal

2010 53 (5) 1161-9

152 Albalat A Mischak H Mullen W Clinical application of urinary

proteomicspeptidomics Expert Rev Proteomics 2011 8 (5) 615-29

153 Ye X Luke B Andresson T Blonder J 18O stable isotope labeling in MS-based

proteomics Brief Funct Genomic Proteomic 2009 8 (2) 136-44

154 Staes A Demol H Van Damme J Martens L Vandekerckhove J Gevaert K

Global differential non-gel proteomics by quantitative and stable labeling of tryptic peptides with

oxygen-18 J Proteome Res 2004 3 (4) 786-91

155 Jorge I Navarro P Martinez-Acedo P Nunez E Serrano H Alfranca A

Redondo J M Vazquez J Statistical model to analyze quantitative proteomics data obtained

by 18O16O labeling and linear ion trap mass spectrometry application to the study of vascular

endothelial growth factor-induced angiogenesis in endothelial cells Mol Cell Proteomics 2009

8 (5) 1130-49

156 Gygi S P Rist B Gerber S A Turecek F Gelb M H Aebersold R Quantitative

analysis of complex protein mixtures using isotope-coded affinity tags Nat Biotechnol 1999 17

(10) 994-9

157 Haqqani A S Kelly J F Stanimirovic D B Quantitative protein profiling by mass

spectrometry using isotope-coded affinity tags Methods Mol Biol 2008 439 225-40

158 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

159 Thompson A Schafer J Kuhn K Kienle S Schwarz J Schmidt G Neumann T

Johnstone R Mohammed A K Hamon C Tandem mass tags a novel quantification strategy

for comparative analysis of complex protein mixtures by MSMS Anal Chem 2003 75 (8)

1895-904

160 Dayon L Hainard A Licker V Turck N Kuhn K Hochstrasser D F Burkhard

P R Sanchez J C Relative quantification of proteins in human cerebrospinal fluids by

MSMS using 6-plex isobaric tags Anal Chem 2008 80 (8) 2921-31

161 DAscenzo M Choe L Lee K H iTRAQPak an R based analysis and visualization

package for 8-plex isobaric protein expression data Brief Funct Genomic Proteomic 2008 7 (2)

127-35

162 Choe L DAscenzo M Relkin N R Pappin D Ross P Williamson B Guertin

S Pribil P Lee K H 8-plex quantitation of changes in cerebrospinal fluid protein expression

in subjects undergoing intravenous immunoglobulin treatment for Alzheimers disease

Proteomics 2007 7 (20) 3651-60

163 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 2010 82 (7) 2817-

25

164 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

67

165 Ong S E Kratchmarova I Mann M Properties of 13C-substituted arginine in stable

isotope labeling by amino acids in cell culture (SILAC) J Proteome Res 2003 2 (2) 173-81

166 Pimienta G Chaerkady R Pandey A SILAC for global phosphoproteomic analysis

Methods Mol Biol 2009 527 107-16 x

167 Zhang G Neubert T A Use of stable isotope labeling by amino acids in cell culture

(SILAC) for phosphotyrosine protein identification and quantitation Methods Mol Biol 2009

527 79-92 xi

168 Bondarenko P V Chelius D Shaler T A Identification and relative quantitation of

protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid

chromatography-tandem mass spectrometry Anal Chem 2002 74 (18) 4741-9

169 Chelius D Bondarenko P V Quantitative profiling of proteins in complex mixtures

using liquid chromatography and mass spectrometry J Proteome Res 2002 1 (4) 317-23

170 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

171 Wang W Zhou H Lin H Roy S Shaler T A Hill L R Norton S Kumar P

Anderle M Becker C H Quantification of proteins and metabolites by mass spectrometry

without isotopic labeling or spiked standards Anal Chem 2003 75 (18) 4818-26

172 Silva J C Denny R Dorschel C A Gorenstein M Kass I J Li G Z McKenna

T Nold M J Richardson K Young P Geromanos S Quantitative proteomic analysis by

accurate mass retention time pairs Anal Chem 2005 77 (7) 2187-200

173 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

174 Filiou M D Martins-de-Souza D Guest P C Bahn S Turck C W To label or not

to label Applications of quantitative proteomics in neuroscience research Proteomics 2012

175 Neilson K A Ali N A Muralidharan S Mirzaei M Mariani M Assadourian G

Lee A van Sluyter S C Haynes P A Less label more free approaches in label-free

quantitative mass spectrometry Proteomics 2011 11 (4) 535-53

176 Zhu W Smith J W Huang C M Mass spectrometry-based label-free quantitative

proteomics J Biomed Biotechnol 2010 2010 840518

177 Xie F Liu T Qian W J Petyuk V A Smith R D Liquid chromatography-mass

spectrometry-based quantitative proteomics J Biol Chem 2012 286 (29) 25443-9

178 Shen Y Tolic N Xie F Zhao R Purvine S O Schepmoes A A Moore R J

Anderson G A Smith R D Effectiveness of CID HCD and ETD with FT MSMS for

degradomic-peptidomic analysis comparison of peptide identification methods J Proteome Res

2011 10 (9) 3929-43

179 Zubarev R A Kelleher N L McLafferty F W Electron Capture Dissociation of

Multiply Charged Protein Cations A Nonergodic Process Journal of the American Chemical

Society 1998 120 (13) 3265-3266

180 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

181 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

68

182 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass

spectrometry for shotgun proteomics Nat Methods 2008 5 (11) 959-64

183 Zhou W Ross M M Tessitore A Ornstein D Vanmeter A Liotta L A

Petricoin E F 3rd An initial characterization of the serum phosphoproteome J Proteome Res

2009 8 (12) 5523-31

184 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides

complexed with divalent metal cations Anal Chem 2006 78 (21) 7570-6

185 Snovida S I Bodnar E D Viner R Saba J Perreault H A simple cellulose

column procedure for selective enrichment of glycopeptides and characterization by nano LC

coupled with electron-transfer and high-energy collisional-dissociation tandem mass

spectrometry Carbohydr Res 2010 345 (6) 792-801

186 Sobott F Watt S J Smith J Edelmann M J Kramer H B Kessler B M

Comparison of CID versus ETD based MSMS fragmentation for the analysis of protein

ubiquitination J Am Soc Mass Spectrom 2009 20 (9) 1652-9

187 Eliuk S M Maltby D Panning B Burlingame A L High resolution electron

transfer dissociation studies of unfractionated intact histones from murine embryonic stem cells

using on-line capillary LC separation determination of abundant histone isoforms and post-

translational modifications Mol Cell Proteomics 2010 9 (5) 824-37

188 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 10 (9) 4219-29

189 Molina H Matthiesen R Kandasamy K Pandey A Comprehensive comparison of

collision induced dissociation and electron transfer dissociation Anal Chem 2008 80 (13)

4825-35

190 Olsen J V Macek B Lange O Makarov A Horning S Mann M Higher-energy

C-trap dissociation for peptide modification analysis Nat Methods 2007 4 (9) 709-12

191 McAlister G C Phanstiel D Wenger C D Lee M V Coon J J Analysis of

tandem mass spectra by FTMS for improved large-scale proteomics with superior protein

quantification Anal Chem 2010 82 (1) 316-22

192 Dayon L Pasquarello C Hoogland C Sanchez J C Scherl A Combining low-

and high-energy tandem mass spectra for optimized peptide quantification with isobaric tags J

Proteomics 2010 73 (4) 769-77

193 Second T P Blethrow J D Schwartz J C Merrihew G E MacCoss M J

Swaney D L Russell J D Coon J J Zabrouskov V Dual-pressure linear ion trap mass

spectrometer improving the analysis of complex protein mixtures Anal Chem 2009 81 (18)

7757-65

194 Jedrychowski M P Huttlin E L Haas W Sowa M E Rad R Gygi S P

Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for

murine phosphoproteomics Mol Cell Proteomics 2011 10 (12) M111 009910

195 Frese C K Altelaar A F Hennrich M L Nolting D Zeller M Griep-Raming J

Heck A J Mohammed S Improved peptide identification by targeted fragmentation using

CID HCD and ETD on an LTQ-Orbitrap Velos J Proteome Res 2011 10 (5) 2377-88

196 Chakraborty A B Berger S J Gebler J C Use of an integrated MS--multiplexed

MSMS data acquisition strategy for high-coverage peptide mapping studies Rapid Commun

Mass Spectrom 2007 21 (5) 730-44

69

197 Ramos A A Yang H Rosen L E Yao X Tandem parallel fragmentation of

peptides for mass spectrometry Anal Chem 2006 78 (18) 6391-7

198 Barbara J E Castro-Perez J M High-resolution chromatographytime-of-flight MSE

with in silico data mining is an information-rich approach to reactive metabolite screening Rapid

Commun Mass Spectrom 2011 25 (20) 3029-40

199 Geromanos S J Vissers J P Silva J C Dorschel C A Li G Z Gorenstein M

V Bateman R H Langridge J I The detection correlation and comparison of peptide

precursor and product ions from data independent LC-MS with data dependant LC-MSMS

Proteomics 2009 9 (6) 1683-95

200 Koutroukides T A Guest P C Leweke F M Bailey D M Rahmoune H Bahn

S Martins-de-Souza D Characterization of the human serum depletome by label-free shotgun

proteomics J Sep Sci 34 (13) 1621-6

201 Martins-de-Souza D Guest P C Guest F L Bauder C Rahmoune H Pietsch S

Roeber S Kretzschmar H Mann D Baborie A Bahn S Characterization of the human

primary visual cortex and cerebellum proteomes using shotgun mass spectrometry-data-

independent analyses Proteomics 12 (3) 500-4

202 Scatena R Bottoni P Pontoglio A Giardina B Revisiting the Warburg effect in

cancer cells with proteomics The emergence of new approaches to diagnosis prognosis and

therapy Proteomics Clin Appl 4 (2) 143-58

203 Herberth M Koethe D Cheng T M Krzyszton N D Schoeffmann S Guest P

C Rahmoune H Harris L W Kranaster L Leweke F M Bahn S Impaired glycolytic

response in peripheral blood mononuclear cells of first-onset antipsychotic-naive schizophrenia

patients Mol Psychiatry 16 (8) 848-59

204 Krishnamurthy D Levin Y Harris L W Umrania Y Bahn S Guest P C

Analysis of the human pituitary proteome by data independent label-free liquid chromatography

tandem mass spectrometry Proteomics 11 (3) 495-500

205 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

206 Eng J K McCormack A L Yates Iii J R An approach to correlate tandem mass

spectral data of peptides with amino acid sequences in a protein database Journal of the

American Society for Mass Spectrometry 1994 5 (11) 976-989

207 Craig R Beavis R C TANDEM matching proteins with tandem mass spectra

Bioinformatics 2004 20 (9) 1466-7

208 Geer L Y Markey S P Kowalak J A Wagner L Xu M Maynard D M Yang

X Shi W Bryant S H Open mass spectrometry search algorithm J Proteome Res 2004 3

(5) 958-64

209 Zhang J Xin L Shan B Chen W Xie M Yuen D Zhang W Zhang Z Lajoie

G A Ma B PEAKS DB De Novo sequencing assisted database search for sensitive and

accurate peptide identification Mol Cell Proteomics 2011

210 Liu J Erassov A Halina P Canete M Nguyen D V Chung C Cagney G

Ignatchenko A Fong V Emili A Sequential interval motif search unrestricted database

surveys of global MSMS data sets for detection of putative post-translational modifications

Anal Chem 2008 80 (20) 7846-54

70

211 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

212 Han Y Ma B Zhang K SPIDER software for protein identification from sequence

tags with de novo sequencing error J Bioinform Comput Biol 2005 3 (3) 697-716

213 Singh S Springer M Steen J Kirschner M W Steen H FLEXIQuant a novel tool

for the absolute quantification of proteins and the simultaneous identification and quantification

of potentially modified peptides J Proteome Res 2009 8 (5) 2201-10

214 Searle B C Scaffold a bioinformatic tool for validating MSMS-based proteomic

studies Proteomics 10 (6) 1265-9

215 Herbst A McIlwain S Schmidt J J Aiken J M Page C D Li L Prion disease

diagnosis by proteomic profiling J Proteome Res 2009 8 (2) 1030-6

216 Menschaert G Vandekerckhove T T Baggerman G Schoofs L Luyten W Van

Criekinge W Peptidomics coming of age a review of contributions from a bioinformatics

angle J Proteome Res 2010 9 (5) 2051-61

217 Kumar C Mann M Bioinformatics analysis of mass spectrometry-based proteomics

data sets FEBS Lett 2009 583 (11) 1703-12

218 Craig R Cortens J P Beavis R C Open source system for analyzing validating and

storing protein identification data J Proteome Res 2004 3 (6) 1234-42

219 Jones P Cote R G Cho S Y Klie S Martens L Quinn A F Thorneycroft D

Hermjakob H PRIDE new developments and new datasets Nucleic Acids Res 2008 36

(Database issue) D878-83

220 Deutsch E W Lam H Aebersold R PeptideAtlas a resource for target selection for

emerging targeted proteomics workflows EMBO Rep 2008 9 (5) 429-34

221 Miliotis T Ali L Palm J E Lundqvist A J Ahnoff M Andersson T B

Hilgendorf C Development of a highly sensitive method using liquid chromatography-multiple

reaction monitoring to quantify membrane P-glycoprotein in biological matrices and relationship

to transport function Drug Metab Dispos 2011 39 (12) 2440-9

222 Xiang Y Koomen J M Evaluation of Direct Infusion-Multiple Reaction Monitoring

Mass Spectrometry for Quantification of Heat Shock Proteins Anal Chem 2012

223 Ossola R Schiess R Picotti P Rinner O Reiter L Aebersold R Biomarker

validation in blood specimens by selected reaction monitoring mass spectrometry of N-

glycosites Methods Mol Biol 2011 728 179-94

224 Liu H Lam L Dasgupta P K Expanding the linear dynamic range for multiple

reaction monitoring in quantitative liquid chromatography-tandem mass spectrometry utilizing

natural isotopologue transitions Talanta 2011 87 307-10

225 Belov M E Prasad S Prior D C Danielson W F 3rd Weitz K Ibrahim Y M

Smith R D Pulsed multiple reaction monitoring approach to enhancing sensitivity of a tandem

quadrupole mass spectrometer Anal Chem 2011 83 (6) 2162-71

226 Prentice R L Paczesny S Aragaki A Amon L M Chen L Pitteri S J

McIntosh M Wang P Buson Busald T Hsia J Jackson R D Rossouw J E Manson J

E Johnson K Eaton C Hanash S M Novel proteins associated with risk for coronary heart

disease or stroke among postmenopausal women identified by in-depth plasma proteome

profiling Genome Med 2 (7) 48

227 Gelman J S Fricker L D Hemopressin and other bioactive peptides from cytosolic

proteins are these non-classical neuropeptides AAPS J 2010 12 (3) 279-89

71

228 Gelman J S Sironi J Castro L M Ferro E S Fricker L D Hemopressins and

other hemoglobin-derived peptides in mouse brain comparison between brain blood and heart

peptidome and regulation in Cpefatfat mice J Neurochem 2010 113 (4) 871-80

229 Li L Garden R W Sweedler J V Single-cell MALDI a new tool for direct peptide

profiling Trends Biotechnol 2000 18 (4) 151-60

230 Altelaar A M Heck A J Trends in ultrasensitive proteomics Curr Opin Chem Biol

231 Bandura D R Baranov V I Ornatsky O I Antonov A Kinach R Lou X

Pavlov S Vorobiev S Dick J E Tanner S D Mass cytometry technique for real time

single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass

spectrometry Anal Chem 2009 81 (16) 6813-22

232 Trimpin S Inutan E D Herath T N McEwen C N Laserspray ionization a new

atmospheric pressure MALDI method for producing highly charged gas-phase ions of peptides

and proteins directly from solid solutions Mol Cell Proteomics 2010 9 (2) 362-7

233 McEwen C N Larsen B S Trimpin S Laserspray ionization on a commercial

atmospheric pressure-MALDI mass spectrometer ion source selecting singly or multiply

charged ions Anal Chem 2010 82 (12) 4998-5001

234 Wang B Lietz C B Inutan E D Leach S M Trimpin S Producing highly

charged ions without solvent using laserspray ionization a total solvent-free analysis approach at

atmospheric pressure Anal Chem 2011 83 (11) 4076-84

235 Inutan E D Richards A L Wager-Miller J Mackie K McEwen C N Trimpin

S Laserspray ionization a new method for protein analysis directly from tissue at atmospheric

pressure with ultrahigh mass resolution and electron transfer dissociation Mol Cell Proteomics

2010 10 (2) M110 000760

236 Wang H Liu J Cooks R G Ouyang Z Paper spray for direct analysis of complex

mixtures using mass spectrometry Angew Chem Int Ed Engl 49 (5) 877-80

237 Zhang Z Xu W Manicke N E Cooks R G Ouyang Z Silica coated paper

substrate for paper-spray analysis of therapeutic drugs in dried blood spots Anal Chem 84 (2)

931-8

238 Wang H Manicke N E Yang Q Zheng L Shi R Cooks R G Ouyang Z

Direct analysis of biological tissue by paper spray mass spectrometry Anal Chem 83 (4) 1197-

201

239 Liu J Wang H Cooks R G Ouyang Z Leaf spray direct chemical analysis of plant

material and living plants by mass spectrometry Anal Chem 83 (20) 7608-13

240 Liu J Wang H Manicke N E Lin J M Cooks R G Ouyang Z Development

characterization and application of paper spray ionization Anal Chem 82 (6) 2463-71

241 Yoo H J Wang N Zhuang S Song H Hakansson K Negative-ion electron

capture dissociation radical-driven fragmentation of charge-increased gaseous peptide anions J

Am Chem Soc 2011 133 (42) 16790-3

242 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 271 (2) 111-

21

243 Hood L Friend S H Predictive personalized preventive participatory (P4) cancer

medicine Nat Rev Clin Oncol 2011 8 (3) 184-7

244 Tian Q Price N D Hood L Systems cancer medicine towards realization of

predictive preventive personalized and participatory (P4) medicine J Intern Med 2012 271

(2) 111-21

72

245 Belda-Iniesta C de Castro J Perona R Translational proteomics what can you do for

true patients J Proteome Res 2011 10 (1) 101-4

246 Schiess R Wollscheid B Aebersold R Targeted proteomic strategy for clinical

biomarker discovery Mol Oncol 2009 3 (1) 33-44

247 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

248 Schmidt A Kellermann J Lottspeich F A novel strategy for quantitative proteomics

using isotope-coded protein labels Proteomics 2005 5 (1) 4-15

249 Wang Y K Ma Z Quinn D F Fu E W Inverse 15

N-metabolic labelingmass

spectrometry for comparative proteomics and rapid identification of protein markerstargets

Rapid Commun Mass Spectrom 2002 16 (14) 1389-97

73

Chapter 3

Protein changes in immunodepleted cerebrospinal fluid from transgenic

mouse models of Alexander disease detected using mass spectrometry

Adapted from ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse

models of Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P

Messing A Li L Submitted

74

ABSTRACT

Cerebrospinal fluid (CSF) is a low protein content biological fluid with dynamic range

spanning at least nine orders of magnitude in protein content and is in direct contact with the

brain A modified IgY-14 immunodepletion treatment was performed to enhance analysis of the

low volumes of CSF that are obtainable from mice As a model system in which to test this

approach we utilized transgenic mice that over-express the intermediate filament glial fibrillary

acidic protein (GFAP) These mice are models for Alexander disease (AxD) a severe

leukodystrophy in humans From the CSF of control and transgenic mice we report the

identification of 266 proteins with relative quantification of 106 proteins Biological and

technical triplicates were performed to address animal variability as well as reproducibility in

mass spectrometric analysis Relative quantitation was performed using distributive normalized

spectral abundance factor (dNSAF) spectral counting analysis A panel of biomarker proteins

with significant changes in the CSF of GFAP transgenic mice has been identified with validation

from ELISA and microarray data demonstrating the utility of our methodology and providing

interesting targets for future investigations on the molecular and pathological aspects of AxD

75

INTRODUCTION

Alexander disease (AxD) is a fatal neurogenetic disorder caused by heterozygous point

mutations in the coding region of GFAP (encoding glial fibrillary acidic protein)1 The hallmark

diagnostic feature of this disease is the accumulation of astrocytic cytoplasmic inclusions known

as Rosenthal fibers containing GFAP Hsp27 αB-crystallin and other components2-5

Although

several potential treatment strategies6-8

are under investigation clinical trial design is hampered

by the absence of a standardized clinical scoring system or means to quantify lesions in MRI

that could serve to monitor severity and progression of disease One solution to this problem

would be the identification of biomarkers in readily sampled body fluids as indirect indicators of

disease

Cerebrospinal fluid (CSF) has a long history as a surrogate biopsy site for brain or spinal

cord in evaluating diseases of the central nervous system The protein composition of CSF is

well defined at least for the most abundant species of proteins and numerous studies exist that

characterize individual biomarkers or complex patterns of biomarkers in various diseases9 10

GFAP itself is present in CSF (albeit at much lower levels than in brain parenchyma) and in one

study of three Alexander disease patients its levels were markedly increased11

Whether an

increase in CSF GFAP will be a consistent finding in Alexander disease or whether other useful

biomarkers for this disease could be identified through an unbiased analysis of the CSF

proteome is not yet known

The rarity of Alexander disease makes analysis of human samples difficult However

mouse models exist that replicate key features of the disease such as formation of Rosenthal

fibers Unfortunately mouse CSF poses particular problems for proteomic studies and there is

76

an urgent need for technical improvements for dealing with this fluid For instance collection

from an adult mouse typically yields ~10 μL of CSF often with some contamination by blood12

To further complicate analysis CSF has an exceedingly low protein content (~04 μgμL) with

over 60 of the total protein content consisting of a single protein albumin13 14

A number of

techniques have been developed to remove albumin from biological samples including Cibacron

Blue15

IgG immunodepletion16

and IgY immunodepletion17-19

IgY which is avian in origin

offers reduced non-specific binding and increased avidity when compared to IgG antibodies from

rabbits goats and mice20-23

One widely used IgY cocktail is IgY-14 which contains fourteen

specific antibodies specific for albumin IgG transferrin fibrinogen α1-antitrypsin IgA IgM

α2-macroglobulin haptoglobin apolipoproteins A-I A-II and B complement C3 and α1-acid

glycoprotein Since existing protocols for IgY-14 depletion are optimized for use with large

volumes of serum new protocols must be developed to permit its use with the low volumes of a

low protein fluid represented by mouse CSF

Various improvements have also taken place in the field of proteomic analysis that could

facilitate analysis of mouse CSF Data dependent tandem mass spectrometry followed by

quantification of proteins is used in standard shotgun proteomics24-29

Several methods now exist

for introducing quantitation into mass spectrometry including stable isotope labeling30-32

isobaric tandem mass tags33 34

and spectral counting35 36

Spectral counting which is a

frequency measurement that uses MSMS counts of identified peptides as the metric to enable

protein quantitation is attractive because it is label-free and requires no additional sample

preparation Finally recent advances in spectral counting has produced a data refinement

strategy termed normalized spectral abundance factor (NSAF)37 38

and further developed into

distributive NSAF (dNSAF) to address issues with peptides shared by multiple proteins39

77

To identify potential biomarkers in AxD we report a novel scaled-down version of IgY

antibody depletion strategy to reduce the complexity and remove high abundance proteins in

mouse CSF samples The generated spectral counts data were then subjected to dNSAF natural

log data transformation and t-test analysis to determine which proteins differ in abundance when

comparing GFAP transgenics and controls with multiple biological and technical replicates

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) urea (gt99) sodium chloride (997) and ammonium

bicarbonate (certified) were purchased from Fisher Scientific (Fair Lawn NJ) Deionized water

(182 MΩcm) was prepared with a Milli-Q Millipore system (Billerica MA) Optima LCMS

grade acetonitrile and water were purchased from Fisher Scientific (Fair Lawn NJ) DL-

Dithiothreitol (DTT) and sequencing grade modified trypsin were purchased from Promega

(Madison WI) Formic acid (ge98) was obtained from Fluka (Buchs Switzerland)

Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris ge999 ) ammonium formate

(ge99995) glycine (ge985) and IgY-14 antibodies were purchased from Sigma-Aldrich

(Saint Louis MO)

Mice

Transgenic mice over-expressing the human GFAP gene (Tg737 line) were maintained

as hemizygotes in the FVBN background and genotyped via PCR on DNA prepared from tail

samples as described previously40

The mice were housed on a 14-10 light-dark cycle with ad

libitum access to food and water All procedures were conducted using protocols approved by

the UW-Madison IACUC

78

CSF collection

CSF was collected from mice as described previously12

Briefly mice were anesthetized

with avertin (400-600 mgkg ip) A midline sagittal incision was made over the dorsal aspect

of the hindbrain and three muscle layers carefully peeled back to expose the cisterna magna The

membrane covering the cisterna magna was pierced with a 30 gauge needle and CSF was

collected immediately using a flexible plastic pipette Approximately ten microliters of CSF was

collected per animal All samples used for MS analysis showed no visible contamination of

blood

Enzyme-linked immunosorbent assay (ELISA)

A sandwich ELISA was used to quantify GFAP8 Briefly a microtiter plate was coated

with a cocktail of monoclonal antibodies (Covance SMI-26R) Plates were blocked with 5

milk before addition of sample or standards diluted in PBS with Triton-X and BSA A rabbit

polyclonal antibody (DAKO Z334) was used to detect the GFAP followed by a peroxidase

conjugated anti-rabbit IgG antibody (Sigma A6154) secondary antibody The peroxidase activity

was detected with SuperSignal ELISA Pico Chemiluminescent Substrate (PIERCE) and

quantified with a GloRunner Microplate Luminometer Values below the biological limit of

detection (16ngL) were given the value 16ngL before multiplying by the dilution factor

Immunodepletion of abundant proteins

Currently there are no commercial immunodepletion products available for use with CSF

and to address the low protein content of this fluid (~04 microgmicroL) Therefore 100 microL of

purchased IgY-14 resin was coupled with a Pierce Spin Cup using a paper filter from Thermo

Scientific (Waltham MA) CSF samples from either transgenics or controls were first pooled to

100 microL and then added to 100 microL of 20 mM Tris-HCl 300 mM pH 74 (2x dilution buffer) and

79

allowed to mix with the custom IgY-14 depletion spin column on an end-over-end rotor for 30

minutes at 4oC The sample was then centrifuged at 04 rcf for 45 seconds with an Eppendorf

Centrifuge 5415D (Hamburg Germany) The antibodies were then washed with 50 microL 1x

dilution buffer vortexed for 5 minutes centrifuged at 04 rcf for 45 seconds and the flow through

was collected for tryptic digestion The antibodies were then stripped of the bound proteins with

four 025 mL washes of 01 M glycine pH 25 neutralized with two 025 mL washes of 01 M

Tris-HCl pH 80 and washed once with 025mL of 1x dilution buffer The immunodepletion

protocol was repeated for each transgenic or control pooled 100 microL sample (N=3)

Preparation of tryptic digests

The immunodepleted pooled mouse CSF samples (200 microL total volume) were

concentrated to 10 microL using a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA)

To each sample 8 microL of 133 M urea and 1 microL of 050 M DTT were added and allowed to

incubate at 37oC for one hour After incubation 27 microL of 055 M IAA was added for

carboxymethylation and the sample was allowed to incubate for 15 minutes in the dark To

quench the IAA 1 microL of 050 M DTT was added and allowed to react for 10 minutes To

perform trypsin digestion 70 microL of 50 mM NH4HCO3 was then added along with 025 microg

trypsin which had previously been dissolved in 50 mM acetic acid at a concentration of 05

microgmicroL Digestion was performed overnight at 37oC and quenched by addition of 25 microL of 10

formic acid The tryptic peptides were then subjected to solid phase extraction using a Varian

Omix Tip C18 100 microL (Palo Alto CA) Peptides were eluted with 50 ACN in 01 formic

acid concentrated and reconstituted in 30 microL H2O in 01 formic acid

RP nanoLC separation

80

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto an Agilent

Technologies Zorbax 300 SB-C18 5 microm 5x03 mm trap cartridge (Santa Clara CA) at a flow

rate of 5 microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm

Atlantis dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B

at 250 nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

81

range of 300-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot41

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt mus musculus

(house mouse) database (version 575) False positive analyses42

were calculated using an

automatic decoy option of Mascot Results from the Mascot results were reported using

Proteinscape 21 and technical replicates were combined and reported as a protein compilation

using ProteinExtractor (Bruker Daltonics Bremen Germany)

Mascot search parameters were as follows Allowed missed cleavages 2 enzyme

trypsin variable modifications carboxymethylation (C) and oxidation (M) peptide tolerance

plusmn12 Da maximum number of 13

C 1 MSMS tolerance plusmn05 Da instrument type ESI-Trap

Reported proteins required a minimum of one peptide with a Mascot score ge300 with bold red

characterization Spectral counts were determined from the number of MSMS spectra identified

from accepted proteins A bold red peptide combines a bold peptide which represents the first

query result from a submitted MSMS spectrum with the red peptide which indicates the top

peptide for the identified protein Requiring one bold red peptide assists in removal of

homologous redundant proteins and further improves protein results In addition requiring one

82

peptide to be identified by a score gt300 removes the ability for proteins to be identified by

multiple low Mascot scoring peptides

Each immunodepleted biological replicate had technical triplicates performed and the

technical triplicates were summed together by ProteinExtractor Peptide spectral counts were

then summed for each protein and subjected to dNSAF analysis Details for this method can be

found elsewhere37 39

but briefly peptide spectral counts are summed per protein (SpC) based on

unique peptides and a weighted distribution of any shared peptides with homologous proteins

ProteinScape removed 83 homologous proteins found in the current study to bring the total

number of proteins identified to 266 but some non-unique homologous peptides which are

shared by multiple proteins are still present in the resulting 266 remaining proteins To address

these non-unique homologous peptides distributive spectral counting was performed as

described elsewhere39

The dSpC is divided by the proteinrsquos length (L) and then divided by the

summation of the dSpCL from all proteins in the experiment (N) to produce each proteinrsquos

specific dNSAF value

N

i

i

kk

LdSpC

LdSpCdNSAF

1

)(

)()(

The resulting data were then transformed by taking the natural log of the dNSAF value The

means for each protein were calculated using the ln(dNSAF) from N=3 biological replicates and

the whole data set was subjected to the Shapiro-Wilk test for normalityGaussian distribution

performed on the software PAST (Version 198 University of Oslo Norway Osla) The

Shapiro-Wilk test was repeated multiple times to determine a non-zero value for zero spectral

83

counts A non-zero value is required to alleviate the errors of dividing by zero which was

experimentally determined to be 043 The Gaussian data were then subjected to the t-test to

identify statistically significant changes in protein expression

RESULTS AND DISCUSSION

General workflow

Individual CSF samples were manually inspected and samples were only selected that

showed no visual blood contamination Preliminary experiments showed that the maximum

degree of blood contamination estimated from counts of red blood cells in the CSF that was not

visible to the eye was less than 005 by total mass Approximately 10 individual mouse CSF

samples were pooled to achieve the desired 100 μL volume for a single biological replicate The

CSF samples were IgY immunodepleted followed by digestion with trypsin The resulting

digested samples were then de-salted concentrated reconstituted in 30 μL of 01 formic acid

and 5 μL was loaded onto a reversed phase nanoflow HPLC column subjected to a 90 minute

gradient and infused into the amaZon ETD ion trap mass spectrometer The workflow for

mouse CSF analysis is shown in Figure 1 The remainder of the desalted sample was saved for

technical replicates

Immunodepletion for CSF

Currently there are no immunodepletion techniques specifically designed for CSF

Nonetheless the protein profiles between CSF and serum are similar enough to use currently

available immunodepletion techniques designed for serum as a starting point The smallest

commercially available IgY-14 immunodepletion kit is for 15 μL of serum which is equivalent in

protein content to ~15 mL of CSF Therefore for 100 μL CSF one fifteenth of the total IgY-14

84

beads would be used for the equivalent immunodepletion which is 66 μL of proprietary bead

slurry The potential for irreversible binding of abundant proteins to their respective IgY

antibody even after an extra stripping wash and low amounts of total beads made using 66 μL

of IgY-14 bead slurry unrealistic In practice almost doubling the amount of IgY-14 beads (100

μL) used was not sufficient and resulted in albumin and transferrin peptides to be observed in

high abundance (data not shown) The most important protein to immunodeplete is albumin and

it has been reported to be a greater percentage of total CSF protein content (~60) than serum

(~49) in humans14

The difference in albumin percentage supports the results that proprietary

blends of immunodepletion beads for high abundance proteins such as albumin cannot be

scaled down on a strict protein scale and further modifications to the serum immunodepletion

protocol need to be made

Since IgY-14 beads were developed for use with serum all of its protocols need to be

taken into account to modify the protocol for CSF Serum samples should be diluted fifty times

before mixing with the IgY-14 beads but CSF protein concentration is at least one hundred times

lower than serum Therefore CSF is below half the recommended diluted protein concentration

for IgY immunodepletion Consequently multiple steps have been devised to address this

limitation First the binding time between the proteins targeted for removal from the CSF and

IgY-14 beads was doubled during the end-over-end rotor depletion step from the recommended

15 minutes to 30 minutes Second to prevent non-specific protein binding to the antibodies the

CSF samples were diluted with an equal amount (100 μL) of 2X dilution buffer The dilution

buffer of NaCl and Tris-HCl was needed to assist in reducing non-specific binding of proteins to

the 14 antibodies and ensuring the sample is held at physiological pH In addition to these

modifications a doubling of the starting IgY-14 bead slurry to 200 μL provided the desired

85

results Overall this modified protocol results in effective depletion of CSF abundant proteins

using only one-fifth of the antibodies provided by the smallest commercially available platform

Data Analysis

Spectral counting technique for relative quantitation provides numerous benefits for the

study of mouse CSF due to its label-free advantage In contrast isotopic labeling method often

involves additional sample processing that could cause sample loss which is highly undesirable

for low protein content and low volume samples Labeling methods also require a mixing of two

sets of isotopically labeled samples which would effectively increase the sample complexity and

reduce the amount of sample that can be loaded onto the nanoLC column by half In addition

more than two sets of samples can be compared by label-free methods The use of label-free

spectral counting method does not lead to an increase in sample complexity or interference in

quantitation from peptides in the mz window selected for tandem MS Using spectral counting

for relative quantitation however is dependent on reproducible HPLC separation and careful

mass spectrometry operation to minimize technical variability during the study To address

concerns of analytical reliability and run to run deviations base peak chromatograms from two

transgenic IgY-14 immunodepleted biological replicates including two technical replicates of

each were shown to be highly reproducible (Figure 2)

Each biological sample was analyzed in triplicate with the same protocols on the amaZon

ETD with three control and three transgenic samples From the three technical replicates for

each biological replicate the spectral counts of the peptides for the proteins identified were

summed The results from these mouse CSF biological triplicates are shown in Figure 3A for

GFAP overexpressor and Figure 3B for control The summation of spectral counts for each

biological replicate was performed to remove the inherent bias related to data dependent analysis

86

for protein identification One concern in grouping technical replicates is a potential loss of

information regarding analytical variability Figure 4 provides a graphical representation of

variability of technical replicates illustrating the standard deviation of technical replicates with

error bars Figure 4 shows that a statistically changed protein creatine kinase M (A) and an

unchanged protein kininogen-1 (B) have similar variability within (technical replicates) and

between samples (biological replicates) for each protein In addition Figure 4B illustrates that

even with the variability of kininogen-1 the resulting mean shown by the dashed line of control

and transgenic samples were almost equal whereas Figure 4A shows significantly different

expression level of creatine kinase M Performing replicate analysis of each biological sample

(n=3) helps correct for systematic errors deriving from sample handling and pooling of samples

helps reduce random error during the CSF sample collection process

Protein Identification and Spectral Counting Analysis

The data for dNSAF analysis like any mass spectrometry proteomics experiment

requires multiple layers of verification to ensure reliable data Our initial protein identifications

were subjected to a database search using a decoy database from Mascot which resulted in an

average false positive rate below 1 for all the experimental data collected Representative

MSMS spectra between control and GFAP transgenic samples are illustrated in Figure 5

Overall 266 proteins were identified in a combination of control and transgenic samples

(Supplemental Table 1) A total of 349 proteins were initially identified but 83 proteins were

isoforms of previously identified proteins and automatically excluded by ProteinExtractor The

next level of quality control was to only include ln(dNSAF) values from proteins identified by 2

or more unique peptides having a Mascot score of ge300 and observed in two out of three

biological replicates These selection parameters resulted in 106 proteins remaining after

87

dNSAF analysis (Supplemental Table 2) The resulting spectral counts were then subjected to

dSpC in order to account and correct for the systematic error of peptides shared by multiple

proteins (Supplemental Table 3)

It is inevitable in large scale and complex proteomics experiments that some proteins will

be seen in some samples and not others In addition when controls were compared to transgenic

samples as shown in Figure 3C 127 proteins were unique to either control or GFAP transgenic

mice and 139 proteins were seen in both samples From dNSAF equation if the spectral count

is zero the numerator is zero and the value will not be normalized between runs In order to

circumvent the zero spectral counts in dNSAF analysis zero spectral counts must be replaced by

an experimentally determined non-zero value determined to be 043 The 043 spectral counts

for zero spectral counts was calculated by serial Shapiro-Wilk tests to determine the lowest value

(043) for zero spectral counts and maintain a Gaussian distribution for all data sets The 043

value for zero spectral counts in the current study was higher than the 016 reported value for

zero spectral counts in the original NSAF spectral counting study37

Our study may have a

higher zero spectral count value than the previous study because the spectral counting data were

an addition of three technical replicates and three times 016 is close to 043 The normalized

Gaussian data were then subjected to t-test analysis and P-values below 005 are considered as

statistically significant and are presented in Table 1 The proteins with significant up or down

regulation from Table 1 can be further evaluated as how close significant proteins were to a p-

value of 005 such as ganglioside GM2 activator serine protease inhibitor A3N and collagen

alpha-2(I) chain having t-test scores of 0045 0047 and 0043 respectively Proteins exhibiting

a P-value close to 005 were more likely to be highly variable proteins or have smaller fold

changes between control and transgenic samples and thus provide less biological relevancy to

88

future studies The modest change in antithrobin-III which is reduced by 14-fold in transgenic

is included due a low pooled standard deviation in spectral counts

Spectral counting has been analyzed with fold changes derived directly from the average

spectral counts from the technical replicates and then the average of the three biological

replicates We decided to perform additional analysis using fold changes to dig deeper into

proteins that statistically support the null hypothesis from the dNSAF analysis To only pull out

highly confident protein identifications we used the same strict cut-off of two unique peptides

identified per protein as in dNSAF analysis We only accepted proteins with greater than three-

fold change in total spectral counts listed in Table 2 Two proteins contactin-1 (cntn1) and

cathepsin B (CB) illustrate the potential biomarkers missed by dNSAF analysis Cntn1 had zero

spectral count in the transgenic sample and had an average spectral count of 41 in control

samples The lack of any spectral counts in one biological control for cntn1 resulted in a large

standard deviation in the ln(dNSAF) means for the control which resulted in the t-test supporting

the null hypothesis Another example is CB which was detected by numerous spectral counts in

every GFAP transgenic biological replicate with an average spectral count of 97 (Table 2) The

presence of CB in one biological control sample (23 average spectral counts) resulted in a high

standard deviation in the mean of the control samples These examples exhibit a limitation of

dNSAF analysis which could cause a loss of potentially useful information

Previously Identified Proteins with Expression Changes

Previously three proteins have been described as increased in CSF from individual(s)

suffering from AxD In one study a single patientrsquos CSF was found to have elevated levels of

αβ-crystallin and HSP2744

In a second study three patients were reported to have elevated

levels of GFAP with concentrations from 4760 ngL to 30000 ngL (compared to lt175 ngL for

89

controls)11

GFAP was detected in our current study however the other two proteins were not

detected One possibility is that αB-crystallin and Hsp-27 levels in mouse CSF are too low for

detection by MS analysis In addition while the transgenic mice display the hallmark

pathological feature of AxD in the form of Rosenthal fibers they do not have any evident

leukodystrophy and thus may not display the full range of changes in CSF as might be found in

human patients

Creatine Kinase M

Creatine kinase M (M-CK) is a 43 kDa protein whose role is to reversibly catalyze

phosphate transfer between ATP and energy storage compounds M-CK has been primarily

found in muscle tissue and for humans creatine kinase M is diagnostically analyzed in the blood

for myocardial infarction (heart attack) In addition M-CK is also found in Purkinje neurons of

the cerebellum45 46

A related protein creatine kinase B (B-CK) also exhibited an apparent 21

fold increase in transgenic CSF over control but this difference was not statistically different

B-CK concentration is known to be elevated in CSF following head trauma47

or cerebral

infarction48

but decreased in astrocytes in individuals affected by multiple sclerosis49

Cathepsin

The data showed multiple cathepsins were up regulated in the CSF of transgenic mice

when compared to control mice The up regulated cathepsins were S L1 and B isoforms which

are all cysteine proteases Cathepsin S (CS) was never observed in control samples but

observed with an average of 73 spectral counts per analysis Cathepsin L1 (CL1) was up

regulated by 94 times in transgenic mice These two cathepsins exhibited significant changes

using dNSAF and t-test statistics as shown in Table 1 and Cathepsin B (CB) showed a 42 fold

increase in transgenic CSF as shown in Table 2

90

Cathepsins regulate apoptosis in cells50

which is the major mechanism for elimination of

cells deemed by the organism to be dangerous damaged or expendable CL and CB are

redundant in the system as thiol proteases and inhibition of CB by CA-047 causes a diminished

apoptosis response in multiple cell lines51

Intriguingly increased levels of CB or CL are

correlated with poor prognosis for cancer patients and shorter disease-free intervals It is

believed that these proteases degrade the extracellular membrane which allows tumor cells to

invade adjacent tissue and metastasize52

With regards to AxD the up regulation of these

cathepsins may be indicative of the bodyrsquos natural defense and response to Rosenthal fibers

Thus stimulation of these cathepsins may provide a further protective stress response but the

positive correlation between these proteases and cancer highlights the multiple roles of these

proteins in pathological response Alternatively it has been shown that increased CB is involved

with the tumor necrosis factor α (TNFα) induced apoptosis cascade53

The activation of the

TNFα could produce oligodendrocyte toxicity54

with the expression of TNFα being elevated in

tissue samples from mouse models and AxD patients55

The potential for a positive or a negative

effect in increasing or decreasing cathepsins warrant future research with cathepsins and AxD

Contactin-1

Contactin-1 (Cntn1) is a 133 kDa cell surface protein that is highly glycosylated and

belongs to a family of immunoglobulin domain-containing cell adhesion molecues56

Table 2

shows that Cntn1 is down regulated in transgenic mice since spectral counts were only observed

in control mice Cntn1 is expressed in neurons and oligodendrocytes and higher levels were

observed during brain development57

In addition Cntn1 leads to activation of Notch1 which

mediates differentiation and maturation of oligodendrocyte precursor cells (OPC) Although the

mechanism leading to a reduction of Cntn1 in CSF is not known it may reflect a change in

91

astrocyte interactions with either neurons or oligodendrocytes to alter their expression of this

protein

Validation of putative biomarkers and MS proteomics data using ELISA and RNA

microarray data

To further validate the relative protein expression data obtained via MS-based spectral

counting techniques orthogonal immunological and molecular biological approaches have been

examined As GFAP was shown to be elevated in the CSF of transgenic animals we used a

well-defined GFAP ELISA protocol to test its CSF concentration CSF from 8 week old male

mice was collected from both transgenic and control animals Five samples of transgenic CSF

was prepared by pooling four to eight animals for the GFAP ELISA In the case of controls

each sample represents a single animal GFAP concentrations observed by both the MS and

ELISA showed significant increases of GFAP in the CSF when comparing transgenic to control

animals

Another validation of MS spectral counts is observed in a microarray analysis performed

on transgenic mouse olfactory bulb tissue 55

In this paper nine of the proteins found by MS

showed similar gene expression in the microarray (Table 3) It is not surprising that all the genes

observed in the microarray are not the same as the proteins observed by MS analysis Gene

expression and protein synthesis and expression are not always correlated but the similarities

and overlapping trends observed with these two assays are encouraging As shown in Table 3

gene expression analysis also revealed the up regulation of several cathepsin proteins GFAP

and down regulation of contactin-1 in CSF collected from transgenic mice corroborating the

MS-based proteomics results

92

CONCLUSIONS

In this study we have produced a panel of proteins with significant up or down regulation

in the CSF of transgenic GFAP overexpressor mice This mouse model for AxD was consistent

with the previous studies showing elevation of GFAP in CSF The development of a modified

IgY-14 immunodepletion technique for low amounts of CSF was presented This improved

protocol is useful for future investigations to deal with the unique challenges of mouse CSF

analysis Modified proteomics protocols were employed to profile mouse CSF with biological

and technical triplicates addressing the variability and providing quantitation with dNSAF

spectral counting Validation of the MS-based proteomics data were performed using both

ELISA and RNA microarray data to provide further confidence in the changes in the putative

protein biomarkers This study presents three classes of interesting targets for future study in

AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

93

REFERENCES

1 Brenner M Johnson A B Boespflug-Tanguy O Rodriguez D Goldman J E

Messing A Mutations in GFAP encoding glial fibrillary acidic protein are associated with

Alexander disease Nat Genet 2001 27 (1) 117-20

2 Herndon R M Rubinstein L J Freeman J M Mathieson G Light and electron

microscopic observations on Rosenthal fibers in Alexanders disease and in multiple sclerosis J

Neuropathol Exp Neurol 1970 29 (4) 524-51

3 Alexander W S Progressive fibrinoid degeneration of fibrillary astrocytes associated

with mental retardation in a hydrocephalic infant Brain 1949 72 (3) 373-81 3 pl

4 Iwaki T Kume-Iwaki A Liem R K Goldman J E Alpha B-crystallin is expressed

in non-lenticular tissues and accumulates in Alexanders disease brain Cell 1989 57 (1) 71-8

5 Head M W Goldman J E Small heat shock proteins the cytoskeleton and inclusion

body formation Neuropathol Appl Neurobiol 2000 26 (4) 304-12

6 Messing A Daniels C M Hagemann T L Strategies for treatment in alexander

disease Neurotherapeutics 7 (4) 507-15

7 Tang G Yue Z Talloczy Z Hagemann T Cho W Messing A Sulzer D L

Goldman J E Autophagy induced by Alexander disease-mutant GFAP accumulation is

regulated by p38MAPK and mTOR signaling pathways Hum Mol Genet 2008 17 (11) 1540-

55

8 Hagemann T L Boelens W C Wawrousek E F Messing A Suppression of GFAP

toxicity by alphaB-crystallin in mouse models of Alexander disease Hum Mol Genet 2009 18

(7) 1190-9

9 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C

Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome

Res 2008 7 (1) 386-99

10 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from

patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma

biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 878 (22) 2003-12

11 Kyllerman M Rosengren L Wiklund L M Holmberg E Increased levels of GFAP

in the cerebrospinal fluid in three subtypes of genetically confirmed Alexander disease

Neuropediatrics 2005 36 (5) 319-23

12 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M

Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta)

equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

13 Wong M Schlaggar B L Buller R S Storch G A Landt M Cerebrospinal fluid

protein concentration in pediatric patients defining clinically relevant reference values Arch

Pediatr Adolesc Med 2000 154 (8) 827-31

14 Roche S Gabelle A Lehmann S Clinical proteomics of the cerebrospinal fluid

Towards the discovery of new biomarkers PROTEOMICS ndash Clinical Applications 2008 2 (3)

428-436

15 Li C Lee K H Affinity depletion of albumin from human cerebrospinal fluid using

Cibacron-blue-3G-A-derivatized photopatterned copolymer in a microfluidic device Anal

Biochem 2004 333 (2) 381-8

94

16 Maccarrone G Milfay D Birg I Rosenhagen M Holsboer F Grimm R Bailey

J Zolotarjova N Turck C W Mining the human cerebrospinal fluid proteome by

immunodepletion and shotgun mass spectrometry ELECTROPHORESIS 2004 25 (14) 2402-

2412

17 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L

Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity

separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample

preparation and analysis Proteomics 2005 5 (13) 3314-28

18 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag

L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep

Biochem Biotechnol 2009 39 (3) 221-47

19 Huang L Fang X Immunoaffinity fractionation of plasma proteins by chicken IgY

antibodies Methods Mol Biol 2008 425 41-51

20 Greunke K Braren I Alpers I Blank S Sodenkamp J Bredehorst R Spillner E

Recombinant IgY for improvement of immunoglobulin-based analytical applications Clin

Biochem 2008 41 (14-15) 1237-44

21 Xiao Y Gao X Taratula O Treado S Urbas A Holbrook R D Cavicchi R E

Avedisian C T Mitra S Savla R Wagner P D Srivastava S He H Anti-HER2 IgY

antibody-functionalized single-walled carbon nanotubes for detection and selective destruction

of breast cancer cells BMC Cancer 2009 9 351

22 Liu T Qian W J Mottaz H M Gritsenko M A Norbeck A D Moore R J

Purvine S O Camp D G 2nd Smith R D Evaluation of multiprotein immunoaffinity

subtraction for plasma proteomics and candidate biomarker discovery using mass spectrometry

Mol Cell Proteomics 2006 5 (11) 2167-74

23 Hinerfeld D Innamorati D Pirro J Tam S W SerumPlasma depletion with

chicken immunoglobulin Y antibodies for proteomic analysis from multiple Mammalian species

J Biomol Tech 2004 15 (3) 184-90

24 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D

Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in

the discovery of candidate protein biomarkers in a diabetes autoantibody standardization

program sample subset J Proteome Res 2008 7 (2) 698-707

25 Ru Q C Zhu L A Silberman J Shriver C D Label-free semiquantitative peptide

feature profiling of human breast cancer and breast disease sera via two-dimensional liquid

chromatography-mass spectrometry Mol Cell Proteomics 2006 5 (6) 1095-104

26 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S

Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-

dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of

Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66

27 Rao P V Reddy A P Lu X Dasari S Krishnaprasad A Biggs E Roberts C T

Nagalla S R Proteomic identification of salivary biomarkers of type-2 diabetes J Proteome

Res 2009 8 (1) 239-45

28 Yu K H Barry C G Austin D Busch C M Sangar V Rustgi A K Blair I A

Stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for

pancreatic cancer serum biomarker discovery J Proteome Res 2009 8 (3) 1565-76

29 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422

(6928) 198-207

95

30 Ong S E Blagoev B Kratchmarova I Kristensen D B Steen H Pandey A

Mann M Stable isotope labeling by amino acids in cell culture SILAC as a simple and

accurate approach to expression proteomics Mol Cell Proteomics 2002 1 (5) 376-86

31 Hsu J L Huang S Y Chow N H Chen S H Stable-isotope dimethyl labeling for

quantitative proteomics Anal Chem 2003 75 (24) 6843-52

32 Liu H Sadygov R G Yates J R 3rd A model for random sampling and estimation

of relative protein abundance in shotgun proteomics Anal Chem 2004 76 (14) 4193-201

33 Xiang F Ye H Chen R Fu Q Li L NN-dimethyl leucines as novel isobaric

tandem mass tags for quantitative proteomics and peptidomics Anal Chem 82 (7) 2817-25

34 Ross P L Huang Y N Marchese J N Williamson B Parker K Hattan S

Khainovski N Pillai S Dey S Daniels S Purkayastha S Juhasz P Martin S Bartlet-

Jones M He F Jacobson A Pappin D J Multiplexed protein quantitation in

Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents Mol Cell Proteomics

2004 3 (12) 1154-69

35 Zybailov B Coleman M K Florens L Washburn M P Correlation of relative

abundance ratios derived from peptide ion chromatograms and spectrum counting for

quantitative proteomic analysis using stable isotope labeling Anal Chem 2005 77 (19) 6218-

24

36 Old W M Meyer-Arendt K Aveline-Wolf L Pierce K G Mendoza A Sevinsky

J R Resing K A Ahn N G Comparison of label-free methods for quantifying human

proteins by shotgun proteomics Mol Cell Proteomics 2005 4 (10) 1487-502

37 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M

P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J

Proteome Res 2006 5 (9) 2339-47

38 Mosley A L Florens L Wen Z Washburn M P A label free quantitative

proteomic analysis of the Saccharomyces cerevisiae nucleus J Proteomics 2009 72 (1) 110-20

39 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome

quantitation how to deal with peptides shared by multiple proteins Anal Chem 82 (6) 2272-81

40 Messing A Head M W Galles K Galbreath E J Goldman J E Brenner M

Fatal encephalopathy with astrocyte inclusions in GFAP transgenic mice Am J Pathol 1998

152 (2) 391-8

41 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

42 Elias J E Gygi S P Target-decoy search strategy for increased confidence in large-

scale protein identifications by mass spectrometry Nat Methods 2007 4 (3) 207-14

43 You J S Gelfanova V Knierman M D Witzmann F A Wang M Hale J E The

impact of blood contamination on the proteome of cerebrospinal fluid Proteomics 2005 5 (1)

290-6

44 Takanashi J Sugita K Tanabe Y Niimi H Adolescent case of Alexander disease

MR imaging and MR spectroscopy Pediatr Neurol 1998 18 (1) 67-70

45 Hemmer W Wallimann T Functional Aspects of Creatine Kinase in Brain

Developmental Neuroscience 1993 15 (3-5) 249-260

46 Hemmer W Zanolla E Furter-Graves E M Eppenberger H M Wallimann T

Creatine kinase isoenzymes in chicken cerebellum specific localization of brain-type creatine

96

kinase in Bergmann glial cells and muscle-type creatine kinase in Purkinje neurons Eur J

Neurosci 1994 6 (4) 538-49

47 Nordby H K Tveit B Ruud I Creatine kinase and lactate dehydrogenase in the

cerebrospinal fluid in patients with head injuries Acta Neurochirurgica 1975 32 (3) 209-217

48 Bell R D Alexander G M Nguyen T Albin M S Quantification of cerebral

infarct size by creatine kinase BB isoenzyme Stroke 1986 17 (2) 254-60

49 Steen C Wilczak N Hoogduin J M Koch M De Keyser J Reduced Creatine

Kinase B Activity in Multiple Sclerosis Normal Appearing White Matter PLoS ONE 5 (5)

e10811

50 Chwieralski C Welte T Buumlhling F Cathepsin-regulated apoptosis Apoptosis 2006

11 (2) 143-149

51 Droga-Mazovec G Bojic L Petelin A Ivanova S Romih R Repnik U Salvesen

G S Stoka V Turk V Turk B Cysteine cathepsins trigger caspase-dependent cell death

through cleavage of bid and antiapoptotic Bcl-2 homologues J Biol Chem 2008 283 (27)

19140-50

52 Duffy M J Proteases as prognostic markers in cancer Clin Cancer Res 1996 2 (4)

613-8

53 Guicciardi M E Deussing J Miyoshi H Bronk S F Svingen P A Peters C

Kaufmann S H Gores G J Cathepsin B contributes to TNF-alpha-mediated hepatocyte

apoptosis by promoting mitochondrial release of cytochrome c J Clin Invest 2000 106 (9)

1127-37

54 Butt A M Jenkins H G Morphological changes in oligodendrocytes in the intact

mouse optic nerve following intravitreal injection of tumour necrosis factor J Neuroimmunol

1994 51 (1) 27-33

55 Hagemann T L Gaeta S A Smith M A Johnson D A Johnson J A Messing

A Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal

fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum Mol

Genet 2005 14 (16) 2443-58

56 Falk J Bonnon C Girault J A Faivre-Sarrailh C F3contactin a neuronal cell

adhesion molecule implicated in axogenesis and myelination Biol Cell 2002 94 (6) 327-34

57 Eckerich C Zapf S Ulbricht U Muumlller S Fillbrandt R Westphal M Lamszus

K Contactin is expressed in human astrocytic gliomas and mediates repulsive effects Glia

2006 53 (1) 1-12

97

Table 1 Statistically changed proteins between transgenic and control mouse CSF using

dNSAF analysis

Accession Protein Pa SC

b Fold

Changec

Control

dSpCd

Transgenic

dSpCd

KCRM_MOUSE Creatine kinase M-type 0034 425 30 182 541

HEXB_MOUSE Βeta-hexosaminidase subunit β 0012 183 uarr 0 59

CMGA_MOUSE Chromogranin-A 000078 153 darr 44 0

ANT3_MOUSE Antithrombin-III 0017 176 -14 67 47

SAP3_MOUSE Ganglioside GM2 activator 0045 415 darr 28 0

SPA3N_MOUSE Serine protease inhibitor A3N 0047 170 42 10 42

CO1A2_MOUSE Collagen alpha-2(I) chain 0043 56 darr 19 0

BLVRB_MOUSE Flavin reductase 00054 204 uarr 0 12

CATS_MOUSE Cathepsin S 00032 232 uarr 0 73

GFAP_MOUSE Glial fibrillary acidic protein 0021 107 uarr 0 21

RET4_MOUSE Retinol-binding protein 4 0040 189 darr 13 0

CBPE_MOUSE Carboxypeptidase E 0043 139 darr 11 0

CATL1_MOUSE Cathepsin L1 0015 87 94 02 19

The statistics are performed using the t-test from the ln(dNSAF) Gaussian data

a P p-value of the t-test where the null hypothesis states that there was no change in expression between

control and transgenic GFAP overexpressor b SC percentage of sequence coverage of the listed protein from

sequenced tryptic peptides c Fold Change positive values are indicative of a fold change for transgenic CSF

negative values are fold changes for control CSF (ie down-regulated in transgenic CSF) uarr indicates the protein

was only detected in transgenic CSF and darr indicates the protein was only detected in control CSF d dSpC

distributive spectral counts which represent the average spectral counts observed per run analysis on the mass

spectrometer and corrected using distributive analysis for peptides shared by more than one protein

98

Table 2 Proteins showing greater than three-fold changes with at least two unique

peptides identified for each protein

Accession Protein SC ()a Fold

Change b

Control

dSpC c

Transgenic

dSpC c

MUG1_MOUSE Murinoglobulin-1 precursor 147 42 155 37

CO4B_MOUSE Complement C4-B 113 54 22 118

PRDX6_MOUSE Peroxiredoxin-6 576 46 14 64

CNTN1_MOUSE Contactin-1 65 darr 41 0

CATB_MOUSE Cathepsin B 263 42 23 97

CAH3_MOUSE Carbonic anhydrase 3 396 76 11 84

APOH_MOUSE Beta-2-glycoprotein 1 128 44 44 1

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

PGK1_MOUSE Phosphoglycerate kinase 1 355 36 17 61

NDKB_MOUSE Nucleoside diphosphate kinase B 408 34 13 44

FHL1_MOUSE

Four and a half LIM domains

protein 1 243 39 13 51

NELL2_MOUSE

Protein kinase C-binding protein

NELL2 45 -43 13 03

MDHM_MOUSE

Malate dehydrogenase

mitochondrial 385 41 12 49

CSF1R_MOUSE Macrophage colony-stimulating

factor 1 receptor

80 44 14 62

a SC percentage of sequence coverage of the listed protein from sequenced tryptic peptides b Fold

Change positive values are indicative of a fold change for transgenic CSF negative values are fold changes for

control CSF and darr indicates the protein was only detected in control CSF c dSpC distributive spectral counts

which represent the average spectral counts observed per run analysis on the mass spectrometer and corrected using

distributive analysis for peptides shared by more than one protein

99

Table 3 Validation of changes in proteins revealed by MS-based spectral counting

consistent with previously published microarray data

Consistent changes in RNA and proteomic data

uarr regulated in transgenic darr regulated in transgenic

Cathepsin S Contactin-1

Cathepsin B Carboxypeptidase E

Cathepsin L1

Peroxiredoxin-6

Complement C4-B

Glial fibrillary acidic protein

Serine protease inhibitor A3N

Note Validation of putative biomarkers from the current proteomics dataset by previously

published RNA microarray data55

Both up and down regulated proteins were consistent with the

RNA microarray data

_

100

___________________________________________

SUPPLEMENTAL INFORMATION (Available upon request)

Table S1 Compilation list of proteins identified from all the control and transgenic biological

replicates

Table S2 Distributive spectral counting calculations performed for proteins observed to share

identified peptides

Table S3 Proteins with dNSAF spectral counting performed including t-test analysis for a

comparison between transgenic and control CSF

101

FIGURE LEGENDS

Figure 1 The general workflow indicating the major steps involved in sample collection sample

processing mass spectrometric data acquisition and analysis of mouse CSF samples

Figure 2 Assessment of run to run variability of the base peak chromatograms within and

between two biological and technical replicates The peak profile and intensity scale is

consistent between the four chromatograms The four panels show two biological replicates (Tg

4 and Tg5) with two technical replicates for each biological sample

Figure 3 A Venn-diagram of the biological triplicates (1 2 and 3) of transgenic mouse

CSF showing 78 proteins identified in all three experiments B Venn-diagram of the biological

triplicates (1 2 and 3) of control mouse CSF having 61 proteins identified by all three

replicates C The overlap between control and transgenic CSF proteomic analysis showing 139

proteins identified by both groups and 73 and 54 uniquely identified by respective groups

Figure 4 Assessment of technical replicate variability between biological replicates The error

bars in both A and B are the standard deviation derived from the technical triplicates for each

biological replicate Panel A shows creatine kinase M having more or equal variability in the

biological triplicates than each technical triplicate The means of the biological triplicates are

illustrated by the dashed line Panel B represents kininogen-1 which is unchanged between

control (Ctl) and transgenic (Tg) samples The biological variability coupled with the technical

replicates provides a barely noticeable difference in the pooled mean between control and

102

transgenic spectral counts The difference in means is contrasted with the three fold change

observed from creatine kinase M (A)

Figure 5 MSMS profile of the tryptic peptide LSVEALNSLTGEFK from creatine kinase M

(A) The top MSMS is from a control sample with an elution at 463 minutes (B) The bottom

MSMS is from the GFAP transgenic sample with an elution at 46 minutes These tandem MS

spectra show instrument reliability and consistent fragmentation patterns which are necessary for

spectral counting analysis

Figure 6 Validation of MS spectral counts using a GFAP ELISA GFAP content (ngL)

measured within mouse CSF from both transgenic and control animals The data represents the

average with standard deviation (n=5 each sample represents pooling of 1 to 8 animals) The

statistics are performed using a student t-test plt00001

103

Figure 1

104

Figure 2

105

Figure3

106

Figure 4

107

Figure 5

108

Figure 6

Ctl Tg

100

1000

10000

100000

Mouse CSF Sample

GF

AP

(n

gL

)

109

Table of Contents Summary

Cerebrospinal fluid (CSF) was obtained from transgenic mouse models of Alexander disease as

well as healthy controls We immunodepleted pooled CSF from mice by a modified IgY-14

protocol Shotgun proteomics utilizing trypsin digestion 1-D nanoRP separation CID tandem

mass spectrometry analysis Mascot database searching and relative quantitation via distributive

normalized spectral abundance factor resulted in the identification of 266 proteins and 27

putative biomarkers

110

Chapter 4

Genomic and proteomic profiling of rat adapted scrapie

Adapted from ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A

Cunningham R Wang J Wellner D Ma D Li L Aiken J In preparation

111

Abstract

A novel model of prion disease using rats called Rat Adapted Scrapie (RAS) was

developed with the genomics from brain and proteomics from cerebrospinal fluid (CSF) profiled

The CSF proteome from control and RAS (biological N=5 technical replicates N=3) were

digested and separated using one dimensional reversed-phase nanoLC coupled to data-

dependent tandem MS acquisition on an ion trap mass spectrometer In total 512 proteins 167

non-redundant protein groups and 1032 unique peptides were identified with a 1 false

discovery rate (FDR) Of the proteins identified 21 were statistically up-regulated (plt005) and

7 statistically down-regulated in the RAS CSF We also identified 1048 genes which were

differentially regulated in rat prion disease and upon mapping these changes to mouse gene

expression however only 22 of these genes were in common with mRNAs responding to

prion infection in mice suggesting that the molecular pathology observed in mice may not be

applicable to other species The proteins are compared to the differentially regulated genes as

well as to previously published proteins showing changes consistent with other prion animal

models

112

Introduction

Prion diseases are an unusual class of fatal transmissible neurodegenerative disorders

that affect the mammalian central nervous system They are caused by the accumulation of an

abnormal conformation of the normal host encoded cellular prion protein PrPC This

conformational rearrangement of PrPC is brought about by template directed misfolding wherein

seed molecules of the abnormal isoform PrPScrapie

PrPSc

convert PrPC into new PrP

Sc molecules

Scrapie in sheep and goats is the prototypic prion disease and is so named because clinically

affected animals scrape their wool off on pen and stall walls Laboratory investigation of prion

diseases typically relies upon rodents which can be infected with natural isolates of scrapie1

albeit with some difficulty as ovid scrapie isolates need time to adapt to rodents This adaptation

is characteristic of prion disease interspecies transmissions and properly reflects the molecular

adaptation that must occur to allow interaction between exogenous foreign PrPSc

and host PrPC

molecules selecting for conformers which exhibit template directed misfolding In some cases

no conformational solution is found reflecting a species barrier to disease transmission

In recent years advances in genomics and proteomics technologies have allowed

unprecedented examination of the biomolecules that are altered upon exposure to prion agents

These studies2 3

have relied upon analysis of gene and protein expression changes in response to

prion infection with the aim of trying to identify pathways that might underlie the mechanism of

prion-induced neurotoxicity A second important aim has been to identify signature molecules

that might act as surrogate biomarkers for these diseases as there are significant analytical

challenges associated with sensitively detecting and specifically distinguishing disease-induced

conformational changes (PrPSc

) of the prion protein from normal host conformations (PrPC)

113

Mass spectrometry (MS)-based proteomics has become a promising tool for biomarker

discovery from biological fluids such as CSF blood and urine4-6

Two-dimensional gel

electrophoresis MS (2D-GE MS) and two-dimensional differential gel electrophoresis (2D-DIGE

MS) are the most widely used methods for proteomic profiling of prion infected CSF so far due

to the advantage of ready separation and quantification of proteins in complex biological samples

Studies using 2D-GE MS or 2D-DIGE MS to profile proteins in CSF have led to the

identifications of currently accepted biomarker for sCJD 14-3-3 protein and other potential

biomarkers for prion diseases7-9

However the application of this method in biomarker

discovery is limited by insufficient sensitivity and potential bias against certain classes of

proteins as gel-based separation does not work well for the low abundance proteins very basic

or acidic proteins very small or large proteins and hydrophobic proteins 10 11

In contrast to 2D-

GE or 2D-DIGE shotgun proteomics employs enzymatic digestion of the protein samples

followed by chromatographic separation prior to introduction into a mass spectrometer for

tandem MS analyses Shotgun proteomics has become increasingly popular in proteomic

research because these methods are reproducible highly automated and have a greater

likelihood of detecting low abundance proteins12 13

Due to the sample complexity in CSF and

because albumin comprises over half of the protein content in CSF removal of high-abundance

proteins including albumin is necessary to improve proteomic coverage and identify low-

abundance proteins One method is IgY immunodepletion14 15

which is performed prior to LC-

MSMS and uses species specific antibodies from chicken yolk to remove abundant proteins in

biological samples such as CSF In the present work CSF from control and rat adapted scrapie

animal models (biological N=5 technical N=3 replicates) were analyzed by LC-MSMS and we

114

indentified 512 proteins and 167 non-redundant protein isoforms (referred to as protein groups)

with 21 proteins being statistically up-regulated (p lt 005) and 7 statistically down-regulated

By and large this work has been performed using laboratory mice for the gene

expression studies and human cerebrospinal fluid for proteomics mice do not provide sufficient

volumes of CSF for proteomic studies Both approaches are exceedingly valuable as the mouse

model allows cross-sectional time course experiments to be performed including the important

pre-clinical phase of disease Critically however the relevance and generalizability of mouse

prion responses to other prion diseases especially human disease is unknown Human proteomic

studies on CSF guarantee relevant protein identifications but are limited to the clinical phase of

the disease when apparent markers may reflect gross neurodegeneration covering up subtle but

more specific responses To address these issues we have adapted mouse RML prions into rats

with the aim of expanding the knowledge of prion disease responses addressing the limitations

of mice and opening up the possibility of preclinical proteomic profiling in a laboratory rodent

In the present work CSF samples from control and rat adapted scrapie were analyzed by system

biology approaches Here we show that Rat Adapted Scrapie (RAS) is a powerful tool for an -

omics based approach to decipher the molecular impact of prion disease in vivo with

applicability to the molecular mechanisms of disease and biomarker discovery We identified

1048 genes which were differentially regulated in rat prion disease Astonishingly on the whole

mouse orthologs of these genes did not change in mouse adapted scrapie and vice versa

questioning the universality of previous mouse gene expression profiles These RAS gene

expression changes were identified in the CSF proteome where we detected 512 proteins and 167

protein groups with 21 proteins being statistically up-regulated (plt005) and 7 statistically down-

115

regulated in the CSF of prion diseased rats Many of the proteins detected have previously been

observed in human CSF from CJD patients

Materials and Methods

Ethics Statement

This study was carried out in accordance with the recommendations in the NIH Guide for Care

and Use of Laboratory Animals and the guidelines of the Canadian Council on Animal Care The

protocols used were approved by the Institutional Animal Care and Use Committees at the

University of Wisconsin and University of Alberta

Chemicals

Acetonitrile (HPLC grade) and ammonium bicarbonate (certified) were purchased from

Fisher Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) Optima LCMS grade acetonitrile and water were purchased

from Fisher Scientific (Fair Lawn NJ) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) tris(hydroxymethyl)aminomethane (Tris

ge999 ) ammonium formate (ge99995) glycine (ge985) and IgY-7 antibodies were

purchased from Sigma-Aldrich (Saint Louis MO)

Rat Transmission and Adaptation

Prion agents used were the rocky mountain lab RML strain of mouse adapted scrapie

Stetsonville transmissible mink encephalopathy16

(TME) Hyper (Hy) strain of Hamster TME 17

1st passage Skunk adapted TME prepared as described and C from genetically defined

transmissions18

116

Brains from animals clinically affected with prion disease were aseptically removed and

prepared as 10 wv homogenates in sterile water 50 microL of 10 brain homogenate was

inoculated into weanling wistar rats intracranially After one year of incubation preclinical rats

from RML infections were euthanized by CO2 inhalation and the brain excised homogenized

and re-inoculated into naive animals Subsequent serial passages were from rats clinically

affected with rat adapted scrapie

Brains from rat passages were aseptically removed and bisected sagittally Brain halves

were reserved for immunohistochemistry (IHC) in formalin or frozen for immunoblotting RNA

isolation or subsequent passage For IHC tissues were first fixed in 10 buffered formalin

followed by antigen retrieval at 121 oC 21 bar in 10mM citrate buffer for 2 min After cooling

to room temperature sections were treated with 88 formic acid for 30 min and 4M guanidine

thiocyanate for 2 h Endogenous peroxidases were inactivated by 003 hydrogen peroxide and

tissue was blocked with 1 normal mouse serum Mouse anti-PrP mAB SAF83 (Cayman

Chemical) was biotinylated and applied at a 1250 dilution in blocking buffer overnight at 4 oC

Washes were performed in 10mM PBS with 005 tween-20 Streptavidin-peroxidase

(Invitrogen) was added for Diaminobenzidine (DAB) color reaction Anti-GFAP

immunohistochemistry was performed as above except that formic acid and guanidine treatment

steps were excluded Mouse anti-GFAP mab G-A-5 (Sigma) was used at a 1400 dilution

Brain samples for immunoblot were treated with or without Proteinase K (Roche) at a

ratio of 35 microg PK 100 ug protein at 50 microg mL for 30 minutes at 37 oC Phosphotungstenic acid

enrichments were performed as described14 19

Bis-Tris SDS-PAGE was performed on 12

polyacrylamide gels (Invitrogen) and transferred to PVDF Immunoblotting was performed using

117

mABs SAF83 (Cayman Chemical) 6H4 (Prionics) or 3F10 (a kind gift from Yong-Sun Kim) all

at a 120000 dilution

Gene Expression Profiling

RNA was extracted from frozen brain halves from clinically affected and control animals with

the QIAshredder and RNeasy mini kit (Qiagen Valencia CA) in accordance with the

manufacturerrsquos recommendations Due to the relatively large mass of rat brain an initial

homogenization was performed with a needle and syringe in 5mL of buffer RLT before further

diluting an aliquot in accordance with the manufacturers protocol Total RNA was amplified and

labeled in preparation for chemical fragmentation and hybridization with the MessageAmp

Premier RNA amplification and labeling kit (Life Technologies Grand Island NY) Amplified

and labeled cRNAs were hybridized on Affymetrix (Santa Clara CA) rat genome 230 20 high

density oligonucleotide arrays in accordance with the manufacturers recommendations

Differentially Expressed Genes were identified using Arraystar 50 (DNAStar Madison WI)

Robust multi-array normalization using the quantile approach was used to normalize all

microarray data A moderated T-test with a multiple comparison adjustment20

was used to reduce

the false discovery rate yet preserve a meaningful number of genes for pathway analysis

Pathway analysis was performed using the DAVID Bioinformatics database21

Comparative

analysis of genes induced by prions in mouse22

and rat disease was performed on genes

exhibiting at least a 2-fold change in expression Orthologs of rat and mouse genes were

identified using ENSEMBLE biomart release 6823

CSF Proteomic Profiling

118

CSF was collected from rats anaesthetized with isoflorane via puncture of the cisterna

magna A volume of 100-200 microL was routinely obtained CSF was touch spun for 30s at 2000xg

on a benchtop nano centrifuge to identify any blood contamination by the presence of a red

pellet CSF with blood contamination was not used for proteomic analysis CSF was prepared

for profiling by first depleting abundant proteins with an antibody based immunopartitioning

column IgY-R7 (Sigma-Aldrich Saint Louis MO) The manufacturers recommendations were

followed with a few deviations 40 microg of protein (~100 microL CSF) was bound to 100 microL of IgY

bead slurry in total volume of 600 microL formulated in 150mM ammonium bicarbonate The flow

through was collected and denatured by the addition of 10 microL of 8 M guanidine thiocyanate and

lyophilized in a speed-vac apparatus The sample was then reconstituted in 10 microL of water and 1

microL of 050 M DTT were added and allowed to incubate at 37oC for 30 minutes After incubation

27 microL of 055 M IAA was added for carboxymethylation and the sample was allowed to

incubate in the dark for 15 minutes After that 1 microL of 050 M DTT was added and allowed to

sit at room temperature for 10 minutes To perform trypsin digestion 70 microL of 50 mM

NH4HCO3 was then added along with 025 microg trypsin Digestion was performed overnight at

37oC and quenched by addition of 5 microL of 10 formic acid The tryptic peptides were then

subjected to solid phase extraction using an Agilent OMIX Tip C18 100 microL (Santa Clara CA)

Peptides were eluted with 50 ACN in 01 formic acid concentrated and reconstituted in 30

microL H2O with 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of a nanoAcquity (Milford MA) with a 10 microL injection

loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B consisted of

ACN Injections consisted of 5 microL of prepared CSF tryptic peptides onto a Waters 5 microm

119

Symmetry C18 180 microm x 20 mm trap cartridge (Milford MA) at a flow rate of 75 microLmin for 5

minutes at 95 A 5 B Separation was performed on a Waters 17 microm BEH130 C18 75 microm x

100 mm (Milford MA) with a gradient of 5 to 15 B over 10 minutes followed by 15 to

40 B over 80 minutes at room temperature

Mass spectrometry data acquisitions

An ion trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with a Bruker CaptiveSpray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Waters Acquity console software to perform MS acquisitions for all experiments Smart

parameter setting (SPS) was set to 700 mz compound stability and trap drive level was set at

100 Optimization of the CaptiveSpray source resulted in dry gas temperature of 160oC dry

gas flow of 30 Lmin capillary voltage of -1325 V end plate offset of -500 V MSMS

fragmentation amplitude of 10V and SmartFragmentation set at 30-300

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms one spectral average rolling average 2 acquisition

range of 350-1500 mz and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS

generation the ICC target was set to 300000 maximum accumulation time 5000 ms two

spectral averages acquisition range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

120

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required for spectral counting to prevent

loss of spectra Identification of peptides were performed using Mascot24

(Version 24 Matrix

Science London UK) Database searching was performed against a forward and reversed

concatenated SwissProt Rattus rattus Mascot search parameters were as follows Allowed

missed cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) peptide tolerance plusmn12 Da maximum number of 13

C 1 MSMS

tolerance plusmn05 Da instrument type ESI-Trap Data was then transformed into Dat file formats

and transferred to ProteoIQ (NuSep Bogart GA) False positive analyses were calculated using

ProteoIQ and set at 1

Results

Development of Rat Adapted Scrapie

To develop a rat adapted strain of prion disease we introduced 6 different prion agents RML

TME Hy Skunk Adapted TME and Chronic Wasting Disease (CWD) from wild type 96G and

96S deer16-18

into 6 rats (Fig 1) Of these primary transmissions only RML induced the

accumulation of Proteinase K resistant PrP after one year of incubation as determined by western

blotting on 10 brain homogenates and PrPSc

enriched phoshotungstenic acid precipitated brain

homogenates Second passage of rat adapted RML scrapie resulted in clinically affected rats at

565 days post inoculation Serial passages of TME or CWD agents were not performed Clinical

symptoms consisted of ataxia lethargy wasting kyphosis and myoclonus Prion affected rats

121

also showed low level porphyrin staining around their head Subsequent serial passage decreased

incubation time to 215 days

Proteinase K resistant prion protein was observed from all clinically affected animals both by

immunoblot (Fig 2) and by immunohistochemistry (Fig 3) Di- and mono-glycosylated bands

were the most abundant isoforms of PrPSc

PrPSc

was extensively deposited in the cerebral cortex

hippocampus thalamus inferior colliculus and granular layer of the cerebellum GFAP

expressing activated astrocytes were found throughout the brain particularly in the white matter

of the hippocampus thalamus and cerebellum Spongiform change was a dominant feature of

clinical rat

Gene expression Profiling

In total 1048 genes were differentially regulated within a 95 confidence interval

(Supplementary Table 1) and 305 genes were differentially expressed by greater than 2-fold (Fig

4) The 1048 genes that were statistically significant were used for pathway analysis using

DAVID Pathway analysis suggested that the gene expression profile was consistent with

immune activation and maturation as well as inflammation (Supplementary Table 2) a likely

interpretation given the observable astrocytosis (Fig 3) and gliosis associated with prion disease

Other pathways highlighted by the analysis included increases in transcription of genes involved

in lysosomes and endosomes

To further probe the gene expression data we compared genes which were differentially

expressed in rat adapted scrapie against their orthologs expression in mouse scrapie and vice

versa (Figs 5 and 6) We did not observe a high degree of correlation between expression fold

changes For example GFAP a gene whose up-regulation in prion disease is well known was

122

increased 25 fold in rat adapted scrapie but up-regulated 93 fold in mouse scrapie A

qualitative analysis of expression of orthologs in prion disease suggests that many genes

deregulated at least 2-fold in mouse or rat do not have orthologs that are differentially expressed

For example the gene RNAseT2 was up-regulated greater than threefold in rat adapted scrapie

but was not significantly up-regulated in mouse Similarly three genes important in metals

homeostatsis metallothionein 1a and 2a and ceruloplasmin were up-regulated in rat 46 43 and

3 fold respectively but were not differentially expressed in mouse prion disease

CSF Proteomics

Each immunodepleted biological replicate (N=5 for each control and RAS) had technical

triplicates performed The triplicates were summed together using ProteoIQ Peptide spectral

counts were then summed for each protein and subjected to dNSAF analysis using ProteoIQ

internal algorithms Details for this method can be found elsewhere25 26

but briefly peptide

spectral counts are summed per protein (SpC) based on unique peptides and a weighted

distribution of any shared peptides with homologous proteins T-tests were used to identify

significant changes in protein expression 1032 unique peptides which identify 512 proteins and

167 protein groups were found Of these 512 proteins 437 were identified in both RAS and

control CSF samples (Fig 7) The complete list of all 512 proteins can be viewed in

Supplemental Table 3 Using the ProteoIQ the most likely isoform was selected such as 14-3-3

protein gamma

From Table 1 we observe five proteins that agree with the genomic data for up

regulations in RAS granulin macrophage colony-stimulating factor 1 receptor cathepsin D

complement factor H and ribonuclease T2 In addition serine protease inhibitor A3N was not

123

detected as up regulated in the RAS genomic data but was found to be up-regulated in previous

genomic profiling of the mouse prion model22

One interesting trend from the data in Table 1 is

that the majority of proteins found to be up-regulated in the RAS model were not detected in the

control samples The absence of the detection of those proteins such as ribonuclease T2 in the

control CSF does not necessarily suggest the absence of the protein nonetheless it is below the

detection limits for this current proteomics protocol and instrumentation

Discussion

Mice have been the preferred laboratory rodent for prion diseases research because they

can be inexpensively housed and are amenable to transgenesis which allows for short incubation

periods as well as hypothesis testing upon targeted pathways Pursuant to the determination of

the mouse genome and the development of high density transcriptional arrays for measurements

of gene expression profiling mice have been used extensively to examine the molecular

pathology of prion disease probing the impact of disease and animal strain In order to expand

upon this foundation we adapted mouse prions to infect rats We reasoned that by taking a

comparative approach to the molecular pathology of prion disease inferences could be obtained

into the variability of the molecular response to prion diseases and that understanding this

variability might suggest whether human prion disease responses are more or less similar to

mouse responses A second rationale is the desire to identify surrogate markers of prion disease

While this approach has been taken before using gene expression profiling a more direct

approach has been to use proteomic approaches on cerebral spinal fluid (CSF) identifying

proteins that are increase in abundance with disease A rat prion disease is valuable for this

because the rat proteome is established and rats allow for the collection of relatively large

volumes of CSF (~100 microL per animal) which allows for robust analytical separations enhancing

124

detection of biomarkers Finally rats unlike humans can be used in a time course study of prion

disease This allows for the identification of early transcriptional and proteomic responses to

prion infection responses which are particularly valuable for the identification of surrogate

disease biomarkers

To initiate the development of a rat prion disease we attempted to adapt six different

prion disease agents PrPres

molecules to rat via intracranial inoculation of weanling animals

(Figure 1) Of these six agents only mouse RML prions were able to surmount the species

barrier to prion disease transmission Mouse PrP differs from rat by eight amino acid changes

six in the mature protein and four in the N-terminal octa-peptide repeat region (Supplementary

Fig 1) As rat shares the most prion protein sequence homology with mouse it is perhaps not

surprising that it transmitted whereas the other did not confirming that the primary prion protein

sequence is the most important determinant for interspecies transmission We conclude that there

is a large molecular species barrier preventing conversion of rat PrPc into PrP

res

The transmission of mouse RML into rats was characterized by a shortening of the

incubation period following each passage This is indicative of agent adaption to the new host

and increases in the titer present in end-stage brain Overall our adaptation of mouse prion

disease into rats resulted in a similar agent to that observed by Kimberlin27

The differences in

incubation period at second passage are largely due to our collecting the animals at 365 days post

inoculation upon first passage whereas Kimberlin and Walker allowed the first passage animals

to reach end-stage clinical rats

Rat adapted scrapie is in many ways similar to other prion diseases The clinical period of

disease is marked by a progressive neurological dysfunction with prominent ataxia kyphosis and

125

wasting The protease resistant PrP observed is typical as is the widespread deposition of PrPSc

in

the brain Spongiosis and reactive astrogliosis are as expected of a prion disease

Gene expression profiles from rats clinically affected with prion disease revealed a strong

neuronal inflammation associated with proliferated and activated astrocytes This is perhaps best

observed through the up-regulation of GFAP GFAP positive astrocytes were omnipresent

throughout the brain and GFAP mRNA was up-regulated 25 fold The up-regulation of GFAP is

a hallmark of the molecular response to prion infection and has been routinely observed Our

comparative analysis of gene expression changes in mouse RML versus rat adapted scrapie

suggest substantial differences in gene expression in response to prion disease despite the fact

that the overall response is neuro-inflammatory This suggests that the potential overlap between

mouse expression profiles and a putative human CJD expression profile could be quite different

at the level of individual transcripts that might be expected to be changed

CSF Proteomics

CSF proteomics can be exceedingly challenging due to the small sample available large

dynamic range in proteins and abundant proteins limiting sample loading onto nanoscale

columns Dynamic range reduction in the CSF sample was achieved using a custom amount of

IgY-7 antibodies to abundant proteins such as albumin After immunodepletion the total

protein content was reduced by ~90 limiting the proteomics analysis to one dimensional

separation Label free quantitation spectral counting was performed because it requires less

protein and does not increase sample complexity The proteins identified from the affected and

control N=5 are shown in Supplemental Table 2 Consistent detection of proteins in CSF from

both control and infected rats was observed (Fig 7C) Only two proteins were identified in

126

controls that were not observed in RAS and only 10 proteins were only observed in RAS Some

of these proteins that were only identified in RAS are significantly changed (Supplemental Table

3) One concern in proteomics data is the variability from run to run and the possibility that

certain proteins are identified from different unique peptides Figure 7A shows that the vast

majority of the unique peptides detected (783) were identified with a 1 FDR in both RAS and

control CSF samples highlighting the analytical reproducibility of our methodology

Proteomic analysis of the infected rat CSF provides a reasonable approach to cross

validate biomarkers detected by gene expression profiling For example RNAseT2 a secreted

ribonuclease was up-regulated 3-fold at the mRNA level and was also enriched in CSF from

infected rats (Table 1) Similarly coordinated increases in detection of colony stimulating factor

1 receptor complement factor H granulin and cathepsin D were also observed Conversely

proteomic analysis of CSF also allows for the observation of post-transcriptional responses to

prion disease that might be absent in gene expression profiling The 14-3-3 proteins and neuron

specific enolase both known markers for CJD are only detected by proteomic analysis Thus

gene expression profiling and proteomic detection serve to increase confidence in the

observation of up-regulation enhancing the likelihood that proteins detected by both

methodologies are specific and perhaps may be more sensitive at earlier time points

Comparison to human CSF prion disease proteome

In our comparative analysis 21 proteins were found to be up-regulated while 7 proteins

down-regulated in prion infected rats Included in the up-regulated proteins were the 14-3-3

proteins epsilon and gamma 14-3-3 zetadelta was also up-regulated but not statistically

significant Enolase 2 or neuron specific enolase (NSE) was also up-regulated in CSF of infected

127

rats These proteins are all in agreement with results from previous proteomic profiling of human

CSF from patients with CJD8 9

The detection of 14-3-3 protein is included in the diagnostic

criteria approved by World Health Organization for the pre-mortem diagnosis of clinically

suspected cases of sCJD28

although its application in large-scale screening of CJD is still

debated due to high false positive rate where elevated 14-3-3 protein levels were also reported in

other conditions associated with acute neuronal damage29 30

It was suggested that other brain-

derived proteins that are also specific to CJD can be used in conjunction with 14-3-3 protein to

increase diagnosis accuracy and specificity31

NSE is present in high concentration in neurons

and in central and peripheral neuroendocrine cells NSE serves as another valuable biomarker in

diagnosis of CJD as the elevated level of NSE in CSF was found in early and middle stages of

CJD 32

Other proteins detected in CSF included cystatin C and serpina3N although both of

these were not statistically changed These proteins were both previously identified as being

putative biomarkers for CJD33 34

Utility for biomarker discovery with emphasis on onset of biomarker appearance in CSF

The investigation of the protein changes in CSF from RAS compared to control rats

provides a solid foundation when investigating potential biomarkers with prion disease onset

The cross-validation of the genomic and proteomics data further emphasizes the targets for

consideration during disease onset Biomarker discovery provides the potential to determine if

animals such as cows in bovine spongiform encephalopathy (BSE) are affected instead of

having to be euthanized when BSE is expected In addition CSF collection in mice or hamsters

Prion models is extremely difficult and limited alternatively with the advent of the RAS model

CSF can be collected in sufficient amounts for CSF proteomics CSF collection for mice or

hamsters can be ~10 microL and could be contaminated with blood further complicating proteomic

128

analysis unlike rats which over 10 times more CSF can be collected per animal35

Due to the

amount of CSF available time course studies analyzing CSF proteomics is possible in RAS due

to animal numbers that are manageable and reasonable The RAS model further allows

investigators to bypass working with highly infections CJD CSF samples to investigate the CSF

proteome changes

Conclusion

In this study we have described the gene and protein expression changes in brain and

spinal fluid from a transmission of mouse prions into rats We find that while the overall gene

expression profile in rats is similar to that in mice the specific genes that make up that profile

are different suggesting that genes that change in response to prion disease in different species

may not be as conserved Analysis of CSF from rat adapted scrapie identified similar protein

changes as known in human CJD The rat will be a useful model to identify surrogate markers

that appear prior to the onset of clinical disease and thus may be of higher specificity and

sensitivity

Supplemental Information Available Upon Request

1 Chandler R L Encephalopathy in mice produced by inoculation with scrapie brain material Lancet 1961 1 (7191) 1378-9 2 Silva C J Onisko B C Dynin I Erickson M L Requena J R Carter J M Utility of Mass Spectrometry in the Diagnosis of Prion Diseases Anal Chem 2011 3 Wei X Herbst A Ma D Aiken J Li L A quantitative proteomic approach to prion disease biomarker research delving into the glycoproteome J Proteome Res 2011 10 (6) 2687-702 4 Zougman A Pilch B Podtelejnikov A Kiehntopf M Schnabel C Kumar C Mann M Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7 (1) 386-99 5 Sjodin M O Bergquist J Wetterhall M Mining ventricular cerebrospinal fluid from patients with traumatic brain injury using hexapeptide ligand libraries to search for trauma biomarkers J Chromatogr B Analyt Technol Biomed Life Sci 2003 878 (22) 2003-12 6 Cunningham R Ma D Li L Mass spectrometry-based proteomics and peptidomics for systems biology and biomarker discovery Frontiers in Biology 2012 7 (4) 313-335

129

7 Brechlin P Jahn O Steinacker P Cepek L Kratzin H Lehnert S Jesse S Mollenhauer B Kretzschmar H A Wiltfang J Otto M Cerebrospinal fluid-optimized two-dimensional difference gel electrophoresis (2-D DIGE) facilitates the differential diagnosis of Creutzfeldt-Jakob disease Proteomics 2008 8 (20) 4357-66 8 Harrington M G Merril C R Asher D M Gajdusek D C Abnormal proteins in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease N Engl J Med 1986 315 (5) 279-83 9 Piubelli C Fiorini M Zanusso G Milli A Fasoli E Monaco S Righetti P G Searching for markers of Creutzfeldt-Jakob disease in cerebrospinal fluid by two-dimensional mapping Proteomics 2006 6 Suppl 1 S256-61 10 Lilley K S Razzaq A Dupree P Two-dimensional gel electrophoresis recent advances in sample preparation detection and quantitation Curr Opin Chem Biol 2002 6 (1) 46-50 11 Oh-Ishi M Maeda T Separation techniques for high-molecular-mass proteins J Chromatogr B Analyt Technol Biomed Life Sci 2002 771 (1-2) 49-66 12 Metz T O Qian W J Jacobs J M Gritsenko M A Moore R J Polpitiya A D Monroe M E Camp D G 2nd Mueller P W Smith R D Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset J Proteome Res 2008 7 (2) 698-707 13 Aebersold R Mann M Mass spectrometry-based proteomics Nature 2003 422 (6928) 198-207 14 Huang L Harvie G Feitelson J S Gramatikoff K Herold D A Allen D L Amunngama R Hagler R A Pisano M R Zhang W W Fang X Immunoaffinity separation of plasma proteins by IgY microbeads meeting the needs of proteomic sample preparation and analysis Proteomics 2005 5 (13) 3314-28 15 Rajic A Stehmann C Autelitano D J Vrkic A K Hosking C G Rice G E Ilag L L Protein depletion using IgY from chickens immunised with human protein cocktails Prep Biochem Biotechnol 2009 39 (3) 221-47 16 Marsh R F Bessen R A Lehmann S Hartsough G R Epidemiological and experimental studies on a new incident of transmissible mink encephalopathy J Gen Virol 1991 72 ( Pt 3) 589-94 17 Bessen R A Marsh R F Biochemical and physical properties of the prion protein from two strains of the transmissible mink encephalopathy agent J Virol 1992 66 (4) 2096-101 18 Johnson C J Herbst A Duque-Velasquez C Vanderloo J P Bochsler P Chappell R McKenzie D Prion protein polymorphisms affect chronic wasting disease progression PLoS One 2011 6 (3) e17450 19 Safar J Wille H Itri V Groth D Serban H Torchia M Cohen F E Prusiner S B Eight prion strains have PrP(Sc) molecules with different conformations Nat Med 1998 4 (10) 1157-65 20 Benjamini Y Hochberg Y Controlling the False Discovery Rate A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 1995 57 (1) 289-300 21 Huang da W Sherman B T Lempicki R A Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources Nat Protoc 2009 4 (1) 44-57 22 Moody L R Herbst A J Aiken J M Upregulation of interferon-gamma-induced genes during prion infection J Toxicol Environ Health A 2009 74 (2-4) 146-53 23 Flicek P Amode M R Barrell D Beal K Brent S Carvalho-Silva D Clapham P Coates G Fairley S Fitzgerald S Gil L Gordon L Hendrix M Hourlier T Johnson N Kahari A K Keefe D Keenan S Kinsella R Komorowska M Koscielny G Kulesha E Larsson P Longden I McLaren W Muffato M Overduin B Pignatelli M Pritchard B Riat H S Ritchie G R Ruffier M Schuster M Sobral D Tang Y A Taylor K Trevanion S Vandrovcova J White S Wilson M Wilder S P Aken B L Birney E Cunningham F Dunham I Durbin R Fernandez-Suarez X M Harrow J Herrero J

130

Hubbard T J Parker A Proctor G Spudich G Vogel J Yates A Zadissa A Searle S M Ensembl 2012 Nucleic Acids Res 2012 40 (Database issue) D84-90 24 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein identification by searching sequence databases using mass spectrometry data Electrophoresis 1999 20 (18) 3551-67 25 Zybailov B Mosley A L Sardiu M E Coleman M K Florens L Washburn M P Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae J Proteome Res 2006 5 (9) 2339-47 26 Zhang Y Wen Z Washburn M P Florens L Refinements to label free proteome quantitation how to deal with peptides shared by multiple proteins Anal Chem 2010 82 (6) 2272-81 27 Kimberlin R H Cole S Walker C A Temporary and permanent modifications to a single strain of mouse scrapie on transmission to rats and hamsters J Gen Virol 1987 68 ( Pt 7) 1875-81 28 World Health Organization Global surveillance diagnosis and therapy of human transmissible spongiform encephalopathies report from a WHO consultation 1998 World Health Organization Geneva Switzerland 9-11 February 1998 1-26 29 Bartosik-Psujek H Archelos J J Tau protein and 14-3-3 are elevated in the cerebrospinal fluid of patients with multiple sclerosis and correlate with intrathecal synthesis of IgG J Neurol 2004 251 (4) 414-20 30 Saiz A Graus F Dalmau J Pifarre A Marin C Tolosa E Detection of 14-3-3 brain protein in the cerebrospinal fluid of patients with paraneoplastic neurological disorders Ann Neurol 1999 46 (5) 774-7 31 Sanchez-Juan P Green A Ladogana A Cuadrado-Corrales N Saanchez-Valle R Mitrovaa E Stoeck K Sklaviadis T Kulczycki J Hess K Bodemer M Slivarichova D Saiz A Calero M Ingrosso L Knight R Janssens A C van Duijn C M Zerr I CSF tests in the differential diagnosis of Creutzfeldt-Jakob disease Neurology 2006 67 (4) 637-43 32 Zerr I Bodemer M Racker S Grosche S Poser S Kretzschmar H A Weber T Cerebrospinal fluid concentration of neuron-specific enolase in diagnosis of Creutzfeldt-Jakob disease Lancet 1995 345 (8965) 1609-10 33 Sanchez J C Guillaume E Lescuyer P Allard L Carrette O Scherl A Burgess J Corthals G L Burkhard P R Hochstrasser D F Cystatin C as a potential cerebrospinal fluid marker for the diagnosis of Creutzfeldt-Jakob disease Proteomics 2004 4 (8) 2229-33 34 Miele G Seeger H Marino D Eberhard R Heikenwalder M Stoeck K Basagni M Knight R Green A Chianini F Wuthrich R P Hock C Zerr I Aguzzi A Urinary alpha1-antichymotrypsin a biomarker of prion infection PLoS One 2008 3 (12) e3870 35 DeMattos R B Bales K R Parsadanian M ODell M A Foss E M Paul S M Holtzman D M Plaque-associated disruption of CSF and plasma amyloid-beta (Abeta) equilibrium in a mouse model of Alzheimers disease J Neurochem 2002 81 (2) 229-36

131

Figure 1 Interspecies transmission of prion disease to rats End-stage clinical brain homogenates

were used to passage prion disease After one year of incubation animals were euthanized to

determine the extent of PrPres

accumulation Protease resistance PrP was only observed in those

animals infected with RML scrapie prions This material was serially passaged for two more

incubations before becoming rat-adapted as indicated by the shortening of the incubation period

132

Table 1 A list of selected proteins and mRNAs up-regulated in RAS CSF and brain tissue If

the protein is detected in the RAS CSF but not the control CSF the CSF expression is reported

with a infin If there is no change or data on certain genes related to an up regulated protein nd is

noted The mouse genomic data presented here was previously published22

Gene Protein Symbol Accession CSF

Expression

Rat

GEX

Mouse

GEX

14-3-3 protein zetadelta Ywhaz NP_037143 infin nd nd

14-3-3 protein epsilon Ywhae NP_113791 infin nd nd

14-3-3 protein gamma Ywhag NP_062249 infin nd nd

serine protease inhibitor A3N Serpin A3N NP_113719 54 nd 975

enolase 2 (gamma neuronal) Eno2 NP_647541 infin nd nd

granulin GRN NP_058809 62 364 184

macrophage colony-stimulating

factor 1 receptor

Csf1r NP_001025072 infin 293 205

cathepsin D CTSD NP_599161 infin 255 299

complement factor H Cfh NP_569093 376 234 nd

ribonuclease T2 RNAset2 NP_001099680 infin 302 nd

133

Figure 2 Accumulation of PrPSc

in rat adapted scrapie First second and third passage brain

homogenates were assayed for the presence of protease resistant PrP by immunoblot PrPSc

was

observed following phosphotungstenic acid enrichment at first passage By clinical disease in 2nd

and 3rd

passage rats PrPSc

had substantially accumulated

134

Figure 3 Histological appearance of rat adapted scrapie in the hippocampus at clinical disease

Infected animals showed intense immuno-staining for deposits of PrPSc

and GFAP expressing

astrocytes Spongiform change is an abundant feature in rat adapted scrapie

135

Figure 4 Scatter plot of gene expression profiling of rat adapted scrapie The expression of

individual genes from uninfected and infected animals were plotted to display up and down

regulation The dashed green line is no change Solid green lines are 2-fold changes in gene

expression

136

Figure 5 Quantitative relationship between orthologous pairs of two-fold up-regulated genes in

mouse and rat scrapie Genes up-regulated more than two fold were mapped to their orthologs

and the fold change was plotted Expression is log2 transformed

137

Figure 6 Comparison of genes upregulated in rat and mouse prion diseases Genes up-regulated

two fold in rodent scrapie were identified and the expression of their orthologs was determined

138

Figure 7 Venn diagram comparison of the CSF proteomics data between rat adapted scrapie

(RAS) model and control rats (A) Unique peptides from tryptic peptides produced for the

proteins identified (B) The total proteins identified including all isoforms within the protein

groups (C) The protein groups comparing only the top protein hit of the protein isoforms

showing very consistent protein identifications between RAS and control

139

Chapter 5

Investigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiae

Adapted from ldquoInvestigation of the differences in the phosphoproteome between starved vs

glucose fed Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M

Heideman W Li L In preparation

140

Abstract

This work explores comparative proteomics between starved and glucose fed

Saccharomyces cerevisiae (bakers yeast) Yeast depends on nutrients such as glucose to

survive and need to cycle between states of growth and quiescence Kinases such as protein

kinase A (PKA) and target of rapamycin kinase (TOR) are involved in the glucose response

Thus performing a large scale phosphoproteomic MS study can be highly beneficial to map the

signaling networks and cascades involved in glucose-dependent stimulated yeast growth Yeast

cell extract was digested and phosphopeptides were enriched by immobilized metal affinity

chromatography (IMAC) followed by two dimensional high pH-low pH reversed phase (RP)-RP

separation The low pH separation was infused directly into an ion trap mass spectrometer with

neutral loss electron-transfer dissociation (ETD)-triggered fragmentation to improve

phosphopeptide identifications In total 477 unique phosphopeptides are identified with 06

false discovery rate (FDR) and 669 unique phosphopeptides identified with 54 FDR This

study includes comparative phosphoproteins for one specific motif of PKA xxxKRKRxSxxxx

which is presented and differences between starved vs glucose fed are highlighted Phosphosite

validation is performed using a localization algorithm Ascore to provide more confident and

site-specific characterization of phosphopeptides

141

Introduction

Microorganisms such as Saccharomyces cerevisiae (yeast) cycle between growth when

nutrients are available and quiescence when nutrients are scarce When glucose is limited yeast

go into growth arrest state but when glucose is added growth quickly resumes Kinases such as

protein kinase A (PKA) and target of rapamycin (TOR) regulate growth in response to nutrient

conditions and have been well studied through transcriptional control1-4

Yeast execute large

transcriptome alterations in response to changing environmental growth conditions5 6

Gene

regulation by glucose introduction in yeast has been studied including genes used for growth on

alternative carbon sources and activation of genes coding for glucose transport and protein

synthesis7-10

Response to nutrients for survival is not limited to yeast biology and indeed all

living creatures both prokaryotes and multicellular eukaryotes like mammals require nutrient

responsiveness and coordinating cellular functions to survive

With regulation of certain genes well studied by glucose introduction the mechanism and

global protein modulation caused by glucose introduction remain unknown6 Large-scale

phosphorylation analysis has been previously performed in yeast using mass spectrometry11-14

Mapping the phosphoproteome in yeast transitioning from quiescence to growth conditions to

better understand the roles of phosphorylation in orchestrating growth is needed The

phosphorylation state during growth or starvation could be used to engineer cells to drive ectopic

activity (or inhibition) to promote growth and ethanol production on non-native sugars like

xylose

It has been reported that the phosphorylation state can be affected by the introduction of

glucose to carbon-starved yeast15

and phosphorylation plays a significant role in the cell cycle

and signal transduction16

Specifically O-Phosphorylation can function as a molecular switch by

142

changing the structure of a protein via alteration of the chemical nature of an amino acid for

serine threonine or tyrosine It has been reported that up to 30 of the proteome can undergo

phophorylation17

Mass spectrometry has evolved as a powerful tool to accomplish phosphosite

mapping using shotgun proteomics With available technology tens of thousands of

phosphorylation sites can be mapped to amino acids on thousands of proteins using shotgun

proteomics18-20

Mass spectrometry can offer sensitive automated non-targeted global analysis of

phosphorylation events in proteomic samples but in any large scale phosphoproteomic

investigation enrichment of phosphoproteinspeptides is required First phosphorylation is

usually a sub-stoichiometric process where only a percentage of all protein copies are

phosphorylated21

Various enrichment methods have been used for phosphopeptide enrichment

including metal oxide affinity chromatography (MOAC)22

such as TiO223

immobilized metal

affinity chromatography (IMAC)12 24 25

electrostatic repulsion-hydrophilic interaction

chromatography (ERLIC)26

and immunoaffinity of tyrosine phosphorylation27 28

After

enrichment MS analyses of phosphorylated peptides are still challenging due to ion suppression

from non-phosphorylated peptides

Even after phosphopeptide enrichment further sample preparation is needed for large

scale proteomic experiments Additional fractionation can increase protein coverage of a

sample by over ten fold such as MudPIT29

(multidimensional protein identification technology)

In online MudPIT a sample is injected onto a strong cation exchange (SCX) column coupled to

a RP column Successive salt bumps followed by low pH gradients provide the separation of

peptides in two dimensions Since the phosphate groups on phosphopeptides have a low pKa

value due to being more acidic then their unmodified counterparts they tend to elute earlier and

143

disproportionally in ion exchange chromatography like SCX Alternatively reversed-phase

reversed-phase (RP-RP) separation has been proposed as an alternative to MudPIT for offline

two dimensional (2D) separation30

One of the caveats of 2D separation is the potential for

wasted mass spectrometry time from early and late fractions having very few peptides present

all while having too much sample for middle fractions One simple method to reduce these

ldquowasted fractionsrdquo is to combine early or late fractions with the middle fractions to reduce MS

runs with little peptide content to analyze thus shortening the overall analysis time31

In addition the labile phosphorylation group has a large propensity to undergo cleavage

during collision induced dissociation (CID) producing a neutral loss The neutral loss can

produce insufficient backbone fragment ions for MSMS identification32

A solution to neutral

loss fragmentation in phosphopeptides is MS3 using CID to produce improved backbone

fragmentation13 14 33

An alternative fragmentation method to CID for fast sampling ion traps is

electron transfer dissociation (ETD)34-36

ETD produces a more uniform back-bone cleavage

where the cation peptide receives an electron from a low affinity radical anion37

The transferred

electron induces a dissociation at the N-Cα bond to produce c- and zbull-type product ions while

retaining the labile post-translational modifications (PTM) such as phosphorylation bound to the

product ions38

The Bruker amaZon ETD ion trap mass spectrometer has the potential to trigger

ETD fragmentation after a labile PTM produces a neutral loss from CID fragmentation This

method is termed neutral loss-triggered ETD fragmentation and provides a complementary

fragmentation pathway to labile poor fragmenting phosphorylated peptides

In this work we provide a qualitative comparative list of yeast phosphopeptides observed

in glucose fed vs glucose starved conditions

144

Experimental

EXPERIMENTAL DETAILS

Chemicals

Acetonitrile (HPLC grade) ammonium hydroxide (ACS plus certified) urea (gt99)

sodium chloride (997) ammonium bicarbonate (certified) Optima LCMS grade acetonitrile

Optima LCMS grade water and EDTA disodium salt (99) were purchased from Fisher

Scientific (Fair Lawn NJ) Deionized water (182 MΩcm) was prepared with a Milli-Q

Millipore system (Billerica MA) DL-Dithiothreitol (DTT) and sequencing grade modified

trypsin were purchased from Promega (Madison WI) Formic acid (ge98) was obtained from

Fluka (Buchs Switzerland) Iodoacetamide (IAA) ammonium formate (ge99995) Trizma

hydrochloride (Tris HCl ge990) Triton X-100 (laboratory grade) and iron (III) chloride

hexahydrate (97) were purchased from Sigma-Aldrich (Saint Louis MO) Sodium dodecyl

sulfate (SDS) (998) was purchased from US Biological (Marblehead MA) Nickel

nitrilotriacetic acid (Ni-NTA) magnetic agarose beads were purchased from Qiagen (Valencia

CA) Autoclaved yeast peptone dextrose (YPD) media was prepared in 1 L of deionized water

using 10 g yeast extract 20 g Peptone from Becton Dickinson and Company (Sparks MD) and

20 g D-(+)-Glucose (ge995) purchased from Sigma-Aldrich (Saint Louis MO)

Modified Mary Miller Yeast Protein Isolation

The yeast culture was prepared and protein extraction was performed using a modified

Mary Miller protocol39

Briefly yeast strain s288c was inoculated with YPD media and shook

for 72 hours to allow the yeast to be glucose starved After the 72 hours the yeast culture was

partitioned into two flasks To one flask glucose was added at 2 of the final concentration and

allowed to incubate and shake for 10 minutes Then 100 optical density units (ODU) of the yeast

145

culture were collected from each flask Each yeast sample was spun down in a Beckman-Coulter

J6B centrifuge (Indianapolis IN) at 2500xg for three minutes The media was aspirated and the

tubes were flash frozen in liquid nitrogen and stored at -80oC The yeast pellets were thawed on

ice and resuspended in 1 mL of lysis buffer (50 mM Tris HCl 01 triton x-100 05 SDS

pH=80) and 1X protease and phosphatase inhibitor cocktail from Thermo Scientific (Rockford

IL) and transferred to an Eppendorf tube To the yeast 200 microL of glass beads were added and

amalgamated for 25 minutes at 4oC The sample was inverted and the Eppendorf tube was

pierced with a hot 23 gauge needle through the bottom and the tube was placed atop a 5 mL

culture tube and centrifuge in the Beckman-Coulter J6B at 2500xg for three minutes at 4oC to

collect the liquid containing the yeast cells while the glass beads remain trapped in the

Eppendorf tubes The protein lysate was then centrifuged at 3000xg for five minutes at 4oC and

the supernatant was collected and stored at -80oC

Preparation of tryptic digests

The Saccharomyces cerevisiae protein cell extract for fed or starved were analyzed with a

BCA protein assay kit (Thermo Scientific Rockford IL) and 2 mg of protein was added to four

parts -20oC acetone and allowed to precipitate the protein for 1 hour at -20

oC The samples were

then centrifuged at 14000xg for 5 minutes and the supernatant was removed and the protein

pellet was reconstituted in 360 microl of 8 M urea To each sample 20 microL of 050 M DTT was

added and allowed to incubate at 37oC for 45 minutes After incubation 54 microL of 055 M IAA

was added for carbamidomethyl of cysteine and the sample was allowed to incubate for 15

minutes in the dark To quench the IAA 20 microL of 050 M DTT was added and allowed to react

for 10 minutes To perform trypsin digestion 1400 microL of 50 mM NH4HCO3 was then added

along with 20 microg trypsin to each sample Digestion was performed overnight at 37oC and

146

quenched by addition of 25 microL of 10 formic acid For each sample the tryptic peptides were

then subjected to solid phase extraction using a Hypersep C18 100 mg solid phase extraction

(SPE) column (Thermo Scientific Bellefonte PA) Peptides were eluted with 50 ACN in

01 formic acid concentrated and reconstituted in 500 microL of 80 ACN and 01 formic acid

Phosphopeptide enrichment using immobilized metal affinity chromatography (IMAC)

One ml of Ni-NTA was washed three times with 800 microL of H2O and then the Ni was

removed with 800 microL of 100 mM EDTA in 50 mM ammonium bicarbonate and vortexed for 30

minutes The supernatant was removed and the NTA was washed with 800 microL of H2O three

times followed by addition of 800 microL 10 mM FeCl3 hexahydrate and vortexed for 30 minutes

The Fe-NTA was then washed three times with 800 microL of H2O and once with 80 ACN in 01

formic acid before being combined with the cell extract for phosphopeptide enrichment and

vortexed for 40 minutes The sample was washed three times with 200 microL of 80 ACN in 01

formic acid before elution with two aliquots of 200 microL of 5 ammonium hydroxide in 5050

ACNH2O with 15 minutes of vortexing The enriched phosphopeptides were then dried down

with a Thermo Scientific Savant SpeedVac (SVC100 Waltham MA) and reconstituted in 55 microL

25mM ammonium formate pH=75

First dimension neutral pH separation

Individually 50 microL of each yeast phosphopeptide enriched sample was injected onto a

Phenomenex Gemini-NX 3microm C18 110 Aring 150 x 2 mm column (Torrance CA) The Gemini

column was coupled to a Phenomenex SecurityGuard Gemini C18 2 mm guard cartridge

(Torrance CA) Mobile phase A consisted of 25 mM ammonium formate at pH=75 and mobile

phase B consisted of 25 mM ammonium formateACN with a 19 ratio respectively at pH=75

The column was operated at 50oC at a flow rate of 05 mLmin The flow profile was 2-30 B

147

over 65 minutes and then 5 min at 100 B A total of 22 fractions were collected every 3

minutes and the fractions were combined as follows 1 with 12 2 with 13 etc to 11 with 22

The combined fractions were dried down and reconstituted in 25 microL of 01 formic acid

RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN Injections consisted of 5 microL from each fraction onto an Agilent Technologies

Zorbax 300 SB-C18 5 microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5

microLmin for 5 minutes at 95 A 5 B Separation was performed on a Waters 3 microm Atlantis

dC18 75 microm x 150 mm (Milford MA) using a gradient from 5 to 45 mobile phase B at 250

nLmin over 90 minutes at room temperature Emitter tips were pulled from 75 microm inner

diameter 360 microm outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using

an in-house model P-2000 laser puller (Sutter Instrument Co Novato CA)

Mass spectrometry data acquisitions

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany)

equipped with an on-line nanospray source was used for mass spectrometry data acquisition

Hystar (Version 32 Bruker Daltonics Bremen Germany) was used to couple and control

Eksigent nanoLC software (Dublin CA Version 30 Beta Build 080715) for MS acquisition for

all experiments Smart parameter setting (SPS) was set to 700 mz compound stability and trap

drive level was set at 100 Optimization of the nanospray source resulted in dry gas

temperature of 125oC dry gas flow of 40 Lmin capillary voltage of -1300 V end plate offset of

-500 V MSMS fragmentation amplitude of 10V and SmartFragmentation set at 30-300

148

Data were generated in data dependent mode with strict active exclusion set after two

spectra and released after one minute MSMS spectra were obtained via collision induced

dissociation (CID) fragmentation for the six most abundant MS ions with a preference for doubly

charged ions An additional mode of MSMS fragmentation electron transfer dissociation

(ETD) was triggered on the precursor ion when a neutral loss was observed in CID

fragmentation from phosphoric acid H3PO4 (Δmz of 49 and 327 for 2+ and 3+ charge states

respectively) or for metaphosphoric acid (HPO3) (Δmz of 40 and 267 for 2+ and 3+ charge

states respectively) For MS generation the ion charge control (ICC) target was set to 200000

maximum accumulation time 5000 ms rolling average 2 acquisition range of 300-1500 mz

and scan speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target

was set to 300000 maximum accumulation time 5000 ms two spectral averages acquisition

range of 100-2000 mz and scan speed (Ultrascan) of 32000 mz

Data analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen

Germany) Deviations in parameters from the default Protein Analysis in DataAnalysis were as

follows intensity threshold 1000 maximum number of compounds 1E9 and retention time

window 0001 minutes These parameter changes were required to prevent artificial data

reduction Identification of peptides were performed using Mascot40

(Version 23 Matrix

Science London UK) Database searching was performed against SwissProt Saccharomyces

cerevisiae database (version 575) Mascot search parameters were as follows Allowed missed

cleavages 2 enzyme trypsin fixed modification carboxymethylation (C) variable

modifications oxidation (M) and phosphorylation (STY) peptide tolerance plusmn12 Da maximum

number of 13

C 1 MSMS tolerance peptide decoy (Mascot) checked plusmn05 Da instrument type

149

ESI-Trap Results from the Mascot search was exported as DAT files for analysis by Scaffold 3

and Scaffold PTM

Scaffold and Ascore data processing

Scaffold 3 (Proteome Software Portland OR)was used for MSMS proteomics data

comparison and false discovery rate analysis The DAT files were loaded into Scaffold 3 and

the fractions for the two dimensional fractionation were combined The resulting biological

triplicates for starved and glucose fed yeast were filtered with a 1 false discovery rate (FDR)

on the protein level (Supplemental Table 1) Due to the inherent poor fragmentation of

phosphopeptides a lenient FDR level may still yield biologically relevant data and thus a 54

FDR was generated for phosphopeptides (Supplemental Table 2) For a more conservative list of

phosphopeptides with a 06 FDR was generated from Scaffold 3 (Supplemental Table 3) FDR

analysis is sufficient at preventing poor data from being reported but does not prevent false

phosphosite identification in phosphopeptides To provide confidence in site identification

Scaffold PTM was used to perform Ascore41

analysis Ascore uses an algorithm to score the

probability of the phosphosite from a phosphopeptide identified by a database searching

algorithm such as Mascot Ascore can be freely accessed at httpascoremedharvardedu

Cell collection RNA isolation and microarray data analysis

All experiments were performed in biological duplicates Cell samples (10 ODU) were

taken at starved and fed states spun at 5000xg for 2 minutes at 30 oC the supernatant was

removed and the pellets snap frozen using liquid nitrogen RNA was isolated using the Epicentre

MasterPure Yeast Purification Kit (Epicentre Technologies) and quality was checked using gel

electrophoresis cRNA synthesis was carried out using the GeneChipreg Expression 3

Amplification One-Cycle Target Labeling and Control Reagents kit (Affymetrix) All

150

experiments followed the manufactures instructions cRNA samples were hybridized to

GeneChip Yeast Genome 20 Arrays for 16 hours Arrays were washed stained and scanned

according the manufactures recommendations Affymetrix CEL files were RMA normalized

with R and the Bioconductor Suite Data analysis was performed within TIGR Multiexperiment

Viewer v451 in-house Perl scripting R and Bioconductor

Results

Sample preparation for shotgun proteomics

As discussed in the introduction the purpose of this study is to provide an exploratory list

of qualitative differences in the phosphoproteome between starved vs glucose fed yeast After

yeast cell lysate production a substantial amount of sample preparation is performed to enhance

the depth of the phosphoproteome coverage A workflow of the steps following cell lysis is

outlined in Figure 1A The yeast proteins are isolated via acetone precipitation followed by

digestion with trypsin to produce peptides Phosphopeptides are intermixed with the entire

tryptic peptide sample and therefore Fe-NTA IMAC is used to enrich the phosphopeptides To

improve upon the number of phosphopeptides we then performed two dimensional separation

with high pH RP followed by low pH RP C18 and infused the eluent directly onto an ion trap

mass spectrometer Figure 1B show an improved technique for the first dimension of separation

to combine the early eluting and late eluting fractions from the first phase of separation to reduce

overall MS analysis time The combination of fractions 1 and 12 2 and 13 etc essentially

improves the duty cycle of the mass spectrometer by ensuring sufficient peptide content is

injected onto a low pH nanoLC RP C18 column

ETD-triggered mass spectrometry

151

In the present study labile phosphorylation can lead to non-informative neutral loss

During MS scanning mode the instrument will choose the 6 most abundant peaks with correct

isotopic distribution and select them for MSMS CID fragmentation During CID fragmentation

it is common for labile PTMs such as phosphorylation to undergo neutral loss and thus limited

informative b and y-type ions are formed Alternatively ETD fragmentation can be used on

specific MSMS scans which produce a neutral loss indicative of phosphorylation (98Daz or

80Daz) The subsequent ETD fragmentation on these putative phosphopeptides will lead to

uniform backbone cleavage resulting in confident identification of phosphopeptides with site-

specific localization during MSMS It is important to note that CID fragmentation still produces

very informative fragmentation for phosphorylation but ETD provides an orthogonal

fragmentation pathway to further increase the phosphoproteome coverage Additionally the

duty cycle for ETD fragmentation is roughly half that of CID therefore about half as many

potential peptides would be fragmented and sequenced if the instrument was using ETD

fragmentation exclusively

Protein Data

Even with phosphopeptide enrichment it is inevitable that non-phosphopeptides will also

be identified All data were searched with Mascot and in total over 1000 proteins were identified

with a 1 FDR rate and are listed in Supplemental Table 1 The proteins listed in Supplemental

Table 1 include both phosphoproteins and non-phosphorylated proteins In addition to the

proteins identified in the fed and starved states the unique peptides and spectral counts are also

listed for reference The phosphopeptides are specifically listed with a 54 FDR rate in

Supplemental Table 2 and comparisons between starved and fed with Ascore analysis are listed

for every phosphopeptide identified A higher confidence of phosphopeptide identification is

152

sometimes required before investing in time consuming biological experiments so a list of

phosphopeptides identified in starved and fed was taken from a setting directing Scaffold to

produce a 05 FDR which resulted in the generated FDR of 06 FDR and listed in

Supplemental Table 3

A summary of specific phosphorylation sites are listed in Table 1 with a 06 FDR and

Table 2 with a 54 FDR Each table gives totals for unique phosphopeptides identified having

an Ascore localization score ge80 without Ascore and phosphorylation events on each unique

peptides As expected the majority of phosphorylation events (over 50) occurred on serine

whereas fewer phosphorylation events (~10) occurred on tyrosine In addition the vast

majority of phosphorylation events were single phosphorylation (ge65) with very few

identifications having more than two phosphosites per peptide For specific phosphopeptide

identification and subsequent phosphoprotein identification refer to Supplemental Table 2 and 3

Discussion

Transcriptional response to glucose feeding

Yeast responds to the repletion of glucose after glucose-depletion by broad

transcriptional changes to induce growth Roughly one-third of the yeast genome is altered by at

least two fold when glucose is introduced as shown in Figure 2 Figure 2 is a heat map from a

microarray analysis showing 1601 genes up regulated and 1457 genes down regulated after

addition of glucose compared to the starved state The arbitrary cut-offs for these values were as

follows two fold induction after nutrient repletion and a false discovery rate (q-value) of 001

Red color in the heat map indicates a particular gene is up-regulated in the fed state compared to

the starved state Alternatively genes coded in green are less expressed in the fed state

compared to the starved condition The intensity of the green or red colors is indicative of the

153

intensity of the fold change in gene expression These large transcriptional changes after glucose

repletion drive and complement the current phosphoproteomic study

PKA motif analysis

One benefit of a large scale phosphoproteomics experiment is to examine the different

phosphopeptides from known motifs for specific kinases PKA and TOR are responsible for the

majority of the transcriptional response and thus PKA is a good target for motif analysis Figure

3 shows the PKA motif sequence xxxRKRKxSxxxxxx with the phosphorylation occurring on

the serine Other than F16P (fructose-16-bisphosphatase) all the proteins listed are unique to the

starved or fed samples A motif sequence will inevitably show up by random chance in any

analysis For this study the control for motif analysis uses the swissprot protein list for the

entire yeast proteome for the background Compared to background this specific PKA kinase

from Figure 3 is up-regulated by 264 fold when compared to the background One interesting

protein emerged from this motif analysis in the fed sample but not the starved sample is

Ssd1which is implicated in the control of the cell cycle in G1 phase42

Ssd1 also is

phosphorylated by Cbk1 and Cbk1 has been proposed to inhibit Ssd143

and provides an

intriguing target for future studies on starved vs glucose fed yeast growth

Localization of the phosphorylation sites

When a phosphopeptide contains any number of serine threonine or tyrosine amino

acids the localization of the phosphosite can sometimes be ambiguous Database searches used

to identify peptides like Mascot do not provide any probability for localization of correct

phosphorylation sites Ascore is an algorithm that uses the same MSMS spectra as Mascot but

instead of comparing theoretical peptide MSMS to experimental spectra Ascore searches for

informative a b or y-type peptide fragments giving evidence for phosphosite The Scaffold

154

program adds a localization probability to the Ascore values and the values are listed in

Supplemental Table 2 and 3 Figure 4 has a representative Ascore mass spectra assigning the

peaks identified and providing evidence that the phosphorylation site occurs at the threonine

instead of the serine Incorporating Ascore into this study provides a layer of validation for

putative phosphosite identification

Plasma Membrane 2-ATPase

A previous study identified and localized phosphorylation sites on plasma membrane 1-

ATPase after glucose was introduced to starved yeast15

In the current study PMA2 (plasma

membrane ATPase 2) was identified in glucose fed and not starved samples The doubly

threonine phosphorylated peptide identified in fed yeast is PMA2 with amino acid sequence

IKMLTGDAVGIAKETCR (583-599) whereas the homologous peptide in PMA1 has the exact

same amino acid sequence except for the first isoleucine substituted for valine

VKMLTGDAVGIAKETCR (554-570 not observed in data) PMA2 was observed in the 06

FDR list of phosphopeptides and has an Ascore localization probably of 100 A plant study

showed that PMA2 phosphorylation level was higher in early growth phase than when in

stationary phase44

In addition PMA2 expression in yeast permits the growth of yeast and

threonine phosphorylation has been reported on Thr-95545

The identification of PMA2 in the

fed glucose cell extract provides an interesting target for future study on the molecular

mechanisms involved in regulation growth in starved vs glucose fed yeast

Conclusion

In conclusion this work provides a qualitative comparison in the phosphoproteome

between starved and glucose fed yeast A large scale IMAC enrichment of yeast cell lysate

followed by 2-D separation provided a rich source of phosphopeptides for neutral loss triggered

155

ETD analysis using mass spectrometry A specific motif for PKA is highlighted to show the

differences in proteins identified between starved vs fed conditions In total 477 unique

phosphopeptides are identified with 06 FDR and 669 unique phosphopeptides identified with

54 FDR Phosphosite validation is performed using a localization algorithm Ascore to

provide further confidence on the site-specific characterization of these phosphopeptides The

proteins Ssd1 and PMA2 emerged as potential intriguing targets for more in-depth studies on

protein phosphorylation involved in glucose response

Supplemental Tables 1 2 and 3 are available upon request

References

1 Martinez M J Roy S Archuletta A B Wentzell P D Anna-Arriola S S

Rodriguez A L Aragon A D Quinones G A Allen C Werner-Washburne M Genomic

analysis of stationary-phase and exit in Saccharomyces cerevisiae gene expression and

identification of novel essential genes Mol Biol Cell 2004 15 (12) 5295-305

2 Radonjic M Andrau J C Lijnzaad P Kemmeren P Kockelkorn T T van Leenen

D van Berkum N L Holstege F C Genome-wide analyses reveal RNA polymerase II

located upstream of genes poised for rapid response upon S cerevisiae stationary phase exit Mol

Cell 2005 18 (2) 171-83

3 Slattery M G Heideman W Coordinated regulation of growth genes in

Saccharomyces cerevisiae Cell Cycle 2007 6 (10) 1210-9

4 Wang Y Pierce M Schneper L GAtildefrac14ldal C G k e Zhang X Tavazoie S

Broach J R Ras and Gpa2 Mediate One Branch of a Redundant Glucose Signaling Pathway in

Yeast PLoS Biol 2004 2 (5) e128

5 Newcomb L L Hall D D Heideman W AZF1 is a glucose-dependent positive

regulator of CLN3 transcription in Saccharomyces cerevisiae Mol Cell Biol 2002 22 (5) 1607-

14

6 Newcomb L L Diderich J A Slattery M G Heideman W Glucose regulation of

Saccharomyces cerevisiae cell cycle genes Eukaryot Cell 2003 2 (1) 143-9

7 Carlson M Glucose repression in yeast Curr Opin Microbiol 1999 2 (2) 202-7

8 Gancedo J M Yeast carbon catabolite repression Microbiol Mol Biol Rev 1998 62

(2) 334-61

9 Johnston M Feasting fasting and fermenting Glucose sensing in yeast and other cells

Trends Genet 1999 15 (1) 29-33

156

10 Warner J R The economics of ribosome biosynthesis in yeast Trends Biochem Sci

1999 24 (11) 437-40

11 Li X Gerber S A Rudner A D Beausoleil S A Haas W Villen J Elias J E

Gygi S P Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces

cerevisiae J Proteome Res 2007 6 (3) 1190-7

12 Ficarro S B McCleland M L Stukenberg P T Burke D J Ross M M

Shabanowitz J Hunt D F White F M Phosphoproteome analysis by mass spectrometry and

its application to Saccharomyces cerevisiae Nat Biotechnol 2002 20 (3) 301-5

13 Gruhler A Olsen J V Mohammed S Mortensen P Faergeman N J Mann M

Jensen O N Quantitative phosphoproteomics applied to the yeast pheromone signaling

pathway Mol Cell Proteomics 2005 4 (3) 310-27

14 Peng J Schwartz D Elias J E Thoreen C C Cheng D Marsischky G Roelofs

J Finley D Gygi S P A proteomics approach to understanding protein ubiquitination Nat

Biotechnol 2003 21 (8) 921-6

15 Lecchi S Nelson C J Allen K E Swaney D L Thompson K L Coon J J

Sussman M R Slayman C W Tandem phosphorylation of Ser-911 and Thr-912 at the C

terminus of yeast plasma membrane H+-ATPase leads to glucose-dependent activation J Biol

Chem 2007 282 (49) 35471-81

16 Cohen P The regulation of protein function by multisite phosphorylation--a 25 year

update Trends Biochem Sci 2000 25 (12) 596-601

17 Kalume D E Molina H Pandey A Tackling the phosphoproteome tools and

strategies Curr Opin Chem Biol 2003 7 (1) 64-9

18 Nagaraj N DSouza R C Cox J Olsen J V Mann M Feasibility of large-scale

phosphoproteomics with higher energy collisional dissociation fragmentation J Proteome Res

2010 9 (12) 6786-94

19 Olsen J V Vermeulen M Santamaria A Kumar C Miller M L Jensen L J

Gnad F Cox J Jensen T S Nigg E A Brunak S Mann M Quantitative

phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis Sci

Signal 2010 3 (104) ra3

20 Breitkopf S B Asara J M Determining In Vivo Phosphorylation Sites Using Mass

Spectrometry In Current Protocols in Molecular Biology John Wiley amp Sons Inc 2012

21 Steen H Jebanathirajah J A Rush J Morrice N Kirschner M W Phosphorylation

analysis by mass spectrometry myths facts and the consequences for qualitative and

quantitative measurements Mol Cell Proteomics 2006 5 (1) 172-81

22 Kweon H K Hakansson K Metal oxide-based enrichment combined with gas-phase

ion-electron reactions for improved mass spectrometric characterization of protein

phosphorylation J Proteome Res 2008 7 (2) 749-55

23 Larsen M R Thingholm T E Jensen O N Roepstorff P Jorgensen T J Highly

selective enrichment of phosphorylated peptides from peptide mixtures using titanium dioxide

microcolumns Mol Cell Proteomics 2005 4 (7) 873-86

24 Kokubu M Ishihama Y Sato T Nagasu T Oda Y Specificity of immobilized

metal affinity-based IMACC18 tip enrichment of phosphopeptides for protein phosphorylation

analysis Anal Chem 2005 77 (16) 5144-54

25 Swaney D L Wenger C D Thomson J A Coon J J Human embryonic stem cell

phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry Proc

Natl Acad Sci U S A 2009 106 (4) 995-1000

157

26 Hao P Guo T Sze S K Simultaneous analysis of proteome phospho- and

glycoproteome of rat kidney tissue with electrostatic repulsion hydrophilic interaction

chromatography PLoS One 2011 6 (2) e16884

27 Rush J Moritz A Lee K A Guo A Goss V L Spek E J Zhang H Zha X

M Polakiewicz R D Comb M J Immunoaffinity profiling of tyrosine phosphorylation in

cancer cells Nat Biotechnol 2005 23 (1) 94-101

28 Ficarro S Chertihin O Westbrook V A White F Jayes F Kalab P Marto J A

Shabanowitz J Herr J C Hunt D F Visconti P E Phosphoproteome analysis of

capacitated human sperm Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3

and valosin-containing proteinp97 during capacitation J Biol Chem 2003 278 (13) 11579-89

29 Washburn M P Wolters D Yates J R 3rd Large-scale analysis of the yeast

proteome by multidimensional protein identification technology Nat Biotechnol 2001 19 (3)

242-7

30 Dowell J A Frost D C Zhang J Li L Comparison of two-dimensional

fractionation techniques for shotgun proteomics Anal Chem 2008 80 (17) 6715-23

31 Song C Ye M Han G Jiang X Wang F Yu Z Chen R Zou H Reversed-

phase-reversed-phase liquid chromatography approach with high orthogonality for

multidimensional separation of phosphopeptides Anal Chem 2010 82 (1) 53-6

32 Palumbo A M Smith S A Kalcic C L Dantus M Stemmer P M Reid G E

Tandem mass spectrometry strategies for phosphoproteome analysis Mass Spectrom Rev 2011

30 (4) 600-25

33 Beausoleil S A Jedrychowski M Schwartz D Elias J E Villen J Li J Cohn M

A Cantley L C Gygi S P Large-scale characterization of HeLa cell nuclear

phosphoproteins Proc Natl Acad Sci U S A 2004 101 (33) 12130-5

34 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and

protein sequence analysis by electron transfer dissociation mass spectrometry Proc Natl Acad

Sci U S A 2004 101 (26) 9528-33

35 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion

dependence in the partitioning between proton and electron transfer in ionion reactions

International Journal of Mass Spectrometry 2004 236 (1acirceuroldquo3) 33-42

36 Hui L Cunningham R Zhang Z Cao W Jia C Li L Discovery and

characterization of the Crustacean hyperglycemic hormone precursor related peptides (CPRP)

and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes sapidus using

multiple tandem mass spectrometry techniques J Proteome Res 2011 10 (9) 4219-29

37 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-

site identity and position on electron-transfer dissociation of polypeptide cations J Am Chem Soc

2007 129 (40) 12232-43

38 Coon J J Collisions or electrons Protein sequence analysis in the 21st century Anal

Chem 2009 81 (9) 3208-15

39 Miller M E Cross F R Distinct subcellular localization patterns contribute to

functional specificity of the Cln2 and Cln3 cyclins of Saccharomyces cerevisiae Mol Cell Biol

2000 20 (2) 542-55

40 Perkins D N Pappin D J Creasy D M Cottrell J S Probability-based protein

identification by searching sequence databases using mass spectrometry data Electrophoresis

1999 20 (18) 3551-67

158

41 Beausoleil S A Villen J Gerber S A Rush J Gygi S P A probability-based

approach for high-throughput protein phosphorylation analysis and site localization Nat

Biotechnol 2006 24 (10) 1285-92

42 Sutton A Immanuel D Arndt K T The SIT4 protein phosphatase functions in late

G1 for progression into S phase Mol Cell Biol 1991 11 (4) 2133-48

43 Jansen J M Wanless A G Seidel C W Weiss E L Cbk1 regulation of the RNA-

binding protein Ssd1 integrates cell fate with translational control Curr Biol 2009 19 (24)

2114-20

44 Kanczewska J Marco S Vandermeeren C Maudoux O Rigaud J L Boutry M

Activation of the plant plasma membrane H+-ATPase by phosphorylation and binding of 14-3-3

proteins converts a dimer into a hexamer Proc Natl Acad Sci U S A 2005 102 (33) 11675-80

45 Maudoux O Batoko H Oecking C Gevaert K Vandekerckhove J Boutry M

Morsomme P A plant plasma membrane H+-ATPase expressed in yeast is activated by

phosphorylation at its penultimate residue and binding of 14-3-3 regulatory proteins in the

absence of fusicoccin J Biol Chem 2000 275 (23) 17762-70

159

Figure 1 The general workflow indicating the major steps involved in sample collection

sample processing mass spectrometric data acquisition and analysis of comparative

phosphoproteomics for starved vs glucose fed yeast (A) The high pH RP separation

procedure for combining fractions to reduce low peptide containing fractions from the

UV-VIS trace is also shown (B)

160

Figure 2 Heat map for the global transcriptional response to glucose fed vs starved samples

S288C cells starved for glucose until growth was arrested and subsequently glucose was added

161

Samples for microarray analysis were taken at the starved state and 60 minutes after glucose was

added The heat map shows the fed log2 fold change for each gene relative to the starved state

across the genome performed in biological replicate (A) Black indicates no change in

expression level while red indicates higher expression for the fed relative to the starved state

Green indicates higher expression for the starved state compared to the fed state (Adapted from

Dr Michael Conways Thesis)

162

Figure 3 The motif for protein kinase A (PKA) that was observed for fed vs starved which is

xxxRKRKxSxxxxxx with the phosphorylation occurring on the serine The motif occurred at a

rate 264 fold higher than the yeast proteome used for background In addition one protein was

observed in both starved and fed with accession identification of F16P (Fructose-16-

bisphosphatase)

163

06 FDR phosphopeptide analysis

Table 1 The number of phosphopeptides identified with a 06 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

Starved Fed All

Ascore ge80 score

unique

STY 164 153 317

S 87 (530) 82 (536) 169 (533)

T 60 (366) 55 (359) 115 (363)

Y 17 (104) 16 (105) 33 (104)

Unique no Ascore

STY 242 235 477

S 131 (541) 133 (566) 264 (553)

T 86 (355) 78 (332) 164 (344)

Y 25 (103) 24 (102) 49 (103)

Phosphorylation events

on each unique peptide

1 102 113 187

2 36 40 68

3 12 11 22

4 or more 8 3 11

164

54 FDR phosphopeptide analysis

Starved Fed All

Ascore ge80 score

unique

STY 217 217 434

S 115 (530) 113 (521) 228 (525)

T 78 (359) 78 (359) 156 (359)

Y 24 (111) 26 (120) 50 (115)

Unique no Ascore

STY 337 332 669

S 193 (573) 180 (542) 373 (558)

T 111 (329) 116 (349) 227 (339)

Y

Phosphorylation events

on each unique peptide

1

2

3

4 or more

33 (98)

135

56

16

11

36 (108)

169

55

14

3

69 (103)

280

100

27

13

Table 2 The number of phosphopeptides identified with a 54 FDR rate for starved and

glucose fed yeast Only unique phosphopeptides are included in the numbers listed Ascore

localization probability ge80 and no Ascore value which totals all phosphopeptides regardless

of Ascore value even if the Ascore localization value was 0 are shown for starved or fed

samples The number of phosphorylation events on each unique peptide is also included The

vast majority of the phosphopeptides identified have a single phosphosite

165

Figure 4 Ascore validation of the phosphopeptide LSLAtKK from the protein SGM1 (slow

growth on galactose and mannose protein 1) with 100 localization probability observed

in only fed but not starved yeast samples Ascorersquos algorithm identifies a b and y-type

ions and looks to identify peaks that provide evidence for a specific phosphorylation site

For this example Ascore assigns this phosphosite to threonine (T5) instead of the serine

(S2) The intense peaks at 4024 mz and 5250 mz are not identified by any a b or y-

type ions From the ladder sequence of the peptide shown numerous ions indicate the

threonine is phosphorylated while the serine is not Among these ions used for

localization are b2 y2 y5+H2O y3 y4 and y5

166

Chapter 6

Use of electron transfer dissociation for neuropeptide sequencing and

identification

Adapted from ldquoDiscovery and Characterization of the Crustacean Hyperglycemic Hormone

Precursor Related Peptides (CPRP) and Orcokinin Neuropeptides in the Sinus Glands of the Blue

Crab Callinectes sapidus Using Multiple Tandem Mass Spectrometry Techniquesrdquo Hui L

Cunningham R Zhang Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

167

Abstract

The crustacean Callinectes sapidus sinus gland (SG) is a well-defined neuroendocrine site that

produces numerous hemolymph-borne agents including the most complex class of endocrine

signaling moleculesmdashneuropeptides One such peptide is the crustacean hyperglycemic hormone

(CHH) precursor-related peptide (CPRPs) which was not previously sequenced Electron

transfer dissociation (ETD) was used for sequencing of intact CPRPs due to their large size and

high charge state In addition ETD was used to sequence the sulfated peptide Cholecystokinin

CCK-like Homarus americanus using a salt adduct Collectively these two examples

demonstrated the utility of ETD in sequencing neuropeptides with large molecular weight or

with labile modifications

168

Introduction

Neuropeptides are the largest and most diverse group of endocrine signaling molecules in

the nervous system They are necessary and critical for initiation and regulation of numerous

physiological processes such as feeding reproduction and development1 2

Mass spectrometry

(MS) with unique advantages such as high sensitivity high throughput chemical specificity and

the capability of de novo sequencing with limited genomic information is well suited for the

detection and sequencing of neuropeptides in endocrine glands and simultaneously provides the

potential for information on post-translational modifications such as sulfonation3-6

The sinus glands (SG) are located between the medulla interna and medulla externa of the

eyestalk in Callinectes sapidus It is a well-known neuroendocrine site that synthesizes and

secretes peptide hormones regulating various physiological activities such as molting

hemolymph glucose levels integument color changes eye pigment movements and

hydro-mineral balance7 Our labrsquos previous neuropeptidomic studies of SG in several

crustacean species including Cancer borealis8-11

Carcinus maenas12

and Homarus americanus13

14 used multiple MS strategies combining MALDI-based high resolution accurate mass profiling

biochemical derivatization and nanoscale separation coupled to tandem MS for de novo

sequencing In the current study we explore the neuropeptidome of the SG in the blue crab

Callinectes sapidus a vital species of economic importance on the seafood market worldwide In

total 70 neuropeptides are identified including 27 novel neuropeptides and among them the

crustacean hyperglycemic hormone precursor-related peptides (CPRPs) and orcokinins represent

major neuropeptide families known in the SG

The crustacean hyperglycemic hormone (CHH) and its precursor-related peptide (CPRP) are

produced concurrently during the cleavage of CHH from the CHH preprohormone protein15

The

169

CPRP peptide is located between the signal peptide and the CHH sequence and is separated from

the CHH by a dibasic cleavage site The functions of CPRPs are currently unknown16

However

the complete characterization of CPRPs provides a foundation for future experiments elucidating

their functional roles in crustaceans The cDNA of CHH preprohormone in SG of Callinectes

sapidus has been characterized17

but the direct detection of CPRP has not been reported due to

its relatively large size and possible post-translational modifications While small fragments of

CPRPs can be identified using nanoLC-ESI-Q-TOF tandem mass spectrometry the intact

peptide is difficult to detect due to the large molecular weight of CPRPs

Peptidomics traditionally uses collision induced dissociation (CID) for tandem MS

experiments for de novo sequencing Recently an alternative peptide fragmentation method has

been developed using the transfer of an electron termed electron transfer dissociation (ETD)18 19

ETD involves a radical anion with low electron affinity to be transferred to peptide cation which

results in reduced sequence discrimination and thus provides improved sequencing for larger

peptides compared to CID20

Specifically for an ion trap ETD the fluoranthene radical anion is

allowed to react with peptide cations in the three dimensional trap The resulting dissociation

cleaves at the N-Cα bond to produce c and zbull type product ions The peptidome of model

organisms like Callinectes sapidus can be further explored and expanded utilizing ETD as a

complementary fragmentation technique to CID Previous peptidomic analysis has been

completed using ETD as an additional fragmentation method21

It was observed that

enzymatically produced peptides with a higher mz produced improved sequence coverage using

ETD This observation termed decision tree analysis determined that a charge state of ge6 all

peptides endogenous or enzymatic should be fragmented via ETD22

In the present study the

highly charged peptide CPRP with an intact molecular weight of 3837 readily produces plus six

170

charge states The higher charge state of CPRP is highly amenable to fragmentation by ETD

which produces remarkably improved fragmentation and thus increased sequence coverage when

compared to CID

Sulfonation of tyrosine is a post-translational modification (PTM) that is thought to occur on

trans-membrane spanning and secreted proteins23

Cholecystokinin-8 (CCK-8) is a sulfated

peptide which has been linked to satiety24-26

and a CCK-like peptide has been observed in

lobster27

Sulfonation is an extremely labile modification and is often lost during soft

ionization such as ESI or MALDI but can be avoided by optimizing the ionization process28

One potential strategy to identify sulfopeptides is to scan for a neutral loss of 80 Da after CID

but this method does not allow for identification of site of sulfonation and has the risk to be

mistaken for phosphorylation The sulfonation PTM is a negatively charged modification on

the peptide which allows for negative ion scanning in the mass spectrometer but provides

minimal MSMS sequence coverage during CID due to preferential loss of the sulfate group

Alternatively electron-based dissociation technique enables more rapid radical driven

fragmentation where the cleavage pattern is more uniform along the peptide backbone without

initially cleaving the labile sulfated group thus preserving the site of modification These types

of dissociation techniques only work for multiply-charged ions thus a method to install a

multiply-charged cation on the target peptide is desirable It has been shown that the electron

capture dissociation (ECD) can be used to sequence sulfated peptides when a multi-charged

cation is added to the solution29

We use a similar multi-charge cation solution technique to

sequence a sulfated peptide from Homarus americanus via ETD on an ion trap mass

spectrometer Here we presented the use of the ETD fragmentation technique for the analysis

of large peptides and peptides containing labile post-translational modification

171

Experimental Section

Chemical and materials

Acetonitrile methanol magnesium chloride (ACS grade) potassium chloride (ACS grade) and

calcium chloride (ACS grade) were purchased from Fisher Scientific (Pittsburgh PA) Formic

acid was from Sigma-Aldrich (St Louis MO) Human sulfated CCK-8 and CCK-like peptide

(E(pyro)FDEY(Sulfo)GHMRFamide) were purchased from American Peptide (Sunnyvale CA)

Fused-silica capillary with 75 μm id and 360 μm od was purchased from Polymicro

Technologies (Phoenix AZ) Millipore C18 Ziptip column was used for sample cleaning and all

water used in this study was deionized water (182 MΩcm) prepared from a Milli-Q Millipore

system (Billerica MA) The physiological saline consisted of 440 mM NaCl 11mM KCl 26

mM MgCl2 13 mM CaCl2 11 mM Trizma base and 5 mM maleic acid in pH 745

Animals and dissection

Callinectes sapidus (blue crab) were obtained from commercial food market and maintained

without food in artificial sea water at 10-12 ordmC Animals were cold-anesthetized by packing on

ice for 15-30min before dissection The optic ganglia with the SG attached were dissected in

chilled (~4ordmC) physiological saline and individual SGs were isolated from the optic ganglia by

micro-dissection and immediately placed in acidified methanol (90 methanol 9 glacial acetic

acid and 1 water) and stored at -80ordmC until tissue extraction

Tissue homogenization

Acidified methanol was used during the homogenization of SGs The homogenized SGs were

immediately centrifuged at 16100 g using an Eppendorf 5415D tabletop centrifuge (Eppendorf

172

AG) The pellet was washed using acidified methanol and combined with the supernatant and

further dried in a Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation) The

resulting dried sample was re-suspended with a minimum amount (20 microL) of 01 formic acid

Fractionation of homogenates using reversed phase (RP)-HPLC

The re-suspended extracts were then vortexed and briefly centrifuged The resulting supernatants

were subsequently fractionated via high performance liquid chromatography (HPLC) HPLC

separations were performed using a Rainin Dynamax HPLC system equipped with a Dynamax

UV-D II absorbance detector (Rainin Instrument Inc Woburn MA) The mobile phases included

Solution A (deionized water containing 01 formic acid) and Solution B (acetonitrile containing

01 formic acid) About 20 μl of extract was injected onto a Macrosphere C18 column (21 mm

id x 250 mm length 5 microm particle size Alltech Assoc Inc Deerfield IL) The separation

consisted of a 120 minute gradient of 5-95 Solution B Fractions were automatically collected

every two minutes using a Rainin Dynamax FC-4 fraction collector (Rainin Instrument Inc

Woburn MA) The fractions were then concentrated using in Savant SC 110 SpeedVac

concentrator (Thermo Electron Corporation) and re-suspended with minimum amount of 01

formic acid

Nano-LC-ESI-Q-TOF MSMS

Nanoscale LC-ESI-Q-TOF MSMS was performed using a Waters nanoAcquity UPLC system

coupled to a Q-TOF Micro mass spectrometer (Waters Corp Milford MA)

Chromatographic separations were performed on a homemade C18 reversed phase capillary

column (75 microm internal diameter x100 mm length 3 microm particle size 100 Aring) The mobile phases

173

used were 01 formic acid in deionized water (A) 01 formic acid in acetonitrile (B) An

aliquot of 50 microl of a tissue extract or HPLC fraction was injected and loaded onto the trap

column (Zorbax 300SB-C18 Nano trapping column Agilent Technologies Santa Clara CA)

using 95 mobile phase A and 5 mobile phase Bat a flow rate of 10 microlmin for 10 minutes

Following this the stream select module was switched to a position at which the trap column

came in line with the analytical capillary column and a linear gradient of mobile phases A and B

was initiated For neuropeptides the linear gradient was from 5 buffer B to 45 buffer B over

90 min The nanoflow ESI source conditions were set as follows capillary voltage 3200 V

sample cone voltage 35 V extraction cone voltage 1 V source temperature 100ordmC A data

dependent acquisition was employed for the MS survey scan and the selection of three precursor

ions and subsequent MSMS of the selected parent ions The MS scan range was from mz

400-1800 and the MSMS scan was from mz 50-1800

Peptide Prediction De Novo Sequencing and Database Searching

De novo sequencing was performed using a combination of MassLynxTM

41 PepSeq software

(Waters) and manual sequencing Tandem mass spectra acquired from the Q-TOF were first

deconvoluted using MaxEnt 3 software (Waters) to convert multiply charged ions into their

singly charged forms The resulting spectra were pasted into the PepSeq window for sequencing

analysis The candidate sequences generated by the PepSeq software were compared and

evaluated for homology with previous known peptides The online program blastp (National

Center for Biotechnology Information Bethesda MD httpwwwncbinlmnihgovBLAST)

was used to search the existing NCBI crustacean protein database using the candidate peptide

sequences as queries For all searches the blastp database was set to non-redundant protein

174

sequences (ie nr) and restricted to crustacean sequences (ie taxid 6657) For each of the

proteins putatively identified via blastp the BLAST score and BLAST-generated E-value for

significant alignment are provided in the appropriate subsection of the results Peptides with

partial sequence homology were selected for further examination by comparing theoretical

MSMS fragmentation spectra generated by PepSeq with the raw MSMS spectra If the

fragmentation patterns did not match well manual sequencing was performed

NanoLC Coupled to MSMS by CID and ETD

Setup for RP nanoLC separation

The setup for nanoLC consisted of an Eksigent nanoLC Ultra (Dublin CA) with a 10 microL

injection loop Mobile phase A consisted of H2O in 01 formic acid and mobile phase B

consisted of ACN (Fisher Scientific Optima LCMS grade Fair Lawn NJ) Injections

consisted of 5 microL of prepared tissue extract onto an Agilent Technologies Zorbax 300 SB-C18 5

microm 5 x 03 mm trap cartridge (Santa Clara CA) at a flow rate of 5 microLmin for 5 minutes at 95

A5 B Separation was performed on a Waters HPLC column with 3 microm Atlantis dC18 75 microm

x 150 mm (Milford MA) on a gradient from 5 to 45 mobile phase B at 250 nLmin over 90

minutes at room temperature Emitter tips were pulled from 75 microm inner diameter 360 microm

outer diameter capillary tubing (Polymicro Technologies Phoenix AZ) using a commercial

laser puller model P-2000 (Sutter Instrument Co Novato CA)

Mass Spectrometry Data Acquisitions for Collision Induced Dissociation (CID)

An ion-trap mass spectrometer (amaZon ETD Bruker Daltonics Bremen Germany) equipped

with an on-line nanospray source was used for mass spectrometry data acquisition Hystar

(Version 32 Bruker Daltonics Bremen Germany) was used to couple and control Eksigent

175

nanoLC software (Dublin CA Version 30 Beta Build 080715) to MS acquisition for all

experiments Smart parameter setting (SPS) was set to mz 700 compound stability and trap

drive level were set at 100 Optimization of the nanospray source resulted in dry gas

temperature 125oC dry gas 40 Lmin capillary voltage -1300 V end plate offset -500 V

MSMS fragmentation amplitude 10V and SmartFragmentation set at 30-300

Data was generated in data dependent mode with strict active exclusion set after two spectra and

released after one minute MSMS was obtained via CID fragmentation for the six most

abundant MS peaks with a preference for doubly charged ions and excluded singly charged ions

For MS generation the ion charge control (ICC) target was set to 200000 maximum

accumulation time 5000 ms four spectral average acquisition range of mz 300-1500 and scan

speed (enhanced resolution) of 8100 mz s-1

For MSMS generation the ICC target was set to

200000 maximum accumulation time 5000 ms three spectral averages acquisition range of

mz 100-2000 and scan speed (Ultrascan) of 32000 mz s-1

Mass Spectrometry Data Acquisitions for Electron Transfer Dissociation (ETD)

The amaZon ETD ion-trap mass spectrometer was operated in electron transfer dissociation for

MSMS fragmentation with the same optimized settings as reported for CID unless otherwise

stated Smart parameter setting (SPS) was set to 500 mz compound stability and trap drive

level were set at 100 MSMS was obtained via ETD fragmentation for the four most

abundant MS peaks with no preference for specifically charged ions except to exclude singly

charged ions The ETD reagent parameters were set to 400000 target ICC for the fluoranthene

radical anion The fluoranthene radical anion has an mz of 210 and therefore the 210 mz value

was removed from the resulting MSMS spectra and 160 mz was set for the MSMS low mz

cut-off

176

Direct Infusion of Sulfated Peptide Standards into the Mass Spectrometer using ETD and

CID Fragmentation

The sulfated peptide standards were infused into the mass spectrometer at a flow rate of 300

nlmin using a fused-silica capillary with 75 μm id and 360 μm od coupled to a custom pulled

tip The CCK-8 human peptide (20 microM) was infused in 5050 ACNH2O and detected in

negative ionization mode with an ICC of 70000 and fragmented with CID using the same

settings as the previous CID experiments Alternatively the CCK-like lobster sulfated peptide

(2 microM) was mixed with 30 microM of different salts (MgCl2 KCl or CaCl2) and 01 formic acid in

5050 ACNH2O The plus three charge of the CCK-like lobster peptide was selected for ETD

fragmentation in positive mode with the same setting as the previous ETD experiments The

data were then de novo sequenced for coverage and localization of the sulfation site

Data Analysis

MS data were processed with DataAnalysis (Version 40 Bruker Daltonics Bremen Germany)

Deviations in parameters from the default Protein Analysis in DataAnalysis were as follows

intensity threshold 1000 maximum number of compounds 1E7 and retention time window 05

minutes These parameter changes assisted in providing better quality spectra for sequencing

Identification of peptides was performed using Mascot (Version 23 Matrix Science London

UK) Searches were performed against a custom crustacean database none enzyme allow up to

1 missed cleavage amidated C-terminal as variable modifications peptide tolerance mass error

12 Da MSMS mass error tolerance is 06 Da

Results and Discussion

177

Identification and Characterization of Intact CPRPs Using ETD

Using ESI-Q-TOF 9 truncated CPRPs fragments were sequenced to cover the amino acid

sequence of the intact CPRP peptide RSAEGLGRMGRLLASLKSDTVTPLRGFEGETGHPLE

A representative CID MSMS spectrum of ASLKSDTVTPLR is shown in Figure 1 (a) CID

using ESI-Q-TOF mainly produces fragmentation for doubly and triply charge molecules which

is adequate to sequence peptides with relatively small molecular weight (less than 2000 Da)

However CID fragmentation is insufficient to sequence the larger intact CPRP in a complex

sample whose molecular mass is 3837 Da and therefore it is extremely difficult to directly

sequence by traditional CID ESI-Q-TOF ETD is an attractive fragmentation alternative to

sequence the highly charged CPRP peptide In ETD a protonated peptide reacts with an anion

(fluoranthene radical) where the anion transfers an electron to the peptide cation and the resulting

fragmentation occurs along the N-Cα bond by free-radical-driven-cleavage The large size of

CPRP offers multiple basic amino acid sites to sequester charge and reduce the sequence

coverage from collision induced dissociate by preventing random backbone cleavage whereas

ETD does not cleave the weakest bond and provides the alternative fragmentation pathway to

obtain superior sequence coverage to larger peptides As shown in Figure 1 (cd) the

fragmentation of ETD is highly amenable to ETD compared to the CID fragmentation in Figure

1 (b) As evident from comparison ETD offers more extensive fragmentation than CID thus

providing enhanced coverage for large intact peptides such as CPRPs Specifically we observe

125 of the b- and y- cleavages for CID as compared to an average of 535 of c- and zbull-

fragments More than a four-fold increase in fragments using ETD suggests the utility of this

relatively new tandem MS fragmentation method as complementary techniques for de novo

sequencing of large intact endogenous peptides and novel neuropeptide discovery endeavors

178

Negative Mode Sulfated Peptide Identification

An accepted method for identification and quantification for sulfated peptides is negative

ionization30

CCK-8 sulfated peptide standards show intense signal in negative ionization mode

without needing to have additives added such as salts Figure 2 shows the CID MSMS

spectrum from the negative ionization and produces an intense b6 ion at 430 mz The transition

from the doubly negatively charged MS ion of 570 mz to the b6 ion provides a selected reaction

monitoring transition for quantification studies but the sequence information is limited and the

presence of the methionine produces variable oxidation In addition Figure 2 shows the

MSMS product ions with the loss of the sulfate group thus making site-specific location of

sulfation more difficult

Investigation of Cation Salt Adducts to Produce Multiply Charged Sulfated Peptides

Sulfated peptides can be isolated in positive scanning mode but usually only in a plus one

state with low signal intensity If ETD is performed on the singly charged peptide cation a

neutral is formed and is lost in the mass spectrometer and thus no sequence information can be

obtained In order to remedy this situation a technique that adding metal salts to peptides to

increase charge state for ECD used in Fourier transform ion cyclotron resonance mass

spectrometry (FTICR-MS)29

inspired the investigation of increasing charge state of targeted

peptide via metal salt adduction followed by ETD analysis in a Bruker amaZon ion trap

Various salts were investigated including MgCl2 CaCl2 and KCl each used at a concentration of

30 microM and mixed with a sulfated peptide standard at 2 microM The addition of KCl produced

mostly doubly charged ion of the lobster CCK-like peptide standard K adduct which produced

insufficient sequence information from ETD fragmentation (data not shown) In comparison

the MgCl2 and CaCl2 produced intense triply charged peptide ions but CaCl2 produced lower

179

signal intensity compared to MgCl2 (data not shown)

Cation Assisted ETD Fragmentation of CCK-like Lobster Sulfated Peptide and Future

Directions

The addition of MgCl2 to the lobster CCK-like sulfated peptide is shown in Figure 3

Except for z1 the complete z-series is obtained including the z7 ion with and without the

sulfation PTM The spectrum is complicated by the additional Mg adducts but several peaks

are shown without any Mg adduct such as z2 z3 z4 etc Clearly the method of cation

assisted ETD fragmentation can be useful using ion trap instruments and can properly sequence

sulfated peptides that are prone to neutral loss from the labile PTM One concern about future

direction in sulfopeptide isolation in non-genome sequenced species is isolation of sulfopeptides

Currently antibodies to specific sulfopeptides is the only commonly used enrichment technique

for sulfopeptides Also since metal cations are needed for this method direct infusion into an

ESI mass spectrometer is preferred to prevent running large amounts of adduct forming salts

through the LC system With direct infusion the lack of separation confounds the problem in

sulfopeptide detection because any co-eluting peptides could have ionization efficiency and thus

reduce the signal of the sulfopeptide Once discovered and sequenced a selected reaction

monitoring (SRM) method could be developed using LC retention coupled with negative

ionization mode followed by CID MSMS or possibly MSMSMS to perform quantitative

studies for sulfopeptides

Conclusions

In this study ETD was performed to improve the sequence coverage of large endogenous

neuropeptides with higher charge and bigger size The intact CPRP from Callinectes sapidus was

identified and characterized with 68 sequence coverage by MS for the first time Cation

180

assisted ETD fragmentation was also shown to be a powerful tool in de novo sequencing of

sulfated neuropeptides These endeavors into using ETD for certain neuropeptides will assist in

future analysis of large neuropeptides and PTM containing neuropeptides

181

Reference

1 Schwartz M W Woods S C Porte D Jr Seeley R J Baskin D G Central nervous system control of

food intake Nature 2000 404 (6778) 661-71

2 Sweedler J V Li L Rubakhin S S Alexeeva V Dembrow N C Dowling O Jing J Weiss K R

Vilim F S Identification and characterization of the feeding circuit-activating peptides a novel neuropeptide

family of aplysia J Neurosci 2002 22 (17) 7797-808

3 Baggerman G Cerstiaens A De Loof A Schoofs L Peptidomics of the larval Drosophila melanogaster

central nervous system Journal of Biological Chemistry 2002 277 (43) 40368-40374

4 Desiderio D M Mass spectrometric analysis of neuropeptidergic systems in the human pituitary and

cerebrospinal fluid Journal of Chromatography B 1999 731 (1) 3-22

5 Li L J Kelley W P Billimoria C P Christie A E Pulver S R Sweedler J V Marder E Mass

spectrometric investigation of the neuropeptide complement and release in the pericardial organs of the crab Cancer

borealis Journal of Neurochemistry 2003 87 (3) 642-656

6 Li L J Moroz T P Garden R W Floyd P D Weiss K R Sweedler J V Mass spectrometric survey of

interganglionically transported peptides in Aplysia Peptides 1998 19 (8) 1425-1433

7 Strand F L Neuropeptides regulators of physiological processes 1st ed ed MIT Press Cambridge Mass

1999 p 658 p

8 Fu Q Goy M F Li L J Identification of neuropeptides from the decapod crustacean sinus glands using

nanoscale liquid chromatography tandem mass spectrometry Biochemical and Biophysical Research

Communications 2005 337 (3) 765-778

9 Fu Q Christie A E Li L J Mass spectrometric characterization of crustacean hyperglycemic hormone

precursor-related peptides (CPRPs) from the sinus gland of the crab Cancer productus Peptides 2005 26 (11)

2137-2150

10 Ma M M Chen R B Ge Y He H Marshall A G Li L J Combining Bottom-Up and Top-Down Mass

Spectrometric Strategies for De Novo Sequencing of the Crustacean Hyperglycemic Hormone from Cancer borealis

Analytical Chemistry 2009 81 (1) 240-247

11 Ma M M Sturm R M Kutz-Naber K K Fu Q Li L J Immunoaffinity-based mass spectrometric

characterization of the FMRFamide-related peptide family in the pericardial organ of Cancer borealis Biochemical

and Biophysical Research Communications 2009 390 (2) 325-330

12 Ma M M Bors E K Dickinson E S Kwiatkowski M A Sousa G L Henry R P Smith C M Towle

D W Christie A E Li L J Characterization of the Carcinus maenas neuropeptidome by mass spectrometry and

functional genomics General and Comparative Endocrinology 2009 161 (3) 320-334

13 Chen R B Jiang X Y Conaway M C P Mohtashemi I Hui L M Viner R Li L J Mass Spectral

Analysis of Neuropeptide Expression and Distribution in the Nervous System of the Lobster Homarus americanus

Journal of Proteome Research 2010 9 (2) 818-832

14 Ma M M Chen R B Sousa G L Bors E K Kwiatkowski M A Goiney C C Goy M F Christie A

E Li L J Mass spectral characterization of peptide transmittershormones in the nervous system and

neuroendocrine organs of the American lobster Homarus americanus General and Comparative Endocrinology

2008 156 (2) 395-409

15 Ollivaux C Gallois D Amiche M Boscameric M Soyez D Molecular and cellular specificity of

post-translational aminoacyl isomerization in the crustacean hyperglycaemic hormone family FEBS J 2009 276

(17) 4790-802

16 Wilcockson D C Chung J S Webster S G Is crustacean hyperglycaemic hormone precursor-related

peptide a circulating neurohormone in crabs Cell and Tissue Research 2002 307 (1) 129-138

17 Choi C Y Zheng J Watson R D Molecular cloning of a cDNA encoding a crustacean hyperglycemic

hormone from eyestalk ganglia of the blue crab Callinectes sapidus General and Comparative Endocrinology 2006

148 (3) 383-387

18 Syka J E Coon J J Schroeder M J Shabanowitz J Hunt D F Peptide and protein sequence analysis

by electron transfer dissociation mass spectrometry Proc Natl Acad Sci U S A 2004 101 (26) 9528-33

19 Coon J J Syka J E P Schwartz J C Shabanowitz J Hunt D F Anion dependence in the partitioning

between proton and electron transfer in ionion reactions International Journal of Mass Spectrometry 2004 236

(1-3) 33-42

20 Xia Y Gunawardena H P Erickson D E McLuckey S A Effects of cation charge-site identity and

position on electron-transfer dissociation of polypeptide cations J Am Chem Soc 2007 129 (40) 12232-43

182

21 Altelaar A F Mohammed S Brans M A Adan R A Heck A J Improved identification of endogenous

peptides from murine nervous tissue by multiplexed peptide extraction methods and multiplexed mass spectrometric

analysis J Proteome Res 2009 8 (2) 870-6

22 Swaney D L McAlister G C Coon J J Decision tree-driven tandem mass spectrometry for shotgun

proteomics Nat Methods 2008 5 (11) 959-64

23 Monigatti F Hekking B Steen H Protein sulfation analysis--A primer Biochim Biophys Acta 2006 1764

(12) 1904-13

24 Chandler P C Wauford P K Oswald K D Maldonado C R Hagan M M Change in CCK-8 response

after diet-induced obesity and MC34-receptor blockade Peptides 2004 25 (2) 299-306

25 Little T J Feltrin K L Horowitz M Meyer J H Wishart J Chapman I M Feinle-Bisset C A

high-fat diet raises fasting plasma CCK but does not affect upper gut motility PYY and ghrelin or energy intake

during CCK-8 infusion in lean men Am J Physiol Regul Integr Comp Physiol 2008 294 (1) R45-51

26 Blevins J E Morton G J Williams D L Caldwell D W Bastian L S Wisse B E Schwartz M W

Baskin D G Forebrain melanocortin signaling enhances the hindbrain satiety response to CCK-8 Am J Physiol

Regul Integr Comp Physiol 2009 296 (3) R476-84

27 Turrigiano G G Selverston A I A cholecystokinin-like hormone activates a feeding-related neural circuit in

lobster Nature 1990 344 (6269) 866-8

28 Wolfender J L Chu F Ball H Wolfender F Fainzilber M Baldwin M A Burlingame A L

Identification of tyrosine sulfation in Conus pennaceus conotoxins alpha-PnIA and alpha-PnIB further investigation

of labile sulfo- and phosphopeptides by electrospray matrix-assisted laser desorptionionization (MALDI) and

atmospheric pressure MALDI mass spectrometry J Mass Spectrom 1999 34 (4) 447-54

29 Liu H Hakansson K Electron capture dissociation of tyrosine O-sulfated peptides complexed with divalent

metal cations Anal Chem 2006 78 (21) 7570-6

30 Young S A Julka S Bartley G Gilbert J R Wendelburg B M Hung S C Anderson W H

Yokoyama W H Quantification of the sulfated cholecystokinin CCK-8 in hamster plasma using

immunoprecipitation liquid chromatography-mass spectrometrymass spectrometry Anal Chem 2009 81 (21)

9120-8

183

Figure 1 Comparison of tandem MS spectra of truncated CPRP (ASLKSDTVTPL mz 128766)

by ESI-Q-TOF (a) and intact CPRP (MW 3837) by the amaZon ETD for both CID and ETD

fragmentation (a) MSn of precursor ion with mz 640 with charge state +6 by CID represent

loss of NH3 ordm represent loss of H2O (b) MS+6

of precursor ion with mz 640 with charge state +6

by ETD at z represent z+1 z represent z+2 (c) MS+5

of precursor ion with mz 768 with charge

state +5 by ETD z represent z+1 z represent z+2 For all labels in Figure 1 charge state +1 is

not specified

184

185

Figure 2 Negative CID spectrum of CCK-8 peptide standard (20 microM) via direct infusion show

the doubly charged b6 ion provides the most intense MSMS transition

186

Figure 3 ETD spectrum of E(pyro)FDEY(Sulfo)GHMRF-amide (2 microM) obtained on the

amaZon ETD via direct infusion with 30 microM MgCl2 The sulfated peptide can be identified

with a visible z-series of z2 to z9 and identified sulfate loss

187

Chapter 7

Investigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysis

Adapted from ldquoInvestigation and reduction of sub-microgram peptide loss using molecular

weight cut-off fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J

Wellner D Li L Journal of Mass Spectrometry In Press

188

ABSTRACT

This work investigates the introduction of methanol and a salt modifier to molecular

weight cut-off membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide

quantities Using a neuropeptide standard bradykinin sample loss was reduced over two orders

of magnitude with and without undigested protein present Additionally a bovine serum

albumin (BSA) tryptic digestion was investigated Twenty-seven tryptic peptides were identified

from MALDI mass spectra after enriching with methanol while only two tryptic peptides were

identified after the standard MWCO protocol The strategy presented here enhances recovery

from MWCO separation for sub-microg peptide samples

INTRODUCTION

Molecular weight cut-off membrane-based centrifugal filter devices (MWCO) are

commonly used to desalt and concentrate large molecular weight proteins [1] Greeing and

Simpson recently investigated various MWCO membranes for large amounts of starting material

(~6 mg) focusing on optimal conditions for the sub 25 kDa protein fraction [2] The authors

recovered 200 μg to 29 mg of protein from multiple MWCO experiments and demonstrated that

a 1090 acetonitrile (ACN)H2O elution solvent produced optimal results [3] In addition Manza

et al provided an alternative approach to isolate proteins with a 5 kDa MWCO by using

NH4HCO3 and recovering the retained proteins [4] Alternatively the elution from MWCOs can

be collected to recover only low molecular weight peptides Multiple peptidomic studies have

utilized MWCOs for peptide isolation during the first few steps of sample preparation [5 6]

When sample amount is limited or peptide content is below 1 microg sample loss is a significant

concern when using MWCOs to isolate endogenous peptides Optimized protocols have been

189

investigated using ACN [3 7] salt [4 8] SDS [5] or native sample [6 9 10] but these

experiments primarily focused on large sample amounts rather than sub-microgram peptide

quantities

MWCOs separate large molecules from small molecules The small molecule fraction

may be rich with signaling peptides (SP) cytokines and other small molecules involved in cell-

cell signaling Signaling peptides perform various functions in the body including cell growth

cell survival and hormonal signaling between organs [11] Individual SP contribute to different

aspects of behavior such as pain (enkephalins) [12] feeding (neuropeptide Y) [13] and blood

pressure (bradykinin) [14] MWCO separations can be used to enrich biologically important SP

and explore the peptide content from biological fluids with relatively low peptide content like

blood or cerebrospinal fluid (CSF) In a recent investigation the detection of neuropeptides and

standards in crustacean hemolymph was improved when methanol and protease inhibitors were

present before performing MWCO neuropeptide isolation The impact of methanol on MWCO

sample loss was not investigated in the study [15] In another study a large-scale mass

fingerprinting protocol of endogenous peptides from CSF used a combination of salts before

MWCO fractionation but the impact of adding salts was not discussed [16] The most

commonly used brand of MWCO in the publications and in peptidomic studies is Millipore

Therefore Millipore MWCOs (using regenerated cellulose as the membrane) were used in the

present study The purpose of this work is to provide an optimized sample preparation technique

for MWCO filtering to reduce sample loss and allow sub-microg detection of peptides using MALDI

mass spectrometry

MATERIALS AND METHODS

190

Materials and Chemicals

Water acetonitrile methanol (optima LCMS grade) and sodium chloride (995) were

purchased from Fisher Scientific (Fair Lawn NJ) The α-cyano-4-hydroxy-cinnamic acid (99)

formic acid (FA) (ge98) and bovine serum albumin (ge96) were purchased from Sigma-

Aldrich (St Louis MO) Amicon Ultra 05 mL 10000 MWCO centrifugal filters and ZipTips

packed with C18 reversed-phase resin were purchased from Millipore (Billerica MA) Trypsin-

digested bovine serum albumin (BSA) was purchased from Waters (Milford MA) Bradykinin

was purchased from American Peptide Company (Sunnyvale CA)

MALDI MS Instrumentation

An AutoFLEX III MALDI TOFTOF mass spectrometer (Bruker Daltonics Billerica

MA) was operated in positive ion reflectron mode The MALDI MS instrument is equipped with

a proprietary smart beam (Bruker Daltonics Billerica MA) laser operating at 200 Hz The

instrument was internally calibrated over the mass range of mz 500minus2500 using a standard

peptide mix Two thousand laser shots were collected per sample spot at an accelerating voltage

of 19 kV and a constant laser power using random shot selection The acquired data were

analyzed using FlexAnalysis software (Bruker Daltonics Billerica MA) Mass spectrometry

data acquisition was obtained by averaging 2000 laser shots

Molecular weight cut off separation procedure

The MWCO separations were performed using Amicon Ultra 05 mL 10000 MWCO

centrifugal filters (Billerica MA) Before MWCO separation three washing steps were

performed to remove contaminants from the filter The three washes were 500 μL of 5050

H2OMeOH 500 μL of H2O and 400 μL of the solution used for MWCO separation For the

191

100 H2O solution 1 microg of BSA or bradykinin was used for separation All the other MWCO

separation experiments used 500 ng of BSA or 100 ng or less of bradykinin The MWCO filter

was then centrifuged at 14000 rcf for 5 min at room temperature in an Eppendorf 5415 D

microcentrifuge (Brinkmann Instruments Inc Westbury NY) The filtrate was concentrated in a

Savant SC 110 SpeedVac concentrator (Thermo Electron Corporation West Palm Beach FL)

and acidified The resulting sample was desalted according to the manufacturer using C18

ZipTips from Millipore (Billerica MA) by washing the ZipTip with three times 100 ACN

three aqueous washes of 01 FA binding the peptides from the solution and one aqueous wash

of 01 FA Peptides were then eluted from the ZipTips using 15 microL of 50 ACN in 01 FA

Matrix deposition

Equal volumes of 05 μL from the 15 μL sample solution (including standards not subject

to MWCO filtering) and α-cyano-4-hydroxy-cinnamic acid (CHCA) matrix solution in 50

ACN50 H2O were mixed using a dried-droplet method and spotted on a MALDI target The

resulting droplets were allowed to air dry prior to mass spectrometry acquisition

RESULTS AND DISCUSSION

Analysis of two orders of magnitude increase for bradykinin standard

Bradykinin was selected to assess the potential peptide loss in the flow-through after

performing MWCO separation As shown in Figure 1 1 microg of bradykinin standard does not

produce a detectable signal by MALDI mass spectrometry analysis after a 10 kDa MWCO

separation in water (performed in triplicate) For comparison 1 ng of bradykinin standard

diluted to 15 microL produced an intense signal on the MALDI mass spectrometer suggesting

192

significant sample loss occurs when the target analyte is low in quantity (data not shown

performed in triplicate) Figure 1 shows that the addition of a salt in this case NaCl improves

the limits of detection and decreases sample loss when 7030 watermethanol was compared to

7030 aqueous 1 M NaClmethanol The reproducible results gave a relative standard deviation

(RSD) of 6 for peak intensity Figure 1 shows that even when starting with 1 μg of bradykinin

too much sample is lost during the MWCO separation in water to detect the remainder

However an intense signal is observed for 10 ng of bradykinin after the MWCO separation when

7030 aqueous 1 M NaClmethanol is used as an elution solvent Sample loss from zip-tipping

was estimated in triplicate by calculating the decreases in peak intensity When 10 ng of

bradykinin was used sample loss was ~41 In comparison the calculated yield for 10 ng of

bradykinin after the 7030 aqueous 1 M NaClmethanol MWCO separation and zip-tip desalting

showed an estimated sample loss of 63 meaning more loss can be attributed to sample clean-

up than MWCO filtration

A series of experiments were performed to determine if 7030 aqueous 1 M

NaClmethanol is an optimal solution to recover peptides during a MWCO separation (data not

shown) A 5050 aqueous 1 M NaClmethanol and a 5050 watermethanol elution were

performed in duplicate but signal intensity of the resulting bradykinin was poor and numerous

polymer peaks were detected in the flow through A lower salt concentration 01 M NaCl was

used but also produced greatly reduced signal when compared to aqueous 1 M NaCl To assess

the optimized methodrsquos compatibility with lower sample amounts the 7030 aqueous 1 M

NaClmethanol solution was added to 1 ng of bradykinin for MWCO separation but no signal

was obtained (data not shown) Using a neuropeptide standard the addition of methanol and

NaCl salt significantly improved the sample recovery in sub-microg amounts

193

BSA tryptic peptide mixture analysis

After demonstrating the importance of using an optimized solution for MWCO

separations with an individual peptide the new method was applied to 500 ng of BSA tryptic

digest to investigate its utility with more complex peptide mixtures Table 1 lists the BSA

tryptic peptides identified in the MALDI MS analysis from different solution conditions

processed by MWCO separation As shown in Table 1 a directly spotted BSA tryptic peptide

standard in the absence of any MWCO filtration enabled identification of 39 tryptic peptides by

accurate peptide mass measurements Once again when using 100 H2O for MWCO

separations the starting amount was doubled to 1 microg (also done with 500 ng data not shown)

However many tryptic peptides were not detected due to low signal intensities and non-optimal

elution conditions Instead of H2O a 1 M NaCl solution was used for the MWCO elution but

only two tryptic peptides were identified (Table 1) The addition of 30 methanol into the

sample before MWCO filtration produced the first increase in identified BSA tryptic peptides

The remaining data from Table 1 shows improved BSA tryptic peptide identifications as the

sample (elution) conditions were further optimized Figure 2 shows the actual mass spectra

associated with the three most promising elution solutions along with 100 H2O

The BSA tryptic peptide intensities are shown in Figure 2A and the most intense tryptic

peptide YLYEIAR mz 92749 was observed in the four different solutions shown in Figure 2B

but not in the 100 H2O or 100 1 M NaCl solutions (data not shown) In Figure 2A all mass

spectra are normalized to an intensity of 7 x 104 to illustrate two points First the MWCO

filtering step still produced sample loss regardless of the solvent conditions chosen Second

there is a noticeable increase in peptide peak intensity using the optimized solvent 6040

194

aqueous 1 M NaClmethanol (Figure 2A) Figure 2C displays a zoomed-in view of a BSA

tryptic peptide signal LKECC

DKPLLEK mz 153266 (

carbamidomethyl) observed only in

the optimized solvent The detection of the mz 153266 peptide in Figure 2C highlights the

potential gain in sample and detectable peptides by using an optimized saltMeOH combination

A non-optimized saltMeOH combination will still reduce sample loss but further minimizing

sample loss during sample preparation will always be desirable in any analytical protocol

MWCO composition

The purpose of this application note is to provide evidence of sub-microg sample loss during

MWCO separations of peptide samples and a solution to overcome this limitation The

explanation of why adding MeOH and NaCl to the sample solution provides a significant

reduction in sample loss is beyond the scope of this application note Regardless Supplemental

Table S1 is an expanded version of Table 1 showing the amino acid sequence hydrophobicity

calculated using GRAVY scores and pI of the identified peptides in this study No discernible

trend was obtained from the data The membrane of commonly used MWCO in peptidomics and

for this study is comprised of chemically treated (regenerated) cellulose which is a

polysaccharide containing β (1rarr4) linked D-glucose Glucose has numerous free hydroxyl

groups which could non-specifically adsorb peptides flowing through the MWCO The addition

of MeOH has the most significant effect on signal which could be due to disrupting the

interaction between peptides and hydroxyl groups from glucose NaCl has a less significant

effect on sample recovery compared to MeOH but a detectable reduction in sample loss is noted

This improvement in sample recovery could be analogous to the use of NaCl in

195

immunodepletion protocols to reduce non-specific binding which is accomplished by adding

150 mM NaCl [17]

Analysis of bradykinin in the presence of undigested BSA

When using MWCO for peptide isolation proteins are typically present in the samples

usually in larger amounts Figure 3 shows the effect that adding BSA protein to a 10 ng

bradykinin solution before MWCO fractionation has on the resulting recovery of bradykinin

Adding 10 μg of BSA to the optimized 7030 aqueous 1 M NaClmethanol solution slightly

decreased bradykininrsquos signal with a RSD of 10 More severe signal reduction occurred after

adding 100 microg BSA with a RSD of 2 (N=2) It is not unexpected that more signal reduction

due to sample loss would occur especially in the 100 microg BSA sample because the BSA protein

has a molar ratio of 160 BSA protein molecules to one bradykinin peptide Figure 3 shows the

usefulness of the MWCO method with samples containing large amounts of proteins

RecommendationConclusion

The present work provides solutions to reduce sample loss from the use of MWCO for

sub-microg peptide isolation with or without non-digested proteins in the sample Despite its

widespread utility significant sample loss often occurs during the MWCO fractionation step

which is particularly problematic when analyzing low-abundance peptides from limited starting

material This application note aims to reduce sample loss during MWCO separations

specifically for sub-microg peptide isolation If complex samples are processed with MWCO

separation the authors recommend eluting the sample with 6040 aqueous 1 M NaClmethanol

solution as a starting point to minimize sample loss This application note provides a viable

196

alternative for sub-microg peptide MWCO separation circumventing the need to increase the starting

material by minimizing sample loss from using a MWCO membrane-based centrifugal filter

device

References

[1] HM Georgiou GE Rice MS Baker Proteomic analysis of human plasma failure of

centrifugal ultrafiltration to remove albumin and other high molecular weight proteins

Proteomics 2001 1 1503

[2] DW Greening RJ Simpson Low-molecular weight plasma proteome analysis using

centrifugal ultrafiltration Methods Mol Biol 2011 278 109

[3] DW Greening RJ Simpson A centrifugal ultrafiltration strategy for isolating the low-

molecular weight (ltor=25K) component of human plasma proteome J Proteomics 2010 73

637

[4] LL Manza SL Stamer AJ Ham SG Codreanu DC Liebler Sample preparation and

digestion for proteomic analyses using spin filters Proteomics 2005 5 1742

[5] D Theodorescu D Fliser S Wittke H Mischak R Krebs M Walden M Ross E Eltze O

Bettendorf C Wulfing A Semjonow Pilot study of capillary electrophoresis coupled to mass

spectrometry as a tool to define potential prostate cancer biomarkers in urine Electrophoresis

2005 26 2797

[6] K Antwi G Hostetter MJ Demeure BA Katchman GA Decker Y Ruiz TD Sielaff LJ

Koep DF Lake Analysis of the plasma peptidome from pancreas cancer patients connects a

peptide in plasma to overexpression of the parent protein in tumors J Proteome Res 2009 8

4722

[7] LP Aristoteli MP Molloy MS Baker Evaluation of endogenous plasma peptide extraction

methods for mass spectrometric biomarker discovery J Proteome Res 2007 6 571

[8] A Zougman B Pilch A Podtelejnikov M Kiehntopf C Schnabel C Kumar M Mann

Integrated analysis of the cerebrospinal fluid peptidome and proteome J Proteome Res 2008 7

386

[9] X Yuan DM Desiderio Human cerebrospinal fluid peptidomics J Mass Spectrom 2005 40

176

[10] X Zheng H Baker WS Hancock Analysis of the low molecular weight serum peptidome

using ultrafiltration and a hybrid ion trap-Fourier transform mass spectrometer J Chromatogr A

2006 1120 173

[11] L Li JV Sweedler Peptides in the brain mass spectrometry-based measurement approaches

and challenges Annu Rev Anal Chem 2008 1 451

[12] GB Stefano G Fricchione Y Goumon T Esch Pain immunity opiate and opioid

compounds and health Med Sci Monit 2005 11 MS47

[13] J Jensen Regulatory peptides and control of food intake in non-mammalian vertebrates Comp

Biochem Physiol A Mol Integr Physiol 2001 128 471

197

[14] A Kuoppala KA Lindstedt J Saarinen PT Kovanen JO Kokkonen Inactivation of

bradykinin by angiotensin-converting enzyme and by carboxypeptidase N in human plasma Am

J Physiol Heart Circ Physiol 2000 278 H1069

[15] R Chen M Ma L Hui J Zhang L Li Measurement of neuropeptides in crustacean

hemolymph via MALDI mass spectrometry J Am Soc Mass Spectrom 2009 20 708

[16] H Jahn S Wittke P Zurbig TJ Raedler S Arlt M Kellmann W Mullen M Eichenlaub H

Mischak K Wiedemann Peptide fingerprinting of Alzheimers disease in cerebrospinal fluid

identification and prospective evaluation of new synaptic biomarkers PLoS One 2011 6

e26540

[17] NA Cellar AS Karnoup DR Albers ML Langhorst SA Young Immunodepletion of high

abundance proteins coupled on-line with reversed-phase liquid chromatography a two-

dimensional LC sample enrichment and fractionation technique for mammalian proteomics J

Chromatogr B Analyt Technol Biomed Life Sci 2009 877 79

198

Table 1 Identified BSA tryptic peptides from various MWCO separation conditions

BSA tryptic

peptide (MH+)

100

H2O 1microg

100

1 M NaCl

70

H2O

80

1 M NaCl

70

1 M NaCl

60

H2O

60

1 M NaCl

5083

5453

6894

7124

8985

9275

10345

10725

11385

11636

12496

12837

13057

13997

14157

14197

14398

14636

14798

15026

15118

15328

15547

15677

15768

16399

16678

16738

17248

17408

17477

17497

18809

18890

19019

19079

20450

21139

22479

Total 39 2 2 6 8 15 15 27

199

Figure 1 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard showing improvement over two orders of magnitude in detection limits Each MWCO

separation was performed at minimum in triplicate with representative spectrum selected for

each with a calculated RSD from the peak heights Three different amounts of bradykinin were

tested to assess the magnitude of sample loss under different MWCO solvent conditions The

top panel shows 1 microg of bradykinin standard after MWCO separation with 100 H2O elution

produced no signal The addition of 40 or 30 MeOH produced very low bradykinin signals

for both 100 ng (RSD of 28) and 10 ng (SNlt3 no RSD calculated) respectively In the

bottom two spectra each showed very large intensity but the 7030 aqueous 1 M NaClmethanol

10 ng bradykinin was processed with a 10kDa MWCO and zip-tipped and was reproducible with

200

a RSD of 6 The last spectrum was from 10 ng bradykinin (RSD of 3) which was diluted to

an equivalent volume as all the other experiments and directly spotted onto the MALDI plate

201

Figure 2 Representative MALDI mass spectra from MWCO separation of a BSA tryptic

peptide standard showing sample loss Stacked mass spectra from mz range 875-2150

normalized to 7 x 104 intensity representing the detection difference from a BSA tryptic peptide

standard from different MWCO separation conditions (A) It should be noted that when the

solvent for MWCO elution was 100 H2O 1 microg of BSA tryptic peptides was processed instead

of 500 ng A zoomed in view of the most abundant BSA tryptic peptide detected YLYEIAR

mz 92749 (B) Various percentages of MeOH produced significant signal but addition of a salt

(1 M NaCl) increases the signal which is closest to a directly spotted BSA tryptic peptide

standard A zoomed in view of a representative low intensity BSA tryptic peptide detected

LKECC

DKPLLEK mz 153266 (C) The optimized solution to be used for MWCO filtration

202

6040 aqueous 1 M NaClmethanol was the only procedure that enabled the detection of the

tryptic peptide in Figure 2C which was also detected in the directly spotted BSA tryptic peptide

standard All experiments were performed a minimum of two times with nearly identical results

) Carbamidomethyl amino acid modification

ordm) Tryptic peptide identified in three of the spectra in Figure 2A

dagger) Tryptic peptide identified in two of the spectra in Figure 2A

) Tryptic peptide identified in a single spectrum in Figure 2A

203

Figure 3 Representative MALDI mass spectra after MWCO separation of a bradykinin

standard with a BSA protein present showing optimized solvent conditions minimized samples

losses Each experiment was performed in duplicate Two different amounts of BSA protein

were tested to assess the magnitude of sample loss caused by the presence of a protein The top

panel shows 10 microg of BSA protein and the second panel shows 100 microg of BSA protein added

while only 10 ng of bradykinin was added Detectable sample loss was observed when the BSA

protein was added but panel two shows that the amount of BSA protein was 1 x 104 greater

(equivalent to 160 fold molar excess) than bradykinin The last two spectra were controls using

a MWCO with the optimized solution in panel 3 and panel 4 using 10 ng bradykinin which was

diluted to an equivalent volume as all the other experiments and directly spotted onto the

MALDI plate

204

Supplemental Table 1 Expanded Table 1 including grand average of hydropathicity (GRAVY)

score theoretical pI and the sequence from the underlying amino acid sequence for the peptides

identified in the BSA digest The GRAVY and pI scores were obtained from ExPASy

Bioinformatics and modifications were not taken into consideration

(httpwebexpasyorgprotparam) Peptide assignments to the recorded peaks were done by

BSA

tryptic

peptide

(MH+)

GRAVY

score

Theoretical

pI

Sequence 100

H2O

1microg

100

1 M

NaCl

70

H2O

80

1 M

NaCl

70

1 M

NaCl

60

H2O

60

1 M

NaCl

5083 NA NA FGER

5453 0900 972 VASLR

6894 0267 979 AWSVAR

7124 -0950 647 SEIAHR

8985 0529 674 LcVLHEK

9275 -0071 600 YLYEIAR

10345 -0725 674 NEcFLSHK

10725 -0211 538 SHcIAEVEK

11385 0 599 ccTESLVNR

11636 0130 453 LVNELTEFAK

12496 -1250 545 FKDLGEEHFK

12837 0264 675 HPEYAVSVLLR

13057 -0582 532 HLVDEPQNLIK

13997 0567 437 TVMENFVAFVDK

14157 0567 437 TVmENFVAFVDK

14197 0058 530 SLHTLFGDELcK

14398 -0133 875 RHPEYAVSVLLR

14636 -0515 465 TcVADESHAGcEK

14798 0292 600 LGEYGFQNALIVR

15026 -0625 409 EYEATLEEccAK

15118 0207 597 VPQVSTPTLVEVSR

15328 -0617 617 LKEccDKPLLEK

15547 -0823 441 DDPHAcYSTVFDK

15677 -0085 437 DAFLGSFLYEYSR

15768 -0985 456 LKPDPNTLcDEFK

16399 -0067 875 KVPQVSTPTLVEVSR

16678 0064 437 MPCTEDYLSLILNR

16738 -1723 550 QEPERNEcFLSHK

17248 0064 437 MPcTEDYLSLILNR

17408 0064 437 mPcTEDYLSLILNR

17477 -0914 414 YNGVFQEccQAEDK

17497 -0621 410 EccHGDLLEcADDR

18809 -0537 606 RPcFSALTPDETYVPK

18890 -0567 674 HPYFYAPELLYYANK

19019 -1275 466 NEcFLSHKDDSPDLPK

19079 0044 454 LFTFHADIcTLPDTEK

20450 -0812 839 RHPYFYAPELLYYANK

21139 -0682 480 VHKEccHGDLLEcADDR

22479 -0458 423 EccHGDLLEcADDRADLAK

Total 39 2 2 6 8 15 15 27

205

mass matching with no tandem mass spectrometry performed Lower case amino acids indicate

a modification present in the peptide of carbamidomethyl (c) or oxidation (m)

206

Chapter 8

Conclusions and Future Directions

207

Summary

Comparative shotgun proteomics investigating numerous biological changes in various

species and sample media and peptidomic method development have been reported The

developed comparative shotgun proteomics based on label-free spectral counting with nanoLC

MSMS platform have been employed in mouse CSF rat CSF yeast culture and other biological

specimens Mass spectrometry (MS)-based peptidomic enhancements using ETD and improved

sample preparation methods for molecular weight cut-offs have been reported Together these

studies add to the Li Labs capabilities in comparative shotgun proteomics and expand available

methods for peptidomic research

Immunodepletion of CSF for comparative proteomics

Chapters 3 and 4 used similar methods to generate a list of differentially expressed

proteins providing insight into how Alexander disease (AxD) perturbs the proteome and how the

new prion model rat adapted scrapie (RAS) proteome varies from control rats In the GFAP

overexpressor mouse CSF study in Chapter 3 we have produced a panel of proteins with

significant up or down regulation in the CSF of transgenic GFAP overexpressor mice MS-based

proteomic study of this mouse model for AxD was consistent with the previous studies showing

elevation of GFAP in CSF The development of a modified IgY-14 immunodepletion technique

for low amounts of CSF with recommendations for future antibody depletion techniques to deal

with the unique challenges of mouse CSF was presented Modified proteomics protocols were

employed to profile mouse CSF with biological and technical triplicates addressing the

variability and providing quantitation with dNSAF spectral counting Validation of the data was

performed using both ELISA and RNA microarray data to provide corroboration with the

208

changes in the putative biomarkers This work presents numerous interesting targets for future

study in AxD including CK-MCK-B cathepsin S L1 and B isoforms and contactin-1

Using the protocol developed in Chapter 3 we performed a similar study in RAS CSF

compared to control rat CSF Two differences in sample preparation for the rat CSF compared

to the mouse CSF are noted (1) rat IgY-7 immunodepletion was performed and (2) each rat

CSF sample was collected from one animal due to sufficient volume instead of pooling from

multiple animals for the mouse CSF sample After immunodepletion the CSF samples from

control and RAS (biological N=5 technical replicates N=3) were digested and separated using

one dimensional RP nanoLC separation We were able to identify 167 proteins with redundant

isoforms removed which was derived from total 512 proteins with a 1 FDR from all the CSF

samples Comparative analysis was performed using dNSAF spectral counting and 21 proteins

were significantly changed Our data were consistent with previous prion CSF studies showing

14-3-3 protein increased in CSF of prion infected animals Genomic analysis was also

performed and was used to cross-validate our proteomic data and numerous proteins were found

to be consistent including a novel marker related to prion disease RNAset2 (ribonuclease T2)

In summary this work provides a foundation for investigation of the perturbed proteome of a

new prion model RAS

Yeast Phosphoproteomic Comparison between Starved and Glucose Fed Conditions

This work presented a qualitative comparison of the phosphoproteome between starved

and glucose fed Saccharomyces cerevisiae (bakers yeast) A large scale IMAC enrichment of

yeast cell lysate followed by 2-D separations provided a rich source of phosphopeptides for CID

MSMS and neutral loss triggered ETD analysis using mass spectrometry A specific motif for

PKA was highlighted to show the differences in proteins identified between starved and glucose

209

fed conditions In total 477 unique phosphopeptides were identified with 06 FDR and 669

unique phosphopeptides identified with 54 FDR Phosphosite validation was performed using

a localization algorithm Ascore to provide further confidence on the site-specific

characterization of these phosphopeptides Two proteins Ssd1 and PMA2 emerged as potential

intriguing targets for more in-depth studies on protein phosphorylation involved in glucose

response

Methods for Peptide Sample Preparation and Sequencing

In this study ETD was performed to improve the sequence coverage of endogenous large

neuropeptides and labile tyrosine sulfation CCK-like neuropeptides found in the blue crab

Callinectes sapidus The intact CPRP from Callinectes sapidus was identified and characterized

with 68 sequence coverage by MS for the first time Cation assisted ETD fragmentation using

MgCl2 was also shown to be a powerful tool in de novo sequencing of sulfated neuropeptides

These endeavors into using ETD for certain neuropeptides will assist in future analysis of large

neuropeptides and PTM containing neuropeptides

In addition to ETD sequencing I presented a protocol on improving recovery of minute

quantities of peptides by adding methanol and a salt modifier (NaCl) to molecular weight cut-off

membrane-based centrifugal filters (MWCO) to enrich sub-microgram peptide quantities

Despite its widespread utility significant sample loss often occurs during the MWCO

fractionation step which is particularly problematic when analyzing low-abundance peptides

from limited starting material This work presented a method to reduce sample loss during

MWCO separations specifically for sub-microg peptide isolation Using a neuropeptide standard

bradykinin sample loss was reduced by over two orders of magnitude with and without

210

undigested protein present The presence of bovine serum albumin (BSA) undigested protein

and the bradykinin standard lends evidence that sample loss occurs because of the MWCO and

not the presence of the protein Additionally a BSA tryptic digestion is presented where twenty-

seven tryptic peptides are identified from MALDI mass spectra after enriching with methanol

while only two tryptic peptides are identified after the standard MWCO protocol

Ongoing Projects and Future Directions

CSF Projects

Rat Adapted Scrapie and Time Course Study of Rat CSF

In ongoing experiments from the work described in Chapter 4 related to rat adapted

scrapie (RAS) we are working towards more complete proteome coverage of CSF and a time

course study of RAS After the promising results of the 1-D proteomic comparison between

RAS and control CSF we want to perform IgY immunodepletion on 1 mL of rat CSF followed

by 2-D high pH-low pH RP-RP LC-MSMS IgY immunodepletion is still required and

afterwards approximately 40 microg of low abundance protein would remain Following traditional

urea tryptic digestion (see Appendix 1) and solid phase extraction (SPE) clean-up the sample

would be dried down and reconstituted in mobile phase A (25 mM ammonium formate basic

pH) for high pH RP separation Fractions would be dried and subjected to MS analysis similar to

the work described in Chapter 4 The purpose of this work would be to increase the proteome

coverage from a few hundred to a thousand CSF proteins identified A time course study of RAS

is also desirable to gain insight into disease progression Rats at different stages will be

sacrificed and CSF will be collected The goal is to perform isobaric labeling between the time

courses with spectral counting being an alternative for relative protein expression We will use

the targets identified in Chapter 4 to track certain proteins for time course analysis Overall

211

these future projects will dig deeper into the proteome and how this novel prion model RAS

perturbs the proteins expressed in rats over time

Glycoproteomics and Peptidomic Analysis of CSF Samples from Individuals with

Alzheimerrsquos Disease

Alzheimerrsquos disease is an incurable progressive neurodegenerative disorder which results

in cognitive impairment Our aim is to provide additional biomarkers andor targets for drug

treatment Currently we used 500 microL of human CSF for non-membrane glycoprotein

enrichment (see Appendix 1) followed by 2-D high-pH-low-pH RP-RP amaZon ion trap LC-

MSMS analysis The initial work was a failure due to low amount of signal and significant

sample loss (data not shown) Through a comparison with Dr Xin Weirsquos PhD thesis we

estimated roughly 20 microg of proteins was obtained after glycoprotein enrichment and 1-D analysis

produced over 100 protein identifications (data not shown) but the additional off-line 2-D

separation and sample clean up resulted in low number of protein identifications for each fraction

analyzed by MS (data not shown) We have recently obtained 1 mL of human CSF samples

from healthy controls and individuals suffering from Alzheimerrsquos disease and plan to perform

the same experiments with double the starting amount and reduce the fractions collected from 2-

D separation from 11 to 6 In addition to the glycoproteomics anything not enriched will be

subjected to MWCO separation using the method presented in Chapter 7 The obtained peptide

sample will be subjected to Q-TOF nanoLC MSE data acquisition followed by de novo

sequencing using various programs including PEAKS and Mascot Collectively we feel this

project has great potential to lead to interesting targets and further expand the proteomic

knowledge of Alzheimerrsquos disease

GFAP Knock-in Mouse CSF

212

In a direct follow-up to Chapter 3 we aim to perform comparative proteomics on control

vs glial fibrillary acidic protein (GFAP) knock-in mouse CSF samples The sample preparation

protocol as presented in Chapter 3 will be used For quantitative proteomics we plan on

performing isobaric labeling followed by performing high energy collision induced dissociation

(HCD) on the Orbitrap Elite The MS acquisition will also perform a top ten scan where the top

ten MS peaks will be selected for tandem MS analysis In addition targeted quantitation of

specific proteins using multiple reaction monitoring (MRM) can be performed on potential

biomarkers and identified in this follow-up study and the work presented in Chapter 3 Also any

CSF samples with noticeable blood content cannot be used for the exploratory proteomics

experiments but can potentially be used for the MRM analysis and should be kept for such

experiments in the future

Large Scale Proteomics

Proteomics of Mouse Amniotic Fluid for Lung Maturation

The overall goal of this project is to determine what proteins are present in amniotic fluid

when the embryo is 175 weeks compared to amniotic fluid at 155 weeks The rationale behind

why these two time points matter was investigated through a lung explant culture where amniotic

fluid was added at 155 weeks and 175 weeks Visual maturation of lungs occurred with the

175 week amniotic fluid and lung maturation genes were up regulated in the mRNA in the lung

explant culture when compared to the 155 week amniotic fluid The compound which is

causing the maturation of the lungs is unknown and search for a secreted protein might provide a

clue to this process In order to test this hypothesis we carried out discovery proteomics

experiments The workflow involves tryptic digestion SPE and 1-D low pH nanoLC separation

coupled to an amaZon ETD ion trap for tandem MS analysis The resulting mass spectrometric

213

acquisition identified less than 100 proteins with a 1 FDR (data not published) To address the

lack of depth in the proteome coverage we purchased an IgY immunodepletion column to

remove the most abundant proteins in amniotic fluid similarly to Chapter 3 and 4 but on a larger

scale One abundant protein alpha-fetoprotein is present in amniotic fluid but absent or present

in very low concentration in CSF serum or plasma Alpha-fetoprotein is similar to albumin and

thus we ran amniotic fluid on an IgY immunodepletion column and observed significant

reduction in alpha-fetoprotein amount (data not published) After immunodepletion 2-D high

pH-low pH RP-RP will be performed followed by nanoLCMSMS on the amaZon ETD ion trap

We will perform mass spectrometry experiments using the 155 week amniotic fluid and 175

week amniotic fluid which have average protein contents ~2 mgmL We anticipate a minimum

of tenfold increase in the proteome coverage of amniotic fluid and provide meaningful

hypothesis driven biological experiments from this work

Phosphoproteomics of JNK Activation

c-Jun N-terminal kinase (JNK) is an important cellular mediator of stress activated

signaling Under conditions of oxidative stress JNK is activated resulting in the downstream

phosphorylation of a large number of proteins including c-Jun However the cellular response

to JNK activation is extremely complex and JNK activation can result in extremely different

physiological outcomes For example JNK is at the crossroads of cellular death and survival

and exhibits both pro-death and pro-survival activities These highly disparate outcomes of JNK

activation are highly contextual and depend on the type of stressor and duration of stress In the

brain JNK has been shown to be activated in models of Alzheimerrsquos disease Parkinsonrsquos

disease amyotrophic lateral sclerosis and stroke It is thought that JNK is activated in these

diseases as a result of oxidative stress and it is unknown whether this activation is pro-survival or

214

pro-death To elucidate JNK signaling in the brain we chose to analyze primary cortical

astrocytes under conditions of oxidative stress Since hydrogen peroxide is a physiologically

relevant oxidant in the brain and known to activate JNK we chose to use this to treat astrocytes

and then analyze changes to the phosphoproteome by mass spectrometry By doing this we

hope to elucidate important JNK-dependent targets involved in stress signaling in the brain and

that identifying these targets could lead to the identification of novel disease mechanisms and

potentially new therapeutic targets for neurodegeneration Specifically we plan on performing

stable isotope labeling by amino acids (SILAC) of astrocytes for control hydrogen peroxide

treated and hydrogen peroxide treated with JNK inhibitor The SILAC labeled astrocyte cell

lysate will be digested enriched using Fe-NTA IMAC and separated using 2-D high pH-low pH

RP-RP and infused into the amaZon ETD ion trap The protocol is similar to the yeast

comparative phosphoproteomics from Chapter 5 After analysis we plan on processing the data

using ProteoIQ to identify phosphoproteins with significant changes

Immunoprecipitation Followed by Mass Spectrometry

Stb3 Mass Spectrometry Analysis

Stb3 (Sin3-binding protein) has previously been shown to change location depending on

the presence or absence of glucose (Heideman Lab Dr Michael Conways Thesis) An

immunoprecipitation of Stb3 was performed in parallel to the work described in Chapter 5 Two

separate experiments were performed one with wild type Stb3 and another tagged with myc for

improved protein immunoprecipitation Myc tagging is a polypeptide tag which can be

recognized by an antibody and provide a better enrichment compared to using a Stb3 antibody

alone The myc tagging was done because of the low abundance of Stb3 and the limited amount

of protein that was pulled down by the antibody for Stb3 Immunopreciptation experiments were

215

performed for both starved and glucose fed samples All samples were tryptically digested

followed by low pH RP nanoLCMSMS Both CID and ETD were used for fragmentation

analysis of each sample (n=2) The Stb3 wild type had two peptide hits in one MS run which is

not surprising due to low Stb3 antibody affinity In contrast a significant amount of Stb3 was

pulled down from Myc tagged starved and glucose fed samples For the glucose starved

samples 22 unique peptides from Stb3 were identified whereas glucose fed samples yielded 21

unique Stb3 peptides identified (unpublished data) The identified Stb3 in the myc samples

allowed us to investigate what other proteins were pulled down that are not present in the wild

type samples

From previous work by our collaborator Dr Heideman it had been suggested that Stb3

forms a complex with Sin3 and Rpd3 Our proteomic data shows multiple significant peptide

hits for Sin3 and Rpd3 in myc tagged but not in wild type where Stb3 is only observed once

with a low Mascot score When looking at the unique proteins identified in myc tagged glucose

fed sample but not included in the wild type samples the myc fed sample contained eight unique

ribosomal proteins with numerous unique peptides for each The ribosomal proteins identified in

myc fed sample supports the novel hypothesis that in the glucose fed state the complex of Stb3

Rpd3 and Sin3 interact with ribosomal proteins Other proteins unique to myc tagged glucose

starved samples are glyceraldehyde-3-phosphate dehydrogenase 2 and transcriptional regulatory

protein UME6 Also three proteins were observed in both myc fed and starved but not in the

wild type samples including transposon Ty1-DR1 Gag-Pol polyprotein(YD11B) SWIRM

domain-containing protein (FUN19) and RNA polymerase II degradation factor 1 (DEF1) Our

proteomics data for the Stb3 immunoprecipitation experiments with starved vs glucose fed

216

samples provide exciting evidence to support previous observation made by the Heideman group

and highlight the utility of MS-based approach to deciphering protein-protein interactions

Conclusions

The majority of the work described in this dissertation revolves around sample

preparation for proteomics and peptidomics with a focus on generating biologically testable

hypotheses In the area of neuropeptides CPRP was fully sequenced analytical methods were

transformed for use in an ETD ion trap and sub-microg peptides can now be analyzed by mass

spectrometry after MWCO separation In the field of comparative proteomics comparisons

between mouse CSF in GFAP overexpressors and control or rat adapted scrapie model and

control CSF yeast fed vs glucose starved cell extract have all been presented Collectively this

thesis has developed new techniques for neuropeptide sample preparation and presented

numerous comparative proteomic analyses of various biological samples and how the proteomes

are dynamically perturbed by various treatments and disease conditions

217

Appendix 1

Protocols for sample preparation for mass spectrometry based

proteomics and peptidomics

218

Small Scale Urea Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 8 μL of urea solution

(400mg05mL) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Allow to digest overnight in 37degC water bath

10 Acidify with 10μL 10 formic acid

11 Perform solid phase extraction using tips dependent of sample amount

a Sub-5μg amounts ndash Millipore Ziptips

b 5-75μg amounts ndash Bond-Elut tips from Agilent (OMIX tips)

12 Dry down in Speedvac as needed for experiment

219

Small Scale ProteaseMAX Tryptic Digestion (below 20 microg)

1 Concentratedilute sample to 10 μL starting volume

2 All solutions are in a 50mM NH4HCO3 buffer (395mgmL)

3 Add 1 μL of 05M Dithiothreitol (DTT) (77mg100μL) and 2 μL of 1 of

ProtesaeMAX (Promega) to the sample

4 Place sample in 37degC water bath for 45 minutes

5 Add 27μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

6 Quench reaction with 1μL of DTT solution and let sit for 10 minutes

7 Dilute with 70μL of NH4HCO3 solution

8 Use 1μL of trypsin (05μgμL)

9 Add 1 μL ProteaseMAX and let sit for 3-4 hours

10 Acidify with 2μL 10 formic acid

11 Dry down in Speedvac as needed for experiment

220

Large Scale Urea Tryptic Digestion (mg of proteins)

1 All solutions are in a 50 mM NH4HCO3 buffer (395 mgmL)

2 Add 20μL of 05M Dithiothreitol (DTT) (77mg100μL) and 160μL of urea solution

(400mg05mL) to sample

3 Allow sample to denature 45-60 minutes in a 37degC water bath

4 Add 54μL of Iodoacetamide (IAA) (10mg100μL) and allow to react (in the dark) for

15 minutes

5 Quench reaction with 20μL of DTT solution

6 Dilute with 14mL of NH4HCO3 solution

7 Add 100μg of trypsin

8 Allow to digest overnight in 37degC water bath

9 Acidify sample with 100μL of 10 formic acid

10 Perform solid phase extraction with Hypersep C18 Cartridges with 100 mg of C18

bead volume (Thermo)

11 Dry down with the Speedvac as needed for experiment

221

Fe-NTA Preparation from Ni-NTA Beads

1 Centrifuge (5 min at 140 rcf) and use a magnet to keep beads in place as supernatant

is removed

2 Wash 3 times with 800μL of H2O (using same technique of centrifuging and using

magnet to keep beads in places as supernatant is removed)

3 Add 800μL of 100mM EDTA (372mgmL) in a 50mM NH4HCO3 (395mgmL)

buffered solution (pH around 8) and vortex for 30 minutes to chelate the Ni

centrifuge and remove supernatant

4 Wash 3 times with 800μL of H2O

5 Add 800μL of 10mM FeCl3 hexahydrate (27mgmL) and vortex for 30 minutes to

bind Fe to beads centrifuge and remove supernatant

6 Wash 3 times with 800μL H2O

7 Resuspend in a storage solution of 111 ACNMeOH001 Acetic Acid (1mL total)

222

Fe-NTA IMAC Phospho-enrichment

1 Use wash solution (80ACN 01Formic Acid) to rinse beads vortex 1 minute

centrifuge and remove supernatant

2 Add sample (around 500μL in 80ACN 01Formic Acid) and vortex 40 minutes to

allow sample to bind dispose of supernatant after centrifuging

3 Wash 3 times with 200μL of wash solution discard supernatant

4 Add 200μL of elution solution (5050 ACN5Ammonium Hydroxide) vortex 15

minutes and save supernatant

5 Add 200μL of elution solution vortex 10 minutes and save supernatant

6 Wash 2 time with wash solution (collect supernatant of first wash)

7 If reusing store in 1mL solution of 111 ACNMeOH001 Acetic Acid

223

High pH Off-line Separation

1) In general a minimum of 20 microg of peptides are needed to gain any benefit

from off-line 2D fractionation It is better to inject 100 microg of peptides on

column

2) Use a Gemini column or a similar column that can handle pH=10 and for this

protocol a 21 mm x 150 mm column was used

3) Prepare ldquobuffer Ardquo 25 mM NH4 formate pH=10 Also prepare ldquobuffer Brdquo

4) Dry down desired sample and reconstitute in buffer A

5) Inject based off sample loop (current Alliance HPLC has a 50 microL sample

loop)

6) Run gradient at bottom of the page collecting fractions every 3 minutes except

for the 1st minute which is the void volume

7) Optional If you want to reduce instrument time you can combine fractions 1

with 12 2 with 13 etc until 11 with 22

Time Mobile phase A Mobile phase B Flow Rate

05mlmin

0 98 2 05 mLmin

65rsquo 70 30 05 mLmin

65rsquo1rdquo 5 95 05 mLmin

70 5 95 05 mLmin

224

Non Membrane Glycoprotein Enrichment

1 For protocols involving membrane glycoprotein enrichment see Dr Xin Weirsquos

thesis

2 Add 75 microL of Wheat germ agglutinin (WGA) bound to agarose beads and 150 microL

of Concanavalin A (ConA) to a Handee spin cup (Thermo) Wash 2 times with

lectin affinity chromatography (LAC) binding buffer (015 M NaCl 002 M Tris-

HCl 1 mM CaCl2 pH=74) centrifuging at 400 g for 30 seconds

3 Place stopper on bottom of spin cup and add 200 microg of sample (500 microL for CSF)

Bring up to 300 microL using lectin LAC binding buffer

4 Incubate for 1 hour with continuous mixing at room temperature

5 Centrifuge at 400 g for 30 seconds

6 Perform two more 300 microL LAC binding washes followed by centrifugation

7 Add 300 microL LAC eluting buffer (0075 M NaCl 001 M Tris-HCl 02 M alpha-

methyl mannoside 02 M alpha-methyl glucoside and 05 M acetyl-D-

glucosamine) vortex for 10 minutes (have stopper in place while vortexing)

centrifuge and collect

7 Add another 300 microL LAC eluting buffer centrifuge and collect

225

MWCO separation for Sub-microg peptide concentrations

1 Wash molecular weight cut-off (MWCO) with 5050 waterMeOH centrifuge at

14000 g for 5 minutes for 10 kDa filter (time will vary Millipore Amicon Ultra

filters)

2 Wash with 100 water centrifuge at 14000 g for 5 minutes

3 Add methanol to the sample to get the concentration to 30 methanol and add

salt to achieve a roughly 05 M NaCl before adding the sample to the MWCO

4 Centrifuge at 14000 for 10 minutes collect flow through

226

Immunoprecipitation

Modified from Thermo Fisher Scientific Classic IP Kit

1 10 microL Polyclonal antibody added to 1 microg protein standard in a Handee spin cup

(Thermo) diluted with 1x TBS buffer to 400 microL and incubated overnight at on

shakerend-over-end rotator

2 Wash Protein GA beads with 2x 200microL TBS buffer and added to the

antibodysample for 15 hours at 4oC

3 Centrifuge at 400 g for 30 seconds and discard flow through

4 Wash 3x with 100microL TBS buffer centrifuge at 400 g for 30 seconds and discard

flow through

5 Wash with 1x conditioning solution (neutral pH buffer) centrifuge at 400 g for 30

seconds and discard flow through

6 Eluted with elution buffer (2x100microL) centrifuge at 400 g for 30 seconds and

collect flow through

227

C18 Solid Phase Extraction (SPE)

1 Determine amount of peptides for SPE ge5 microg use Ziptips from Millipore If

between 5-75 microg use OMIX from Agilent 100 microL version and if ge75 microg use SPE

cartridges such as 100 mg Hypersep from Thermo

2 Ensure no detergents are in the sample and it is acidified

3 The three SPE procedures all use the same sets of solutions only volumes vary

4 3x Wetting solution (100 ACN) for 60 microL OMIX (20 microL ziptip and 15 mL for

100 mg cartridge)

5 3x Wash solution (01 formic acid in 100 H2O) same volumes as 4

6 Draw up the sample about 10 times minimum (generally 30-40 is preferred)

without letting the bead volume dry out

7 1X Wash solution same volumes as 4

8 Back load the eluting solution 01 formic acid in 5050 H2OACN into the

Ziptips (15 microL) or OMIX (50 microL) tips For the SPE cartridge add 700 microL of

eluting solution

9 Dry down and prepare for next step in sample preparation

228

Laser Puller Programs and Protocols

1) Obtain about two feet of Fused-silica capillary with 75 μm id and 360 μm od

2) Wash with methanol and then air dry using the bomb

3) Cut into one foot or desired length

4) Use a lighter to burn the middle portion (about an inch in length) of the tubing

5) Remove the ash by rinsing with methanol and rubbing with a Kimwipe

6) Make sure the laser puller has been on for at least 30 minutes before use to allow

for the instrument to warm up

7) Place capillary in instrument with the burnedexposed portion in the center

making sure that the length of the capillary is pulled taut

8) Enter desired program (next page) and press pull

229

Laser Puller Programs

Program 99 (default lab program)

Heat Filament Velocity Delay Pull

250 0 25 150 15

240 0 25 150 15

235 0 25 150 15

245 0 25 150 15

Program 97 (developed for larger inner diameter tips)

Heat Filament Velocity Delay Pull

230 - 25 150 -

220 - 25 150 -

215 - 25 150 8

230

On column Immunodepletion (serum plasma CSF amniotic fluid)

1) Prepare buffer A ldquoDilution Bufferrdquo 10 mM Tris-HCl pH=74 150 mM NaCl

2) Prepare buffer B ldquoStripping Bufferrdquo 01 M Glycine-HCl pH=25

3) Prepare buffer C ldquoNeutralization Bufferrdquo 100 mM Tris-HCl pH=80

4) Determine loading amount (LC1=~1000 ug of serum or plasma less for CSF due

to the increased amount of albumin percentage in CSF)

5) Add Dilution buffer to sample before injection and ensure the sample is proper

pH (~7)

6) Use gradient below

Time A B C Flow Rate

(mLmin)

0rsquo 100 0 0 02

4rsquo59rdquo 100 0 0 02

5rsquo 100 0 0 05

8rsquo59rdquo 100 0 0 05

9rsquo 0 100 0 05

22rsquo 0 100 0 05

22rsquo1rdquo 0 0 100 05

39rsquo 0 0 100 05

7) Store the column in 1x dilution buffer until next use

231

Small Scale Immunodepletion (100 microL of CSF)

1) Use 75 microg (per 100 microL CSF) of Sigma Seppro IgY-14 or IgY-7 bead slurry

2) Add an equal volume of 2x dilution buffer (20 mM Tris-HCl pH=74 300 mM

NaCl) to the starting amount of CSF

3) Add to a spin cup with a filter and allow to mix for 30 minutes

4) Centrifuge at 400 g for 30 seconds and collect the flow through

5) Add 50 microL of 1x dilution buffer mix for 5 minutes centrifuge at 400 g for 30

seconds and collect the flow through

6) Use 1x stripping washes (01 M Glycine-HCl pH=25) centrifuge as before and

discard Repeat four times

7) Use 1x neutralization buffer (100 mM Tris-HCl pH=80) centrifuge as before

and discard Repeat two times

8) Store the beads in the spin column in 1x dilution buffer until next use

232

Alliance Maintenance Protocol

Seal Wash

10 methanol no acetonitrile This wash cleans behind the pump-head seals to

ensure proper lubrication Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime SealWsh gt Start

2 Press Stop after 5 minutes

Prime Injector

10 methanol for maintenance high organic solvent for dirty runs (eg 95

acetonitrile) Done before injecting any real samples to ensure no bubbles are

present in the injector Minimum once per week

1 On instrument interface navigate MenuStatus gt Diag gt Prime NdlWsh gt Start

2 After completion press Close

Purge Injector

Solvent is dependent on run Run this protocol at beginning of experiments each day

Minimum once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Navigate Direct Function gt 4 Purge Injector gt OK

3 Set Sample loop volumes 60 leave Compression Check unchecked gt OK

Prime Solvent Pumps

Solvent is dependent on run If solvents are changed run this protocol Minimum

once per week for maintenance

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys choose composition A type 100 Enter x4

3 Navigate Direct Function gt 3 Wet Prime gt OK

4 Set Flow Rate 7000 mLmin Time 100 min gt OK

5 Repeat for all changedactive solvent pumps

Condition Column

Dependent on user Use starting conditions for run

1 On instrument interface navigate MenuStatus gt Status screen

2 Using arrow keys type starting solvent compositions for run

3 Navigate Direct Function gt 6 Condition Column gt OK

4 Set Time as desired

233

Appendix 2

List of Publications and Presentations

234

PUBLICATIONS

ldquoDiscovery and characterization of the crustacean hyperglycemic hormone precursor related

peptides (CPRP) and orcokinin neuropeptides in the sinus glands of the blue crab Callinectes

sapidus using multiple tandem mass spectrometry techniquesrdquo Hui L Cunningham R Zhang

Z Cao W Jia C Li L J Proteome Res 2011 10(9) 4219-29

ldquoInvestigation and reduction of sub-microgram peptide loss using molecular weight cut-off

fractionation prior to mass spectrometric analysisrdquo Cunningham R Wang J Wellner D Li L

Journal of Mass Spectrometry In Press

ldquoMass spectrometry-based proteomics and peptidomics for systems biology and biomarker

discoveryrdquo Cunningham R Ma D Li L Frontiers in Biology 2012 7(4) 313-35

ldquoProtein changes in immunodepleted cerebrospinal fluid from transgenic mouse models of

Alexander disease detected using mass spectrometryrdquo Cunningham R Jany P Messing A Li

L Journal of Proteome Research Submitted

ldquoInvestigation of the differences in the phosphoproteome between starved vs glucose fed

Saccharomyces cerevisiaerdquo Cunningham R Grunwald D Wellner D Conway M Heideman

W Li L In preparation

ldquoIdentification of astrocytic JNK targets by phosphoproteomics using mass spectrometryrdquo

Cunningham R Dowell J Wang J Wellner D Li L Milhelm M In preparation

ldquoGenomic and proteomic profiling of rat adapted scrapierdquo Herbst A Cunningham R Wellner

D Wang J Ma D Li L Aiken J In preparation

235

INVITED SEMINARS AND CONFERENCE PRESENTATIONS

Robert Cunningham Davenne Mayour and Shoshanna Coon ldquoMorphology and Thermal

Stability of Monolayers on Porous Siliconrdquo The 231th

ACS National Meeting 2006 Atlanta

GA

Robert Cunningham Xin Wei Paige Jany Albee Messing and Lingjun Li ldquoMass

Spectrometry-Based Analysis of Cerebrospinal Fluid Peptidome and Proteome for Biomarker

Discovery in Alexander Diseaserdquo The 57th

ASMS Conference 2009 Philadelphia PA

Robert Cunningham ldquoWhat to Expect in a PhD Graduate Schoolrdquo Invited seminar University

of Northern Iowa 2010 Cedar Falls IA

Robert Cunningham Dustin Frost Albee Messing and Lingjun Li ldquoInvestigation of an

Optimized ProteaseMAX Assisted Trypsin Digestion of Human CSF for Pseudo-SRM

Quantification of GFAP and Identification of Biomarkersrdquo The 58th

ASMS Conference 2010

Salt Lake City UT

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo Oral presentation at Pittcon Conference and Expo 2011 2011 Atlanta

GA

Robert Cunningham Michael Conway Daniel Wellner Douglas Grunwald Warren

Heideman and Lingjun Li ldquoDevelopment of optimized phosphopeptide enrichment methods for

comparison of starved and glucose fed yeast Saccharomyces cerevisiaerdquo The 59th

ASMS

Conference 2011 Denver CO

Robert Cunningham Paige Jany Albee Messing and Lingjun Li ldquoMass Spectrometry-Based

Analysis of GFAP Overexpressor Micersquos Cerebrospinal Fluid for Protein Biomarker Discovery

in Alexander Diseaserdquo 2011 Wisconsin Human Proteomics Symposium 2011 Madison WI

Page 6: Mass Spectrometry Applications for Comparative Proteomics
Page 7: Mass Spectrometry Applications for Comparative Proteomics
Page 8: Mass Spectrometry Applications for Comparative Proteomics
Page 9: Mass Spectrometry Applications for Comparative Proteomics
Page 10: Mass Spectrometry Applications for Comparative Proteomics
Page 11: Mass Spectrometry Applications for Comparative Proteomics
Page 12: Mass Spectrometry Applications for Comparative Proteomics
Page 13: Mass Spectrometry Applications for Comparative Proteomics
Page 14: Mass Spectrometry Applications for Comparative Proteomics
Page 15: Mass Spectrometry Applications for Comparative Proteomics
Page 16: Mass Spectrometry Applications for Comparative Proteomics
Page 17: Mass Spectrometry Applications for Comparative Proteomics
Page 18: Mass Spectrometry Applications for Comparative Proteomics
Page 19: Mass Spectrometry Applications for Comparative Proteomics
Page 20: Mass Spectrometry Applications for Comparative Proteomics
Page 21: Mass Spectrometry Applications for Comparative Proteomics
Page 22: Mass Spectrometry Applications for Comparative Proteomics
Page 23: Mass Spectrometry Applications for Comparative Proteomics
Page 24: Mass Spectrometry Applications for Comparative Proteomics
Page 25: Mass Spectrometry Applications for Comparative Proteomics
Page 26: Mass Spectrometry Applications for Comparative Proteomics
Page 27: Mass Spectrometry Applications for Comparative Proteomics
Page 28: Mass Spectrometry Applications for Comparative Proteomics
Page 29: Mass Spectrometry Applications for Comparative Proteomics
Page 30: Mass Spectrometry Applications for Comparative Proteomics
Page 31: Mass Spectrometry Applications for Comparative Proteomics
Page 32: Mass Spectrometry Applications for Comparative Proteomics
Page 33: Mass Spectrometry Applications for Comparative Proteomics
Page 34: Mass Spectrometry Applications for Comparative Proteomics
Page 35: Mass Spectrometry Applications for Comparative Proteomics
Page 36: Mass Spectrometry Applications for Comparative Proteomics
Page 37: Mass Spectrometry Applications for Comparative Proteomics
Page 38: Mass Spectrometry Applications for Comparative Proteomics
Page 39: Mass Spectrometry Applications for Comparative Proteomics
Page 40: Mass Spectrometry Applications for Comparative Proteomics
Page 41: Mass Spectrometry Applications for Comparative Proteomics
Page 42: Mass Spectrometry Applications for Comparative Proteomics
Page 43: Mass Spectrometry Applications for Comparative Proteomics
Page 44: Mass Spectrometry Applications for Comparative Proteomics
Page 45: Mass Spectrometry Applications for Comparative Proteomics
Page 46: Mass Spectrometry Applications for Comparative Proteomics
Page 47: Mass Spectrometry Applications for Comparative Proteomics
Page 48: Mass Spectrometry Applications for Comparative Proteomics
Page 49: Mass Spectrometry Applications for Comparative Proteomics
Page 50: Mass Spectrometry Applications for Comparative Proteomics
Page 51: Mass Spectrometry Applications for Comparative Proteomics
Page 52: Mass Spectrometry Applications for Comparative Proteomics
Page 53: Mass Spectrometry Applications for Comparative Proteomics
Page 54: Mass Spectrometry Applications for Comparative Proteomics
Page 55: Mass Spectrometry Applications for Comparative Proteomics
Page 56: Mass Spectrometry Applications for Comparative Proteomics
Page 57: Mass Spectrometry Applications for Comparative Proteomics
Page 58: Mass Spectrometry Applications for Comparative Proteomics
Page 59: Mass Spectrometry Applications for Comparative Proteomics
Page 60: Mass Spectrometry Applications for Comparative Proteomics
Page 61: Mass Spectrometry Applications for Comparative Proteomics
Page 62: Mass Spectrometry Applications for Comparative Proteomics
Page 63: Mass Spectrometry Applications for Comparative Proteomics
Page 64: Mass Spectrometry Applications for Comparative Proteomics
Page 65: Mass Spectrometry Applications for Comparative Proteomics
Page 66: Mass Spectrometry Applications for Comparative Proteomics
Page 67: Mass Spectrometry Applications for Comparative Proteomics
Page 68: Mass Spectrometry Applications for Comparative Proteomics
Page 69: Mass Spectrometry Applications for Comparative Proteomics
Page 70: Mass Spectrometry Applications for Comparative Proteomics
Page 71: Mass Spectrometry Applications for Comparative Proteomics
Page 72: Mass Spectrometry Applications for Comparative Proteomics
Page 73: Mass Spectrometry Applications for Comparative Proteomics
Page 74: Mass Spectrometry Applications for Comparative Proteomics
Page 75: Mass Spectrometry Applications for Comparative Proteomics
Page 76: Mass Spectrometry Applications for Comparative Proteomics
Page 77: Mass Spectrometry Applications for Comparative Proteomics
Page 78: Mass Spectrometry Applications for Comparative Proteomics
Page 79: Mass Spectrometry Applications for Comparative Proteomics
Page 80: Mass Spectrometry Applications for Comparative Proteomics
Page 81: Mass Spectrometry Applications for Comparative Proteomics
Page 82: Mass Spectrometry Applications for Comparative Proteomics
Page 83: Mass Spectrometry Applications for Comparative Proteomics
Page 84: Mass Spectrometry Applications for Comparative Proteomics
Page 85: Mass Spectrometry Applications for Comparative Proteomics
Page 86: Mass Spectrometry Applications for Comparative Proteomics
Page 87: Mass Spectrometry Applications for Comparative Proteomics
Page 88: Mass Spectrometry Applications for Comparative Proteomics
Page 89: Mass Spectrometry Applications for Comparative Proteomics
Page 90: Mass Spectrometry Applications for Comparative Proteomics
Page 91: Mass Spectrometry Applications for Comparative Proteomics
Page 92: Mass Spectrometry Applications for Comparative Proteomics
Page 93: Mass Spectrometry Applications for Comparative Proteomics
Page 94: Mass Spectrometry Applications for Comparative Proteomics
Page 95: Mass Spectrometry Applications for Comparative Proteomics
Page 96: Mass Spectrometry Applications for Comparative Proteomics
Page 97: Mass Spectrometry Applications for Comparative Proteomics
Page 98: Mass Spectrometry Applications for Comparative Proteomics
Page 99: Mass Spectrometry Applications for Comparative Proteomics
Page 100: Mass Spectrometry Applications for Comparative Proteomics
Page 101: Mass Spectrometry Applications for Comparative Proteomics
Page 102: Mass Spectrometry Applications for Comparative Proteomics
Page 103: Mass Spectrometry Applications for Comparative Proteomics
Page 104: Mass Spectrometry Applications for Comparative Proteomics
Page 105: Mass Spectrometry Applications for Comparative Proteomics
Page 106: Mass Spectrometry Applications for Comparative Proteomics
Page 107: Mass Spectrometry Applications for Comparative Proteomics
Page 108: Mass Spectrometry Applications for Comparative Proteomics
Page 109: Mass Spectrometry Applications for Comparative Proteomics
Page 110: Mass Spectrometry Applications for Comparative Proteomics
Page 111: Mass Spectrometry Applications for Comparative Proteomics
Page 112: Mass Spectrometry Applications for Comparative Proteomics
Page 113: Mass Spectrometry Applications for Comparative Proteomics
Page 114: Mass Spectrometry Applications for Comparative Proteomics
Page 115: Mass Spectrometry Applications for Comparative Proteomics
Page 116: Mass Spectrometry Applications for Comparative Proteomics
Page 117: Mass Spectrometry Applications for Comparative Proteomics
Page 118: Mass Spectrometry Applications for Comparative Proteomics
Page 119: Mass Spectrometry Applications for Comparative Proteomics
Page 120: Mass Spectrometry Applications for Comparative Proteomics
Page 121: Mass Spectrometry Applications for Comparative Proteomics
Page 122: Mass Spectrometry Applications for Comparative Proteomics
Page 123: Mass Spectrometry Applications for Comparative Proteomics
Page 124: Mass Spectrometry Applications for Comparative Proteomics
Page 125: Mass Spectrometry Applications for Comparative Proteomics
Page 126: Mass Spectrometry Applications for Comparative Proteomics
Page 127: Mass Spectrometry Applications for Comparative Proteomics
Page 128: Mass Spectrometry Applications for Comparative Proteomics
Page 129: Mass Spectrometry Applications for Comparative Proteomics
Page 130: Mass Spectrometry Applications for Comparative Proteomics
Page 131: Mass Spectrometry Applications for Comparative Proteomics
Page 132: Mass Spectrometry Applications for Comparative Proteomics
Page 133: Mass Spectrometry Applications for Comparative Proteomics
Page 134: Mass Spectrometry Applications for Comparative Proteomics
Page 135: Mass Spectrometry Applications for Comparative Proteomics
Page 136: Mass Spectrometry Applications for Comparative Proteomics
Page 137: Mass Spectrometry Applications for Comparative Proteomics
Page 138: Mass Spectrometry Applications for Comparative Proteomics
Page 139: Mass Spectrometry Applications for Comparative Proteomics
Page 140: Mass Spectrometry Applications for Comparative Proteomics
Page 141: Mass Spectrometry Applications for Comparative Proteomics
Page 142: Mass Spectrometry Applications for Comparative Proteomics
Page 143: Mass Spectrometry Applications for Comparative Proteomics
Page 144: Mass Spectrometry Applications for Comparative Proteomics
Page 145: Mass Spectrometry Applications for Comparative Proteomics
Page 146: Mass Spectrometry Applications for Comparative Proteomics
Page 147: Mass Spectrometry Applications for Comparative Proteomics
Page 148: Mass Spectrometry Applications for Comparative Proteomics
Page 149: Mass Spectrometry Applications for Comparative Proteomics
Page 150: Mass Spectrometry Applications for Comparative Proteomics
Page 151: Mass Spectrometry Applications for Comparative Proteomics
Page 152: Mass Spectrometry Applications for Comparative Proteomics
Page 153: Mass Spectrometry Applications for Comparative Proteomics
Page 154: Mass Spectrometry Applications for Comparative Proteomics
Page 155: Mass Spectrometry Applications for Comparative Proteomics
Page 156: Mass Spectrometry Applications for Comparative Proteomics
Page 157: Mass Spectrometry Applications for Comparative Proteomics
Page 158: Mass Spectrometry Applications for Comparative Proteomics
Page 159: Mass Spectrometry Applications for Comparative Proteomics
Page 160: Mass Spectrometry Applications for Comparative Proteomics
Page 161: Mass Spectrometry Applications for Comparative Proteomics
Page 162: Mass Spectrometry Applications for Comparative Proteomics
Page 163: Mass Spectrometry Applications for Comparative Proteomics
Page 164: Mass Spectrometry Applications for Comparative Proteomics
Page 165: Mass Spectrometry Applications for Comparative Proteomics
Page 166: Mass Spectrometry Applications for Comparative Proteomics
Page 167: Mass Spectrometry Applications for Comparative Proteomics
Page 168: Mass Spectrometry Applications for Comparative Proteomics
Page 169: Mass Spectrometry Applications for Comparative Proteomics
Page 170: Mass Spectrometry Applications for Comparative Proteomics
Page 171: Mass Spectrometry Applications for Comparative Proteomics
Page 172: Mass Spectrometry Applications for Comparative Proteomics
Page 173: Mass Spectrometry Applications for Comparative Proteomics
Page 174: Mass Spectrometry Applications for Comparative Proteomics
Page 175: Mass Spectrometry Applications for Comparative Proteomics
Page 176: Mass Spectrometry Applications for Comparative Proteomics
Page 177: Mass Spectrometry Applications for Comparative Proteomics
Page 178: Mass Spectrometry Applications for Comparative Proteomics
Page 179: Mass Spectrometry Applications for Comparative Proteomics
Page 180: Mass Spectrometry Applications for Comparative Proteomics
Page 181: Mass Spectrometry Applications for Comparative Proteomics
Page 182: Mass Spectrometry Applications for Comparative Proteomics
Page 183: Mass Spectrometry Applications for Comparative Proteomics
Page 184: Mass Spectrometry Applications for Comparative Proteomics
Page 185: Mass Spectrometry Applications for Comparative Proteomics
Page 186: Mass Spectrometry Applications for Comparative Proteomics
Page 187: Mass Spectrometry Applications for Comparative Proteomics
Page 188: Mass Spectrometry Applications for Comparative Proteomics
Page 189: Mass Spectrometry Applications for Comparative Proteomics
Page 190: Mass Spectrometry Applications for Comparative Proteomics
Page 191: Mass Spectrometry Applications for Comparative Proteomics
Page 192: Mass Spectrometry Applications for Comparative Proteomics
Page 193: Mass Spectrometry Applications for Comparative Proteomics
Page 194: Mass Spectrometry Applications for Comparative Proteomics
Page 195: Mass Spectrometry Applications for Comparative Proteomics
Page 196: Mass Spectrometry Applications for Comparative Proteomics
Page 197: Mass Spectrometry Applications for Comparative Proteomics
Page 198: Mass Spectrometry Applications for Comparative Proteomics
Page 199: Mass Spectrometry Applications for Comparative Proteomics
Page 200: Mass Spectrometry Applications for Comparative Proteomics
Page 201: Mass Spectrometry Applications for Comparative Proteomics
Page 202: Mass Spectrometry Applications for Comparative Proteomics
Page 203: Mass Spectrometry Applications for Comparative Proteomics
Page 204: Mass Spectrometry Applications for Comparative Proteomics
Page 205: Mass Spectrometry Applications for Comparative Proteomics
Page 206: Mass Spectrometry Applications for Comparative Proteomics
Page 207: Mass Spectrometry Applications for Comparative Proteomics
Page 208: Mass Spectrometry Applications for Comparative Proteomics
Page 209: Mass Spectrometry Applications for Comparative Proteomics
Page 210: Mass Spectrometry Applications for Comparative Proteomics
Page 211: Mass Spectrometry Applications for Comparative Proteomics
Page 212: Mass Spectrometry Applications for Comparative Proteomics
Page 213: Mass Spectrometry Applications for Comparative Proteomics
Page 214: Mass Spectrometry Applications for Comparative Proteomics
Page 215: Mass Spectrometry Applications for Comparative Proteomics
Page 216: Mass Spectrometry Applications for Comparative Proteomics
Page 217: Mass Spectrometry Applications for Comparative Proteomics
Page 218: Mass Spectrometry Applications for Comparative Proteomics
Page 219: Mass Spectrometry Applications for Comparative Proteomics
Page 220: Mass Spectrometry Applications for Comparative Proteomics
Page 221: Mass Spectrometry Applications for Comparative Proteomics
Page 222: Mass Spectrometry Applications for Comparative Proteomics
Page 223: Mass Spectrometry Applications for Comparative Proteomics
Page 224: Mass Spectrometry Applications for Comparative Proteomics
Page 225: Mass Spectrometry Applications for Comparative Proteomics
Page 226: Mass Spectrometry Applications for Comparative Proteomics
Page 227: Mass Spectrometry Applications for Comparative Proteomics
Page 228: Mass Spectrometry Applications for Comparative Proteomics
Page 229: Mass Spectrometry Applications for Comparative Proteomics
Page 230: Mass Spectrometry Applications for Comparative Proteomics
Page 231: Mass Spectrometry Applications for Comparative Proteomics
Page 232: Mass Spectrometry Applications for Comparative Proteomics
Page 233: Mass Spectrometry Applications for Comparative Proteomics
Page 234: Mass Spectrometry Applications for Comparative Proteomics
Page 235: Mass Spectrometry Applications for Comparative Proteomics
Page 236: Mass Spectrometry Applications for Comparative Proteomics
Page 237: Mass Spectrometry Applications for Comparative Proteomics
Page 238: Mass Spectrometry Applications for Comparative Proteomics
Page 239: Mass Spectrometry Applications for Comparative Proteomics
Page 240: Mass Spectrometry Applications for Comparative Proteomics
Page 241: Mass Spectrometry Applications for Comparative Proteomics