scaling discovery proteomics to large lung cancer...
TRANSCRIPT
H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE, AN NCI COMPREHENSIVE CANCER CENTER – Tampa, FL
1-888-MOFFITT (1-888-663-3488) www.MOFFITT.org
© 2010 H. Lee Moffitt Cancer Center and Research Institute, Inc.
Introduction
Proteome profiling experiments must satisfy two primary requirements: maximizing depth of protein sampling as well as scaling to clinical cohorts. The approach presented here integrates discovery experiments used to create reference databases and rapid data independent acquisition (DIA) methods to examine large clinical cohorts using < 3 hours of instrument time per individual patient’s resected tumor specimen.1-2 The combination facilitates data mining for protein identification and relative quantitation across larger sample sets within 1 month.
Scaling Discovery Proteomics to Large Lung Cancer Cohorts using Data Independent Acquisition
Bin Fang,1 Melissa A. Hoffman,1 Amol Prakash,2 Scott M. Peterman,3 Paul A. Stewart,1 Guolin Zhang,1 Richard Z. Liu,1 Matthew A. Smith,1 Joseph Johnson,1 Steven A. Eschrich,1 Eric B. Haura,1 & John M. Koomen1
1. H. Lee Moffitt Cancer Center, Tampa, FL 2. OPTYS Tech 3. Thermo Fisher Scientific
Experimental Details
Comparison of Discovery Methods
Sample Preparation: FFPE Tissue was processed using the method reported by Sprung et a l[3]. Twelve fractions from basic pH reversed phased liquid chromatography (bRPLC) were analyzed using LC-MS/MS. Unfractionated digests were analyzed using pSMART and hybrid DIA acquisition methods.
Spectral Library Construction: Sequest and Mascot searches were used for spectral library generation (Proteome Discoverer 1.4, Thermo Fisher Scientific).
DIA and Data Processing: Samples were analyzed with MS1 followed by narrow window DIA (n = 18) in each cycle. The data were matched to the spectral libraries and qualitative and quantitative data processing was performed using Pinnacle (OPTYS Tech). In addition, hybrid acquisition with DIA and targeted MS2 is examined using cancer biomarkers.
References
Acknowledgments This work used Analytic Microscopy, Proteomics, and Tissue Cores funded in part by Moffitt’s Cancer Center Support Grant (NCI P30-CA076292). Project funding provided by the American Lung Association (Lung Cancer Discovery Award LCD-257857-N), the NCI (R21-CA169980), Moffitt Lung Cancer Center of Excellence, and Moffitt’s Innovation and Technology Pilot Project Funding.
1. Prakash A, et al. J Proteome Res. 2014, 13, 5415-30. 2. Gillet LC., et al. Mol Cell Proteomics 2012, 11, O111.016717 3. Sprung RW Jr, et al. Mol Cell Proteomics 2009, 8, 1988-98. 4. Stewart et al. PLoS ONE 2015, 10, e0142162.
Conclusions 1. DIA balances a sufficient depth of coverage with the ability to analyze large cohorts of patients (n =
100-400) within 1-2 months on a single instrument. 2. Hybrid data acquisition / processing method developed, which includes spectral library generation,
scans combining MS1, asymmetric narrow window DIA, targeted MS2 scans, spectral library matching and dual MS1/MS2 quantitation using Pinnacle software.
3. Applicable to Tumor Microarray Specimens.
m/z m/z
16 Highest Intensity Peptides for MS/MS
DDA
DIA
?
MS/MS across m/z Range with User-Defined Windows
m/z m/z
Undersampling: What Peptides/Proteins are Missed?
Maximize ID and Quantification from Single Sample
Lung Tissue Sample Type # Squamous Cell Carcinoma (SCC) 50 SCC Adjacent Lung Tissue 50 Adenocarcinoma (ADC) 46 ADC Adjacent Lung Tissue 13
ADJ Lung- Pools
SCC- Pool
ADC-Pool
812
491
3,661 41%
1,494
1,173 322
868
CTR
L1
CTR
L2
CTR
L3
CTR
L4
AD
C1
A
DC
2
SCC
1
SCC
2
High
Low
TMT (6)
DIA (1)
Label Free (24)
DIA TMT Label Free
Ctrl Tumor Ctrl Tumor Ctrl Tumor
Selected Cancer-Related Proteins Not Observed in DIA, but in Spectral Library: 4E-BP1, AKT(PKB), ASK1 (MAP3K5), Bcl-2, Bcl-XL, Bcl-XS, Bim, Collagen IV, COX-2 (PTGS2), c-Src, Frizzled, GIPC, GSK3 alpha/beta, GYS1, HGF receptor (Met), IBP, IGF-1 receptor, IGF-2, I-kB, IKK-beta, IKK-gamma, Insulin receptor, IRAK1/2, IRAK4, K-RAS, Matrilysin (MMP-7), MEK1/2, MEK3(MAP2K3), MEK6(MAP2K6), MELC, Midkine, MMP-1, MNK1, MRLC, MyD88, Neurofibromin, NF-κB, N-WASP, p130CAS, PARD6, PDK (PDPK1), PI3K cat class IA, PI3K cat class IA (p110-alpha), PI3K cat class IB (p110-gamma), PI3K reg class IA, PI3K reg class IA (p85), PI3K reg class IA (p85-alpha), PI3K reg class IA (p85-beta), PKC-zeta, PLAUR (uPAR), PP2A catalytic, PTEN, RHEB2, Shc, SMAD2, SP1, TAB1, Talin, TGF-beta 1, Tryptase, VCAM1, VEGF-A, WNT, XIAP
Prot
eins
in D
IA
Proteins in Spectral Library
Top 50 Enriched Pathways y = 0.69x + 4.5
R² = 0.80
0
20
40
60
0 20 40 60 80 100 Apoptosis Signaling, GeneGO, Metacore
Proteins Peptides
1
9
17
0 50 100 150 200 Qua
ntifi
catio
ns
(x 1
03 )
Sorted Sample Order
ADC Adj Ctrl
ADC ADC Pool SCC SCC
Pool SCC Adj Ctrl
AD
C
Ctrl
(A)
SCC
C
trl (S
) Po
ol (A
)
Pool
(S)
0
3E3
6E3
Qua
ntifi
able
Pro
tein
s
AD
C
Ctrl
(A)
SCC
C
trl (S
) Po
ol (A
)
Pool
(S) 0
1E4
2E4
Qua
ntifi
able
Pep
tides
y = 0.2462x + 1030.3 R² = 0.9644
0
3E3
6E3
0 1E4 2E4 Qua
ntifi
able
Pro
tein
s
Quantifiable Peptides
m/z 0
50
100
Inte
nsity
486.76 483.76
485.5 SIS Endogenous
R² = 0.998 0
50
100
150
0 20 40 60 80 100 Amount (fmol)
L/H
Rat
io
ERBB2_ELVSEFSR Heavy: 1.25 fmol/injection
Light: 0.04 to 80 fmol CV < 5.5% across range
MS
Targeted MS2 m/z 485.5
DIA m/z 487.50
28.6 28.8 29.0 29.2 29.4 29.6 29.8
Time (min)
0
50
100
0
50
100
Rel
ativ
e Ab
unda
nce
0
50
100
450
1440
485.5
Time (s)
m/z
0 19.2
NL: 6.01E6
NL: 9.02E6
NL: 4.43E6
7
8
9
10
11
12
18
13
14
15
16
17 3,605
3,585
3,097 3,144
3,576
3,598
1mm 0
200
0 120 Area (mm2)
Prot
ein
(µg)
DIA LC-MS/MS proteome profiling data were acquired from lung squamous cell carcinoma, adenocarcinoma, and adjacent lung tissue. DIA analysis of cores from a tissue microarray produced quantitative data for > 3,000 proteins each, indicating that the technology can be applied to minimal amounts of formalin fixed paraffin embedded tumor tissue.
LB-140
Tissue Microarray Analysis
Molecular Classification using Lung Cancer Proteomes
Targeted Quantification in DIA
Pilot Project Comparing Squamous Tumors to Adjacent Lung Tissue using Label-Free, TMT, and DIA Discovery Proteomics Enables Investigation of Detectable Biology.
Adenocarcinoma (EGFR WT, KRAS WT) Squamous Cell Carcinoma
Assessment of 159 Total Tumor and Lung Tissue Proteomes using 200 DIA-LC-MS/MS Analyses over 25 Days Enables Molecular Classification of Tumors.
Protein Quantification from Single TMA Core Sections Builds on Whole Slide Imaging (Aperio), Histological Maps (Genie) , and DIA-LC-MS/MS Analysis.
Targeted Peptide-Based Quantitative Assays for Known Biomarkers (e.g. HER2) can be Built into DIA Proteome Analysis Experiments.