a high-throughput platform for proteome ......phospho-proteome profiling of matching tumor and...

1
INTRODUCTION METHODS & RESULTS CONCLUSIONS A HIGH-THROUGHPUT PLATFORM FOR PROTEOME AND PHOSPHO-PROTEOME PROFILING OF TUMOR TISSUES Jakob Vowinckel 1 , Karel Novy 1 , Thomas Corwin 2 , Tobias Treiber 1 , Vito Dozio 1 , Roland Bruderer 1 , Lukas Reiter 1 , Eike-Christin von Leitner 2 , Oliver Rinner 1 , Claudia Escher 1 1) Biognosys AG, Wagistrasse 21, 8952 Schlieren, Switzerland 2) Indivumed GmbH, Falkenried 88, 20251 Hamburg, Germany Precision oncology requires a detailed molecular understanding of tumor biology. Phenotype and underlying cellular functions are best characterized by the study of the proteome. However, MS-based proteome profiling is underrepresented in precision medicine compared to DNA/RNA sequencing techniques. Limitations in instrument stability, reproducibility, sample throughput and data analysis have prevented large-scale proteome characterization experiments. Recent developments in data- independent acquisition (DIA) LC-MS/MS and robust chromatographic separation now present the opportunity to make proteomics available to routine analysis. Here we present a workflow that is capable of routine profiling 850 whole proteome (WP) or 650 phospho-proteome (PP) tumor samples per month and platform with an average depth of 6,000 proteins (WP) or 30,000 phospho-peptides (PP). Figure 1: Sample Preparation Workflow Diagram (A) Sample preparation of high quality fresh- frozen IndivuType samples in batches of 92 samples (+ 3 workflow QC and 1 blank) using partial robotic automation. (B) Automated QC validation and DIA data analysis by a workflow involving directDIA™ search using SpectroMine™ software and pre-set thresholds, and subsequent data analysis of individual raw files with tissue- specific spectral libraries using Spectronaut™. Figure 2: Reproducibility of Data Acquisition and Analysis Over 43 batches, median protein group coefficients of variation (% CV) are 12.5 % for WP platform QC (A), 12.0 % for WP workflow QC (B), and median phospho-peptide % CV are 19.7 % (C). In 26 NSCLC batches (1,755 samples) on average 5,903 protein groups and 28’819 phospho-peptides are quantified with peptide and protein FDR < 0.01, and site localization probability > 0.75 (D). Figure 3: NSCLC Tumor Biology Hierarchical clustering (A) and principal component analysis (B) of 7,346 protein intensity values mainly reveals co-clustering according to tissue type in 1,755 lung samples. Six known markers of lung cancer (C) show robust change between normal and tumor tissue. EGFR phosphorylation status on three known C-terminal sites (D) are consistently elevated in tumor compared to normal tissue. Jakob Vowinckel, PhD Senior Scientific Project Manager [email protected] www.biognosys.com An optimized, semi-automated workflow enables high throughput deep proteome and phospho-proteome profiling of matching tumor and normal tissue samples from Indivumed’s high quality collection of fresh- frozen bio-specimens. Rigorous quality control during sample collection, sample processing, data acquisition and analysis allows reproducible generation of data sets consisting of thousands of samples. In NSCLC patient lung tissue on average 5,903 protein groups and 28,819 phospho- peptides are quantified in each sample, with up to 7,346 protein groups quantified. Observed protein expression and phosphorylation status of known surrogates are in accordance with established knowledge and provide valuable insights to previously unknown markers relevant for tumor biology IndivuType BIOBANK PARTNER HOSPITAL TUMOR RESECTION PePtide desalting tryPtic digest samPle lysis SAMPLE COLLECTION SAMPLE STORAGE adjacent normal tissue bead shaking 500 µg 30 µg liquid handling robot HLB Oasis tumor tissue 3x24 samples 92 samples 3 Workflow QC 1 blank 96 samples BIOGNOSYS FACILITY 96 PP samPles / day monthly shipment 100-1000 s/mth 5-10 mg DIA RAW FILE directDIA™ SEARCH tissue-sPecific sPectral library Post-analysis dia data analysis QC RESULT Qc validation dia lc-ms/ms MICRO-FLOW LC DIA MS/MS Waters ACQUITY M-Class 45 - 60 min gradient 5 µL flow rate 1.5 µM CSH-C18 beads Thermo Scientific Q Exactive HF-X mass spectrometer EASY-Spray ion source 1 full range MS1 scan 50 non-linear DIA segments file watcher HPRP DDA data directDIA™ spectra protein inference cross-run normalization batch correction exploratory data analysis statistical analysis tissue-specific analysis of individual DIA raw files search result database raw file database peptide intensity database evaluation QC report run repetition A A B A B C D platform QC workflow QC workflow QC Principal component 2 B Principal component 3 C log2 protein intensity log2 peptide intensity EGFR (S 1064 ) EGFR (Y 1110 ) n.d. EGFR (Y 1197 ) D 10’ 96 WP samPles / day KingFisher™ Flex robot MagReSyn™ Ti-IMAC magnetic beads 96-well format PhosPho-PePtide enrichment

Upload: others

Post on 11-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A HIGH-THROUGHPUT PLATFORM FOR PROTEOME ......phospho-proteome profiling of matching tumor and normal tissue samples from Indivumed’s high quality collection of fresh-frozen bio-specimens

INTRODUCTION

METHODS & RESULTS

CONCLUSIONS

A HIGH-THROUGHPUT PLATFORM FOR PROTEOME AND PHOSPHO-PROTEOME PROFILING OF TUMOR TISSUES

Jakob Vowinckel1, Karel Novy1, Thomas Corwin2, Tobias Treiber1, Vito Dozio1, Roland Bruderer1, Lukas Reiter1, Eike-Christin von Leitner2, Oliver Rinner1, Claudia Escher1

1) Biognosys AG, Wagistrasse 21, 8952 Schlieren, Switzerland 2) Indivumed GmbH, Falkenried 88, 20251 Hamburg, Germany

Precision oncology requires a detailed molecular understanding of tumor biology. Phenotype and underlying cellular functions are best characterized by the study of the proteome. However, MS-based proteome profiling is underrepresented in precision medicine compared to DNA/RNA sequencing techniques. Limitations in instrument stability, reproducibility, sample throughput and data analysis have prevented large-scale proteome characterization

experiments. Recent developments in data-independent acquisition (DIA) LC-MS/MS and robust chromatographic separation now present the opportunity to make proteomics available to routine analysis. Here we present a workflow that is capable of routine profiling 850 whole proteome (WP) or 650 phospho-proteome (PP) tumor samples per month and platform with an average depth of 6,000 proteins (WP) or 30,000 phospho-peptides (PP).

Figure 1: Sample Preparation Workflow Diagram(A) Sample preparation of high quality fresh-frozen IndivuType samples in batches of 92 samples (+ 3 workflow QC and 1 blank) using partial robotic automation. (B) Automated QC validation and DIA data analysis by a workflow involving directDIA™ search using SpectroMine™ software and pre-set thresholds, and subsequent data analysis of individual raw files with tissue-specific spectral libraries using Spectronaut™.

Figure 2: Reproducibility of Data Acquisition and Analysis Over 43 batches, median protein group coefficients of variation (% CV) are 12.5 % for WP platform QC (A), 12.0 % for WP workflow QC (B), and median phospho-peptide % CV are 19.7 % (C). In 26 NSCLC batches (1,755 samples) on average 5,903 protein groups and 28’819 phospho-peptides are quantified with peptide and protein FDR < 0.01, and site localization probability > 0.75 (D).

Figure 3: NSCLC Tumor Biology Hierarchical clustering (A) and principal component analysis (B) of 7,346 protein intensity values mainly reveals co-clustering according to tissue type in 1,755 lung samples. Six known markers of lung cancer (C) show robust change between normal and tumor tissue. EGFR phosphorylation status on three known C-terminal sites (D) are consistently elevated in tumor compared to normal tissue.

Jakob Vowinckel, PhDSenior Scientific Project Manager

[email protected]

• An optimized, semi-automated workflow enables high throughput deep proteome and phospho-proteome profiling of matching tumor and normal tissue samples from Indivumed’s high quality collection of fresh-frozen bio-specimens.

• Rigorous quality control during sample collection, sample processing, data

acquisition and analysis allows reproducible generation of data sets consisting of thousands of samples.

• In NSCLC patient lung tissue on average 5,903 protein groups and 28,819 phospho-peptides are quantified in each sample, with up to 7,346 protein groups quantified.

• Observed protein expression and phosphorylation status of known surrogates are in accordance with established knowledge and provide valuable insights to previously unknown markers relevant for tumor biology

IndivuType BIOBANK

PARTNER HOSPITALTUMOR RESECTION

PePtide desalting

tryPtic digest

samPle lysis

SAMPLE COLLECTION

SAMPLE STORAGE

adjacentnormaltissue

beadshaking

500 µg

30 µg

liquid handling

robot

HLB Oasis

tumortissue

3x24samples

92 samples3 Workflow QC1 blank

96 samples

BIOGNOSYS FACILITY

96 PP samPles / day

monthlyshipment

100-1000 s/mth

5-10 mg

DIA RAW FILE

directDIA™ SEARCHtissue-sPecific

sPectral libraryPost-analysis

dia data analysis

QC RESULT

Qc validationdia lc-ms/ms

MICRO-FLOW LC

DIA MS/MS

Waters ACQUITY M-Class45 - 60 min gradient5 µL flow rate1.5 µM CSH-C18 beads

Thermo Scientific Q Exactive HF-X mass spectrometer

EASY-Spray ion source1 full range MS1 scan50 non-linear DIA segments

filewatcher

HPRP DDA datadirectDIA™ spectra

protein inferencecross-run normalizationbatch correctionexploratory data analysisstatistical analysis

tissue-specific analysisof individual DIA raw files

search result database

raw filedatabase

peptide intensitydatabase

evaluationQC report

run repetition

AA

B A

B C D platform QCworkflow QC

workflow QC

Principal component 2

B

Prin

cipa

l com

pone

nt 3

C

log2

pro

tein

inte

nsity

log2

pep

tide

inte

nsity

EGFR (S1064)

EGFR (Y1110)

n.d.

EGFR (Y1197)

D

10’

96 WP samPles / day

KingFisher™Flex robot

MagReSyn™ Ti-IMAC magnetic beads

96-wellformat

PhosPho-PePtideenrichment