ampullary cancers harbor elf3 tumor suppressor gene

29
Article Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene Mutations and Exhibit Frequent WNT Dysregulation Graphical Abstract Highlights d Three periampullary tumor types share a common molecular blueprint d Frequent WNT pathway disruption can be a potential therapeutic target d ELF3 is a frequently mutated tumor suppressor gene of periampullary tumors Authors Marie-Claude Gingras, Kyle R. Covington, David K. Chang, ..., Andrew V. Biankin, David A. Wheeler, Richard A. Gibbs Correspondence [email protected] (M.-C.G.), [email protected] (D.A.W.) In Brief Gingras et al. compare the genomic profiles of ampullary, distal bile duct, and duodenal adenocarcinomas. They find disruption of the WNT pathway, which could be used as a tumor subclassification for targeted therapy, a high frequency of ELF3 inactivating mutations, and microsatellite instability. Gingras et al., 2016, Cell Reports 14, 907–919 February 2, 2016 ª2016 The Authors http://dx.doi.org/10.1016/j.celrep.2015.12.005

Upload: others

Post on 02-Aug-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Article

Ampullary Cancers Harbor

ELF3 Tumor SuppressorGene Mutations and Exhibit Frequent WNTDysregulation

Graphical Abstract

Highlights

d Three periampullary tumor types share a common molecular

blueprint

d Frequent WNT pathway disruption can be a potential

therapeutic target

d ELF3 is a frequently mutated tumor suppressor gene of

periampullary tumors

Gingras et al., 2016, Cell Reports 14, 907–919February 2, 2016 ª2016 The Authorshttp://dx.doi.org/10.1016/j.celrep.2015.12.005

Authors

Marie-ClaudeGingras, Kyle R. Covington,

David K. Chang, ..., Andrew V. Biankin,

David A. Wheeler, Richard A. Gibbs

[email protected] (M.-C.G.),[email protected] (D.A.W.)

In Brief

Gingras et al. compare the genomic

profiles of ampullary, distal bile duct, and

duodenal adenocarcinomas. They find

disruption of the WNT pathway, which

could be used as a tumor

subclassification for targeted therapy, a

high frequency of ELF3 inactivating

mutations, and microsatellite instability.

Page 2: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Cell Reports

Article

Ampullary Cancers Harbor ELF3 Tumor SuppressorGeneMutations andExhibit FrequentWNTDysregulationMarie-Claude Gingras,1,2,3,* Kyle R. Covington,1 David K. Chang,4,5,6,7 Lawrence A. Donehower,1,8 Anthony J. Gill,6,9,10

Michael M. Ittmann,3,11,12 Chad J. Creighton,1,3 Amber L. Johns,6 Eve Shinbrot,1 Ninad Dewal,1 William E. Fisher,2,3,13

Australian Pancreatic Cancer Genome Initiative, Christian Pilarsky,14 Robert Gr€utzmann,15 Michael J. Overman,16

Nigel B. Jamieson,4,5,17 George Van Buren II,2,3,13 Jennifer Drummond,1 Kimberly Walker,1 Oliver A. Hampton,1 Liu Xi,1

Donna M. Muzny,1 Harsha Doddapaneni,1 Sandra L. Lee,1 Michelle Bellair,1 Jianhong Hu,1 Yi Han,1 Huyen H. Dinh,1

Mike Dahdouli,1 Jaswinder S. Samra,10,18 Peter Bailey,4 Nicola Waddell,19,20 John V. Pearson,19,20 Ivon Harliwong,19

Huamin Wang,21 Daniela Aust,22 Karin A. Oien,4,23 Ralph H. Hruban,24 Sally E. Hodges,2,13 Amy McElhany,2,13

Charupong Saengboonmee,1,25 Fraser R. Duthie,4,23 Sean M. Grimmond,4,19 Andrew V. Biankin,4,5,6,7

David A. Wheeler,1,3,* and Richard A. Gibbs11Department of Molecular and Human Genetics, Human Genome Sequencing Center, Baylor College of Medicine, Houston,

TX 77030, USA2Michael DeBakey Department of Surgery, Baylor College of Medicine, Houston, TX 77030, USA3Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA4Wolfson Wohl Cancer Research Centre, Institute for Cancer Sciences, University of Glasgow, Garscube Estate, Bearsden,

Glasgow G61 1BD, UK5West of Scotland Pancreatic Unit, Glasgow Royal Infirmary, Glasgow G31 2ER, UK6The Kinghorn Cancer Centre and the Cancer Research Program Garvan Institute of Medical Research, Darlinghurst, Sydney, NSW 2010,Australia7South Western Sydney Clinical School, Faculty of Medicine, University of New South Wales, Liverpool, NSW 2170, Australia8Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX 77030, USA9Department of Anatomical Pathology, Royal North Shore Hospital, St Leonards, Sydney, NSW 2065, Australia10Sydney Medical School, University of Sydney, Sydney, NSW 2006, Australia11Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX 77030, USA12Michael E. DeBakey Department of Veterans Affairs Medical Center, Houston, TX 77030, USA13The Elkins Pancreas Center at Baylor College of Medicine, Houston, TX 77030, USA14Department of Surgery, TU Dresden, 01307 Dresden, Germany15Department of Surgery, Universitatsklinikum Erlangen, 91054 Erlangen, Germany16Department of Gastrointestinal Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA17Academic Unit of Surgery, Institute of Cancer Sciences, Glasgow Royal Infirmary, Level 2, New Lister Building, University of Glasgow,

Glasgow G31 2ER, UK18Department of Surgery, Royal North Shore Hospital, St Leonards, Sydney, NSW 2065, Australia19Queensland Centre for Medical Genomics, Institute for Molecular Bioscience, The University of Queensland, St Lucia, Brisbane, QLD 4072,Australia20QIMR Berghofer Medical Research Institute, Herston, Brisbane, QLD 4006, Australia21Department of Pathology, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA22Department of Pathology, TU Dresden, 01307 Dresden, Germany23Department of Pathology, Southern General Hospital, Greater Glasgow and Clyde NHS, Glasgow G51 4TF, UK24Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, the Johns Hopkins University School of Medicine,

Baltimore, MD 21231, USA25Department of Biochemistry and Liver Fluke and Cholangiocarcinoma Research Center, Faculty of Medicine, Khon Kaen University,

Khon Kaen 40002, Thailand

*Correspondence: [email protected] (M.-C.G.), [email protected] (D.A.W.)

http://dx.doi.org/10.1016/j.celrep.2015.12.005This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

SUMMARY

The ampulla of Vater is a complex cellular environ-ment from which adenocarcinomas arise to form agroup of histopathologically heterogenous tumors.To evaluate the molecular features of these tumors,98 ampullary adenocarcinomas were evaluated andcompared to 44 distal bile duct and 18 duodenal ad-enocarcinomas. Genomic analyses revealed muta-tions in the WNT signaling pathway among half of

C

the patients and in all three adenocarcinomas irre-spective of their origin and histological morphology.These tumors were characterized by a high fre-quency of inactivating mutations of ELF3, a highrate of microsatellite instability, and common focaldeletions and amplifications, suggesting commonattributes in the molecular pathogenesis are at playin these tumors. The high frequency ofWNT pathwayactivating mutation, coupled with small-molecule in-hibitors of b-catenin in clinical trials, suggests future

ell Reports 14, 907–919, February 2, 2016 ª2016 The Authors 907

Page 3: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

treatment decisions for these patientsmay be guidedby genomic analysis.

INTRODUCTION

Though the pancreas, bile duct, and intestinal duodenum share

common embryologic origins in the ventral endoderm, the

adenocarcinomas arising in this region presumably originate

from different epithelial cellular constituents present at the site

(Zaret and Grompe, 2008). These tumors have been described

in many different ways: intra-ampullary, periampullary, intra-

ampullary papillary-tubular neoplasm, ampullary/ductal, periam-

pullary-duodenal, and ampullary/not otherwise specified. The tu-

mors clearly separated from the ampulla of Vater and localized in

the bile duct, duodenum, or pancreatic duct have been identified

as distal cholangiocarcinomas or distal bile duct (CAC), duodenal

(DUOAC), or pancreatic ductal (PDAC) adenocarcinomas.

As recommended in the AJCC seventh edition 2009 staging

system (Edge et al., 2009), the current subtype classification of

ampullary adenocarcinoma (AMPAC) is based on the anatomical

location from which the tumor is thought to arise (Edge et al.,

2009), sometimes supplemented by histopathology and expres-

sion of differential markers (Adsay et al., 2012; Chang et al.,

2013; Ehehalt et al., 2011; Morini et al., 2013). This classification

is subjective and prone to inter-observer variability and can

significantly impact treatment selection and therapeutic devel-

opment (Amptoulach et al., 2011; Heinrich and Clavien, 2010;

Romiti et al., 2012; Westgaard et al., 2013). Current treatment

approaches do not distinguish patients based on subtypes, yet

tumors may arise from at least the three epithelia that converge

at that site, and somemay arise from the ampulla itself, where lit-

tle is known of the specialized epithelium that may be present.

Malignancies that arise from different cellular origins often

have vastly differing sensitivities to therapeutics. Post hoc ana-

lyses of clinical trials using histopathological criteria have not

discerned such a difference and likely represent the inaccuracy

of such a classifier. However, as most therapeutic development

is focused on agents that target specific molecular mechanisms,

a molecular characterization that would allow selection of pa-

tients for specific therapies would facilitate therapeutic develop-

ment with the aim of improving outcomes and alleviate the

impact of an inaccurate subjective classification.

For this study, we have assembled a large cohort of AMPAC

with nearby DUOAC and CAC for comparison. We show that

tumors from the duodenum, ampulla of Vater, and distal bile

duct exhibit a common spectrum of features irrespective of

their morphology, marker expression, and cellular origin. Here,

we use the term ‘‘periampullary tumors’’ in this study to refer to

the three tumor types of AMPAC, DUOAC, and CAC collectively,

as defined by the AJCC seventh edition 2009 staging system

(Edge et al., 2009), excluding cases that clearly arise from the

pancreas (pancreatic adenocarcinoma [PDAC]).

RESULTS

In order to develop a molecular taxonomy for periampullary can-

cers and define subtypes with clinical relevance, we performed

908 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

exome sequencing and copy-number analysis of 160 cancers

arising in the periampullary region, 62 of these clearly arising

from either the bile duct (n = 44) or the duodenum (n = 18) and

98 for which the epithelium of origin could not be clearly defined

morphologically (AMPAC). Mutations were validated by deep

and ultra-deep sequencing on a limited target region consisting

of 71 recurrently mutated genes. RNA sequencing (RNA-seq)

was performed on 30 patients: a 28-patient subset of the 98

ampullary tumors and a two-patient subset of the 18 duodenal

tumors.

Clinical Characteristics and SubtypingThe clinical characteristics of our patient cohort are described in

Table S1A. In this study, the anatomical primary site of origin of

all tumors was defined using the AJCC seventh edition 2009

staging system (Edge et al., 2009). In addition, the tumors were

also classified independently by cellular morphology and immu-

nohistochemistry (IHC) staining (see Experimental Procedures)

into intestinal, pancreatobiliary, or mixed subtypes (Table S1B).

Since treatment may be determined based on subtypes defined

by the combination of morphology and IHC even if these mea-

sures are somewhat subjective, it was an important objective

of our study to assess the reliability and meaning of these sub-

types. Subtyping according to IHC, the AMPAC tumors were

51% pancreatobiliary and 34% intestinal, with the remainder

mixed. CAC was dominated by the pancreatobiliary subtype,

86% as expected; however, 11% of CAC exhibited an intestinal

phenotype. In DUOAC, the intestinal subtype was 44%, with

22% pancreatobiliary and the remainder mixed.

By histological morphology, a smaller proportion of each

tumor type was classified as pancreatobiliary (AMPAC, 37%;

CAC, 77%; and DUOAC, 6%). The two methods of classifica-

tion yielded concordant subtypes only 62% of the time for

AMPAC tumors, 77% of the time in CAC, and 53% of the time

in DUOAC. Although the two methods often disagreed, all three

tumor types included in their numbers concordant cases of

all three subtypes. Thus, tumors originating in each organ site

in the periampullary region may be classified as any of the

three subtypes, though this classification system is rarely

applied to DUOAC or CAC tumors. These tumors were analyzed

by genomic methods to further characterize their molecular

properties.

Mutation AnalysisExomes were sequenced to an average of 120-fold coverage re-

sulting in 28,795 mutations across 152 patients. Eight additional

patients were sequenced with targeted custom sequencing

and were included in the study (see Supplemental Experimental

Procedures, ‘‘Sequencing design and Mutation analysis,’’ and

Tables S1C–S1E). Microsatellite instable phenotypes were

observed in 12 patients representing each organ cohort (Fig-

ure 1), accounting for 18,572 of the whole-exome sequencing

(WES) discovery set. Using a method we developed based on

the enzyme slippage of the homopolymer region (E.S., unpub-

lished data), we identified two other patients among the targeted

sequencing set (Figure S1A).

Excluding microsatellite instable (MSI) tumors and correc-

ting for tumor purity, the median mutation rate did not vary

Page 4: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

A B

C D

Figure 1. Mutation Frequencies and MSI

Characteristics

(A) Mutation frequencies for all patients by

anatomical site (D, DUOAC; A, AMPAC; C, CAC)

and subtype (I, intestinal; M, mixed; PB, pan-

creatobiliary). Black dots, microsatellite stable

(MSS); red dots, microsatellite instable (MSI).

(B) Germline mutations in MMR gene associated

with Lynch syndrome were detected in 66% of the

MSI samples. Survival (m) is in months; black tile,

patient died of disease; white tiles, patient alive;

Lynch Mutation Freq, frequency each gene is

observed in Lynch syndrome patients; blue tiles,

missense mutations; green, frameshift mutations;

red, nonsense mutations; ‘‘L’’ = known Lynch

syndrome mutation; ‘‘d’’ deleterious mutation by

PolyPhen2.

(C) Kaplan-Meier plot for survival based on MSI

status in AMPAC (log rank p = 0.04, n = 96).

(D) Kaplan-Meier plot for survival based on MSI

status in all periampullary tumors (p = 0.0028,

n = 160).

See also Figures S1A–S1C and Tables S1A–S1E.

significantly across the AMPAC, CAC, and DUOAC (3.8, 4.6, and

4.7 per Mb, respectively) but was clearly distinct from the MSI

mutation rate (68, 127, and 108 per Mb, respectively) (Figures

1A, S1B, and S1C). Two-thirds of the hypermutated WES sam-

ples had germline mutation in genes associated with Lynch syn-

drome. Interestingly, PMS2, a gene that accounts for less than

5% of Lynch syndrome patients overall (Thompson et al.,

2004) (OMIM #600259), was mutated in one half of our MSI pa-

tients (Figure 1B). Although MSI was more common in DUOAC

than CAC patients, every morphologic category harbored at

least one PMS2 germline mutation in this study. Leaving aside

germline contribution, the overall frequency of MSI in AMPAC

was 3%. MSI appeared to confer a survival advantage in

AMPAC, as it does in other gastrointestinal (GI) cancers, as all

six AMPAC patients were alive ranging from 2 to 8 years after

diagnosis (p = 0.04 with a lack of negative event) (Figure 1C).

Taking all three anatomical sites into consideration, MSI have

better survival, p < 0.0021 (Figure 1D).

Non-negative matrix factorization was used to evaluate the

mutation signatures associated with periampullary tumors. We

identified five prominent signatures, out of 21 observed (Figures

2A, S2A, and S2B; Table S2). The most common signature was

C > T at CpG islands (#6). Indeed, this signature is most common

across all tumor types. A few CAC and AMPAC tumors had a

strong T > G, > C signature (#7) associated with the digestive

track tumors and consistent with DNA damage and exposure

to arsenic (Martinez et al., 2013). A C > G signature (#4) charac-

teristic of DNA damage by APOBEC enzymes was also present

in a few patients (Roberts et al., 2013).

Cell Reports 14, 907–919

We observed signature #1 at greater

than 20% of the total signature in 9.6%

of our entire tumor set (6% AMPAC and

21% CAC). Signature #1 is characterized

by AC, AT > AN and is enriched in non-

transcribed regions of the genome in

samples from several cancer types (PDAC, medulloblastoma,

breast tumor, AML, and CLL). However, signature #1 was also

observed in the coding region of 18 out of 486 hepatocellular car-

cinoma (4%) and 31 out of 450 colorectal carcinoma (CRC) (7%)

(Lawrence et al., 2013; Totoki et al., 2014; K.R.C., unpublished

data). Whereas none of the known signatures have yet been

associated with a difference in outcome, signature #1 was asso-

ciated with poor outcomes in our study set (multivariate Cox pro-

portional hazards p = 0.02) (Figure 2B).

The analysis of the periampullary tumors, excluding MSI pa-

tients, revealed 19 genes mutated significantly above back-

groundusingMutSig-CV (Lawrence et al., 2013) (Figure 3A; Table

S3A). Considering the ratio of inactivating tomissensemutations,

an additional three genes were brought in to the significantly

mutated gene list (Table S3B) including PBRM1, RECQL4, and

KDM6A. Gene expression data confirmed that the variants

harboring missensemutation in the driver genes were expressed

between 85% and 88% of the time (Table S3C).

Most interestingly, ELF3 a transcriptional regulator of TGFBR2

was mutated in 10.6% of the periampullary tumors with predom-

inantly inactivating frameshift or nonsense mutations (Figure 3B).

This mutation frequency is three times higher than in any other

cancer (Table S3D) (Cerami et al., 2012; Gao et al., 2013; Law-

rence et al., 2014) (http://www.cBioPortal.org). In agreement

with our finding, ELF3 mutations were found in 9.5% of extrahe-

pathic CAC in a recent study of 74 samples with four inactivating

mutations out of seven (Nakamura et al., 2015). ELF3 mutation

occurred 71%of the timewithWNTpathwaymutations in all three

periampullary groups (Figure S3A). (chi-square test, p = 0.02).

, February 2, 2016 ª2016 The Authors 909

Page 5: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

A

B

Figure 2. Mutation Signature in Periampullary Tumors

(A) Heatmap of five dominant mutation signatures from NMF analysis of mutation spectrum for each subject. Intensity indicates the proportion of mutations for

that subject attributed to the indicated signature. Subjects are sorted first by signature 1, then signature 6 from the highest to the lowest value. Only signatures

with high penetrance are shown.

(B) Kaplan-Meier curve of survival in this cohort stratified by signature 1 levels (high, red line: signature 1 component >10% of all mutations; low, black line:

otherwise, multivariate Cox proportional hazards p = 0.001).

See also Figures S2A and S2B and Table S2.

Considering the 44 CAC alone, four genes were significantly

mutated in this cancer: TP53, KRAS, SMAD4, and CDKN2A

with the highest mutation incidence in TP53. Whereas intrahe-

patic CAC tumors frequently harbor BAP1, IDH1, and IDH2

(Nakamura et al., 2015), these were absent with the exception

of a single IDH1 hotspot mutation in the periampullary CAC.

This is in agreement with Nakamura et al. (2015), where no

IDH1 mutations could be detected among 74 extrahepatic tu-

mors (compared to a 5% mutation rate in intrahepatic tumor)

and a less than 3% BAP1 mutation rate was found in extrahe-

910 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

patic tumors (compared to a 12.4%mutation rate in intrahepatic

tumor).

Alteration of Key Signaling PathwaysThe significantlymutated genes defined five pathways in periam-

pullary tumors: TP53/cell division, RAS/PI3K, WNT, TGF-b,

and chromatin remodeling pathways. We combined the point

mutations and copy-number alterations (CNA) changes at

the gene level within these five pathways to assess the impact

of these pathways among the three anatomical sites (Figures

Page 6: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure 3. Significantly Mutated Genes in

Non-MSI Periampullary Tumors

(A) Significantly mutated genes are displayed by

FDR value (MutSigCV). Genes with FDR < 0.1 are

located in the left panel, genes with FDR > 0.1 but

significantly inactivated are in the middle panel,

and genes slightly under the significant threshold

of the significantly mutated gene (SMG) list are in

the right panel. The amount of samples for each

tumor type is stacked.

(B) ELF3 inactivating mutations were distributed

along the entire gene characteristic of a tumor

suppressor (q < 1.6 3 10�11). All the mutations

found in the study are represented in the figure,

each mutation being found in one patient.

See also Tables S3A and 3B–S3D.

4A and 4B). The similarities and differences in gene mutations

per tumor types and subtypes are illustrated in Figures S3A

and S3B.

TheWNT pathway was mutated in 46% of patients overall but

was clearly differentially mutated across the three tumor types,

being more frequently mutated in DUOAC (72%) than in AMPAC

(49%) or CAC (30%) (chi-square p < 0.05) (Tables S4A and S4B).

This predominance of WNT pathway mutation in DUOAC was

due mainly to more frequent mutations of APC and SOX9.

Whereas the TP53, RAS, TGF-b signaling and chromatin remod-

eling pathways are deregulated in many tumor types, the WNT

pathway deregulation is reported only in gastrointestinal tumors

(Biankin et al., 2012; Cancer Genome Atlas, 2012). We reasoned

that grouping the patients by our histological classification might

enrichWNTmutation in the intestinal subtype relative to the pan-

creatobiliary subtype. As expected, the intestinal subtype had

67% WNT pathway alterations compared to pancreatobiliary

with 30% WNT alterations, very close to the WNT frequency

based on anatomical site (Figures 4B, S3A, and S3B; Tables

S4A and S4B). Although we observe a gradient ofWNT pathway

disruption in tumors as their anatomical site moves away from

the GI tract, WNT mutation is still frequent in CAC, or ‘‘pancrea-

tobiliary’’ subtype tumors.

TGFBR2 was also more frequently mutated in DUOAC than

AMPAC and CAC, but this may have been secondary to MSI,

which was in higher proportion in DUOAC. TGFBR2 harbors an

A homopolymer run of eight bases that is a frequent target of

Cell Reports 14, 907–919

mutation in MSI patients, and 5 of the 12

TGFBR2 mutations were at this site.

Interestingly, SMAD4, a gene frequently

mutated in PDAC, was the most

commonly mutated gene of the TGF-b

pathway in AMPAC and CAC, the tissue

sites in closest proximity to the pancreas.

Mutant KRAS was the major RAS

signaling oncogene in all three tumor

types. Overall, the RTK/RAS/PI3K path-

way was activated in all periampullary pa-

tients at a statistically similar rate ranging

from 84% to 94% among the three tumor

types (Tables S4A and S4B).

Alterations in the SWI/SNF chromatin remodeling pathway

were observed most frequently in ARID1A and ARID2. Overall,

mutations in the SWI/SNF complex were equally frequent in

the three tumor types.

Pathway Mutation Correlates with Disease OutcomeMultivariate analyses on the periampullary tumors as a group

showed mutations in the TGF-b pathway are associated with

better overall survival (multivariate Cox proportional hazard

p = 0.0059, HR = 0.42) independent of stage, gender, subtype,

and MSI status (multivariate Cox proportional hazard p =

0.029). Mutations in the PI3K pathway were also associated

with better overall survival (multivariate-Cox proportional haz-

ards p = 0.036, HR = 0.43) (Figure S3C). Mutations in TP53,

KRAS, WNT, and chromatin remodeling pathways showed no

significant association with outcomes in multivariate modeling.

Interestingly, TGF-b pathway mutations were also negatively

associated with mutation signature 1 (multivariate ANOVA p =

0.02), possibly explaining the association with outcomes. How-

ever, the contribution of signature 1 to outcomes was still signif-

icant when considering TGF-b pathway mutations in the model,

indicating that these two effects are not entirely redundant.

RNA ExpressionRNA expression was analyzed in 28 AMPAC and 2 DUOAC. Due

to the high frequency of mutation in WNT and the current devel-

opment of therapeutic agents targeting b-catenin, we evaluated

, February 2, 2016 ª2016 The Authors 911

Page 7: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure 4. Major Altered Pathways in Periampullary Tumors

(A) Frequency of changes defined by somatic mutations or copy-number loss or gain is expressed as a percentage of cases for each gene. Inactivation (blue) or

activation (red) is graded in intensity by percent of patients affected.

(B) Genetic alterations in the significantly mutated genes grouped by pathway are illustrated for each patient. Note WNT and PI3K signaling pathways could be

found in the three tumor types and in each of their subtypes, including the pancreatobiliary subtype.

See also Figures S3A and S3B, wherein mutations in each gene are grouped by tumor type and subtype, and Figure S3C and Tables S4A and S4B.

912 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

Page 8: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure 5. Relative RNA Expression of WNT

Antagonist, Agonist, and Target Genes

Tumors were split between those with and without

WNT canonical pathway mutations as shown in

the mutation panel. The level of RNA expression

for each gene can be visualized in the heat map

and the average expression of all the genes is

summarized in the lower panel. See also Figure S4

for fusions.

the expression data using a previously developedWNT signature

that included WNT antagonist, WNT agonist, and WNT target

genes (Donehower et al., 2013). An increase in expression in

these three gene groups as a result of the WNT pathway dereg-

ulation was noticed in colorectal cancer (Donehower et al.,

2013). This could be explained by the fact that CTNNB1 activa-

tion resulted in an increased expression of targeted genes and

the unrestricted WNT signaling set up a negative feedback

loop of the WNT antagonist genes attempting to shut down

signaling. In this study, we divided the patients into WNT

mutated and those without (Figure 5, mutation panel). We then

looked at the relative RNA expression in the two tumor groups

forWNT antagonists, WNT agonists, andWNT targets (Figure 5,

middle panel). The tumors with WNT mutations trend signifi-

cantly toward higher overall WNT gene expression (p < 0.001)

(Figure 5, lower panel). The WNT gene expression profile was

also increased in some of the WNT non-mutated patients, indi-

cating that some other mechanism affecting the WNT pathway

might be at play.

Fusion analysis identified two noticeable non-recurrent fu-

sions: SLC45A3-ELK4 used as a prognosis marker in prostate

Cell Reports 14, 907–919

cancer, where its expression is elevated

(Kumar-Sinha et al., 2012; Ren et al.,

2014), and a LINE-MET fusion in a patient

without any KRAS or TP53 driving

mutations and a high MET expression

(Figure S4; Table S5). LINE element

insertions are found in PDAC, colon,

hepatocellular, esophageal, and lung

carcinoma (Paterson et al., 2015; Rodi�c

et al., 2015).

Copy-Number AlterationThe majority of CNAs involved entire

chromosomes or chromosome arms as

opposed to focal events, which are com-

mon in gastrointestinal tumors. Arm-level

deletions outnumbered amplifications

across all tumors (Figure 6A). The three

tumor types shared four arm-level ampli-

fications and nine arm-level deletions.

AMPAC shared amplification of 1q and

deletion of 1p and 8p CAC (Table S6A).

AMPAC shared no events specifically

with DUOAC, making AMPAC marginally

more similar to CAC in its CNA pattern.

AMPAC also had two unique amplica-

tions on 5p and 6p, whereas 3q amplification was unique to

CAC, and 6p was unique to DUOAC.

A combined GISTIC analysis revealed as expected a focal

deletion of 9p23.1, involving CDKN2A (Table S6B). A focal

deletion in chromosome 9 removed the promotor and 50 end of

KDM4C (Figure 6B). Although present in every tumor type, it

was only statistically significant in AMPAC (Table S6B). This

deletion resulted in a significant decrease in expression of

KDM4C as well as the upstream UHRF2 (Figure 6B, inserts).

Interestingly, overexpression of both genes has been associated

with a pro-growth effect on colon cancer cells (KDM4C) and a

much lower disease-free survival and overall survival in patients

with colon cancer (UHRF2) (Lu et al., 2014; Kim et al., 2014).

KDM4C forms also complexes with b-catenin (Kim et al., 2014;

Yamamoto et al., 2013).

DISCUSSION

This study compares the genetic constitution of ampullary can-

cer with two nearby tumor types with pathologic classification.

Historically, ampullary cancers have been classified as intestinal

, February 2, 2016 ª2016 The Authors 913

Page 9: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure 6. Copy-Number Alteration

(A) Nexus GISTIC analysis of copy-number alteration by anatomical site. Upper blue panel shows copy-number gains and the lower red panel shows copy-

number losses for each tumor type. Blue arrows demark changes characteristic of a given anatomical site.

(legend continued on next page)

914 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

Page 10: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

or pancreatobiliary subtypes based on immunohistochemistry

and/or cellular morphology. The genomic analysis mirrored

these results by the finding that some ampullary tumors exhibit

properties of intestinal tumors such as microsatellite instability,

ELF3 mutation, and disruption of WNT signaling.

We found that the classification approaches of the three peri-

ampullary tumors are often discordant with one another. No

unique molecular characteristics were specifically associated

with one tumor subtype or one tumor type. Interestingly patients

from each tumor type and subtype exhibited alterations in WNT

pathway genes, including nearly one-fourth of the CAC tumors

and one-fourth of the pancreatobiliary tumors. Other studies

using subtype classification different from ours report WNT

pathway mutations in AMPAC pancreatobiliary (PB) subtype

(Achille et al., 1996; Hechtman et al., 2015). Transcriptional

changes in AMPAC tumors in WNT signaling genes was

increased, as expected, in tumors with WNT mutation, reinforc-

ing a molecular dichotomy. With half of the patients across the

three tumor types harboring WNT mutation, this could impact

greatly the choice of treatment since several WNT pathway tar-

geted therapies are in development. Ampullary, duodenal and

distal bile duct adenocarcinoma could be regarded as a WNT

± entity from the perspective of treatment. Thus, the molecular

data suggest that clinical testing for WNT signaling status might

be beneficial to patients in the near future, making this a stepping

stone to personalized medicine.

The identification of ELF3 as a significantly mutated gene with

an inactivating mutation pattern is also of interest. It was re-

ported at lower frequency in bladder and biliary tract cancers,

but not in any other cancer so far. ELF3 encodes an ETS-domain

transcription factor. By interacting with promoter regions, ELF3

is implicated in the regulation of several genes during epithelial

cell differentiation (Oliver et al., 2012). One of the genes transac-

tivated by ELF3 is TGFBR2, a prime initiator of TGF-b signaling, a

pathway with a dual role in tumorigenesis, suppressing tumor

progression at early stages but enhancing invasion and metas-

tasis at later stages (Roberts and Wakefield, 2003). The tumor-

suppressor antiproliferative function of ELF3 was previously

noted in studies on colorectal, prostate, and oral squamous can-

cer cells (Iwai et al., 2008; Lee et al., 2003, 2008; Shatnawi et al.,

2014) andmore recently in biliary tract cancer cell line (Nakamura

et al., 2015). Such studies showed that ELF3 directly binds to the

promoter region of EGR1 (Lee et al., 2008) and TGFBR2 (Lee

et al., 2003), increasing the transcription of these two tumor

suppressor genes in CRC, whereas ELF3 binding to androgen

receptor (AR) (Shatnawi et al., 2014) and matrix metalloprotei-

nase-9 (MMP9) (Iwai et al., 2008) promoters suppressed the

transcriptional activity of these tumor growth- and invasive-

ness-promoting genes in prostate and squamous cancers,

respectively. However, recent observations also suggested an

oncogenic functional role in CRC development when ELF3 is

amplified, and its upregulated expression correlated with cancer

progression and decreased patient survival (Wang et al., 2014).

(B) Focal deletion in the promotor region and at the 50 end of KMD4C impacts its an

(Integrative Genomics Viewer, Broad Institute). Deletions are in blue, and amplifica

groups: samples with (1) or without (2) focal deletion. The color ladder on the righ

See also Tables S6A and S6B.

C

A WNT-independent CTNNB1 transactivation facilitating tumor

development was also reported (Wang et al., 2014). Such dual

function has also been observed in breast and prostate cancer

(Longoni et al., 2013; Oliver et al., 2012; Shatnawi et al., 2014).

It could be argued that when ELF3 inactivation occurs early in tu-

mor development, it provides a moderate growth advantage by

suppressing TGF-b signaling. The fact that we found ELF3muta-

tion in a duodenal adenoma with intraepithelial neoplasia and

dysplasia components (DUOAC 707) and that 75%of the tumors

with ELF3mutation were lower-grade tumors (stage I or II) could

support this hypothesis. The ELF3 functional switch might

depend on tumor stage and expression of other factors and/or

be associated with its expression level, some genes being trans-

activated only when ELF3 is overexpressed. In any case, ELF3 is

implicated in the development of periampullary tumors, and its

exact functional role during periampullary tumor development

will need to be investigated further.

EXPERIMENTAL PROCEDURES

Clinical Data

A total of 160 tumors (98 AMPAC, 44 CAC, and 18 DUOAC) were collected by

the different groups participating in this study: Australian Pancreatic Cancer

Genome Initiative (APGI), Baylor College of Medicine Elkins Pancreatic Center

(BCM) as a member of The Cancer Research Banking, MD Anderson Cancer

Center (MDA), and Technical University of Dresden (TUD). Ethical approval

was obtained from each of these institution’s research ethic boards. All

patients underwent surgical pancreatoduodenectomy with curative intent

without known residual disease. Clinical data variables including race, sex,

age, familial history, operative procedure, pathological findings, and survival

from the date of initial surgery to the date of death or last follow up are pre-

sented in Table S1A.

Tumor Classification

A section of the tumor was fixed in formaldehyde and embedded in paraffin

(FFPE). H&E-stained and immunohistochemistry slides from the FFPE tissue

were examined by the pathologists from the original site of collection to

confirm diagnosis of the specific tumor section and to grade expression of

subtype markers. All slides were then centrally reviewed by a single patholo-

gist (A.G.) who was blinded to all clinical, molecular, and pathological data

at the time of review and scoring. The distinction between pancreatic, biliary,

ampullary or intestinal carcinomawas based on the anatomical site fromwhich

the carcinoma was thought to arise using the guidelines recommended in the

AJCC seventh edition 2009 staging manual (Edge et al., 2009).

Histology and Morphology

Tumors were classified as pancreatobiliary, intestinal, or mixed morphological

subtype based on the cellular morphology. A morphology similar to colorectal

adenocarcinoma (tall often pseudostratified columnar epithelium with oval

nuclei forming elongated glands) was defined as intestinal type. Morphology

similar to pancreatobiliary carcinoma (small solid nest of cells with rounded

nuclei surrounded by desmoplastic stroma and forming simple or branching

rounded glands) was defined as pancreatobiliary type. Mixed histological

types contained a mixture of both intestinal and pancreatobiliary types with

80% or less of the cells with either morphology. Grade of differentiation was

also noted aswell as presence of adenoma, signet-ring cells, ormucinous cells

(Table S1B).

dUHRF2 expression. Human Omni 2.5 SNP array results were analyzed in IGV

tions are in red. Gene expressionwas analyzed by dividing the samples into two

t indicates the tumor type (pink, AMPAC; purple, CAC; and orange, DUOAC).

ell Reports 14, 907–919, February 2, 2016 ª2016 The Authors 915

Page 11: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Immunohistochemistry Staining

FFPE sections were stained with antibodies against MUC1 and CDX2 (see

Supplemental Experimental Procedures). This two-antibody panel has previ-

ously been validated by our group to predict prognosis in ampullary carcinoma

(Chang et al., 2013), and the methods we used were the same as employed in

that study. Briefly, expression was evaluated by estimating the percentage of

positively stained carcinoma cells and the intensity of the staining (0 absent, 1+

weak, 2+, 3+ strong). H scores were calculated for bothmarkers bymultiplying

the percentage of stained cells by the intensity of the staining. The ratio of the

CDX2/MUC1 H score defined the subtypes: a ratio of 2 and above and smaller

than 0.5 were considered intestinal and pancreatobiliary, respectively. Inter-

mediate values were associated to a mixed subtype (Table S1B).

Nucleic Acid Isolation

Samples were retrieved and had full face sectioning performed in OCT embed-

ding media to verify the presence of carcinoma in the sample to be sequenced

and to estimate the percentage of malignant epithelial nuclei in the sample

relative to stromal nuclei. Macrodissection was performed if possible to excise

areas of non-malignant tissue. DNA and RNA extraction was performed at the

center of collection following their own protocol with all samples being tracked

using unique identifiers though out the process (see Supplemental Methods).

DNAwas shipped and quantified at BCM-Human Genome Sequencing Center

(HGSC) using the PicoGreen DNA Assay.

SNP Array Assays

SNP arrays were processed at the HGSC for each sample using the Illumina

Infinium LCG Assay according to the manufacturer’s guidelines. Specifically,

assays were performed with Human Omni2.5-8 BeadChips (Illumina, catalog

no.WG-311-2513), interrogating 2.5million SNP loci with aMAF detection limit

of 1% (see Supplemental Experimental Procedures). SNP calls were collected

using Illumina’s GenomeStudio software (version 2011.1) in which standard

SNP clustering and genotyping were performed with the default settings rec-

ommended by the manufacturer. Data from samples that met a minimum SNP

call rate of 0.9 were considered passing and were included in subsequent an-

alyses. Results were analyzed on Nexus (BioDiscovery).

Sequencing

Library preparation, whole (Bainbridge et al., 2011) and targeted exome cap-

ture, and regular and ultra-deep sequencing on HiSeq 2000 platform are

detailed in Supplemental Experimental Procedures. In brief, 152 samples

were whole-exome sequenced and their mutations validated with a custom

design targeted exome capture. The targeted capture consisted of a panel

of 71 genes covering 0.25 Mb, and the probes were designed by Nimblegen

(genes are listed in Supplemental Experimental Procedures). These genes

were selected on the basis they were significantly mutated and/or had

high impact in the development of AMPAC, PDAC, DUOAC, CAC, and other

pancreatic tumor types. The selective targeted capture was also used in dis-

covery on eight samples received at a later date (six samples) or of low purity

(two samples with <10% tumor). The mutations identified with the targeted

capture were validated with ultra-deep (single-molecule reconstruction)

sequencing.

Data Analysis

Primary Data Analysis

Initial sequence analysis was performed using the HGSC Mercury analysis

pipeline (https://www.hgsc.bcm.edu/software/mercury). First, the primary

analysis software on the instrument produces .bcl files that are transferred

off-instrument into the HGSC analysis infrastructure by the HiSeq Real-time

Analysis module. Once the run is complete and all .bcl files are transferred,

Mercury runs the vendor’s primary analysis software (CASAVA), which demul-

tiplexes pooled samples and generates sequence reads and base-call

confidence values (qualities). The next step is the mapping of reads to the

GRCh37 Human reference genome (http://www.ncbi.nlm.nih.gov/projects/

genome/assembly/grc/human/) using the Burrows-Wheeler aligner (Li and

Durbin, 2009) (BWA; http://bio-bwa.sourceforge.net/) and producing a BAM

(Li et al., 2009) (binary alignment/map) file. The third step involves quality reca-

libration (using GATK; DePristo et al., 2011; https://www.broadinstitute.org/

916 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

gatk/) and, where necessary, the merging of separate sequence-event

BAMs into a single sample-level BAM. BAM sorting, duplicate-read marking,

and realignment to improve in/del discovery all occur at this step.

Cancer Data Analysis

Primary BAM files were separately run through Atlas-SNP (Shen et al., 2010),

Atlas-Indel, and PInDel (Ye et al., 2009). Data were aggregated for each tumor/

normal pair, and variants were cross-checked for each tissue pair. Variant

annotation was performed using Annovar (Wang et al., 2010), COSMIC (Forbes

et al., 2011), and dbSNP (Sherry et al., 2001). Variant filtering was performed to

remove low-quality variants. Cohort-level data processing was performed to

remove additional false somatic calls by filtering against a cohort of normal

tissues.

Ultra-deep Sequencing Analysis

Duplicate reads were aggregated and consensus variants were defined as the

variant being present in 90% of the reads contributing to both halves of the

duplex molecule. Subsequent filtering was employed to remove variants in

which there was either mapping error (tested using BLAST) or sequence error

non-consensus block rate of 50% (Altschul et al., 1990). Variants detected in

this way were annotated using ANNOVAR, COSMIC, and dbSNP annotations

(Forbes et al., 2011; Sherry et al., 2001; Wang et al., 2010).

Mutational Signature

Mutation signatureswere generated froma set of over 6,000 somaticmutations

across a range of cancer types using non-smooth non-negative matrix factor-

ization (nsNMF) (K.R.C., unpublished data; Pascual-Montano et al., 2006). The

solution resulted in 21distinctmutational signatures, similar to those previously

reported (Alexandrov et al., 2013; Gaujoux and Seoighe, 2010), many of which

could be correlated with previously published mutational modes, including

APOBEC, UV radiation exposure, POLE hypermutation (Lawrence et al.,

2013), and CpG island mutation. Mutations for this cohort where compared

against the solved NMF to generate a mutational decomposition for each of

the tumor samples. Samples were aggregated and compared using hierarchi-

cal clustering and other correlative statistics to clinical covariates.

Tumor Purity and Normalization Mutation Rate

Tumor purity was estimated using ASCAT and the tumor variant allelic fraction

of driver genes. The average of both analyses was plotted against the number

of mutations in each tumor, and the slope value was used to approximate the

number of mutations that would have been identified in 100% tumor cellularity

(Figure S1A).

Significantly Altered Genes

Several approaches were taken to dissect genes and pathways which

were mutated more often than by chance in this dataset. We used the final

MAF file (Tables S6A and 6B) to calculate significantly mutated genes using

MutSig-CV and an inactivation bias test (Lawrence et al., 2013; Totoki et al.,

2014).

Microsatellite Instability Coefficient

For details, see Supplemental Experimental Procedures.

Multivariable Cox Analysis

Cox proportional hazards analysis was performed using the survival (Ther-

neau, 2000) package in R (R Development Core Team, 2008). We included

age at diagnosis, gender, stage, grade, tumor type, histologic subtype (IHC),

and mutation status (WNT, TGFB, TP53, KRAS, PI3K, and chromatin remodel-

ing) in the multivariate Cox analysis. Country of origin and ethnicity were not

included as covariates since they had no associated effect with survival.

RNA-Seq

Total RNA was prepared using the AllPrep RNA/DNA isolation kit (QIAGEN).

RNA integrity was confirmed (RIN > 7.0) on a Bioanalyzer (Agilent). RNA-seq

libraries were prepared using the TruSeq Stranded total RNA LT library kit

(Illumina) following the manufacturer’s instructions. 100-base-pair-end

sequencing was then performed to a minimum depth of 50 million reads of

each sample on an Illumina HiSeq2000 sequencer.

Transcript Expression Analysis

Gene Expression of the WNT Pathway

The profiles in the RNA-seq dataset were quantile normalized, and log-trans-

formed expression values were then centered to SDs from the median across

Page 12: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

sample profiles. Tumors were split into two groups (those withWNT canonical

pathway mutations and those without such mutations) and were scored for

relative activity in theWNT pathway. The gene signature score within each tu-

mor profile was defined as the average of the centered values for the WNT

signature genes.

Fusion Analysis

The deFuse software version 0.6.1 (McPherson et al., 2011) with default set-

tings was used to detect fusion genes. The deFuse results were further filtered

by removing identified read through fusions, selecting coding regions, select-

ing in-frame (open reading frame) genes and selecting samples with a deFuse

confidence score of >80%. This filtering resulted in a list of candidate fusion

genes. To characterize these candidate fusion genes, we took each spanning

junction read and using the BLAT tool in UCSC genome browser examined

where the reads mapped. The fusions that mapped with 100% identity to

each part of the identified fusion (gene1 or gene2) were selected for further

analysis. This filter removed genes that mapped to multiple locations. Next,

each RNA BAM from candidate fusion genes was examined in IGV, looking

for stacked soft clipped reads, changes in coverage, at the identified fusion

breakpoints. The sequence of each soft clipped read was brought into the

UCSC genome browser and mapped using BLAT. Only fusions that had reads

that matched (100%) the identified fusion genes were considered further.

ACCESSION NUMBERS

The accession number for the sequence data reported in this paper is dbGAP:

PRJNA280134.

SUPPLEMENTAL INFORMATION

Supplemental Information includes Supplemental Experimental Procedures,

four figures, and six tables and can be found with this article online at http://

dx.doi.org/10.1016/j.celrep.2015.12.005.

CONSORTIA

Themembers of Australian Pancreatic Cancer Genome Initiative are Andrew V.

Biankin, Amber L. Johns, Amanda Mawson, David K. Chang, Christopher J.

Scarlett, Mary-Anne L. Brancato, Sarah J. Rowe, SkyeH. Simpson,MonaMar-

tyn-Smith, Michelle T. Thomas, Lorraine A. Chantrill, Venessa T. Chin, Angela

Chou, Mark J. Cowley, Jeremy L. Humphris, Marc D. Jones, R. Scott Mead,

Adnan M. Nagrial, Marina Pajic, Jessica Pettit, Mark Pinese, Ilse Rooman,

Jianmin Wu, Jiang Tao, Renee DiPietro, Clare Watson, Angela Steinmann,

Hong Ching Lee, Rachel Wong, Andreia V. Pinho, Marc Giry-Laterriere, Roger

J. Daly, Elizabeth A. Musgrove, and Robert L. Sutherland (Garvan Institute of

Medical Research, Australia); Sean M. Grimmond, Nicola Waddell, Karin S.

Kassahn, David K. Miller, Peter J. Wilson, Ann-Marie Patch, Sarah Song,

Ivon Harliwong, Senel Idrisoglu, Craig Nourse, Ehsan Nourbakhsh, Suzanne

Manning, Shivangi Wani, Milena Gongora, Matthew Anderson, Oliver Holmes,

Conrad Leonard, Darrin Taylor, ScottWood, Christina Xu, Katia Nones, J. Lynn

Fink, Angelika Christ, Tim Bruxner, Nicole Cloonan, Felicity Newell, John V.

Pearson, Peter Bailey, Michael Quinn, Shivashankar Nagaraj, Stephen Kazak-

off, Nick Waddell, Keerthana Krisnan, Kelly Quek, and David Wood (Queens-

land Centre for Medical Genomics and Institute for Molecular Biosciences,

Australia); Jaswinder S. Samra, Anthony J. Gill, Nick Pavlakis, Alex Guminski,

and Christopher Toon (Royal North Shore Hospital, Australia); Ray Asghari,

Neil D. Merrett, Darren Pavey, and Amitabha Das (Bankstown Hospital,

Australia); Peter H. Cosman, Kasim Ismail, and Chelsie O’Connnor (Liverpool

Hospital, Australia); Vincent W. Lam, DuncanMcLeod, Henry C. Pleass, Arthur

Richardson, and Virginia James (Westmead Hospital, Australia); James G.

Kench, Caroline L. Cooper, David Joseph, Charbel Sandroussi, Michael Craw-

ford, and James Gallagher (Royal Prince Alfred Hospital, Australia); Michael

Texler, Cindy Forest, Andrew Laycock, Krishna P. Epari, Mo Ballal, David R.

Fletcher, and Sanjay Mukhedkar (Fremantle Hospital, Australia); Nigel A.

Spry, Bastiaan DeBoer, and Ming Chai (Sir Charles Gairdner Hospital,

Australia); Nikolajs Zeps, Maria Beilin, and Kynan Feeney (St John of God

Healthcare, Australia); NanQNguyen, AndrewR. Ruszkiewicz, ChrisWorthley,

C

Chuan P. Tan, and Tamara Debrencini (Royal Adelaide Hospital, Australia);

John Chen, Mark E. Brooke-Smith, and Virginia Papangelis (Flinders Medical

Centre, Australia); Henry Tang and Andrew P. Barbour (Greenslopes Private

Hospital, Australia); Andrew D. Clouston and Patrick Martin (Envoi Pathology,

Australia); Thomas J. O’Rourke, Amy Chiang, Jonathan W. Fawcett, Kellee

Slater, Shinn Yeung, Michael Hatzifotis, and Peter Hodgkinson (Princess Alex-

andria Hospital, Australia); Christopher Christophi, Mehrdad Nikfarjam, and

Angela Mountain; Victorian Cancer Biobank (Australia); James R. Eshleman,

Ralph H. Hruban, Anirban Maitra, Christine A. Iacobuzio-Donahue, Richard

D. Schulick, Christopher L. Wolfgang, Richard A. Morgan, and Mary Hodgin

(Johns Hopkins Medical Institutes, USA); Aldo Scarpa, Rita T. Lawlor, Stefania

Beghelli, Vincenzo Corbo,Maria Scardoni, andClaudio Bassi (ARC-Net Centre

for AppliedResearch onCancer, Italy); Margaret A. Tempero (University of Cal-

ifornia, San Francisco, USA); David K. Chang (Garvan Institute of Medical

Research, Australia; University of Glasgow, UK; and Greater Glasgow and

Clyde National Health Service, UK); Sean M. Grimmond and Craig Nourse

(University of Glasgow, UK; Queensland Centre for Medical Genomics and

Institute for Molecular Biosciences, Australia); Elizabeth A. Musgrove (Univer-

sity of Glasgow, UK); Marc D. Jones (Garvan Institute of Medical Research,

Australia; and, University of Glasgow, UK); and Nigel B. Jamieson, Fraser R.

Duthie, and Janet S. Graham (University of Glasgow, UK; andGreater Glasgow

and Clyde National Health Service, UK).

AUTHOR CONTRIBUTIONS

The research network comprises Baylor College of Medicine Human Genome

Sequencing Center, Australian Pancreatic Cancer Genome Initiative, MD An-

derson Cancer Center, TU Dresden, and Glasgow as part of the International

Cancer Genome Consortium study. Each center contributed biospecimens

collected at affiliated hospitals and processed at each biospecimen core

resource centre. Data generation and analyses were performed by the Human

Genome Sequencing Center. Investigator contributions are as follows: project

leader: M.C.G., C.P., R.G., M.J.O., S.M.G., A.V.B., D.A.W., and R.A.G.; writing

team: M.C.G. and D.A.W.; bioinformatics/databases: J.D., K.W., O.A.H., L.X.,

M.D., and P.B.; sequencing: D.M.M, H.D., S.L.L., M.B., J.H., Y.H., and H.H.D.;

data analysis: M.C.G., K.R.C., D.K.C., L.A.D., C.J.C, E.S., N.D., J.S.S., J.V.P.,

and C.S.; surgery: D.K.C, W.E.F, R.G., N.B.J., and G.V.B.; sample collection

and clinical annotation: A.L.J., C.P., S.E.H., and A.M.; sample processing

and quality control: M.C.G., A.L.J., C.P., R.G., S.L.L., and H.W.; and pathology

assessment: A.J.G., M.M.I., H.W., D.A., K.O., R.H.H., and F.R.D.

ACKNOWLEDGMENTS

We acknowledge the following funding support: HGSC-BCM: NHGRI U54

HG003273 and CPRIT grant RP101353-P7 (Tumor Banking for Genomic

Research and Clinical Translation Site 1); D.W.A. CPRIT grant RPI21018;

M.M.I. and C.J.C.: Dan L. Duncan Cancer Center NIH P30 Cancer Center

support grant (P30 CA125123) supporting the BCM Human Tissue Acquisi-

tion and Pathology Core and the Biostatistics and Bioinformatics Core;

M.J.O.: Kavanagh Family Foundation. Australian Pancreatic Cancer

Genome Initiative: National Health and Medical Research Council of

Australia (NHMRC; 631701, 535903, 427601); Queensland Government

(NIRAP); University of Queensland; Institute for Molecular Bioscience; Can-

cer Research UK (C596/A18076, C29717/A17263); University of Glasgow;

Australian Government: Department of Innovation, Industry, Science and

Research (DIISR); Australian Cancer Research Foundation (ACRF); Cancer

Council NSW: (SRP06-01, SRP11-01. ICGC); Cancer Institute NSW: (10/

ECF/2-26; 06/ECF/1-24; 09/CDF/2-40; 07/CDF/1-03; 10/CRF/1-01, 08/

RSA/1-15, 07/CDF/1-28, 10/CDF/2-26,10/FRL/2-03, 06/RSA/1-05, 09/RIG/

1-02, 10/TPG/1-04, 11/REG/1-10, 11/CDF/3-26); Garvan Institute of Medical

Research; Avner Nahmani Pancreatic Cancer Research Foundation; Howat

Foundation; R.T. Hall Trust; Petre Foundation; Philip Hemstritch Foundation;

Gastroenterological Society of Australia (GESA); American Association for

Cancer Research (AACR) Landon Foundation – INNOVATOR Award; Royal

Australasian College of Surgeons (RACS); Royal Australasian College of

Physicians (RACP); Royal College of Pathologists of Australasia (RCPA); Ital-

ian Ministry of Research (Cancer Genome Project FIRB RBAP10AHJB);

ell Reports 14, 907–919, February 2, 2016 ª2016 The Authors 917

Page 13: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Associazione Italiana Ricerca Cancro (12182); Fondazione Italiana Malattie

Pancreas – Ministero Salute (CUP_J33G13000210001); Wilhelm Sander Stif-

tung 2009.039.2; NIH grant P50 CA62924. See the Supplemental Informa-

tion for further acknowledgments.

Received: April 14, 2015

Revised: October 30, 2015

Accepted: November 19, 2015

Published: January 21, 2016

REFERENCES

Achille, A., Scupoli, M.T., Magalini, A.R., Zamboni, G., Romanelli, M.G., Orlan-

dini, S., Biasi, M.O., Lemoine, N.R., Accolla, R.S., and Scarpa, A. (1996). APC

gene mutations and allelic losses in sporadic ampullary tumours: evidence of

genetic difference from tumours associated with familial adenomatous polyp-

osis. Int. J. Cancer 68, 305–312.

Adsay, V., Ohike, N., Tajiri, T., Kim, G.E., Krasinskas, A., Balci, S., Bagci, P.,

Basturk, O., Bandyopadhyay, S., Jang, K.T., et al. (2012). Ampullary region

carcinomas: definition and site specific classification with delineation of four

clinicopathologically and prognostically distinct subsets in an analysis of 249

cases. Am. J. Surg. Pathol. 36, 1592–1608.

Alexandrov, L.B., Nik-Zainal, S., Wedge, D.C., Aparicio, S.A., Behjati, S., Bian-

kin, A.V., Bignell, G.R., Bolli, N., Borg, A., Børresen-Dale, A.L., et al.; Australian

Pancreatic Cancer Genome Initiative; ICGC Breast Cancer Consortium; ICGC

MMML-Seq Consortium; ICGC PedBrain (2013). Signatures of mutational pro-

cesses in human cancer. Nature 500, 415–421.

Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. (1990). Basic

local alignment search tool. J. Mol. Biol. 215, 403–410.

Amptoulach, S., Josefsson, A., Kavantzas, N., and Kalaitzakis, E. (2011).

Adenocarcinoma of the ampulla of Vater: does the histopathologic type mat-

ter? Scand. J. Gastroenterol. 46, 1401–1403.

Bainbridge, M.N., Wang, M., Wu, Y., Newsham, I., Muzny, D.M., Jefferies, J.L.,

Albert, T.J., Burgess, D.L., and Gibbs, R.A. (2011). Targeted enrichment

beyond the consensus coding DNA sequence exome reveals exons with

higher variant densities. Genome Biol. 12, R68.

Biankin, A.V., Waddell, N., Kassahn, K.S., Gingras, M.C., Muthuswamy, L.B.,

Johns, A.L., Miller, D.K., Wilson, P.J., Patch, A.M., Wu, J., et al.; Australian

Pancreatic Cancer Genome Initiative (2012). Pancreatic cancer genomes

reveal aberrations in axon guidance pathway genes. Nature 491, 399–405.

Cancer Genome Atlas, N.; Cancer Genome Atlas Network (2012). Comprehen-

sive molecular characterization of human colon and rectal cancer. Nature 487,

330–337.

Cerami, E., Gao, J., Dogrusoz, U., Gross, B.E., Sumer, S.O., Aksoy, B.A., Ja-

cobsen, A., Byrne, C.J., Heuer, M.L., Larsson, E., et al. (2012). The cBio cancer

genomics portal: an open platform for exploring multidimensional cancer

genomics data. Cancer Discov. 2, 401–404.

Chang, D.K., Jamieson, N.B., Johns, A.L., Scarlett, C.J., Pajic, M., Chou, A.,

Pinese, M., Humphris, J.L., Jones, M.D., Toon, C., et al. (2013). Histomolecular

phenotypes and outcome in adenocarcinoma of the ampulla of vater. J. Clin.

Oncol. 31, 1348–1356.

DePristo, M.A., Banks, E., Poplin, R., Garimella, K.V., Maguire, J.R., Hartl, C.,

Philippakis, A.A., del Angel, G., Rivas, M.A., Hanna, M., et al. (2011). A frame-

work for variation discovery and genotyping using next-generation DNA

sequencing data. Nat. Genet. 43, 491–498.

Donehower, L.A., Creighton, C.J., Schultz, N., Shinbrot, E., Chang, K., Gunar-

atne, P.H., Muzny, D., Sander, C., Hamilton, S.R., Gibbs, R.A., andWheeler, D.

(2013). MLH1-silenced and non-silenced subgroups of hypermutated

colorectal carcinomas have distinct mutational landscapes. J. Pathol. 229,

99–110.

Edge, S.E., Byrd, D.R., and Compton, C.C. (2009). AJCC Cancer Staging

Manual, Seventh Edition (New York: Springer).

Ehehalt, F., R€ummele, P., Kersting, S., Lang-Schwarz, C., R€uckert, F., Hart-

mann, A., Dietmaier, W., Terracciano, L., Aust, D.E., Jahnke, B., et al. (2011).

918 Cell Reports 14, 907–919, February 2, 2016 ª2016 The Authors

Hepatocyte nuclear factor (HNF) 4a expression distinguishes ampullary can-

cer subtypes and prognosis after resection. Ann. Surg. 254, 302–310.

Forbes, S.A., Bindal, N., Bamford, S., Cole, C., Kok, C.Y., Beare, D., Jia, M.,

Shepherd, R., Leung, K., Menzies, A., et al. (2011). COSMIC: mining complete

cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic

Acids Res. 39, D945–D950.

Gao, J., Aksoy, B.A., Dogrusoz, U., Dresdner, G., Gross, B., Sumer, S.O., Sun,

Y., Jacobsen, A., Sinha, R., Larsson, E., et al. (2013). Integrative analysis of

complex cancer genomics and clinical profiles using the cBioPortal. Sci.

Signal. 6, pl1.

Gaujoux, R., and Seoighe, C. (2010). A flexible R package for nonnegative ma-

trix factorization. BMC Bioinformatics 11, 367.

Hechtman, J.F., Liu, W., Sadowska, J., Zhen, L., Borsu, L., Arcila, M.E., Won,

H.H., Shah, R.H., Berger, M.F., Vakiani, E., et al. (2015). Sequencing of 279

cancer genes in ampullary carcinoma reveals trends relating to histologic sub-

types and frequent amplification and overexpression of ERBB2 (HER2). Mod.

Pathol. 28, 1123–1129.

Heinrich, S., and Clavien, P.A. (2010). Ampullary cancer. Curr. Opin. Gastroen-

terol. 26, 280–285.

Iwai, S., Amekawa, S., Yomogida, K., Sumi, T., Nakazawa, M., Yura, Y.,

Nishimune, Y., and Nozaki, M. (2008). ESE-1 inhibits the invasion of oral

squamous cell carcinoma in conjunction with MMP-9 suppression. Oral Dis.

14, 144–149.

Kim, T.D., Fuchs, J.R., Schwartz, E., Abdelhamid, D., Etter, J., Berry, W.L., Li,

C., Ihnat, M.A., Li, P.K., and Janknecht, R. (2014). Pro-growth role of the

JMJD2C histone demethylase in HCT-116 colon cancer cells and identification

of curcuminoids as JMJD2 inhibitors. Am. J. Transl. Res. 6, 236–247.

Kumar-Sinha, C., Kalyana-Sundaram, S., and Chinnaiyan, A.M. (2012).

SLC45A3-ELK4 chimera in prostate cancer: spotlight on cis-splicing. Cancer

Discov. 2, 582–585.

Lawrence, M.S., Stojanov, P., Polak, P., Kryukov, G.V., Cibulskis, K., Siva-

chenko, A., Carter, S.L., Stewart, C., Mermel, C.H., Roberts, S.A., et al.

(2013). Mutational heterogeneity in cancer and the search for new cancer-

associated genes. Nature 499, 214–218.

Lawrence, M.S., Stojanov, P., Mermel, C.H., Robinson, J.T., Garraway, L.A.,

Golub, T.R., Meyerson, M., Gabriel, S.B., Lander, E.S., and Getz, G. (2014).

Discovery and saturation analysis of cancer genes across 21 tumour types.

Nature 505, 495–501.

Lee, H.J., Chang, J.H., Kim, Y.S., Kim, S.J., and Yang, H.K. (2003). Effect of

ets-related transcription factor (ERT) on transforming growth factor (TGF)-

beta type II receptor gene expression in human cancer cell lines. J. Exp.

Clin. Cancer Res. 22, 477–480.

Lee, S.H., Bahn, J.H., Choi, C.K., Whitlock, N.C., English, A.E., Safe, S., and

Baek, S.J. (2008). ESE-1/EGR-1 pathway plays a role in tolfenamic acid-

induced apoptosis in colorectal cancer cells. Mol. Cancer Ther. 7, 3739–3750.

Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Bur-

rows-Wheeler transform. Bioinformatics 25, 1754–1760.

Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G.,

Abecasis, G., and Durbin, R.; 1000 Genome Project Data Processing Sub-

group (2009). The Sequence Alignment/Map format and SAMtools. Bioinfor-

matics 25, 2078–2079.

Longoni, N., Sarti, M., Albino, D., Civenni, G., Malek, A., Ortelli, E., Pinton, S.,

Mello-Grand, M., Ostano, P., D’Ambrosio, G., et al. (2013). ETS transcription

factor ESE1/ELF3 orchestrates a positive feedback loop that constitutively

activates NF-kB and drives prostate cancer progression. Cancer Res. 73,

4533–4547.

Lu, S., Yan, D., Wu, Z., Jiang, T., Chen, J., Yuan, L., Lin, J., Peng, Z., and Tang,

H. (2014). Ubiquitin-like with PHD and ring finger domains 2 is a predictor of

survival and a potential therapeutic target in colon cancer. Oncol. Rep. 31,

1802–1810.

Martinez, V.D., Thu, K.L., Vucic, E.A., Hubaux, R., Adonis, M., Gil, L.,

MacAulay, C., Lam, S., and Lam, W.L. (2013). Whole-genome sequencing

Page 14: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

analysis identifies a distinctive mutational spectrum in an arsenic-related lung

tumor. J. Thorac. Oncol. 8, 1451–1455.

McPherson, A., Hormozdiari, F., Zayed, A., Giuliany, R., Ha, G., Sun, M.G.,

Griffith, M., Heravi Moussavi, A., Senz, J., Melnyk, N., et al. (2011). deFuse:

an algorithm for gene fusion discovery in tumor RNA-Seq data. PLoS Comput.

Biol. 7, e1001138.

Morini, S., Perrone, G., Borzomati, D., Vincenzi, B., Rabitti, C., Righi, D., Castri,

F., Manazza, A.D., Santini, D., Tonini, G., et al. (2013). Carcinoma of the

ampulla of Vater: morphological and immunophenotypical classification pre-

dicts overall survival. Pancreas 42, 60–66.

Nakamura, H., Arai, Y., Totoki, Y., Shirota, T., Elzawahry, A., Kato, M., Hama,

N., Hosoda, F., Urushidate, T., Ohashi, S., et al. (2015). Genomic spectra of

biliary tract cancer. Nat. Genet. 47, 1003–1010.

Oliver, J.R., Kushwah, R., and Hu, J. (2012). Multiple roles of the epithelium-

specific ETS transcription factor, ESE-1, in development and disease. Lab.

Invest. 92, 320–330.

Pascual-Montano, A., Carazo, J.M., Kochi, K., Lehmann, D., and Pascual-Mar-

qui, R.D. (2006). Nonsmooth nonnegative matrix factorization (nsNMF). IEEE

Trans. Pattern Anal. Mach. Intell. 28, 403–415.

Paterson, A.L., Weaver, J.M., Eldridge, M.D., Tavare, S., Fitzgerald, R.C., and

Edwards, P.A.; OCCAMs Consortium (2015). Mobile element insertions are

frequent in oesophageal adenocarcinomas and can mislead paired-end

sequencing analysis. BMC Genomics 16, 473.

R Development Core Team (2008). R: A Language and Environment for Statis-

tical Computing (R Foundation for Statistical Computing).

Ren, G., Zhang, Y., Mao, X., Liu, X., Mercer, E., Marzec, J., Ding, D., Jiao, Y.,

Qiu, Q., Sun, Y., et al. (2014). Transcription-mediated chimeric RNAs in pros-

tate cancer: time to revisit old hypothesis? Omics 18, 615–624.

Roberts, A.B., and Wakefield, L.M. (2003). The two faces of transforming

growth factor beta in carcinogenesis. Proc. Natl. Acad. Sci. USA 100, 8621–

8623.

Roberts, S.A., Lawrence, M.S., Klimczak, L.J., Grimm, S.A., Fargo, D., Stoja-

nov, P., Kiezun, A., Kryukov, G.V., Carter, S.L., Saksena, G., et al. (2013). An

APOBEC cytidine deaminase mutagenesis pattern is widespread in human

cancers. Nat. Genet. 45, 970–976.

Rodi�c, N., Steranka, J.P., Makohon-Moore, A., Moyer, A., Shen, P., Sharma,

R., Kohutek, Z.A., Huang, C.R., Ahn, D., Mita, P., et al. (2015). Retrotransposon

insertions in the clonal evolution of pancreatic ductal adenocarcinoma. Nat.

Med. 21, 1060–1064.

Romiti, A., Barucca, V., Zullo, A., Sarcina, I., Di Rocco, R., D’Antonio, C., La-

torre, M., and Marchetti, P. (2012). Tumors of ampulla of Vater: A case series

and review of chemotherapy options. World J. Gastrointest. Oncol. 4, 60–67.

C

Shatnawi, A., Norris, J.D., Chaveroux, C., Jasper, J.S., Sherk, A.B.,

McDonnell, D.P., and Giguere, V. (2014). ELF3 is a repressor of androgen

receptor action in prostate cancer cells. Oncogene 33, 862–871.

Shen, Y., Wan, Z., Coarfa, C., Drabek, R., Chen, L., Ostrowski, E.A., Liu, Y.,

Weinstock, G.M., Wheeler, D.A., Gibbs, R.A., and Yu, F. (2010). A SNP

discovery method to assess variant allele probability from next-generation

resequencing data. Genome Res. 20, 273–280.

Sherry, S.T., Ward, M.H., Kholodov, M., Baker, J., Phan, L., Smigielski, E.M.,

and Sirotkin, K. (2001). dbSNP: the NCBI database of genetic variation.

Nucleic Acids Res. 29, 308–311.

Therneau, T.M. (2000). Modeling Survival Data: Extending the Cox Model

(Springer-Verlag).

Thompson, E., Meldrum, C.J., Crooks, R., McPhillips, M., Thomas, L., Spigel-

man, A.D., and Scott, R.J. (2004). Hereditary non-polyposis colorectal cancer

and the role of hPMS2 and hEXO1 mutations. Clin. Genet. 65, 215–225.

Totoki, Y., Tatsuno, K., Covington, K.R., Ueda, H., Creighton, C.J., Kato, M.,

Tsuji, S., Donehower, L.A., Slagle, B.L., Nakamura, H., et al. (2014). Trans-

ancestry mutational landscape of hepatocellular carcinoma genomes. Nat.

Genet. 46, 1267–1273.

Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation

of genetic variants from high-throughput sequencing data. Nucleic Acids Res.

38, e164.

Wang, J.L., Chen, Z.F., Chen, H.M., Wang, M.Y., Kong, X., Wang, Y.C., Sun,

T.T., Hong, J., Zou, W., Xu, J., and Fang, J.Y. (2014). Elf3 drives b-catenin

transactivation and associates with poor prognosis in colorectal cancer. Cell

Death Dis. 5, e1263.

Westgaard, A., Pomianowska, E., Clausen, O.P., and Gladhaug, I.P. (2013).

Intestinal-type and pancreatobiliary-type adenocarcinomas: how does ampul-

lary carcinoma differ from other periampullary malignancies? Ann. Surg.

Oncol. 20, 430–439.

Yamamoto, S., Tateishi, K., Kudo, Y., Yamamoto, K., Isagawa, T., Nagae, G.,

Nakatsuka, T., Asaoka, Y., Ijichi, H., Hirata, Y., et al. (2013). Histone demethy-

lase KDM4C regulates sphere formation by mediating the cross talk between

Wnt and Notch pathways in colonic cancer cells. Carcinogenesis 34, 2380–

2388.

Ye, K., Schulz, M.H., Long, Q., Apweiler, R., and Ning, Z. (2009). Pindel: a

pattern growth approach to detect break points of large deletions andmedium

sized insertions from paired-end short reads. Bioinformatics 25, 2865–2871.

Zaret, K.S., andGrompe,M. (2008). Generation and regeneration of cells of the

liver and pancreas. Science 322, 1490–1494.

ell Reports 14, 907–919, February 2, 2016 ª2016 The Authors 919

Page 15: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Cell Reports

Supplemental Information

Ampullary Cancers Harbor the Tumor Suppressor

Gene ELF3 and Exhibit Frequent WNT Dysregulation

Marie-Claude Gingras, Kyle R. Covington, David K. Chang, Lawrence A. Donehower,

Anthony J. Gill, Michael M. Ittmann, Chad J. Creighton, Amber L. Johns, Eve Shinbrot,

Ninad Dewal, William E. Fisher, Australian Pancreatic Cancer Genome Initiative,

Christian Pilarsky, Robert Grützmann, Michael J. Overman, Nigel B. Jamieson, George

Van Buren II, Jennifer Drummond, Kimberly Walker, Oliver A. Hampton, Liu Xi, Donna

M. Muzny, Harsha Doddapaneni, Sandra L. Lee, Michelle Bellair, Jianhong Hu, Yi Han,

Huyen H. Dinh, Mike Dahdouli, Jaswinder S. Samra, Peter Bailey, Nicola Waddell, John

V. Pearson, Ivon Harliwong, Huamin Wang, Daniela Aust, Karin A. Oien, Ralph H.

Hruban, Sally E. Hodges, Amy McElhany, Charupong Saengboonmee, Fraser R. Duthie,

Sean M. Grimmond, Andrew V. Biankin, David A. Wheeler, and Richard A. Gibbs

Page 16: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

A

B C

Figure S1. Related to Figure 1. A. Microsatellite instability coefficient. MSI tumors can be identified by the

frequency of INDEL mutations in homopolymer regions of specific genes (See Supplemental method). * patient

sequenced with targeted sequencing only. B. Correction factor of cellular purity. The average purity was plotted

against the number of mutations in each tumor samples. The normalized number of mutations for a 100% tumor

cellularity was estimated with the slope value. C. Mutation frequency. Unnormalized versus normalized mutation

frequencies for all patients by anatomical site and subtype. Black dots, non MSI; red dots, MSI; red diamonds, non

MSI average.

Supplemental Figures

Page 17: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

A

B

Figure S2. Related to Figure 2. A. Mutation signature in periampullary tumors. Heat map of the 21 mutation signatures

from non-negative factorization analysis of mutation spectrum for each subject. Intensity indicates the proportion of

mutations for that subject attributed to the indicated signature. Subjects are sorted by hierarchical clustering. B. Mutation

signature comparison. Non-negative factorization analysis of mutation spectrum of the 5 dominant mutation signatures for

each subject in tumor type (A: AMPAC; C: CAC; D: DUOAC) and subtype (I: intestinal; M mixed; PB: pancreatobiliary).

Page 18: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure S3. Related to Figure 4. Somatic mutations in major altered pathways in the tumors grouped by types

(A) and subtypes (B). Genetic alterations in the significantly mutated genes grouped by pathway are illustrated for

each patient. Note PI3K and WNT signaling pathways could be found in each tumor type and subtypes, including the

pancreatobilliary subtype. C. Pathway alteration effect on survival. Kaplan-Meier curve of survival among patient

with or without TGF-β and PI3K altered pathway. (mutated, red; non mutated, black)

TGF-B pathway mutations C

Subtype IHC markers

I I I I I I I I I I I I I I

Tumor type

Subtype morphology

APC

CTNNB1

SOX9

AXIN1

FBXW7

TGFBR2

ACVR1B

ELF3

SMAD4

SMAD3

PIK3CA

PIK3R1

PTEN

ERBB2

GNAS

ARID1A

ARID2

swi/snf ARID1B

SMARCA4

SMARCA2

PBRM1

KRAS

NF1

TP53

ATM

CDKN2A

PancreatobiliaryIntestinal Mixed

KRAS

signaling

TP53

signaling

TGFB

signaling

PIK

signaling

WNT

signaling

Tumor type

I I I I I I I I I I I I I I

Subtype morphology

Subtype IHC markers

APC

CTNNB1

SOX9

AXIN1

FBXW7

TGFBR2

ACVR1B

ELF3

SMAD4

SMAD3

PIK3CA

PIK3R1

PTEN

ERBB2

GNAS

ARID1A

ARID2

ARID1B

SMARCA4

SMARCA2

PBRM1

KRAS

NF1

TP53

ATM

CDKN2A

CAC

KRAS

signaling

TP53

signaling

DUOAC

WNT

signaling

TGFB

signaling

PIK

signaling

swi/snf

AMPACA

B

PI3K pathway mutations

I MSI Adenoma Purity<10% No driver mutation

Duodenal Ampullary Cholangiocarcinoma

Intestinal Mixed Pancreatobiliary

Nonsense INDEL Splice site Missense

Inactivated gene smg gene

Page 19: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Figure S4. Related to Figure 5. ELK4 and MET fusions. The exonic regions of each gene are represented and

numbered with the protein domain illustrated behind. FAM227B had a LINE element located in the intron that was

present in the fusion RNA.

Page 20: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Table S2. Related to Figure 2. Most prevalent Mutation Signature. Signatures present over 20%

of the total signature were considered as prevalent.

Related to Figure 1 and uploaded as an Excel file:

Table S1a. Summary of clinico-pathological variables and outcome listed by tumor type.

Table S1b. Tumor General and Subtyping information.

Table S1c. WES discovery and validation. 152 samples mutation file (maf).

Table S1d. Custom design targeted capture sequencing discovery and validation. Eight samples

mutation file (maf).

Table S1e. Mutation file (maf) of false negative undetected with the primary discovery method

Dominant ConcomitantDUOAC

n=14

CAC

n=38

AMPAC

n=38

1 _ _ 2 1

1 6 _ 1 2

4 _ _ 1 1

4 6 _ _ 1

5 _ _ _ 1

6 _ 11 19 58

6 1 _ 5 2

6 4 _ _ 1

6 5 _ 2 _

6 7 _ 2 _

6 12 _ _ 1

6 14 _ _ 1

6 18 1 _ _

7 _ _ 2 2

7 6 _ _ 1

3 _ _ _ 1

8 _ _ _ 1

2 4 9

Signature # Number of samples

no dominant sig

Supplemental Tables

Page 21: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Table S3b. Related to Figure 3. Significantly inactivated genes in periampullary MSS tumors

gene smg qinactivating

Mutstotal Muts chisq.fdr

geometric

mean

SMAD4 0 13 31 0.0001 0

KRAS 0 0 88 0.31 0

APC 6.25E-11 42 46 3.00E-60 1.37E-35

CDKN2A 8.38E-12 17 23 4.64E-17 1.97E-14

ELF3 1.59E-11 14 16 3.24E-17 2.27E-14

TP53 2.79E-12 41 104 4.93E-16 3.71E-14

ARID2 1.47E-06 12 15 7.23E-13 1.03E-09

SOX9 4.90E-06 9 9 1.57E-12 2.77E-09

ARID1A 1.61E-04 11 16 3.10E-09 7.06E-07

TGFBR2 1.61E-04 7 9 3.23E-06 2.28E-05

PIK3CA 1.73E-09 0 25 0.31 2.31E-05

PBRM1 1 7 8 1.82E-07 0.0004

ACVR1B 1.14E-05 7 15 0.02 0.0005

CTNNB1 6.98E-07 1 18 0.79 0.0007

PTEN 1.21E-05 3 10 0.31 0.002

ATM 1.15E-02 7 13 0.003 0.006

RECQL4 3.89E-01 6 8 0.0001 0.007

KDM6A 1 4 4 0.0005 0.02

AXIN1 2.93E-02 4 6 0.07 0.05

KIAA0040 2.01E-02 2 3 0.31 0.08

FBXW7 2.93E-02 4 11 0.31 0.10

TCEAL4 8.38E-02 0 7 0.79 0.26

Related to Figure 3 and uploaded as an Excel file:

Table S3a. MutSig generated smg list

Page 22: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Table S3c. Related to Figure 3. Missense variant expression

Pathway Gene name Total

with INDEL,

nonsense, or

splice

mutations

with

Missense

mutations

with both

mutation type

with

expressed

Missense

mutation

CTNNB1 3 0 3 0 3

APC2 1 0 1 0 0

FBXW7 3 2 2 1 2

FZD10 1 0 1 0 0

KRAS 17 0 17 0 16

NF1 4 1 3 0 3

PIK3CA 6 0 6 0 4

PIK3R1 3 2 1 0 1

PTEN 3 1 2 0 1

ERBB2 3 0 3 0 2

GNAS 2 0 2 0 1

TGFBR2 4 4 1 1 1

ACVR1B 2 0 2 0 1

SMAD4 7 2 6 1 6

TP53 18 9 10 1 9

ATM 4 3 2 1 2

CDKN2A 5 4 1 0 1

ARID2 4 3 2 1 2

SMARCA4 3 1 2 0 2

Sample Total and % expression: 67 57 85%

Without low cell. samples 65 57 88%

Sample with low cellularity

swi/snf

Sample number

WNT

signaling

RTK

signaling

TGFB

signaling

TP53

signaling

Page 23: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Table S3d. Related to Figure 3. Percentage of ELF3 inactivating mutations in different

tumor type

Table S5. Related to Figure 5. Fusion

Tumor type (studiesa) Count % Count %

Periampullary 160 17 10.6 14 8.75b

Bladder (BGI, MSKCC, TCGA, TCGA pub) 570 60 10.5 28 4.91b

Stomach (Pfizer, TCGA pub) 388 15 3.9 8 2.06

Cholangiocarcinoma (JHU, NCCS) 55 2 3.6 2 3.64

Cervical (TCGA) 194 7 3.6 2 1.03

pRCC (TCGA) 111 3 2.7 1 0.9

Colorectal (TCGA, TCGA pub, Genentech) 767 20 2.6 15 1.96

Lung adeno (Broad, TCGA PUB) 411 8 1.9 3 0.73

Melanoma (Broad, TCGA, Yale) 539 6 1.1 1 0.18

Esophagus (Broad, TCGA) 319 3 0.9 1 0.31

Pancreasc

457 3 0.7 1 0.22

Breast (BCCRC, TCGA, TCGA pub) 1,625 6 0.4 1 0.06

a data compiled by MSKCC cBio Portal http://www.cbioportal.org/

b significantly mutated

c Grimmond, unpublished

Non silent

mutation

Inactivating

mutationNumber

cases

Gene 1 Gene 2 SampleGene

chr 1

Genomic

break pos 1

Gene

chr 2

Genomic

break pos 2orf

read

throughprobability

SLC45A3 ELK4 AMP-2618 1 205630989 1 205593019 Y Y 0.90

FAM227B MET AMP-2117 15 49688526 7 116371722 Y N 0.76

Related to Figure 4 and uploaded as an Excel file:

Table S4a. Percentage mutation and chromosomal alteration

Table S4b. Percentage alteration (mutation and CNV combined) and statistical value (chi

square test)

Page 24: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Table S6a. Related to Figure 6a. Arm-level copy number alterations by anatomical site

ARM DUOAC AMPAC CAC

1p del + +

1q amp + +

3p del + + +

3q amp +

4p-q del + + +

5p amp +

5q del + + +

6p amp +

6p del +

6q del + + +

7p-q amp + + +

8p del + +

8q amp + + +

9p-q del + + +

13q amp + + +

14p-q del +

17q del + + +

18p-q del + + +

20p-q amp + + +

21p-q del + + +

22p-q del + + +

Related to Figure 4 and uploaded as an Excel file:

Table S6b. GISTIC analysis of CNA from Illumina Human Omni 2.5 SNP array

Page 25: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Supplemental Acknowledgements

Australian Pancreatic Cancer Genome Initiative: We would like to thank C. Axford, M.-A. Brancato, S. Rowe, M.

Thomas, S. Simpson and G. Hammond for central coordination of the Australian Pancreatic Cancer Genome

Initiative, data management and quality control; M. Martyn-Smith, L. Braatvedt, H. Tang, V. Papangelis and M.

Beilin for biospecimen acquisition; and Deborah Gwynne for support at the Queensland Centre for Medical

Genomics. We also thank M. Hodgins, M. Debeljak and D. Trusty for technical assistance at Johns Hopkins

University. N. Sperandio and D. Filippini for technical assistance at Verona University. Robert L. Sutherland –

author deceased during study.

Supplemental Experimental Procedures

Immunohistochemistry Staining

Primary antibodies and dilutions were: MUC1 1:200 (Ma695, Novocastra) and CDX2 1:100 (CDX2-88, Biocare

Medical, Concord, CA, USA). All slides were processed using an automated staining system – Leica BondIII

autostainer (Leica Microsystems, Wetzlar, Germany) according to the manufacturer’s instructions using the

manufacturer’s retrieval solutions. For MUC1 and CDX2 heat induced epitope retrieval was performed for 30

minutes in the alkaline retrieval solution ER2 (AR9640, Vision BioSystems.

DNA isolation

APGI and TUD: Nucleic acids were extracted using the Qiagen Allprep® Kit (Qiagen) in accordance with the

manufacturer’s instructions with purification of DNA and RNA from the same sample. BCM: Blood was collected

and isolated using the PaxGene collection and isolation system (Qiagen). Tumors were flashed frozen and DNA

isolated with the Puregene isolation kit (Qiagen). MDA: DNA was isolated using TRIZOL and QIAmp spin column

(Qiagen).

SNP Array Assays

In brief, 200 ng of genomic DNA was denatured with NaOH, followed by isothermal whole genome amplification at

37oC for 16-24 hours. The amplified DNA was enzymatically fragmented and hybridized to the BeadChip for 16-24

hours at 48oC (with 8 samples processed in parallel for each BeadChip). After a series of washing steps to remove

unhybridized and non-specifically hybridized DNA fragments, allele-specific single-base extension reactions were

performed to incorporate labeled nucleotides into the bead-bound primers. A multi-layer staining process was

conducted to amplify signals from the labeled extended primers, and then the coated beads were imaged with the

Illumina iScan system.

Sequencing design

A total of 160 samples were used in the study. Whole exome (WES) and custom design targeted capture sequencing

methods were both used in the discovery phase based on the tumor purity or the time the samples were received.

WES was performed on 152 samples and targeted capture on 8 samples. Validation of the WES identified driver

mutations was done with targeted capture and the targeted capture discovery was validated with ultra-deep

sequencing.

Whole Exome Sequencing

Library Preparation

DNA samples were constructed into Illumina paired-end pre-capture libraries according to the manufacturer’s

protocol (Illumina Multiplexing_SamplePrep_Guide_1005361_D) with modifications as described in the BCM-

HGSC Illumina Barcoded Paired-End Capture Library Preparation protocol. Libraries were prepared using Beckman

robotic workstations (Biomek NXp and FXp models). The complete protocol and oligonucleotide sequences are

Page 26: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

accessible from the HGSC website (https://hgsc.bcm.edu/sites/default/files/documents/Illumina_Barcoded_Paired-

End_Capture_Library_Preparation.pdf). Briefly, 1 ug of DNA in 100ul volume was sheared into fragments of

approximately 300-400 base pairs in a Covaris plate with E210 system (Covaris, Inc. Woburn, MA) followed by

end-repair, A-tailing and ligation of the Illumina multiplexing PE adaptors. Pre-capture Ligation Mediated-PCR

(LM-PCR) was performed for 7 cycles of amplification using the Phusion High-Fidelity PCR Master Mix (NEB,

Cat. no. M0531L). Universal primer IMUX-P1.0 and a pre-capture barcoded primer IBC were used in the PCR

amplification. In total, a set of 12 such barcoded primers were used on these samples. Purification was performed

with Agencourt AMPure XP beads after enzymatic reactions. Following the final XP beads purification,

quantification and size distribution of the pre-capture LM-PCR product was determined using the LabChip GX

electrophoresis system (PerkinElmer).

Exome Capture

For exome capture, four pre-capture libraries were pooled together (~300 ng/sample, 1.2 ug/pool). These pooled

libraries were then hybridized in solution to the HGSC VCRome 2.1 design (Bainbridge et al., 2011) (42Mb,

NimbleGen) according to the manufacturer’s protocol NimbleGen SeqCap EZ Exome Library SR User’s Guide

(Version 2.2) with minor revisions. Human COT1 DNA and full-length Illumina adaptor-specific blocking

oligonucleotides were added into the hybridization to block repetitive genomic sequences and the adaptor sequences.

Post-capture LM-PCR amplification was performed using the Phusion High-Fidelity PCR Master Mix with 14

cycles of amplification. After the final AMPure XP bead purification, quantity and size of the capture library was

analyzed using the Agilent Bioanalyzer 2100 DNA Chip 7500. The efficiency of the capture was evaluated by

performing a qPCR-based quality check on the four standard NimbleGen internal controls. Successful enrichment of

the capture libraries was estimated to range from a 6 to 9 of ΔCt value over the non-enriched samples.

Sequencing

Library templates were prepared for sequencing using Illumina’s cBot cluster generation system with TruSeq PE

Cluster Generation Kits (Cat. no. PE-401-3001). Briefly, these libraries were denatured with sodium hydroxide and

diluted to 6-9 pM in hybridization buffer in order to achieve a load density of ~800K clusters/mm2. Each library

pool was loaded in a single lane of a HiSeq flow cell, and each lane was spiked with 2% phiX control library for run

quality control. The sample libraries then underwent bridge amplification to form clonal clusters, followed by

hybridization with the sequencing primer. Sequencing runs were performed in paired-end mode using the Illumina

HiSeq 2000 platform. Using the TruSeq SBS Kits (Cat. no. FC-401-3001), sequencing-by-synthesis reactions were

extended for 101 cycles from each end, with an additional 7 cycles for the index read. Sequencing runs generated

approximately 300-400 million successful reads on each lane of a flow cell, yielding 7-13 Gb per sample. With these

sequencing yields, samples achieved an average of 92% of the targeted exome bases covered to a depth of 20X or

greater.

Targeted capture Sequencing (Custom Design Capture)

Illumina Library Preparation & Sequence Capture

Deep sequencing

DNA samples were constructed into Illumina paired-end pre-capture libraries according to the manufacturer’s

protocol (Illumina Multiplexing_SamplePrep_Guide_1005361_D) with the below described modifications. Libraries

were prepared using Beckman robotic workstations (Biomek FXp model).

Briefly, 0.5 ug of DNA in 70 ul volume was sheared into fragments of approximately 200-300 base pairs in a

Covaris plate with E210 system (Covaris, Inc. Woburn, MA) followed by end-repair (NEBNext End-Repair

Module; Cat. No. E6050L), A-tailing (NEBNext® dA-Tailing Module; Cat. No. E6053L) and ligation of the

Illumina multiplexing PE adaptors with barcode sequences using the ExpressLink™ T4 DNA Ligase (A custom

product from LifeTech). In total, a set of 12 such barcodes were used on these samples. Pre-capture Ligation

Mediated-PCR (LM-PCR) was performed for 6-8 cycles using the Library Amplification Readymix containing

KAPA HiFi DNA Polymerase (Kapa Biosystems, Inc., Cat # KK2612). Universal primer IMUX-P1.0 and IMUX-

P3.0 were used in the PCR amplification. Purification was performed with Agencourt AMPure XP beads after

enzymatic reactions. Following the final XP beads purification, quantification and size distribution of the pre-

Page 27: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

capture LM-PCR product was determined using the LabChip GX electrophoresis system (PerkinElmer) and gel

analysis using AlphaView SA Version 3.4 software.

For the hybridization step, 12 such six pre-capture libraries were pooled together (~125 ng/sample for a total 1.5 ug

per pool). These pooled libraries were then hybridized in solution to the custom capture design (255.48 Kb) as

described earlier for exome library enrichment with the exception that for post-capture LM-PCR amplification,

KAPA HiFi DNA Polymerase (Kapa Biosystems, Inc., Cat # KK2612) with 12 cycles of amplification. Post

enrichment steps were similar to the exome library enrichment.

The custom capture was designed to cover the following genes and regions. (SMG are indicated in red):

Exonic regions

Gene Chr

Gene Chr

Gene Chr

APC chr5

ARID1A chr1

ATXN1 chr6

CTNNB1 chr3 ARID2 chr12

ATRX chrX

SOX9 chr17 ARID1B chr6

DAXX chr6

AXIN1 chr16 SMARCA4 chr19

MEN1 chr11

FBXW7 chr4 SMARCA2 chr9

MTOR chr1

FZD10 chr12 PBRM1 chr3

SETD2 chr3

TGFBR2 chr3 MAP2K1 chr15

TSC2 chr16

ACVR1B chr12 MAP2K4 chr17

APC2 chr19

ELF3 chr1 FBN2 chr5

CDKN1B chr12

SMAD4 chr18 FBN3 chr19

CPA1 chr7

SMAD3 chr15 KDM6A chrX

DCC chr18

KRAS chr12 MLL3 chr7

DMKN chr19

NF1 chr17 RNF43 chr17

ERBB3 chr12

PIK3CA chr3 SF3B1 chr2

ERBB4 chr2

PIK3R1 chr5 ALK chr2

MAGEA3 chrX

PTEN chr10 BRCA1 chr17

MAGEA6 chrX

ERBB2 chr17 BRCA2 chr13

ROBO3 chr11

GNAS chr20 BAP1 chr3

IRF2BP2 chr1

TP53 chr17 IDH1 chr2

CDC27 chr17

ATM chr11 IDH2 chr15

CDKN2A chr9 NRAS chr1

Selected regions

Gene Chr Region covered

ACVR2A chr2 homopolymer (chr2:148683686-148683693)

AIM2 chr1 homopolymer (chr1:159032487-159032496)

ATR chr3 homopolymer (chr3:142274740-142274749; chr3:142217556-142217563)

MLH3 chr14 homopolymer (chr14:75514604-75514612)

MSH3 chr5 homopolymer (chr5:79970915-79970922)

MSH6 chr2 homopolymer (chr2:48030640-48030647; chr2:48027196-48027202)

PMS2 chr7 homopolymer (chr7:6027157-6027164)

POLE chr12 exonuclease (chr12:133257855-133248899)

Page 28: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Ultra-deep sequencing (single molecule reconstruction)

DNA (90-500 ng) was sheared into 200-300 base pairs followed by end-repair and A-tailing similar to the above

described deep sequencing libraries but during the ligation step, six different barcodes were ligated to each sample.

These barcodes are 9-bp in length and a total of 48 such barcodes were used in this study. The post ligated DNA was

then purified with Agencourt AMPure XP beads, quantified using Qubit® 2.0 Fluorometer and 10 ng of this DNA

was amplified using KAPA HiFi DNA Polymerase (Kapa Biosystems, Inc., Cat # KK2612) for 22 cycles.

For the hybridization step, 8 such pre-capture libraries were pooled together (~188 ng/sample for a total 1.5 ug per

pool). These pooled libraries were then hybridized in solution to the custom capture design (255.48 Kb as described

earlier for exome library enrichment with the exception that for post-capture LM-PCR amplification, KAPA HiFi

DNA Polymerase (Kapa Biosystems, Inc., Cat # KK2612) with 12 cycles of amplification. Post enrichment steps

were similar to the exome library enrichment.

Sequencing Statistical analysis

Sequencing data were analyzed to assure pertinent representation. The whole exome sequencing (WES) as well as

the custom design targeted capture (CDC) met high quality standard.

Mutation analysis

The distribution of mutations identified among the 160 WES and targeted capture samples (see Sequencing design)

is listed in following table.

Data analysis

Microsatellite instability coefficient

MSI is characterized by the high frequency of INDEL mutation in homopolymer regions. We identified 9 genes with

such mutations in periampullary tumors (see Table below). To calculate a MSI coefficient, the insertion or deletion

at a site were counted in the normal and the tumor of each sample; the allelic fraction of each event was calculated

and the normal value subtracted from the tumor value. The sum from the allelic fraction of all genes represented the

homopolymer instability coefficient (see figure S1a).

Total MB Total ReadsBase 20+

cov

Avg

Coverage

Normal 9,810 96,666,088 93.4% 123

Tumor 9,645 95,026,884 92.0% 120

Normal 3,736 36,994,079 99.98 3,872

Tumor 3,677 36,410,698 99.96 3,748

Average value

CDC: 160

cases

WES: 152

cases

Patient

numberTotal

Mis

sense

Non

senseINDEL

Splice

SiteSilent

Non

coding

RNA

AMPAC 92 6767 4236 325 452 104 1647 3

MSI AMPAC 6 7388 3868 208 1356 177 1760 19

CAC 42 2379 1568 104 115 32 558 2

MSI CAC 2 5244 3044 167 593 98 1335 7

DUOAC 12 1122 710 61 39 17 295 0

MSI DUOAC 6 5974 2852 153 1445 241 1263 20

Total 160 28,874 16,278 1,018 4,000 669 6,858 51

Page 29: Ampullary Cancers Harbor ELF3 Tumor Suppressor Gene

Gene Homopolymer Frameshift location

MLH3 T 9 Chr 14: 75514604-75514612

MSH3 A 8 Chr 5: 79970915-79970922

MSH6 C 8 Chr 2: 48030640-48030647

ACVR2A A 8 Chr 2: 148683686-148683693

AIM2 A 10 Chr 1: 159032487-159032496

ATR A 10 Chr 3: 142274740-142274749

MLL3 T 9 Chr 7: 151874148-151874156

RNF43 C 7 Chr 17: 56435161-56435167

TGFBR2 A 10 Chr 3: 30691872-30691881