ars.els-cdn.com · web viewsupplementary information . elucidation of the complete biosynthetic...

81
Supplementary information Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of Panax notoginseng using a synthetic biology platform Dong Wang 1 †, Jinhe Wang 1,2 †, Yusong Shi 1 , Rongsheng Li 1,3 , Feiyu Fan 1 ,Ying Huang 1 , Weixian Li 1,4 , Ning Chen 2 , Luqi Huang 5 , Zhubo Dai 1 * and Xueli Zhang 1 * 1 Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, China. 2 College of Biotechnology, Tianjin University of Science and Technology, China. 3 School of Pharmacy and Yunnan Key Laboratory of Natural Medicine Pharmacology, Kunming Medical University, China. 4 College of Pharmaceutical Science, Yunnan University of Chinese Medicine, China. 5 State Key Laboratory Breeding Base of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, China. †These authors contribute equally to this work. *Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax: 86-22-84861946. Email address: [email protected]. *Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax: 86-22-84861983. Email address: [email protected]. 1

Upload: others

Post on 06-Aug-2020

22 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary information

Elucidation of the complete biosynthetic pathway of the main triterpene

glycosylation products of Panax notoginseng using a synthetic biology platform

Dong Wang1†, Jinhe Wang1,2†, Yusong Shi1, Rongsheng Li1,3, Feiyu Fan1,Ying Huang1, Weixian Li1,4, Ning Chen2,

Luqi Huang5, Zhubo Dai1* and Xueli Zhang1*

1Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese

Academy of Sciences, China. 2 College of Biotechnology, Tianjin University of Science and Technology, China. 3

School of Pharmacy and Yunnan Key Laboratory of Natural Medicine Pharmacology, Kunming Medical

University, China. 4 College of Pharmaceutical Science, Yunnan University of Chinese Medicine, China. 5 State

Key Laboratory Breeding Base of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China

Academy of Chinese Medical Sciences, China.

†These authors contribute equally to this work.

*Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax:

86-22-84861946. Email address: [email protected].

*Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax:

86-22-84861983. Email address: [email protected].

List:

1. Supplementary Methods: P2-P9

2. Supplementary Figures: P10-11

3. Supplementary Tables: P12-P31

4. Supplementary Sequence 1: P32-P63

5. Supplementary Sequences 2: P64-P72

6. Supplementary Sequences 3: P73-P78

1

Page 2: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary Methods

Gene mining

Total RNA from control and drug treatment samples were isolated using TRIzol reagent (Invitrogen,

Grand Island, USA) according to the manufacturer’s protocol and used for high-throughput RNA

sequencing. Two 150-bp paired-end RNA-seq libraries were commercially generated in Novogene

Biotechnology Co. Ltd (Tianjin, China) using Illumina’s HiSeq X Ten platform (Illumina, San Diego,

USA). 7.3 and 5.3 million raw reads from control and drug treatment samples were filtered using

Trimmomatic (v0.35)1 to remove adapters, NGS QC Toolkit (v2.3.3)2 program to remove low quality reads

and discard reads based upon the threshold of length. Then filtered clean reads from two samples were

pooled together and a total of 12454362 reads were used to create a de-novo transcriptome assembly using

the Trinity pipeline (v2.4.0)3 with the default parameters, including a fixed k-mer size of 25 nt and

min_kmer_cov 2 options. Only assembled transcripts longer than 600 bp were kept. In addition, a set of

published data from NCBI SRA(Sequence Read Archive)database were also re-analyzed in this work

according to the above process, including SRR5282580, SRR5286139, SRR5288612, SRR5288616, and

SRR5288620. For estimating transcript abundance, HQ reads were then mapped back to the assembled

transcriptome using Bowtie2 (v2.3.2)4, and mapping statistics (fragments per kilobase transcript length per

million fragments mapped, FPKM) were estimated by RSEM5. Annotation of the reference transcriptome

was conducted on a local server against the Swissprot database and Pfam database using Blastx with an e-

value of 10-5.

Plasmid construction

1. Construction of pRS425-LEU2-TEF1-Pn1-31

The Pn1-31 gene was amplified from the cDNA of Panax Notoginseng (using primer sets SexAI-Pn1-

31/Pn1-31-AscI). The TEF1 promoter (PTEF1) and CYC1 terminator (TCYC1) were amplified from the

genomic DNA of S. cerevisiae BY4742 (using primer sets SacII-TEF1/SEXAI-TEF1 and

AscI-CYC1t/CYC1t-SacII). These amplified DNA fragments were digested with SexAI/AscI, SexAI and

AscI, respectively. These digested DNA fragments were ligated with T4 ligase and used as templates for

2

Page 3: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

PCR amplification of the PTEF1-Pn1-31-TCYC1 cassette (using primer sets SacII-TEF1/CYC1t-SacII). The

amplified product was cloned into SacII site of plasmid pRS425, resulting in pRS425-LEU2- TEF1-Pn1-31.

2. Construction of pRS425-LEU2-TEF1-Pn3-29

The Pn3-29 gene was amplified from the cDNA of Panax Notoginseng (using primer sets PacI-Pn3-

29/Pn3-29-AscI). The TEF1 promoter (PTEF1) and CYC1 terminator (TCYC1) were amplified from the

genomic DNA of S. cerevisiae BY4742 (using primer sets SacII-TEF1/PacI-TEF1 and AscI-CYC1t/CYC1t-

SacII). These amplified DNA fragments were digested with PacI/AscI, PacI and AscI, respectively. These

digested DNA fragments were ligated with T4 ligase and used as templates for PCR amplification of the

PTEF1-Pn3-29-TCYC1 cassette (using primer sets SacII-TEF1/CYC1t-SacII). The amplified product was cloned

into SacII site of plasmid pRS425, resulting in pRS425-LEU2-TEF1-Pn3-29.

3. Construction of pRS425-LEU2-TEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14,

Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-29, Pn1-30,

Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51,

Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9,

Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-

6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28,

Pn3-31, Pn3-32,Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-

42, Pn3-43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-

76, Pn3-79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107

The Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-

21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-29, Pn1-30, Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45,

Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51, Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57,

Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9, Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-

33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15,

Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-31, Pn3-32,Pn3-32-i5, Pn3-33, Pn3-35, Pn3-36,

Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60,

Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97,

Pn3-98, Pn3-103, Pn3-107 genes were amplified from the cDNA of Panax Notoginseng (using

primer sets in Table S2), respectively, and cloned into plasmid pRS425-LEU2-TEF1 -Pn1-31-CYC1

3

Page 4: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

at SexAI and AscI sites, resulting in plasmids pRS425-LEU2-TEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6,

Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-

29, Pn1-30, Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50,

Pn1-51, Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-

9, Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-

6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-

31, Pn3-32,Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-

43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-79,

Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107.

4. Construction of pRS425-LEU2-TEF1-Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106

The Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106 genes were amplified from the cDNA of

Panax notoginseng (using primer sets in Table S2), respectively, and cloned into plasmid

pRS425-LEU2-TEF1-Pn3-29-CYC1 at PacI and AscI sites, resulting in plasmids pRS425-LEU2-

TEF1-Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106.

5. Construction of pM3-Pn1-31 and pM3-SynAtUGD1

Pn1-31 and p-SynAtUGD1 were digested with SexAI and AscI, respectively, and cloned into

plasmid pM3-tHMG1 at SexAI and AscI sites, resulting in plasmids pM3-Pn1-31 and pM3-

SynAtUGD1.

6. Construction of pM13-Pn3-29

Pn3-29 was digested with PacI and AscI, and cloned into plasmid pM13-PgPPDS6 at PacI

and AscI sites, resulting in plasmid pM13-Pn3-29.

7. Construction of pM9-Pn1-31 and pM9-SynAtUXS3

Pn1-31 and p-SynAtUXS3 were digested with SexAI and AscI, respectively, and cloned

into plasmid pM9-ERG127 at SexAI and AscI sites, resulting in plasmids pM9-Pn1-31 and pM9-

SynAtUXS3.

8. Construction of pM16-Pn3-31 and pM16-SynUGTPg101

Pn3-31 and p-SynUGTPg101 were digested with SexAI and AscI, respectively, and cloned

into plasmid pM16-IDI17 at SexAI and AscI sites, resulting in plasmids pM16-Pn3-31 and pM16-

SynUGTPg101.

9. Construction of pRS425-LEU2-TEF1-SynPn3-29

4

Page 5: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

p-SynPn3-29 was digested with SexAI and AscI, respectively, and cloned into plasmid pRS425-

LEU2-TEF1-Pn1-31 at SexAI and AscI sites, resulting in plasmids pRS425-LEU2 -TEF1-

SynPn3-29.

10. Construction of p-TRP1

TRP1 was amplified from S. cerevisiae S288C genome using primer set BSP-TRP1-F/BSP-

TRP1-R, and cloned into plasmid pEASY-Blunt Simple, resulting in plasmids pTRP1.

11. Construction of pLPPgRNA

pLPPgRNA was amplified from p425-SNR52p-gRNA.CAN1.Y-SUP4t (purchased from

Addgene) using primer set 43803-up/43803-LPP-down, and digested with DpnI , resulting in

pLPPgRNA.

Strain construction

Transformation of S. cerevisiae strains was performed as described previously 6-8

1. BY-GM

Strain BY-GM was constructed by integrating the PGM1, PGM2, UGP1 and SynPgPPDS

genes into the EGH1 site of strain BY-T38. Seven DNA fragments were amplified from pM2-

PGM1 (using primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-PGM2 (using

primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set

S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-SynPgPPDS (using primer set 3G-2-

M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), S. cerevisiae genome (using primer sets

EGH1-UP-F/EGH1-UP-R and EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set

EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-T3 followed by

selection on SD-HIS-TRP plate. The strain was verified by PCR analysis.

2. PPD-PPT-GM

Strain PPD-PPT-GM was constructed by integrating the SynPgDDS, SynPgPPTS, AtCPR1

and SynPgPPDS genes into the GAL7 site of strain BY-GM. Six DNA fragments amplified from

pM2-SynPgDDS (using primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-

SynPgPPTS (using primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-

AtCPR1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-

SynPgPPDS (using primer set 3G-2-M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), and pGAL7-URA3

5

Page 6: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

(using primer sets X1-M-pEASY-r-t-F/GAL7-interg-2 and GAL7-interg-1/X2-M-pEASY-r-t-R)

were transformed into strain BY-GM followed by selection on SD-HIS-TRP-URA plate. The

strain was verified by PCR analysis.

3. PPD-PPT-GM-UGTs

Strain PPD-PPT-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -

TEF1-UGTs into strain PPD-PPT-GM, respectively, followed by selection on SD-HIS-TRP -

URA-LEU plate. The strains were verified by PCR analysis.

4. Rh2-GM

Strain Rh2-GM was constructed by integrating the PGM1, PGM2, UGP1 and Pn1-31 genes

into the EGH1 site of strain BY-PPT 9. Seven DNA fragments amplified from pM2-PGM1 (using

primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-PGM2 (using primer set s-

4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set S-4G-3M-

TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-Pn1-31 (using primer set 3G-2-M-TPI1t-TEF1-

F/M-CYC1t-pEASY-R), S. cerevisiae genome (using primer sets EGH1-UP-F/EGH1-UP-R and

EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set

EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-PPT followed by

selection on SD-HIS-TRP-URA plate. The strain was verified by PCR analysis.

5. Rh2-GM-UGTs

Strain Rh2-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -TEF1-

UGTs into strain Rh2-GM, respectively, followed by selection on SD-HIS-TRP -URA-LEU plate.

The strains were verified by PCR analysis.

6. Rd-GM

Strain Rd-GM was constructed by integrating the PGM1, Pn1-31 Pn3-31, PGM2, UGP1 and

Pn3-29 genes into the EGH1 site of strain BY-PPT 9. Nine DNA fragments amplified from pM2-

PGM1 (using primer set 1-M-pEASY-PGK1-F/s-7G-1-M-ADHT-PDC1-R), pM-9-Pn1-31 (using

primer set s-7G-1-M-ADH1t-PDC1-F/3G-1-M-ADH2t-ENO2-R), pM16-Pn3-31 (using primer

set 3G-2-M-ADH2t-ENO2-F/6G-2-M-PDC1t-FBA1-R), pM8-PGM2 (using primer set 6G-2-M-

PDC1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set S-4G-3M-TDH2t-

TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-Pn3-29 (using primer set 3G-2-M-TPI1t-TEF1-F/M-

CYC1t-pEASY-R), S. cerevisiae genome (using primer sets EGH1-UP-F/EGH1-UP-R and

6

Page 7: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set

EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-PPT followed by

selection on SD-HIS-TRP-URA plate. The strain was verified by PCR analysis.

7. Rd-GM-UGTs

Strain Rd-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -TEF1-

UGTs into strain Rd-GM, respectively, followed by selection on SD-HIS-TRP-URA-LEU plate.

The strains were verified by PCR analysis.

8. Rg1-XM

Strain Rg1-XM was constructed by integrating the PGM1, SynAtUXS3, SynUGTPg101,

PGM2, UGP1 and SynAtUGD1 genes into the EGH1 site of strain BY-PPT 9. Nine DNA

fragments amplified from pM2-PGM1 (using primer set 1-M-pEASY-PGK1-F/s-7G-1-M-ADHT-

PDC1-R), pM-9-SynAtUXS3 (using primer set s-7G-1-M-ADH1t-PDC1-F/3G-1-M-ADH2t-

ENO2-R), pM16-SynUGTPg101 (using primer set 3G-2-M-ADH2t-ENO2-F/6G-2-M-PDC1t-

FBA1-R), pM8-PGM2 (using primer set 6G-2-M-PDC1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R),

pM4-UGP1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-

SynAtUGD1 (using primer set 3G-2-M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), S. cerevisiae

genome (using primer sets EGH1-UP-F/EGH1-UP-R and EGH1-DOWN-F/EGH1-DOWN-R)

and p-TRP1 (using primer set EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into

strain BY-PPT followed by selection on SD-HIS-TRP-URA plate. The strain was verified by PCR

analysis.

9. Rg1-XM-UGTs

Strain Rg1-XM-UGTs were constructed by transforming plasmids pRS425-LEU2-TEF1-

UGTs into strain Rg1-XM, respectively, followed by selection on SD-HIS-TRP-URA-LEU plate.

The strains were verified by PCR analysis.

10. YS01-CK

Strain YS01-CK was constructed by integrating the PGM1, PGM2, UGP1 and SynPn3-29

genes into the LPP site of strain YS01 (lab collection) with CRISPR-Cas9 method. Four DNA

fragments amplified from pM2-PGM1 (using primer set LPP-50-PGK1-F/s-4G-1-M-ADHT-

FBA1-R), pM8-PGM2 (using primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-

R), pM4-UGP1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pRS425-

7

Page 8: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

LEU2-TEF1-SynPn3-29 (using primer set 3G-2-M-TPI1t-TEF1-F/LPP-50-CYC1t-R) and a

gRNA plasmid pLPPgRNA were transformed into strain YS01 followed by selection on SD-TRP-

URA plate. The strain was verified by PCR analysis.

11. YS01-CK2

Strain YS01-CK2 was constructed by recovering URA3, HIS3, TRP1, LEU2 markers. URA3,

HIS3, TRP1 genes amplified from S. cerevisiae S288C genome using primer sets M-URA-F/M-

URA-R, M-HIS3-ORF-F/M-HIS3-ORF-R, Trp-up-F/Trp-down-R, together with pRS425-LEU2-

TEF1-SynPn3-29, were transformed into strain YS01-CK (removing Cas9 and gRNA plasmids)

followed by selection on SD-TRP-URA-HIS-LEU plate. The strain was verified by PCR analysis.

Chemical analysis

LC-MS analysis was done according to the literature6, 8, with the following adaptations, for

the determination of CK, F1, Rh2, Rg3, F2, Rd, Rb1, Rg1 and R1. Acetone and methanol (1:1)

extracts (20 ul) were analysed by LC-MS using an Agilent 1200 HPLC system coupled to a

Bruker-micrOTOF-II with an electrospray ionisation (ESI) interface. Data acquisition and

processing were performed with the MicrOTOF control version 3.0/Data Analysis Version 4.0

software.

For chromatographic separation of CK, F1, Rh2, Rg3, F2, Rd, Rb1, a Waters Symmetry

C18® column (250 mm×4.6mm, 5 μm) was used. The mobile phase consisted of 0.1% formic

acid and 10% methanol in water (A) and 0.1% formic acid in acetonitrile (B). The column

was equilibrated with 80% A, then the sample was injected, the mobile phase A reached 5%

within 65 minutes, keeped 5% for 5 minutes, later returned to 80% within 5 minutes and hold

80% for 5 minutes. The solvent flow rate was 0.8 mL/min and the column temperature was

set at 30 °C. Optimised MS operating conditions were as follows: all spectra were obtained in the

positive ion mode over an m/z range of 100-1200; dry gas flow, 6.0 L/min; dry temperature, 180

°C; nebuliser pressure, 1 bar; and probe voltage, -4.5 kV. Crystallized CK, F1, Rh2, Rg3, F2, Rd

and Rb1 (all purchased from shyuanye, China) were used as standards for analysis.

For chromatographic separation of Rg1 and R1, an Ultimate® PG-C18 column (250

mm×4.6mm, 5 μm) was used. The mobile phase consisted of 0.1% formic acid in acetonitrile

(A) and 0.1% formic acid in water (B). The column was equilibrated with 19% A, then the

sample was injected, keeping 19% A for 12 minutes; the mobile phase A reached 26% within 20

minutes, and increased to 90% in 2 minutes, keeping 90% for 10 minutes, later returned to 19%

within 2 minutes and hold 19% for 4 minutes. The solvent flow rate was 1 mL/min and the

column temperature was set at 25 °C. Optimised MS operating conditions were as follows: all

spectra were obtained in the positive ion mode over an m/z range of 100-1200; dry gas flow, 6.0

8

Page 9: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

L/min; dry temperature, 180 °C; nebuliser pressure, 1 bar; and probe voltage, -4.5 kV.

Crystallized Rg1 and R1 (all purchased from shyuanye, China) were used as standards for

analysis.

Supplementary Methods Reference

1. Bolger AM, Lohse M, & Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina

sequence data. Bioinformatics 30(15):2114-2120.

2. Patel RK & Jain M (2012) NGS QC Toolkit: a toolkit for quality control of next

generation sequencing data. PLoS One 7(2):e30619.

3. Grabherr MG, et al. (2011) Full-length transcriptome assembly from RNA-Seq data

without a reference genome. Nat Biotechnol 29(7):644-652.

4. Langmead B & Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat

Methods 9(4):357-359.

5. Li B & Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data

with or without a reference genome. BMC Bioinformatics 12:323.

6. Dai Z, et al. (2013) Metabolic engineering of Saccharomyces cerevisiae for production of

ginsenosides. Metab Eng 20:146-156.

7. Lin, T. et al. (2016 )Construction of cell factories for production of lupeol in Saccharomyces

cerevisiae. China Journal of Chinese Materia Medica,41 (6):1008-1015.

8. Dai Z, et al. (2014) Producing aglycons of ginsenosides in bakers' yeast. Sci Rep 4:3698.

9. Wang, J. et al. (2019 ) Optimization of UDP-glucose supply module and production of

ginsenoside F1 in Saccharomyces cerevisiae. China Journal of Chinese Materia

Medica,44 (21):4596-4604.

9

Page 10: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Figure S1 : A synthetic biology platform was used to identify UGTs enzyme

catalyzing of triterpenoids.

A: High throughput strategy of A:

B: High throughput strategy of B:

10

Page 11: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Figure S2:Comparison of production of protopanaxadiol (PPD) using the P. ginseng CPR and

Vitis vinifera CPR.

We transformed a codon-optimized version of the dammarenediol-II synthase (SynPgDDS), protopanaxadiol synthase (SynPgPPDS) and cytochrome P450 reductase (PgCPR, KF486915) genes from P. ginseng into the S. cerevisiae strain BY-T3, resulting in strain PgCPR. For the constructon of strain VvCPR, the dammarenediol-II synthase (SynPgDDS) and protopanaxadiol synthase (SynPgPPDS) genes from P. ginseng, as well as the Vitis vinifera CPR (VvCPR, XP_002270732) gene were also transformed into the S. cerevisiae strain BY-T3, resulting in strain VvCPR. This strains was cultivated in SD-URA-TRP medium with 20g/L glucose for 5 days. Three repeats were performed and the error bars represent standard deviation.

11

Page 12: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary Table

Table S1. The therapeutic potential of ginsenosides. Ginsenosides Activities/applications ReferenceF1 Skin whitening

AntioxidantsHan, J. et al.1

Lee, E.H. et al.2

Rg1 AntioxidantsNeuroprotection

Liu, Z.Q. et al.3

Liu, Z.J. et al.4

R1 Antioxidants Liu, Z.Q. et al.3

CK Antidiabetic effectsInhibition of various cancer cell growth

Yoon, S.H. et al.5

Musende, A.G. et al.6

Rh2 Antidiabetic effectsProoxidativeInhibition of various cancer cell growth

Lee, W.K. et al.7

Liu, Z.Q. et al. 3

Musende, A.G. et al.6

Rg3 ProoxidativeInhibition of various cancer cell growthNeuroprotection

Liu, Z.Q. et al. 3

Musende, A.G. et al.6

Tian, J. et al.8

Rd AntioxidantsNeuroprotection

Liu, Z.Q. et al.3

Ye, R.D. et al.9

Rb1 Antidiabetic effectsAntioxidantsNeuroprotection

Shang, W. et al.10

Liu, Z.Q. et al. 3

Chen, X.C. et al.11

Re Antidiabetic effectsAntioxidantsNeuroprotection

Xie, J.T. et al.12

Liu, Z.Q. et al.3

Chen, L.M. et al.13

Rb2 Antidiabetic effectsAntioxidants

Yokozawa, T. et al.14

Chae, S. et al.15

Rh1 Inhibition of various cancer cell growth Lyu X, et al.16

F2 Inhibition of various cancer cell growth Shin JY. et al.17

Supplementary Table S1 reference:1. Han, J.; Lee, E.; Kim, E.; Yeom, M.H.; Kwon, O.; Yoon, T.H.; Lee, T.R.; Kim, K. Role of epidermal γδ T-

cell-derived interleukin 13 in the skin-whitening effect of Ginsenoside F1. Exp. Dermatol. 2014, 23, 860–862.

2. Lee, E.H.; Cho, S.Y.; Kim, S.J.; Shin, E.S.; Chang, H.K.; Kim, D.H.; Yeom, M.H.; Woe, K.S.; Lee, J.; Sim,

Y.C.; et al. Ginsenoside F1 protects human HaCaT keratinocytes from ultraviolet-B-induced apoptosis by

maintaining constant levels of Bcl-2. J. Investig. Dermatol. 2003, 121, 607–613.

3. Liu, Z.Q., Luo, X.Y., Liu, G.Z., Chen, Y.P., Wang, Z.C., Sun, Y.X., 2003. In vitro study of the relationship

between the structure of ginsenoside and its antioxidative or prooxidative activity in free radical induced

hemolysis of human erythrocytes. J. Agric. Food Chem. 51, 2555–2558.

4. Liu, Z.J., Zhao, M., Zhang, Y., Xue, J.F., Chen, N.H., 2010. Ginsenoside Rg1 promotes glutamate release

via a calcium/calmodulin-depe ndent protein kinase II-dependent signaling pathway. Brain Res. 1333, 1–8.

5. Yoon, S.H., Han, E.J., Sung, J.H., Chung, S.H., 2007. Anti-diabetic effects of compound K versus

metformin versus compound K-metformin combination therapy in diabetic db/db mice. Biol. Pharm. Bull. 30,

12

Page 13: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

2196–2200.

6. Musende, A.G., Eberding, A., Wood, C., Adomat, H., Fazli, L., Hurtado-Coll, A., Jia, W., Bally, M.B.,

Guns, E.T., 2009. Pre-clinical evaluation of Rh2 in PC-3 human xenograft model for prostate cancer in vivo:

formulation, pharmacokinetics, biodistribution and efficacy. Cancer Chemother. Pharmacol. 64, 1085–1095.

7. Lee, W.K., Kao, S.T., Liu, I.M., Cheng, J.T., 2006. Increase of insulin secretion by ginsenoside Rh2 to

lower plasma glucose in Wistar rats. Clin. Exp. Pharmacol. Physiol. 33, 27–32.

8. Tian, J., Fu, F., Geng, M., Jiang, M., Yang, J., Jiang, W., Wang, C., Liu, K., 2005. Neuroprotective effect of

20(S)-ginsenoside Rg3 on cerebral ischemia in rats. Neurosci. Lett. 374, 92–97

9. Ye, R.D., Li, N.L., Han, J.L., Kong, X.W., Cao, R., Rao, Z.R., Zhao, G., 2009. Neuroprotective effects of

ginsenoside Rd against oxygen–glucose deprivation in cultured hippocampal neurons. Neurosci. Res. 64, 306–

310.

10. Shang, W., Yang, Y., Jiang, B., Jin, H., Zhou, L., Liu, S., Chen, M., 2007. Ginsenoside Rb1 promotes

adipogenesis in 3T3-L1 cells by enhancing PPARgamma2 and C/EBPalpha gene expression. Life Sci. 80, 618–

625.

11. Chen, X.C., Huang, T.W., Zhang, J., Song, J.Q., Chen, L.M., Zhu, Y.G., 2008d. Involvement of calpain and

p25 of CDK5 pathway in ginsenoside Rb1’s attenuation of beta-amyloid peptide (25-35)-induced tau

hyperphosphorylation in cortical neurons. Brain Res. 1200, 99–106.

12. Xie, J.T., Mehendale, S.R., Li, X., Quigg, R., Wang, X., Wang, C.Z., Wu, J.A., Aung, H.H., Rue, P.A., Bell,

G.I., Yuan, C.S., 2005b. Anti-diabetic effect of ginsenoside Re in ob/ob mice. Biochim. Biophys. Acta 1740, 319–

325.

13. Chen, L.M., Zhou, X.M., Cao, Y.L., Hu, W.X., 2008. Neuroprotection of ginsenoside Re in cerebral

ischemia-reperfusion injury in rats. J. Asian Nat. Prod. Res. 10, 439–445.

14. Yokozawa, T., Yasui, T., Oura, H., 1993. Stimulation of RNA-polymerase activity by ginsenoside-Rb2 in

diabetic rats. Phytother. Res. 7, 240–243.

15. Chae, S., Kang, K.A., Youn, U., Park, J.S., Hyun, J.W., 2010. A comparative study of the potential

antioxidant activities of ginsenosides. J. Food Biochem. 34, 31–43.

16. Lyu X, Xu X, Song A, Guo J, Zhang Y, Zhang Y. 2019.Ginsenoside Rh1 inhibits colorectal cancer cell

migration and invasion in vitro and tumor growth in vivo. Oncol Lett. 18(4):4160-4166.

17. Shin JY, Lee JM, Shin HS, Park SY, Yang JE, Cho SK, Yi TH. 2012. Anti-cancer effect of ginsenoside

f2 against glioblastoma multiforme in xenograft model in sd rats. J Ginseng Res. 36(1):86-92.

13

Page 14: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Table S2. Primers used in this work

Primer name Target Gene SEQUENCE(5' TO 3')

Pac-pPDC1 pPDC1 GCGTTAATTAACATGCGACTGGGTGAGCATATGTTC

pPDC1-SexA GCGACCWGGTTTTGATTGATTTGACTGTGTTATTT

ASC-tADH2 tADH2 GCGGCGCGCCGCGGATCTCTTATGTCTTTACGATT

tADH2-Pme1 GCGGTTTAAACTAGAATTATATAACTTGATGAGATG

Pac-pENO2 pENO2 GCGTTAATTAAAATCCTACTCTTGCCGTTGCCATCC

pENO2-SexA GCGACCWGGTTATTATTGTATGTTATAGTATTAGTTG

Sac11-TEF1 pTEF1 GCGCCGCGGAGTGATCCCCCACACACCATAGCTT

TEF1-SexA1 TGGCGACCWGGTTTTGTAATTAAAACTTAGATTAGA

TEF1-Pac1pTEF1

(Pac1)TGGCGTTAATTAATTTGTAATTAAAACTTAGATTAGA

Asc1-CYC1t CYC1t GCGGCGCGCCCCGCTGATCCTAGAGGGCCGCATCA

CYC1t-Sac11 GCGCCGCGGGCGCGTTGGCCGATTCATTAATGCA

SexA1-Pn1-2 Pn1-2 GCGACCTGGTATGTTTACAACAAAAATGCCAACAC

Pn1-2-Asc1 GCGGCGCGCCTTATTGTTTAGATTTCACACCCATT

SexA1-Pn1-3 Pn1-3 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC

Pn1-3-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTG

SexA1-Pn1-4 Pn1-4 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC

Pn1-4-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTG

SexA1-Pn1-6 Pn1-6 GCGACCTGGTATGGAAAGAAAAGCAATGGAATATTC

Pn1-6-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGCATC

SexA1-Pn1-12 Pn1-12 GCGACCTGGTATGGAGCAAAATCAGAAAATGGCTAG

Pn1-12-Asc1 GCGGCGCGCCTTATCTCCTAGAGAAAGATAAAATGC

SexA1-Pn1-14 Pn1-14 GCGACCTGGTATGGCCGCCAATGACAAATTACAC

Pn1-14-Asc1 GCGGCGCGCCTCATTTTGTGAGGATTTTATGACTTTC

SexA1-Pn1-15 Pn1-15 GCGACCTGGTATGGAGGTTGTGGTTGATGAAG

14

Page 15: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Pn1-15-Asc1 GCGGCGCGCCCTATTTTTTCCACGACTCAACAACAC

SexA1-Pn1-16 Pn1-16 GCGACCTGGTATGGACTCACAAGTCTCATCACGTCC

Pn1-16-Asc1 GCGGCGCGCCTTACTGATCTGATCGTTCCTCTCTC

SexA1-Pn1-17 Pn1-17 GCGACCTGGTATGAAACAACTAAGAGAGTTGCTAAC

Pn1-17-Asc1 GCGGCGCGCCTTATTTCATCCAATCAGTAATATTGC

SexA1-Pn1-20 Pn1-18 GCGACCTGGTATGGCAGTAGCCGGCGCTGGTGCAC

Pn1-20-Asc1 GCGGCGCGCCTCATTTTATTATATGTTTTTTGGGG

SexA1-Pn1-21 Pn1-21 GCGACCTGGTATGAATGCACCAAGTTTTCACATAGC

Pn1-21-Asc1 GCGGCGCGCCTCATCCCAATAACTGTTGCAGTTTC

SexA1-Pn1-23 Pn1-23 GCGACCTGGTATGAATGCACCAAGTTTTCACATAG

Pn1-23-Asc1 GCGGCGCGCCTCATCCCAATAACTGTTGCAGTTTC

SexA1-Pn1-24 Pn1-24 GCGACCTGGTATGGAGGTTGTGGTTGATGAAGCC

Pn1-24-Asc1 GCGGCGCGCCCTATTTTTTCCACGACTCAACAAC

SexA1-Pn1-25 Pn1-25 GCGACCTGGTATGGATTCACAATCAAACCAACTTCAC

Pn1-25-Asc1 GCGGCGCGCCTTAATTAGAGCATTTACGTTCGCTTGG

SexA1-Pn1-28 Pn1-28 GCGACCTGGTATGGGATTAAATTCAGCTAATAAGG

Pn1-28-Asc1 GCGGCGCGCCTCACATATTGCAGTGGAGAGCTTCC

SexA1-Pn1-29 Pn1-29 GCGACCTGGTATGGGTAATCCAAAACCTCATGTAG

Pn1-29-Asc1 GCGGCGCGCCTCAACCTTCATAATGAAGAGCCTCC

SexA1-Pn1-30 Pn1-30 GCGACCTGGTATGGCGAAGAATCAGCAGAAAGTTC

Pn1-30-Asc1 GCGGCGCGCCCTATCTGCCAAGAATTTTGATTTTG

SexA1-Pn1-31 Pn1-31 GCGACCTGGTATGGACAGAGAAATGTTGAGCAAAAC

Pn1-31-Asc1 GCGGCGCGCCTCAGGAGGACACAAGCTTTGAAATG

SexA1-Pn1-34 Pn1-34 GCGACCTGGTATGTCCGGTTCCTCCGGTAAAGGCG

Pn1-34-Asc1 GCGGCGCGCCCTATCGATCATTTTTGAGTTGAAGTTG

SexA1-Pn1-37 Pn1-37 GCGACCTGGTATGGAGAAGAATCACTGTGACAACC

Pn1-37-Asc1 GCGGCGCGCCTTATCTACGAATGTGAGCAATGAAGTC

SexA1-Pn1-39 Pn1-39 GCGACCTGGTATGGAAAATAACCACGTTCTTCATG

Pn1-39-Asc1 GCGGCGCGCCTTAACTCATCAATTGGGATTTCCTCA

SexA1-Pn1-45 Pn1-45 GCGACCTGGTATGGAGAAGCAAACATTGGCAAAC

Pn1-45-Asc1 GCGGCGCGCCTCAAGTGGTGGTGGATAAAACCTTTG

15

Page 16: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

SexA1-Pn1-47 Pn1-47 GCGACCTGGTATGGCAGAAACTCAGAAGCCACATG

Pn1-47-Asc1 GCGGCGCGCCTTACTTCATGGGGGAGAGAATCAC

SexA1-Pn1-48 Pn1-48 GCGACCTGGTATGTCAGATTTCGGTGACCCAAATTG

Pn1-48-Asc1 GCGGCGCGCCTCAAGTGACCCGGGTATTCTTCCAC

SexA1-Pn1-49 Pn1-49 GCGACCTGGTATGGGTACAAGGCAATCCCAGCAAA

Pn1-49-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATA

SexA1-Pn1-50 Pn1-50 GCGACCTGGTATGGGAGCAGAGCTCATCTTCATCC

Pn1-50-Asc1 GCGGCGCGCCCTAACCGACCTGATTATATTTTGCG

SexA1-Pn1-51 Pn1-51 GCGACCTGGTATGGGAGCAGAGCTCATCTTCATCC

Pn1-51-Asc1GCGGCGCGCCTCATTGGCAGCAGTGAATCGTCACAA

C

SexA1-Pn1-52 Pn1-52 GCGACCTGGTATGACAACAGTTGAGGAAGTATTTGTG

Pn1-52-Asc1 GCGGCGCGCCTCATAATAACACCCCCCTAAAGGC

SexA1-Pn1-53 Pn1-53 GCGACCTGGTATGACAACAGTTGAGGAAGTATTTG

Pn1-53-Asc1 GCGGCGCGCCTCATAATAACACCCCCCTAAAGGC

SexA1-Pn1-54 Pn1-54 GCGACCTGGTATGGAGATTAACCGGCATAGGAAGC

Pn1-54-Asc1 GCGGCGCGCCTTATTTTGTATGATTTTCAAGATAC

SexA1-Pn1-55 Pn1-55 GCGACCTGGTATGGAACATGAAGATCAAAAGCC

Pn1-55-Asc1 GCGGCGCGCCTTAATTATTTGCCGTTGGGCAAAC

SexA1-Pn1-57 Pn1-57 GCGACCTGGTATGGAGTCTCCGAATAGACCTCATG

Pn1-57-Asc1 GCGGCGCGCCTTAAGGTTTGCTAATATTTTTTCCAAC

SexA1-Pn1-58 Pn1-58 GCGACCTGGTATGGCAACTGAAGACCCTAAACTCC

Pn1-58-Asc1 GCGGCGCGCCTTATCCATTTTTTGATTTCTCAAAAG

SexA1-Pn1-61 Pn1-61 GCGACCTGGTATGAATACCCTACTCTTGTTTCATA

Pn1-61-Asc1 GCGGCGCGCCTTACACTAAATCCTTGATCTTACTC

SexA1-Pn1-63 Pn1-63 GCGACCTGGTATGGAGAAATTGGAACTGATTTTCAT

Pn1-63-Asc1 GCGGCGCGCCTCACACTTTCATATCCTCAATTAAG

SexA1-Pn1-66 Pn1-66 CGACCTGGTATGGATTATCAAGAAAAAATTCTCCCTC

Pn1-66-Asc1 GCGGCGCGCCTTAGGTATGTGGTGTTGACAAGGTC

SexA1-Pn2-7 Pn2-7 GCGACCTGGTATGGCAACCCAAAAATGTTTCCGTG

Pn2-7-Asc1 GCGGCGCGCCTCATACAACAATCCGTTTTTTCTTG

16

Page 17: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

SexA1-Pn2-9 Pn2-9 GCGACCTGGTATGGATACCAACTCAAATAGCTTC

Pn2-9-Asc1 GCGGCGCGCCTTAATTAATATCTCTATACTTGCCC

SexA1-Pn2-19 Pn2-19 GCGACCTGGTATGGCCACTGAGGAAGCTAGTTTG

Pn2-19 -Asc1 GCGGCGCGCCTTACTCATCCAAATTCTTGTTGCATA

SexA1-Pn2-20 Pn2-20 GCGACCTGGTATGAGAGTACTTCCAATTCTTCTAC

Pn2-20-Asc1 GCGGCGCGCCCTAAGTACATGCATGCTTAAGTTCA

SexA1-Pn2-21 Pn2-21 GCGACCTGGTATGGATACCGAAAAGCTTCATGTAG

Pn2-21-Asc1 GCGGCGCGCCTTAAATTAATTTTTTTAACCTCCTC

Pac1-Pn2-28 Pn2-28 GCGTTAATTAAATGGAAAATCAAGGACTACGA

Pn2-28-Asc1 GCGGCGCGCCTCACTTTGACAATTGGAATGACGAG

SexA1-Pn2-30 Pn2-30 GCGACCTGGTATGTACGAGTCGACGACAAAGATG

Pn2-30-Asc1 GCGGCGCGCCCTATAACACATCTCTCACTAACTTGTC

SexA1-Pn2-31 Pn2-31 GCGACCTGGTATGTCAAAAATGGAACAAACACCC

Pn2-31-Asc1 GCGGCGCGCCTCATGAGGCTATAGGTGCTATTTTA

SexA1-Pn2-33 Pn2-33 GCGACCTGGTATGGAAAATCGACAAATTCTCCCTC

Pn2-33-Asc1 GCGGCGCGCCTTACGCCTGTGATGTTGACAAG

SexA1-Pn2-35 Pn2-35 GCGACCTGGTATGGCTCAACAAACAATCCCAC

Pn2-35-Asc1 GCGGCGCGCCCTAGGGTGTGATGCCACCCAAAG

SexA1-Pn2-47 Pn2-47 GCGACCTGGTATGGGAGCGGAGCTCATCTTCATC

Pn2-47-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATA

SexA1-Pn3-1 Pn3-1 GCGACCTGGTATGAAGAACCCAGAGCTCATATTC

Pn3-1-Asc1 GCGGCGCGCCTCAAATAATTTCCTCCATTATACTTC

SexA1-Pn3-4 Pn3-4 GCGACCTGGTATGCTCCCCTGGTTAGCTTTTGGCC

Pn3-4-Asc1 GCGGCGCGCCCTATTTTTCAGAATTTTTTCCGGTG

SexA1-Pn3-5 Pn3-5 GCGACCTGGTATGGAGAAGCAAGGAGAAAAGAA

Pn3-5-Asc1 GCGGCGCGCCTTATAATGACATAATATAACTAACC

SexA1-Pn3-6 Pn3-6 GCGACCTGGTATGGCTGAACAAACAATCCCACCTC

Pn3-6-Asc1 GCGGCGCGCCCTAGGCTCTGATGCCACCCACAGTC

SexA1-Pn3-7 Pn3-7 GCGACCTGGTATGAAGAGGGCCAAGATAATAATGG

Pn3-7-Asc1 GCGGCGCGCCTCACGAATTAAGATTCTGTTGAGATAC

SexA1-Pn3-8 Pn3-8 GCGACCTGGTATGGAAATGAAGAGCTATGGAGCTC

17

Page 18: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Pn3-8-Asc1 GCGGCGCGCCCTATATATGTGAAGAGCTTCTTGTAC

SexA1-Pn3-9 Pn3-9 GCGACCTGGTATGGCCGCCGGCAATGATAACTTAC

Pn3-9-Asc1 GCGGCGCGCCTTATTTTGTATGATTTTCAAGATAC

SexA1-Pn3-11 Pn3-11 GCGACCTGGTATGGGGTCCTTCGGAAAAGCAGAA

Pn3-11-Asc1 GCGGCGCGCCCTACTCTGAAATATTATTAGTCAAG

SexA1-Pn3-13 Pn3-13 GCGACCTGGTATGGATATCGAGAAAGGTAGAATC

Pn3-13-Asc1 GCGGCGCGCCTTAATATTGTGCGTCTTTCTTCATC

SexA1-Pn3-15 Pn3-15 GCGACCTGGTATGGGAGCGGAGCTCATCTTCATCC

Pn3-15-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATAA

Pac11-Pn3-19 Pn3-19 GCGTTAATTAAATGAGGAATATGAAGACTGCAGAAC

Pn3-19-Asc1 GCGGCGCGCCTTACATAATAATTTCCTCAATTAATC

SexA1-Pn3-21 Pn3-21 GCGACCTGGTATGTTCCCTTGGCTTGCCTTTGGTC

Pn3-21-Asc1 GCGGCGCGCCTCATATTAATTCCCTCTTGATCTTTC

SexA1-Pn3-24 Pn3-24 GCGACCTGGTATGGGTGGTCAGCTTCACGTCTTC

Pn3-24-Asc1 GCGGCGCGCCTCAAGAATGATAAAAACTCAATTC

Pac1-Pn3-25 Pn3-25 GCGTTAATTAAATGTCCCAAAGTCCAGCAATGTCC

Pn3-25-Asc1 GCGGCGCGCCTTATAACTCTTCTTGATTGTTCTTC

SexA1-Pn3-26 Pn3-26 GCGACCTGGTATGTTCCCTTGGCTTGCCTTTGGTC

Pn3-26-Asc1 GCGGCGCGCCTCATATTAATTCCCTCTTGATCTTTC

SexA1-Pn3-27 Pn3-27 GCGACCTGGTATGGAAAGAAAAGCAATGGAATAT

Pn3-27-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGC

SexA1-Pn3-28 Pn3-28 GCGACCTGGTATGGGCTCCCTTCCTAAAGTAAC

Pn3-28-Asc1 GCGGCGCGCCCTACTTTGCTAACAACACCTGATCC

Pac1-Pn3-29 Pn3-29 GCGTTAATTAAATGAAGTCAGAATTGATATTCTTGC

Pn3-29-Asc1 GCGGCGCGCCTTACATAATTTCCTCAAATAGCTTC

SexA1-Pn3-31 Pn3-31 GCGACCTGGTATGGATAACCAAAAAGGTAGAATC

Pn3-31-Asc1 GCGGCGCGCCCTATTGTGCATCTTTCTTCTTCTTAC

SexA1-Pn3-32 Pn3-32 GCGACCTGGTATGGATATCGAGAAAGGTAGAATC

Pn3-32-Asc1 GCGGCGCGCCTTAATATTGTGCGTCTTTCTTCATC

SexA1 Pn3-32-i5 Pn3-32-i5 GCGACCTGGTATGGATAACCAAGAAGCTAGAATCAG

Pn3-32-i5-Asc1 GCGGCGCGCCCTATTGTGCATCTTTCTTCTTCTTAC

18

Page 19: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

SexA1-Pn3-33 Pn3-33 GCGACCTGGTATGGGTACAAATCCAGCTAAAAAG

Pn3-33-Asc1 GCGGCGCGCCTCATCCTGCAAGGAGAACTTCCTTA

SexA1-Pn3-35 Pn3-35 GCGACCTGGTATGAAAAGCAAGAATCACTTCCTC

Pn3-35-Asc1 GCGGCGCGCCTTACATTTCCAAAAACTGTTTTAC

SexA1-Pn3-36 Pn3-36 GCGACCTGGTATGGCATCCGATCTCTATGAGCAC

Pn3-36-Asc1 GCGGCGCGCCTTAAATTAACTTCTGCATCCTAGTG

SexA1-Pn3-38 Pn3-38 GCGACCTGGTATGAATCCGGTCGCCGCCGGTGAA

Pn3-38-Asc1 GCGGCGCGCCTCAGCATTGTAAAATATTCCTTGTA

SexA1-Pn3-39 Pn3-39 GCGACCTGGTATGGAAAACACACCCCATATAGCC

Pn3-39-Asc1 GCGGCGCGCCTTAGTTTTTATTCCACTTGCCAGCC

SexA1-Pn3-40 Pn3-40 GCGACCTGGTATGGATTCACCGTCAGACCAGCTTC

Pn3-40-Asc1 GCGGCGCGCCTTAGCTTTTTTTTTTCCTTTGTTTC

SexA1-Pn3-41 Pn3-41 GCGACCTGGTATGGAAGAGAGAAAAGAAAAGATG

Pn3-41-Asc1 GCGGCGCGCCTCAAGTATTCATTCTGCATTTCTGC

SexA1-Pn3-42 Pn3-42 GCGACCTGGTATGGAGAAAAAGGACTCAACTCG

Pn3-42-Asc1 GCGGCGCGCCTCATCTCTCCACACCCATCAATTTA

SexA1-Pn3-43 Pn3-43 GCGACCTGGTATGGGGAGTTCAGCGGAGACACATG

Pn3-43-Asc1 GCGGCGCGCCCTAAGTTGTAATCACTTCTAGTAGG

SexA1-Pn3-44 Pn3-44 GCGACCTGGTATGGAAGAATTTCCAATTCCAGTAG

Pn3-44-Asc1 GCGGCGCGCCTTAATCATTCCTCGGCTGGAGAGCTTC

SexA1-Pn3-47 Pn3-47 GCGACCTGGTATGATGGGCCACGCAACACAGGCC

Pn3-47-Asc1 GCGGCGCGCCTCACATTTTCAGATCCTCAATTAAG

SexA1-Pn3-54 Pn3-54 GCGACCTGGTATGAGCTCTGAAGGTCTGGTTCATG

Pn3-54-Asc1 GCGGCGCGCCTCATTGAGTAATATGCATGGTTCTC

SexA1-Pn3-55 Pn3-55 GCGACCTGGTATGGCTTCGGAAGGAAAGAAGAAAG

Pn3-55-Asc1 GCGGCGCGCCTTAAGAATATCCTTTGATCTCATCA

SexA1-Pn3-60 Pn3-60 GCGACCTGGTATGGATTATGTTCCGGCAGCTAATC

Pn3-60-Asc1 GCGGCGCGCCTTATGATTTGCGTAGTAACACCTCTTCC

SexA1-Pn3-67 Pn3-67 GCGACCTGGTATGGATCAACATACACTCCCACCTC

Pn3-67-Asc1 GCGGCGCGCCTTATTTATTCATTGATGGTGGAGG

SexA1-Pn3-68 Pn3-68 GCGACCTGGTATGGAAAGAAAAGCAATGGAATAT

19

Page 20: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Pn3-68-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGCA

SexA1-Pn3-74 Pn3-74 GCGACCTGGTATGGAAGAGAGAAAAGAAAAGATGAT

Pn3-74-Asc1 GCGGCGCGCCCTATTCATCCTCCCCTCTTACCTTG

SexA1-Pn3-75 Pn3-75 GCGACCTGGTATGGATACCAATGAAAAAACCAGAA

Pn3-75-Asc1 GCGGCGCGCCTTAGGACCATTCCTTCCCCTGGC

SexA1-Pn3-76 Pn3-76 GCGACCTGGTATGGAGAATGAGAAAACTTATAAAGC

Pn3-76-Asc1 GCGGCGCGCCTTAGAGTGCCAAAATCCGAGAAATA

SexA1-Pn3-79 Pn3-79GCGACCTGGTATGGAAAATCGACAAATTCTCCCTCCT

C

Pn3-79-Asc1GCGGCGCGCCTTACGCCTGTGATGTTGACAAGTTTAT

C

SexA1-Pn3-87 Pn3-87 GCGACCTGGTATGGCTTCCCAAATTGAAAAGCTC

Pn3-87-Asc1 GCGGCGCGCCTCATTGACCAGTTATTTGTTCAATG

SexA1-Pn3-88 Pn3-88 GCGACCTGGTATGGCTATTCTCCAAACCCAAGAC

Pn3-88-Asc1 GCGGCGCGCCTCATTTCAATTTCAGTTGTTCCACGAC

SexA1-Pn3-89 Pn3-89 GCGACCTGGTATGTTACCCATTCTTGACTTGACG

Pn3-89-Asc1 GCGGCGCGCCTCAATCATTTTTGAGTTGAAGTTGAGC

SexA1-Pn3-91 Pn3-91 GCGACCTGGTATGAAGAAATCTGCAGAGGTTGTAT

Pn3-91-Asc1 GCGGCGCGCCTCAAATATCTAACAAAAGTTGTTG

SexA1-Pn3-97 Pn3-97 GCGACCTGGTATGGAGCAAAATCAGAAAATGGCTAG

Pn3-97-Asc1 GCGGCGCGCCTTATCTCCTAGAGAAAGATAAAATGC

SexA1-Pn3-98 Pn3-98 GCGACCTGGTATGGGTGGTCAGCTTCACGTCTTC

Pn3-98-Asc1 GCGGCGCGCCATGGGTGGTCAGCTTCACGTCTTC

SexA1-Pn3-103 Pn3-103 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC

Pn3-103-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTGAG

Pac1-Pn3-106 Pn3-106 GCGTTAATTAAATGCTGCTGCTGCTACTACTACTACT

Pn3-106-Asc1 GCGGCGCGCCTTATCTAGATATGTGTACAATAAATG

SexA1-Pn3-107 Pn3-107 GCGACCTGGTATGTCTACCTCCATCTCTATATCTTCC

Pn3-107-Asc1 GCGGCGCGCCATGTCTACCTCCATCTCTATATCTTCC

BSP-TRP1-F TRP1 TGGCGTCCGGATACAATCTTGATCCGGAGCT

BSP-TRP1-R TGGCGTCCGGACACAAACAATACTTAAATAAATAC

43803-up LPP gRNA GATCATTTATCTTTCACTGC

20

Page 21: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

43803-LPP1-downCGCAGTGAAAGATAAATGATCAGCATAAATGCTGCCC

TAACGTTTTAGAGCTAGAAATAGCAAG

21

Page 22: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Table S3. Plasmids used in this work

Name Description Source

pRS425 2 MICRON, LEU2Sikorski et al.1

{Sikorski, 1989 #215}

pEASY-Blunt Simple Cloning vector for blunt ligation, Amp, Km TransGen Biotech

pUC57 Cloning vector with multiple cloning sites, Amp GenScript

p-SynAtUGD1 Cloning SynAtUGD1 gene into pUC57 This study

p-SynAtUXS3 Cloning SynAtUXS3 gene into pUC57 This study

p-SynPn3-29 Cloning SynPn3-29 gene into pUC57 This study

p-SynUGTpg101 Cloning SynUGTpg101 gene into pUC57 This study

pM3-SynAtUGD1 Containing PTEF1-SynAtUGD1-TCYC1 cassette This study

pM9-SynAtUXS3 Containing PPDC1-SynAtUXS3-TADH2 cassette This study

pRS425-LEU2-TEF1-SynPn3-29 Containing PTEF1-SynPn3-29-TCYC1 cassette This study

p-TRP1 Cloning TRP1 gene into pEASY-Blunt Simple This study

pGAL7-URA3 Cloning GAL7 and URA3 genes into pEASY-Blunt Simple This study

pM2-SynPgDDS Containing PPGK1-SynPgDDS-TADH1 cassette Dai et al.2

pM8-SynPgPPTS Containing PFBA1-SynPgPPTS-TTDH2 cassette Dai et al.2

22

Page 23: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

pM4-AtCPR1 Containing PTDH3-AtCPR1-TTPI1 cassette Dai et al.2

pM3-SynPgPPDS Containing PTEF1-SynPgPPDS-TCYC1 cassette Dai et al.2

pM2-PGM1 Containing PPGK1-PGM1-TADH1 cassette Dai et al.2

pM8-PGM2 Containing PFBA1-PGM2-TTDH2 cassette Dai et al.2

pM4-UGP1 Containing PTDH3-UGP1-TTPI1 cassette Dai et al.2

pM3-Pn1-31 Containing PTEF1-Pn1-31-TCYC1 cassette This study

pM9-Pn1-31 Containing PPDC1-Pn1-31-TADH2 cassette This study

pM13-Pn3-29 Containing PTEF1-Pn3-29-TCYC1 cassette This study

pM16-SynUGTPg101 Containing PENO2-SynUGTPg101-TPDC1 cassette This study

pM16-Pn3-31 Containing PENO2-Pn3-31-TPDC1 cassette This study

pRS425-LEU2-TEF1-UGTs

(The UGT gene library)Containing PTEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17,

Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25, Pn1-28, Pn1-29, Pn1-30, Pn1-31, Pn1-34, Pn1-

37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51, Pn1-52, Pn1-53,

Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9, Pn2-19, Pn2-20,

Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47.Pn3-1, Pn3-4, Pn3-5, Pn3-6, Pn3-7, Pn3-

8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-31, Pn3-

32, Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-43,

Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-

This study

23

Page 24: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107, Pn2-28, Pn2-39,

Pn3-19, Pn3-25, Pn3-29, Pn3-106-TCYC1 cassette, respectively, 2 micron, LEU2 marker

pLPPgRNA Containing LPP site gRNA, 2 micron, URA3 marker This study

24

Page 25: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Table S4. Strains used in this work

Name Description Source

BY-T3 BY4742-TRP, δDNA::PPGK1-tHMG1-TADH1-PTEF1-LYS2-TCYC1, TRP1::HIS3-PPGK1-ERG20-TADH2-PTDH3-ERG1-TTPI1-PTEF1-ERG9-TCYC1 Dai et al.3

BY-PPT BY-T3, GAL7::URA3:PPGK1-SynPgDDS-TADH1-PFBA1-SynPgPPTS-TTDH2-PTDH3-AtCPR1-TTPL1-PTEF1-SynPgPPDS-TCYC1 Wang et al.5

BY-GM BY-T3, EGH1::TRP1: PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynPgPPDS-TCYC1 This study

PPD-PPT-GM BY-GM, GAL7::URA3:PPGK1-SynPgDDS-TADH1-PFBA1-SynPgPPTS-TTDH2-PTDH3-AtCPR1-TTPL1-PTEF1-SynPgPPDS-TCYC1 This study

PPD-PPT-GM-UGTs PPD-PPT-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study

Rh2-GM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-Pn1-31-TCYC1 This study

Rh2-GM-UGTs Rh2-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study

Rd-GM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1- PPDC1-Pn1-31-TADH2-PENO2-Pn3-31-TPDC1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-Pn3-29-TCYC1 This study

Rd-GM-UGTs Rd-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study

Rg1-XM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1- PPDC1-SynAtUXS3-TADH2-PENO2-SynUGTPg101-TPDC1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynAtUGD1-TCYC1 This study

Rg1-XM-UGTs Rg1-XM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study

YS01 CEN.PK2-1D,

YJL064W::PPGK1-tHMG1-TADH1-PPDC1-ERG12-TADH2-PPYK1-ERG12-TPGI1-PTEF2-SynHMGR-N-TENO2-PENO2-IDI1-TPDC1-PFBA1-ERG13-TTDH2-PTDH3 -ERG8-TTPL1-PTEF1-ERG10-TCYC1,

NDT80::PPGK1-tHMG1-TADH1-PTDH3-SynHMGR-N-TTPL1-PTEF1-IDI1-TCYC1,

RHR2::PPGK1-ADH2-TADH1-PFBA1-SynPTA-TTDH2 -PTDH3-SynPK-TTPL1-PTEF1-SynADA-TCYC1,

This study

25

Page 26: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

GAL7::PPGK1-VvCPR-TADH1-PFBA1-SynPgDDS-TTDH2-PTDH3-SynPgPPDS-TTPL1-PTEF1-PgPPDS-TCYC1, EGH1::PPGK1-VvCPR-TADH1-PFBA1-SynPgDDS-TTDH2-PTDH3-SynPgPPDS-TTPL1-

PTEF1-PgPPDS-TCYC1, YPL062W::PPGK1-AtSQS2-TADH1-PTDH3-AtSQE2-TTPL1-PTEF1-SmFPS-TCYC1,

ALD3::PCCW12-SynPgPPDS-TCYC1-PCWP2- SynPgDDS-TCWP2, p414-TEF1p-Cas9-CYC1t

YS01-CK YS01, LPP1::PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynPn3-29-TCYC1, p414-TEF1p-Cas9-CYC1t, pLPPgRNA

YS01-CK2 YS01-CK , containing pRS425-LEU2-TEF1-Synpn3-29 plasmids, recovering URA3, HIS3, TRP1 Marker

26

Page 27: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Table S5. Primers used in DNA assembly

Primer name Sequence (5' to 3')

1-M-pEASY-PGK1-F CTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCTTAATTAAACGCACAGATATTATAAC

s-7G-1-M-ADHT-PDC1-R CCTTGCTTGTTTATCTTGCACATCACATCAGCGGAACATATGCTCACCCAGTCGCATGTCGGCATGCCGGTAGAGGTGTGGTCAATAAG

s-7G-1-M-ADH1t-PDC1-F GGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGACATGCGACTGGGTGAGCATATGTT

CCG

3G-1-M-ADH2t-ENO2-R TGTTAATCCACCTTCTAGCTCATTTTGGATGGCAACGGCAAGAGTAGGATTTAGAATTATATAACTTGATGAGATGAG

3G-2-M-ADH2t-ENO2-F GGTAATTCTTCTGTCATTTACTCATCTCATCTCATCAAGTTATATAATTCTAAATCCTACTCTTGCCGTTGCCATCC

6G-2-M-PDC1t-FBA1-R GTTATTTACAGAAGTTGGAAGGCTGGTATTGTTGTTCAAGCCAGCGGTGCCAGTTGGATCGGCAGTTTTGAATTGAGTAACCA

6G-2-M-PDC1t-FBA1-F AGTGTAATAGAAGACTAGACACCTCGATACAAATAATGGTTACTCAATTCAAAACTGCCGATCCAACTGGCACCGCTGGCTTGA

s-4G-4-M-TDH2t-TDH3-R GGCCTCCGCGTCATTAAACTTCTTGTTGTTGACGCTAACATTCAACGCTAGTATGGCGAAAAGCCAATTAGTGTGATAC

s-4G-1-M-ADHT-FBA1-R CCTCCGCGTCATTAAACTTCTTGTTGTTGACGCTAACATCAACGCTAGTATTCGGCATGCCGGTAGAGGTGTGG

27

Page 28: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

s-4G-4-M-ADH1t-FBA1-F CTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGAGATCCAACTGGCACCGCTGGCTTGA

S-4G-3-M-TDH2t-TDH3-F GGCATCACGGATTTTCGATAAAGCACTTAGTATCACACTAATTGGCTTTTCGCCATACTAGCGTTGAATGTTAGCGTCAAC

3G-3-M-TPL1t-TEF1-R AGGAGTAGAAACATTTTGAAGCTATGGTGTGTGGGGGATCACTTTAATTAATCTATATAACAGTTGAAATTTGGA

3G-2-M-TPLt-TEF1-F GTCATTTTCGCGTTGAGAAGATGTTCTTATCCAAATTTCAACTGTTATATAGATTAATTAAAGTGATCCCCCACAC

2-M-CYC1t-pEASY-R CGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGCGTTGGCCGATTCATTAATGC

X1-M-pEASY- r-t-F CTTGCAAATGCCTATTGTGCAGATGTTATAATATCTGTGCGTTTAATTAAGGCTCGTATGTTGTGTGGAATTGT

GAL7-interg-1 ATATTTGAAAGGCTTATGATTTTCTCTTGC

GAL7-interg-2 GTCCAAATATTTGAAAACAAAGGTACAGC

X2-M-pEASY-r-t-R CGAAGGCTTTAATTTGCAAGCTGCGGCCCTGCATTAATGAATCGGCCAACGCGCCAGGGTTTTCCCAGTCACGACGTTG

X1-Xp-pEASY-M13R-F CCGACTGGAAAGCGGGCAGTGAGCG

2-Xp-pEASY-M13F-R CCATTCAGGCTGCGCAACTGTTGGGA

Xp-M-pEASY-M13R-F GTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGTCACACAGGAAACAGCTATGACC

Xp-M-pEASY-M13F-R CTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGACGACGTTGTAAAACGACGGCCAGT

EGH1-UP-F GGCAACGAGATCCAATTGCGTGGTGTC

EGH1-UP-R CTTAGGGTAGGACTCTTCATCTTTGCC

28

Page 29: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

EGH1-Down-F GCATTAGAAGGAAGTAATCTTTCGTAC

EGH1-Down-R GCAAATTTCAATGAACTGATGGCCAGG

EGH1-up-Marker-50bp-F GACCCCAAAACGGGCAGGGAAGTTGGCAAAGATGAAGAGTCCTACCCTAAGCACACAGGAAACAGCTATGACC

Only-pPGK1-R CACGAGGTTCTACTAAACTAAACCACC

LPP-50-PGK1-F GAGAAACTCCTACATCAACGCCTAAGGAAACTCGTCATATTCTACCAAGGACGCACAGATATTATAACATC

LPP-50-CYC1t-R TGAAGCTTGATCGGCTTTTATTCTTTCCTGATAGGACTCTGTAAGTGTTAGCGCGTTGGCCGATTCATTAATG

M-URA-F ATGTCGAAAGCTACATATAAGGAACGTG

M-URA-R TTAGTTTTGCTGGCCGCATCTT

M-HIS3-ORF-F AGGCAAGATAAACGAAGGCAAAG

M-HIS3-ORF-R CTACATAAGAACACCTTTGGTGG

LEU2-up-F TGCTTGCATCACAATACTTG

LEU2-down-R CTTACCTGTATTCCTTTACA

TRP1-up-F GCTTGTCACCTTACGTACAA

TRP1-down-R ACCCCCTGCGATGTATATTT

29

Page 30: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Table S6. The pair-wise amino acid identities of the important UGTs.

Percent identity (%)Pn1-31

Pn3-29 Pn3-31

Pn3-32 Pn3-32-i5 PgUGT74AE2 PgUGT94Q2 PgUGT71A29 PgUGTPg100 PgUGTPg101 BsYjiC

Pn1-31 100 23.2 22.9 23.1 22.1 96.2 22.9 21.6 21.4 23.0 20.1Pn3-29 - 100 27.3 26.4 25.9 23.4 28.1 95.4 85.9 98.9 15.3Pn3-31 - - 100 90.7 93.4 23.1 97.5 27.8 28.4 28.1 19.5Pn3-32 - - - 100 88.6 23.5 89.4 27.4 27.9 27.4 20.1Pn3-32-i5 - - - - 100 22.3 92.3 26.4 26.8 26.7 19.5PgUGT74AE2 - - - - - 100 23.1 21.8 21.7 23.2 19.6PgUGT94Q2 - - - - - - 100 27.8 28.2 28.1 19.8PgUGT71A29 - - - - - - - 100 83.0 95.8 15.0PgUGTPg100 - - - - - - - - 100 86.0 14.8PgUGTPg101 - - - - - - - - - 100 15.8BsYjiC - - - - - - - - - - 100

Supplementary Table S3-S6 Reference:1. Sikorski, R.S.& Hieter, P(1989). A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae.

Genetics 122:19-27.

2. Dai Z, et al. (2013) Metabolic engineering of Saccharomyces cerevisiae for production of ginsenosides. Metab Eng 20:146-156.

3. Dai Z, et al. (2014) Producing aglycons of ginsenosides in bakers' yeast. Sci Rep 4:3698.

30

Page 31: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

4. Lin, T. et al. (2016 ) Construction of cell factories for production of lupeol in Saccharomyces cerevisiae. China Journal of Chinese Materia Medica,41 (6):1008-

1015.

5. Wang, J. et al. (2019 ) Optimization of UDP-glucose supply module and production of ginsenoside F1 in Saccharomyces cerevisiae. China Journal of Chinese

Materia Medica,44 (21):4596-4604.

31

Page 32: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary Sequences 1

>Pn1-31

MDREMLSKTHIMFIPFPAQGHMSPMMQFVKRLAWKGVRITIVLPAEIRDSMQINNSLINT

ECISFDFDKDDEMPYSMRAYMGVVKLKVTNKLSDLLEKQKTNGYPVNLLVVDSLYPSRV

EMCHQLGVKGAPFFTHSCAVGAIYYNARLGKLKIPPEEGLTSVSLPSIPLLGRNDLPIIRTG

TFPDLFEHLGNQFSDLDKADWIFFNTFDKLENEEAKWLSSQWPITSIGPLIPSMYLDKQLP

NDKDNDINFYKADVGSCIKWLDAKDPGSVVYASFGSVKHNLGDDYMDEVAWGLLHSK

YHFIWVVIESERTKLSSDFLAEAEEKGLIVSWCPQLEVLSHKSIGSFMTHCGWNSTVEALS

LGVPMVAVPQQFDQPVNAKYIVDVWRIGVQVPIGENGVLLRGEVANCIKDVMEGEIGDE

LRGNALKWKGLAVEAMEKGGSSDKNIDEFISKLVSS.

>Pn1-50

MGAELIFIPFPGAGHLISVVEIAKLLISRAEWLSITILVMKLSLDTGVAGYTQSLQKDAHNR

IVFADLPPDESVISMIKTIKTSDRLSIFRVFIESQKTQVRDAVKVIVSQSESNKLAGFVIDMF

CTPMIDVANEFGVPSYVFFTSGAAYLGLQFYHLSLDDEHKQDVIQYKDSDVELSVPYFVN

PVPAKVLPSVMLIKEGSTMIQSISRRFKETKAILVNTFLELEPHAIKCLADNGKIPPVYHVG

PIVNLKSKEGTTQNHNSEDGIISWLDNQPPLSVVFLCFGSLGSFDKEQVREIAYGLEHSGQ

RFLWSLRPPPQPTELFGLPKEYENHNEVLPEGFIKRTTGIGKVIGWAPQVVVLSHPAVGGF

VSHCGWNSTLESIWCGVPMATWPIHAEQQLNAFELVKELGMAVDIKMDYRKDSSIATEP

LVVTADEIEKGIKCLMNAASEMRKKVKEMKEKSRMAMVDGGSSHISLGHFIEDVMGNIQ

ERAKYNQVG.

>Pn3-13

MDIEKGRISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASIK

LVELHLPSSPDPPPQYHTTNGLPSHLMVPLKNAFETVGPTFSEILKTLDPDLLIYDFNPSWA

PEIALSHNIPAVYFLTSAAATSSVALRALKNPGEKYPFPDFYDNSNITPEPPSADKMKLFHD

FVACFKRSCDIILIKSFRELEGKYIDLLSTLSKKTLVPVGPLVQDPLGHDEDPKTGHLINWL

DKRAESTVVFVCFGSEYFPSNEELEEVAIGLEISMVNFILAVRFLEGEKKGVLPEGFVQRV

GDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIMESVKFGVPVIAMARHLDQPLNAKL

AAEVGVGMEVVRDENGKYKREAIAEVIRKVVMEKNGEVIRRKARELSEKMKETGEQEI

GRAVEELVQICKMKKDAQY.

32

Page 33: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

>Pn3-29

MKSELIFLPAPAIGHLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTKPTP

RLTIVNLPESDPQNYMLKPRHAIFPSVIETQKTHVRDIILGMTQSESTRVVGLLADLLFINI

MDIANEFNVPTYVYSPAGAGHLGLAFHLQTLNDKKQDVTEFRHSDTELLVPSFANPVPAE

VLPSMYVDKEGGYDYLFSLFRRCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL

NGDGQNSDEAAVILGWLDDQPPSSVVFLCFGSYGTFQENQVKEIAMGLERSGHRFLWSL

RPSIPKGETKLQLKYSNLEEILPVGFLDRTSCVGKVIGWAPQVAVLGHEAVGGFLSHCGW

NSTLESVWCGVPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKNEYFNMTNDFIVRAE

EIETKIKKLMMDEKNSEIRKKVKEMKEKSRLAMSENGSSYNSLAKLFEEIM.

>Pn3-31

MDNQKGRISIALLPFLAHGHISPFFELAKQLAKRNCNVFLCSTPINLSSIKNKDSSASVKLV

ELHLPSSPDLPPHYHTTNGLPSHLMVPLRNAFETVGPTFSEILKTLNPDLLIYDFNPSWAPE

IASSHNIPAVYFLTTAAASSSIGLHAFKNPGEKYPFPDFYDNSNITPEPPSADNMKLLHDFIA

CFERSCDIILIKSFRELEGKYIDLLSTLSDKTLVPVGPLVQDPMGHNEDPKTEQIINWLDKR

AESTVVFVCFGSEYFLSNEELEEVAIGLEISMVNFIWAVRLIEGEKKGVLPEGFVQRVGDR

GLVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNAKLAAE

VGVGMEVVRDDNGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEIDRA

VEELVQICKKKKDAQ.

>Pn3-32

MDIEKGRISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASIK

LVELHLPSSPDLPPQYHTTNGLPSHLMVPLKNAFETVGPTFSEILKTLDPDLLIYDFNPSWA

PEIALSHNIPAVYFLTSAAATSSVALRALKNPGEKYPFPDFYDNSNITPEPPSADKMKLFHD

FVACFKRSCDIILIKSFRELEGKYIDLLSTLSKKTLVPVGPLVQDPLGHDEDPKTGHLINWL

DKRAESTVVFVCFGSEYFPSNEELEEVAIGLEISMVNFILAVRFLEGEKKGVLPEGFVQRV

GDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIMESVKFGVPVIAMARHLDQPLNAKL

AAEVGVGMEVVRDENGKYKREAIAEVIRKVVMEKNGEVIRRKARELSEKMKETGEQEI

GRAVEELVQICKMKKDAQY.

>Pn3-32-i5(Xyl)

MDNQEARISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASI

KLVELHLPSSPDLPPHYHTTNGLPSHLMVPLRNAFETAGPTFSEILKTLNPDLLIYDFNPSW

33

Page 34: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

APEIASSHNIPAVCFIIGGAASSSMSLHSFKNPGEKYPFLDFNENSNITPEPPSADNMKLFLD

FMTCFERSCDIILIKSFRELEGKYFDFFSTLSDKTVVPVGPLVQDPMGHNEDPKTEQFINW

LDKRAESTVVFVCFGSEYFLSNEELEEVAIGLEISMVNFIWAVRLIEGEKKGVLPEGFVQR

VGDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNAK

LAAEVGVGMEVVRDDNGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEI

DRAVEELVQICKKKKDAQ.

>Pn1-2

MFTTKMPTQKYSTQPPHVLIFPLPLQGAINSMLKLAELLCLSDINITFLITDNNHTRLLRYT

DIPSRFNKYPGFRLEKIPDGLPEDHPRGSRYIEMLESMKTNTKPIFKEMLTSGRLGDKLRP

VTCIIADGVMGFTCDVANEVGIPIIYTRTLSASCLWAFFCLPNLIETGELPLNGDDMDTPIK

SVPGTENFLRRRDLPNFCRSGDLSNPSIQLFKSEYKENSRADGLILNTFEDLEGPILSNIRTV

CPNLYTIGPLHSHLKTKLAAKSISSAISTNSLRKEDRSCIKWLDSQPPKSVIYVSFGSIAVMT

RNQLMEFWYGLVNSGCKFLWVIRPDSVAGDGEIPAEVLKGTKERGYIVGWAPQEEVLGH

SSVGGFLTHSGWNSTLESVMEGVPMICWPYFMDQQVNSRFVEKFWKIGLDMKDTCDRV

IVEKMVKDLMVERRDEFMKSADRMAKMGRKCLSEGGSSYCNLDRLIEDIKLMGVKSKQ

.

>Pn1-3

MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI

HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP

WTADLAIRLDIPRLVFNATCMFSQVLKDAVRRPDSPHLTVKSDYDPFVIAGLPHPITMTRA

ELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFYQKMVAPTQKIINV

GPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGSGCIFPDAQLMEIAC

GLMAAGHDFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGTIVKGWAPQVLILDH

PSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGIGVEVGAEEWNLW

VDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAVGEGGSSQRNVTVL

IEDLRKLREKREKA.

>Pn1-4

MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI

HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP

WTADLAIRLDIPRLVFNATCMFSQVSDMPPFHLFFLSQVLKDAVRRPDSPHLTVKSDYDPF

34

Page 35: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

VIAGLPHPITMTRAELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFY

QKMVAPTQKIINVGPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGS

GCIFPDAQLMEIACGLMAAGHDFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGTI

VKGWAPQVLILDHPSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGI

GVEVGAEEWNLWVDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAV

GEGGSSQRNVTVLIEDLRKLREKREKA.

>Pn1-6

MERKAMEYSGSEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFIQ

ILQIAVDKSESPPPAEKAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVMMS

WSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAYSD

LDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVALLI

NTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDEV

TQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLFG

VQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVEAI

GRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGDEE

VHKRARILQGKLESGFPASSVAALDALVDFIK.

>Pn1-12

MEQNQKMASPQEHIIMLPFFAHGHLFPFLALAKQIQERTGFTITLVSTPLNILYLKSTNTQN

PQIHLVPLPFNSSHHNLPPNTETTESLPLSQVITLFHASSSLESPFRRFISDVTIRDGKSPICIIS

DVFMGWANEVAKSLDIVNVSFSTCGAYGSAAYVSVWQKLPHRFLENDNDEFCLPWFPER

CRVTRSHLHQFVRVADGNDEWSKFFQQQTTFSLGSFGWLCNTVQEIEPLGLEVLKNCTK

LPIWCIGPLLPQRMLESSSNPGTFGKRAGKEPGLSPEECLKWLDLFPKSSVLYISFGSQNTI

RPTQMMELAKGLEESGQPFIWAIRPPIGFNLKENFRDEWLPPGFEEQMIHSKLGLLVHKW

APQLEILSHKSTGAFLSHCGWNSTLESLSQGVPIIGWPLAAEQVYNSKMMEEEMGVGIEL

TRGLESSIVKEDVKRVIEIVMGKNGKGEEMRKKACEIGELIRVAAREENDVKGSSLQAMD

DFVASILSFSRR.

>Pn1-14

MAANDKLHIVMFPWLAFGHILPYLKLAKLIAKKGHKISFISTPRNIDRLPKIPPNLAPHIDL

VKFPLPSIPNLPENAEATIDVPFNKVKYLKIAYDQLQQPLTQFLVSNSPNWILFDIIAYWVG

PIASKLNVRSSFFSIFSASIMGYFGPPSVLMHSDEDRRKPEDYSIKPKWVRFETTVAMSLH

35

Page 36: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

QMQSTFVNFTEDDTENVPDQYRFAASIRDCEMVAVRSSSVFEPEWLELLDEIYQKPVVQV

GLLPTDTTDTDTGGKSNDSWRDIKDWLDKQEKVSVIYVAFGAETKPNQDELTELALGLE

LSGLPFIWALRKQRGSADPEPTELPEGFEERTRGHGLFYTSWVPQTKILSHDSVGGLLIHS

GWSSVIEAVQFGRALVLLPLLADQELIAKLVEEKKMGYLIPRNERDGWFSRDSVAESVRL

VMVDEEGKIYRDNVKEMKGVFGDMDKQDEYVDNLLSYLESHKILTK.

>Pn1-15

MEVVVDEAIKPHVAILPSPGMGHVIPLLEFAKRLVIIHNFHVSFLVITTEASTAQNNLLHSS

ALPADLHVIHLPPVDVSNVISDDMPIVTRISLICQESLKPLRSILIDLNRLKALIIDIFVTQTF

MVCQELSIPVYTFFTSTTQVLALSLYLPILDQEVKGEYIDLPEPIQLPGCKPIRIDDLLDQIR

DRKINEYNWFLLHVSRLPMAAGILVNTWEHLESTSTWLNALKQDPFVTNIPTPPIYPIGPL

VKHNDLVAQSDAYIMSWLDNQPRDSVLFVALGSGGTLRSEQLSELAWGLEASKQRFILVA

QIPTDLCAFATFFNVGSDGNDPVAYLPEGFVRRTEGVGLVVPSWAPQVAVLCHEATGGFLS

HCGWNSALESMVHGVPMIAWPLYAEQKMNATMLTEEVGVAVKPAARDGERVIRREEIER

VVRLVMESEEGKILRRRAKELQESAEKALVSGGSSYESLTRVVESWKK.

>Pn1-16

MDSQVSSRPLEIFVIPYLTPGHLIPLSELACLFAVRGEHVTIITTTTKAAVIQKTINKFSSSGH

PVSLHPIPFPSKEAGLPEGLELEQANDVETAGKFILGLNLMQPIIEDFVVMRRPDCIIADKF

YPWTSDLAARLSIPRLVFDPYSIFAKSLHEALLNPNSPHLTVESDYDPFVIPDFPHRITMTRS

QLPDSRFAQLFKVFREAEVNSYGLLANSVSELDSVYTEYYSKKMGHKVFHIGPASLIHQS

ANDKVERSHKSAVSERQCLSWLDSKKPDSVVYICFGSGCVLPDAQLMEIGYALELAGSD

FIWVVAGKNKDDEDEEKWLPRGFNERVVKKGKGLVVKGWAPQLLILDHPSTGGFLTHC

GWNSAIEAMIAGVPLISWPFICDNFFNEKFFTQVLGIGVEVGALDWKLFSEVGTKVINRE

KIEKAVRKLMDGAGEDEGKEMRKKTRELGEKATRAVKEAGSSHTNLTLLIEELKQLRIER

EERSDQ.

>Pn1-17

MKQLRELLTSNTSKNCKKSSEEQLEMSIPHVLAVPFPAQGHVIPLMELVQHFVKKGFKVT

FVNTDFNHKRVINALLEKDRNVGDMIHMVSLPDGLDPGEDRNDIGKLSEAVPRVMPEKL

EKLIKNINATDNNKITCLVADENMGWALKVAEKMGIRRVAFWPASAAVLGSIFSIPKLIKD

GIINNDGTIIKNQMIDVSPNMPARNTHFAWACFGDLKTQKMFFELTVNNNEFVKLADRIIC

NSAYELEPAAFTLIPNMLPIGPLLASNRHGNQAGNFWPEDSTCLTWLDQQPVCSVIYVAF

36

Page 37: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

GSFTVFDQSQFQELALGLELTNKLFLWVVRPDMTNEMNEAYPKGFTDRIGTRGRMVGW

APQEKVLRHPSVACFLSHCGWNSTMEGVSNGVPFLCWPYFTDQFLNQSYICDVWKVGL

GFNKDESGIIRQGEIKNKVEHLLGDRKFRARALDLKEKVVNSVKGHGCSHKNLSNITDW

MK.

>Pn1-20

MAVAGAGAHILVFPYPAQGHMISLLDLTHQLAVRGLTITILVTPKNLSFLDPLLSKHPSIKPL

VLPFPVSPSIPAGVENLKDLPVGSFSVMMAALGELYNPMLNWFESHPSPPVAIISDMFLGW

THRLACQLSIRRFVFCPSGALAMSVIFALWRDMRQRNDPGDENELISFPEIPNSPVYPWSE

LSPVYRSFVAGGPRSEFLKDAFLGNIASWGIVINSFRELERVYLDYLKESLGHDRVWSVGP

LLPPEIDRVSRGGSSSVLASEITSWLDKFDDQTVVYVCFGSLAVLTNKQMEELALGLEKS

GVNFLWSSKIPTEGHVEGEYGMVPLGFQERVAGRGLVIKGWVPQVSILSHRAVGAFLTHC

GWNSVLESIVAGVPMLAWPMGADQFVNTDLLDELKVGIRVCKGANMIPDSDELARLVA

KGVNNEERGERIARAKELSKAALVSTKMGGSSYKSLDELVRHLSQDQDFLACDPKKHIIK

.

>Pn1-21

MWSFGKAENSRPHAVCVPFPTQGHINPTLKLAKLFHHRGFHITFVNTEFNHQVLLKTRGP

HALDRLPTFRFETIPDGLPPSNARATEDIPLLFDSTSKNCLAPFRELLLKLNNPLSSPDIPPV

TCIVSDFVMGFTLKAAEELGIPKVLFWTASACSFMCFLHYGQLIEKGFTPLKDSSYLTNG

YLDTVVDWIPGMKGIRLKDLPSFIRTTNPNDPMINFGLADTDKYYKSSPQIILNTYDTLEH

EVLDALSSFLPSNIYSIGPIHHQLNNIHKSSTNSIELNLWKDEQQCLAWLDSKEPNSVVYV

NFGSTTIMSPDQLVEFSWGLANSNQTFLWIIRPDLVNGDTTVLPPDSVEAIKERGLFASWC

SQEQVLRHPSIGGFLTHNGWNSTLESITCGVPMLCWPFFADQQTNCWYCCTRWGIGMEI

NSDAKRDEVESLVKELMVGVKGKKMKEKAMEWKKLAEGAITPPAGSSYINFEKMVND

VLLTNNISE.

>Pn1-23

MNAPSFHIAMFPWFALGHLTPFLHLSNKLAKQGHRVSFLIPTKTQPKLQSFNLHPDLITFIP

ITVPHVDGLPRGSETTSDVPFPLQTLLVTAMDYTEDHVECLLYDLKVDVVLFDFAHWIPG

LARRLGIKSIHYCIISPATIGYTLSPERHLNVDKITEADLMKPPANYPGSNITLHAHEARAFA

SRRVMKFGNNTLFNDRQFISLSQCDALGFRTCREIEGPYCDYLESQFGKPVLLSGPVIPEPP

TSPLEEKWAKWLSKFASGSVIYCAFGSECILKMYQFQELLYGLELTGMPFLAALKPPAGA

37

Page 38: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

ESIEEALPDKFEERVKGRGVVHEGWVQQQLILGHPSVGCFITHCGSGSLAEALVNKCQLV

LLPQVGDQFINARMMSQNLKVGVEVEKGEEDGVFTRESVCKAVGNVAQEDNQVGKEVR

TNHARLRDFLLDKDLESSYIHSFSKKLQQLLG.

>Pn1-24

MDPPTFHIAMYPWFALGHLTPFLQLSNKLAKKGHKISFILPNRTQLKLQTFNLLPDLITFVP

ITVPHVDGLPHGTETTSDVPAPLHSLVMTAMDRTQKNIEILLHDLNVDIVFFDFTHWMPG

LARRLGIKSLFYSITCPGTTAFALSLAPRAHENNAKEADDFPLLSIKLHLHEARGFAAGSTE

NFGRGIKIFDRLLISLKECDALGFRSYRELDGPSCDFLESYFQKPVLLSGPLIPEPPNSTLDE

KWVKWLAKFGPGSVIYCAFGSECTLKKDQFQELLLGFMQMGMPFLAVLRPPLGVESVE

AAIPQGFNEGVGERGVVHEGWIQQQLILEHPSVGCFVTHCGVASLVEALMNKCELVLLP

QAAGDQIIQARILSTILKVGIEVEKGEEDGLFTRDSLCKAVRTVMEKDNEVGKEVRSNRS

KIREFLSNKELESSYINSFCEKIENLVLGRSS.

>Pn1-25

MDSQSNQLHFVMIPLMSPGHIIPMVDMAKLVAQRFVTVTIVTTPLNATRFGAVIARAIESG

LPIKFLQLRFPAVEAGLPEGCESIDNLPSFNLSRNFFAAVYMLQEPLQKMLEEIKPSPSCILS

DKHVTWTADTAKKFQIPWIIFDGMSCFTQICTHNLYTSRVHENVADSERFAVPNLPHHIEF

TKAQLPGLFNPRSPSAAAFHGRIRAAEVGAYGVVINSFEELERDYVNEFKKMRGDKVWC

VGPLSLCNKDKLDKAQRGNKASIDQNKILQWLDSWEPGSVVYACLGSLSSLAPPQLIELA

LGLEASEHPFVWVVREGNKQKEIENWILKEGFEERTKGRYLLIRGWAPQVLILSHPAIGGF

LTHCGWNSTIEGICAGVPMITWPLFAEQFFNEKLVVQVLDTGVSVGARAVVDLAEEDKN

GVTVRSEEVKKAIKRIMFGGEEWVERRERARELAKLAVRVVEEGGSSYLNMTLLIQDIM

QQTNTRLPSERKCSN.

>Pn1-28

MGLNSANKAHAVCVPFPAQGHVNPVMQLAKLLHSRGFYITFVNTEFNHRRLIRSKGYEW

VNGFDDFKFETIPEGLPPSDHDATQDPSALFDSIPKNCLAPFRDLLAKLNSSAEVPPVTYIV

ADGIMSFAVEAAKEIGVPEVQFWTATACGFMGYLHYPELMKRGIVPFKDENFMIDGTLDT

KVNWIPGMIRNLRLKDLPAFMRTTDPNNVLFTYARDEALNCFKASAMIFNTFEPLEHEVL

QALSSMFPHIYPIGPLSLLLKDLPNSPLYSFRSSFWKEDTKCLEWLDKREPNSVVYVNYGS

LTLMTDQHLKEFAWGLANSKHPFLWIVRPDIVMGDSAILSEDFLEDIKDRGLLASWCPQD

EVLSHESICVFLTHCGWNSTLESICEGLPIICWPFFAEQQTNCLYSCSDDQWGIGVEVNQD

38

Page 39: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

VKRGEIEALVKEMMEGDKGKKLKKNALEWKKKAEEATSVGGSSYSSFDRFIKEALHCN

M.

>Pn1-29

MGNPKPHVVCIPYPSQGHVTPMMRLAKLLHSRGFHITFVNTEFNHKRLVRAKGPQSVKG

FSDFRFEAIPDGLPPSDRDATQDIPKLCDSTRKNCLVPFKELLGKLNSSPELPRVTCIISDGV

MSFGIKAAEEMGIPEVQFWTSPACAMMCFLHYREFINRGICPFKDDNYLTDGTLDKPIDW

IAGMKNIRYKDVPSFIRTTDPNDIMFDFIGEEAQNCLKAPAIIFNTFDAFEHQVLQAYASKF

NYRNIYTIGPLPLLGRHVPESPVNSLNSSLWKPDSKCLEWLDKKEKDSVLYINYGSITTMT

EQHLIEFAWGIANSKHPFLWIVRPDIVTGDSDSASLPREFLEETKEIGMLATWCAQDQVLA

HPAVGAFLTHCGWNSMMETVCEGVPVIGWPFFSDQQTNCRYSCTEWGIGMEVNEDVRR

EEVELLVKEMMGGEKGKELRRKAKEWKMLAEEATNVGGSSYQNFDRFIKEALHYEG.

>Pn1-30

MAKNQQKVHAIMIPYPFQGHVIPFVHLAIKLASRGFTITFVNTHSIHQQISKAQSTDAGDIF

AGARESGLDIRYATVSDGLPLEFDRSLNHDQHQESLLQVISAHVDELVGKLVCSGNPPVSC

LIADTFFVWPSSIASKYNLVNVSFWTEPALVFTLYYHLDLLRQNGHFASSDNRLDTIGYIP

GVQAIEPRDLMSYLQASDITTVVHRIIYEAFEDVKRADFVICNTVQELEHETLSALNQKQP

IYAIGPIFPNEFKKTKIATSLWSESDCSQWLNSKANGSVLYVSFGSYAHVSKHDIVEIAHGL

SVSEVDFVWVLRPDIVSSDDTDILPVGFEDSVKERGLIVSWCHQIGVISHPAIGGFLTHCG

WNSTMESMWCGVPLICFPLLTDQFTNRKLVVDDWKLGINLCDQKVITRKEVSEKIDSLM

RGKDLNIKLRNEMKKVRITLESACVTDGSSEKNFENFISDVKIKIKILGR.

>Pn1-34

MSGSSGKGAHVLVFPYPAQGHMLPILDLTHQLALHGLTITILVTPKNLPILNPLLSAHPSSSI

QTLLFPFPHRPSLLETENIKDIGNSGNVLIINALSALSGPIIHWFNTHPSPPVALLSDFFLGW

THHLAHQIGIPRICFYSSGAFLTSILACLWDDDIDTLVSLDVVDFPDLPHAPSFPKEHLPTVF

LTYRKSNPMWELVKDGMIANSASWGCVFNTFDAIEGEYLEYSGKKMGHERVFAVGPLSL

LGGPDHTTRGSTSRSSGSDNTNILAWLDDVYPDGSVLYVCFGSQKLLKRVQMEALSSAL

EQCGIKFIWVVKSPTAQQVADGYGFVSDEFEKRVSGRGLIIKEWAPQVSILNHRAVGGFLS

HCGWNSVLEAIVSGVTILGWPMEADQFVNAKLLVDYMGVAIRVCEGPDIVPNSFELAKK

VAESMNAEITEKMRAKELKVKALEGVKHCGSSSRDLDGLVKELAQLQLKNDR.

>Pn1-37

39

Page 40: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

MEKNHCDNQSSGLKLAPPVTVIMVPFPAQGHLNQLLHLSRLISSYNIPVHFAGTTAHSRQ

AKLRVHGWDPLSIANIHFHEFSTPAIPSPPPKPNNSGKLPPHLEPLFDGLLLLREPVAQLLFA

LSYTTQRIVVIHDLAMASVVQDVVLIPYAESYSFQPVSAFFISSYKGENMREKFGIEDDIV

KDLPPFESILSSEVMEFSKKLQVHHKFNSGNLYNSSNVIEAQFLDILKKVHVSSDTDIKQW

AIGPFNPVVIYDSDPNQAHKCLKWLDKQAPNSVMFLSFGTTTTLSDEQILELAIGLENSEQ

KFIWVLRDADRVDIFAGDVRRAELPKGYEERLEGKGMVVRDWAPQLEILAHPSTGGFMS

HCGWNSCLESITMGVPIAAWPMHSDQPANAVLITKVLKIGVVVKDWALGDHELVKSSTV

ENAVRRLMASIEGDEIRRRAAQMGAAVRSSVAEGGVARVEMDDFIAHIRR.

>Pn1-39

MENNHVLHVVMFPWLAIGHLIPFLQLSKILAQKGHRVSFISTPRNIERLPKIPPDLSPMINL

VTFPFPKVHNLPDEAESSMDIPHHKAQFLKIAFDLLKSPLVDFLKNTRLKPDWIVYDYAS

HWLPSIAGELGILRGYFSLFSAACMAFLGPPSLLVSGEDGRLKGEDFTIVPKWIPFQSDVV

YRLHEVIKYVDGSAGNESGASDSVRFGASVGGSDLVLFRTCVEFEPEWFNLVCQLYRKPV

VPVGVLPPDNDDFGNEEKWVFIKEWLDKQMVNSVVYVALGSEAILSRDELSELALGLEQ

CGLPFFWVLRMPPGSTQDESQLLLPDGFVERVKSRGVVHQGWVPQVKILSHSAVGGFLT

HCGWNSVVEALGYGRVLILFPVMNDQGLNARLLKGKKVGVEIPRNERDGAFTSDSVAD

SVRIAMVSQEGESVRATAREMRGLFGHGIRNDNCIDNFVSHLVKMRKSQLMS.

>Pn1-45

MEKQTLANKPHFMTIPYPYQGHMNPMLQFSKRLASKGVQITILATNDVKTSKLAHTSSIN

IEYISYEIEQGDEIPKSVAAGLGYINHKVLKSVPGIIEKHKASGSPIKVIVYDSLIHGALELA

HKLGLRGASLFTQTCAVCSVYYHVQRGSLALPLEGHTVSLPSIPVLEIGDLPSLVQAHDRL

YPGLLEVIKKQFVDLEKADWIFFNVFDKLEEEISKWLSSQWPIKTIGPSIPSMFLDKRLPD

DRDYGLNLFKPDAEACIKWLNAKATGSVVYVSFGSVANLAQNHMDELAWGLLNSNINF

LWVVRETEQNKLPGNFNADASEKGLVVSWCPQLEVLAHKAVGSFMTHCGWNSTLEALS

LGVPMLVMPQWGDQTTNAKYVVDVWQTGVWVKARDKEIFTREDIANRIKDVMDGAK

GEELRANAIKWKELAVEAMSEGGSSDKNIDEFISKVLSTTT.

>Pn1-46

MEKQTLANKPHFMTIPYPYQGHMNPMLQFSKRLASKGVQITILATNDVKTSKLAHTSSIN

IEYISYEIEQGDEIPKSVAAGLGYINHKVLKSVPGIIEKHKASGSPIKVIVYDSLIHGALELA

HKLGLRGASLFTQTCAVCSVYYHVQRGSLALPLEGHTVSLPSIPVLEIGDLPSLVQAHDRL

40

Page 41: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

YPGLLEVIKKQFVDLEKADWIFFNVFDKLEEEISKWLSSQWPIKTIGPSIPSMFLDKRLPD

DRDYGLNLFKPDAEACIKWLNAKATGSVVYVSFGSIANLGQDHMDEVAWGLLNSSSNFL

WVVRETEQNKLPGNFKAEASEKGLVVSWCPQLEVLAHKAVRSFMTHCGWNSTLESLSL

GVPMLVIPQWTDQTTNAKYVVDVWRTGVWVKARDKEIFTREDIVNCIKAVMDGAKGV

ELRANAIKWKELTVEAMSEGGSSDKNIDQFISKVISTTTCSFQADISCI.

>Pn1-47

MAETQKPHVVCMPYPAQGHITPMLKLAMLLHHRGFYITFVHTEFNYNRLLKSGGPKSIS

GSPTFRFETIPDGLPPPENPDATQNIIELCISTSKNCSIPFRELLNKLNNSSDVPPVSCIVSDAI

MAFSVEVSEELGIPNVFFWTVNAFTLMTYLHYSRLRELHKDVLNGSENGYLNYVIDWIP

GTGSIRMRDSSSLIWSPDLPDSFVLYCIQEISRTYKASAIILNTFDELECEVLKPLSSMLNRV

YSIGPIHNLSKSVIPDNHTKYLRSNLWKEDSGCIQWLDSKQPGSVVYINFGSITVMSPQHL

VEFAWGLANSMQNFLWIIRPDLVMGDVAVLPPEFEMQTKQRGLLASWCDQEQVLNHAS

VGGFLTHCGWNSTLESLSAGVPMICWPFFADQLTNCYCICKLWGVGLEIDSDVKRDGVE

SVVKELIEGEKGKEMKKRVVEWKNKAKSATSPYMGSSYLDIDKMVNEVILSPMK.

>Pn1-48

MSDFGDPNCIAHVALIPSSGMGHLTPFLRLAVSLMSHDVRVTFITAHSTVSLAESQTLSRFF

STYPKITPAQLPLIHFDDPSEDPFYIQYEAIRRSSHLLLPILSSILPHLSALITDMSLASTVIPV

TKALSLPNYVLFTSSAQMSTLFLSYHTIFGSRDSEENNLFTIPGLQPLPKSWIPPPLLSDKD

SLFRTQLMDNGKKMLQADGILINTFESFEQEPLAALNDGKVVTGLPPVTPIGPFAPCDFER

GQPLEWLEGQPMGSVVFVSFGSRTAISREQIRELGNGLLRSECRFVWVVKENKVDREDD

KELGELVGKDFLAKVKDKGLVVNHWVNQGEILRHPAVGGFLSHCGWNSVTEAMWHGV

PMLAWPQHGDQKMNAYVVSRAGLGIWVENWGWGGGEMVVKGEEISKKVREVMDNEL

LRVQVAQIRERARAAVADGGSSNKRVVELIQMWKNTRVT.

>Pn1-49

MGTRQSQQMYDISFLALHTQVKTMGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLV

MKFPHDTGVTAYTRSLQKDAPNRIVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIV

SRSDSNKLAGFVIDMFCTPMIDVANEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLA

EYKDTDVELSIPSFINPVPAKVLPSVILNKEGSTMLQSISRRFKEAKAILVNTFAELEPHAIK

SLGDNCKIPPIYHVGPIINLKNKEGTTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEG

QVREIAHGLEQSGQRFLWSLRRRPEKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQ

41

Page 42: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

TAILSHPAVGGFVSHCGWNSTLESIWCGVPMATWPIYAEQQINAFELVKELGMAVEIKME

YREDYFFAPENNVVVTADQIEKGMRCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSY

ISLRHFIEDVMGN.

>Pn1-51

MGAELIFIPFPGAGHLISVVEIAKLLISRAEWLSITILVMKLSLDTGVAGYTQSLQKDAHNR

IVFADLPPDESVISMIKTIKTSDRLSIFRVFIESQKTQVRDAVKVIVSQSESNKLAGFVIDMF

CTPMIDVANEFGVPSYVFFTSGAAYLGLQFYHLSLDDEHKQDVIQYKDSDVELSVPYFVN

PVPAKVLPSVMLIKEGSTMIQSISRRFKEAKAILVNTFLELEPHAIKCLADNGKIPPVYHVG

PIVNLKSKEGTTQNHNSEDGIISWLDNQPPLSVVFLCFGSLGSFDKEQVREIAYGLEHSGQ

RFLWSLRPPPQPTELFGLPKEYENHNEVLPEGFIKRTTGIGKVIGWAPQVVVLSHPAVGGF

VSHCGWNSTLESIWCGVPMATWPIHAEQQLNAFELVKELGMAVDIKMDYRKDSSIATEP

LVVTIHCCQ.

>Pn1-52

MTTVEEVFVVTSCGSGHLFPCIELCKHLSSRNFKTTLVLPSNLSSSLPSSFLHHHPLLSTAEI

AAASGGPPMLGSDPARHQMGLDLEAHLTRRSTESDSARPICAIVDFQMGWTKHVFWKFS

VPVISFFTFGACAAAIEWGAWKADAGKLNVGEVLNIPNLPNEMTVTFSDLKGRATGPSQ

AGRPGPSNTSLARLGGGGPPKPGDRPPWVPLIEDSIGLMFNTCDDLERPFIDYMANQMA

MPVWGVGPLLPDKYWQSSDRLISDRQIRGPKHPVNYNEDEVIQWLDSKPRGSVLYVSFG

SEVGPTSEECSQLASALEETTHPFIWVIQQKPGLPKSNANANANANENVGVGVGGGLIIK

GWAPQLLILSHPSTGGFLSHCGWNSTVEAIGRGVPFLAWPIRGDQIYNAKLVVNYLKVGY

MAMAEGGDDTLGISKKELILQGIDKLMSDEDVHKRAMAIRAKFHASFPKSSGEALDAFR

GVLL.

>Pn1-53

MTTVEEVFVVTSCGSGHLFPCIELCKHLSSRNFKTTLVLPSNLSSSLPSSFLHHHPLLSTAEI

AAASGGPPMLGSDPARHQMGLDLEAHLTRRSTESDSARPICAIVDFQMGWTKHVFWKFS

VPVISFFTFGACAAAIEWGAWKADAGKLNVGEVLNIPNLPNEMTVTFSDLKGRATGPSQ

AGRPGPSNTSLARLGGGGPPKPGDRPPWVPLIEDSIGLMFNTCDDLERPFIDYMANQMA

MPVWGVGPLLPDKYWQSSDRLISDRQIRGPKHPVNYNEDEVIQWLDSKPRGSVLYVSFG

SEVGPTSEECSQLASALEETTHPFIWVIQQKPGLPKSNANANANANANDNVGVGVGVGG

GLIIKGWAPQLLILSHPSTGGFLSHCGWNSTVEAIGRGVPFLAWPIRGDQIYNAKLVVNYL

42

Page 43: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

KVGYMAMAEGGDDTLGISKKELILQGIDKLMSDEDVHKRAMAIRAKFHASFPKSSGEAL

DAFRGVLL.

>Pn1-54

MEINRHRKLTPMAAGNDNLHVVMFPWLAFGHLIPFLELSKSIAQKGHKVSFISTPRNIDR

LPAVPTNLAPLITFVKLKLPRVHKLRQDAEATTDIRTDDIPYLKTAYDGLEPGLTRFLETESP

DWIIYDFSPHWLPPVAARLGVSRAFFSIFNAWFIAFSGSSSEAMINGSDQRIRPEDFTVPPS

WIPFPNNVAYRLHEINWVMGSSTDNESGVSDFYRAGSVISGSEVVFVRYCNEFEPEWIKL

LEELHKKPVIPLGLMPPSAQDRVGDQNEAEWLTIKEWLDSQDKGSVVYVALGSEATLSQ

NELTELGIGLELSGLPFFWVLRKPPGSEESNPIELPDGFLERIRDRGIVWMSWVPQLKILGH

ESVGGFLSHLGWSSIIEGLMFGRPLVMLPFLVDQGLNARVLVDNKVGIEVPRNETDGLFT

RDSVAESLRLVMVEEEGRIYRERTKELSGIFGDKELHNRYMDTSIKYLENHTK.

>Pn1-55

MEHEDQKPHAIFVTYPLQGHVNPSVHLAIKLAQRGFTITFINTHYVHLKRSEARSTTGGG

DEDEDIFAGARNSGLDIRYVTISDGLPVGFDRSLNHDQFMACLLHVFSAHVEEAVEKLIM

KSGPPVNCLIADTFFVWPSKLAKKFGILYISYWTETALVFTLYYHVDLLRLNSHFGCIDAR

EDAITYVPGVQYIEPKDMTSYLQDIDTNTVVHQIIFNAFKDVRGADFVLCNTVHELEPDTI

SALQSKIPFLAIGPIIPFGLNKNPVPTSLWAETDCSQWLDSRPHGSVLYVSFGSYAHLTKND

LVEIGNGLLDSKLSFVWVIRPDIVSSDDPEPLPDGFREEMGERGIIITWCNQIQVLGHPAIG

GFLSHCGWNSVLESIWSEVPVLCFPLNTDQFTNRKLVVDEWKVGINLCDRKPVAKFEVS

EKINRLMSGKSGDDFRKKVRQVKKLLENALGSTDGSSEKNMKQFISELKVKIYSKFRNA

RSIVCPTANN.

>Pn1-57

MESPNRPHAVCIPLPAQGHINPMLKLAKLLHSKGFYITFVHTEYNYNRVVKSRGPDSVHG

PNDFGFETVSDGLPWTNQRRIIDLPDLCVSMPIHLLQSFRGLITKLNANSSSNIIPPVSCIVS

DGVMSFTMEVAEEFGIPEIFLFTPSACAMLGYLHFDELIERGYFPLKDESCLSNGYLDTEID

WIPAMPGIKLKHLPTFIRTTDKNDIMFNYNLESIQNALKTKTLILNTFDELEQQVLDAIKIK

FPSLYTVGPLSLLHQQHCQTKLAKSENESNLWEEDMSCLEWLDKKAPKSVVYVNYGSFV

IMTREQLSEFAWGLANSNYTFLWVIRPDLVLDGGDDIIISKDFMEKIGDRGMLVGWCPQE

EILRHPSVGGFLTHCGWNSVLESICEGVPMICWPFFADQQLNCVYLCREWGIGLEIDSDV

RREKVERLVKELMEGEKGNVKREKALEWKERAENATKNGGSSYINLDLLVMHLKQESV

43

Page 44: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

GKNISKP.

>Pn1-58

MATEDPKLHVLILPYVTPSHMMPLVEIGRLIAARGVNITIIATPHNANLFRSSVDQDINSGH

QISIHELKFPSAEVGLPEGIENLSAITSTDMSAKVFEGIMRLRKPMEDLIRNLSPDCIFSDMF

YPWTVELAEELKIPRLMFYPSSFFYYCLSHSLKLYAPHDQKVQSDAESFLIPHLPDTIEMK

RSQLQDHVKGKSRLGVFMDAIKNSELKTYGIVHMTFYELEPAYADHYIKIKPAKFWGILP

LFQFFKGLKAPRSNDSQHNCLSWLDTQKPNSVVLLSFGSLVRFPDAQLTEIALALEASTHS

FIWVVRKSEANRENQEKSWLPAGFEERMMEGNKGMMVRGWAPQVKILAHPATGAFMT

HCGWNSVLEAVAAGVPLITWPLFAEQFYNEKAINEVLKIGVGVGAEVWNPTFEITCPPVG

RDKIEKALSKLMGGSEESQKIRQKAKEMAAMAKGAVEEGGSSYNNITALIEELKACAFE

KSKNG.

>Pn1-61

MNTLLLFHISFEQISKVLYLTFVSKASMAAKPFHIMMFPWLAFGHMLPYLELTKKFAQNG

IKVSFVSTLRNLQRLPPIPPNLANEIKLVEIPLPSVEGLPENCEATMDLQKEQVQHLKKAYD

MLPAAFEKLLQKDLPDLLFVDFSPYWIPEVAEKFGVQSAFFSVYTAATLAYLGPPNVLKS

GENRRSPQEWFTFPSLVTHISDYEHSRLKKAHFPDASGISTGQRLGRIVERCDFVAIRSCKE

FEGEYIDLVQQLYQKPVLPIGVLPPVLTDNKALDAINSNWSNNFKWLDEQRHKSVIFVGF

GSEYKMPIEQVHELAFGLELSNLPFLWILRKPDAIHSNALLPSGFDIRTSNRGQVCLGWAP

QSKILAHPSIGGCLFHSGWGTIIESLSFGHPLILMPMVADQGLNAKLLVEKGLGYEVPRNE

DNSFSRDMVAKSLKIVMTTQEGNPLRFKAGQTQTVFGNQHLHENYINKLIQYLGKFKRS

EDGVHKLLKKKSKIKDLV.

>Pn1-63

MEKLELIFIPTPMMGHATQALHLANLMINRFDHLSITVLIMKLPVDPISTNFMESLGSPAAT

SSSNVNSEVDRIRFHHLPPSKITSDCCYRTPGVLLDLVIEDHKDHVRNYLVTRKSTPAAAL

VVDMFCTTMMDVGNQFGIPTYVFFTSGAAFLQLLFHLQILHDVDNGDVSELINSDTELVI

SGFVNPIPPGVLPFVLIDKYQWSTRFLKYARACRKANGIIVNTFVELESHALNSLHGDGAT

KSPPIYPVGPIINHAQMNLADDRDHDIMRWLDDQPRSSVVFLCFGSMGGFDMEQVREIA

NGIELSGYRFLWSLRHPAEKGKSLFPTDYSCIEQIFLPDKFFERTAHVGKVIGWGPQLKILA

HEAVGGFVSHCGWNSILESLWHGVPIATWPIYSEQQLNAFEMVRELGLSVEIKLDYHNY

MDNDMNKVLVRAEEIERGIRSVMDGGNEVREKVKRMRDKSRMSMEEGGSSYKSLELLI

44

Page 45: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

EDMKV.

>Pn1-66

MDYQEKILPPHVLLFPLPLQGPVNCMLKLAEFFSLAGLHVTFLNTHHIHHRLLKHAHIQS

RFAPYPKFRFETISDGLPEDHQRSGDRFFDIVDGLQLVTEPLFKEMLASGSLSSKSGTPITCI

IADGFFTFALDVAKEVGIPLFYFQTISPCALWVHLCLPKLIELGEVPFKGNDLDAKITCVPG

MEGFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPN

LYPIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMT

RDQLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEE

VLGHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSC

DRGIIERMIRDVMEVRKDEFQESAKVMAKLARQSVCEGESSYYSLDRLIKDIRKMTLSTP

HT.

>Pn2-7

MATQKCFRVLMFPWLAHGHISPFLELARKLIEKGNFFIYFCSTPINLISIKKKLSGDDHQNY

TKSIQLVEHNLPTLPQLPPHYHTTDGLPPDLNSTLRKAFEMSKLSFPNTLNTLKPDLLICD

DLFQWPEIVASSHDVPVVRFQTCSVTAGSFIAHTFRNPDVTYPFPSIYLHEYEIDQIRRCVD

AVFESGKEESGNLLVVNTSKAIEEKYFDYYSLLRGNTKIMPVGPLIQQAPNGDEDMKVIE

WLDKKDPCSTVFVSFGSEYFMQKEEVEEMAHGLELSNVNFIWVFRAPVGAEKVKLPLG

FVERVGGRGIVMEGWAPQARILGHSSIGGFVSHCGWNSVLESINFGVPIIGMPIKFEQPMN

ARLVSELGVCVEIVGDETRRFGREEVVNVIKKVVGGKIGDDLRRKVKELGATIKEKQEEE

MDDVLDELVQICNKKKRIVV.

>Pn2-9

MDTNSNSFRVVMFPWLAYGHVSPFLELAKKLSTKNFSVYFCSTPITLKPIKNKISNYKSIEL

VEYPLESTPEFPPHLHTSNGLPPHLMPTLKKYFENASHNFSQIIKTLSPHLVIYDYLMPSW

VPKFASSHQIPAVHFHIFGVANLAYFTCLVRDIPGFSFQSKTVCLKPSEIMKLVQAPRDNVE

AEENELSDCIIGSTEMFLIKSNREIEGKYLDFAADLFKKKIVPVGPLFQEISVNNQENDEEIF

RWLNKKEEFSTVYVSFGTESYLSKKGMEELANGLELSKVNFIWVIKFPEGEKINAAEALP

EGFLERVGEKGMIVERWVPQAKILGHKSIGGFVSHCGWSSVMESASVGVPIIALPMHHD

QPVNARLVVEVGFGLEVEKDENVEFWREEVARVVKEVVIEKSGVELRKKAKELSEQMK

AKGEEEVDLAIKELKTLCENNLGKYRDIN.

>Pn2-19

45

Page 46: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

MATEEASLRVMMFPWLGYGHVSPFLELAKKLSAKNFHIYFCSTPINLNSIKNRIDSFSSSIE

LVEFHLPSSPELPPHYHTTNGLPHHLHKTLLQAFNMSKPNFSDILNNLKPNLLIYDTYQPW

VPEIASSHHIPAVNFHCTGAASSCFFYSNFKLQGKGLQFNFPAIYLRESEIRKMIASAPYDT

EAAEDPIYTCVEKSLDFVLVKSCRAIEDKYINFFSQLLNKKMVTVGPLAQSGEEEEEKDS

VIMEWLSQKGKSSTVFVSFGSENYLCNKKMEELAYGLELSNVNFIWVIRFPSGGEKIKIG

DVLPEGYLERVKERGLVVEGWAPQAKILGHSSTGGFVSHCGWSSVTESLSYGVPVIAIPM

NFDQPLNARLMVELGAGLEVLKDEDLEFDREEVARVIKEVVVEKSGEGIRCKAKELSEEI

RMKEEGEMCGVVEELEKIICNKNLDE.

>Pn2-20

MRVLPILLPAQTTTTTTMDSSSQLQQLHFVLIPLMSPGHLMPIVDMARLFAQHGVIVTIVS

TPLNTKRFKTIVDRAIDSGLQIRIIDLYFPAAEACLPQGCENMDSISRNLIKNFFMASSMLQ

QPFDQLFDQLSPRPSCIISGKNQAWTVETARKFNIPRLFFDGMGCFSFSCTHNLKMSEEFQ

RVTSKFETFLVPGLPHEIELTKAQLPEALNPGGSGDLIDVRNKMTAAESIADGIIVNSFEEL

EPEYVEMYTRAKGGNIWCIGPVSASNKLILDKAERGSFAPTENEIQCLEWLDLQEPNSVV

YACLGSISGLTASQLVELGLGLEASKRPFIWVIRGGERSKELERWIKQERFEERTKGRGLLV

RGWAPQLLILSHSSTGGFLTHCGWNSTLEGVSAGKPIIACPLFAEQFYNEKLVVKVLGTG

ASVGVEAAVTWGMEDQFGLVMKRENVEKAIQEVMDKGVEAEERRKRAREFGDMAKR

AIEEGGSSYLNIRSLIQHVKEKNELKHACT.

>Pn2-21

MDTEKLHVAIVASPGLGHLTPVLLLGNRLATHHNIHVTVIVVTTQLSTAESQILHPSTAPKL

LNIVQVPHVDISGLIDANTKVVTQLSIMMRETRPRIRSAISAMNHRPDALIVDLFGSELLPI

AEEFCMPRYIFVTSTAWFVALTTYCQVLDKEIQGEYVDQKEPLKIPGCRPVHPEDVVDPM

LDRSDQQYREYIRHGVEYSMFDGILMNTWEDLEPTTIQALRNDEILQTVVKVPVYPIGPL

STPVKPASQKSELIEWLDTQPSDSVMYVSFGSGGTLPAEQVNELARGLEQSQQRFIWVVR

PPIESHADGTFFTSGNGPDGTPAYLPERFLSRTHKLGKVVPLWAPQMEILSHPSVGVYLSH

CGWNSALESISNGVPMVAWPLYAEQRLNATLLTEDLGVAWRPTVLPGKKVVEREEIEKM

VRNVMQYREEGKGAWERVKEVKCSGDRALSKGGSSYNSLCEVIKDCERRLKKLI.

>Pn2-28

MENQGLRKCLHVVLVPAPLQGHISPMLQLGDVLHSMGFSIIVAHTIFNSPNPSNHPEFIFLP

ISDNLSDHDTSPGNRLALFKSINKNCEEPLRQSLAHIMHQQELGDRVVCIIYDQIMYFSEA

46

Page 47: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

VASHLKLPCMNFRTISASFALAYKEIPRLLAEGYIPRQNSMLQDLVPGLFPLRFTDMPTDIG

SLEDVIENMELSINTRNYSAIIWNTVDHLEQTALAQFLQPRPVPLFSVGPLHKMASPSSTS

LLEVDTSCISWLDKQAPRSVIYVSIGSLAFMEEKELTEMAWGLASSGQPFLWVVRTGSEC

DSEWKLLPKGFKETIGERGCIVKWAPQRKVLAHDAVGGFWSHCGWNSTLESLSEGVPMI

CWPCLGDQKVNARYISHEWKVGQQFEHKLERNAIKQAIRGLMVNKEGENTRQNAMDI

MEKIWFSMNKGGSSYNSLNGLAEFISSFQLSK.

>Pn2-30

MYESTTKMGFAASDQKPHAVCIPFPAQGHINPMLKLSMVLHYRGFHITFVNTDFNHNRLL

KSRGSNSLDGLPDFRFHSIPDGLPPTEADVTQDIPTLCFSTARNCLVPFRELLHKLDQNSSS

LNSVDPPVTCIISDAVMTFTLIAAEEIGVPCVSFRTTTACCFMLNKQYSHLKEKVLLPLKD

ARYLTNEHLNMTVDWIPGIKNICLKDFPSFVQTTNLNDKMVEFTIGEAERASTASAVIFNT

FHELECDVLQALSSMCSPIYSIGPVRLLINKLPLSKLKPIGSNLWKEDTECLKWLDSKQLN

SVIYINYGSITVMTKEKLVEFAWGLANSKHNFLWVIRPDLIVGETAILPPEFYEEIKERGLIA

GWCPQEEVLDHVAVGGFLTHCGWNSMIESLSCGVPMICWPFFADQPTNCKLACSDWGV

GMEIGKNVRRDELEMHVRELMGGEKGKAMRKKAMEWKKKAEEATGEFGSSSLNLDKL

VRDVL.

>Pn2-31

MSKMEQTPHIAVVPSPGMGHLIPLIEFAKRLVNFHNVSVTVIVPTIGTPPKVETAVLESLPK

AINYVHLAPVSFEDLPRDSKVETTIFLAVTRSLSSLRGTIKLLVARTALVALISDPFGTDAFS

VAKEFDISPYLFFTSNAMNFAFSYMLPKFDETMSCEFRELPDPVIIPGCIPVHGGDLMDPV

QDRTNEVYKLLLHHTKQFSFAEGVLLNSFIELEEGAIKALQGKEPGKLPVYPIGPLIQTGSS

DEADPSECMKWLDNQPSGSVLFISFGSGGTLSYDQLIELALGLETSGQKFLWVARAPNDN

SSNAAFFSVESQNDPLSFLPKGFLNRTKGQGIVVSSWAPQTKILAHGSTGGFLSHCGWNS

TLESVVHGVPLIAWPLYAEQKMNAIMLAKGLKVALRPKVNQNGIVERDEIGKVVKSLIEG

EEGKKVRSRMKEVKDAATKVLREDGSSTKTLSELVKKWKNKIAPIAS.

>Pn2-33

MENRQILPPHVLIFPLPLQGPVNCMLKLAELFSLAELRVTFLNTHHIHHRLLKHAHVESRF

GRYPEFRFETISDGLPADHQRSGDRFIEVINGIEAVTKPLLKEMLASGCLSSKSGRPVTCIIA

DGFLSFVLDIAKEVGVPLLNFDTISPCALWTYLCIPKLIEVDELPFKGNDLDAKITSVPGME

NFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPNLY

47

Page 48: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

PIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMTRD

QLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEEVL

GHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSCDR

GIIERMIRDVMEVRKDEFQESANVMGKLARQSVCEGGSSYYGLDRLIKNIKKISLSTSQA.

>Pn2-35

MAQQTIPPHVLIFPFPLQGHVTTMLKLSELFCLAGIHVTYLISTQNYIRLVRCTNVLSRLSP

YPGFRFQVLPESVSDGNAQSMDLVMNLFESLKTAKTFVKDMLIGEAQKPVTCIITDGVMS

FTLDLGEEIGIPIIYFRTISACSFWAYFCMPKLVEAGELPLKGDDMDVLIKNVPGMEGFLRQ

RDLPSLFRVSELNNSAFKMWSTETHQTIRAQALILNTFEDLEGPILSQIRTQCPNTYTIGPL

HAHLKARLDGKTASQLASSNSLWEEDKSCIKWLDNQPLKSVVYVSFGSIAMVTREQLVE

FWHGLVNSEIRFLWVIRPDSVTDEEMGSEIQAELEKATKGRGYMVEWVPQEAVLAHQAV

GGFLTHSGWNSTLESIFEGVPMICWPYFADQQINSRFVGEVWKVGLDIKDTCDRVVIEKA

VRDMMEVRKEEFVESADQIAEMARKSVGEGGSSYCNLNRLIEDIMLMTLGGITP.

>Pn2-39

MSMSNKHGPGQVIVVMVPFPAQGHLNQLLHLSRLISAYDIPVHYVGTTIHNRQAKLRVH

GWDPLSVSNIHFHDFQTPSFPSPPPNPNSPLGHLLPSFELASSHLRKPVASLLQQLSSAAKT

RLVIIHDSLMSSVVQDLVLVPNAEAYTFHSVSAFTVFLHLWENLGRPFAVQSDILKDLPSL

DGCFTSDFANFISSQHDCMKFNSGKIYNTCKLIETPYLDLLAKEQISKNKKQWPLGPFNPV

SVSQNQTPNLQHQCLKWLDKQATSSVIFVSFGTTTSLRDQQIQELALGLEKGAQNFIWVL

RDADKGDVFGGEVRRAKLPKGYEERMKGKGMIVRDWAPQLEILAHPSTAGFLSHCGWN

SCMESITMGVPILAWPMHSDQPRNTVLITKLLRIGLVVKNWSLRDELVVAETVGDVIKKL

MTSQEGGEIRRRAEKMGGDVRMSVAGGGVSRLELDSFIVHISR

>Pn2-47

MGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLVMKFPHDTGVTAYTRSLQKDAPNR

IVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIVSRSDSNKLAGFVIDMFCTPMIDVA

NEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLAEYKDTDVELSIPSFINPVPAKVLPS

VILNKEGSTMLQSISRRFKEAKAILVNTFAELEPHAIKSLGDNCKIPPIYHVGPIINLKNKEG

TTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEGQVREIAHGLEQSGQRFLWSLRRRP

EKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQTAILSHPAVGGFVSHCGWNSTLESI

WCGVPMATWPIYAEQQINAFELVKELGMAVEIKMEYREDYFFAPENNVVVTADQIEKGM

48

Page 49: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

RCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSYISLRHFIEDVMGN.

>Pn3-1

MKNPELIFMPAPFMGHLPSMLEMAKILVTRYETLSITVLLIKLPFDTGLSAYIQSLSSTPIPR

LTLVELPQSDPSTYMSKPPNTILYSLIEMQKIYVRDIVQDIFTSESTRLAGFFIDMFCVTMID

LAAEFKVPTYVFFAASAAFLGLMFHVQTLSDEYGKDVTELKDSDAELLVPSFRNPFPCKL

LPSMVLDNQGGCDVLLSMAKRFRETKGIIVNTFMELESYAIESLIASDRKIPPVYPVGPVL

NSAGVGYNSDETAVIIQWLDDQPVSSVVFLCFGSMGSISNDQVKEIAFALEHSNHRFLWS

LKPPLSKGAAKPSQDTSNLKDILPVGFLERTAGIGKVIGWAPQVAVLSHVAVGGFVSHCG

WNSILESVWFGVPLATWPLYAEQQLNAFEMVIEYGMALDIKMDYRYDGNMKNCVIVKA

EEIESKIRELMMDINRNELRKTVKDVQEKSRNAILEKGSSYVSIGSIMEEII.

>Pn3-4

MLPWLAFGHMIPFFHLSLALAKQGIRISFVSTPKNITRLPKVPAELAHLINLVKLPLPNVAT

SNYILPNDAEATVDLLPENVQYLKIAYDLLAQPFKQFVSGQLPDWIIVDLIPHWAVDVAQE

CGVPLIFFSAYNAAALAFVGPPEFLTGDAQKRVRPSPESLTISPEWISFRSSVAFRMYEAIGF

HPGLYGENASEIKDAQRVASVIQGCYAMAIRSCPEVEGEYLNLLGNIFGKPVIPVGLLPPAP

PGERENVVPWNLMFNWLDKQKARSVVFVGFGSECKLSKDQVNEIAYGLEMSELFFIWV

LQKPDWTSHDVTTATPPGFTQRTAEKGVVHVGWAPQKEILAHPSIGGSLFHCGWGSVIET

LQYGHCLVALPFVFDQGLTARMLVDKGLAIEVERREDGSFSRDDIAKSLRQAMVSNEGE

NLRNNAKEVALVFGDQKLHQEHYIGEFAQFLKNGIVDIDGNTGKNSEK.

>Pn3-5

MEKQGEKKASEQSTGGRLLLFPPPLQGHISPMLQLANILHSKGFKITIIHTNFNSPNKSNYP

HFTFEPISESLVENEFEGSSTADFGNVLILIKLLNNTCVDPFRDCIVRLLSEEPILCLITDALL

VFTQAVADKLKVPRIVLRTGSLSSFHVFHAHPLLREKGYFSMEDYKSEAPVAEIPPLKVKD

IPMVFTHEEESLHEIIEAMMKATKAASGLIWNSFEELEQSSLIGIRQDFPIPNFTIGPFHKYF

VASSSSLFTQDQTSISWLDLQKPESVLYVSFGSILTIDKSDFLEMAWGLANSMQKFLWVVR

PGVIRGSEWLEPLPDGLLEAVHERGHIVKWAPQQQVLAHPATACFWTHNGWNSTLESICE

GVPMICSPCFGDQPVNARYVEGVWKVGVVLENGLERREIESGIRRLMVDEEGKEMKRRT

KCLKEKVNLCLKQGGSSNQSLQNLVSYIMSL.

>Pn3-6

MAEQTIPPHVLIFPFPLQGHVTTMLKLSELFCLAGIHVTYLISTQNYIRLVRCTNVLSRLSP

49

Page 50: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

YPGFRFQVLPESVSDGNAQSMDLVMNLFESLKTAKTFVKDMLIGEAQKPVTCIITDGVMS

FTLDLGEEIGIPIIYFRTISACSFWAYFCMPKLVEAGELPLKGDDMDVLIKNVPGMECFLRQ

RDLPSLCRVSDLNNSAFTMISSETHQTVRARALILNTFEDLEGPILSQIRTEFPNTYTIGPLH

AHLKARLDGKTASQLASSNSLWEEDKSCIKWLDNQPLKSVVYVSFGSIAMVTREQLVEF

WHGLVNSGIRFLWVIRPDSVTDEEMGSEIQAELEKATKARGYMVEWAPQEEVLAHQAVG

GFLTHSGWNSTLESIFEGVPMICWPYFADQQINSRFAGEVWKVGLDIKDTCDRVVIEKAV

RDMMEVRKEEFVESADQIAEMARKSVGEGGSSYCNLNRLIEDIMLMTVGGIRA.

>Pn3-7

MKRAKIIMVPYPAQGHVTPMLKLASGLASRGFEPVLVTPEFIHRSIAPQIGGRSEISCTKIP

DELDEGIRRDFFAIEMAMENNMPVHLERIVQKLVDEDGGRVACFVVDLLASWAIKVGCD

RGIPVAGFWPGMLEAYHLITAIPDMIQTGIISETGIPQYQGPVFFKSLKPMLSTEDLPWLIGT

SVERKSRFKFWTKTLDRSKALQWLLVNSYPDHDEDDDDDEIKTQQMIMQVTNYDSQGD

SPHILPVGPLSNDYARMKNASFWEEDVSCLDWLGKHKDGCVVYISFGSWVSPIEEGKVK

SLALALEASMRPFIWVLGCNWRQGLPSGYMERVWKRGKIVSWAPQMEVLQHKAVGCY

LTHCGWNSTMEAIQCRKRMLCYPVAGDQFLNCAYIVKVWRIGVKLSVFGQRDLEDGIER

VMEDDEMSNRLMRLNERVMGKEANSRMMDNLATFTDFVSQQNLNS.

>Pn3-8

MEMKSYGAHVLALSYPSQGHINPMLQFCKRLVSKGLKATLLITNFISTSMNPKSTSVRLD

TISDGYDEGGFMQAANVEAYLETLQSAGSKSLSELLNKYKKSGIPIDCVIYDAFCPWALD

VTKEAGLVGAAFFTQPCAVNYIYYYAYHGLLTLPVDSGSLPVSIPGLPLLELQDLPSFIYVH

GSYPAYFKLVLNQFLKVEKADYIFVNTFYKLEAEMVDTMSKLIPLITIGPTIPSFFLDNEDE

NDKEYGLNLFHLGSSNPIITWLNTKPKGSVVYVGFGSMANLSEDQMEELAWGLKESNY

NFLWVVRDSEEAAKLPQNFAQKMAGKGLLVTWSPQLEVLSNEAVGCFFSHSGWNSTIEA

LSLGVPMVVMPQWTDQTTNAKLVQDVWKVGIRVRVDEKGVVGREEIEYCIREVMEGNR

GKEMKDNAMKWKGLAKEAVSEGGTTDKNIDQFVSKLSTRSSSHI.

>Pn3-9

MAAGNDNLHVVMFPWLAFGHLIPFLELSKSIAQKGHKVSFISTPRNIDRLPAVPTNLAPLI

TFVKLKLPRVHKLRQDAEATTDIRTDDIPYLKTAYDGLEPGLTRFLETESPDWIIYDFSPHW

LPPVAARLGVSRAFFSIFNAWFIAFSGSSSEAMINGSDQRIRPEDFTVPPSWIPFPNNVAYRL

HEINWVMGSSTDNESGVSDFYRAGSVISGSEVVFVRYCNEFEPEWIKLLEELHKKPVIPL

50

Page 51: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

GLMPPSAQDRVGDQNEAEWLTIKEWLDSQDKGSVVYVALGSEATLSQNELTELGIGLELS

GLPFFWVLRKPPGSEESNPIELPDGFLERIRDRGIVWMSWVPQLKILGHESVGGFLSHLG

WSSIIEGLMFGRPLVMLPFLVDQGLNARVLVDNKVGIEVPRNETDGLFTRDSVAESLRLV

MVEEEGRIYRERTKELSGIFGDKELHNRYMDTSIKYLENHTK.

>Pn3-11

MGSFGKAENSRPHAVCVPFPTQGHINPTLKLAKLFHHRGFHITFVNTEFNHQVLLKTRGP

HALDRLPTFRFETIPDGLPPSNARATEDIPLLFDSTSKNCLAPFRELLLKLNNPLSSPDIPPV

TCIVSDFVMGFTLKAAEELGIPKVLFWTASACSFMCFLHYGQLIEKGFTPLKDSSYLTNG

YLDTVVDWIPGMKGIRLKDLPSFIRTTNPNDPMINFGLADTDKYYKSSPQIILNTYDTLEH

EVLDALSSFLPSNIYSIGPIHHQLNNIHKSSTNSIELNLWKDEQQCLAWLDSKEPNSVVYV

NFGSTTIMSPDQLVEFSWGLANSNQTFLWIIRPDLVNGDTTVLPPDSVEAIKERGLFASWC

SQEQVLRHPSIGGFLTHNGWNSTLESITCGVPMLCWPFFADQQTNCWYCCTRWGIGMEI

NSDAKRDEVESLVKELMVGVKGKKMKEKAMEWKKLAEGAITPPAGSSYINFEKMVND

VLLTNNISE.

>Pn3-15

MGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLVMKFPHDTGVTAYTRSLQKDAPNR

IVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIVSRSDSNKLAGFVIDMFCTPMIDVA

NEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLAEYKDTDVELSIPSFINPVPAKVLPS

VILNKEGSTMLQSISRRFKETKAILVNTFAELEPHAIKSLGDNCKIPPIYHVGPIINLKNKEG

TTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEGQVREIAHGLEQSGQRFLWSLRRRP

EKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQTAILSHPAVGGFVSHCGWNSTLESI

WCGVPMATWPIYAEQQINAFELVKELGMAVEIKMEYREDYFFAPENNVVVTADQIEKGM

RCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSYISLRHFIEDVMGN.

>Pn3-19

MRNMKTAELVFIPAWSISHFVPMMEMAKLFVARYEQLSVTILFMKYPVEIGLGSYIESLLS

TTIPHFRFLEVPQTDPNTYMSKPPHTRFSAFIETQKAHVRDIVFDIARSETTQFKGCIIDLFC

ITLADIADELNVPSYVFSAPSASFLGLMFHLHSLVDEHNIDITEYKDSDAELFVPSYTNPVP

AKVLPSVLLDKEGGSALFVSIARRLRETKGIVVNTFMELESHALKSLAVDSKIPVFYPVGP

VLNLTGEGKNSEEKTIILRWLDEQPPSSVVFMCFGSFGSFQEDQVKEIALALERSKHHFLW

SLRPPRPKDKTKVPLEYSNLEKVIPPGFLERTAGIGKVIGWAPQVSVLSHQAVGGFVSHCG

51

Page 52: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

WNSILESLWFGVPMGTWPLYAEQQMNAFKMVNELGIAVDIKIDYRNDHNMKTEVIVRAE

EIESKIMEMMMDENRSERMKVKEMKEKSRLAVMEKGSSYAAIGRLIEEIIM.

>Pn3-21

MFPWLAFGHMIPFLELAKLIAHKGHKISFISTPKNIDRLPKQLPPNLSSLITFVKLPLPHVQ

NLPENAEATSDVPLDKVKYLKIAYDGLQQSITEFIQNSAPDLILFDFASYWLGPLASKLNIC

SAFFSIFTAAALGFIGSAESMISGDYRKKPEDFTVLPKWVTFESTVKFKLFEVLRNRDNEN

VSDLYRMGATIKGCDFVAIRSCYELEPEWLRLLEDLNEKRFIPVGQLPTTSFENGDDVKDV

AWREMKGWLDMQPKGTVVYVAFGSEAKPSQTELTEIALGLELSELPFFWVLRTQRGKV

DTEVIELPQGFEERTKGRGVVCTSWAPQLKILSHDSVGGFLTHSGWTSVVEAIQFEKALIL

LTFLADQGINARVLEKKKMAYSIPRDDGDGSFTRDSVAESLKLVIIEKEGQIYRDKVKEMK

KVFVDKNRQEEYVNNLLGYLQTTRKIKRELI.

>Pn3-24

MGGQLHVFFFPMLAQGHMIPMLDIAKLFASRGVNATIITTPLNAPVFSKAVEKTKQLGFE

LSLRIIKFPAVPGLPEGCENADQLTSPEMFPTFFNATILLKEQVEQILQEYRPNCLVADMFFP

WATDVAAKLDIPILVFHGTSFFALCATEQVRLHKPFNNLLSDSEPFVLPNLPHDVKLTRSQL

SPHVRAAVETEFSKMHIQIRESEQRSYGVIVNSFYKLEPDYADHYRDVLKRKAWHIGPVS

LCNRGIEDKAQRGKKAAIDEDECLEWLDSKKPNSVIYVCFGSVCNFPSSQLYELAMGLE

NSGQEFIWAVRKGKNEEENEKWMPEGFEERIKGKGLIIRGWAPQVLILDHEAIGGFVTHC

GWNSSLEGITAGVPLVTWPSFADQFFNEKLVTEILRIGVAVGAKEWAIGGGRGGVKRDAIE

TAVKRLMVGEEAEERRNRAKALKEMAIKAVEEGGSSYLDLDALIKELSFYHS.

>Pn3-25

MSQSPAMSKIQNQLHFVLVPLLAQGHMIPMIDMARLLAQHGVVVSLVTTPHNASRFASTI

HRARDSGLKIQLIQIPFPWQEVGLPPGCENLDSVPSRDLIGNFFSALNKLQQPLEQHLQEL

MPPPSCVISDKYLSWTTKTAEKFHVPRLVFHGMCCFSLLSSHNIRLYNAHLSVTSDSQPFV

VPGMPQRVEITKAQLPGAFVTLPDLDDIRDQMREAESSAYGVVVNSFCELEQGCFEEYK

KAIAKKVWCIGPVSLCNKDNLDKFERGNKASIDETHCTEWLDSMKPKSVIYACLGSQCR

LVPAQLMELGLALESSKHPFIWVIKEGERFQELEKWLVEEEFEERNKRRGLLIKGWAPQV

LILSHPAIKAFLTHCGWNSTIEGVCSGVPMITWPMFAEQFFNEKLIVDILRIGIKVGVQVSV

RWGEEEKIGVLVKREQIQKAIETIMNGGGEEGRIRKRVTKLSKVGARAMEDGGSSHFNIS

LLIQDIWKQKNNQEEL.

52

Page 53: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

>Pn3-26

MFPWLAFGHMIPFLELAKLIAHKGHKISFISTPKNIDRLPKQLPPNLSSLITFVKLPLPHVQ

NLPENAEATSDVPLDKVKYLKIAYDGLQQSITEFIQNSAPDLILFDFASYWLGPLASKLNIC

SAFFSIFTAAALGFIGSAESMISGDYRKKPEDFTVLPKWVTFESTVKFKLFEVLRNRDNEN

VSDLYRMGATIKGCDFVAIRSCYELEPEWLRLLEDLNEKRFIPVGQLPPTLFESGDDVKDV

AWREMKGWLDMQVKGRVVYVAFGSEAKLSQTELTEIALGLELSELPFFWVLRKQRGKE

DTEVIELPEGFEERTKGRGVVCTSWAPQLKILSHDSVGGFLSHAGWGSVVEAIQFEKALIL

LTFLADQGLNARFLEEKRMAYSIPRDDGDGSFRRDSVADSLKLVIIEKEGQIYRDKVKEM

KNVFVDKNRQEEYIDNLLGYLEANRKIKRELI.

>Pn3-27

MERKAMEYSGGEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFI

QILQIAVDKSESPPPAEQAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVM

MSWSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAY

SDLDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVAL

LINTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDE

VTQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLF

GVQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVE

AIGRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGD

EEVHKRARILQGKLESGFPASSVAALDALVDFIK.

>n3-28

MGSLPKVTKPHAVCIPYPAQGHINPMLKLAKLLHQKGFHISFVNTEFNHNRLLRSRGPAV

LDGLPDFRFYAIPDGLPPSDPDATQDIPSLCKYTPTHSLLPFCSLISKLNDTSVSDVPPVTCI

VSDGVMSFTLKAAEKFGLPEVLFWTTSACGLLGYLQYRHLVERGYTPLKDMSYVTNGY

LDTKIEWVPGMKDIKLRDFPSFIRTTDPNDTMLNFLQTESAALPRARALILNTFDALEQDV

LDAISAIQPRIYTVGPLQLMVNQIQDHRLKSMSSNLWKEEAGCIDWLDTKEANSVVYVN

FGSITVMTAKQLTEFAWGLANSKKHFLWIVRPDIVVGDSAMLPAEFITDTENRGMLASWC

PQEEVLKHPAIGGFLTHNGWNSTVESISGGVPVICWPFFAEQQTNCRYSCVEWGIGMEID

NEVKRDEVEVLVRELMDGEKGKEMRRNALGWKKKAEEAAGHADGSSRVNLDKLVDQ

VLLAK.

53

Page 54: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

>Pn3-33

MGTNPAKKAHAVCVPFPAQGHVSPVMQLAKLLHSRGFYITFVNTEFNHRRLLRSKGSAW

VNGFDHFMFETIPEGLPPSNRDATQDPAALFDSIPKNCLAPFRDLLAKLNSSPEVPPVTCIV

ADGIMSFVIEAAKEIGVPEVQFWTASACGFLGYFHYPELMKRGIVPLKDENFMIDGTLDT

QVDWIPGMIRNLRLKDLPTFLRTTDPNNVLFTYLRDEVQNCFKASATIFNIFEPLEHEALQ

ALSSMFPHIYPIGPLSMLLKNLPNSPLYSFRSNLWIEDTKCLEWLDKWEPNSVVYVNYGS

LTIMSDQHLKEFAWGLANSKHPFLWIVRPDIVMGDSAILSEDFFEEIKGRGLLASWCPQDE

VLSRESVGVFLTHCGWNSTLESICEGVPTICWPFFAEQQTNCRYSCSDDQWGIGVEVNQD

VKRDEIDALVREMMEGDKGKKLKEKALAWKKKAEEATRIGGSSYNSFDRFIKEVLLAG.

>Pn3-35

MKSKNHFLLLCLPVQSHINPTLQLAKILTRSGANVTYATTTTGLGRLGALPTIDGLSYATFS

DGKEHNGTLILDDYLAVLRRVAPQSVTKLLQDLSTKGTPVTFIVYTVVLPWVAEVARDM

HLPSAFLFIQCATAFSIFHRFFNSQDGLHDGVHDISPDVSVQLPGLPLFRCKDIPDFVFRHN

QYYSSMVPVLQEHIQTLEKDPNPFVLVNTFDALEGDIIKSFPNMKLLAIGPLLPSAFSDGN

DLDDKSFGGTLFQNPNNYLTWLDSKPDQSVIYASFGSIMQLKETQKEEILQGLMTTNRPF

LWVIRDINEGEVKSMKLNNGIADELGFIVPWCSQVEVLCHRSIGCFVTHCGWNSTVESIT

GGVPVVGCPHFSEQQTNIKMVEEVWGNGIRVRENDDGVFGREEIRRCLDIVMGEEEKGK

EIRRNAMKWKCLAMEAVKAGGSSHNNVKQFLEM.

>Pn3-36

MASDLYEHQTNDRKKSQVVVVMVPFPAQGHLNQLLHLSRLISAYNIPVHYVGTTTHSCQ

AKSRIQGWDPLSITNIHFHEFPTPPFFTPSPNPNTSTKFPAHLQPSFEASSHLRDPVASLVHA

LAPTAHKIVVIHDAFMSSIIQDVVTISNAELYCFQSGSAFNFLALKWESIGRPSSLKLETLK

NLPSMDGCFTQEFKEFATCQIRLMDKFSSGTIHDVSKVIEGEYFDLLTKEEFSGNIKQWAL

GPFNPVSISDHKPHNCLNWLDKQDPNSVIYVSFGTTTSLTDEQIHELAIGLEESEQKFIWV

LREADKGDIFEGNYARRAELPKGYEQRIQGKGMILRDWVPQLEILEHASTGGFMSHCGW

NSCMESMSMGVAMAAWPMHSEQPRNAMLITDVLKIGTLVRDWERRDELVTSLSVEKAV

RRLMVSKEGDEMRMRAAEIGGTVRRSVAEGGVTRAEFDSFISRITRMQKLI.

>Pn3-38

MENTPHIAIFPSPGMGHLIPLVQFAKRFISQHKFTATLIVPTDGPLLKAQKTFLDALPAGIAY

VVLPPVNLDDLPSDVNIETQIAITVARSLPSLRDVFKSLLATKKLVALVVDLFGTDAFDVA

54

Page 55: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

NEFKVSPYIFFPSTAMSLSFFLHLPKLDEAESCEYRDLSGPIQLPGCIPIHGRDLLDPVQDR

KNDAYKWILHHTKRYRLAEGIMVNSFKDLEDGAIKALQEESDIPPVYPIGPLIQIDSSSAA

DGSECLRWLDNQPRGSVLFISFGSGGTHSSDQINELALGLEMSEQRFLWVVRSPNNETAN

ANFFNVESQKDPFDFLPKGFLERTKGNGLVVPSWAPQAQILRHHSTGGFLTHCGWNSILE

TVVNGIPLIAWPLYAEQKMNAVMLSEGLKVALRPQVDENGLVGRAKIAKVVKCLMEGEE

GKRLRNRMRDLKDAAAKVLSDDGSSTKSMAELAGKWNKN.

>Pn3-39

MENTPHIAIFPSPGMGHLIPLVQFAKRFISQHKFTATLIVPTDGPLLKAQKTFLDALPAGIAY

VVLPPVNLDDLPSDVNIETQIAITVARSLPSLRDVFKSLLATKKLVALVVDLFGTDAFDVA

NEFKVSPYIFFPSTAMSLSFFLHLPKLDEAESCEYRDLSGPIQLPGCIPIHGRDLLDPVQDR

KNDAYKWILHHTKRYRLAEGIMVNSFKDLEDGAIKALQEESDIPPVYPIGPLIQIDSSSAA

DGSECLRWLDNQPRGSVLFISFGSGGTHSSDQINELALGLEMSEQRFLWVVRSPNNETAN

ANFFNVESQKDPFDFLPKGFLERTKGNGLVVPSWAPQAQILRHHSTGGFLTHCGWNSILE

TVVNGIPLIAWPLYAEQKMNAVMLSEGLKVALRPQVDENGLVGRAKIAKVVKCLMEGEE

GKRLRNRMRDLKDAAAKVLSDDGSSTKSMAELAGKWNKN.

>Pn3-40

MEKKDSTRPHVLIFPLPLQGPVNSMFKLAELLCLAGIHITFLVTDYIHTRLLRYTDIHSRFN

RYPGFRLETISDGLREDHPRGDRIMELFDSLKTNTKPLFKELLTTGSLGSDKGRPITCIIAD

GIMGFTCDVANEIGIPIIYVRTISACCLWVFFCLPKLIEAGELPFNGDDLDTPIKSVPGTENS

LRIRDLPSFCRSGDICHPSIQLYKTERQELRRAYGLILNTFEDLEGSILSQMRTVCPNLYPIG

PLHTHLKTKLAAESTSPAISSNSLWKEDKSCITWLDSQPPKSVIYVSFGSLAIMTRDQLME

FWHGLVNSGCKFLWVIRPDSIAGDSEIPAEVAEGTKERGYIVGWAPQEEVLAHPSVGGFLT

HSGWNSTLESVIEGKPMICWPYFMDQQVNSRFVEEFWKLGLDMKDTCDRVIFEKMVKD

LMVERRDEFMKSADQMARLGKKCLSEGGSSYCNLVRLIEDIKLMGVER.

>Pn3-41

MEERKEKMIVIFPYMAQGHIIPFLSLALQIEKKGYQITFVNTPLNIKNLKQSLPLNSSIRLLE

IPFNSSDHRLPPETENTDSIPFSLTLTLLEASVSLKPAFRNLISDLVRGGAPPLAVIADIFFGW

TAEVAHEFGIFHTIFSSTGGFGMACYYSVWMNLPHNYTDSVEFTLPDFPEAGLIHRTQLSA

NVLAADGTDPSSKIIQLLLSSWVDSDGILFNTIEEIDKIGLYYFRRKLSLPVWPIGPILLSVD

SRARSNKVCGISSESCINWLDSKPQNSVLYISFGSQHTISASQMMQLAKALDSIDINFIWV

55

Page 56: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

VRPPLGFDMNLEFDAVEWLPEGFLKRIEEQNRGLIIVKWAPQVEILLHKAVAAFLSHCGW

NSVLESISAGVPLIGWPMGAEQFYNVKYLEEEVGVCMEVARGTNFEVRNEDIVKKIGIV

MGENGKGKEIREKACEVKKMIENGGRDEEGYKGSSVRAMEEFLNVAAFFGKEKKVRGQ

KCRMNT.

>Pn3-42

MEKKDSTRPHVLIFPLPLQGPVNSMFKLAELLCLAGIHITFLVTDYIHTRLLRYTDIHSRFN

RYPGFRLETISDGLREDHPRGDRIMELFDSLKTNTKPLFKELLTTGSLGSDKGRPITCIIAD

GIMGFTCDVANEIGIPIIYVRTISACCLWVFFCLPKLIEAGELPFNGDDLDTPIKSVPGTENS

LRIRDLPSFCRSGDICHPSIQLYKTERQELRRAYGLILNTFEDLEGSILSQMRTVCPNLYPIG

PLHTHLKTKLAAESTSPAISSNSLWKEDKSCITWLDSQPPKSVIYVSFGSLAIMTRDQLME

FWHGLVNSGCKFLWVIRPDSIAGDSEIPAEVAEGTKERGYIVGWAPQEEVLAHPSVGGFLT

HSGWNSTLESVIEGKPMICWPYFMDQQVNSRFVEEFWKLGLDMKDTCDRVIFEKMVKD

LMVERRDEFMKSADQMARLGKKCLSEGGSSYCNLVRLIEDIKLMGVER.

>Pn3-43

MGSSAETHVGVLAFPFATHAGLLLGLVCRLAAAAPNVNFSFFSTAVSNRSLFSDPNSPYSN

VIPYDVSDGVPEGYVFSGKPQEDINLFLTVASDEFRRGLEKASVDSGRKITCLVADAFLWF

SGDLAEQIGVPWVPLWTSGACSLSIHVYTDLIRQTVGLGAGIEGRMDEILTFIPGFSELRLG

DLPGGVLFGNLESPFSIMLHKMGQTLPRAAAVPINSFEELDPDLMKDIKSKFKKILNVGPF

NLTSPPQSSKSDEHGCIPWLDNKNPKSVAYIAFGTVATPPPNELVSLAEALEESGTPFLWSL

KENLKNHLPKGFLERNSKSGKIVAWAPQIQVLSHDAVGVVITHGGWNSVVESLAAGVPVI

CRPFFGDHHINTWMVENVWKIGVRIEGGVFTRNGTMNALEQVLSREKGKKLKEQITVFK

ELALKAVGPNGTSTLNFKRLLEVITT.

>Pn3-44

MEEFPIPVVMNSNSAQKPHIVCVPFPAQGHIIPMMQLAKLLHSRGFHITFVNSEHNQRRLI

RSKGPEWVAGFTDFRFETIPEGLPRIDPSKSHFPVAVFESVPKTCLAPFRDLLIKLKSTAEVP

AITSIIADGVMGFTIKAGIEFGIPVVQFWTASACGFMGYFQYPELIKRGIFPFKDQSFRSDG

SLDAKVDWIPGMRNIRLKDIPSFVQTTDPNDISFNFLREEVQNCLKASALIFNTFDAFEHE

VLQAISSMFPHIYTVGPIPLLSKGIPESQLQSFRPSLWIEDTKCLEWLDKREPNSVVYVNYG

SITKMTDHHLKEFAWGLANSKHPFLWIVRDNIIMGDSAILSEDFLEATKDRSLLTSWCQQ

DQVLSHSSIGVYLTHCGWNSTLETICAGVPIICWPFFAEQQTNCRYSCTDQWGIGVEVNH

56

Page 57: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

DVKRDEIEALVKEMMEGDKGKQLKKKALEWKKMAEEATSVGGSSYSNFDRLIEEALQP

RND.

>Pn3-47

MMGHATQALHLANLMINRFDHLSITVLIMKLPVDPISTNFMESLGSPAATSSSNVNSEVD

RIRFHHLPPSKITSDCCYRTPGVLLDLVIEDHKDHVRNYLVTRKSTPAAALVVDMFCTTM

MDVGNQFGIPTYVFFTSGAAFLQLLFHLQILHDVDNGDVSELINSDTELVISGFVNPIPPGV

LPFVLIDKYQWSTRFLKYARACRKANGIIVNTFVELESHALNSLHGDGATKSPPIYPVGPII

NHAQMNLADDRDHDIMRWLDDQPRSSVVFLCFGSMGGFDMEQVREIANGIELSGYRFL

WSLRHPAEKGKSLFPTDYSCIEQIFLPDKFFERTAHVGKVIGWGPQLKILAHEAVGGFVSH

CGWNSILESLWHGVPIATWPIYSEQQLNAFEMVRELGLSVEIKLDYHNYMDNDMNKVLV

RAEEIERGIRSVMDGGNEVREKVKRMRDKSRMSMEEGGSSYKSLELLIEDLKM.

>Pn3-54

MSSEGLVHVLLVSFPGQGHVNPLLRLGNLLAAKGLLVTFSTPQNVGKVLRKANKMIDKL

TPIGEYGGMIRFEFFEDGLDEDDLRRNDLNFYMPQLQEFGSKAVVSIIKKHEQQGRPISCII

NNLFIPWVSDLAETLNIPSAVLWVQSCACFSAYYHFHHKLLPFPTDTDLKINVQLPCMPLL

KYDEIPSFLHPSDPFLVLGKLILGQFKNLSKPFCVLMETFQELEDELLTYMSNYCNIKPIGP

LFINPITDPTSTVRADFIKADDCVDWLDSKEANSSVVYISFGSVIYLKQEQLDEIANGLLNS

GVCFLWVLRPAHDSSVEPAVLPDGFLEKAGDKGKVVQWCSQEKVLSHRAVACFVTHCG

WNSSLEALSCGVPVVAFPAWGDQVTNAKYLVDVLNVGVRLSRGDAEKRIIGREEVEKRI

REVTSGPKAVEMKENALKWKKAAKEAVAEGGSSDRNLQDFVDEIKRKRTMHITQ.

>Pn3-55

MASEGKKKETHVLMVAFAAHGHMNPMLRLGKRLVSKGINVTFAVNEYGRDLIVNSPAFA

GIHLEFYSDGLSLDADRKTSMDLFMESICKYGPVSISALIQTIGVEKFSCIINNPFVPWVAD

VAAELEIPCAMLWIQPCMLYAIYYRFYNNLNLFPTSTNPNMSVELPGFPLLASEDLPSFILP

SNTSGPFPKLLAELFQNMNKIKWVLGNSFYELEKDIIDSMKNVHPIRPVGPLVPSTLLLGE

NDQNLHKDKCIEWLDHQKNSSVIYISFGTIIEFSTKEMEVVAKCLKKTKLKFLWAVKKGE

ELLPLGFPEEIKDQGLIVKWSPQVEVLGHPSVACFLTHCGWNSLIETITAGVPVIAYPKWT

DQPTNAKLIDILHVGVRLRPGQDGELSGEELEKCIEEIMVGPSSEEYKKNAAELKQEARE

TMAHGGSSDRNIQWFVDEIKGYS.

>Pn3-60

57

Page 58: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

MDYVPAANHHKPHAVFVAYPAQGHINPLLKLAILIHHKGFHITFVNTEDTHNRLLQSSGPE

SLDGLPDFRFETIPDGLPPTEADATQDTKQVRLSTKQNCLAPFRKLVQKLNENCSENVPPV

CCIIFDALMSFAITAAEEIGVPSVCVRTSAACCFMLYKHYHLLTQKDEISSTSGHLDAEIDYI

PGIKSMRLKDLPTFIQTVSQNDEMLNFILGEVKRASEASAIIFNTYNALEFDVLRALSSMS

PPIYSIGPLPMLVNQENNKFKKSIGSNLWKEETTNCIQWLNSKKPNSVIYVNYGSLTVMTR

EILVEFAWGLANSKHNFLWIIRPDLVKGETEILPLEFVEDTKERGMIAGWCSQEEVLGHSS

VGGFLTHCGWNSMIESLSSGVPMICWPFFADQPTNCKLACKDWGVGMEIDKNVSRDEV

QMRVSELMEGGKGVEMRKKAMEWKKKAEEATGVDGSSSWNLDKLVEEVLLRKS.

>Pn3-67

MDQHTLPPHVLIFPLPAQGHVNSMLKLAELLCSAGIHVTFLVSAQNHTRLLRYTDVQSRF

SPYSGFHFETLPEGIYDGQVNYNGLIMNLFDSLKEIAEPFLKDFLSARKSSSPVTCIISDGLF

SYVLDLAEELAIPVIYFRTISACAFWAYFCIPQLIEAGELPLKGDDMDVPISSVPGMESYLR

RRDLPSFCRADLTDGSFQMVLNQTQQTPRAQALILNTFEDLEGAILSQVGAKCPNIYTVG

PLHAHLKARLPTKRTSSNSLREEDESCLTWLDDQPLNSVIYVSFGSITTMTRDQHVEFWH

GLVNSGKRFLWVVRPDSLASKEVKREIWGELEEATKARGYIVGWAAQEEVLAHRAVGGF

LTHCGWNSTLESIVAGVPMLCWPFLADQQINSRFVGEVWKIGMDMKDSCDRVVIEKMV

RDLMEVRRDEFVGRASEMANLARDSLRVGGTSYCNLDRLVKDIKLMVVPPPSMNK.

>Pn3-68

MERKAMEYSGSEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFIQ

ILQIAVDKSESPPPAEQAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVMMS

WSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAYSD

LDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVALLI

NTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDEV

TQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLFG

VQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVEAI

GRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGDEE

VHKRARILQGKLESGFPASSVAALDALVDFIK.

>Pn3-74

MEERKEKMIVIFPYMAQGHIIPFLSLALQIEKKGYQITFVNTPLNIKNLKQSLPLNSSIRLLE

IPFNSSDHRLPPETENTDSIPFSLTLTLLEASVSLKPAFRNLISDLVRGGAPPLAVIADIFFGW

58

Page 59: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

TAEVAHEFGIFHTIFSSTGGFGMACYYSVWMNLPHNYTDSVEFTLPDFPEAGLIHRTQLSA

NVLAADGTDPSSKIIQLLLSSWVDSDGILFNTIEEIDKIGLYYFRRKLSLPVWPIGPILLSVD

SRARSNKVCGISSESCINWLDSKPQNSVLYISFGSQHTISASQMMQLAKALDSIEINFIWVV

RPPLEFDINLGFDTEEWLPEGFIKRVEDQNKGLIIVKWAPQVEILSHRAVAAFLSHCGWNS

VLESLSRGVPLIGWPMGAEQFYNVKYLEEEVGVCVEVARGTKFEVRSEDIVEKIGIVMRE

NGKGKEMREKADQLKKMIENGGRDEEGYKGSSVKAMEEFLSVAAFFGKDKVRGEDE.

>Pn3-75

MDTNEKTRIKVVMLPWLAYGHISPYLELAKKLSKRDFYVYFCSTSINLSSIRKKLAVDDH

DAIQLIEFQLTSQTELPPHHHTTKGLPPHLIPDLIKALGMSGPNVINILNTVNPDLIIYDVFQ

LWVPAFAASLQIPAVHFQVVGALSTAAVYRFKVDSSIPVPCSRIFLDDTNIRKSPDYDSSSG

ENSGIVDLTFGTAIQSSDIILIKSSREFDEKNIEYYSLLMDKKIVPTGPLVQVNTCVAVHTEN

EKDDIMDWLSKKEESSTVYVSFGSECYLSEPRIRELAHGLELSNVNFIWVISFPEGDEEMC

NTSIEDVLPEGFLDRVKDRGVIVSWAPQERILGHGGLGGFVSHCGWGSVVEAMSYGVPII

AMPAQYEQPLHAMFVEEVGVGVEVLKDESGEFRRDEIAKAIKKVVVEKNGEGVRKKAR

EMGKAIKKRGEEEVECVVEELTKLCKKYQKVAAGQGKEWS.

>Pn3-76

MENEKTYKAHIMVLAYHGQGHINPMVQFSKRLASKGMKITVTTTLSNIKAMKKASSSVI

FESVYDDAAEGGVGAPGGFQGFLDRFEASGSRNLAQLIKKQENSGYPIKCLVYDANIHW

ASNIAKQFAIPGAAFFTQSCAAIASYYPMHCDLSDKSLPFPVFSMPGLPPPKLPYLPSLGAV

TGQYSPIIRFICKQFDNIENAEWVLFNSFDKLEEEVVKWMSNLWTVRNIGPTVPSVYLDN

RVENDNDYGFNLFKPSTEVCMQWLNTKETGSVVYVSFGSAASLSAEQMAEMAEALKQS

RHSFLWVVKPTEIKLPTNFVEETSEKGLVVAWCPQLEVLAHHAVGCFISHCGWNSTVEAI

SFGVPVVAMPQFLDQMTNAYFVEKVWGIGIQPKENEENVTSAEEIGRCINGVMNGKEIK

KKAKQWKELAKEAIDENGSSDKSIDEIISRILAL.

>Pn3-79

MENRQILPPHVLIFPLPLQGPVNCMLKLAELFSLAELRVTFLNTHHIHHRLLKHAHVESRF

GRYPEFRFETISDGLPADHQRSGDRFIEVINGIEAVTKPLLKEMLASGCLSSKSGRPVTCIIA

DGFLSFVLDIAKEVGVPLLNFDTISPCALWTYLCIPKLIEVDELPFKGNDLDAKITSVPGME

NFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPNLY

PIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMTRD

59

Page 60: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

QLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEEVL

GHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSCDR

GIIERMIRDVMEVRKDEFQESANVMGKLARQSVCEGGSSYYGLDRLIKNIKKINLSTSQA.

>Pn3-87

MASQIEKLHFILIPFMSQSHILPLMDFAKVLAQRHLTVTIITTPLNAISVKSIIDHAQKTHLKI

GLDTVPFPSQEAGLPQGCESMETLPSHDMAKEFFVGCKMLQEPIQKLLAELQPRPSCIIST

NSLPWTGKVASNLGIPRYAFQTIACFSLLMSHNVGRMTLVQTAISDTEPFVLQGMPDKIVL

TKSQIPQIVTRTGEEDRKGIIDQMIEAERLTRGMVVNSFEEMEPKYVEAFKNMGKKIWCI

GPVSLCNKEMSDKLERGGNKDSIDQSLCLKWLDSMESNSVIYACFGSMGIIPSSQIIETGL

GLEASNCPFIWIIRKRDLSAKVEKWLEDENFEERVKGRGLIIRGWAPQVLILSHPSVGGFIT

HCGWNSTLEAVCAGVPMITWPMFSEQFYNEKLVVNVLKIGVRVGVEVAMKTEEEDKVY

VRREQVKEAIEQLMDEEEKGERIKRAKELSEMATKATEEGGSSFLNITMLIQDIIEQITGQ.

>Pn3-88

MAILQTQDLHFILIPFMSPSHIIPLMDLAKLLTQRPQLTVTVITTPLNAITIQSIIDRAQKSTH

VKIGLSTVPFPAQEAGLPQGLESMEALPSPDTWKPFFIACTMLQQPIQDLLEELQPRPSCIIS

SNVLPWTVKVASNLKIPRYDFQTVCCFSLLLGQTVMRMNVDETVGSDSEPFVVPNMPDE

IVITKSQIPQETERTGGGDKKGIVDQMREAELLTRGMVVNSFYEMESKYVDAHMNMGR

KVWCIGPVSLCNKDMWDKLERGKKASIDDEDVCMKWLDSMEPSSVIYACFGSLGLGRIS

EAQIVEIGLGLEASNRPFIWTIKKKDLSSRVEKWLEEEKFEERVKGRGLIMRGWAPQVLIL

SHPSVGGFITHCGWNSTLEAVSSGVPMITWPMFAEQFYNEKLIVNVLKIGVRTGVEAAM

KMGDVGYNEVYVSTDQLKKAIKEVMDSEEKGKRIRRRARELRDMAKKATEEGGSSYLN

TTMLIQDVVEQLKLK.

>Pn3-89

MLPILDLTHQLAVHGLTITILVTPKNLPILNPLLSAHPSSSIQTLLFPFPHRPSLLETENIKDIG

NSGNVLIINALSALSGPIIHWFNTHPSPPVALLSDFFLGWTHHLAHQIGIPRICFYSSGAFLT

SILACLWDDDIDTLVSLDVVDFPDLPHAPSFPKEHLPTVFLTYRKSNPMWELVKDGMIAN

SASWGCVFNTFDAIEGEYLEYSRKKMGHERVFAVGPLSLLGGPDHTTRGSTSRSSGSHNT

NILAWLDDVYPDGSVLYVCFGSQKLLKRVQMEALSSALEQCGIKFIWVVKSPTAQQVAD

GYGFVSDEFEKRVSGRGLIIKEWAPQVSILNHRAVGGFLSHCGWNSVLEAIVSGVTILGW

PMEADQFVNAKLLVDYMGVAIRVCEGPDIVPNSFELAKKVAESMNAEITEKMRAKELKV

60

Page 61: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

KALEGVKHCGSSSRDLDGLVKELAQLQLKND.

>Pn3-91

MKKSAEVVFISAPAIGNLVPAVEFATHLTKTDPRLSATILVITTSHRPLVTTYIESRAAVNAG

KVRFIHLPPVEPPSPDQYQSSIAFMCLLIEKHKPHVKDALANLMQTQSESENSVVPVAGIF

VDMFCMPIMDVAAELGVPCYLFFASPATFLGFMLHLPSLHGQLASEFDDSATVLTIPAFTN

PVPIPVLPTFFLRRNDEDGCSCFLRNACRYKETKGIVVNTFKELEPCALDSLSADFADLPQ

VYPVGPVVDHCGPAGWNSDQSHENITKWLDKQPPSSVVFLCFGSMGSLSPAQVKNIAVG

LERSGYRFVWQLREPARAKSCLPSNYTNLEGVLPDGFLDRTAEFGLVCGWVPQVTILAH

KAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFEMVRELELSVEIRLDYRDGVS

DLVCAEEVEKGVRILMGNEGGEVREKVKKMREMSKNALIENGSSFVGLGLLIQQLLLDI.

>Pn3-97

MEQNQKMASPQEHIIMLPFFAHGHLTPFLALAKQIQERTGFTITLVSTPLNILYLKSTNTQN

PQIHLVPLPFNSSHHNLPPNTETTESLPLSQVITLFHASSSLESPFRRFISDVTIRDGKSPICIIS

DVFMGWANEVAKSLDIVNVSFSTCGAYGSAAYVSVWQKLPHRFLENDNDEFCLPWFPER

CRISRSHLHQFVRVADGNDEWSKFFQQQTTFSLGSFGWLCNTVQEIEPLGLEVLKNCTKL

PIWCIGPLLPQRMLESSSNPGTFGKRAGKEPGLSPEECLKWLDLFPKSSVLYISFGSQNTIR

PTQMMELAKGLEESGQPFIWAIRPPIGFNLKENFRDEWLPPGFEEQMIHSKLGLLVHKWA

PQLEILSHKSTGAFLSHCGWNSTLESLSQGVPIIGWPLAAEQVYNSKMMEEEMGVGIELT

RGLESSIVKEDVKRVIEIVMGKNGKGEEMRKKACEIGELIRVAAREENDVKGSSLQAMDD

FVASILSFSRR.

>Pn3-98

MGGQLHVFFFPMLAQGHMIPMLDIAKLFASRGVNATIITTPLNAPVFSKAVEKTKQLGFE

LSLRIIKFPAVPGLPEGCENADQLTSPEMFPTFFNATILLKEQVEQILQEYRPNCLVADMFFP

WATDVAAKLDIPILVFHGTSFFALCATEQVRLHKPFNNLLSDSEPFVLPNLPHDVKLTRSQL

SPHVRAAVETEFSKMHIQIRESEQRSYGVIVNSFYKLEPDYADHYRDVLKRKAWHIGPVS

LCNRGIEDKAQRGKKAAIDEDECLEWLDSKKPNSVIYVCFGSICNFPSSQLYELAMGLVD

SGQEFIWVVSKGKNEEENEKWMPERFEERMKGNGLIIRGWAPQVLILDHEAIGGFVTRC

GWNSTLEEISAGVPLVTWPLYADQFYNEKLVIEILRIGVAVGAKVWVAGDGSGSVKRDAV

EMAVKRLMAGEESEERRNRAKALKEMAIKAVEEGGSSYLDLDALIKELSFYHS.

>Pn3-103

61

Page 62: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI

HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP

WTADLAIRLDIPRLVFNATCMFSQVLKDAVRRPDSPHLTVKSDYDPFVIAGLPHPITMTRA

ELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFYQKMVAPTQKIINV

GPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGSGCIFPDAQLMEIAC

GLMAAGHHFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGMIVKGWAPQVLILD

HPSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGIGVEVGAEEWNL

WVDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAVGEGGSSQRNVT

VLIEDLRKLREKREKA.

>Pn3-106

MLLLLLLLLITIAHALITPLTLNPFTMSMSNKHGPGQVIVVMVPFPAQGHLNQLLHLSRLIS

AYDIPVHYVGTTIHNRQAKLRVHGWDPLSVSNIHFHDFQTPSFPSPPPNPNSPLGHLLPSFE

LASSHLRKPVASLLQQLSSAAKTRLVIIHDSLMSSVVQDLVLVPNAEAYTFHSVSAFTVFL

HLWENLGRPFAVQSDILKDLPSLDGCFTSDFANFISSQHDCMKFNSGKIYNTCKLIETPYL

DLLAKEQISKNKKQWPLGPFNPVSVSQNQTPNLQHQCLKWLDKQATSSVIFVSFGTTTSL

RDQQIQELALGLEKGAQNFIWVLRDADKGDVFGGEVRRAKLPKGYEERMKGKGMIVR

DWAPQLEILAHPSTAGFLSHCGWNSCMESITMGVPILAWPMHSDQPRNTVLITKLLRIGL

VVKNWSLRDELVVAETVGDVIKKLMTSQEGGEIRRRAEKMGGDVRMSVAGGGVSRLEL

DSFIVHISR.

>Pn3-107

MSTSISISSKRNMQINNDTIVLYPSPGIGHLVSMVELAKLILTHHPDSFSRIIILITTAPNFNTG

STGPYISRVSAITPSITFHHLPEPATSLPPNPGASVEAISYEFLRLNNPNVHQALQSISSTKSL

KIKAFIIDFFCTPAFEVSSSLNIPTYYFFTSSASALSFLLYLPIIHETTTTSLKDLNAFLDIPGSP

PIFSSELAKPLQDRNKVEYKYFIQTALEMAKSFGIIVNTFQSLEPRAITAISDGLCIPNAPTPP

IYCIGPLIATNNSGSCDAHECLIWLNSQPSRTVVFLCFGSLGLFKEEQLKEMARGLEKSGH

RFLWVVRSPAAADNQDQPQEPDLNALLPQGFLDRTKGRGLVVKSWAPQIAVLNHESVGG

FVTHCGWNSILEGVSAGVPMIGWPLYAEQGMNKVMLVEELKLGLGLGESGEGRFVSGA

ELEKQVRELMESKSGEMVRRRVMEMSDAAKTALGKDDGSSRIALANLIAKWK.

62

Page 63: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary Sequences 2

>Pn1-31ATGGACAGAGAAATGTTGAGCAAAACTCACATTATGTTCATCCCATTCCCAGCTCAAGGCCACATGAGCCCAATGATGCAATTCGTCAAGCGTTTAGCCTGGAAAGGCGTGCGAATCACGATAGTTCTTCCGGCTGAGATTCGAGATTCTATGCAAATAAACAACTCATTGATCAACACTGAGTGCATCTCCTTTGATTTTGATAAAGATGATGAGATGCCATACAGCATGCGGGCTTATATGGGAGTTGTAAAGCTCAAGGTCACAAATAAACTGAGTGACCTACTCGAGAAGCAAAAAACAAATGGCTACCCTGTTAATTTGCTAGTGGTCGATTCATTATATCCATCTCGGGTAGAAATGTGCCACCAACTTGGGGTAAAAGGAGCTCCATTTTTCACTCACTCTTGTGCTGTTGGTGCCATTTATTATAATGCTCGCTTAGGGAAATTGAAGATACCTCCTGAGGAAGGGTTGACTTCTGTTTCATTGCCTTCAATTCCATTGTTGGGGAGAAATGATTTGCCAATTATTCGGACTGGCACCTTTCCTGATCTCTTTGAGCATTTGGGGAATCAGTTTTCAGATCTTGATAAAGCGGATTGGATCTTTTTCAATACTTTTGATAAGCTTGAAAATGAGGAAGCAAAATGGCTATCTAGCCAATGGCCAATTACATCCATCGGACCATTAATCCCTTCAATGTACTTAGACAAACAATTACCAAATGACAAAGACAATGACATTAATTTCTACAAGGCAGACGTCGGATCGTGCATCAAGTGGCTAGACGCCAAAGACCCTGGCTCGGTAGTCTACGCCTCATTCGGGAGCGTGAAGCACAACCTCGGCGATGACTACATGGACGAAGTAGCATGGGGCTTGTTACACAGCAAATATCACTTCATATGGGTTGTTATAGAATCCGAACGTACAAAGCTCTCTAGCGATTTCTTGGCAGAGGCAGAGGAAAAAGGCCTAATAGTGAGTTGGTGCCCTCAACTCGAAGTTTTGTCACATAAATCTATAGGTAGTTTTATGACTCATTGTGGTTGGAACTCGACGGTTGAGGCATTGAGTTTGGGCGTGCCAATGGTGGCAGTGCCACAACAGTTTGATCAGCCTGTTAATGCCAAGTATATCGTGGATGTATGGCGAATTGGGGTTCAGGTTCCGATTGGTGAAAATGGGGTTCTTTTGAGGGGAGAAGTTGCTAACTGTATAAAGGATGTTATGGAGGGGGAAATAGGGGATGAGCTTAGAGGGAATGCTTTGAAATGGAAGGGGTTGGCTGTGGAGGCAATGGAGAAAGGGGGTAGCTCTGATAAGAATATTGATGAGTTCATTTCAAAGCTTGTGTCCTCCTGA>Pn3-29ATGAAGTCAGAATTGATATTCTTGCCCGCCCCGGCCATCGGACACCTCGTGGGAATGGTGGAGATGGCTAAACTCTTCATCAGTCGACATGAAAACCTCTCGGTCACCGTCCTCATCGCGAAATTCTACATGGATACGGGGGTAGACAACTACAATAAATCACTCTTAACAAAGCCTACCCCGCGTCTCACAATTGTAAATCTCCCGGAAAGCGACCCCCAAAACTATATGCTCAAACCACGCCACGCCATCTTTCCTAGCGTCATCGAGACTCAGAAGACACACGTGCGAGACATAATATTAGGCATGACTCAGTCCGAGTCGACTCGGGTCGTTGGTTTGCTGGCTGACCTTTTGTTCATCAACATTATGGACATTGCCAATGAGTTCAATGTTCCAACTTATGTATACTCCCCTGCCGGAGCAGGTCATCTTGGCCTCGCGTTCCATCTCCAGACACTCAACGACAAAAAGCAAGATGTGACCGAGTTCAGGCACTCGGACACTGAGTTATTGGTACCGAGTTTTGCAAACCCGGTTCCCGCCGAGGTCTTGCCGTCGATGTATGTGGATAAAGAAGGTGGGTATGATTATTTGTTTTCATTGTTCCGGAGGTGCAGAGAGTCAAAGGCAATTATTATTAACACGTTTGAGGAGCTGGAACCCTATGCGATCAATTCCCTCCGGATGGATAGTATGATCCCTCCGATCTACCCGGTGGGACCCATACTAAATCTCAACGGTGATGGCCAAAACTCCGATGAGGCTGCTGTGATCCTTGGTTGGTTAGACGATCAACCACCTTCATCTGTGGTGTTTTtGTG

63

Page 64: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

CTTTGGTAGCTATGGAACCTTTCAaGAAAaCCAgGTGAAGGAGATTGCAATGGGTCTAGAGCGCAGTGGGCATCGCTTCTTGTGGTCCTTGCGTCCGTCTATCCCTAAAGGCGAGACAAAGCTTCAGCTTAAATACTCAAATTTGGAAGAAATTCTCCCAGTCGGATTCTTGGACAGGACATCATGCGTCGGAAAAGTTATTGGATGGGCCCCGCAAGTGGCGGTGCTCGGACACGAGGCAGTCGGAGGGTTCCTGTCTCATTGTGGTTGGAATTCGACATTAGAGAGTGTGTGGTGTGGCGTGCCCGTCGCAACATGGCCAATGTACGGCGAGCAACAACTCAATGCTTTTGAGATGGTTAAGGAGTTGGGTATTGCGGTGGAAATTGAGGTGGACTATAAGAATGAATATTTTAACATGACGAATGATTTTATTGTTAGGGCAGAAGAAATTGAGACGAAAATAAAGAAGTTGATGATGGATGAAAAGAATAGTGAAATAAGGAAGAAGGTAAAGGAAATGAAAGAAAAGAGTAGGCTTGCAATGTCTGAGAATGGATCATCTTATAATTCCTTGGCGAAGCTATTTGAGGAAATTATGTAA>Pn3-31ATGGATAACCAAAAAGGTAGAATCAGTATAGCGTTGCTACCATTTTTAGCCCATGGTCACATATCTCCCTTCTTTGAGCTAGCCAAACAACTAGCAAAAAGAAATTGCAATGTTTTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAATAAGGATTCCTCTGCTTCTGTAAAACTAGTTGAGCTTCATCTTCCATCTTCCCCTGATCTTCCTCCTCACTATCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAGAAACGCCTTTGAAACAGTAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAAACCCTGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTCGTCTCACAATATTCCGGCAGTTTATTTCCTAACCACGGCAGCAGCCAGCTCTTCCATTGGCCTACATGCTTTCAAAAACCCAGGTGAAAAATACCCATTTCCAGATTTTTATGATAACAGTAATATTACCCCTGAACCACCTTCTGCAGATAACATGAAGCTACTTCATGATTTTATCGCTTGTTTCGAACGATCTTGCGATATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATATTGATTTGCTTTCCACTTTATCTGATAAAACTTTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTATGGGCCATAATGAAGATCCAAAAaCAGAGCAGATTATAAACTGGCTTGACaAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCTCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATGGGCTGTGAGATTAATTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTGCGGAGAGTATGAAGTTTGGGGTTCCAGTAATTGCCATGGCTAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTTGGTGTGGGCATGGAGGTTGTGAGAGATGATAATGGGAAATATAAGAGGGAAGGGATTGCAGAGGTAATAAGAAAAGTCGTTGTGGAGAAAAGTGGGGAGGTTATCAGGAGGAAAGCAAGGGAGTTGAGTGAGAAAATGAAAGAGAAAGGAGAGCAAGAGATTGATAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGAAGAAGAAAGATGCACAATAG>Pn3-32ATGGATATCGAGAAAGGTAGAATCAGTATAGTTATGCTACCATTTTTAGCCCATGGTCACATATCTCCATTCTTTGAGCTAGCCAAGCATCTCTCAAAAAGAAATTGCAATATATTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAACAGAGTATCTGATAAGGATTCCTCTGCTTCAATAAAACTAGTAGAGCTTCATCTTCCATCTTCCCCTGATCCTCCTCCTCAGTACCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAAAAACGCCTTTGAAACAGTAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAGACCCTGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTTGTCTCACAATATTCCGGCAGTTTATTTCCTAACCTCGGCAGCAGCCACCTCTTCCGTGGCCCTACGTGCTTTGAAAAACCCAGG

64

Page 65: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

TGAAAAATACCCATTTCCAGATTTTTATGATAACAGTAATATTACCCCTGAACCACCTTCTGCAGATAAAATGAAGCTATTTCATGATTTTGTTGCTTGTTTCAAACGATCTTGCGACATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATATTGATTTGCTTTCCACTTTATCTAAGAAAACTTTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTTTGGGACATGATGAAGATCCAAAAACAGGGCATCTTATAAACTGGCTTGACAAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCCCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATTGGCTGTGAGATTTCTTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTATGGAGAGTGTGAAGTTTGGGGTTCCAGTAATTGCCATGGCCAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTCGGTGTGGGCATGGAGGTTGTGAGAGATGAAAATGGGAAGTATAAGAGAGAAGCGATTGCAGAGGTAATAAGAAAAGTCGTGATGGAGAAAAATGGGGAGGTTATCAGGAGGAAAGCAAGGGAATTGAGTGAGAAAATGAAAGAGACAGGAGAGCAAGAGATTGGTAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGATGAAGAAAGACGCACAATATTAA>Pn3-32-i5ATGGATAACCAAGAAGCTAGAATCAGTATAGTTATGCTGCCATTTTTAGCCCATGGCCACATTTCTCCATTCTTTGAGCTAGCCAAGCATCTCTCAAAAAGAAATTGCAATATATTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAACAGAGTATCTGATAAGGATTCCTCTGCTTCTATAAAACTAGTAGAGCTTCATCTTCCATCTTCCCCTGATCTTCCTCCTCACTACCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAGAAACGCCTTTGAAACAGCAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAAACCCCGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTCGTCTCACAATATTCCGGCAGTTTGTTTCATAATTGGGGGAGCAGCCTCCTCTTCCATGAGCCTACATAGTTTCAAAAACCCAGGTGAAAAATACCCATTTCTAGATTTTAATGAGAACAGTAATATTACCCCTGAACCACCTTCAGCAGATAACATGAAGCTATTTCTTGATTTTATGACTTGTTTCGAACGATCTTGCGACATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATTTTGATTTTTTTTCCACTTTATCTGATAAAACTGTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTATGGGCCATAATGAAGATCCAAAAACAGAGCAGTTTATAAACTGGCTTGACAAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCTCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATGGGCTGTGAGATTAATTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTGCGGAGAGTATGAAGTTTGGGGTTCCAGTAATTGCCATGGCTAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTTGGTGTGGGCATGGAGGTTGTGAGAGATGATAATGGGAAATATAAGAGGGAAGGGATTGCAGAGGTAATAAGAAAAGTCGTTGTGGAGAAAAGTGGGGAGGTTATCAGGAGGAAAGCAAGGGAGTTGAGTGAGAAAATGAAAGAGAAAGGAGAGCAAGAGATTGATAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGAAGAAGAAAGATGCACAATAG

65

Page 66: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

>PGM1ATGTCACTTCTAATAGATTCTGTACCAACAGTTGCTTATAAGGACCAAAAACCGGGTACTTCAGGTTTACGTAAGAAGACCAAGGTTTTCATGGATGAGCCTCATTATACTGAGAACTTCATTCAAGCAACAATGCAATCTATCCCTAATGGCTCAGAGGGAACCACTTTAGTTGTTGGAGGAGATGGTCGTTTCTACAACGATGTTATCATGAACAAGATTGCCGCAGTAGGTGCTGCAAACGGTGTCAGAAAGTTAGTCATTGGTCAAGGCGGTTTACTTTCAACACCAGCTGCTTCTCATATAATTAGAACATACGAGGAAAAGTGTACCGGTGGTGGTATCATATTAACTGCCTCACACAACCCAGGCGGTCCAGAGAATGATTTAGGTATCAAGTATAATTTACCTAATGGTGGGCCAGCTCCAGAGAGTGTCACTAACGCTATCTGGGAAGCGTCTAAAAAATTAACTCACTATAAAATTATAAAGAACTTCCCCAAGTTGAATTTGAACAAGCTTGGTAAAAACCAAAAATATGGCCCATTGTTAGTGGACATAATTGATCCTGCCAAAGCATACGTTCAATTTCTGAAGGAAATTTTTGATTTTGACTTAATTAAAAGCTTCTTAGCGAAACAGCGCAAAGACAAAGGGTGGAAGTTGTTGTTTGACTCCTTAAATGGTATTACAGGACCATATGGTAAGGCTATATTTGTTGATGAATTTGGTTTACCGGCAGAGGAAGTTCTTCAAAATTGGCACCCTTTACCTGATTTCGGCGGTTTACATCCCGATCCGAATCTAACCTATGCACGAACTCTTGTTGACAGGGTTGACCGCGAAAAAATTGCCTTTGGAGCAGCCTCCGATGGTGATGGTGATAGGAATATGATTTACGGTTATGGCCCTGCTTTCGTTTCGCCAGGTGATTCTGTTGCCATTATTGCCGAATATGCACCCGAAATTCCATACTTCGCCAAACAAGGTATTTATGGCTTGGCACGTTCATTTCCTACATCCTCAGCCATTGATCGTGTTGCAGCAAAAAAGGGATTAAGATGTTACGAAGTTCCAACCGGCTGGAAATTCTTCTGTGCCTTATTTGATGCTAAAAAGCTATCAATCTGTGGTGAAGAATCCTTCGGTACAGGTTCCAATCATATCAGAGAAAAGGACGGTCTATGGGCCATTATTGCTTGGTTAAATATCTTGGCTATCTACCATAGGCGTAACCCTGAAAAGGAAGCTTCGATCAAAACTATTCAGGACGAATTTTGGAACGAGTATGGCCGTACTTTCTTCACAAGATACGATTACGAACATATCGAATGCGAGCAGGCCGAAAAAGTTGTAGCTCTTTTGAGTGAATTTGTATCAAGGCCAAACGTTTGTGGCTCCCACTTCCCAGCTGATGAGTCTTTAACCGTTATCGATTGTGGTGATTTTTCGTATAGAGATCTAGATGGCTCCATCTCTGAAAATCAAGGCCTTTTCGTAAAGTTTTCGAATGGGACTAAATTTGTTTTGAGGTTATCCGGCACAGGCAGTTCTGGTGCAACAATAAGATTATACGTAGAAAAGTATACTGATAAAAAGGAGAACTATGGCCAAACAGCTGACGTCTTCTTGAAACCCGTCATCAACTCCATTGTAAAATTCTTAAGATTTAAAGAAATTTTAGGAACAGACGAACCAACAGTCCGCACATAG>PGM2ATGTCATTTCAAATTGAAACGGTTCCCACCAAACCATATGAAGACCAAAAGCCTGGTACCTCTGGTTTGCGTAAGAAGACAAAGGTGTTTAAAGACGAACCTAACTACACAGAAAATTTCATTCAATCGATCATGGAAGCTATTCCAGAGGGTTCTAAAGGTGCCACTCTTGTTGTCGGTGGTGATGGGCGTTACTACAATGATGTCATTCTTCATAAGATTGCCGCTATCGGTGCTGCCAACGGTATTAAAAAGTTAGTTATTGGCCAGCATGGTCTTCTGTCTACGCCAGCCGCTTCTCACATCATGAGAACCTACGAGGAAAAATGTACTGGTGGTATTATCTTAACCGCCTCACATAATCCAGGTGGTCCAGAAAATGACATGGGTATTAAGTATAACTTATCCAATGGGGGTCCTGCTCCTGAATCCGTCACAAATGCTATTTGGGAGATTTCCAAAAAGCTTACCAGCTATAAGATTATCAAAGACTTCCCAGAACTAGACTTGGGTACGATAGGCAAGAACAAGAAATACGGTCCATTACTCGTTGACATTATCGATATTACAAAAGATTATGTCAACTTCTTGAAGGAAATCTTCGATTTCGACTTAATCAAGAAATTCATCGATAATCAACGTTCTACTAAGAATTGGAAGTTACTGTTTGACAGTATGAACGGTGTAAC

66

Page 67: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

TGGACCATACGGTAAGGCTATTTTCGTTGATGAATTTGGTTTACCGGCGGATGAGGTTTTACAAAACTGGCATCCTTCTCCGGATTTTGGTGGTATGCATCCAGATCCAAACTTAACTTATGCCAGTTCGTTAGTGAAAAGAGTAGATCGTGAAAAGATTGAGTTTGGTGCTGCATCCGATGGTGATGGTGATAGAAATATGATTTACGGTTACGGCCCATCTTTCGTTTCTCCAGGTGACTCCGTCGCAATTATTGCCGAATATGCAGCTGAAATCCCATATTTCGCCAAGCAAGGTATATATGGTCTGGCCCGTTCATTCCCTACCTCAGGAGCCATAGACCGTGTTGCCAAGGCCCATGGTCTAAACTGTTATGAGGTCCCAACTGGCTGGAAATTTTTCTGTGCTTTGTTCGACGCTAAAAAATTATCTATTTGTGGTGAAGAATCGTTTGGTACTGGTTCCAACCACGTAAGGGAAAAGGACGGTGTTTGGGCCATTATGGCGTGGTTGAACATCTTGGCCATTTACAACAAGCATCATCCGGAGAACGAAGCTTCTATTAAGACGATACAGAATGAATTCTGGGCAAAGTACGGCCGTACTTTCTTCACTCGTTATGATTTTGAAAAAGTTGAAACAGAAAAAGCTAACAAGATTGTCGATCAATTGAGAGCATATGTTACCAAATCGGGTGTTGTTAATTCCGCCTTCCCAGCCGATGAGTCTCTTAAGGTCACCGATTGTGGTGATTTTTCATACACAGATTTGGACGGTTCTGTTTCTGACCATCAAGGTTTATATGTCAAGCTTTCCAATGGTGCAAGATTCGTTCTAAGATTGTCAGGTACAGGTTCTTCAGGTGCTACCATTAGATTGTACATTGAAAAATACTGCGATGATAAATCACAATACCAAAAGACAGCTGAAGAATACTTGAAGCCAATTATTAACTCGGTCATCAAGTTCTTGAACTTTAAACAAGTTTTAGGAACTGAAGAACCAACGGTTCGTACTTAA>UGP1ATGTCCACTAAGAAGCACACCAAAACACATTCCACTTATGCATTCGAGAGCAACACAAACAGCGTTGCTGCCTCACAAATGAGAAACGCCTTAAACAAGTTGGCGGACTCTAGTAAACTTGACGATGCTGCTCGCGCTAAGTTTGAGAACGAACTGGATTCGTTTTTCACGCTTTTCAGGAGATATTTGGTAGAGAAGTCTTCTAGAACCACCTTGGAATGGGACAAGATCAAGTCTCCCAACCCGGATGAAGTGGTTAAGTATGAAATTATTTCTCAGCAGCCCGAGAATGTCTCAAACCTTTCCAAATTGGCTGTTTTGAAGTTGAACGGTGGGCTGGGTACCTCCATGGGCTGCGTTGGCCCTAAATCTGTTATTGAAGTGAGAGAGGGAAACACCTTTTTGGATTTGTCTGTTCGTCAAATTGAATACTTGAACAGACAGTACGATAGCGACGTGCCATTGTTATTGATGAATTCTTTCAACACTGACAAGGATACGGAACACTTGATTAAGAAGTATTCCGCTAACAGAATCAGAATCAGATCTTTCAATCAATCCAGGTTCCCAAGAGTCTACAAGGATTCTTTATTGCCTGTCCCCACCGAATACGATTCTCCACTGGATGCTTGGTATCCACCAGGTCACGGTGATTTGTTTGAATCTTTACACGTATCTGGTGAACTGGATGCCTTAATTGCCCAAGGAAGAGAAATATTATTTGTTTCTAACGGTGACAACTTGGGTGCTACCGTCGACTTAAAAATTTTAAACCACATGATCGAGACTGGTGCCGAATATATAATGGAATTGACTGATAAGACCAGAGCCGATGTTAAAGGTGGTACTTTGATTTCTTACGATGGTCAAGTCCGTTTATTGGAAGTCGCCCAAGTTCCAAAAGAACACATTGACGAATTCAAAAATATCAGAAAGTTTACCAACTTCAACACGAATAACTTATGGATCAATCTGAAAGCAGTAAAGAGGTTGATCGAATCGAGCAATTTGGAGATGGAAATCATTCCAAACCAAAAAACTATAACAAGAGACGGTCATGAAATTAATGTCTTACAATTAGAAACCGCTTGTGGTGCTGCTATCAGGCATTTTGATGGTGCTCACGGTGTTGTCGTTCCAAGATCAAGATTCTTGCCTGTCAAGACCTGTTCCGATTTGTTGCTGGTTAAATCAGATCTATTCCGTCTGGAACACGGTTCTTTGAAGTTAGACCCATCCCGTTTTGGTCCAAACCCATTAATCAAGTTGGGCTCGCATTTCAAAAAGGTTTCTGGTTTTAACGCAAGAATCCCTCACATCCCAAAAATCGTCGAGCTAGATCATTTGACCATCACTGGTAACGTCTTTTTAGGTAAAGATGTCACTTTGAGGGGTACTGTCATCATCGTTTGCTCCGACGGTCATAAAATCGATATTCCAAACGGCTCCATATTGGAAAAT

67

Page 68: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

GTTGTCGTTACTGGTAATTTGCAAATCTTGGAACATTGA>SynAtUXS3 (a codon-optimized version of the AtUXS3 gene)ATGGCTGCAACTTCTGAAAAGCAAAACACTACAAAACCACCACCATCTCCATCACCATTGAGAAACTCAAAGTTCTGTCAACCAAACATGAGAATTTTAATTTCTGGTGGTGCTGGTTTTATTGGTTCACATTTGGTTGATAAATTGATGGAAAACGAAAAGAATGAAGTTGTTGTTGCAGATAACTACTTCACTGGTTCTAAGGAAAATTTGAAGAAATGGATCGGTCATCCAAGATTCGAATTGATCAGACATGATGTTACAGAACCATTGTTGATCGAAGTTGATAGAATCTATCATTTGGCTTGTCCAGCATCACCAATTTTCTATAAGTACAACCCAGTTAAGACTATTAAAACAAATGTTATTGGTACATTGAACATGTTGGGTTTGGCTAAGAGAGTTGGTGCAAGAATTTTGTTGACTTCTACATCAGAAGTTTATGGTGACCCATTGATTCATCCACAACCAGAATCTTACTGGGGTAATGTTAATCCAATTGGTGTTAGATCATGTTATGATGAAGGTAAAAGAGTTGCTGAAACTTTGATGTTCGATTACCATAGACAACATGGTATCGAAATCAGAATCGCAAGAATTTTCAATACATACGGTCCAAGAATGAACATCGATGATGGTAGAGTTGTTTCTAACTTCATCGCTCAAGCATTGAGAGGTGAAGCATTGACTGTTCAAAAGCCAGGTACTCAAACAAGATCTTTTTGTTACGTTTCAGATATGGTTGATGGTTTGATCAGATTGATGGAGGGTAACGATACAGGTCCAATTAATATCGGTAATCCTGGTGAATTCACTATGGTTGAATTGGCTGAAACAGTTAAGGAATTGATTAATCCATCTATCGAAATTAAAATGGTTGAAAATACTCCAGATGATCCAAGACAAAGAAAGCCAGATATCTCAAAGGCAAAGGAAGTTTTGGGTTGGGAACCAAAAGTTAAATTGAGAGAAGGTTTGCCATTGATGGAAGAAGATTTCAGATTGAGATTAAATGTTCCAAGAAATTAA>SynAtUGD1 (a codon-optimized version of the AtUGD1 gene)ATGGTTAAAATTTGTTGTATTGGTGCTGGTTATGTTGGTGGTCCAACTATGGCTGTTATGGCATTGAAATGTCCAGAAATCGAAGTTGTTGTTGTTGATATCTCTGAACCAAGAATTAATGCTTGGAACTCAGATAGATTGCCAATCTATGAACCAGGATTAGAAGATGTTGTTAAGCAATGTAGAGGTAAAAATTTGTTTTTCTCTACTGATGTTGAAAAGCATGTTTTCGAATCTGATATCGTTTTTGTTTCAGTTAATACTCCAACAAAAACTCAAGGTTTGGGTGCTGGTAAAGCTGCAGATTTGACATATTGGGAATCTGCTGCAAGAATGATTGCAGATGTTTCAAAGTCTTCAAAGATCGTTGTTGAAAAATCAACTGTTCCAGTTAGAACAGCTGAAGCAATTGAAAAGATTTTGACTCATAACTCTAAGGGTATCGAATTCCAAATCTTGTCAAATCCAGAATTTTTAGCTGAAGGTACTGCAATTAAAGATTTGTACAACCCAGATAGAGTTTTAATTGGTGGTAGAGATACAGCTGCAGGTCAAAAGGCTATTAAAGCATTGAGAGATGTTTACGCTCATTGGGTTCCAGTTGAACAAATCATCTGTACAAATTTGTGGTCTGCAGAATTGTCAAAGTTGGCTGCAAACGCATTTTTGGCACAAAGAATCTCTTCAGTTAATGCTATGTCTGCATTATGTGAAGCTACTGGTGCAGATGTTACACAAGTTGCTCATGCAGTTGGTACAGATACTAGAATCGGTCCAAAGTTCTTGAATGCTTCTGTTGGTTTCGGTGGTTCATGTTTCCAAAAGGATATCTTGAATTTGATCTATATCTGTGAATGTAACGGTTTGCCAGAAGCTGCAAACTACTGGAAGCAAGTTGTTAAGGTTAACGATTACCAAAAGATTAGATTCGCTAACAGAGTTGTTTCTTCAATGTTCAACACAGTTTCTGGTAAAAAGATTGCTATCTTGGGTTTCGCTTTTAAGAAAGATACAGGTGACACTAGAGAAACACCAGCTATTGATGTTTGTAACAGATTGGTTGCTGATAAGGCAAAGTTGTCTATCTATGATCCACAAGTTTTGGAAGAACAAATCAGAAGAGATTTGTCAATGGCTAGATTTGATTGGGATCATCCAGTTCCATTGCAACAAATTAAAGCAGAAGGTATCTCTGAACAAGTTAACGTTGTTTCAGATGCTTACGAAGCAACTAAAGATGCTCATGGTTTGTGTGTTTTGACAGAATGGGATGAATTCAAATCTTTGGATTTCAAGAAAATTTTCGATAACATGCAAAAACCAGCTTTCGTTTTCGATGGTAGAAACGT

68

Page 69: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

TGTTGATGCTGTTAAGTTGAGAGAAATCGGTTTTATTGTTTACTCTATCGGTAAACCATTGGATTCATGGTTGAAGGATATGCCAGCTGTTGCATAA>SynUGTpg101 (a codon-optimized version of the UGTpg101 gene)ATGAAGTCCGAATTGATTTTTTTGCCAGCTCCAGCTATTGGTCATTTGGTTGGTATGGTTGAAATGGCCAAGTTGTTCATTTCTAGGCACGAAAACTTGTCCGTTACCGTTTTGATTGCTAAGTTCTACATGGATACCGGTGTTGACAATTACAACAAGTCCTTGTTGACTAACCCAACTCCAAGATTGACTATCGTTAACTTGCCAGAAACTGACCCACAAAACTATATGTTGAAACCTAGACATGCCATCTTTCCATCCGTTATTGAAACTCAAAAGACCCACGTTAGAGACATCATTTCTGGTATGACTCAATCCGAATCTACCAGAGTTGTTGGTTTGTTGGCTGATTTGTTGTTCATCAACATTATGGATATCGCCAACGAATTCAACGTTCCAACTTATGTTTATTCTCCAGCTGGTGCTGGTCACTTGGGTTTAGCTTTTCACTTGCAAACTCTGAACGATAAGAAGCAAGACGTTACCGAATTCAGAAACTCTGATACCGAATTATTGGTTCCCTCATTTGCTAATCCAGTTCCAGCTGAAGTTTTGCCATCTATGTATGTTGACAAAGAAGGTGGTTACGACTACCTGTTTTCTTTGTTCAGAAGATGCAGAGAATCCAAGGCCATTATTATCAACACCTTCGAAGAATTGGAACCCTACGCTATTAACTCCTTGAGAATGGATTCTATGATCCCACCAATCTATCCAGTTGGTCCAATTTTGAATTTGAACGGTGATGGTCAAAACTCCGATGAAGCTGCTGTTATTTTAGGTTGGTTGGATGATCAACCACCATCCTCTGTTGTTTTTTTGTGTTTTGGTTCCTACGGCACCTTCCAAGAAAATCAAGTAAAAGAAATCGCCATGGGTCTAGAAAGATCTGGTCATAGATTTTTGTGGTCTTTGAGGCCATCTATTCCAAAGGGTGAAACTAAGTTGCAGTTGAAGTACTCTAACCTGGAAGAAATTTTGCCAGTTGGTTTCTTGGATAGAACCTCTTGTGTTGGTAAAGTTATTGGTTGGGCTCCACAAGTTGCTGTTTTGGGTCATGAAGCTGTTGGTGGTTTTTTGTCTCATTGTGGTTGGAACTCTACCTTGGAATCTGTTTGGTGTGGTGTTCCAGTTGCTACTTGGCCAATGTATGGTGAACAGCAATTGAATGCTTTCGAAATGGTCAAAGAATTGGGTATCGCCGTTGAAATCGAAGTTGATTACAAGAACGAATACTTCAACATGAACAACGACTTCATCGTTAGAGCCGAAGAAATCGAAACGAAGATCAAAAAGTTGATGATGGACGAGAAGAACTCCGAGATTCGTAAGAAAGTCAAAGAGATGAAGGAAAAGTCCAGATTGGCTATGTCTGAAAACGGTTCTTCTTACAACTCTTTGGCCAAGCTGTTTGAAGAGATCATGTGA>SynPn3-29 (a codon-optimized version of the Pn3-29 gene)ATGAAAAGTGAGCTAATTTTTCTACCAGCTCCAGCTATTGGTCATCTAGTTGGTATGGTTGAAATGGCAAAGTTATTCATTTCTAGGCACGAGAATTTGAGTGTTACTGTTCTAATTGCTAAGTTTTATATGGACACTGGTGTTGATAATTATAACAAGAGTCTATTGACTAAACCAACTCCAAGATTAACTATAGTTAACTTGCCAGAGTCTGATCCACAGAATTACATGTTAAAGCCTAGACATGCTATTTTCCCAAGTGTTATTGAAACACAAAAGACTCATGTTCGTGATATTATTTTGGGTATGACACAATCTGAATCAACACGTGTTGTAGGATTACTAGCAGATCTTCTATTCATTAATATAATGGATATAGCTAACGAATTCAACGTACCTACATACGTTTATAGTCCAGCTGGTGCTGGACACCTAGGTTTAGCTTTTCACTTGCAAACTCTAAATGATAAGAAACAGGACGTTACTGAATTCAGACATTCAGATACAGAACTTTTAGTTCCATCATTCGCTAATCCAGTACCAGCTGAAGTTCTTCCATCTATGTACGTTGACAAGGAAGGTGGTTACGACTACTTATTCAGTCTTTTTAGACGTTGTAGGGAAAGTAAAGCTATAATAATCAATACTTTCGAAGAATTGGAGCCATACGCTATTAACAGTTTGCGTATGGACTCTATGATTCCACCAATATACCCAGTTGGTCCAATTTTGAACCTAAATGGAGACGGTCAGAATAGTGACGAAGCAGCAGTTATTTTAGGATGGCTAGATGACCAGCCTCCAAGTTCAGTTGTTTTCCTTTGTTTCGGATCATACGGTACTTTCCAGGAGAATCAAGTTAAAGAAATAGCTATGGGATTAGA

69

Page 70: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

AAGGTCCGGTCACAGGTTTTTATGGTCACTAAGACCATCCATTCCAAAGGGTGAAACTAAATTACAATTGAAGTATAGTAACTTAGAGGAGATATTGCCTGTTGGTTTCTTAGATCGAACTTCTTGTGTTGGTAAAGTAATAGGTTGGGCTCCACAGGTTGCTGTTCTAGGTCATGAAGCTGTTGGTGGTTTCTTATCACACTGCGGATGGAACTCTACTTTGGAATCCGTTTGGTGCGGTGTTCCAGTTGCTACTTGGCCTATGTATGGTGAACAGCAGTTAAACGCATTCGAAATGGTAAAAGAATTAGGAATAGCTGTTGAGATAGAAGTTGATTACAAAAACGAGTACTTCAATATGACTAACGACTTCATAGTAAGAGCTGAGGAGATAGAAACTAAGATTAAAAAGTTGATGATGGACGAGAAAAACTCAGAGATTCGAAAGAAAGTTAAAGAGATGAAGGAGAAATCTCGTCTAGCTATGTCAGAAAACGGTTCCAGTTACAACTCTCTAGCTAAATTGTTCGAAGAGATAATGTGA

70

Page 71: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

Supplementary Sequences 3

>SlGAME1(Gal)

MAGENLHVLFLPFLSAGHFIPLVNAARLFASRGVKATILTTPHNALLFRSTIEDDVRISGFPI

SIVTIKFPSAEVGLPEGIESFNSATSPEMPHKVFYALYLLQKPMEDKIRELHPDCIFSDMYY

PWTVDLAEELHIPRILYNLSAYMCYSIMHNLKLYRPHKQPDLDESQSFVVPGLPDEIKFKL

SQLTEDLRKPEDQKSVFDELLEKVRVSEERSYGIVHDTFYELEPAYINYYQKLKKPKWWH

FGPLSHFASKIRSKELITEHNNNDIVVDWLNAQKPKSVLYVSFGSMARFPENQLNEIAQAL

HASNVPFIFVLRPNEETASWLPVGNFEDKTKKGLFIVGWVPQLTIMEHPATGGFMTHCGT

NSVLEANTFGVPMITWPLYADQFYNEKVVEVNGLGIKIGIDVWNDGIEITGPVIGSAKIRE

AIERLMSSNDSEEIMNIRDRVMAMSKMAQDATDEGGSSWNNLTALIQHIKNFNLN

>SlGAME2(Xyl)

MAMEENEQSAIAHVVFIPYAMTSHITPLVHIARLLAFHGLKVTIIAPPHNALLFQSSVDRD

CLFWGSNISVRTIQFPSEEIGLPVGIENFIASPSMEIVGKVHYGFLLLQKPMEQMIRELNPN

CIISDMFFPWTVDLAEELQIPRFSFQPGTFVHQCAWVFIRELKPYENHVSFSIPGLPLDIQM

KVSEIEDFLKGETEYRKTVEDVLQAEIRSHGIIHNTCSELEPGFAQLYEKARGVKGWHIGP

VALFINNYEAENSCCDPWKGYGDCFDWLENQQSKSVLFVCFGSMIRFSDDQLKEMAVGL

KAANCPTIWVFKEQDKNGFCSKRLKEMKGENMFIIEGWAPQVSILKHGAIGGFLTHCGW

NSILESLSVGVPLITWPLFSDNFYTDKLLEKLGLAIGIGADVWNPGFILSCPSLSGEKIELAV

KRLINNSEESRNIRENAKLMAKKLKVATEEGGSSHAQLMGLIHEIKRCALKKSSL

>SlGAME18

MGNMRIVMLPYLAYGHITPFLELAKKLSNRGFSIHICSSPINLSFIKAKIPEKYSSSIHLVEL

HLPNLPELPPHHHTTNGLPNHLKQTLFKTLKMTKPQLHQILSDLKPDFFIYDIMLLWSAVV

ASSLNIPSLRFYTVNAAIFSYFFHFYFNPGEEFPFPALYMRDYELAKMTHEVADDAEVEVD

RDKVTESDKFVLVHSTKSIDGKYMDYLCGTGQAKVVPIGTESPEDGVGDVDKIDIELVK

WLEKKTEHSTVYVSFGSEYFLSKEEMEEVAYGLEVSGVDFIWVVRYQKGEQLELPQGFK

ERIGDRGRIIEGWAPQQRILKHSSIGGFVTHCGWNSTLESIEFGVPIIAMPMLYDQPLNARL

MVENGVAVEVPRDEKGNLDRVNIAEKIKHVIRDETGENLRKKMNNLGENVRSQREEEM

DGVVKVIQLLIDEKKGTL

>SlGAME17

71

Page 72: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

MDNKDDVAVVIAPLHLQSHLRQLLHFACRISSYGLPVYYLGLAASNREARQHSTTLNPCD

IEKIHFHDLQIPNEDPPPDIPMHIWHASMRTREPIASFLGDISSKARRIVVVHDLLMSYNVQ

DISSYPNGESYVFHCLPIFDMYCSHYAPAAGFPIPLEEALLKRLPSNDGCYNPEDVEHHAK

YLNHCMGKNAGDIFNTSQVIDGTAIVDSMANLATMQNKKLWALGPILLTQDHDKVQTK

HFCLDWLNKQPPKSVIYVSFGTSTSFSAEQIKELAIGLELSKQKFIWVLRNADVGDPVNN

KCEENRSGTLELPEGFEERVKGVGLVVREWAPQQEIFAHPSTGGFMSHCGWTSCLESIIEG

VPIAAWPIQFDQPKNAFLVAEVLKIGLHVREWEQRNELVTASTVQNIVCKLMASEEGDMI

RKRAQELRTAVKRSTEEGGVSRMELDSFIAHITR

>StSGT3(Rha)

MAMEQNEETAMPHVVFIPYAMTSHITPLVHIARLFALHGLKVTIIAPQHNALLFQSSVDRD

RLFSGSNITVRTIQFPSEEVGLPVGIENFIASPSMEIVGKVHYGFILLQKIMEQLIREINPNCI

VSDMFFPWTVDLAEEMQIPRFSFQPATSIHQCAWVLIREFKPYKNVASDSERFLIPGLPLDI

KMKVSEIEDFLKEETEYTKTVDDVLQAEVRSHGIIHNTCSELEPGVAQLYEKARGVKGW

HIGPLALFINKYEAEISSKQISNSNINSCSDPWKGYGDCFNWLENQQPNSVLFVCFGSMIR

FSDDQLKEMAVGLKAANCPTIWVFREQDKNEVDEKDEHSDWSRNGFKEMIGEKMFIIQG

WAPQQLILKHRAIGGFLTHCGWNSILESLAIGVPLITWPLFSDNFYTDKLLETLGLAIGIGA

DVWNPGFILSCPPLSGEKIELAVKRLMNNSEESRKIRENAKLMAKKLKSATEEGGSSHSQ

LIGLIEEIKRCAFKKSS

>StSGT1(Gal)

MVATCNNGEILHVLFLPFLSAGHFIPLVNAARLFASRGVKATILTTPHNALLFRSTIDDDVR

ISGFPISIVTIKFPSAEVGLPEGIESFNSATSPEMPHKIFYALSLLQKPMEDKIRELRPDCIFSD

MYFPWTVDIADELHIPRILYNLSAYMCYSIMHNLKVYRPHKQPNLDESQSFVVPGLPDEI

KFKLSQLTDDLRKSDDQKTVFDELLEQVEDSEERSYGIVHDTFYELEPAYVDYYQKLKKP

KCWHFGPLSHFASKIRSKELISEHNNNEIVIDWLNAQKPKSVLYVSFGSMARFPESQLNEI

AQALDASNVPFIFVLRPNEETASWLPVGNLEDKTKKGLYIKGWVPQLTIMEHSATGGFMT

HCGTNSVLEAITFGVPMITWPLYADQFYNEKVVEVRGLGIKIGIDVWNEGIEITGPVIESA

KIREAIERLMISNGSEEIINIRDRVMAMSKMAQNATNEGGSSWNNLTALIQHIKNYNLN

>GmUGT73F4(Xyl)

MDLQQRPLKLHFIPYLSPGHVIPLCGIATLFASRGQHVTVITTPYYAQILRKSSPSLQLHVV

DFPAKDVGLPDGVEIKSAVTDLADTAKFYQAAMLLRRPISHFMDQHPPDCIVADTMYSW

72

Page 73: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

ADDVANNLRIPRLAFNGYPLFSGAAMKCVISHPELHSDTGPFVIPDFPHRVTMPSRPPKMA

TAFMDHLLKIELKSHGLIVNSFAELDGEECIQHYEKSTGHKAWHLGPACLVGKRDQERGE

KSVVSQNECLTWLDPKPTNSVVYVSFGSVCHFPDKQLYEIACALEQSGKSFIWIVPEKKG

KEYENESEEEKEKWLPKGFEERNREKGMIVKGWAPQLLILAHPAVGGFLSHCGWNSSLE

AVTAGVPMITWPVMADQFYNEKLITEVRGIGVEVGATEWRLVGYGEREKLVTRDTIETAI

KRLMGGGDEAQNIRRRSEELAEKAKQSLQEGGSSHNRLTTLIADLMRLRDSKSAT

>BvUGT73C10

MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLSR

AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSTKLLVPFFKAVNMLEEPVQKLFEEMSPQ

PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNREILENLKSDKEHFVVPY

FPDRVEFTRPQVPLATYVPGEWHEIKEDMVEADKTSYGVIVNTYQELEPAYANGYKEARS

GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLDSKEEGSVLYVCLGSICSLPLS

QLKELGLGLEESQRPFIWVVRGWEKNKELLEWFSESGFEERVKDRGLLIKGWSPQMLIL

AHHSVGGFLTHCGWNSTLEGITSGVPLLTWPLFGDQFCNQKLVVQVLKVGVSAGVEEVT

NWGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRKRVKELGQLAHKAVEEGGSSHSNI

TSLLEDIMQLAQPNN

>BvUGT73C11

MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLSR

AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSMELLVPFFKAVNMLEEPVQKLFEEMSPQ

PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNREILENLKSDKEHFVVPY

FPDRVEFTRPQVPMATYVPGEWHEIKEDIVEADKTSYGVIVNTYQELEPAYANDYKEARS

GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLDSKEEGSVLYVCLGSICSLPLS

QLKELGLGLEESQRPFIWVVRGWEKNKELLEWFSESGFEERVKDRGLLIKGWSPQMLIL

AHHSVGGFLTHCGWNSTLEGITSGIPLLTWPLFGDQFCNQKLVVQVLKVGVSAGVEEVT

NWGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRKRVKELGQLAQKAVEEGGSSHSN

ITSLLEDIMQLAQSNN

>BvUGT73C12

MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFKNVLSR

AIESGLPISIVQVKLPSQEAGLPEGNETLDSLVSMELMIHFLKAVNMLEEPVQKLFEEMSP

QPSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHILRKNREIVENLKSDKEHFVVP

73

Page 74: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

YFPDRVEFTRPQVPVATYVPGDWHEITEDMVEADKTSYGVIVNTYQELEPAYANDYKEA

RSGKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLNSKEEGSVLYVCLGSICNLP

LSQLKELGLGLEESQRPFIWVIRGWEKNKELHEWFSESGFEERIKDRGLLIKGWAPQMLIL

SHHSVGGFLTHCGWNSTLEGLTAGLPLLTWPLFADQFCNEKLAVQVLKAGVSAGVDQPM

KWGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRRRAKELGELAHKAVEEGGSSHSNI

TSLLEDIMQLAQSNN

>BvUGT73C13

MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLNR

AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSMELLVPFFKSVNMLEEPVQKLFEEMSPQ

PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNHEIVENLKSDKEHFVVPY

FPDRVEFTRPQVPVATYVPGDWHEITGDMVEADKTSYGVIVNTCQELEPAYANDYKEARS

GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLNSKEEGSVLYVCLGSICNLPLS

QLKELGLGLEESQRPFIWVIRGWEKNKELLEWFSESGFEERIKDRGLLIKGWAPQMLILSH

HSVGGFLTHCGWNSTLEGLTAGLPLLTWPLFADQFCNEKLAVQVLKAGVSAGVDQPMK

WGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRRRAKELGELAHKAVEEGGSSHSNIT

SLLEDIMQLAQSNN

>PgUGT74AE2

MLSKTHIMFIPFPAQGHMSPMMQFAKRLAWKGVRITIVLPAQIRDSMQITNSLINTECISFD

FDKDDGMPYSMQAYMGVVKLKVTNKLSDLLEKQKTNGYPVNLLVVDSLYPSRVEMCH

QLGVKGAPFFTHSCAVGAIYYNAHLGKLKIPPEEGLTSVSLPSIPLLGRDDLPIIRTGTFPDL

FEHLGNQFSDLDKADWIFFNTFDKLENEEAKWLSSQWPITSIGPLIPSMYLDKQLPNDKG

NGINLYKADVGSCIKWLDAKDPGSVVYASFGSVKHNFGDDYMDEVAWGLLHSKYNFIW

VVIEPERTKLSSDFLAEAEEKGLIVSWCPQLEVLSHKSIGSFMTHCGWNSTVEALSLGVP

MVAVPQQFDQPVNAKYIVDVWQIGVRVPIGEDGVVLRGEVANCIKDVMEGEIGDELRGN

ALKWKGLAVEAMEKGGSSDKNIDEFISKLVSS

>PgUGT94Q2

MDNQNGRISIALLPFLAHGHISPFFELAKQLAKRNCNVFLCSTPINLSSIKDKDSSASIKLV

ELHLPSSPDLPPHYHTTNGLPSHLMLPLRNAFETAGPTFSEILKTLNPDLLIYDFNPSWAPEI

ASSHNIPAVYFLTTAAASSSIGLHAFKNPGEKYPFPDFYDNSNITPEPPSADNMKLLHDFIA

CFERSCDIILIKSFRELEGKYIDLLSTLSDKTLVPVGPLVQDPMGHNEDPKTEQIINWLDKR

74

Page 75: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

AESTVVFVCFGSEYFLSNEELEEVAIGLEISTVNFIWAVRLIEGEKKGILPEGFVQRVGDRG

LVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNGKLAAEV

GVGMEVVRDENGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEIDRALE

ELVQICKKKKDEQ

>PgUGT71A29

MKSELISLAA.AIGRLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTNPTP

RLTIVNLPESDPQNYMLKPRHAIFPSVIETQKTHVRDIISGMTQSESTRVVGLLADLLFINI

MDIANEFNVPTYVYSPAGAGHLGLAFHLQTVNDKKQDVTEFRNSDTELLVPSFANPVPAE

VLPSMYVDKEGGYDYLFSLFRKCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL

NGDGQNADEAAVILGWLDDQPPSSVVFLCFGSYGTFQENNQVKEIAMGLERSGHRFLWS

LRSSIPKGETKLQLKYSNLKEILPVGFLDRTSCVGKVIGWAPQVAVLGHESVGGFLSHCG

WVSTLESVWCGPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKKDYFNMKNDFIVRA

KEIETKIKKLMMDENNSEIRKKVLEMKEKSRAASSENGSSYNSLAKLFEEIM

>SgUGT74AC1

MEKGDTHILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVKIE

VISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNPPKFILYDSTMPWVLEV

AKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPPETPTISLPSMPLLRPSDLPAYDFDPAST

DTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYLDKRVEN

DKHYGLSLFKPNEDVCLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKETGKFFL

WVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLEALCLG

VPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERASEF

KSNSMEWKKWAKEAVDEGGSSDKNIEEFVAMLKQT

>BsYjiC

MKKYHISMINIPAYGHVNPTLALVEKLCEKGHRVTYATTEEFAPAVQQAGGEALIYHTSLN

IDPKQIREMMEKNDAPLSLLKESLSILPQLEELYKGDQPDLIIYDFVALAGKLFAEKLNVPV

IKLCSSYAQNESFQLGNEDMLKKIKEAEAEFKAYLEQEKLPAVSFEQLAVPEALNIVFMPK

SFQIQHETFDDRFCFVGPSLGERKEQEGLLIDKDDRPLMLISLGTAFNAWPEFYKMCIKAF

RDSSWQVIMSVGKTIDPESLEDIPANFTIRQSVPQLEVLEKADLFISHGGMNSTMEAMNA

GVPLVVIPQMYEQELTANRVDELGLGVYLPKEEVTVSSLQEAVQAVSSDQELLTRVKNMQ

KDVKEAGGAERAAAEIEAFMKKSAVPQ

75

Page 76: ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic pathway of the main triterpene glycosylation products of . Panax notoginseng . using

>PgUGTpg100

MKSELIFLPVPAFGHLVGMVEMAKLFISRHENLSVTVLISKFFIDTGIDNYNKSLLAKPTPR

LTIINLPEIDPQKYLLKPRCAIFPSLIENQKTHVRDVMSRMTQSESTRVVGLLADILFVDIFD

IADEFNVPTYVYSPAGAGFLGLAFHLQTLNDDKKQDVTEFRNSDTELLVPSFANPVPAEFL

PSIFLEKDGRHDVLLSLYWRCREAKGIIVNTFEELEPYAINSLRMDSMIPPIYPVGPILNLN

GEGQNSDEAAVILGWLDDQPPSSVVFLCFGSFGSFPENQVKEIAMGLERSGHRFLWSLRP

CISEGETTLQLKYSNLELPAGFLDRTSCVGKVIGWAPQMAILAHEAVGGFVSHCGWNSVL

ESVWYGMPVATWPMYGEQQLNAFEMVKELGLAVEIEVDYRNEYNKSDFIVKADEIETKI

KKLMMDGKNSKIRKKVKEMKEKSRVAMSENGSSYTSLAKLFEEIM

>PgUGTpg101

MKSELIFLPAPAIGHLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTNPTP

RLTIVNLPETDPQNYMLKPRHAIFPSVIETQKTHVRDIISGMTQSESTRVVGLLADLLFINI

MDIANEFNVPTYVYSPAGAGHLGLAFHLQTLNDKKQDVTEFRNSDTELLVPSFANPVPAE

VLPSMYVDKEGGYDYLFSLFRRCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL

NGDGQNSDEAAVILGWLDDQPPSSVVFLCFGSYGTFQENQVKEIAMGLERSGHRFLWSL

RPSIPKGETKLQLKYSNLEEILPVGFLDRTSCVGKVIGWAPQVAVLGHEAVGGFLSHCGW

NSTLESVWCGVPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKNEYFNMNNDFIVRA

EEIETKIKKLMMDEKNSEIRKKVKEMKEKSRLAMSENGSSYNSLAKLFEEIM

76