Transcript
Page 1: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

1

SupplementaryTable1Descriptionofthe1000GenomesProjectPhase3representing2504individualsfrom26differentglobalpopulationsthatareassignedtofivesuper-populations

Numberofindividuals

Populationcode

Populationdescription

Super-population

Super-populationcode

103 CHB HanChineseinBeijing,China EastAsian EAS

104 JPT JapaneseinTokyo,Japan EastAsian EAS

105 CHS SouthernHanChinese EastAsian EAS

93 CDX DaiinXishuangbanna,China EastAsian EAS

99 KHV KinhinHoChiMinhCity,Vietnam EastAsian EAS

99 CEU UtahResidents(CEPH) European EUR

107 TSI ToscaniinItalia European EUR

99 FIN FinnishinFinland European EUR

91 GBR BritishinEnglandandScotland European EUR

107 IBS IberianpopulationinSpain European EUR

108 YRI YorubainIbadan,Nigera African AFR

99 LWK LuhyainWebuye,Kenya African AFR

113 GWD GambiansfromTheGambia African AFR

85 MSL MendeinSierraLeone African AFR

99 ESN EsaninNigeria African AFR

61 ASW AfricanAmericansinSWUSA African AFR

96 ACB AfricanCaribbeansinBarbados African AFR

64 MXL MexicanAncestryfromLosAngeles AdmixedAmerican AMR

104 PUR PuertoRicansfromPuertoRico AdmixedAmerican AMR

94 CLM ColombiansfromMedellin,Colombia AdmixedAmerican AMR

85 PEL PeruviansfromLima,Peru AdmixedAmerican AMR

103 GIH GujaratiIndianfromTexas SouthAsian SAS

96 PJL PunjabifromLahore,Pakistan SouthAsian SAS

86 BEB BengalifromBangladesh SouthAsian SAS

102 STU SriLankanTamilfromtheUK SouthAsian SAS

102 ITU IndianTelugufromtheUK SouthAsian SASDatageneratedfrom:ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20130502/

http://www.1000genomes.org/category/frequently-asked-questions/population

Page 2: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

2

SupplementaryTable2:pleaseseeseparatesupplementaryattachmentforthesedataandrelatedkeythatprovidescolumnheadingdescriptions.SupplementaryTable3:pleaseseeseparatesupplementaryattachmentforthesedataandrelatedkeythatprovidescolumnheadingdescriptions.

SupplementaryTable4.Functionalannotationsofhighlydifferentiatedpharmacogenomicvariantsinthe17pharmacogenesthatcontainedvariantswerecommoninonepopulation(>5%),butrare(0.05%)intheglobalcohort.

ID Population Gene Annotation CADD

rs35865660 GWD ACE Missense 0.14

rs185351296 PEL ACE Intron 8.93

rs75067113 ESN ACE Synonymous 16.31

rs4979 LWK ACE Synonymous 14.05

rs139471271 LWK ACYP2 Intron 1.05

rs376175333 GIH ALK Missense 21.50

rs6003592 ESN BCR Intron 6.64

rs4986854 JPT BRCA1 Missense 0.023

rs113857788 KHV CFTR Missense 26.50

rs116593005 LWK CRHR1 Intron 14.21

rs28399501 FIN CYP2B6 3’UTR 3.47

rs181297724 FIN CYP2C19 Missenseandspliceregion 22.90

rs11572079 FIN CYP2C8 Spliceregion 0.13

rs141759372 ESN GSTT1 Missense 27.00

rs142766358 LWK HAS3 Missense 14.10

rs17181024 ESN OPRM1 Intron 6.20

rs200451188 ESN RYR1 Synonymous 11.70

rs200023171 GIH RYR1 Spliceregion 2.45

rs17215493 GWD SCN5A Synonymous 2.90

rs140504750 ASW SULT1A1 Intron 3.97

rs186462665 FIN TXNRD2 Synonymous 10.98

Potentiallydeleteriousvariants(CADDscores≥15)aredisplayedinbold.Independentsignals(r2<0.4)withthelargestallelefrequencydifferencesbetweentheremaining

populationsforeachgeneregionarereported.

SupplementaryTable5:pleaseseeseparatesupplementaryattachmentforthesedataandrelatedkeythatprovidescolumnheadingdescriptions.

Page 3: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

3

SupplementaryFigure1Principalcomponentanalysis(PCA)ofpharmacogenomicvariantsstratifiedindividualsintocontinentalsuperpopulations,withtheadmixedAmericanindividualsseparatingalongclinesbetweenthemainancestralclusters.Principalcomponent1separatestheAfricanindividualsfromtherestofthecontinentalpopulations,whileprincipalcomponent2distinguishesEuropean,SouthandEastAsianpopulationgroups.

SupplementaryFigure2WeirandCockerham’sfixationindex(FST)statisticsx100forsynonymouspharmacogenomicvariantsinthe1000GenomesProjectindividuals.Populationstendedtobethemostsimilaronthecontinentallevel(indicatedbylightershadesofblue)andtheadmixedAmericanpopulationsshowingdifferingdegreesofdifferentiationfromtheEuropeanpopulations.

Page 4: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

4

SupplementaryFigure3Thenumberofpolymorphicsitesineachoftheindividual1000GenomesProjectindividualpopulationsstratifiedbysingletonsites.Africanpopulationshadthehighestnumberofpolymorphicsitesintheirpharmacogenes.

SupplementaryFigure4Themeannumberofpharmacogenomicsingletonsperindividualineachofthe1000GenomesPopulations.Individualscarriedbetween1and3uniquepharmacogenomicvariantsonaverageandthisclassofvariationmadeup52.9%ofthedataset.

Page 5: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

5

SupplementaryFigure5Allelefrequenciesofpharmacogenomicvariantsinthe23pharmacogenesthatdisplayedatleastonehighlydifferentiatedsignal(pairwiseFST>0.5foroneormoresuper-populationcomparison).ThehighestmeanFSTvariantforindependentsignal(r

2<0.4)eachgeneregionisreported.

SupplementaryFigure6Minorallelefrequenciesofhighlydifferentiatedpharmacogenomicvariantsinthe17pharmacogenesthatcontainedvariantswerecommoninonepopulation(>5%),butrare(0.05%)intheglobalcohort.Thedashedlinerepresentsthe5%minorallelefrequency.Independentsignals(r2<0.4)withthelargestallelefrequencydifferencesbetweentheremainingpopulationsforeachgeneregionarereported.

Page 6: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

6

SupplementaryFigure7Meansequencingcoverageforeachofthe1000GenomesProjectindividualsforpharmacogenesthatareknowntocarryhighclinical

evidencevariants(i.e.PharmGKB1A/B)separatedbysuper-population.

Page 7: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

7

SupplementaryFigure8Proportionofdifferentqualityvariantsinthetenpharmacogenesthatwereflaggedaspotentiallyproblematicforshortreadsequencingdata.HighQCvariantshadSVMscores>0.3,whilemarginalvariantshadscoresbetween0and0.3.Failvariantswerenotincludedinthefinalreleaseofthe1000GenomesProjectandhadSVMscores<0.

Page 8: Supplementary Table 1 Description of the ... - Nature Research1 Supplementary Table 1 Description of the 1000 Genomes Project Phase 3 representing 2504 individuals from 26 different

8

SupplementaryFigure9Astrongcorrelationinallelefrequencieswereobservedbetweenoverlappingpharmacogenomicvariantsinthe1000GenomesProjectandthosegeneratedbyexternalprojects:AComparisonbetweenglobalallelefrequencyinthe1000GenomesProjectandsequencingdatageneratedthroughtheExomeAggregationConsortium(10871variants)B-FComparisonbetweensuper-populationallelefrequenciesofthe1000

GenomesProjectandtheHumanGenomeDiversityProject(136array-genotypedvariants).


Top Related