data management and preliminary analysis of snps

25
DATA MANAGEMENT AND PRELIMINARY ANALYSIS OF SNPS Samantha Cravens

Upload: kevlyn

Post on 23-Feb-2016

81 views

Category:

Documents


0 download

DESCRIPTION

Data management and preliminary analysis of SNPS. Samantha Cravens. Overview. AHB Data Manage measured genotypic data (SNPs) Merge two batches Label SNPs Preliminary analyses of SNPs Focus on dopaminergic system DRD2 . AHB (Alcohol health and behavior). - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Data management and preliminary analysis of SNPS

DATA MANAGEMENT

AND PRELIMINARY ANALYSIS OF

SNPSSamantha Cravens

Page 2: Data management and preliminary analysis of SNPS

OVERVIEW AHB Data

Manage measured genotypic data (SNPs) Merge two batches Label SNPs

Preliminary analyses of SNPs Focus on dopaminergic system

DRD2

Page 3: Data management and preliminary analysis of SNPS

AHB (ALCOHOL HEALTH AND BEHAVIOR)

Prospective study of 489 college students enrolling as first-time freshman in 1987

Approximately half with positive history of paternal alcohol; half with no family history

Equal numbers of men and women Extensive assessments at years 1 (age 18), 2 (19), 3 (20), 4 (21), 7 (25), 11 (29), and 17 (35) Measured genotypic data collected in two batches

n = 203 n = 71

Thus, the first step was to merge genetic data on both batches and link to AHB measures!

Page 4: Data management and preliminary analysis of SNPS

DATA MANAGEMENT:BACKGROUND DNA

Double helix comprised of 2 anti-parallel strands

Four nucleotides: A, C, G, T A corresponds with T G corresponds with C

SNPsSingle Nucleotide PolymorphismsRefer to variability for a given nucleotide for

different people: Andrew: G A C T G Amelia: G A T T G

This would be considered a C/T SNP

Page 5: Data management and preliminary analysis of SNPS

DATA MANAGEMENT ISSUES

Non A/T, C/G SNPs

PROBLEM!!!

Batch 1 (with top and bottom strand for each participant): ID Top Strand Bottom

Strand Moe A T Larry C G

Batch 2 (with top and bottom strand for each participant): ID Top Strand Bottom

Strand Curly A T Shemp C G

Top Strand Used for Both Batches: Type Freq %

A 2 50 C 2 50

Top Strand Used for One Batch, Bottom Strand Used for the Other: Type Freq %

A 1 25 C 1 25 G 1 25 T 1 25

Page 6: Data management and preliminary analysis of SNPS

DATA MANAGEMENT ISSUES (CONTINUED)

A/T, C/G SNPs

(A NOT AS OBVIOUS) PROBLEM!!

Batch 1 (with top and bottom strand for each participant): ID Top Strand Bottom

Strand Moe A T Larry T A

Batch 2 (with top and bottom strand for each participant): ID Top Strand Bottom

Strand Curly A T Shemp T A

Top Strands Used for Both Batches: Type Freq %

A 2 50 T 2 50

Top Strand Used for One Batch, Bottom Strand Used for the Other: Type Freq %

A 2 50 T 2 50

Page 7: Data management and preliminary analysis of SNPS

MERGING AND RECODING SNPS

Of the total 1208 SNPs in both batches, we identified and recoded:494 problematic Non A/T, C/G SNPs (41%)185 potentially problematic A/T, C/G SNPs

(15%) 6 SNPs “too close to call”

Page 8: Data management and preliminary analysis of SNPS

OVERVIEW AHB Data

Manage measured genotypic data Merge two batches Label SNPs

Preliminary analyses of SNPs Focus on dopaminergic system

DRD2

Page 9: Data management and preliminary analysis of SNPS

DATA MANAGEMENT:LABELING Why Label the SNPs?

Easier to find and use Ability to organize by gene, chromosome, etc.

How we labeled the SNPs: Used data from 3 different sites to identify:

Type of SNP (A/G, C/T, etc.) Chromosomal location Gene association (i.e., DRD2, GABRA, etc.) Function within gene (intron, untranslated region, etc.)

END RESULT: Accurate & Informative labels for all SNPs!

Page 10: Data management and preliminary analysis of SNPS

OVERVIEW AHB Data

Manage measured genotypic data Merge two batches Label SNPs

Preliminary analyses of SNPs Focus on dopaminergic system

DRD2

Page 11: Data management and preliminary analysis of SNPS

DATA ANALYSIS:FIRST STEPS Literature Review

Identify SNPs in systems relevant to substance use and substance use disorders (SUDs) Focus on dopaminergic system

Relevant to SUDs and related phenotypes Coding genotypes

Recode alleles to AA (homozygous major allele) Aa (heterozygous allele) aa (homozygous minor allele)

Also recoded based on previous studies E.g., AA vs. a*

Page 12: Data management and preliminary analysis of SNPS

DATA ANALYSIS: INITIAL FINDINGS Examined relation between SNPs, SUD

phenotypesE.g., Tobacco Dependence, DUDs, AUDs

DSM-III diagnoses to maintain consistency across waves

Excluding non-Caucasians (n=459)Adjusting for sexDue to small n of AHB for genotype data—

focus on effect size

Page 13: Data management and preliminary analysis of SNPS

DATA ANALYSIS: INITIAL FINDINGS Numerous SNPs showed associations

with SUDs, related phenotypes Exemplar SNP finding for today’s talk:

rs4648317G>A SNP within DRD2

Other DRD2 SNPs linked to AUDs (Munafo, Matheson, & Flint, 2007)

Associated with nicotine dependence in adolescents (Laucht et al., 2008) Minor allele carriers had higher dependence

scores

Page 14: Data management and preliminary analysis of SNPS

ASSOCIATION BETWEEN RS4648317 (GG = 0, A* = 1)AND CATEGORICALTOBACCO/DRUG/ALCOHOL PHENOTYPES

Tobacco Dependence Drug Use Disorder Alcohol Use Disorder

AgeOdds Ratio 95% CI

Odds Ratio 95% CI

Odds Ratio 95% CI

18 0.85 0.29-

2.47 4.03 1.27-12.8 1.05 0.53-2.07

19 1.24 0.47-

3.24 2.31 0.78-6.85 1.41 0.73-2.71

20 2.76 1.23-

6.19 5.27 1.89-14.7 1.38 0.70-2.72

21 2.02 0.94-

4.35 3.05 1.05-8.89 1.18 0.58-2.40

25 1.43 0.64-

3.18 2.78 0.85-9.16 1.31 0.59-2.93

29 1.01 0.44-

2.34 1.20 0.43-3.38 2.59 1.06-6.37

35 0.74 0.28-

1.95 1.81 0.39-8.35 2.07 0.96-4.45

Page 15: Data management and preliminary analysis of SNPS

Tobacco Dependence Inconsistent evidence of higher rates

of dependence among minor allele carriers

Tobacco Dependence Drug Use Disorder Alcohol Use Disorder

AgeOdds Ratio 95% CI

Odds Ratio 95% CI

Odds Ratio 95% CI

18 0.85 0.29-

2.47 4.03 1.27-12.8 1.05 0.53-2.07

19 1.24 0.47-

3.24 2.31 0.78-6.85 1.41 0.73-2.71

20 2.76 1.23-

6.19 5.27 1.89-14.7 1.38 0.70-2.72

21 2.02 0.94-

4.35 3.05 1.05-8.89 1.18 0.58-2.40

25 1.43 0.64-

3.18 2.78 0.85-9.16 1.31 0.59-2.93

29 1.01 0.44-

2.34 1.20 0.43-3.38 2.59 1.06-6.37

35 0.74 0.28-

1.95 1.81 0.39-8.35 2.07 0.96-4.45

Page 16: Data management and preliminary analysis of SNPS

Drug Use Disorder Consistent evidence of higher rates of

dependence among minor allele carriers, esp. in early twenties

427% Higher Odds for DUD at age 20

Tobacco Dependence Drug Use Disorder Alcohol Use Disorder

AgeOdds Ratio 95% CI

Odds Ratio 95% CI

Odds Ratio 95% CI

18 0.85 0.29-

2.47 4.03 1.27-12.8 1.05 0.53-2.07

19 1.24 0.47-

3.24 2.31 0.78-6.85 1.41 0.73-2.71

20 2.76 1.23-

6.19 5.27 1.89-14.7 1.38 0.70-2.72

21 2.02 0.94-

4.35 3.05 1.05-8.89 1.18 0.58-2.40

25 1.43 0.64-

3.18 2.78 0.85-9.16 1.31 0.59-2.93

29 1.01 0.44-

2.34 1.20 0.43-3.38 2.59 1.06-6.37

35 0.74 0.28-

1.95 1.81 0.39-8.35 2.07 0.96-4.45

Page 17: Data management and preliminary analysis of SNPS

Alcohol Use Disorder Evidence of higher rates of AUD among

minor allele carriers, esp. during late twenties, thirties

200%+ Higher Odds of AUD at Age 29Tobacco Dependence Drug Use Disorder Alcohol Use Disorder

AgeOdds Ratio 95% CI

Odds Ratio 95% CI

Odds Ratio 95% CI

18 0.85 0.29-

2.47 4.03 1.27-12.8 1.05 0.53-2.07

19 1.24 0.47-

3.24 2.31 0.78-6.85 1.41 0.73-2.71

20 2.76 1.23-

6.19 5.27 1.89-14.7 1.38 0.70-2.72

21 2.02 0.94-

4.35 3.05 1.05-8.89 1.18 0.58-2.40

25 1.43 0.64-

3.18 2.78 0.85-9.16 1.31 0.59-2.93

29 1.01 0.44-

2.34 1.20 0.43-3.38 2.59 1.06-6.37

35 0.74 0.28-

1.95 1.81 0.39-8.35 2.07 0.96-4.45

Page 18: Data management and preliminary analysis of SNPS

A DEVELOPMENTAL PERSPECTIVE: GENOTYPE X TIME INTERACTION

18 19 20 21 25 29 35-5%

0%

5%

10%

15%

20%

25%

30% Genotype = 0 (GG) (n=166)Genotype = 1 (A*) (n=72)

%Diagnosing

AGE

At age 18,when AUDs are more common, no difference between GG, A* allele carriers

Page 19: Data management and preliminary analysis of SNPS

A DEVELOPMENTAL PERSPECTIVE: GENOTYPE X TIME INTERACTION

18 19 20 21 25 29 35-5%

0%

5%

10%

15%

20%

25%

30% Genotype = 0 (GG) (n=166)Genotype = 1 (A*) (n=72)

%Diagnosing

AGE

During young adulthood, when AUDs are less common, sizeable difference in AUD prevalence by genotype

Page 20: Data management and preliminary analysis of SNPS

A DEVELOPMENTAL PERSPECTIVE: GENOTYPE X TIME INTERACTION

18 19 20 21 25 29 35-5%

0%

5%

10%

15%

20%

25%

30% Genotype = 0 (GG) (n=166)Genotype = 1 (A*) (n=72)

%Diagnosing

AGE

Group with risk allele show shallower decreases in AUD from 18-35

FINDINGS HIGHLIGHT IMPORTANCE OF DEVELOPMENTAL PERSPECTIVE!

Page 21: Data management and preliminary analysis of SNPS

RS4648317 AND OTHER ALCOHOL PHENOTYPES No significant association between

genotype andMax quantity, max drinks in a 24 hour

periodAverage quantity of drinking across time

However, significant associations between genotype andMax frequency, max quantity/frequency Average frequency, quantity/frequency

across time

Page 22: Data management and preliminary analysis of SNPS

RS4648317 AND OTHER SUD PHENOTYPES: SUMMARY Carriers of minor allele demonstrated:

Inconsistently higher rates of tobacco dependence

Consistently higher rates of drug use disorders

Higher rates of AUDs in young adulthood, shallower decreases in AUDs across time

Higher maximum and average frequency, quantity/frequency

Page 23: Data management and preliminary analysis of SNPS

FUTURE PLANS Continue to examine relation between

SNPs, SUD phenotypes in AHB Further refine understanding of potential link

between rs4648317 and relevant phenotypes E.g., Is relationship moderated by environmental

influences? Examine SNPs relevant to other systems

Serotonin GABA

Also examine the interplay of genetic and environmental factors on alcohol-related phenotypes Personality Drinking motives

Page 24: Data management and preliminary analysis of SNPS

THANK YOU! MARC ARTSS administrators and

coordinators Genetic Experts

Arpana AgrawalSean Kristjansson Ian Gizer

Dr. Sher’s LabPhil KennyMargie NicholasAlvaro AmeliaRachel SimonAngi Gail

Page 25: Data management and preliminary analysis of SNPS

AND A SPECIAL THANKS TO ANDREW AND MILES!