Download - Edinburgh Research Explorer · 2019. 7. 17. · UKB participants were given the option of supplying an e-mail address for receiving newsletters and invitations for online follow-up

Edinburgh Research Explorer

Factors associated with sharing e-mail information and mentalhealth survey participation in large population cohorts

Citation for published version:Adams, MJ, Hill, WD, Howard, DM, Dashti, HS, Davis, KAS, Campbell, A, Clarke, T-K, Deary, IJ, Hayward,C, Porteous, D, Hotopf, M & McIntosh, AM 2019, 'Factors associated with sharing e-mail information andmental health survey participation in large population cohorts', International Journal of Epidemiology.https://doi.org/10.1093/ije/dyz134

Digital Object Identifier (DOI):10.1093/ije/dyz134

Link:Link to publication record in Edinburgh Research Explorer

Document Version:Publisher's PDF, also known as Version of record

Published In:International Journal of Epidemiology

General rightsCopyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s)and / or other copyright owners and it is a condition of accessing these publications that users recognise andabide by the legal requirements associated with these rights.

Take down policyThe University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorercontent complies with UK legislation. If you believe that the public display of this file breaches copyright pleasecontact [email protected] providing details, and we will remove access to the work immediately andinvestigate your claim.

Download date: 15. Jun. 2021

https://doi.org/10.1093/ije/dyz134https://doi.org/10.1093/ije/dyz134https://www-ed.elsevierpure.com/en/publications/0e152d5f-867e-4399-9c97-e74adbaf83f5

Original article

Factors associated with sharing e-mail

information and mental health survey

participation in large population cohorts

Mark J Adams,1* W David Hill,2,3 David M Howard,1,4 Hassan S Dashti,5

Katrina A S Davis,4,6,7 Archie Campbell,8,9 Toni-Kim Clarke,1

Ian J Deary,2,3 Caroline Hayward,10 David Porteous,2,8

Matthew Hotopf4,6,7 and Andrew M McIntosh1,2

1Division of Psychiatry, Royal Edinburgh Hospital, 2Centre for Cognitive Ageing and Cognitive

Epidemiology, 3Department of Psychology, University of Edinburgh, Edinburgh, UK, 4Institute of

Psychiatry, Psychology and Neuroscience, King’s College London, London, UK, 5Center for Genomic

Medicine, Massachusetts General Hospital, Boston, MA, USA, 6South London and Maudsley NHS

Foundation Trust, London, UK, 7NIHR Biomedical Research Centre, London, UK, 8Centre for Genomic

and Experimental Medicine, Institute of Genetics & Molecular Medicine, 9Usher Institute for

Population Health Sciences and Informatics, and 10MRC Human Genetics Unit, Institute of Genetics

and Molecular Medicine, University of Edinburgh, Edinburgh, UK

*Corresponding author. Division of Psychiatry, Kennedy Tower, Royal Edinburgh Hospital, University of Edinburgh,

Edinburgh EH10 5HF, UK. E-mail: [email protected]

Editorial decision 2 June 2019; Accepted 17 June 2019

Abstract

Background: People who opt to participate in scientific studies tend to be healthier,

wealthier and more educated than the broader population. Although selection bias does

not always pose a problem for analysing the relationships between exposures and dis-

eases or other outcomes, it can lead to biased effect size estimates. Biased estimates

may weaken the utility of genetic findings because the goal is often to make inferences in

a new sample (such as in polygenic risk score analysis).

Methods: We used data from UK Biobank, Generation Scotland and Partners Biobank

and conducted phenotypic and genome-wide association analyses on two phenotypes

that reflected mental health data availability: (i) whether participants were contactable by

e-mail for follow-up; and (ii) whether participants responded to follow-up surveys of

mental health.

Results: In UK Biobank, we identified nine genetic loci associated (P

Conclusions: Re-contact availability and follow-up participation can act as further genetic

filters for data on mental health phenotypes.

Key words: Selection bias, cohort studies, mental health, follow-up studies, genome-wide association study, UK

Biobank, Generation Scotland, Partners Biobank

Introduction

Selection bias in epidemiological and cohort studies occurs

when characteristics of individuals that influence their likeli-

hood of becoming or remaining as study participants are

also related to exposure to risk factors or to outcomes of

interest.1 Selection bias can be introduced at many stages of

a study, including at recruitment, at follow up, during re-

cord linkage or in non-response to questionnaires or tasks

and has the potential to lead to misestimates of phenotypic

and genetic associations.2 For example, a longitudinal study

of psychiatric traits identified several characteristics related

to loss-to-follow-up including: age; education; ancestry; geo-

graphical location; and the presence, severity and comorbid-

ity of anxiety and depression.3 There are several methods

for handling selection bias if and when it needs to be taken

into consideration. When all variables that influence selec-

tion and attrition are known, then bias can potentially be re-

duced or eliminated by conditioning on known variables or

including them as predictors.4 In longitudinal studies, tech-

niques such as inverse probability weighting, where observa-

tions that are similar to those that were lost to follow-up

contribute proportionally more to the analysis, can be used

to correct for selection bias.5 Given the importance of selec-

tion bias on inference, it is crucial to fully characterize it in

any given study population.

Initial ascertainment and re-contact have been demon-

strated to have a genetic basis. For example, individuals

who had a high genetic risk of schizophrenia (calculated

from polygenic risk scores) were less likely to complete

follow-up questionnaires or attend additional data collec-

tion sessions,6 and genetic propensity for other traits have

similar effects.7 Participation in large cohort studies is al-

ready known to have a ‘healthy volunteer’ effect,8 so we

sought to characterize the phenotypic and genetic correlates

of participation in follow-up studies focused on assessing

mental health traits. To this end, we analysed re-contact

and participation in three studies: the Mental Health

Questionnaire (MHQ) online follow-up in UK Biobank9

(N¼ 371 417–373 478), the Stratifying Resilience andDepression Longitudinally (STRADL) study in Generation

Scotland10 (N¼ 19 994) and the Partners Biobank11

(N¼ 15 925). We conducted phenotypic and genome-wideassociation analyses in UK Biobank to determine how par-

ticipants who completed the MHQ differed from the rest of

the sample. We also analysed factors related to whether UK

Biobank participants were contactable by e-mail, as e-mail

invitations were the primary method of recruitment into the

MHQ follow-up. We used participation in the STRADL

questionnaire follow-up in Generation Scotland and a

health information survey follow-up questionnaire in the

Partners Biobank as replication data sets for genetic

findings.

Conducting genetic analyses of selection bias and loss-

to-follow-up can complement and add to existing knowl-

edge gained by comparing biobank cohorts with national

statistics and published disease incidences and by compar-

ing follow-up responders and non-responders on key char-

acteristics. A participant’s decision to continue to engage

in a research study is likely to be multifactorial. Genetic

analyses are a pragmatic first step in indicating what the

many contributing factors are, since genome-wide associa-

tion summary statistics can be efficiently compared with

those from hundreds of other studies.

Genetic analyses can be revealing in other ways. First, ge-

netic and environmental factors may have different magni-

tudes or directions of association with follow-up participation.

Thus, genetic studies of follow-up samples may differ in the

degree to which they are susceptible to selection bias,

Key Messages

• Large cohort studies show a ‘healthy volunteer bias’ and this type of selection bias has a polygenic basis.

• Participants who take part in follow-up studies of mental health differ from participants who do not, and tend to be

healthier, better educated and to have a family history of dementia and/or depression.

• Genetic factors that positively associate with follow-up survey participation are positively related to cognitive function

and psychological well-being and negatively related to psychiatric disorders.

2 International Journal of Epidemiology, 2019, Vol. 0, No. 0

Dow

nloaded from https://academ

ic.oup.com/ije/advance-article-abstract/doi/10.1093/ije/dyz134/5526901 by Edinburgh U

niversity user on 17 July 2019

compared with phenotypic studies. Second, a genetic study

makes it possible to evaluate selection bias from traits that are

only measured in a follow-up sample. For example, the

Mental Health Questionnaire in UK Biobank includes evalua-

tions of depression, anxiety, addiction and trauma that were

not measured at baseline (so it is not possible to directly com-

pare responders and non-responders on these traits).

Comparisons between responders and non-responders can

even be made for traits that are rare or not even measured in

the Biobank. Genetic analyses can be correlated with external

genome-wide summary statistics to elucidate the role of liabil-

ity to disorders that are rare in most biobank samples, such as

anorexia and schizophrenia. Finally, genetic summary statistics

for follow-up response in a large sample in UK Biobank can

become the basis for the analysis of selection bias in other ge-

netic cohorts.

Methods

Samples

UK Biobank (UKB) is a population-based study of health

in middle-aged and older individuals (N¼ 502 616).Eligible participants were aged 40 to 69 and recruited from

22 assessment centres in the UK. UK Biobank received

ethical approval from the Research Ethics Committee (ref-

erence 11/NW/0382). The present study was conducted

under UK Biobank application 4844.

Generation Scotland: Scottish Family Health Study (GS:

SFHS) is a family-based cohort (N¼ 24 091) recruitedthrough general practitioners in Scotland.12,13 Eligible par-

ticipants were aged 18 years or older who were able to re-

cruit one or more family members into the study. GS:

SFHS received ethical approval from the Tayside Research

Ethics Committee (reference 05/S1401/89).

Partners Biobank is a hospital-based cohort study from

the Partners HealthCare hospitals with electronic medical

records and genetic data supplemented with electronic

health and lifestyle surveys.11 Recruitment started in 2010

(N¼ 78 726 in 2018) and is ongoing, participating acrossseveral clinics including Brigham and Women’s Hospital

and Massachusetts General Hospital. All participants pro-

vided consent upon enrolment. The current analysis was

restricted to adults aged 18 years or older and of European

ancestry14 with high-quality genotyping data at the time of

analysis.

Re-contact and participation measures

During recruitment and baseline assessment (2006–10),

UKB participants were given the option of supplying an

e-mail address for receiving newsletters and invitations for

online follow-up assessments. Of the 317 785 participants

who supplied an e-mail address, 294 738 provided a usable

one but the remaining 23 047 either provided a syntacti-

cally incorrect or non-existent e-mail address or asked that

their e-mail address be withdrawn. An e-mail address was

not provided by 184 831 UKB participants during baseline

assessment. This variable is called ‘e-mail access’ in the UK

Biobank documentation (field 20005), but we refer to this

phenotype as ‘e-mail contact’. Although additional UK

Biobank participants have subsequently provided an e-mail

address for re-contact, here we analyse the baseline avail-

ability of e-mail contact so that it can be related to other

baseline factors that were captured contemporaneously.

Starting in 2016, UKB participants who had provided

e-mail contact were sent an invitation to an online Mental

Health Questionnaire (MHQ) entitled ‘thoughts and feel-

ings.’9 Participants who had not started the questionnaire

or had only partially completed it were sent reminder

e-mails after 2 weeks and again after 4 months.

Participants also received information about the MHQ in a

postal newsletter with instructions on how to participate.

From data supplied by UK Biobank on 12 June 2018,

157 396 participants had completed the MHQ. Responses

to the MHQ were submitted between July 2016 and July

2017. Mean time between baseline assessment and MHQ

follow-up was 7.5 years (range 5.9–11.2 years). We refer to

this phenotype as ‘MHQ data’.

In 2015, GS: SFHS participants were sent a questionnaire

package by post as part of the Stratifying Resilience and

Depression Longitudinally (STRADL) project, with the aim

of studying psychological resilience.10 Participants were eligi-

ble for follow-up if they had consented to re-contact and if

they had a Community Health Index (CHI) number. Of the

21 525 eligible participants, 9618 responded to the question-

naire, from which we coded a ‘STRADL data’ phenotype.

In the Partners Biobank, following enrolment, partici-

pants were invited to complete the Partners Biobank

Health Information Survey, an optional online lifestyle,

environment and family history survey.14 Of the 15 925

participants of European ancestry with genetic data at the

time of analysis, 6639 responded to the questionnaire.

Phenotype analysis

Demographic and health differences between responders

and non-responders to the STRADL survey have been ana-

lysed previously and showed that, among other differences,

participants who were women, non-smokers or who had

low levels of psychological distress were more likely to re-

spond. We thus first conducted a similar analysis in UK

Biobank. We ran logistic regressions for e-mail contact and

MHQ data using R 3.5.0.15 We examined associations

International Journal of Epidemiology, 2019, Vol. 0, No. 0 3

Dow




with age at initial assessment, sex, geographical region, ed-

ucational qualification, smoking, alcohol consumption,

number of diagnoses in linked electronic health records

and family history of dementia and depression (see

Supplementary Information, available as Supplementary

data at IJE online, for regression input coding).

Genome-wide association, linkage disequilibrium

score analysis, and replication analysis

We conducted genome-wide association studies (GWAS) on

the UKB e-mail contact and MHQ data phenotypes and

conducted gene-based association and gene-set analyses (see

Supplementary Information, available as Supplementary

data at IJE online). We calculated a genomic control factor

(kGC)16 for each set of GWAS results, which measures the

inflation in test statistics above what would be expected by

chance. Inflation in test statistics can caused both by a large

number of genetic variants having an association with each

trait (polygenicity) and by confounding factors, including

population stratification and relatedness within the sample.

We used linkage disequilibrium (LD) score regression17 to

distinguish polygenicity from confounding. LD score regres-

sion exploits the increase in association test statistics for ge-

netic loci that are closely linked in the region surrounding

each causal genetic variant (indicating polygenicity) to dis-

tinguish from confounding, which is expected to inflate test

statistics evenly across the whole genome. The intercept

from an LD score regression quantifies the test statistic infla-

tion from confounding factors, where an intercept estimate

close to 1.0 indicates no confounding. We also used LD

score regression to estimate the proportion of variance in

these traits attributable to common genetic variants [also re-

ferred to as single nucleotide polymorphism (SNP)

heritability] and calculated genetic correlations with 235

traits using LD Hub. We used false discovery rate to correct

for multiple testing. To test for possible effects of mortality

on loss-to-follow-up, we used the death register to identify

participants whose death occurred before the MHQ assess-

ment (N¼10 623). We then ran a GWAS on MHQ datawith these participants removed.

In the replication data sets (Generation Scotland and

Partners Biobank) we first tested for replication of inde-

pendent SNPs (r2 ¼ 0.1, 250 kb window) after Bonferronicorrection. We calculated the expected power of replica-

tion using the Genetic Association Study power calcula-

tor.18 Following that, we tested for replication of direction

of effect by performing a binomial test for the number of

SNPs with the same direction of effect between the UK

Biobank and Partners association results. We also calcu-

lated LD score genetic correlations17 between the UK

Biobank and Generation Scotland summary statistics, to

estimate genome-wide similarity in phenotypes between

these studies.

Results

Phenotypic associations of e-mail contact and

mental health follow-up (MHQ) data in UK

Biobank

We conducted logistic regressions on e-mail contact (valid

e-mail address provided vs no valid e-mail address pro-

vided) and MHQ participation (those that had completed

the MHQ vs those that had not completed the MHQ) in

UK Biobank, examining the effects of age, sex, geographi-

cal region, educational attainment, alcohol consumption,

smoking status and personal and family history of disease.

We retained participants with complete data for analysis

(N¼ 373 478). Odds ratios from the logistic regressionsare listed in Table 1. Women in UK Biobank were less

likely to have provided an e-mail address but more likely

to take part in the MHQ. There was regional variation in

e-mail contact and MHQ data. Individuals who attended

assessment centres in Greater London and the South West

of England were the most likely to have provided an e-mail

address, whereas individuals from assessment centres in

the North East of England and Scotland were the least

likely. Individuals with greater educational attainment,

those who were not current smokers, those with a fewer

number of hospital diagnoses and those with a family his-

tory of dementia or severe depression were more likely to

have e-mail contact and to have MHQ data.

Genome-wide association analysis of e-mail

contact and MHQ data in UK Biobank

After filtering UK Biobank individuals to a White, British,

unrelated sample, the sample size was N¼ 371 417 for theGWAS of e-mail contact and N¼ 371 428 for the GWAS ofMHQ data. After clumping, there were nine loci

(P� 5� 10�8) for e-mail contact (Figure 1, Table 2;Supplementary Table S1, available as Supplementary data at

IJE online) and 25 for MHQ participation (Figure 2, Table 3;

Supplementary Table S11, available as Supplementary data at

IJE online). The kGC was 1.29 for e-mail contact and 1.37 for

MHQ data. The LD score intercept for e-mail contact and for

MHQ data in UK Biobank was 1.013 [standard error (SE)

0.008] and 1.020 (SE 0.008). respectively. This yielded infla-

tion ratios indicating that only 3.7% (SE 0.025) and 4.3%

(SE 0.020) of the inflation in test statistics for e-mail contact

and MHQ data was caused by confounding factors, and thus

most of the inflation in test statistics was attributed to a large

number of genetic loci influencing both traits (polygenicity).


Dow




https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-data

Loci discovery and annotation of the e-mail

contact and MHQ phenotypes

The nine loci associated with e-mail contact were found to

contain an over-representation of SNPs found in ncRNA

intronic regions (57.5%), as well as SNPs found in intronic

regions (28.4%) (Supplementary Figure S1 and Supplementary

Table S1, available as Supplementary data at IJE online).

Evidence was also found that these loci contained regula-

tory regions of the genome, indicated by 32.0% of the

SNPs in the genomic loci having RegulomeDB (RDB) less

Table 1. Logistic regression on e-mail contact and MHQ data in UK Biobank (N¼373 478). Regression coefficients are expressedas odds ratios for increased probability of having e-mail contact and increased probability of having MHQ data

E-mail contact MHQ data

Variable N OR (SE) 95% CI OR (SE) 95% CI

Age (SD) 373 478 0.85 (0.004) 0.846–0.861 1.01 (0.004) 0.998–1.014

Sex Female 211 768 1 � 1 �Male 161 710 1.11 (0.010) 1.093–1.131 0.90 (0.008) 0.883–0.914

Region East Midlands 25 307 1 � 1 �Greater London 50 795 1.85 (0.032) 1.785–1.909 1.13 (0.022) 1.088–1.173

North East 27 594 0.49 (0.008) 0.470–0.501 0.87 (0.018) 0.835–0.904

North West 54 053 0.81 (0.013) 0.781–0.833 0.84 (0.012) 0.817–0.866

Scotland 27 557 0.42 (0.009) 0.405–0.439 0.83 (0.017) 0.800–0.866

South East 34 114 0.84 (0.016) 0.805–0.867 1.13 (0.020) 1.088–1.165

South West 33 410 1.13 (0.021) 1.087–1.171 1.08 (0.020) 1.042–1.121

Wales 15 741 0.58 (0.013) 0.558–0.611 0.83 (0.020) 0.796–0.873

West Midlands 33 042 0.63 (0.011) 0.606–0.649 0.83 (0.016) 0.799–0.862

Yorkshire 71 865 1.00 (0.016) 0.967–1.028 0.93 (0.014) 0.900–0.957

Qualifications None 53 654 1 � 1 �GCSE 124 377 2.35 (0.028) 2.297–2.408 2.29 (0.029) 2.230–2.342

A Levels 44 132 3.43 (0.048) 3.338–3.525 3.53 (0.057) 3.421–3.642

Other 19 583 2.53 (0.042) 2.451–2.616 2.72 (0.052) 2.620–2.823

College/university 131 732 4.27 (0.054) 4.163–4.375 4.43 (0.056) 4.322–4.541

Smoking Never 210 858 1 � 1 �Previous 126 802 1.13 (0.009) 1.116–1.152 1.06 (0.008) 1.042–1.074

Current 35 818 0.71 (0.009) 0.689–0.723 0.73 (0.010) 0.706–0.744

Alcohol Units/week (SD) 373 478 1.05 (0.004) 1.038–1.053 1.03 (0.005) 1.021–1.039

Anthropometry Body mass index (SD) 373 478 0.95 (0.004) 0.940–0.953 0.88 (0.004) 0.877–0.893

Diagnoses, yes (vs no)

Mental disorder 24 668 0.75 (0.011) 0.729–0.774 0.68 (0.012) 0.654–0.701

Injury 59 706 0.90 (0.007) 0.881–0.909 0.83 (0.009) 0.815–0.851

Other disease 278 019 0.95 (0.009) 0.929–0.963 0.91 (0.009) 0.889–0.923

Family history, yes (vs no)

Alzheimer’s/dementia 52 238 1.18 (0.013) 1.157–1.208 1.22 (0.013) 1.198–1.250

Severe depression 54 651 1.04 (0.011) 1.022–1.066 1.11 (0.012) 1.084–1.131

Figure 1. Manhattan plot of e-mail contact in UK Biobank.


Dow




https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-data

than 2, indicating that genetic variation in these loci is

likely to affect gene expression. Finally, 77.6% of the SNPs

within the independent genomic loci had a minimum chro-

matin state of

Collectively these mapping strategies identified 70 unique

genes, of which 21 were implicated by two mapping strate-

gies and 10 were implicated by all three. A total of five

genes, TNNI3K, LRRIQ3, NEGR1, FPGT and FPGT-

TNNI3K, were implicated using all three methods and

showed evidence of a chromatin interaction between two in-

dependent genomic risk loci (Supplementary Table S4,

available as Supplementary data at IJE online). Gene-based

statistics derived in MAGMA indicated a role for 72 genes

(Supplementary Table S5, available as Supplementary data

at IJE online), four of which overlapped with genes impli-

cated by all three mapping strategies (Supplementary Figure

S3, available as Supplementary data at IJE online).

For the MHQ data phenotype, positional mapping impli-

cated 42 genes, with eQTL mapping indicating a role for 86

genes. Chromatin interaction mapping annotated a total of

124 genes (Supplementary Tables S14 and S15, Supplementary

Figure S6a–m, available as Supplementary data at IJE online).

Across these three mapping strategies, 181 unique genes were

identified, with 46 of these being implicated by two mapping

strategies and 25 being implicated by all three. MAGMA was

also used and indicated a role for 81 genes (Supplementary

Figure S4 and Supplementary Table S15, available as

Supplementary data at IJE online). Fifteen of these 81

genes overlapped with those identified using the three map-

ping strategies.

Gene-set and gene property analysis

The presynaptic membrane gene-set was enriched for the e-

mail contact phenotype (P¼5.19� 10�7) (SupplementaryTable S6, available as Supplementary data at IJE online).

Gene property analysis showed a relationship between expres-

sion in the Epstein-Barr virus (EBV)-transformed lymphocyte

cells (P¼ 9.24� 10�4) and for gene expression in the earlymid-prenatal time of life (P¼ 0.004) (Supplementary TablesS9 and S10, available as Supplementary data at IJE online).

For the MHQ data phenotype none of the gene sets were

enriched (Supplementary Table S16, available as

Supplementary data at IJE online). However, gene property

analysis indicated a relationship between gene expression in

the brain and the MHQ phenotype (P¼ 2.64� 10�4)(Supplementary Table S17, available as Supplementary data

at IJE online). When examining the specific tissue gene

groupings, this relationship was driven by expression

change in the cerebellar hemisphere (P¼ 8.52� 10�6) and

Table 3. Top lead SNPs associated with MHQ data. Direction of effects are listed for the UK Biobank discovery sample and the

Generation Scotland and Partners Biobank replication samples as either positive (þ) or negative (�)

Chr SNP Location (bp) A1/A2 Freq. OR (SE) P-value Direction

1 rs7542974 72, 544, 704 A/G 0.25 1.032 (0.006) 3.8 � 10�8 þþþ1 rs485929 74, 678, 285 G/A 0.39 1.028 (0.005) 3.7 � 10�8 þ�þ1 rs532246 84, 411, 238 G/A 0.74 0.968 (0.005) 7.0 � 10�9 �þ�1 rs2789111 243, 346, 404 C/T 0.38 0.968 (0.005) 1.5 � 10�10 ��þ2 rs35028061 49, 479, 987 GT/G 0.38 1.029 (0.005) 1.9 � 10�8 þ��3 rs9917656 48, 581, 513 C/T 0.30 1.03 (0.006) 3.2 � 10�8 þþ�3 rs13082026 52, 962, 681 T/C 0.44 0.972 (0.005) 2.4 � 10�8 ��þ4 rs57692580 106, 214, 476 A/T 0.39 0.973 (0.005) 2.8 � 10�8 �þþ5 rs34635 60, 513, 501 G/A 0.42 0.972 (0.005) 1.2 � 10�8 ��5 rs146681214 133, 867, 867 AC/A 0.18 1.039 (0.007) 3.6 � 10�9 þþþ5 rs2336897 167, 050, 276 T/C 0.69 1.031 (0.005) 5.2 � 10�9 þþ�6 rs3993747 31, 580, 507 G/A 0.35 0.969 (0.005) 9.5 � 10�10 ��6 rs59732267 98, 432, 302 CA/C 0.52 0.972 (0.005) 2.5 � 10�8 ��8 rs28716319 83, 269, 854 G/A 0.28 1.031 (0.005) 2.7 � 10�8 þ�þ8 rs13262595 143, 316, 970 G/A 0.56 1.03 (0.005) 1.0 � 10�9 þþþ9 rs6474966 15, 757, 537 A/G 0.46 1.028 (0.005) 2.8 � 10�8 þþþ9 rs11793831 23, 362, 311 T/G 0.42 1.027 (0.005) 4.3 � 10�8 þ�þ11 rs1984389 31, 740, 989 C/A 0.54 0.973 (0.005) 2.4 � 10�8 ��11 rs10791143 131, 278, 676 G/A 0.62 1.034 (0.005) 1.5 � 10�11 þþþ16 rs4616299 7, 657, 432 G/A 0.40 0.972 (0.005) 1.2 � 10�8 ��17 rs56058331 56, 427, 128 A/G 0.42 1.029 (0.005) 1.0 � 10�8 þþþ18 rs1261078 52, 866, 791 G/A 0.05 0.927 (0.010) 5.6 � 10�12 �þ�19 rs34232444 4, 965, 404 C/T 0.35 1.029 (0.005) 2.5 � 10�8 þþ�19 rs3746187 18, 279, 816 G/A 0.40 0.968 (0.005) 9.8 � 10�11 ��19 rs429358 45, 411, 941 C/T 0.15 0.942 (0.006) 4.6 � 10�19 ��

A1, effect allele; A2, non-effect allele; Chr, chromosome; Freq., frequency of effect allele; OR, odds ratio; SE, standard error.


Dow




https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-data

the cerebellum (P¼1.27� 10�5) (Supplementary TableS18, available as Supplementary data at IJE online). A rela-

tionship between gene expression in the early prenatal life-

span (P¼ 0.002) and the early mid-prenatal lifespan wasalso found (P¼ 5.33�10�4) (Supplementary Table S19,available as Supplementary data at IJE online).

LD score regression analysis

We used LD score regression to estimate SNP heritability

from the GWAS results. Heritability on the liability scale for

e-mail contact was 0.073 (0.004 SE) and for MHQ data

was 0.099 (0.004 SE). The genetic correlation between

e-mail contact and MHQ data was 0.822 (0.020 SE).

We used LD Hub19 to estimate genetic correlations

with a large number of other traits. Both e-mail contact

and having MHQ data were genetically correlated with a

broad spectrum of traits. Results for an illustrative set of

traits are plotted in Figure 3 and the results for all traits are

listed in Supplementary Table S21, available as

Supplementary data at IJE online. For most anthropomet-

ric, behavioural, cognitive, psychiatric, health-related and

life history traits the direction of the genetic correlations

with e-mail contact and MHQ participation was the same.

In general, genetic factors associated with providing an

e-mail address for re-contact to UK Biobank and taking

part in the MHQ were also associated with better health,

higher intelligence, lower burden of psychiatric disorders

and a slower life-history (e.g. later age at menarche, age at

first birth and age at menopause). Both e-mail contact and

MHQ participation were not genetically correlated with

any traits categorized as bone, kidney, uric acid and metals

(transferrin/ferritin). Additionally, e-mail contact was not

genetically correlated with glycaemic traits and MHQ data

availability was not genetically correlated with hormone or

metabolite phenotypes.

Effect of mortality on MHQ genetic associations

To test for the role of mortality on our findings, we re-ran

the genome-wide association analysis of MHQ data avail-

ability after removing participants whose dates of death

occurred before the MHQ assessment. The overall infla-

tion in association test statistics including and excluding

deceased participants was identical (mean v2 ¼ 1.438) andthe genetic correlation between the two sets of summary

statistics was 0.9996 (SE ¼ 0.0002). We compared the topindependent associated SNPs in the GWAS in the larger

sample with those that excluded deaths (Supplementary

Table S24 and Figure S7, available as Supplementary data

at IJE online). Although there were three SNPs that no lon-

ger passed the criterion for genome-wide significance, there

was no appreciable change in the effect sizes estimates for

any of the SNPs

Replication in generation Scotland and partners

Biobank

We examined whether any of the associations results for

the e-mail and MHQ data phenotypes replicated in an in-

dependent sample, using whether members of Generation

Scotland participated in the STRADL follow-up of mental

health. At an alpha criterion of 0.05/34 and an average ge-

notype relative risk of 1.015, there was 4% power to repli-

cate in Generation Scotland and 2% power in Partners

Biobank, and replicating the UK Biobank findings requires

approximately 200 000 cases and controls to achieve 90%

power.18 None of the independent SNPs in the UKB

GWASs replicated in Generation Scotland after Bonferroni

correction (34 tests) (Supplementary Tables S22 and S23,

available as Supplementary data at IJE online). We ob-

served replication evidence for one independent SNP

(rs9917656, 6.2� 10–4) in Partners Biobank afterBonferroni correction (Supplementary Tables S22 and S23,

available as Supplementary data at IJE online). Between

UK Biobank and Partners Biobank, more of the SNPs for

survey participation had the same direction of effect than

expected (20/25, exact binomial test P-value ¼ 0.002).Furthermore, the STRADL data phenotype was moder-

ately genetically correlated with both UKB e-mail contact

(rg ¼ 0.430, SE ¼ 0.112, P¼ 0.0001) and UKB MHQ data(rg ¼ 0.619, SE ¼ 0.130, P¼ 1.98� 10–6) and had an SNPheritability on the liability scale of 0.112 (SE 0.0408).

Discussion

Using data from UK Biobank, we found that individuals

who provided an e-mail address for re-contact and who

participated in follow-up surveys of mental health differed

from those who did not with regards to demographic, psy-

chological, health, lifestyle and genetic factors. The UK

Biobank sample differs from the UK population,20 and our

results show that ascertainment processes also exert an ef-

fect on follow up assessments. Most of the phenotypic and

genetic associations were in the same direction. These

results were not due to population stratification, as only

4% of the inflation in GWAS statistics could be attributed

to factors other than polygenic heritability. Having greater

educational attainment, being a non-smoker or a former

smoker, having fewer hospital diagnoses of illness or injury

and having a family history of dementia or a family history

of serious depression all predicted greater likelihood of

providing e-mail contact information. Furthermore, those

variables were also associated with providing responses to


Dow




https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-data

the online Mental Health Questionnaire (MHQ).

Importantly for understanding the composition of the

MHQ subset, having an inpatient diagnosis of a mental

disorder was associated with lower participation rates in

the MHQ [OR ¼ 0.68, 95% confidence interval (CI) ¼0.65-0.70], and this was a larger effect size than other

hospital diagnoses, specifically injury (OR ¼ 0.83) and

non-psychiatric disorders (OR ¼ 0.91). A few effects wentin the opposite direction between the e-mail contact and

MHQ data variables, with men and younger individuals

more likely to provide an e-mail address to UK Biobank,

whereas women were more likely to provide MHQ data.

E-mail contact and MHQ data availability had SNP

heritabilities of 7.3% and 9.9%, respectively. We

Figure 3. LD score genetic correlations (rg) with e-mail contact (triangle) and MHQ data (circle), with 95% confidence intervals.


Dow




identified nine independent SNPs associated with e-mail

contact and 25 for MHQ data, more than for many GWAS

studies of disease traits in the same sample. Loci for both

phenotypes were mostly located within regulatory regions.

Of particular interest was the association of MHQ data

availability with the apolipoprotein E (APOE) e4 genotype

that is a major risk factor for Alzheimer’s disease.21 One

SNP associated with MHQ data replicated in the Partners

Biobank sample. The SNP, rs9917656, is in an intron in

the 6-phosphofructo-2-kinase/fructose-2, 6-biphosphatase

4 (PFKFB4), a signally enzyme involved in switching be-

tween different forms of carbohydrate metabolism.22

However, several other genes are implicated in this locus

by positional mapping (genomic locus 6 in Supplementary

Table S13, available as Supplementary data at IJE online).

Given the effect sizes found in the discovery sample, both

Generation Scotland and Partners were underpowered for

replicating association results. However, the consistent

directions of effect in the Partners cohort and the strong ge-

netic correlation between STRADL participation and the

e-mail contact and MHQ data phenotypes, suggest that

similar genetic factors are driving participation in follow-

up studies.

E-mail contact and MHQ data shared similar genetic

correlations with other traits. There were strong genetic

correlations between e-mail contact and indicators of cog-

nitive ability (college completion, rg ¼ 0.76; intelligence,rg ¼ 0.73). Contact and data availability were also geneti-cally associated with a lower burden of genetic risk for

mental illness and lower BMI. These results were in the

same direction as the phenotypic analysis. The negative ge-

netic correlation with schizophrenia matches results from

follow-up participation in the ALSPAC cohort using poly-

genic risk scores,6 and suggests that this association is not

specific to schizophrenia.

The similarity in the results for phenotypic and genetic

factors associated with e-mail contact and MHQ data

shows that the availability of an individual to be contacted

by e-mail, and their choice to participate, both act as a fil-

ter for selection into the subsample of UK Biobank with

Mental Health Questionnaire data. Notably, self-reports

of a family history of dementia and a family history of se-

vere depression were more common in e-mail providers

and MHQ completers, but individual genetic associations

with both these disorders showed negative correlations.

Individuals who reported dementia or severe depression in

their family were therefore more likely to be MHQ partici-

pants, even though having a personal genetic predisposi-

tion to these disorders may also decrease their likelihood

of participating. Knowledge of family history may be a

strong motivational factor for participating in follow-up

surveys of mental health.

Our sample was large enough that we were able to iden-

tify specific genetic loci that were related to participation in

follow-up studies of mental health. We were also able to ana-

lyse the genetics of one particular factor (the availability of e-

mail contact for receiving invitations) that is heavily involved

in the specific process of follow-up participation. However, a

limitation of our analysis is that information on e-mail con-

tact was available for participants at baseline only, and thus

did not distinguish the entire subset of participants who

would have received an e-mail invitation. Another limitation

is that information from electronic health records only cov-

ered hospital admissions and thus would underestimate asso-

ciations with milder health conditions. Our study also does

not address factors that would differentially influence partici-

pation of individuals of non-European ancestry.

Individuals in large epidemiological cohorts who partici-

pate in follow-up surveys differ in their patterns of pheno-

typic and genetic association with traits of interest from

those who do not. Because most factors had a consistent re-

lationship with the two-step selection process (contactability

by e-mail and opting to participate in follow-up), it is likely

that these same factors may also differentiate people who

choose to become part of the cohort in the first place from

other people in the larger population. These factors are very

likely to bias the selection of individuals for inclusion in

population-based studies towards those with positive family

histories but lower personal genetic risk of mental health

conditions such as depression and dementia. Analysing vari-

ables within a follow-up study may have the effect inducing

statistical dependence or attenuating estimates of the rela-

tionships among variables.2 Figure 4a illustrates a

hypothesized causal model where a polygenic risk score

(PRS) influences a phenotype or outcome Y via an interme-

diate phenotype X. This model could be tested by d-separa-

tion23: if the model is true, then regressing Y on X will result

in conditional independence of PRS and Y. Figure 4b illus-

trates a scenario analysing the effect of the PRS where par-

ticipation in follow-up is a collider for the two phenotypes

when they do not have a causal relationship with each

other. Analysing data only within the follow-up sample cre-

ates non-independence between the X and Y traits and thus

between PRS and Y. Even when one trait causes the other,

conditioning on follow-up participation can bias the esti-

mate of PRS on the downstream trait (Figure 4c). A scenario

where only one of the traits causes follow-up would not re-

sult in biased estimates of the effects of PRS (Figure 4d).

Going forward, studies should evaluate (e.g. using simu-

lations2) the particular effects that selection and attrition

might have on effect estimates and, where available, check

results from follow-up assessments against those from base-

line data, even in the cases where the follow-up data provide

better or more comprehensive measures of phenotypes of


Dow




https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttps://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-data

interest. Because continued participation in large cohorts

studies recapitulates the ‘healthy volunteer’ effect, compar-

ing responders and non-responders in follow-up surveys

may be a useful way to analyse how selection bias may in-

fluence the generalizability and accuracy of findings.

Data availability

GWAS summary statistics for the UK Biobank e-mail con-

tact and Mental Health Questionnaire data are available

from [https://doi.org/10.7488/ds/2554]. Underlying study

data are available to bona fide researchers from UK

Biobank [https://www.ukbiobank.ac.uk/], Generation

Scotland [http://www.generationscotland.org/] and

Partners HealthCare Biobank [https://biobank.partners.

org].

Supplementary Data

Supplementary data are available at IJE online.

Funding

M.J.A. and A.M.Mc. are supported by MRC Mental Health Data

Pathfinder award (reference MC_PC_17209) and the Wellcome

Trust Strategic Award ‘STratifying Resilience and Depression

Longitudinally’ (STRADL) (reference 104036/Z/14/Z). Analysis

conducted under UK Biobank application 4844. W.D.H. is sup-

ported by a grant from Age UK (Disconnected Mind Project). I.J.D.

is supported by the Centre for Cognitive Ageing and Cognitive

Epidemiology, which is funded by the Medical Research Council

and the Biotechnology and Biological Sciences Research Council

(reference MR/K026992/1). D.M.H. is supported by a Sir Henry

Wellcome Postdoctoral Fellowship (reference 213674/Z/18/Z) and a

2018 NARSAD Young Investigator Grant from the Brain &

Behavior Research Foundation (ref: 27404). K.A.S.D. and M.H. are

supported by NIHR Biomedical Research Centre at South London

and Maudsley NHS Foundation Trust and King’s College London.

Generation Scotland received core support from the Chief Scientist

Office of the Scottish Government Health Directorates (CZD/16/6)

and the Scottish Funding Council (HR03006). Genotyping of the

GS: SFHS samples was carried out by the Genetics Core Laboratory

at the Wellcome Trust Edinburgh Clinical Research Facility,

University of Edinburgh, Scotland and was funded by the Medical

Research Council UK and the Wellcome Trust [Wellcome Trust

Strategic Award ‘Stratifying Resilience and Depression

Longitudinally’ (STRADL), reference 104036/Z/14/Z].

AcknowledgementsWe thank the participants of UK Biobank, Generation Scotland and

Partners Biobank. This work has made use of the resources provided

by the Edinburgh Compute and Data Facility (ECDF) [http://www.

ecdf.ed.ac.uk/].

Conflict of interest: I.J.D. was a participant in UK Biobank. The

other authors declare no conflicts of interest.

References

1. Hernán MA, Hernández-Dı́az S, Robins JM. A structural ap-

proach to selection bias. Epidemiology 2004;15:615–25.

2. Munafò MR, Tilling K, Taylor AE, Evans DM, Davey Smith G.

Collider scope: when selection bias can substantially influence

observed associations. Int J Epidemiol 2018;47:226–35.

3. Lamers F, Hoogendoorn AW, Smit JH et al. Sociodemographic

and psychiatric determinants of attrition in the Netherlands

Study of Depression and Anxiety (NESDA). Compr Psychiatry

2012;53:63–70.

4. Gelman A, Hill J. Data Analysis Using Regression and

Multilevel/Hierarchical Models. New York, NY: Cambridge

University Press, 2007.

5. Robins JM, Hernán MÁ, Brumback B. Marginal structural mod-

els and causal inference in epidemiology. Epidemiology 2000;

11:550–60.

6. Martin J, Tilling K, Hubbard L et al. Association of genetic risk

for schizophrenia with nonparticipation over time in a

population-based cohort study. Am J Epidemiol 2016;183:

1149–58.

7. Taylor AE, Jones HJ, Sallis H et al. Exploring the association of

genetic factors with participation in the Avon Longitudinal Study

of Parents and Children. Int J Epidemiol 2018;47:1207–16.

8. Rothman KJ, Gallacher JE, Hatch EE. Why representativeness

should be avoided. Int J Epidemiol 2013;42:1012–14.

9. Davis KAS, Coleman JRI, Adams M et al. Mental health in UK

Biobank: Development, implementation and results from an

Figure 4. Possible effects of selection bias on polygenic risk score analy-

ses in follow-up studies. (A) Causal model to be tested where PRS

causes phenotype Y via phenotype X. (B) Worst-case scenario where

PRS influences X but not Y and both phenotypes cause follow-up partic-

ipation. Analysing only follow-up participants is the same as condition-

ing on F, which induces a correlation between PRS and Y. (C) More

likely scenario, where both X and Y cause follow-up participation.

Conditioning on F attenuates estimates of the relationship between PRS

and Y. (D) Ideal scenario where X causes follow-up participation, but

Y does not. Conditioning on F has no impact on the dependence of Y on

PRS. PRS, polygenic risk score; X and Y, phenotypes of interest; F, se-

lection into follow-up; directional solid line, true causal association;

dashed line, induced or attenuated statistical dependence.


Dow




https://doi.org/10.7488/ds/2554https://www.ukbiobank.ac.uk/]http://www.generationscotland.org/https://biobank.partners.org]https://biobank.partners.org]https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dyz134#supplementary-datahttp://www.ecdf.ed.ac.uk/http://www.ecdf.ed.ac.uk/

online questionnaire completed by 157 366 participants.

BJPsych Open 2018;4:83–90.

10. Navrady LB, Wolters MK, MacIntyre DJ et al. Cohort Profile:

Stratifying Resilience and Depression Longitudinally (STRADL):

a questionnaire follow-up of Generation Scotland: Scottish Family

Health Study (GS: SFHS). Int J Epidemiol 2018;47:13–14g.

11. Karlson E, Boutin N, Hoffnagle A, Allen N. Building the

Partners HealthCare Biobank at Partners Personalized Medicine:

informed consent, return of research results, recruitment lessons

and operational considerations. J Pers Med 2016;6:2.

12. Smith BH, Campbell H, Blackwood D et al. Generation

Scotland: The Scottish Family Health Study; a new resource for

researching genes and heritability. BMC Med Genet 2006;7:74.

13. Smith BH, Campbell A, Linksted P et al. Cohort Profile:

Generation Scotland: Scottish Family Health Study (GS: SFHS).

The study, its participants and their potential for genetic research

on health and illness. Int J Epidemiol 2013;42:689–700.

14. Dashti HS, Redline S, Saxena R. Polygenic risk score identifies

associations between sleep duration and diseases determined

from an electronic medical record biobank. Sleep 2018;42:1–10.

15. R Development Core Team. R: A Language and Environment

for Statistical Computing. 3.5. Vienna: R Foundation for

Statistical Computing, 2018.

16. Devlin B, Roeder K. Genomic control for association studies.

Biometrics 1999;55:997–1004.

17. Bulik-Sullivan BK, Loh P-R, Finucane HK et al. LD Score regres-

sion distinguishes confounding from polygenicity in genome-

wide association studies. Nat Genet 2015;47:291–95.

18. Johnson JL, Abecasis GR. GAS Power Calculator: Web-based

power calculator for genetic association studies. bioRxiv 2017.

doi: 10.1101/164343.

19. Zheng J, Erzurumluoglu AM, Elsworth BL et al. LD Hub: A cen-

tralized database and web interface to perform LD score regres-

sion that maximizes the potential of summary level GWAS data

for SNP heritability and genetic correlation analysis.

Bioinformatics 2017;33:272–79.

20. Fry A, Littlejohns TJ, Sudlow C et al. Comparison of sociodemo-

graphic and health-related characteristics of UK Biobank partici-

pants with those of the general population. Am J Epidemiol

2017;186:1026–34.

21. Coon KD, Myers AJ, Craig DW et al. A high-density whole-ge-

nome association study reveals that APOE is the major suscepti-

bility gene for sporadic late-onset Alzheimer’s disease. J Clin

Psychiatry 2007;68:613–18.

22. Pilkis SJ, Claus TH, Kurland IJ, Lange AJ. 6-phosphofructo-2-ki-

nase/fructose-2, 6-bisphosphatase: A metabolic signaling en-

zyme. Annu Rev Biochem 1995;64:799–835.

23. Verma T, Pearl J. Equivalence and synthesis of causal models. In:

Schachter R, Levitt TS, Kanal LN (eds). Uncertainty in Artificial

Systems. Amsterdam: Elsevier, 1991.


Dow




dyz134-TF1dyz134-TF2

Download - Edinburgh Research Explorer · 2019. 7. 17. · UKB participants were given the option of supplying an e-mail address for receiving newsletters and invitations for online follow-up

Top Related