lecture 8 short-term selection response r = h 2 s
TRANSCRIPT
Lecture 8Short-Term Selection
Response
R = h2 S
Applications of Artificial Selection
• Applications in agriculture and forestry • Creation of model systems of human diseases
and disorders• Construction of genetically divergent lines for
QTL mapping and gene expression (microarray) analysis
• Inferences about numbers of loci, effects and frequencies
• Evolutionary inferences: correlated characters, effects on fitness, long-term response, effect of mutations
Response to Selection
• Selection can change the distribution of phenotypes, and we typically measure this by changes in mean– This is a within-generation change
• Selection can also change the distribution of breeding values– This is the response to selection, the change
in the trait in the next generation (the between-generation change)
The Selection Differential and the Response to Selection
• The selection differential S measures the within-generation change in the mean– S = * -
• The response R is the between-generation change in the mean– R(t) = (t+1) - (t)
S
o
*p
R
( ) A Parental Generation
( ) B Offspring Generation
Truncation selection uppermost fraction
p chosen
Within-generationchange
Between-generationchange
The Breeders’ Equation: Translating S into RyO=πP+h2µPf+Pm2°πP∂( )
Recall the regression of offspring value on midparent value
Averaging over the selected midparents, E[ (Pf + Pm)/2 ] = *,
E[ yo - ] = h2 ( - ) = h2 S
Likewise, averaging over the regression gives
Since E[ yo - ] is the change in the offspring mean, it represents the response to selection, giving:
R = h2 S The Breeders’ Equation
• Note that no matter how strong S, if h2 is small, the response is small
• S is a measure of selection, R the actual response. One can get lots of selection but no response
• If offspring are asexual clones of their parents, the breeders’ equation becomes – R = H2 S
• If males and females subjected to differing amounts of selection,– S = (Sf + Sm)/2– An Example: Selection on seed number in plants -- pollination
(males) is random, so that S = Sf/2
Break for Problem 1
Response over multiple generations
• Strictly speaking, the breeders’ equation only holds for predicting a single generation of response from an unselected base population
• Practically speaking, the breeders’ equation is usually pretty good for 5-10 generations
• The validity for an initial h2 predicting response over several generations depends on:– The reliability of the initial h2 estimate– Absence of environmental change between
generations– The absence of genetic change between the
generation in which h2 was estimated and the generation in which selection is applied
(A)
S S
(B)
(C)
S
50% selectedVp = 4, S = 1.6
20% selectedVp = 4, S = 2.8
20% selectedVp = 1, S = 1.4
The selection differential is a function of boththe phenotypic variance and the fraction selected
The Selection Intensity, i
As the previous example shows, populations with thesame selection differential (S) may experience verydifferent amounts of selection
The selection intensity i provided a suitable measurefor comparisons between populations,i=SpVP=Sæp
Selection Differential Under
Truncation SelectionE[zjz>T]=Z1tzp(z)ºTdz=π+æ¢pTºTMean of truncated distribution
Change in the Variance"1+pT¢(z°π)=æºT°µpTºT∂2#æ2Height of the unit normal density functionat the truncation point T
P ( z > T)
Phenotypic standard deviationsHence, S = pT/ T
Likewise, i = S/ = pT/ Ti'0:8+0:41lnµ1p°1∂( )
pT=¡∑T°πæ∏( )
Selection Intensity Versions of the Breeders’ EquationR=h2S=h2Sæpæp=ih2æp
We can also write this as Since h2P = (2A/2
P) P = A(A/P) h A
R = i h A
Since h = correlation between phenotypic and breedingvalues, h = rPA
R = i rPAA
Response = Intensity * Accuracy * spread in Va
When we select an individual solely on their phenotype,the accuracy (correlation) between BV and phenotype is h
Accuracy of selectionMore generally, we can express the breedersequation as
R = i ruAA
Where we select individuals based on the index u(for example, the mean of n of their sibs).
rua = the accuracy of using the measure u topredict an individual's breeding value = correlation between u and an individual's BV
Example: Estimating an individual’s BV withprogeny testing. For n progeny, ruA
2 = n/(n+a), where a = (4-h2)/h2
Mathematica program
Overlapping Generations
Ry =im + if
Lm + Lf
h2p
Lx = Generation interval for sex x = Average age of parents when progeny are born
The yearly rate of response is
Trade-offs: Generation interval vs. selection intensity:If younger animals are used (decreasing L), i is also lower,as more of the newborn animals are needed as replacements
Generalized Breeder’s Equation
Ry =im + if
Lm + Lf
ruAA
Tradeoff between generation length L and accuracy r
The longer we wait to replace an individual, the moreaccurate the selection (i.e., we have time for progenytesting and using the values of its relatives)
Finite sample correction for iWhen a finite number of individuals form the next generation, previous expressions overestimate i
Suppose M adults measured, of which N are selectedfor p = N/M
E[i] depends on the expected value of the order statistics
Rank the M phenotypes as z1,M > z2,M . .. > zM,M
zk,M = kth order statistic in a sample of ME(i)=1æ√1NNXk=1E(zk;M)°π!=1NNXk=1E(z0k;M)(Standardized order statistics z0k;M=(zk;M°π)=æ
1.1.01
p = N/M, the proportion selected
Exp
ecte
d S
elec
tion
inte
nsi
ty
M = 10
M = 20
M = 50
M = 100
M = infinity
Finite population size results in infinite M expression overestimating the actual selection intensity, although the difference is small unless N is very small.
Burrows' approximation (normally-distributed trait)E(i(M;N))'i°∑1°p2p(M+1)∏1iBulmer’s approximation:
uses infinite population result, with p replaced byep=N+1=2M+N=(2M)
Selection on Threshold Traits
Assume some underlying continuous value z, the liability, maps to a discrete trait.
z < T character state zero (i.e. no disease)
z > T character state one (i.e. disease)
Alternative (but essentially equivalent model) is aprobit (or logistic) model, when p(z) = Prob(state one | z)
Liability scale
Mean liability before selection
Mean liability after selection(but before reproduction)
Selection differentialon liability scale
qt* - qt is the selection differential on the phenotypic scale
Mean liability in next generationFrequency of character state onin next generation
Frequency of trait
Steps in Predicting Response to Threshold Selection
i) Compute initial mean 0
We can choose a scale where the liabilityz has variance of one and a threshold T = 0
Hence, z - 0 is a unit normal random variableP(trait) = P(z > 0) = P(z - > - ) = P(U > - )
U is a unit normalDefine z[q] = P(U < z[q] ) = q. P(U > z[1-q] ) = q
For example, suppose 5% of the pop shows thetrait. P(U > 1.645) = 0.05, hence = -1.645
General result: = - z[1-q]
ii) The frequency qt+1 of the trait in the next generation is just
qt+1 = P(U > - t+1 ) = P(U > - [h2S + t ] ) = P(U > - h2S - z[1-q] )
St=π°πt='(πt)qtpt°qt1°qt*-
- -
iii) Hence, we need to compute S, the selection differential on liability
Let pt = fraction of individuals chosen ingeneration t that display the traitπt=(1°pt)E(zjz<0;πt)+ptE(zjz∏0;πt)- >
This fraction does not display the trait, hencez < 0
This fraction display the trait, hence z > 0When z is normally distributed, this reduces to
Height of the unit normal density functionAt the point tHence, we start at some initial value given h2 and
0, and iterative to obtain selection response
25201510500.00
0.25
0.50
0.75
1.00
1.25
1.50
1.75
2.00
2.25
0
10
20
30
40
50
60
70
80
90
100
Generation
Sel
ecti
on d
iffe
ren
tial
S
q, F
req
uen
cy o
f ch
arac
terS q
Initial frequency of q = 0.05. Selection only on adultsshowing the trait
15 Minute Break
Permanent Versus Transient Response
Considering epistasis and shared environmental values,the single-generation response follows from the midparent-offspring regressionR=h2S+Sæ2zµæ2AA2+æ2AAA4+¢¢¢+æ(Esire;Eo)+æ(Edam;Eo)∂( )
Breeder’s EquationResponse from epistasisResponse from shared
environmental effects
Permanent component of responseTransient component of response --- contributesto short-term response. Decays away to zeroover the long-term
Response with EpistasisR=Sµh2+æ2AA2æ2z∂( )R(1+ø)=Sµh2+(1°c)øæ2AA2æ2z∂( )
The response after one generation of selection froman unselected base population with A x A epistasis is
The contribution to response from this single generationafter generations of no selection is
c is the average (pairwise) recombination between lociinvolved in A x A
Contribution to response from epistasis decays to zero aslinkage disequilibrium decays to zero
Response from additive effects (h2 S) is due to changes in allele frequencies and hence is permanent. Contribution from A x A due to linkage disequilibrium
R(t+ø)=th2S+(1°c)øRAA(t)Why unselected base population? If history of previousselection, linkage disequilibrium may be present andthe mean can change as the disequilibrium decays
More generally, for t generation of selection followed by generations of no selection (but recombination)
Maternal Effects:Falconer’s dilution model
z = G + m zdam + e
Direct genetic effect on characterG = A + D + I. E[A] = (Asire + Adam)/2
Maternal effect passed from dam to offspring is justA fraction m of the dam’s phenotypic value
m can be negative --- results in the potential for a reversed response
The presence of the maternal effects means that responseis not necessarily linear and time lags can occur in response
Parent-offspring regression under the dilution model
In terms of parental breeding values,E(zojAdam;Asire;zdam)=Adam2+Asire2+mzdamRegression of BV on phenotypeA=πA+bAz(z°πz)+e
With no maternal effects, baz = h2With maternal effects, a covariance between BV and maternal effect arises, withæA;M=mæ2A=(2°m)The resulting slope becomes bAz = h2 2/(2-m)
The response thus becomes¢πz=Sdamµh22°m+m∂+Ssireh22°m( )
109876543210
-0.15
-0.10
-0.05
0.00
0.05
0.10
Generation
Cu
mu
lati
ve R
esp
onse
to
Sel
ecti
on
(i
n t
erm
s of
S)
Response to a single generation of selection
Reversed response in 1st generation largely due to negative maternal correlation masking genetic gain
Recovery of genetic response after initial maternal correlation decays
h2 = 0.11, m = -0.13 (litter size in mice)
20151050
-1.0
-0.5
0.0
0.5
1.0
1.5
Generation
Cu
mu
lati
ve R
esp
onse
(in
un
its
of S
)
m = -0.25
m = -0.5
m = -0.75
h2 = 0.35
Selection occurs for 10 generations and then stops
Gene frequency changes under selection
Genotype A1A1 A1A2 A2A2
Fitnesses 1 1+s 1+2sAdditive fitnesses
Let q = freq(A2). The change in q from onegeneration of selection is:¢q=sq(1°q)1+2sq'sq(1°q)whenj2sqj<<1
In finite population, genetic drift can overpowerselection. In particular, when 4Nejsj<<1drift overpowers the effects of selection
Strength of selection on a QTL
Genotype A1A1 A1A2 A2A2
Contribution toCharacter
0 a 2a
Have to translate from the effects on a trait under selection to fitnesses on an underlying locus (or QTL)
Suppose the contributions to the trait are additive:
For a trait under selection (with intensity i) andphenotypic variance P
2, the induced fitnessesare additive with s = i (a / P )4Nejsj=4NejaijæP<<1Thus, drift overpowers
selection on the QTL when
More generally
Genotype A1A1 A1A2 A2A2
Contribution to trait
0 a(1+k) 2a
Fitness 1 1+s(1+h) 1+2s¢q'2q(1°q)[1+h(1°2q)]
Change in allele frequency:
s = i (a / P )
Selection coefficients for a QTL
h = k
15 Minute Break
Changes in the Variance under Selection
The infinitesimal model --- each locus has a very smalleffect on the trait.
Under the infinitesimal, require many generations for significant change in allele frequencies
However, can have significant change in geneticvariances due to selection creating linkage disequilibrium
Under linkage equilibrium, freq(AB gamete) = freq(A)freq(B)
With positive linkage disequilibrium, f(AB) > f(A)f(B), so that AB gametes are more frequentWith negative linkage disequilibrium, f(AB) < f(A)f(B), so that AB gametes are less frequent
±æ2z=æ2z°æ2z*
VA(1)=Va+dVP(1)=VA(1)+VD+VE=VP+dh2(1)=VA(1)VP(1)=Va+dVP+dChanges in VA with disequilibrium
Under the infinitesimal model, disequilibrium onlychanges the additive variance.Starting from an unselected base population, a single generation of selection generates a disequilibriumcontribution d to the additive variance
Additive genetic variance afterone generation of selection
Additive genetic variance in theunselected base population. Oftencalled the additive genic variance
disequilibrium
Changes in VA changes the phenotypic variance
Changes in VA and VP change the heritabilityd=æ2O°æ2P=h42±æ2zThe amount disequilibrium generated by a single generation of selection is
Within-generation changein the varianceA decrease in the variance
generates d < 0 and hencenegative disequilibrium
An increase in the variancegenerates d > 0 and hencepositive disequilibrium
A “Breeders’ Equation” for Changes in Variance
d(0) = 0 (starting with an unselected base population)h2(t)=VA(t)VP(t)=Va+d(t)VP+d(t)d(t+1)=d(t)2+h4(t)2±æz(t)Decay in previous disequilibrium from recombinationNew disequilibrium generated byselection that is passed onto thenext generationæ2z(t)=(1°k)æ2z(t)Many forms of selection (e.g., truncation) satisfyd(t+1)=d(t)2°kh4(t)2æ2z(t) k > 0. Within-generation
reduction in variance.negative disequilibrium, d < 0
k < 0. Within-generationincrease in variance.positive disequilibrium, d > 0
d(t+1) - d(t) measures the response in selectionon the variance (akin to R measuring the mean)
Within-generationchange in the variance-- akin to S
Break for Problem 2