så används statistiska metoder i jordbruksförsök
DESCRIPTION
Så används statistiska metoder i jordbruksförsök. Svenska statistikfrämjandets vårkonferens den 23 mars 2012 i Alnarp Johannes Forkman, Fältforsk, SLU . Agricultural field experiments. Experimental treatments Varieties Weed control treatments Plant protection treatments Tillage methods - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/1.jpg)
Så används statistiska metoder i jordbruksförsökSvenska statistikfrämjandets vårkonferens den 23 mars 2012 i AlnarpJohannes Forkman, Fältforsk, SLU
![Page 2: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/2.jpg)
Agricultural field experimentsExperimental treatments
• Varieties• Weed control treatments• Plant protection treatments• Tillage methods• Fertilizers
![Page 3: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/3.jpg)
Experimental designAllocate Treatments A and B to eight plots...
A A A A B B B B
A B A B A B A B
A B B A A B B A
Option 1:
Option 2:
Option 3:
![Page 4: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/4.jpg)
Systematic error
• The plots differ...• The treatments are not compared on equal terms.• There will be a systematic error in the comparison
of A and B.
![Page 5: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/5.jpg)
Randomise the treatments. This procedure transforms the systematic error into a random error.
R. A. Fisher
![Page 6: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/6.jpg)
ExampleTreatment Yield (kg/ha) Mean (kg/ha)A 8165A 7792A 8397A 7764 8029.3B 8483B 8602B 8641B 8783 8627.2
The difference is 598
![Page 7: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/7.jpg)
Randomisation test
• The observed difference is 598 kg/ha.
• There are 8!/(4! 4!) = 70 possible random arrangements.
• The two most extreme differences are 598 and -598.
• P-value = 2/70 = 0.029
![Page 8: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/8.jpg)
t-test
6339164
598
415298
493438
598 ..
t
Compare with a t-distribution with 6 degrees of freedom
P-value = 0.011
![Page 9: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/9.jpg)
The randomisation model
is the number of available plots.
![Page 10: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/10.jpg)
The approximate model
When is infinitely large
.
For statistical tests, we assume further that
𝐲=𝐗𝛃+𝐞 , E (𝐲 )=𝐗𝛃
![Page 11: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/11.jpg)
A crucial assumption
Unit-Treatment additivity:
• Variances and covariances do not depend on treatment
![Page 12: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/12.jpg)
Heterogeneity
A B
![Page 13: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/13.jpg)
Inference about what??• Randomisation model: The average if the
treatment was given to all plots of the experiment.• The approximate model: The average if the
treatment was given to infinitely many plots?
Sample Population
![Page 14: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/14.jpg)
Variance in a difference
When then
.
When then
.
![Page 15: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/15.jpg)
Independent errors• Randomisation gives approximately independent
error terms• Information about plot position was ignored• This information can be utilized
B A B A A B B A
![Page 16: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/16.jpg)
Tobler’s law of geography
“Everything is related to everything else, but near things are more related than distant things.”
Waldo Tobler
![Page 17: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/17.jpg)
Random fieldsThe random function Z(s) is a
• stochastic process if the plots belong to a space in one dimension
• random field, if the plots belong to a space in two or more dimensions
![Page 18: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/18.jpg)
Semivariogram
|| h ||
(||
h ||
)
practical range
95%sill
![Page 19: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/19.jpg)
Spatial modelling• Can improve precision.• Still rare in analysis of agricultural field experiments.• There are many possible spatial models and
methods.• Can be used whether or not the treatments were
randomized...• Which is the best design for spatial analysis?
![Page 20: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/20.jpg)
G H D E F B C A
A C G B D E H F
E F C D G A H B
I
II
III
Gradient
Randomised block design
![Page 21: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/21.jpg)
Incomplete block design
1
2
III IV II I
I J K L P O M N F E H G A D C B
V VIII VII VI
M I A E L P D H G O C K J B N F
Strata
• Replicates• Blocks• Plots
![Page 22: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/22.jpg)
Ofullständiga block
![Page 23: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/23.jpg)
D A C B B C A D C B D A B D C A C B A D B C A D
1 3 2 2 1 3
Replicate I Replicate II
Strata• Replicates• Plots• Subplots
Split-plot design
![Page 24: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/24.jpg)
D A C B B C A D C B D A B D C A C B A D B C A D
1 3 2 2 1 3 Replicate I Replicate II
D A C B B C A D C B D A B D C A C B A D B C A D
D A C B B C A D C B D A B D C A C B A D B C A D
D A C B B C A D C B D A B D C A C B A D B C A D
D A C B B C A D C B D A B D C A C B A D B C A D
1a
1b
2a
2b
3
Comparison
![Page 25: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/25.jpg)
sown conventionally sown with no tillage
cultivar 2cultivar 1cultivar 3
Mo applied Mo applied
Each replicate:
A design with several strata
Bailey, R. A. (2008). Design of comparative experiments. Cambridge University Press.
![Page 26: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/26.jpg)
![Page 27: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/27.jpg)
The linear mixed model
y = Xb + Zu + e
X: design matrix for fixed effects (treatments)Z: design matrix for random effects (strata)
u is N(0, G) e is N(0, R)
![Page 28: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/28.jpg)
Bates about error strata“Those who long ago took courses in "analysis of variance" or "experimental design" that concentrated on designs for agricultural experiments would have learned methods for estimating variance components based on observed and expected mean squares and methods of testing based on "error strata". (If you weren't forced to learn this, consider yourself lucky.) It is therefore natural to expect that the F statistics created from an lmer model (and also those created by SAS PROC MIXED) are based on error strata but that is not the case.”
![Page 29: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/29.jpg)
Approximate t and F-tests
The number of degrees of freedom is an issue.SAS: the Satterthwaite or the Kenward & Roger method.
𝑡= 𝑳 ��√v a r (𝑳 ��)
when L is one-dimensional, and
otherwise.𝐹=(𝐋 �� ) ′ ( var (𝐋 �� ))−1 (𝐋 �� )
rank ( var (𝐋 �� ))
![Page 30: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/30.jpg)
Likelihood ratio test
Full model (FM): p parameters
Reduced model (RM): q parameters
is asymptotically c2 with p – q degrees of freedom.
)()(log
yy|L|L
RM
FM2
![Page 31: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/31.jpg)
Bayesian analysis
y = Xb + Zu + eu is N(0, G) e is N(0, R)
G is diag(Φ) R is diag(σ2)
Independent priori distributions: p(b), p(Φ)Sampling from the posterior distribution: p(b,Φ | y)
![Page 32: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/32.jpg)
P-values in agricultural research
• Only discuss statistically significant results
• Do not discuss biologically insignificant results (although they are statistically significant).
• “Limit statements about significance to those which have a direct bearing on the aims of the research”. (Onofri et al., Weed Science, 2009)
![Page 33: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/33.jpg)
Shrinkage estimators
Galwey (2006). Introduction to mixed modelling. Wiley.
![Page 34: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/34.jpg)
Fixed or random varieties?
Fixed varieties (BLUE)• Few varieties • Estimation of differences
Random varieties (BLUP)• Many varieties
• Ranking of varieties
![Page 35: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/35.jpg)
Conclusions based on a simulation studyi. Modelling treatment as random is efficient for
small block experiments.ii. A model with normally distributed random effects
performs well, even if the effects are not normally distributed.
iii. Bayesian methods can be recommended for inference about treatment differences.
![Page 36: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/36.jpg)
Summary• Fisher’s ideas about randomisation and blocking
are still predominant.• Strong focus on p-values.• Linear mixed models are used extensively.• Spatial and Bayesian methods are used less often.• The question is what is random and fixed, and how
to calculate p-values.
![Page 37: Så används statistiska metoder i jordbruksförsök](https://reader035.vdocuments.us/reader035/viewer/2022062520/56816222550346895dd24c79/html5/thumbnails/37.jpg)
Tack för uppmärksamheten!