elementary statistics for foresters
DESCRIPTION
Elementary statistics for foresters. Lecture 2 Socrates/Erasmus Program @ WAU Spring semester 2005/2006. Descriptive statistics. Descriptive statistics. Data grouping (frequency distribution) Graphical data presentation (histogram, polygon, cumulative histogram, cumulative histogram) - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/1.jpg)
Elementary statistics for foresters
Lecture 2
Socrates/Erasmus Program @ WAU
Spring semester 2005/2006
![Page 2: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/2.jpg)
Descriptive statistics
![Page 3: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/3.jpg)
Descriptive statistics
• Data grouping (frequency distribution)
• Graphical data presentation (histogram, polygon, cumulative histogram, cumulative histogram)
• Measures of location (mean, quadratic mean, weighted mean, median, mode)
• Measures of dispersion (range, variance, standard deviation, coefficient of variation)
• Measures of asymmetry
![Page 4: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/4.jpg)
Descriptive statistics
• Descriptive statistics are used to summarize or describe characteristics of a known set of data.
• Used if we want to describe or summarize data in a clear and concise way using graphical and/or numerical methods.
![Page 5: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/5.jpg)
Descriptive statistics
• For example: we can consider everybody in the class as a group to be described. Each person can be a source of data for such an analysis.
• A characteristic of this data may be for example age, weight, height, sex, country of origin, etc.
![Page 6: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/6.jpg)
Descriptive statistics
• Closer-to-forestry example: we can consider all pine stands in central Poland as a group to be characterized.
• Each stand can be described by its area, age, site index, average height, QMD, volume per hectare, volume increment per hectare per year, amount of carbon sequestered, species composition, damage index, ...
![Page 7: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/7.jpg)
Frequency distribution
xi nini pi
pi
468
10121416
238273452421
23105178223247249250
0,0920,3280,2920,1800,0960,0080,004
0,0920,4200,7120,8920,9880,9961,000
250 1,000
![Page 8: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/8.jpg)
Frequency distribution
• Frequency distribution is an ordered statistical material (measurements) in classes (bins) built according to the investigated variable values
![Page 9: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/9.jpg)
Frequency distribution
• How to build it? – determine classes (values/mid-points and class
limits), depending on variable type– classify each unit/measurement to the
appropriate class– sum units in each class
![Page 10: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/10.jpg)
Frequency distribution
• Practical issues:– number of classes should be between 6 and 16– classes should have identical widths– middle-class values/class mid-points should be
chosen in such a way, that they are easy to manipulate
![Page 11: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/11.jpg)
Frequency distribution
xi nini pi
pi
468
10121416
238273452421
23105178223247249250
0,0920,3280,2920,1800,0960,0080,004
0,0920,4200,7120,8920,9880,9961,000
250 1,000
![Page 12: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/12.jpg)
Graphical description of data
• Pictures are very informative and can tell the entire story about the data.
• We can use different plots for different sorts of variables. We can use for example bar plots (histograms), pie charts, box plots, ... .
![Page 13: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/13.jpg)
Graphical description of data
Histogram for dk
dk
freq
uenc
y
0 3 6 9 12 15 180
20
40
60
80
100
![Page 14: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/14.jpg)
Graphical description of data
polygon
dk
freq
uenc
y
0 3 6 9 12 15 180
20
40
60
80
100
![Page 15: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/15.jpg)
Graphical description of data
cumulative histogram
dk
freq
uenc
y
0 3 6 9 12 15 180
50
100
150
200
250
![Page 16: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/16.jpg)
Graphical description of data
![Page 17: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/17.jpg)
Numerical data description
![Page 18: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/18.jpg)
Sums and their properties
ncc
xccx
yxyx
xx
ii
iiii
i
ni
ii
)(
1
1.
2.
3.
![Page 19: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/19.jpg)
Measures of location
• Arithmetic mean
• Quadratic mean
• Weighted mean
• Median
• Mode
• other
![Page 20: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/20.jpg)
Arithmetic mean
![Page 21: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/21.jpg)
Quadratic mean
![Page 22: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/22.jpg)
Properties of the mean
Weighted mean
...
![Page 23: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/23.jpg)
Median
• If observations of a variable are ordered by value, the median value corresponds to the middle observation in that ordered list.
• The median value corresponds to a cumulative percentage of 50% (i.e., 50% of the values are below the median and 50% of the values are above the median).
![Page 24: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/24.jpg)
Median
• The position of the median is calculated by the following formula:
![Page 25: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/25.jpg)
Median
• How to calculate it?
• If the detailed values are available, sort the data file and find an appropriate value
• If the frequecy distribution is available, use the following formula:
![Page 26: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/26.jpg)
Mode
• The mode is the most frequently observed data value.
• There may be no mode if no value appears more than any other.
• There may also be two (bimodal), three (trimodal), or more modes (multimodal).
• In the case of grouped frequency distributions, the modal class is the class with the largest frequency.
![Page 27: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/27.jpg)
Mode
• If there is no exact mode available in the data file, you can calculate its value by using:– an approximate Pearson formula
– by using an interpolation
![Page 28: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/28.jpg)
Relationship between measures
x
f(x)
μμe
μo
![Page 29: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/29.jpg)
Relationship between measuresf(x)
μo μe μ
c3c
![Page 30: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/30.jpg)
Relationship between measures
μ μe μo
c
3c
x
f(x)
![Page 31: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/31.jpg)
Sample calculations
![Page 32: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/32.jpg)
Sample calculations
![Page 33: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/33.jpg)
![Page 34: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/34.jpg)
Measures of dispersion
• Range
• Variance
• Standard deviation
• Coefficient of variation
![Page 35: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/35.jpg)
Range and variance
• Range is a difference between the lowest and the highest value in the data set
• Variance– average squared differences between data
values and arithmetic mean
N
xi
2
2
![Page 36: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/36.jpg)
Variance
N
xi
2
2
NN
xx ii
2
2
2
1
2
2
n
xxs i
1
2
2
2
nn
xx
s
ii
![Page 37: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/37.jpg)
22222 22 Nxxxxx iiiii
N
x
N
xx
N
xNx
N
xx ii
ii
ii
i
22
2
2
2
2 22
N
xx ii
2
2
NN
xnxn iiii
2
2
2
1
2
2
2
nn
xx
s
ii
22
2
22
2 kw
iiii
N
xn
N
xn
![Page 38: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/38.jpg)
Variance
min2 ix
222xcx c
02 c
![Page 39: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/39.jpg)
Standard deviation and coefficient of variation
2
100
w %
![Page 40: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/40.jpg)
Sample calculations
1950iixn 165962iixn250N
544,5250
1386
250
1521016596
250250
380250016596
250250
195016596
2
2
35,2544,5 %1,30%10080,7
35,2%100
w
![Page 41: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/41.jpg)
Measures of asymmetry
• Skewness: is a measure of the degree of asymmetry of a distribution.
• If the left tail is more pronounced than the right tail, the function has negative skewness.
• If the reverse is true, it has positive skewness.
• If the two are equal, it has zero skewness.
![Page 42: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/42.jpg)
Skewness
![Page 43: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/43.jpg)
Skewness
• Skewness can be calculated as a distance between mean and mode expressed in standard deviations:
oas
![Page 44: Elementary statistics for foresters](https://reader035.vdocuments.us/reader035/viewer/2022062423/56814e45550346895dbbc1b4/html5/thumbnails/44.jpg)
Acknowledgements
• This presentation was made thanks to the support and contribution of dr Lech Wróblewski