descriptive statistics - ftmsย ยท 2019-02-20ย ยท interquartile range ... semi-interquartile range...
TRANSCRIPT
DESCRIPTIVE
STATISTICS
Ms Nurazrin Jupri
SKEWNESS
Skewness measures the lack of symmetry in a data
distribution.
The skewed portion is the long and thin part of the curve.
A skewed distribution: the data are sparse at one end of
distribution but piled up at the other end.
Ms Nurazrin Jupri
SKEWNESS IN RELATION TO
BETWEEN MEAN, MEDIAN & MODE
Mode : the highest point of the curve
Median : the middle value
Mean : located somewhere towards the tail of the
distribution
Affected by all values, including extreme values
Bell-shaped / normal distribution has NO SKEWNESS
Mean = Median = Mode
Ms Nurazrin Jupri
MODE < MEDIAN < MEAN
Positively skewed
Skewed to the right
Ms Nurazrin Jupri
MODE = MEDIAN = MEAN
Symmetrical
Zero-Skewness
Evenly or normally distributed
Ms Nurazrin Jupri
MEAN < MEDIAN < MODE
Negatively skewed
Skewed to the left
Ms Nurazrin Jupri
MEASURE OF SKEWNESS
To determine the difference between the mean and the mode of
the distribution
Mean โ Mode = +ve distribution is right or positively
skewed
Mean โ Mode = -ve distribution is left or negatively
skewed
Mean โ Mode = 0 distribution is symmetrical
๐ท๐๐๐๐๐๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐ ๐๐๐๐๐๐๐๐ =๐(๐๐๐๐ โ ๐๐๐ ๐๐๐)
๐๐๐๐๐ ๐๐๐ ๐ ๐๐๐๐๐๐๐๐
Ms Nurazrin Jupri
EXERCISE 1
1. What is the relationship between mean, median and
mode?
Find the mean median and mode of:
1, 2, 2, 3, 3, 3, 4, 4, 4, 4, 4, 4, 5, 5, 5, 6, 6, 7
โข Mean is 4.
โข Median is 4.
โข Mode is 4.
Ms Nurazrin Jupri
EXERCISE 1 (CONT.)
What is the relationship between mean, median
and mode?
Find the mean, median and mode of:
0, 5, 10, 20, 40, 45, 45, 50, 50, 50, 60, 60, 60, 60, 60, 60, 70,
70, 70, 70, 70, 70, 70, 70
โข The mean is 51.5.
โข The median is 60.
โข The mode is 70.
Ms Nurazrin Jupri
EXERCISE 1 (CONT.)
What is the relationship between mean, median
and mode?
โข Find the mean, median, and mode of:
20, 20, 20, 20, 20, 20, 20, 20, 30, 30, 30, 30, 30, 30, 45, 45, 45, 50, 50, 60,
70, 90
โข The mean is 36.1.
โข The median is 30.
โข The mode is 20.
Ms Nurazrin Jupri
QUARTILE
Normally used to describe positional values of large sets
of numerical data.
First quartile (Q1)
Second quartile (Q2)
Third quartile (Q3)
Ms Nurazrin Jupri
FIRST QUARTILE (Q1)
Is a positional value where :
25% of the observations are smaller
75% of the observation are larger
Step 1: Find first quartile position
๐ธ๐ =๐ + ๐
๐
Step 2: Arrange data
Step 3: Find first quartile value which correspond with first
quartile position.
Ms Nurazrin Jupri
THIRD QUARTILE (Q3)
Is a positional value where :
75% of the observations are smaller
25% of the observation are larger
Step 1: Find third quartile position
๐ธ๐ =๐(๐ + ๐)
๐
Step 2: Arrange data
Step 3: Find third quartile value which correspond with
third quartile position.
Ms Nurazrin Jupri
EXAMPLE 1
The 3 year annual returns of 14 low-risk funds are given as
follows.
9.77 11.35 12.46 13.80 15.47 17.48 18.37
18.47 18.61 20.72 21.49 22.47 31.50 38.16
Find the first and third quartile.
๐1(๐๐๐ ๐๐ก๐๐๐) =14 + 1
4= 3.75
Approximately, the forth position of data : 13.80
๐3(๐๐๐ ๐๐ก๐๐๐) =3(14 + 1)
4= 11.25
Approximately, the eleventh position of data : 21.49
Ms Nurazrin Jupri
QUARTILES FOR GROUPED DATA
Step 1: Obtain the cumulative frequencies
Step 2: Identify the first and third quartile position by using
formula quartile position.
Step 3: Identify the first and third quartile classes.
โข Quartile position
โข Cumulative frequencies
Step 4: Find the first and third quartile by using formula.
๐ธ๐ =๐
๐ ๐ธ๐ =
๐๐
๐
Ms Nurazrin Jupri
QUARTILES FORMULA
First Quartile
Third Quartile
LCB = Lower class boundary
n = Number of observations
CF = Cumulative frequency before the quartile class
f = Frequency for quartile class
C = Class size
๐ธ๐ = ๐ณ๐ช๐ฉ๐ + (๐ช๐)
๐๐ โ ๐ช๐ญ๐
๐๐
๐ธ๐ = ๐ณ๐ช๐ฉ๐ + (๐ช๐)
๐๐๐ โ ๐ช๐ญ๐
๐๐
Ms Nurazrin Jupri
EXAMPLE 2
Table shows the distribution of test scores obtained by 42
students in Statistics class. Calculate Q1 and Q3.
Scores obtained Number of students
80-90 1
90-100 2
100-110 5
110-120 10
120-130 15
130-140 7
140-150 2
Total 42
Ms Nurazrin Jupri
EXAMPLE 2 (CONT.)
Scores obtained Number of students Cumulative frequency
80-90 1 1
90-100 2 3
100-110 5 8
110-120 10 18
120-130 15 33
130-140 7 40
140-150 2 42
๐ธ๐ =๐(๐๐)
๐= ๐๐. ๐ ๐ธ๐ =
๐๐
๐= ๐๐. ๐
Ms Nurazrin Jupri
EXAMPLE 2 (CONT.)
Scores obtained Number of students Cumulative frequency
80-90 1 1
90-100 2 3
100-110 5 8
110-120 10 18
120-130 15 33
130-140 7 40
140-150 2 42
๐ธ๐ = ๐ณ๐ช๐ฉ๐ + (๐ช๐)
๐๐ โ ๐ช๐ญ๐
๐๐
= 110 + 120 โ 11010.5โ8
10
= 110 + 2.5
= 112.50
๐ธ๐ = ๐ณ๐ช๐ฉ๐ + (๐ช๐)
๐๐๐ โ ๐ช๐ญ๐
๐๐
= 120 + 130 โ 12031.5 โ 18
15
= 120 + 9
= 129
Ms Nurazrin Jupri
INTERQUARTILE RANGE
The difference between the third and first quartiles in a set
of data.
One of dispersion measurement
๐ฐ๐๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐๐๐ = ๐ธ๐ โ ๐ธ๐
Ms Nurazrin Jupri
SEMI-INTERQUARTILE RANGE
Known as Quartile Deviation
One of dispersion measurement
๐ธ๐๐๐๐๐๐๐ ๐ซ๐๐๐๐๐๐๐๐ = ๐ธ๐ โ ๐ธ๐
๐
Ms Nurazrin Jupri
EXAMPLE 3
Refer to example 2 , find the interquartile and semi
interquartile range.
๐ฐ๐๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐๐๐ = ๐๐๐ โ ๐๐๐. ๐ = ๐๐. ๐
๐บ๐๐๐ ๐๐๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐๐๐ =๐๐. ๐
๐= ๐๐. ๐๐
Ms Nurazrin Jupri
COEFFICIENT OF VARIATION
Used while comparing distributions of different means and
variances
Gives the ratio of standard deviation to mean expressed
as percent.
๐ช๐ฝ =๐๐๐๐๐ ๐๐๐ ๐ ๐๐๐๐๐๐๐๐
๐๐๐๐ร ๐๐๐
Ms Nurazrin Jupri
EXAMPLE 4
Typist Ani can type 40 words per minutes with standard
deviation of 5 while typist Jura can type 160 words per
minutes with standard deviation of 10. which typist is more
consistent in her work?
Standard deviation of Jura is twice than Ani
Ani can type four times the speed of Jura
๐ช๐ฝ๐จ๐๐ =๐
๐๐ร ๐๐๐ = ๐๐. ๐%
๐ช๐ฝ๐ฑ๐๐๐ =๐๐
๐๐๐ร ๐๐๐ = ๐. ๐๐%
It shows that the typing ability of typist Jura is more consistent than
typist Ani
Ms Nurazrin Jupri
EXERCISE 2
The investments of Karu and Kamal are given as below:
Whose investment is considered to be more consistent?
Karu Kamal
Profit (RM) 250 250
Standard Deviation 8.16 238.05
Ms Nurazrin Jupri