1
SA’ADATU RIMI COLLEGE OF EDUCATION, KUMBOTSO
SCHOOL OF SCIENCE EDUCATION
DEPARTMENT OF BIOLOGY
BIO 212 RESEARCH METHODS AND BIOMETRY
Meaning, purpose and relevance of research methods and biometry
Research method refers to the techniques for investing phenomenon through
experimentation. An experiment is carried out in order to investigate a
phenomenon which can lead to the acquisition of new knowledge or it can be
carried out to confirm or to correct a previous finding.
Biometry, Biometrics or Biostatistics is the application of statistics to a wide
range of topics in biology and its related discipline particularly agriculture and
medicine to distinguish it from statistics used in other field of human endeavour
e.g. economics business, education etc. The subject encompasses the design
layout, conduct of experiments, collection, compilation and analysis of
numerical data as well as interpretation of the results and drawing valid
conclusion for a particular situation time.
Biometry or Biostatistics is relevant to the field of Biology Agriculture and
Medicine; where the subjects of inquiries are variable in nature and the extent of
variation coupled with the relative importance of the different causes of the
variations are the primary concerns. Three, eminent scientist Sir R.A. Fisher,
S.D. Wright and J.B.S. Haldane, were the founding fathers of Biometry.
TYPES OF RESEARCH
Generally, research is divided into two main types these are: -
2
1. Qualitative Research
2. Quantitative Research
Qualitative Research: is a type of research that aim to investigate a question
without attempting to quantifiable measure variable or look to potential
relationship between variables. Example of qualitative research include case
study.
Quantitative Research: is a systematic empirical investigation of quantitative
properties and phenomena and their relationships, there by asking a narrow
question and collecting numerical data to analyse using a statistical methods.
Example include: experimental, correlational and survey.
Choice of Research Topic
The most successful research topics are narrowly focused and carefully defined.
Before a researcher chose a research topic, several factors are taken into
consideration. Some of them have to do with your particular interests,
capabilities and motivation. Other focused on areas that will be of great interest
to both the academic and private sectors.
The chemistry professor and author, Robert Smith in his book Graduate
Research: A guide for students in sciences (1984) listed some point to be
consider in finding and developing a research topic:
a) Can it be enthusiastically pursued?
b) Can interest be sustained by it?
c) Is the problem solvable?
d) Is it worth doing?
e) Will it lead to other research problems?
f) Is it manageable in size?
3
g) If the problem is solved, will the result be reviewed well by scholars in
your field?
h) Are you, or will you become, competent to solve it?
i) What is the potential for making an original contribution to the literature
in the field?
Hypothesis: (Types source, Formulation)
Hypothesis is one of the concepts frequently used by scientist and other
researchers in their scientific writings and works.
Hypothesis is a proposed explanation for a phenomenon. It is simply defined as
the educated guess that is based upon observation of an event or phenomenon. It
is a possible explanation of an event.
That is, it provides an explanation as to why or how an observed event happen
or what makes it happen but which has not been proved. Therefore it is simply
based on intuition or reasoning. Most hypothesis can be supported or rejected
by experimentation or continued observation.
Types of Hypothesis
Types of hypothesis include: - Null hypothesis, alternate hypothesis and
scientific hypothesis. Null and alternate hypothesis are the two types of
hypothesis found in statistics hypothesis testing. One is often just the
contradiction of the other. Scientific hypothesis is not commonly known as an
educated guess, it is a scientific theory that has not yet been proven to be true.
Null hypothesis always predicts the absence of relation between two variable
e.g. “there is no relationship between education and income” while.
The alternate hypothesis states an actual expectation such as “Higher levels of
education increase the likelihood of earning a higher income”.
4
Sources of Hypothesis
Hypothesis can be derived from many sources:
a) Theory: Theory on the subject can act as source of hypothesis e.g. providing
employment opportunity is an indicator of social responsibility of a
government enterprises from the above many hypothesis can be deduced.
i. Public enterprise has greater social concern than the enterprises
ii. People’s perception of government enterprise is a social concern.
b) Observation: People’s behaviour is observed. In this method we use
observed behaviour to infer the attitudes.
c) Past experience: Here researcher goes by past experience to formulate the
hypothesis.
d) Case studies: Case studies published can be used as a source for hypothesis.
Formulating Hypothesis
Hypothesis can be explained as a statement that can be used to predict the
outcome of future observations. It stated that you predict will happen based on
changes in variables. Variables are things that change. A hypothesis is stated in
order to be tested by an experiment. It can come as answers or explanations to
the question you have raised since from the beginning. It can also be a statement
that explains or describe the way you think the observation you made are
brought about, that is the mechanism underlying the phenomenon.
Once you have identified your research question, it is time to formulate your
hypothesis. An experiment is carried out to test the validity of the hypothesis
formulated through acceptance or rejection. Hypothesis are form in order to be
discredited and when they are not rejected they become accepted.
5
Formulating a valid hypothesis takes practice but it is important to the success
of any experiment. The following steps need to be consider when formulating a
hypothesis.
1. Great your hypothesis in a form of a question. E.g. “Does smoothing
cause lung cancer?”
2. Formulate the hypothesis by making it a conditional statement e.g.
“smoking may cause lung cancer”.
3. Write a formalized hypothesis e.g. “It smoking cause lung cancer, then
individuals who smoke have a higher frequencies is consider as the most
useful.
4. Make sure that your hypothesis contains variable the researcher is always
in control of the independent variable in the experiment. The dependent
variables are mostly observed in the context of the experiment. For an
experiment to be valid, it must contain at least two variables.
5. Make sure that your hypothesis include subject group. A subject group
defines who or what the researcher is studying in our example the subject
group are the smokers.
6. Include a treatment or exposure in the experiment. A treatment is literally
what is being done to the subject group. In our example, the exposure is
smoke or smoking.
7. Prepare for an outcome measures, which is a measurement concerned
with how the treatment is going to be assessed. In our example the
outcome of smoking is the frequency of smoke developing cancer in
subject population.
8. Understanding your control group. Control group is a group similar to the
subject group but does not receive the treatment i.e. is a population that
the subject group is compared to. In our example, the control group is
non-smokers.
6
DATA COLLECTION (TYPES AND SOURCES)
Data (Singular Datum) is a numerical statement of fact in a specific field of
enquiry. Simply, data means numbers and these numbers are obtained from
research carried out in a specific area of investigation. For example a research
may be interested in finding out how many students are admitted into the
various departments in S.R.C.O.E Kano annually for the last five years (2008 –
2013). This will involve checking the records at student’s affairs and counting
numbers of students admitted into each department during the stated period. The
number counted for each department is then counted. This record forms your
data. The research may also involve distribution of questionnaires and getting
responses from the respondents. Example ministry of health is interested in
knowing how many people use treated nets in an area that is known to have a
higher prevalence rate of malaria in an attempt to popularize its use. The
researcher may have to cover the area with questionnaires, and use that to count
how many people use the nets, how many people use other methods of
prevention and how many people do not use any method. This also form a data
from which you can carry out further analysis. So also study may involve taking
measurement on blood pressure, parasite count, height weight, yield
temperature humidity etc. the measurements records from these parameters all
comprise data.
Therefore, data may be collected by two methods, counting or taking
measurements. Data collected from counting (e.g. number of students in
SRCOE) is taken in whole number (1,2,5,8,11 and so on) and is referred to as
DISCRETE or DISCONTINUOUS data. On the other hand data collected from
taking measurements (such as blood pressure is termed as Continuous data (2.3,
2.34, 2.345) depending on the accuracy of the measuring device. It is very
important to know the kind of data one is collecting whether Discrete or
7
Continuous because it is an important determinant of the kind of statistical test
to be used for data analysis.
Population Sample and Sampling Techniques
It is difficult to collect information from the whole population due to problems
associated with natural causes of variation in Biological materials as well as
limitation of resources and time. Consequently in an investigation, information
is collected from samples.
At the time of collecting data, sampling must be made randomly without bias or
proportionately as with allocation of treatments. The size of the sample, should
also be large enough to be a good representative of the whole population. This
is to enable the researcher collect adequate information he is after. Size of the
sample can be determined by the external of the variation you are seeking in
your experiment. It is important to decide on correct sample size before the start
of the experiment.
Population Sample and Sampling Techniques
Due to limitation of resources and time as well as variation in Biological
materials, it is difficult to collect information from the whole population during
research. Therefore during investigation, information or fact are collected from
a samples. By definition, sample in defined as the time representation of entire
population, when collecting information, sampling must be made randomly in
thought bias in the size of sample must be large enough to be a good
representation of the whole population. Sampling size must be determine before
the commencement of the experiment or investigation.
Sampling Techniques
This refers to the techniques of taking a portion of the population as a
representative of that population. The researcher has to decide on whether every
8
member of the population will be studied i.e. sample. In most areas, sampling is
done when the researcher wants to generalize the findings of his/her research to
the entire population: the following techniques are employed when sampling.
a) Simple random sampling
b) Systematic sampling
c) Stratified sampling
d) Cluster sampling
• Simple Random Sampling
This is a method of selecting a sample from a population so that all members of
the population have an equal chances of being selected. This indicates that no
member of the population has been omitted deliberately except by chance. This
is most applied when the population is homogenous. For example, suppose a
researcher wants to choose 10 students at random out of a population of 50
within a given class each students has one chance of being included in the
sample.
Sampling Techniques
For some members of the population to be selected and included in a study
sample, rules and methods, have to be established. This is in order to avoid bias
on the part of the researcher. A sample should be a true representative of the
population under investigation and to achieve that it has to be taken randomly.
Random sampling ensures that every member of the population has an equal
chance of being selected as a sample. The following techniques are employed
when sampling.
a. Random Sampling: Simple random sampling is best applied where the
distribution of the organisms is homogenous, that is they are evenly
distributed. In simple random sampling, every member has an equal chance
9
of being selected and included in the sample. Randomness can be achieved
through the use of random member tables.
Stratified Sampling: Stratified sampling is applied when you are dealing with
a heterogeneous population. It involves grouping men or mutually exclusive
groups before sampling. In stratified sampling, an individual’s first of all
identifies the strata of interest and divides the population into sub-group or
strata depending on the number and types of sub-group that exist in the
population and the objectives of the study. For example stratification may
include variables of the study. For example stratification may include variables
of gender i.e. male and female, income tribe, religion etc.
b. After the researcher has divide the population into strata sub-groups, he then
uses simple random sampling to select appropriate sample size from each
stratum or sub-group, according to desire characteristic of the variables such
as sex, location academic ability etc. The idea behind using stratified
sampling techniques is to examine each sub-group separately.
c. Systematic Sampling: Systematic sampling techniques is best applied to a
homogenous population. It involves selecting samples at regular intervals or
fixed distances that are equally spaced. This is a method of selecting a
sample at fixed intervals from a population that is systematically arrange or
in an Alphabetical order or in some natural sequence. It is employed when
all the population or subjects are put in order for example systematic
sampling can be used in selecting 100 people out of 300.
Cluster Sampling: In cluster sampling, the population or habitat or field, is
divided into smaller groups or clusters and some of these cluster are randomly
selected to form the sample. In each cluster selected, every member would be
taken to be part of the sample and included in the study. In cluster sampling,
each cluster chosen must be a representative of the population. When the
population to be sampled is vast and spread over a wide geographical areas a
10
cluster sampling is used. In this type of sampling, the population is first divided
into sample units or cluster. For example states within the country, local
government area, within the state etc. Therefore some of these clusters are
randomly selected to form the sample. In each cluster selected, every member
would be taken to be part of the sample and included in the study.
• Data Presentation
Raw data, that is data recorded directly from the laboratory or field in the record
book is unorganized and because of that it make it difficult to comprehend and
draw information from it. Therefore, raw data collected from experiment have
to be properly organized summarized and present. Presented and statistical
analysed for any meaningful interpretation to be made out of it.
The first step in organizing data is the preparation of an ordered array. His
explain arranging the data in an ascending order. An ordered array enables one
to determine quickly the value of the smallest measurements and that of the
highest as well as other facts or information that may be need from the given
data. Consider the following example.
Example 1.0: Number of pods produced per plant by certain species of legume
in Kadawa research station.
69 74 71 74 71 71
69 72 72 78 72 68
70 67 72 70 75 72
71 70 74 73 75 75
75 74 74 73 74 72
67 76 75 75 68 78
11
69 68 75 78 69 78
73 69 75 68 71 78
The fact that the above data is unorganized make it difficult to draw information
from it, and so an ordered array can be prepared as follows: -
Table 1.0
67 67 68 68 68 68
69 69 69 69 69 70
70 70 71 71 71 71
71 72 72 72 72 72
72 73 73 73 74 74
74 74 74 74 75 75
75 75 75 75 75 75
76 78 78 78 78 78
From this ordered array one can easily find out the minimum number of
pods/plant (67) or the maximum (78).
However an ordered array may not be adequate in offering all the information
that are needed from the data obtained. Therefore further summary may be
required and one way is by tabulating your data according to frequencies.
• The Frequency Distribution
Classifying data according to frequencies enables further summary as well as
facilitates the computation of various descriptive measures like the percentages,
12
average and so on. To get the frequency of number, the first step is to get the
range of the figures in the table and then you take a tally and record the score of
each number that is the frequency of occurrence of each number. In example
1.0, the range is from 67 to 78 and the frequencies are obtained as followed:
Table 1.2: A tally of frequency of occurrence number of pods/plant
No. of Pods/plant Tally Frequency
67
68
69
70
71
72
73
74
75
76
77
78
II
III
IIII
IIII
IIII I
III
III
IIII I
IIII III
I
0
IIII
2
4
5
4
6
4
3
6
8
1
0
5
TOTAL 48
This frequency can then be presented in a table which is referred to as frequency
distribution table.
Table 1.3: Frequency distribution of the number of pods produced plant
No. of Pod/Plant (x) Frequency (f) Relative frequency (%)
67
68
2
4
2/48 x 100 = 4.17%
4/48 x 100 = 8.33%
13
69
70
71
72
73
74
75
76
77
78
5
3
5
6
3
6
8
1
0
5
5/48 x 100 = 10.42%
3/48 x 100 = 6.25%
5/48 x 100 = 10.42%
6/48 x 100 = 12.50%
3/48 x 100 = 6.25%
6/48 x 100 = 12.50%
8/48 x 100 = 16.67%
1/48 x 100 = 2.08%
0/48 x 100 = 0.00%
5/48 x 100 = 10.42%
Total 48 100%
• Cumulative Frequency
This is usually derived from the frequency distribution table by summing up the
observed frequencies. It gives way of obtaining information regarding the
frequency or relative frequency of value with two or more contiguous
observation or group. From example 1.0 the following cumulative frequency
distribution can be obtained.
Table 1.4: Cumulative frequency of number of pods/plant
No. of Pods/Plant
(x)
Frequency (f) Cumulative
frequency
Cumulative relative
frequency
67
68
69
70
71
72
2
4
5
3
5
6
2
6
11
14
19
25
4.1
12.5
22.9
29.2
39.6
52.0
14
73
74
75
76
77
78
3
6
8
1
0
5
28
34
42
43
43
48
58.3
70.8
87.5
89.6
89.6
100.0
The frequency distribution for group data
Another method of summarising a data is by classifying it into groups or classes
from which the various descriptive measures can be easily calculated. To group
data the first step is to select a class interval such that each value is placed in
only one interval bearing in mind that there can be not less than 6 intervals and
not more than 15. There are two methods of classification; exclusive which is
best for describe data and inclusive which is ideal for continuous data.
Example 2.0 the weight of all NCE II research method and biometry students
are measured and the results is expressed to the nearest kilogram (kg) as
follows: 100, 95, 85, 75, 80, 78, 90, 93, 109, 104, 88, ...... (the dots represent
value not shown) assuming we have 100 such measurement. The measurement
can be seen to range from lowest value of 75kg to a higher value of 109kg. If
we collect together the value in groups such that each value is placed in only
one class interval a frequency distribution.
Table 2.0: Frequency distribution of weight of NCE II Research method
and Biometry students.
Class interval of weight of students (kg) Frequency (f)
75 – 79
80 – 84
2
2
15
85 – 89
90 – 94
95 – 99
100 – 104
105 – 109
7
11
28
30
20
Total 100
Histograms: This is a graph of frequency or relative frequency distribution
plotted in a form of rectangle or bars continuous data is best represented in the
form of histogram.
Bar-
chart:
This represent the frequency distribution of describe data and because the data
is discontinuous, bar and columns are form, and exclusive type of classification
is used.
Example: Let us assume we have 37 observations on number of flowers
produced per plant as follows: -
Table 3.0: Frequency distribution of number of flowers/plant
16
Class interval of no. of flowers/plant Frequency (f)
1 – 5
6 – 10
11 – 15
16 – 20
21 – 25
26 – 30
3
5
8
4
7
10
Total 37
A bar chart can be drawn from the above illustration
Line Graphs: These changes in one variable in relation to another variable. In
this case we have;
Y = vertical axis as dependent variable. Dependent variable is the one which is
not under the control of the experimenter, it changes in response to changes in
the independent variable.
X = horizontal axis, and is the independent variable, which ............
Example: Measurement on height of maize plant were taken and the data
presented in the table below:
Table 4.0: Height of maize plant at different number of days after planting.
Time in days x (Independent variable) Heights (cm) y (Dependent variable)
20
40
60
80
25
50
75
100
17
100 125
A line graphs of maize plants height could be drawn as follows:
Data may be organized and summarized through the following processes.
1. Organize the data by putting them in tables such as frequency tables, and
other forms of summary tables.
2. Illustrate the data using graphs such as histograms, bar charts line graphs etc.
3. Analyze the data using mathematical approach.
Once the data is summarized an appropriate statistical test can be carried out in
order to draw conclusion about the investigation.
Example: The number of flowers produced per plants by rose plants in garden
are as follows:
18
6 3 6 7 8 4 6 7 8 4
9 8 9 5 7 8 9 5 7 8
8 5 7 8 10 6 7 8 6 7
The data above can be organized by constructing
a) A frequency distribution table
b) Relative frequency distribution
MEASURES OF CENTRAL TENDENCY
They are also known as measures of location. There are three most commonly
used measures and these are; mean, median and mode. These measurement
provide a general idea of the position of the center of a distribution.
The Mean
The most common of the measures of central tendency is the average or in
mathematics term, the arithmetic mean or simply the mean. This is achieve by
adding up all the observations and dividing by the total number of
measurements obtain; thus:
Mean = Sum of observation
Total number of observation
The mean is donated by X (x-bar). The general formula for the calculation of
sample mean can be given as:
X = ∑𝑥
𝑛
Where X represents the variable
∑ = summation sign n = number of observation
∑𝑥 = x1 + x2, x3 ............... xn
19
Example 3.0: To compute the mean of the number pods produced plant of 48
plants represented in table 1.0, which is a set of unsummarised measurements,
we simply add them all up and divide by the total number.
X = ∑𝑥
𝑛 = 69 + 74 + 71 .................. 71 + 78
48
= 3477
48 = 72.438
X = 72.44
Median
To calculate the median, the data has to be arrange in an array, in an increasing
order of size. When this is done then the median is the value which divide a set
of values into two equal parts. When the numbers are odd, the median is the
central or middle value, but when they are even, then it became mean of the two
central figures. In others words median can be defined as:
n = 1
2
Therefore, if we have 15 observations, the median is the (15+1
2) = 8th ordered
observation. If we have 16 observation, then it the (15+1
2) = 8.5 ordered
observation and this implies that the value is half way between the 8th and the
9th observation. You therefore take the average of the 8th and 9th observation as
your median.
Example 4.0: Taking the data presented in table 1.0 for illustration, since the
value are already arranged in a sequence and it is an even number, the median is
therefore the:
𝑛+1
2 =
48+1
2 =
49
2 = 24.5th value
Therefore we take the two middle values which are 72 and 72.
20
The median therefore is 72+72
2 = 72
The Mode
It is defined as the most commonly occurring value in a data. Thus if all the
figures are different e.g. 5, 6, 11, 15, 8 then there is no mode. Alternatively it is
possible to get more than one mode e.g. 5, 5, 6, 8, 8, 9; here 5 and 8 are the
modes.
Example 5.0: The mode of the data presented in table 1.0 is 75. This is because
it occurs most frequently (8 times).
MEASURES OF DISPERSION
The observation making up a set of data are scattered about the mean. This
scatter or dispersion reveals the degree of variability present in a set of
measurements. If all values are the same, then there is in dispersion in the data.
If on the other hand, the value are not these then there is dispersion, and the
amount of dispersion depend on spread of the figures, that is how close together
or how scattered, they are:
The main measures of dispersion encountered in biological estimates include
the followings:
a) Mean deviation
b) Variance
c) Standard deviation
The Mean Deviation
Dispersion can also be assessed by calculating deviation from the mean since
the mean is a very useful measure of central tendency. The sum of the deviation
from the mean is always zero, so one has to ignore the negative sign, since it is
not used in Biology. The contributions an observation (x) to scatter is equivalent
to its distance from the mean, i.e. X – X. Deviation from the mean (MD) can be
calculated using this equation.
MD = ∑(X – X) Where X = observation
21
n X = mean
n = No. observation
Example 6.0: Let us select a range of 5 figures from the observation on number
of pods produced/plant in table 1.0. These are 67, 70, 75, 78, 68. To calculate
Mean Deviation (MD), we to first of all calculate the mean: as thus:
X = ∑X = 67 + 70 + 75 + 78 + 68 = 358
n 5 5 = 71.6
MD = (67 – 71.6) + (70 – 71.6) + (75 – 71.6) + (78 – 71.6) + (68 – 71.6)
5
= (-4.6) + (-1.6) + (3.4) + (6.4) + (-3.6)
5
Ignoring the minus signs
MD = 19.6 = 3.92
5
The Variance
As ready mentioned that the amount of variation in a given data depends on
how close or scattered the values are from the mean. The closer they are to the
mean the less the dispersion while the further they are from the mean, the
greater the variation. Therefore it is common to measure dispersion relative to
the scattered of the value about their mean. One measure through which this is
achieved is through the computation of the VARIANCE. Variance is the sum of
the squares (S2) of the deviations of the values from the means divided by the
size of the sample (n) minus 1. If it is a population, you divide by the size of
population (N). The procedure for the computation of variance is therefore, to
compute the mean, then subtract the mean from each of the values, square the
resulting differences, sum up the squared differences and divide by the size of
the sample or population. The variance of a sample can be represented as
follows:
22
𝑆2 = ∑(𝑥−𝑥)2
𝑛−1.........................................(1)
When you have a large data the computation of sum of squares from the above
equation can be tedious, and so an alternative formula, which may be less
familiar can be adopted. This is
𝑆2 = ∑𝑋2−(£𝑥)2
𝑛 (𝑛−1)......................................(2)
Example 7.0: Let us compute the variance of the sample discussed in example
6.0. we substitute the value in the first equation.
𝑆2 = (67 – 71.6)2 + (70 – 71.6)2 + (78 – 71.6)2 + (68 – 71.6)2
5 – 1
𝑆2 = (-4.6)2 + (-1.6)2 + (3.4)2 + (6.4)2 + (-3.6)2
4
𝑆2 = 89.2
4 = 22.3
The fact that n – 1 is used in the division instead of n as would have been
expected is due to a concept known as degree of freedom (df).
The Standard Deviation
This is the positive square root of the variance and therefore has the same unit
of the original measurement unlike the variance. Standard deviation of the
sample is given as:
𝑆 = √∑(𝑥−𝑥)2
𝑛−1
Or the alternative formula for a large data;
𝑆 = √∑𝑥2−(∑𝑥)2
𝑛(𝑛−1)
Example 8.0: To compute the standard deviation of the sample discussed in
example 7.0. Since the variance (S2) of the same sample has already been
calculate in example 7.0 and it is 22.3 it follows that the standard deviation (S)
is.
23
𝑆 = √22.3
ᵟ = 4.722
Mean, Variance and Standard Deviation for a Grouped Data
You may wish to compute the mean and variance from a frequency distribution
table. In this case you assume that all values falling in a particular class interval
are placed at the mid-point of the class interval. To determine the mid-point of a
class interval, the average of the lower and upper and limits of the class interval
is calculated and that taken as the mid-point (X). It is good to prepare a
worktable as follow for convenience and ease in calculation.
1. The Mean
To complete the mean, the mid-point (x) of each class is multiplied by its
corresponding frequency (f) and divided by the sum of the frequencies (∑f).
This is represented as follows:
X = ∑𝑓𝑥
∑𝑓 = ....................................... for group data
MD = ∑(𝑋−𝑋)𝑓
∑𝑓.................................. for group data
Variance for grouped data
𝑆2 = ∑(𝑥−𝑥)2𝑓
∑𝑓−1
Standard deviation for grouped data
𝑆 = √∑(𝑥−𝑥)2𝑓
∑𝑓−1
Example 8.0: Let us compute the mean, mean deviation, variance and standard
deviation of the weight of NCE II research Method and Biometry students
presented in the frequency distribution table (2.0). The work table is shown as
follows: -
Table 5.0: Workable for calculating the mean, mean deviation, variance and
standard deviation from frequency data of table 2.0.
24
Class
Interval
Frequency
(f)
Mid-
point (x)
fx x-x (x-x)2 (x-x)2 f
75 – 79
80 – 84
85 – 89
90 – 94
95 – 99
100 – 104
105 – 109
2 3
2 2
7 5
11 12
28 30
30 32
20 21
77
82
87
92
97
102
107
154
164
609
1012
2716
3060
2140
-21.5
-16.5
-11.5
-6.5
-1.5
3.5
8.5
462.3
272.3
32.3
42.3
2.3
12.3
72.3
924.6
544.6
926.1
465.3
64.4
369
1446
Total 100 9855 4740
1. X = ∑𝑓𝑥
∑𝑓 =
9853
100 = 98.5
2. MD = ∑(𝑥−𝑥)𝑓
∑𝑓 =
545.0
100 = 5.5
3. 𝑆2 = ∑(𝑥−𝑥)2𝑓
∑𝑓−1 =
4740
100−1 =
4740
99 = 47.88
4. ᵟ = √
∑(𝑥−𝑥)2𝑓
∑𝑓−1 = √47.88 = 6.92
MEASURES OF RELATIONSHIP
1. The Chi-Squared Test
The statistical tests discussed in the previous sections are used to analyse
differences between mean or to determine relation between two variable
collected from every member of the population. The tests described are
considered when the data collected is from continuous variables like height,
weight, blood pressure age, yield e.t.c alternatively referred to as quantitative
date but there are discrete date which cannot be measured but counter. In such, a
ratio is found out known as the chi-squared (with the symbol x2) to goodness of
fit. There are experiments that collect qualitative date, similarly such research
25
seeks to determine association between two factors such as seedling vigor and
leaf colour or eye colour and hair colour or flower pigmentation and seed
germination and so on. In this case a contingency table is formed and chi-
squared X2 is used to determine whether or not there is association between the
factors. The X2 – test is also used to compare two proportions or percentages. It
may be regarded as a non-parametric test as it does not require detailed
information about the population from which the samples are drawn.
Test of Goodness of Fit
In situation where the basis of a hypothesis, which specifies the frequency of
occurrence of an observation, it is tested, the test of goodness of fit is used. It is
therefore frequently used in genetic studies. It is used to test whether the
outcome of a breeding experiment in line with the predicted mendelian rations.
In testing goodness of fit, the X2 determines how close the observes frequencies
are to those expected on the basis of a hypothesis. It is calculated using the
following expression;
X2 = ∑(𝑂−𝐸)2
𝐸
The computed value is compared with the table value of X2 at n-1 degree of
freedom. Goodness of fit test may involve one – way classification in which the
frequencies fall into only two classes that are mutually exclusive and then is the
simplest test. Or the test may also involve more than two classes.
Example 9.0: Test of goodness of fit with two classes.
In a determination of sex ratio among a sample species of birds, it was
discovered that out of 5000 birds, 378 were male. Does this sample conform to
the one male: one female ratio, that is 1.1?
Solution
Sex Male Female Total
Observed no. of bird
Expected no. of bird
378
250
122
250
500
500
26
O – E
X2 = ∑(O-E)2 /E
128
65.5
-128
65.5
0
121.0
Therefore the calculated value (131.0) exceeded the table value at 500% and 1
d.f. (that is 3.481) and even at 1& on the same do (that is 6.635), the difference
is highly significant. We therefore conclude that this sample does not confirm to
the 1:1 ratio.
Example 10.0: Test of goodness of fit with more than two classes.
Suppose a cross-involving two cowpea varieties (smooth white and rough
brown) show the ratio of 9:3:3:1, if a sample of 550 or observations give the
following frequencies 305 smooth white (sw), 110 smoth brown (sb), 105 rough
white (rw) and 30 rough brawn (br). Test whether this agrees with the given
ratio.
Solution
If the given ratio applies, it follows that the expected frequencies in the four
groups will be.
psw = 9/16 x 550; psb = 3/16 x 550; psw 3/16 x 550; prb = 1/6 x 440
respectively, where p stands for probability. The sum of the expected
frequencies must be the same as that of the observed frequencies the next step is
to construct the table of observed and expected frequencies in the following
way.
Phenotype category 1
sw
9
2
sb
3
3
rw
3
4
rb
1
Total
Observed frequency (O)
Expected frequency (E)
305
309.42
110
103.14
105
103.14
30
34.38
550
500
27
O – E
X2 = ∑(O-E)2
E
-4.42
0.06
6.86
0.46
1.86
0.03
-4.38
0.56
1.11
28
PROJECT REPORTING
Once the research has been carried out and the data is analyzed the researchers
has to write the research report or project report. The project report essentially
consist of the following parts:
a) Preliminary part of the report
b) The main body of the report.
c) Reference section.
Preliminary Part of the Project Report
This section should contain all or some of the following items in the
following order.
i. Cover page
ii. Certification page
iii. Title page
iv. Approval page
v. Abstract
vi. Acknowledgement
vii. Table of content
viii. List of tables
ix. List of figures (if any)
x. List of appendices (if any)
The manner of presentation of the preliminary pages in the project report is as
follows:
1. Cover Page: The cover page is the first page of the report. It contain the title
page of the project, full names of the authors and the month and year of
completion of the report.
29
2. Titled Page: This page covers information on the title of the project the
student’s full name, the degree, diploma or certificate for which the project is
submitted as well as the month and the year of submission of the project.
3. Certification Page: This page shows a statement signed by the supervisor to
the effect that materials recorded in the project report resulted from the
research originally carried out by the student.
4. Approval Page: This page contains a statement signed by the examiner, the
head of Department and the students’ supervisor to the effect that the student
has satisfied the requirements for acceptance of the project for the award of
NCE.
5. Abstract: An abstract is a summary of the research report and should not be
lengthy, it should contain in summary from the objectives of the study,
characteristics of the subject, and the method used, the finding of the study
and the conclusion reached by the researcher.
6. Acknowledgement: In this section, the researcher express his/her gratitude
and indebtedness to individuals institutions or organization who provided
one form of assistance or the other during the execution of the work e.g
financial support, review and /or manuscript, and assistance in conducting
the research and/or prepares the manuscript.
7. Table of Content: Table of contents consists of the list of main topics in the
research report beginning with the abstract and end in with appendices, it
also contains the first page of each of the topics listed. The table of content
enables the reader to locate without difficulty any topic of interest to him in
the research report.
8. List of Figure: All figures (if any) in the project one also listed, showing
titles and indicating the pages where they appear.
Main Body of Project Report
The main body of project report is divided into six major section.
30
1. Introduction
2. Literature review
3. Research Methodology
4. Analysis and presentation of result
5. Discussion, conclusion and recommendation
6. Reference and appendices.
Section 1 to 5 above are ordinary referred to as chapter section six is
separated after the fifth chapter into reference and appendices respectively.
1.0 Chapter One: Introduction
The title of the first chapter of the project report is the introduction it is divided
into the following sub-headings
a. The background of the problem
b. The statement of the problem
c. The purpose of the study/objectives of the study.
d. The importance/significance of the study.
e. The statement of hypothesis
f. Definition of terms.
2.0 Chapter Two: Literature Review
This referred to as review of literature. In this, one tries to summarize the
related literature previously written about the topic she/he is writing on. It gives
a background on what is to be written about in the research. The review of
literature often from a critical analysis of what is written before the researcher
came out with his/her view.
3.0 Chapter Three: Research Methodology
The third chapter is the methodology it compresses the following sub-heading.
31
3.1 Research Design: The researcher explain the general approach adopted in
executing the study. i.e he/she has to specifically explain the type of the
design followed in the study.
3.2 Area of Study: This refers to the geographical location covered by the
study which is usually stated in terms of country, state, education,
political or administrative zone, local government etc.
3.3 Population: The researcher specify the number of persons or items from
whom data related to the study were collected.
3.4 Sampling & Sampling Procedure: Cleary one has to inform the reader
about the sample used, how it was obtained including the sampling
techniques used.
3.5 Instrument for Data Collection: This involves giving detailed description
of the instrument used in collecting the date how the instrument was
developed and the major features of the instrument.
3.6 Method of Data Collection: The researcher reports the statistical
techniques tools employed in analyzing the data.
4.0 Chapter Four: Result Analysis and Presentation
This chapter presents the data and statistical analysis without discussing the
implication of the result. These are usually presented in tables, figures and
chart. It is advisable to present the result according to the research question or
hypothesis to which they relate.
5.0 Chapter Five: Discussion, Conclusion and Recommendation
5.1 Discussion: The researcher tries to discuss the findings and relate to the
findings of previous works.
5.2 Conclusion: Should indicate to major findings from the study.
32
5.3 Recommendation: The researcher should suggest action which could be
taken in the light of findings in ordered to bring improvement in the system.
Reference Section
The last section of a researcher report in followed by the list of references
and then appendices (if any).
Reference consist of all documents, including journals article books,
website and unpublished works that are cited in the project.
Appendix: The researcher includes any information/materials that might
have been bulky to be put in the context of the report. These might
include questionnaire design and tables.
Important Information: In research, the researcher has to use an
objective language and avoid personal pronouns like I, me, ours, or any
other personal references eg. I selected ten student from any sample
(irony) is better put it thin why. Ten students were selected.
General Guidelines for Writing References in Project Report
1. Surnames of authors to be written first before the appreciate initials e.g
Abdullahi, I.I.
2. The date of the work cited should be written in bracket e.g immediately in
front of the last initial of the author e.g Abdullahi I.I (2011).
3. Ensure that the surname of authors are arranged alphabetically in line
with APA style.
4. When references are from journal article the name of the journal should
be under line, the volume of the journal as well as the number of the
pages from the journal should be indicated.
33
When references come from text book, the name of the publishing company of
the book should be stated.