Methods of Animal Experimentation
Methods of Animal Experimentation
ng
COURSE DETAILS:
COURSE DETAILS:
Course Coordinator: Dr. O. Olowofeso, (B. Agric. Tech., M. Sc., Ph.D., RAS)
Email: [email protected]
Office Location: Room 6, COLANIM
Other Lecturers: Prof. M.O. Ozoje
COURSE CONTENT:
COURSE REQUIREMENTS:
This is a compulsory course for all final year students in the College of Animal Science
and Livestock Production (COLANIM). The course must be registered for and passed
before students can graduate in any of the five major departments in COLANIM. To do
well in the course, students must have the basic knowledge of mathematics and
elementary statistics. It is also expected that all students must have a very good scientific
calculator and statistical table. Attendance in class is mandatory and students must take
part in all tests and final examination. The minimum score required as pass mark at the
end of the semester for the course is 40%.
READING LIST:
1
http://www.unaab.edu.ng
LECTURE NOTES
2
http://www.unaab.edu.ng
3
http://www.unaab.edu.ng
These categories of data are cheap. But it is often collected without any definite purpose
objectives or focus. The planning may be quiet haphazard, the design or protocol
collection may not promote objectivity while the instrument of data collection may not be
comprehensive, precise and focused enough to permit credible statistical analysis. The
research design is often inadequate or inappropriate.
1.2.4 SURVEY
Surveys are investigations in which the surveyor obtains information about characteristics,
opinions, attitudes, tendencies, activities or operation of the individual unit of the
population. A population is defined as the totality of the units being surveyed or
investigated. For example, a population of households in a rural economic survey, a
population of farms in a farm survey, a population of persons living in a country, a
population of plots in farm survey, a population of trees in a plantation reserve. There are
two types of survey – namely census and sample survey. A census is a complete
enumeration in which every unit in the population is observed, they usually consume large
amount of time, human and material resources. A sample survey involves observation of
only a selected sample of the eligible population units. Sample surveys are cheaper but the
sample must be relevant, objective and representative to have any meaning.
Non statistical sample are subjective, sample collections are non-random. A most
immediate example is quota sample, which is a common practice in opinion surveys. A
probability sample may be subjected to inferential statistical analysis. But non-statistical
samples would not permit application of probability concepts which elements form the
basis of inferential statistics. Thus such samples can only be described: no generalization
for the target population is valid.
6
http://www.unaab.edu.ng
Answers to these questions constitute the essential elements of experimental design. All
biological, materials vary; even identical twins treated the same way will have different
weaning weights. The variation is caused by many factors and the role of statistics is to:
i. Measure the amount of random variation due to inherent variability of the
materials
ii. Measure non-random, systematic variation ascribable to treatment effects.
iii. Measure the latter in relation to the former to assess the genuineness of treatment
effects. One of the procedures for this kind of investigation is experimentation, that
is comparative experimentation, which can be achieved through a process known
as experimental design and which may involve feeding trails, cultural practices,
pest control measures, perennial crops, livestock and pasture, fertilizers, plant
population/ spacing, crop rotations, intercrossing etc.
Plot and block structures are constructed to ensure minimum plot variability and
maximum precision and efficiency units so dictate. Blocking is done so that units within a
block are as similar as possible e.g. animals of similar weight, age, previous milk yield, or
units of the same “fertilizer status” etc. Blocking or stratification is used routinely in
agronomy trials but this should not be so.
Effective blocking increases precision. Treatment comparisons are made within blocks;
hence variation between blocks is eliminated. Conclusions are valid over the range of
situations represented by different blocks.
b. The median - the middle values when you arrange from the lowest to the highest.
When you have 2 values as the ones in the middle, the 2 values must be averaged
to get the median.
c. The mode - the most frequently occurring values among all the observation is a
sample. Some distributions have more than one mode. A distribution with one
mode is referred to as unimodal, 2 is bimodal, 3 is trimodal etc.
Hypothetical Example 1
= sum of squares
x = 12.6 = 1.8g
7
Mean deviation
x n- x
i = 2.4
7
= 0.34g
2
S
n
X i X
2
1.12 g 2
0.1867 g 2
i 1 n 1 6
S= 0.1867g2 = 0.43g
Sample 2.
= sum of squares
x = 12.6 = 1.8g
7
S2
n
X iX
2
0.82 g 2
0.1367 g 2
i 1 n 1 6
S= 0.1367g2 = 0.37g
2. The mean deviation – The sum of all deviations from the means i.e. (xi - x )
will always equal zero, however, such summation will be useless as a measure of
10
http://www.unaab.edu.ng
dispersion. Summing the absolute values of the deviation from the means results in a
quantity that is an expression of dispersion about the mean. Dividing this quantity by n
yields, a measure known as the means deviation or mean absolute deviation of the sample.
It has the same unit as that of the data.
S2 i
X X 2 ............................................(5)
n 1
If in equation 4, we replace µ by x and N by n, the result is a quantity that is a biased
estimate of σ2. The dividing of the sum of squares by n-1 (called the degree of freedom
abbreviated Df) rather than n, yields an unbiased estimate. It is equation 5 that should be
used to calculate the sample variances. The calculation of S2 can be tedious for large
samples but it can be facilitated by the use of equality.
Sample 1
Xi (g) X2i (g2) xi (g) X2i (g2)
1.2 1.44 1.2 1.44
1.6 2.56 1.6 2.56
1.7 2.89 1.7 2.89
1.8 3.24 1.8 3.24
1.9 3.61 1.9 3.61
2.0 4.00 2.0 4.00
2.4 5.76 2.4 5.76
xi=12.6g x i=23.50g2
2
xi=12.6g x i=23.50g2
2
n=7 n=7
11
http://www.unaab.edu.ng
S2 = SS V = 0.37g
n-1 1.8g
S 0.43g
V 0.24 24%
X 1.8 g
The variance has square units. If measurements are in grams, their variances will be in
grams squared or if measurements are in cubic centimeters, their variance will be in terms
of cubic centimeters squared even though such squared units have no physical
interpretation.
4. The standard deviation is the positive square root of the variance therefore, it has
the same units as the original measurements. Thus for a population.
X i
2
– (x i )2
S= N
N
X i
2
– (x i )2
S= n
n-1
S S S
V orV 100% sin ce is generally a small quantity, it is
X X X
S SD Standard deviation
12
http://www.unaab.edu.ng
Example 1
A plant geneticist may raise 100 progeny from a cross that is hypothesized to result in a 3:
1 phenotypic ratio of 84 yellow: 16 green is observed, although out of this total of 100
plants, the geneticists hypothesis would predict a ratio of 75 yellow: 25 green. The
question to be asked, then is whether the observed frequencies (84 and 16) deviated
significantly from the frequencies expected if the hypothesis were true (75 and 25). The
statistical procedure for attacking the question first involves the concise statement of the
hypothesis to be tested. The hypothesis in this case is that the population which was
sampled has 3:1 ratio of yellow-flowered to green-flowered plants. Statistically, this is
referred to as null hypothesis (abbreviated Ho) because it is a statement of “no difference”
in this instance, we are hypothesizing that the population flower colour ratio is not
different from 3:1. If it is concluded that Ho is false, then an alternative hypothesis
(abbreviated HA) will be assumed to be true. In this case, H A would be that the population
sampled has a flower colour ratio which is not 3 yellow: 1 green. One states a null
hypothesis and an alternative hypothesis for every statistical test performed and all
possible outcomes are accounted for by the two hypothesis.
k
X2 = (fif- fi)
i-1
2
X2 = 2
(fif- fi)
i-1
= (84-75)2 + (16-25)2
75 25
92 + 92
75 25
13
http://www.unaab.edu.ng
= 1.080 + 3.240
= 4.320
0.025 < P < 0.05
Therefore, reject Ho
Example 2
Since calculated X2 is less than X2 .05. (4-1) = 7.815, we do not reject the hypothesis i.e. the
observed results are in agreement with the expected 9:3:3:1 ratio.
Where Ri is the total frequency in row i, Cj is the total frequency in column j and n is the
total sample size. The number of degrees of freedom in a contingency table is
given as
d.f.= (r-1) (c-1)
Where r= number rows and C = number of columns in the table,
For example, expected cell frequencies are as follows
14
http://www.unaab.edu.ng
The formula for the Chi-square test statistic for contingency tables is
x2= ΣiΣj (Observed frequency – Expected frequency)
Expected frequency
It is very important to realize that a true null hypothesis occasionally will be rejected
which of course means that we have committed an error. Moreover, this error will be
committed with a frequency of α. That is, if Ho is in fact a true statement about a statistical
population, it will be concluded (erroneously) to be false 5% of the time. The rejection of
a null hypothesis when it is in fact true is a Type 1 error (also called an error, or an α error
of the first kind). On the other hand, if Ho is in fact false, our test may occasionally not
detect this fact, and we shall have reached an erroneous conclusion by not rejecting Ho.
This error of not rejecting the null hypothesis when it is in fact false is a Type II error
(also called a β error, or an error of the second kind).
The two types of errors in hypothesis testing
If Ho is true If Ho is false
If Ho is rejected Type I error No error
If Ho is not rejected No error Type II error
15
http://www.unaab.edu.ng
x1 = 83.9857 x2 = 63.0143
S1 = 5.291 S2 = 12.729
t=x– c
Sx
Which is compared with t a from a table of the t- distribution
16
http://www.unaab.edu.ng
= S2 p + S2 p
n1-1 n2-1
Where n1 and n2 = the total number of observations used to compute the first and second
means, respectively and
S2p = Pooled sample variance
Since this t is greater than ta, n1 + n2 – 2 or t(0.05,12) = 2.179. we reject Ho and conclude that
the two means are significantly different. In other words, creep fed calves had
significantly higher weaning weights than non-creep fed calves.
3.0 CORRELATIONS
When pairs of observation are available from a bivariable normal distribution, the
researcher may be interested in determining the strength of the relationship between the
two variables concerned. The quantity that describes this is the correlation coefficient. Its
sign indicates the direction of the relationship. Such relationship as between body weight
and egg products, body weight gain and feed consumption, milk yield and butterfat
percent, body weight and egg weight, body weight and height at withers. Shell thickness
and % calcium in plasma, sperm concentration and fertility and so on are of considerable
interest to the Animal Scientist.
r= Σxy – (Σ x) (Σy)/n
√Σx2 – (Σx)2 √Σy2 – (Σy)2
n n
Compute the sample correlation coefficient from the following data on body weight (BW)
and shank length (SL) of 20 six week old broilers.
17
http://www.unaab.edu.ng
1486 1480.36
0.52
3.3491X 34.56
Sr = 1 – r2
n–2
= 1 - (0.52)2 = 0.041
18
To test whether the correlation coefficient was significant or not, we compute the t statistic
t= r = 0.52 = 12.68
Sr 0.041
And compare with t , n-2, here t(0.05,18) = 2.101. We conclude that the correlation was
highly significant P< 0.01.
18
http://www.unaab.edu.ng
DATA TABLE
Treatment 1 2 3 4 5 Total Yi
1 Y11 Y12 Y13 Y14 Y15 Y1
2 Y21 Y2
3 Y3 Y3
4 Y45 Y4
5 Y53 Y5
Total Yj Y4 Y
Y1 = Sum of observation on the first treatment
Y2 = Sum of observation on the second treatment
19
http://www.unaab.edu.ng
ANOVA TABLE
Source of d.f s.s m.s F-ratio
variation
Total a.f.m 11 331-290=41 3.7 -
Treatment 3 4.2 1.4 14<1 = 46
0.304
Residual 8(n-t) (41-4.2)36.8 4.6
20
http://www.unaab.edu.ng
ii. We decide on the level of significance (not given) and find the tabulated F-
ratio from the Table.
iii. We compare the tabulated F- ration with the computed F – ratio.
iv. Conclusion. We accept the null hypothesis if the F-ratio calculated is less
than the F-ratio tabulated. Otherwise, we accept the alternative hypothesis.
In the above example for the one-way classification, we are testing for the
treatment effect and our null hypothesis will be:
Ho: t1 = t2 = t3 = t4 = 0
i.e. there is no significant difference between the treatments or that the
treatment has no effect.
The alternative hypothesis is
HA:µ = µ = i
At (level of significance) = 5%
F0.05 (3,8) = 4.07
F computed = 14
46
Therefore, FC< FT, we accept the hypothesis that there is no treatment effect.
Where Pj = the blocking effect. The basic assumptions underlying the equation are the
same with the one-way classification.
The working formulas for the degree of freedom, sum of squares, mean squares, and F-
ratio are given in the table below:
Correction term C.T. = (ΣYij2)
N
Source of Degree of Sum of squares Mean square F-ratio
variation freedom
n
Total adjusted n - 1 - -
ΣYij2 – CT
for the mean ij
21
http://www.unaab.edu.ng
CT = Correction term
N = total number of observation
B = number of blocks
T = number of treatments.
Numerical Example:
Given the data table used in the one-way classification, the ANOV table is given below.
Source of Degree of Sum of squares Mean square F-ratio
variation freedom
Total adjusted 11 41 3.7
for the mean
Treatment 3 4.2 1.4 14<1= 8.5
16
Block 2 27.25 13.65
Residual 6 9.55 4.6
Test of hypothesis.
For blocks.
H0:p1 = p2 = p3 i.e There is no block effect
HA: p1 = p1: 1 = 1 i.e There is block effect.
22
http://www.unaab.edu.ng
The Treatment by Block interaction (i.e. T X B) is a sort of Residual. From the table,
when dealing with treatment, what we want to test is rso2 y T i.e whether the effect due to
treatment is significant or not (or to find out whether it is 0 or not, thus we find from the
chart or ANOVA Table, under E(MS) one equation which contains σ2ε and sσ2 TP to make
σ2 ε & σ 2T = 1 and the error or equation to use is TXB effect.
This E (MS) of the TXB interaction is used to test the E (MS) of both the treatment
and the block, effect, this is because the E (MS) of TXB interaction contains both σ2ε &
SσTp 2 i.e. the overall error and the joint error respectively which occur in both the
treatment effect which is Sσ 2Tp, we use the Residual error which is σ 2ε.
The E (MS) is a guide to enable one tell what to test what.
Note:
σ 2ε = Overall effect
SσTp 2 = joint effect
rso2τ = Treatment effect alone
St σ 2 = block effect alone
CT = Y…..2
Rts
1 2 3 4 Yi
1 A B C D
2 D A B C
3 C D A B
4 B C D A
Y.j Y..
From the above chart, there must be 1 treatment/row/column i.e. all treatments must be
represented in every row and column in such a way that no treatment occurs twice in any
row or column.
In the above treatments A, B,C,D are observations.
The column and row totals are computed as Yj & Yi respectively.
For the Treatment total Tr T, there will be one for
AT = A1 + A2 + A3 + A4
BT = B1 +B2 +B3+B4
CT = C1 + C2 +C3 + C4
DT = D1 + D2 +D3 +D4
i.e. pick all the As i.e. A1+A2+A3+A4 = ass up for AT. Same for B, C, D.
Once this is done the ANOVA is simple.
Source Degree of Sum of squares Mean E(MS) F-ratio
freedom square
Total n= r x c TSS - -
adjusted for n - 1 Σ Yij2 – CT n-1
the mean
Row r-1 Σ Yi..2 – CT RowSS σ2 ε + cσ2p RowMS
C r-1 RMS
Column c-1 Σ Yi..2 – CT ColSS σ2 ε + σ 2 CMS
j r c-1 RMS
Treatment t -1 Σ Yj2 – CT Tr. SS σ2 ε + σ 2 Tr.MS
j r (t-1) RMS
Error n- r – c- t By substitution Substitution σ2 ε -
+2
* Yt = YtA + Ytb + Y
Summary by spacing
A (2”) = 203 + 283 + 279 + 334 + 250 = 1,349
B (4”) = 257 + 252 + 266 + 280 + 259 = 1,314
C (6”) = 231 + 204 + 280 + 287 + 260 = 1,262
D (8”) = 245 + 271 + 227 + 246 + 202 = 1,191
E (10”) = 182 + 230 + 245 + 195 + 228 = 1,188
Calculations:
C.T = Y.02 = (6,304)2 1,589,617
rc 25
Total = Yij2 -CT = (257) 2 +……….+ (338)2 - CT = 36,571
Row = Yi2 -CT = (1,255) 2 +……….+ (1440)2 - CT = 13,601
C 5
24
http://www.unaab.edu.ng
r
Residual = 36,571 – (13,601 + 6146 + 4156) = 12,668
In carrying out scientific experiments we are very often interested in comparing the effects
of certain treatments on the elements we are investigating. Alternatively, we may concern
ourselves with the determination of some properties of our elements of interest. In
carrying out the investigation, we will discover that certain factors will affect our results.
Some of these factors are controllable while others are not. By designing the experiment in
a particular way we may be able to observe the factors that we can control as against those
we cannot control and thereby be able to make inferences on them. The right design has to
be employed in any experiment if the results are to give valid conclusions. However, what
type of design we employ will depend on the particular experiment and on the
characteristics we are investigating.
To be able to carry out a good experiment the following conditions are important.
(1) Absence of systematic error: this implies that the experimental units which
are designed to receive one treatment should not be different in any
systematic way from those receiving other treatments. If they do, a source
of variation (systematic error) is introduced and this will affect the validity
of inferences made from the result of the experiment.
(2) Precision: The treatment comparison must be as precise as possible. By a
measure of precision of the estimate, e.g. mean X however, precision
refers to a measure of how close to the set of possible sample estimates for
a particular sample design may be expected to come to.
(3) Range of validity: the experiment must allow for a wide range of validity
on the conclusion. This means that as the units for the experiment is carried
out; the wider is the range of validity of the conclusions.
(4) Simplicity: As much as possible, the experimental arrangement should be
simple. If the experiment is not simple, them the procedure might be
difficult to follow especially if the experiment is to be carried out by
unskilled people.
25
http://www.unaab.edu.ng
(5) Also, for a good experiment, the error must be assessable. We must be able
to measure the amount of uncertainty involved in the experiment. This
means that we must have a set of experimental units responding
independently to the same treatment and such units must differ only
randomly from the units of other treatments.
(6) The experiment must not only be simple, precise and its error measurable,
it must also be cheap.
In the completely randomized design, the experimental units are numbered and the
treatments are randomly assigned to them. This means that all the units will have the same
chance of being given or not given a particular treatment. The one way classification is
used in the analysis of variance for this design.
Disadvantages
The mean disadvantage of this design is that it does not take cognizance of other source of
variation. Any variation is assumed to be due to be treatments applied. Where the
experimental units are not homogenous (apart from treatments) this can lead to erroneous
conclusions.
The first step in the design is to group the experimental units into blocks. The main
purpose of grouping is to make sure that all the units in a block are as uniform as possible
so that observed variation between blocks will be due to the treatment effect. After the
blocking, the treatments are then randomly applied to the units of each block. For
example, we might be interested in observing the effects of various feed components
(treatments) on the weight gain of cattle. If we have a herd of cattle consisting of animals
26
http://www.unaab.edu.ng
of difference age group variations in the weight gain by the animals might not be due to
the differences in age. To be able to observe the variation due to age, it then becomes
necessary first to group the animals according to age before applying the various feeds.
The two-way classification is used in the analysis of randomized complete block design.
5.3 Factorial
In factorial analysis we deal with factors.
Let us look at a two-factor factorial.
Let Factor A = Variety (2 varieties)
Factor B = Fertilizer (N) (2 levels of N. fertilizer).
In this case we talk of a 2 X 2 or 22 factorial.
If we call the first level of factor A 0 and its 2nd is 1, we therefore have.
0 0 00
1 01
1 0 10
1 11
A N O VA
Source d.f
Total 3
A 1
B 1
Residual AB 1
Let us assume that we planted something at the green house and the same thing is repeated
on the farm, i.e. there now a 3rd factor which is location, we would then have 3-factor
factorial.
Thus A = Variety (3) 15 15 15
B = Fertilizer NPK (3) 15 20 15
C = Location (2) 15 30 15
This is a 32 by 2 (3x3x2) or 32 x 2factorial. We can make a design for this type of factorial
thus:
A factorial experimental is one in which an equal number of observations is made for all
possible combinations of the levels.
Variety 0 1 2
Fertilizer 0 1 2 0 1 2 0 1 2
Location 0000 010 020 100 110 120 200 210 220
1001 011 021 101 111 121 201 211 221
After this, randomize following the same procedure
27
http://www.unaab.edu.ng
ANOVA
Source Df SS MS
3 3 3
Total 3 x 3 x 2 -1 = 17
ΣΣΣ y2 ijk – CT
i j k
3
Variety 3 -1 = 2
Σ yi2 – CT
i =1
6 j k
Fertilizer 3–1=2 Σ y.j2 – CT
i
6
V&F (3- 1)(3-1)
(2) (2)
=4
0 1 2
0 00 10 20 Y.1
1 01 11 21 Y.2
2 02 12 22 Y.3
Y1… Y2… Y3… Y….
Factorial experiment is one designed to examine the effect of one or more factors, each
factor being applied at two levels at least so that differential effects can be observed. It
investigates all possible treatment combinations which may be formed from the factors
under investigation. The level of a factor denotes that intensity with which it is brought to
bear. It may be measured quantitatively as when fertilizer is applied to plots in a given
wt/unit area or qualitatively as when patients are considered two levels inoculated and not
inoculated.
If we have 2 factors A and B each with 2 levels-0,1 and 0, 1 in effect we shall have 4
treatments -00, 01,10, and 11 as seen on the chart.
2 X 2
A B
0 1
0 00 01
1 10 11
If this supposed experiment is repeated 4 times so that we have 4 blocks –or replicates as
shown in the table below:
Treatments
00 01 10 11
1
Block 2
3
4
28
http://www.unaab.edu.ng
ANOVA
Source d.f SS MS F-ratio
Total 15(16-1)
Blocks 3(4-1) MSn MSB
MSE
Treatments 3(4-1) MSr MST
MSE
Factor A 1 MSA MSA
MSAB
Factor B 1 MSB MSB
MSAB
A and B 1 MSAB MSAB
MSE
Residual 9 (15-3-3) MSE
In calculating the F-ratio, we test the blocks and treatment with the Residual which is
above (9). In calculating the F-ratio for the Factors A&B, we use the MS for AB to test
for factors A,B while we use the overall residual to test for the AB.
Example
N = 2 levels
K = 2 levels
Coded yield of maize
Replicate Treatment
00 01 10 11 Total
1 2 3 3 4 12
2 5 7 4 5 21
3 2 4 5 6 17
4 4 3 2 1 10
Total 13 17 14 16 60
N2 = A
P
0 1
0 00 10 27
13 14
1 01 11 33
17 16
30 30 60
=B
ANOVA
RSS = 122 + 212 + 172 + 102 + 602
4 4 4 4 4
TSS = 132 + 142 + 172 + 162 + 602
4 4 4 4 4
ASS = 302 + 302 + 3600 = 0
4 4 16
29
http://www.unaab.edu.ng
Source Df SS MS F
Total 15 19 - -
Replicate 3 10.5 3.5 3.5/7 Significant
Treatment 3 2.5 0.83 0.83/0.7
A 1 0 0.0 0
B 1 2.2 2.2 2.2/0.3 Significant
AB 1 0.3 0.3 2.2/0.7
Residual 9 6 0.7
0 100 8 treatments
0 1 101
0
0 0 110
1 111
Here there are 23, 3 factors each at 2 levels and thus we have 8 treatments. If we assume
there are 5 replicate each with 8 treatments.
1 2 3 4 5 6 7 8 Y1
Replicates 1
2
3
4
5 40
Y5 Y
30
http://www.unaab.edu.ng
In model 1- we use the Residual to test all eff4ects; in models II and II we use the
expected mean square as a guide for determining appropriate denominator for testing.
The ANOVA table is as follows.
ANOV TABLE
Source d. f SS MS F-ratios
Total 39
Replicate. 4
Treatment 7
A 1
B 1
AB 1
C 1
AC 1
BC 1
ABC 1
Residual 28
A 0 1
B 0 1 0 1
C 1 0 1 0 1 0
ABC 001 10 011 100 101 110 111
000
Exercise:
1. Explain what we mean by Factorial Design
2. Differentiate between Factorial design and completely randomized design.
A B C D E A E D C B
E A B C D B A E D C
D E A B C C B A E D
C D E A B D C B A E
B C D E A E D C B A
We must now randomize such that the restrictions above still hold. The randomization can
be by column or by row. This can be done by changing the column to row. The analysis of
variance for the Latin square Design is the same as done in the past
The Latin square design is like randomized complete block treatment but different
in that no one treatment must appear twice in the same row or column. Thus the Latin
square is a diagonal effect and vertical effects.
31
http://www.unaab.edu.ng
A 3 levels
6 treatments
B 2 levels
For the square design, we shall have 6 rows, 6 columns and 6 treatments.
A O 1 2
B O 1 0 1 0 1
OO O1 10 11 20 21
We assume that factor A requires a large piece of land while factor B does not. We can
divide the whole piece of land into Blocks e.g. the piece of land is divided into 3 blocks.
Each block can be divided up into whole plots and each whole plot is divided again into
sub plots. This processed will happen to each block.
Whole
A C D
Sub Plot a1
a2
a3
a4
If the factor A is the methods of land preparation and each factor A is divided into 4 plots
we have a1,a2 , a3, a4. while factor B is the varieties of crop (maize) to plant and because
there are 2 levels of B or 2 Sub plots per whole we have b1, b2. we randomize factor A in
to each whole plot. i.e. each whole plot is now a1, a2, a3, a4. after this randomization of
factor A among whole plots we now randomize factor (i.e. Varieties) into the sub plots.
Thus this is a 2- stage randomization procedure.
A 4 - a1, a2, a3, a4.
B 2 - b1, b2
This shows that each factor A/ level will contain all the levels of factors B. i.e. a1 contains
b1 & b2. in considering the yield in a b1.. The yield is thus made of
Y ij =
(2) It incorporates the principles of R.C.B. design e.g. if there ae 2 factors A &
B and A has 4 and B 2 levels there will be 8 treatments. In the R.C.B. design the 8
treatments will be randomized completely c within in the blocks, but in the split plot
design, the factors will be split into A and B and one factor will be randomized within the
other factor i.e. B. within A.
ANOVA
Sources D.f
Y Total 23
T Replicates 2
P whole plots 3
S Error a 6
B Sub plots 1
(rp) Interaction 3
Eijk Error (b) 8
8.0 REGRESSION
Regression is a technique for quantifying the relationship among variables, one of which
(the response or dependent variable) is known to be functionally dependent on one or
more other variables called explanatory or independent variable:
33
http://www.unaab.edu.ng
From sample data, numeric values for the statistics can be obtained by solving a set of
normal simultaneous equations in two unknowns. However, easy-to-use formulae are
available for hand computations.
b1 = Σxy – (Σx. Σx)/n
Σx2 – (Σx)2/n
and
bo=y – b1x
If it sis known apriori that will be zero when x is zero, then regression is done through the
origin. The model becomes:
y1 = bx1 + e1
Simple estimated b y:
b = Σxy
Σx2
X Y XY X2 Y2
1 0.06 0.06 1 0.0036
2 0.10 0.02 4 0.01
3 0.19 0.57 9 0.0361
4 0.34 1.36 16 0.1156
5 0.47 2.35 25 0.2209
6 0.63 3.78 36 0.3969
7 0.84 5.88 49 0.7056
8 1.09 8.72 64 1.1881
2 2
X 36 Y 3.72 XY 22.92 X =204 Y =2.6768
b= 22.92 – (36)(3.72)/8
204-(36)2/8
=6.18/42 = 0.147
b0= 0.465 – (0.147)(4.5)= -0.197
Hence, our model becomes
y1 = -0.197 + 0.147x
The line corresponding to this equation is obtained by plotting the (xi,yi) points.
The regression coefficient obtained is interpreted to mean that body weight was increasing
by 0.147kg per week on the average. In other words, the
Total SS = Σy2 – (Σy)2/n
Σy2
34
http://www.unaab.edu.ng
SOURCE D.F. SS MS F
Total 7 0.947
Regression 1 0.909 0.909** 151.5
Residual 6 0.038 0.006
The analysis shows that the regression is highly significant (P< 0.01)
The standard error of the regression coefficient is given as:
S.E (b)= Sy.x/ √Σ(x1 – x)2
Where Sy. x = Residual MS or
One commonly used criterion for determining the adequacy of a fitted regression model is
the R2 or r2 (coefficient of multiple determination) or 1-R2 (coefficient of allenation). R2 is
the proportion of the variability in Y explained by the fitted regression model.
R2 = Regression SS
Total SS, corrected for y
And R, the correction between x and y = +√R2 -. The sign will correspond to the sign of
the value Σxy – (Σx)( Σy)/n the square of which is the numerator of regression SS. in our
example:
R2 = 0.909 = 0.96 or 96%
0.947
and R = 0.98
9.0 SAMPLING
36
http://www.unaab.edu.ng
37