ABE 057 Manual
ABE 057 Manual
0
TABLE OF CONTENTS
1
MODULE – I. Introduction to Design of Experiments
1.1 Four Eras in the History of DOE
Here's a quick timeline:
If we had infinite budgets there probably wouldn't be a big fuss made over designing
experiments. In production and quality control we want to control the error and learn as much as
we can about the process or the underlying theory. From an engineering perspective we're trying
to do use experimentation for the following purposes:
2
The practical steps needed for planning and conducting an experiment include: recognizing
the goal of the experiment, choice of factors, choice of response, choice of the design, analysis
and then drawing conclusions. This pretty much covers the steps involved in the scientific
method.
2. Efficiency – conduct the investigation as efficient as possible, that is, every effort
should be made to save time, money, personal and experimental materials. Fortunately,
most simple design is also efficient both statistically and economically.
3
our experimental data and ensures that the data are free from any systematical
error.
3. Blocking (Local Control) – refers to grouping of the experimental units in
such a way that the units within the group are more homogenous than are units
in different groups.
"All experiments are designed experiments, it is just that some are poorly designed and a some
are well-designed."
- Simplicity
- Degree of precision
- Absence of systematic error
- Range of validity of conclusion
- Calculation of degree of uncertainty
4
a. Replication
b. Randomization
2. Control of Error
- The ability to detect the existing differences among
treatments increases as the size of the experimental error
decreases.
Three Techniques of Controlling Error
a. Blocking
b. Proper Plot Technique
c. Data Analysis
3. Proper Interpretation of Results
1.9 Terminology
Experiment – planned inquiry
Treatment – procedure whose effect will be measured
Factor – class of related treatments
Levels – states of a factor
Variable – measurable characteristic of a plot
Experimental Unit – a unit which a treatment is applied(plot)
Sampling Unit – part of experimental unit that is measured
Experimental Error – variation among experimental units that are treated alike
5
MODULE – II. Single – Factor Experiments
Experiment in which only a single factor varies while all others are keep constant.
2.2 Two (2) Groups of Experimental Design that are applicable to a Single –
Factor Experiments:
1. Complete Block Design – small number of treatment characterized by blocks, each of
which contains at least one complete set of treatments.
Three (3) Complete Block Design
1. Completely Randomized Design(CRD)
2. Randomized Complete Block Design(RCBD)
3. Latin Square Design(LSD)
2. By Drawing Cards
Steps
1. Draw n cards
2. Rank n cards(2 = lowest, A = highest)
3. Assign the t treatments to the n plots
3. By Drawing Lot
6
2.4 Completely Randomized Design (CRD)
- Simplest and least restrictive
- Every plot is equally likely to be assigned to any treatment
- Appropriate for experiments with homogenous units, such as laboratory
experiments
1. Flexibility
- Any number of treatments and any number of replications
- Don’t have to have the same number of replications per treatment
2. Simple Statistical Analysis
3. Missing plots do not complicate the analysis
4. Maximum error degrees of freedom
Steps;
1. Determine n (n = rt), where r = replications, t = treatments
2. Use any of the randomization scheme
3. Assign plot number to each experimental plot
4. Assign the treatments to the experimental plots
Steps
1. Group the data by treatments and calculate the treatment totals (T) and grand total (G)
2. Construct an outline of the ANOVA
7
Outline ANOVA with Corresponding Formula
Treatment t–1 T 2
CF
TrSS
TrMS
ErMS
r t -1
ErSS
Experimental ToSS – TrSS
t(r – 1) t (r-1)
Error
3. Correction Factor(C.F) = G2
n
4. Obtain the tabular F values from Appendix E
f’1 = treatment df (t -1) (numerator df)
f’2 = error df (t(r -1)) (denominator df)
5. Enter all the values computed in step 2 using the formula in the outline ANOVA
6. Compare the computed F values with the Tabular F values
Where:
TrSS = Treatment Sum of Squares r = number of replications
ToSS = Total Sum of Squares t = number of treatments
TrMS = Treatment Mean Square T = treatment total
ErMS = Error Mean Square X = Variables (measurement of the ith plot)
1. If the computed F value is larger than the tabular F value at the 1% level of
significance, the treatment difference is said to be highly significant such as a
result is generally indicated by placing two asterisks on the computed F value
in the ANOVA.
2. If the computed F value is larger than the tabular F value at the 5% level of
significant but smaller than or equal to the F value at the 1% level of
significance, the treatment is said to be significant, indicated by placing one
asterisk on the computed F value in the ANOVA.
3. If the computed F value is smaller than or equal to the tabular F value at the
5% level of significance, the treatment difference is said to be nonsignificant
and indicated by placing ns on the computed F value in the ANOVA.
8
2.4.8 Numerical Examples
Table 1. Plant height result for use of different treatments from a CRD experiment with
5 replications and 5 treatments.
Treatment Treatment
Treatment Plant Height (cm)
Total (T) Mean
A 8.2 8.7 9.4 9.2 9.0 44.5 8.9
B 7.7 8.4 8.6 8.0 8.1 40.8 8.2
C 6.9 5.8 7.2 6.8 7.4 34.1 6.8
D 6.8 7.3 6.3 6.9 7.1 34.4 6.9
Control 5.3 6.3 6.7 7.1 5.6 31.0 6.2
Grand Total
184.8
(G)
Grand
7.4
Mean
Table 2. Growth data recorded for the Plants that survived in a CRD experiments with 4
replications and 5 treatments.
Treatment Treatment
Treatment Plant Height (cm)
Total (T) Mean
A 8.2 8.7 9.2 9.0 35.1 8.78
B 7.7 8.4 8.0 8.1 32.2 8.05
C 6.9 5.8 6.8 7.4 26.9 6.73
D 6.8 6.9 7.1 20.8 6.93
Control 5.3 6.3 7.1 5.6 24.3 6.08
Grand Total
139.3
(G)
Grand Mean 7.31
9
2.4.9 for CRD with Unequal Replication
The same procedures with CRD equal replication, except that of the following;
t
T2
TrSS = CF
i 1 ri
Note:
The cv indicates the degree of precision with which the treatments are compared and is good
index of reliability of the experiment.
- For rice 6 to 8 % for variety trials, 10 to 12 % for fertilizer trials, 13 to 15 % for herbicide and
insecticide trials.
10
2.5 Randomized Complete Block Design
- One of the most widely used experimental designs in Agricultural research.
- Suited for field experiments where the number of treatment is not large.
- Area has predictable productivity gradient.
- Blocks of equal size each contain all the treatments.
1. The selection of the source of variability to be used as the basis for blocking.
Ideal source of variation to use as the basis for blocking is one that is large and
highly predictable.
1. Unidirectional Gradient
Long and narrow blocks
Block is perpendicular to the direction of gradients.
2. Fertility gradient occurs in two direction with one direction much stronger than the
other, ignore the weaker gradient and follow the preceding guideline for the case of
the unidirectional gradient.
3. Fertility gradient occurs in two direction with both gradients equally strong and
perpendicular to each other.
Alternatives:
Square blocks
Long and narrow blocks with their length perpendicular to the direction of one
gradient use Covariance.
Use Latin Square Design with two – way blockings, one for each gradient.
2.5.3 Randomization
Applied separately and independently to each of the blocks.
1. Group the data by treatments and replications and calculate treatment totals (T),
replication totals (R), and grand total (G).
12
2. Outline the analysis of Variance.
Outline ANOVA
Source of Degree of Means Tabular F
Sum of Squares Computed F
Variation Freedom Squares 5% 1%
Replication r–1 R 2
CF
RSS
t r-1
T 2
ErMS
Error (r -1)(t -1) ToSS –RSS - TrSS (r 1)(t 1)
ΣX2 – CF
Total rt - 1
G2
3. Compute C.F. =
rt
4. Compute for the degree of freedom, Sum of squares, Mean Squares and F – value.
Follow the formula in outline ANOVA.
5. Enter all values computed in the ANOVA.
6. Compare the computed F – value with the Tabular F – value.
7. Compute for cv.
Where:
TrSS = Treatment Sum of Squares r = number of replications
ToSS = Total Sum of Squares t = number of treatments
TrMS = Treatment Mean Square T = treatment total
ErMS = Error Mean Square X = Variables (measurement of the ith plot)
RSS = Replication Sum of Squares R = Replication Total
2. Relative Efficiency
13
RE = ( r – 1 ) Eb + r( t – 1 ) Ee
( rt – 1 ) Ee
Where:
Eb = replication mean square
Ee = error mean square
Note:
If the error df is less than 20, the RE value should be multiplied by adjustment factor k
defined as:
k
(r 1)(t 1) 1t (r 1) 3
(r 1)(t 1) 3t (r 1) 1
Table 2.6 Data on Stem and Root Rot of Cucumber under Trichoderma inoculation
with Different Levels of Inoculumns (%).
Replication
Trichoderma Level Mean
Total
(GM/16L H2O) 1 2 3
Table 2.6a Analysis of Variance on Stem and Root Rot of Cucumber under
Trichoderma inoculation with Different Levels of Inoculumns (%).
14
Source of Degree of Sum of Mean of Computed Tabular F
Variation Freedom Squares Squares F 5% 1%
Replication 2 15.64 7.82
Treatment 4 43.78 10.95 0.78ns 3.84 7.01
Error 8 112.59 14.07
Total 14 172.01
ns = not significant
cv = 18 %
Advantage:
– Allows the experimenter to control two sources of variation
Disadvantages:
– Error df is small if there are only a few treatments
– The experiment becomes very large if the number of treatments is large
– The statistical analysis is complicated by missing plots and misassigned
treatments
Suppose you had four feeds you wanted to test on dairy cows. The feeds would be tested
over time during the lactation period
This experiment would require 4 animals (think of these as the rows)
There would be 4 feeding periods at even intervals during the lactation period beginning
early in lactation (these would be the columns)
The treatments would be the four feeds. Each animal receives each treatment one time
only.
2.6.3 Uses in Field Experiments
15
Practically speaking, use only when you have more than four but fewer than ten
treatments
– a minimum of 12 df for error
Steps :
1. Select a LS plan from Appendix K
2. Randomize the row arrangement of the plan
3. Rank from lowest to highest
4. Rank as existing row numbers and sequence as row number of the new plan.
5. Randomize the column arrangement
2.6.5 Analysis
1. Set up a two-way table and compute the row and column means and deviations
2. Compute a table of treatment means and deviations
3. Set up an ANOVA table divided into sources of variation
– Rows
– Columns
– Treatments
– Error
4. Significance tests
– FT tests difference among treatment means
– FR and FC test if row and column groupings are effect
ΣX2 - CF
Total t2-1
Note : SASS = Sum of All other Sum of Squares ( SASS = RSS + CSS + TrSS )
16
CSS = Column Sum of Squares
RSS = Row Sum of Squares
RE
CMS (t 1) ErMS
k
(t 1)(t 2) 1(t 1) 2 3
t ( ErMS ) (t 1)(t 2) 3(t 1) 2 1
To compare with CRD
1. To determine the effect of four different sources of seed inoculums, A, B, C, and D, and a
control, E, on the dry matter yield of irrigated alfalfa in pounds per plot. The plots were furrow
irrigated and there was a line of trees that might form a shading gradient.
A B D C E
33.8 33.7 30.4 32.7 24.4
D E B A C
37.0 28.8 33.5 34.6 33.4
C D A E B
35.8 35.6 36.9 26.7 35.1
E A C B D
33.2 37.1 37.4 38.1 34.1
B C E D A
34.8 39.1 32.7 37.4 36.4
Table 1.0 Dry Matter Yield of irrigated alfalfa from LSD Experiment
17
Row Row Total
Yield, lb/plot
Number
(R)
A B D C E
1 155
33.8 33.7 30.4 32.7 24.4
D E B A C
2 167.3
37 28.8 33.5 34.6 33.4
C D A E B
3 170.1
35.8 35.6 36.9 26.7 35.1
E A C B D
4 179.9
33.2 37.1 37.4 38.1 34.1
B C E D A
5 180.4
34.8 39.1 32.7 37.4 36.4
Column
174.6 174.3 170.9 169.5 163.4
Total(C)
Grand Total
(G) 852.7
Steps:
1. Divide the experimental into replication(r).
2. Divide each replication into k incomplete blocks, each containing k
experimental plots.
3. Select from Appendix L a Basic Balanced Lattice plan corresponding to the
number of treatments to be tested.
4. Randomize the replication arrangement of the selected plan.
5. Randomize the incomplete blocks within each replication.
6. Randomize the treatment arrangement within each incomplete block.
7. Apply the final outcome of the randomization process of step 6 to the field lay
out.
19
2.7.1.c Analysis of Variance
Steps:
1. Calculate the block totals (B) and replication(R ).
2. Calculate the treatment totals (T) and the grand total (G).
3. Calculate Bt for each treatments.
4. Calculate W = kT – (k+1)Bt + G
5. Construct the Outline ANOVA
G2
6. CF 2
(k )(k 1)
7. Compute for the Sum of Squares and Mean Squares as indicated in the
ANOVA
Note:
Computations for Mean Squares are equal to its corresponding Sum of Squares divided by its
degree of freedom except for Treatment Adjusted MS and Effective Error MS.
Block (adj) 2 W 2
k -1 i 1
(k 3 )(k 1)
ToSS – RSS –
Intrablock Error 2
(k – 1)(k -1) TrunadjSS –
BSS
1 2 G
2
TradjMS
T ' 2
2
Treatment(adj) k -1 -------------
(k 1)(k 1)
2
k EfERMS
2
Effective Error (k – 1)(k -1) -------------- IntraMS(1+kµ)
Total 2
k (k+ 1) – 1 x 2 CF
20
MODULE – III. Two – Factor Experiment
2. Data Analysis
21
- Construct tables of means and deviation
- Complete an ANOVA table
- Perform significance tests
- Compute appropriate means and standard errors
- Interpret the analysis and report the results
22
3.5.1a Numerical Example
Table 3.0 Grain Yield of Three Rice Varieties Tested with Five Fertilizer in a RCB Design.
Grain Yield, Treatment
Kg/ha Total
(T)
Fertilizer Rep I Rep II Rep III Rep IV
V1
F1 3.85 2.61 3.14 2.89 12.50
F2 4.79 4.94 4.56 4.61 18.89
F3 4.58 4.45 4.88 3.92 17.84
F4 6.03 5.28 5.91 5.65 22.87
F5 5.84 5.92 5.98 5.52 23.29
V2
F1 2.85 3.79 4.11 3.44 14.19
F2 4.96 5.13 4.15 4.99 19.22
F3 5.93 5.70 5.81 4.31 21.74
F4 5.66 5.36 6.46 5.47 22.96
F5 5.46 5.55 5.79 5.93 22.72
V3
F1 4.19 3.75 3.74 3.43 15.11
F2 5.25 4.58 4.90 4.29 19.01
F3 5.82 4.85 5.68 4.93 21.28
F4 5.89 5.52 6.04 4.76 22.21
F5 5.86 6.26 6.06 5.36 23.55
Rep Total(R) 76.99 73.69 77.20 69.51
Grand Total ( G) 297.39
Table 3.1 The Variety X Fertilizer Table of Totals from Data in Table 3.3
Yield Total
Fertilizer Total (
(AB)
B)
Fertilizer V1 V2 V3
F1 12.50 14.19 15.11 41.80
F2 18.89 19.22 19.01 57.13
F3 17.84 21.74 21.28 60.86
F4 22.87 22.96 22.21 68.04
F5 23.29 22.72 23.55 69.56
Variety Total (A) 95.39 100.83 101.16 297.39
23
Table 3.2 Analysis of Variance of Data in Table 3.3 from a 3 x 5 Factorial Experiment in
RCBD
Source of Degree of Sum of Tabular F
Mean Squares Computed F
Variation Freedom Squares 5% 1%
Replication 3 2.60 0.87 5.74** 2.83 4.29
Treatment 14 44.49 3.18 21.09** 1.94 2.54
Variety(A) (2) 0.96 0.48 3.48* 3.22 5.15
Fertilizer(B) (4) 41.25 10.31 68.26** 2.59 3.80
AXB (8) 2.28 0.29 1.89ns 2.17 2.96
Error 42 6.10 0.15
Total 59 53.19
b** = highly significant, * = significant, ns =nonsignificant
cv = 7.8%
3.6.3 Guidelines for Assignment of a particular factor to either the main plot or subplot:
1. Degree of precision
2. Relative size of the main effects
3. Management Practices
Disadvantages
1. Main plot factor is estimated with less precision so larger differences are required for
significance – may be difficult to obtain adequate df for the main plot factor
2. Statistical analysis is more complex because different standard errors are required for
different comparisons
24
Advantages
1. Permits the efficient use of some factors that require different sizes of plot for their
application
2. Permits the introduction of new treatments into an experiment that is already in
progress
Note : SASS = Sum of All other Sum of Squares ( SASS = RSS + ASS + Er(a)SS + BSS + ABSS )
25
Table A. Grain Yield Data of Four Rice Varieties Grown with 4 levels of Nitrogen in Split –
Plot Design with Three(3) replications
Table A.1 The Replication X Nitrogen Table of Yield Totals Computed from data in Table A
26
Table A.2 The Nitrogen X Variety Table of Yield Totals Computed from data in Table A
Table A.3 Analysis of Variance of data in Table A from 4X4 Factorial Experiment in Split Plot
Design
Source of Degree of Sum of Tabular F
Mean Squares Computed F
Variation Freedom Squares 5% 1%
Replication 2 1.15 0.575
Nitrogen (A) 3 25.00 8.33 59.5** 4.76 9.78
Error(a) 6 0.82 0.14
Variety (B) 3 15.71 5.24 15.41** 3.01 4.72
AXB 9 13.24 1.47 4.32** 2.30 3.25
Error(b) 24 8.05 0.34
Total 47 63.96
b** = highly significant
cv(a) = 7.09%
cv(b) = 10.96%
Advantages
- Permits efficient application of factors that would be difficult to apply to small plots
Disadvantages
- Differential precision in the estimation of interaction and the main effects
27
- Complicated statistical analysis
3.7.3 Randomization
- Randomize the vertical factors and horizontal factors separately and independently.
Note : SASS = Sum of All other Sum of Squares ( SASS = RSS + ASS + EraSS + BSS + ErbSS +
ABSS )
28
3.7.5 Numerical Examples
Table 1.0 Data on Grain Yield of 3 Varieties of Rice grown with Four Nitrogen Rates in a Strip
– Plot Design with Three Replications
Table 1.1 The Replication X Variety Table of Yield Totals Computed from Data in Table 1.0
Table 1.2 The Replication X Nitrogen Table of Yield Totals Computed from Data in Table
1.0
29
Table 1.3 The Variety X Nitrogen Table of Yield Totals Computed from Data in Table 1.0
Yield Total (AB)
Variety
N1 N2 N3 N4
V1 10.3 15.4 22.7 15.0
V2 15.7 20.6 18.5 14.0
V3 16.6 21.2 12.3 16.8
Table 1.4 Analysis of Variance of Data in Table 1.0 from 3 x 4 Factorial Experiment in a Strip –
Plot Design.
Tabular F
Source of Degree of Sum of Mean
Computed F 5% 1%
Variation Freedom Squares Squares
Replication 2 7.55 3.78
Variety (A) 2 1.25 0.63 1.14ns 6.94 18
Error(a) 4 2.20 0.55
Nitrogen (B) 3 15.14 5.04 4.67ns 4.76 9.78
Error(b) 6 6.53 1.08
AXB 6 32.87 5.47 3.12* 3.00 4.82
Error(c) 12 21.02 1.75
Total 35 86.56
b* = significant, ns = nonsignificant
cv(a) = 7.09%
cv(b) = 10.96%
30
MODULE – IV. COMPARISON BETWEEN TREATMENT
MEANS
4.1 PAIR COMPARISON
4.1.2 Two Types :
Steps :
dij = Xi – Xj
LSDδ = tδ ( Sd ˉ )
Where :
Sdˉ = standard error
Tδ = tabular value at appendix C( n = Error df )
31
4.3 STANDARD ERROR (Sdˉ)
A. For CBD
- Only one error term is involved CRD, RCBD and LSD
- Standard error of the mean difference for any pair of treatment means
2S 2
1. Sdˉ = for Equal Replication
r
1 1
2. Sdˉ = S2 for Unequal Replication
r r
i j
2b 1Eb Ea / rb
Sdˉ =
Where:
Ea = Error(a) MS
Eb = Error(b) MS
r = no. or replication
a = no. of mainplots
b = no. of sub-plot
32
C. For Strip – Plot Design
Table B. Comparison between Mean Yields of Treatment D and each of the four Treatments
A,B,C and E using LSD Test Data from a CRD experiments.
Dij(Treatment Difference), b
Treatment Mean Yield, a kg/ha
kg/ha
A 5.2 2.4ns
B 7.8 5.0**
C 4.0 1.2ns
D 2.8 ---
E 6.6 3.8*
a =
Average of five Replication
b**
= significant at 1% level, * = significant at 5% level, ns = not significant
Sdˉ = 1.52, LSD0.05 = 2.086(1.52) = 3.17 kg/ha, LSD0.01 = 2.845(1.52) = 4.32 kg/h
33
34
4.3.2 LSD Test for Split – Plot Design
Table A. 4 Comparison between Mean Yields of Two Variety Means with same Nitrogen Rate
2 Eb
1. Sdˉ = = 0.48
r
2. tδ at n = Eb = 24
t0.01 =2.797
t0.05 =2.064
35
4.4 DUNCAN’S MULTIPLE RANGE TEST
- Experiments that require the evaluation of all possible pairs of treatment means.
A.1 Compare dTrRp to the remaining Treatment Mean all less than dTrRp are
declared significantly different from the treatment mean being compared.
A.2. Compute the difference between the treatment group
dTi.Tj = Ti - Tj
A.3 Compare the dTi.Tj to the Rp value at p = m, if dTi.Tj is less than the
Rp value at p = m,declare not significantly different
B. Continue the process in A until all the treatments are being compared.
5. Present the test results in one of the following ways:
36
DMRT for C.B.D ( Complete Block Design )
Rank
Treatment Mean Yield, Kg / ha
T2 : Dolmix ( 2 kg ) 2,678 1
T3 : DDT + δ – BHC 2,552 2
T4 /: AZODRIN 2,128 3
T1 : Dol – mix ( 1 kg ) 2,127 4
T5 : Dimecron – Boom 1,796 5
T6 : Dimecron ÷ Knap 1,681 6
T7 Control 1,316 7
Control
P rp ( 0.05 )
2 2.94
3 3.09
4 3.18
5 3.24
6 3.30
7 3.33
The ( t – 1 ) Rp Values :
p Rp = ( r p b )( Sdˉ )
√
37
Table 5.2 DMRT for Comparing all possible Pairs of Treatments Means, from a CRD
Experiment Involving Seven Treatments, Using the Line notation ( Data Table 2.1
Treatment
Mean yield, kg/ haa DMRTb
T2 2,678
T3 2,552
T4 2,128
T1 2,127
T5 1,796
T6 1,681
T7 1,316
a = Average of four replication
b = any two means connected by one same vertical line are not significantly different at 5% level
of significance
Table 5.2 DMRT for Comparing all possible Pairs of Treatments Means, from a CRD
Experiment Involving Seven Treatments, Using the Alphabet notation ( Data Table 2.1
Treatment
Mean yield, kg/ haa DMRTb
T1 2,127 bc
T2 2,678 a
T3 2,552 ab
T4 2,128 bc
T5 1,796 c
T6 1,681 cd
T7 1,316 d
a = Average of four Replication
b = any two means having a common letters are not significantly different at the 5%
38
4.5 Tukey HSD Post – hoc test
It is needed after an ANOVA is completed and it should be significant in order to
determine which groups differ from each other.
Where :
Numerical Example:
Valencia oranges were tested for juiciness at 4 different orchards. Eight oranges were
sampled from each orchard, and the total ml of juice per 20 gms of orange was
calculated given below:
39
Source of Degree of Sum of Mean Tabular F
Computed F
Variation Freedom Squares Squares 5% 1%
Treatment
Experimental
Error
Total
40
41
MODULE – V. REGRESSION AND CORRELATION ANALYSIS
5.1 Three Groups of variables that are normally recorded in crop experiments:
3. Responses – Biological and physical features of the experimental units that expected to be
affected by the treatments being tested.
Y X
Where :
intercept of Y axis
= linear regression coefficient or slope of the line
42
5.4.3 Step by step procedures for solving Regression Analysis:
1. Compute for the means, sum of squares and cross – product of x and y.
2. Compute the estimates of the regression parameters and
Note: x and y – deviation from means
a = y – bx
xy
b=
x2
xy 2
x 2
x2
s y2. x
n2
b. Compute tb
b
tb
s y2. x
x 2
Note: Linear Regression is only useful if there is evidence of a linear relationship between x and
y. If the correlation is very low, then the line will fit horribly and will be useless.
43
5.4 SIMPLE LINEAR CORRELATION ANALYSIS
The linear correlation coefficient (denoted r) measures the strength of the linear
relationship between two variables x and y in a sample. It is sometimes called a Pearson
Correlation Coefficient, after a famous statistician named Karl Pearson. The formula for this
statistic is as follows:
∑ ∑ ∑
√[ ∑ ∑ ][ ∑ ∑ ]
Where:
x = the observed data for the independent variable
y = the observed data for the dependent variable
n = is the number of observation or size of the sample
r = the degree of the relationship between x and y
Note: If absolute value of computed r is greater than tabular r values from Appendix H with df =
n – 2 , then, r is significant at a level of significance.
Due to the standardization that takes place in the formula, there are a couple of interesting
properties of r
1. If the values of either variable are converted to a different scale, r will be the same.
2. If the variables x and y are interchanged, r will be the same.
3. The correlation coefficient r will only measure the strength of a linear relationship. It says
nothing about other kinds of relationships.
The sign of the correlation coefficient tells whether the line is sloping upward (positive) or
downward (negative). When r is close to 0, there seems to be no linear pattern at all. Here is a
general guide for how to interpret the value of r :
44
45
6.0 INTRODUCTION TO SPSS
SPSS (Statistical Package for Social Stduies) is a windows based program that can be
used to perform data entry and analysis and to create tables and graphs. It is capable of handling
large amounts of data and can perform all of the analyses covered in the text and much more.
Layout of SPSS
The data editor window has two views that can be selected from the lower left hand side
of the screen. Data view is where you see the data you are using. Variable view is where you can
specify the format of your data when you are creating a file or where you can check the format of
a pre-existing file. The data in the Data Editor is saved in a file with the extension. Sav.
File
Includes all the options typically use in other programs, such as open, save, exit and etc.
Edit
Includes the typical cut, copy and paste commands and allows you to specify various options
for displaying data and output.
View
It allows you to select which tool bars you want to show.
Transform
It includes several options to change the current variables.
Analyze
Includes all the commands to carry out statistical analyses and to calculate descriptive
statistics.
Graphs
Includes commands to create various types of graphs including box, histograms and etc.
Utilities
It allows you to list file information which is a list of all variables, there are labels, values ,
locations in the data file and type.
Adds – on
Programs that can be added to the base SPSS package.
46
Help
It includes link to the SPSS homepage, statistics coach, and syntax guide. Using topics, you
can use the index option to type in any key word and get a list of options.
47
Laboratory Exercises
Part I. Randomize and lay out a CRD experiment with 4 replication and 5 treatments
(T1, T2, T3, T4, and T5) using random numbers in calculator.
48
Part II. Randomize and lay out a RCBD experiment with 6 replication and 4 treatments
using random numbers in calculator.
49
Part III. Provide what is required for the following:
Table 1.0 Germination percentage of abaca seeds under different plant residues with carabao manure as
based compost from CRD experiment with 3 ( r ) replications and 6 ( t ) treatments.
Treatment Replication
T1(ordinary garden soil) 76.67 81.67 85.00
T2(Ordinary Soil+Carabao Manure) 73.33 75.00 95.00
T3(Ordinary Garden Soil T Carabao Manure+Abaca Leaves) 68.33 93.33 85.00
T4(Ordinary Garden Soil+Carabao Manure+Abaca Corms) 86.67 80.00 68.33
T5(Ordinary Garden Soil+Carabao Manure+Abaca Bracts) 73.33 85.00 61.67
T6(Ordinary Garden Soil + Carabao Manure+Coconut Husk) 88.33 60.00 86.67
Required:
1. Filled Anova
2. Make Decision
50
51
Table 3.0 Data on the average weight in grams of filled grains with different treatments
from RCBD Experiment with four replication.
Required:
1. Filled Anova
2. Make Decision
52
53
Table 4.0 Yield of three promising maize hybrids ( A,B, and D ) and a check
variety ( C )from an experiment with LSD.
Yield, Kg/plot
Row Number
Col. 1 Col. 2 Col. 3 Col. 4
1
165 ( B ) 131 ( D ) 152 ( C ) 144 ( A )
2
148 ( D ) 128 ( A ) 176( B ) 139 ( C )
3
168 ( C ) 100 ( B ) 175 ( A ) 128 ( D )
4
157 ( A ) 139 ( C ) 150 ( D ) 76 ( B )
Required:
1. Filled Anova
2. Make Decision
54
55
56
Table 5.0 Data on Different Sizes (Fine, Medium and Coarse)
of Materials in grams as Affected by the Five different Number
of Passes in a Split - Plot Design with three Replication
Size
Weight in Grams
Rep I Rep II Rep III
T1(One pass)
Fine 35 40 32
Medium 55 70 55
Coarse 198 180 200
T3(Three passes)
Fine 86 93 79
Medium 135 130 135
Coarse 60 60 68
T5(Five Passes)
Fine 97 109 101
Medium 135 127 127
Coarse 40 38 48
Control
Fine 9 12 18
Medium 12 18 18
Coarse 270 267 261
Required:
1. Filled Anova
2. Make Decision
57
58
59
60
Table 6.0. Grain Yield Data of four Rice variety Grown with 4 levels of
Nitrogen in a Strip - Plot Design with Four Replication
Variety
Grain Yiled , t/ha
Rep I Rep II Rep III Rep IV
N0(0 kg/ha)
V1 16 16 15 15
V2 15 16 14 16
V3 14 14 14 14
V4 15 15 15 15
N1(60 kg/ha)
V1 16 15 15 15
V2 15 16 14 15
V3 15 15 15 14
V4 16 15 15 16
N2(90 kg/ha)
V1 17 16 16 15
V2 15 16 15 16
V3 14 13 14 14
V4 15 15 15 15
N3(120 kg/ha)
V1 18 17 16 16
V2 15 16 14 16
V3 15 14 15 14
V4 15 15 14 14
Required:
1. Filled Anova
2. Make Decision
61
62
63
Table 7.0. Grain Yield Data of four Rice variety Grown with 4 levels of
Nitrogen in a RCB Design with Four Replication
Required:
1. Filled Anova
2. Make Decision
64
65
66
Table 8.0 Plant heigt (cm) Data Resulting from the use Different treatment
( t,1 t,2 t,3 t,4 and t 5 ( control ) CRD experiment with 5 replications.
Required:
1. LSD test
2. Present the final result and interpret
67
10. From the given below compare the following treatment means using DMRT and interpret the
result. ANOVA Result from RCBD is highly significant and with four (4) replications and error mean
square is 97,774.
Table 9.0 Analysis of Variance on the height gained of plants at 60 to 90 days.
Required:
1.LSD test
2 Present the final result and interpret.
68
Treatment Total
T1 28,508
T2 20,712
T3 20,208
T4 18,512
T5 27,184
T6 16,724
T7 15,264
Note:
Present the Final DMRT Result in Alphabet and Line Notation.
69
70
71
Table 11.0 Mean Yield Of 10 Treatments and the Control Treatment with Unequal Replication
Required :
1. LSD test
72
73
74
Table 13.0 Grain Yield and Nitrogen Rate using the Data in Rice Experiments
Required:
75
76
Part IV. SPSS
1. Look up ANOVA in Help/help topics. What kind of information did you find?
2. Look up “compare groups for significant differences” in Help/Statistics Coach. What did you
learn?
77
78
79
80
81
82
83
References:
Hoshmand
5. http://www.stat.psu.edu/online/courses/stat.503/01_intro_intro_html
6. http://www.books.google.com.ph/books?id
84