Stat Chapter 6
Stat Chapter 6
Unit Objectives
After completing this unit, you will be able to:
• Describe the meaning of regression and correlation
• Demonstrate the procedures for computing descriptive
measures of the strength of linear relationship between
two variables.
• Explain how to find a ‘best fitting’ line relating two
variables.
• Outline the computation of rank correlation which is a
measure of association between two rankings
• Demonstrate the procedure of test statistics for
analyzing analytical data
6.1 Definition of key terms
1 15 15
2 35 30
3 42 30
4 60 50
5 72 48
6 128 100
7 98 93
8 35 33
9 15 14
10 50 50
Calculate the coefficient of correlation and interpret.
Cont…
Solution:
Table 2: Calculation of the necessary summary statistics
10(34770) (550)(463)
= 0.973
[10(41936) (550) ][10(29263) (463) ]
2 2
a y bx
Cont…
Example:- Table 5 shows the number of items produced
(X) and the cost (Y) incurred in producing them (in Birr) at a
certain factory.
a y bx
Cont…
Therefore, the equation of the least squares line is:
ŷ a bx ŷ = 10.86 + 1.21x
•The y-intercept is: a = 10.86. This value tells us that,
even if no item is produced, there will be a fixed cost
of 10.86 Birr (such as insurance cost, maintenance
cost, etc.). The slope is: b = 1.21. This figure
indicates that for a unit increase (decrease) in the
number of items produced, the cost increases
(decreases) by 1.21 Birr.
6.5 Rank correlation
Rank correlation is used to measure the strength of the
linear association between two ranked variables, denoted
6 d 2
by rs and given by rs 1
n(n 1)
2