Lecture 8-Association Between Variables
Lecture 8-Association Between Variables
BETWEEN VARIABLES
Lecture 8-Class Discussion Notes
A&F and IFB Year 1
Kelebogile Kenalemang
Business Mathematics
Botswana Accountancy College
OUTLINE
• Introduction
• Correlation analysis
• Scatter plots
• Correlation Coefficient
• Coefficient of determination, r2
• Regression analysis
OBJECTIVES
• To be able to draw and interpret scatter diagrams
• To be able to calculate the correlation coefficient and coefficient of
determination.
• To understand and be able to use the least squares method to
estimate the regression line
Introduction
• This lecture introduces two techniques;
• Correlation; to measure the association between two variables, and
regression to obtain the relationship between the two variables.
• For example you might suspect that cost of production is dependent
on the quantity produced , or that sales of a product are related to
price.
Correlation Analysis
• The technique of correlation measures the strength of the association
between two variables.
• Correlation analysis simply explores the strength of association
between two variables, two variables are said to be associated if a
change in the value of one variable is accompanied by a change in the
value of another variable.
• For example we may want to determine the strength of association
between the cost of producing an item and its price, the relationship
between advertising and sales revenue, number of deliveries and
time taken to deliver.
Determining and Measuring Correlation
1. Scatter Diagrams
2. Pearson’s product moment correlation coefficient
Scatter plots/Scatter diagrams
• A scatter plot is simply a way of representing a set of bivariate data to
determine correlation between the two variables.
• The notion of dependence arises here, we should always plot on the ,
the variable which is likely to be dependent on, i.e. influenced by or
caused by, the other variable, the independent variable (cause) – this
‘other variable’ is plotted on the .
• The resulting pattern of the plotted points can give us a lot of
information about the association between the variables and the
suitability of the data set before we spend any time on calculations.
Degrees of Association/Correlation
• When we examine the scatter plot, we should consider several issues
before reaching a conclusion about the association between the
variables.
• The first issue is obviously whether there is evidence of a pattern in
the plotted.
1. Perfectly correlated (+,-), all the pairs of values lie on a straight line
to show an exact linear relationship between the two variables.
2. Partly/ Partially correlated
3. Uncorrelated
Pearson’s Product Moment Correlation Coefficient
• You can think of as the slope of the regression line and as the value of the
intercept on the
𝒏 ∑ 𝒙𝒚 − ∑ 𝒙 ∑
example
𝒚
𝒃=
𝒏∑ 𝒙 −( ∑ 𝒙 )
𝟐 𝟐