Capslet: Practical Research Ii
Capslet: Practical Research Ii
G12
PRACTICAL RESEARCH II
QUARTER 2: WEEK 7
Lesson No. 6
CapSLET
Capsulized Self-Learning
Empowerment Toolkit
Schools Division Office of Zamboanga City
Region IX, Zamboanga Peninsula
Zamboanga City
CapSLET
Practical research ii
Subject &
PR II -12 QUARTER 2 WEEK 7 DAY
Grade Level dd/mm/yyyy
CONTENT STATISTICAL TECHNIQUES FOR BIVARIATE ANALYSIS
MOST ESSENTIAL Uses statistical techniques to analyse data – study of
LEARNING
COMPETENCY differences and relationships limited for bivariate analysis
(CS_RS12-lld-g-3)
IMPORTANT: Do not write on this material. Write your answers on the Learner’s Activity
and Assessment Sheets provided separately.
When a Researcher conducts a study that looks at a single variable, that study
involves univariate data. For example, one might study a group of college students to
find out their average SAT scores or one might study a group of diabetic patients to
find their weights. Bivariate data is when you are studying two variables. For
example, if one is studying a group of college students to find out their average SAT
score and their age, you have two pieces of the puzzle to find (SAT score and age).
Or if you want to find out the weights and heights of college students, then one also
has bivariate data. Bivariate data could also be two sets of items that are dependent
on each other. For example:
Bivariate data has many practical uses in real life. For example, it is pretty
useful to be able to predict when a natural event might occur. One tool in the
statistician’s toolbox is bivariate data analysis. Sometimes, something as simple as
plotting on variable against another on a Cartesian plane can give you a clear picture
of what the data is trying to tell you.
Bivariate analysis is not the same as two sample data analysis. With two
sample data analysis (like a two sample z test in Excel), the X and Y are not directly
related. You can also have a different number of data values in each sample; with
bivariate analysis, there is a Y value for each X. Let’s say you had a caloric intake of
3,000 calories per day and a weight of 300lbs. You would write that with the x-
variable followed by the y-variable: (3000,300).
Y-Values
90
80
70
60
50
40
30
20
10
0
0 50 100 150 200 250 300 350 400 450
These give you a visual idea of the pattern that your variables follow.
A simple scatterplot.
2. Regression Analysis
Regression analysis is a catch all term for a wide variety of tools that you can
use to determine how your data points might be related. In the image above, the
points look like they could follow an exponential curve (as opposed to a straight line).
Regression analysis can give you the equation for that curve or line. It can also give
you the correlation coefficient.
3. Correlation Coefficients
This means that for each unit in our sample, two quantitative variables will be
determined. The purpose of collecting two quantitative variables is to determine if
there is a relationship between them.
There are several reasons why it is appropriate to collect and analyze bivariate
data. One such reason is that the dependent or response variable is of greater
interest, but the independent or explanatory variable is easier to measure. Therefore,
if there is a strong relationship between the explanatory and response variable, that
relationship can be used to calculate the response variable using data from the
explanatory variable. For example, a physician would really like to know the degree
to which a patient’s coronary arteries are blocked, but blood pressure is easier data
to obtain. Therefore, since there is a strong relationship between blood pressure and
the degree to which arteries are blocked, then blood pressure can be used as a
predictive tool.
Another reason for collecting and analyzing bivariate data is to establish norms
for a population. As an example, infants are both weighed and measured at birth and
there should be a correlation between their weight and length (height?). A baby that
is substantially underweight compared to babies of the same length would raise
concerns for the doctor.
This is shown in the two graphs below. The graph on the left is a time series
graph of the actual GDP for the US. The graph on the right is a scatter plot that uses
the GDP for the US as the x variable and the GDP for the US one year later (lag 1)
for the y value. The fact that these points are in such a straight line indicates that the
data are not independent. Consequently, this data should not be used in the type of
the analyses that will be discussed in this chapter.
Life can be more difficult for those in poverty and certainly the influence they
can have in the country is far more limited than those who are affluent. Since people
in poverty must channel their energies into survival, they have less time and energy
to put towards things that would benefit humanity as a whole. Therefore, it is in the
interest of all people to find a way to reduce poverty and thereby increase the number
of people who can help the world improve.
There are a lot of possible variables that could contribute to poverty. A partial
list is shown below. Not all of these are quantitative variables, and some can be
difficult to measure, but they can still have an impact on poverty levels
1. Education
2. Parent’s income level
3. Community’s income level
4. Job availability
5. Mental Health
6. Knowledge
7. Motivation and determination
8. Physically disabilities or illness
9. Wealth gap
10. Race/ethnicity/immigration status/gender
11. Percent of population that is employed
correlation is represented with the Greek letter ρρ, while the sample correlation
coefficient is represented with the letter rr. A correlation of 0 indicates no correlation,
whereas a correlation of 1 or -1 indicates a perfect correlation. The question is
whether the underlying population has a significant linear relationship. The evidence
for this comes from the sample. The hypotheses that are typically tested are:
H0:ρ=0H0:ρ=0
H1:ρ≠0H1:ρ≠0
The logic behind this hypothesis test is based on the assumption the null
hypothesis is true which means there is no correlation in the population. An example
is shown in the scatter plot on the left. From this distribution, the probability of getting
the sample data (shown in solid circles in the graph at the right), or more extreme
data (forming a straighter line), is calculated.
t=rn−2−−−−−√1−r2−−−−−√.(7.1)(7.1)t=rn−21−r2.
This can be demonstrated with the example of Gini coefficients and poverty
rates as provided in Chapter 4 and using a level of significance of 0.05. The
correlation is -0.650. The sample size is 7, so there are 5 degrees of freedom. After
substituting into the test statistic,
t=−0.6507−2−−−−√1−(−0.650)2−−−−−−−−−−−√t=−0.6507−21−(−0.650)2
the value of the test statistic is -1.91. Based on the t-table with 5 degrees of freedom,
the two-sided p-value is greater than 0.10 (actual 0.1140). Consequently, there is not
a significant correlation between Gini coefficient and poverty rates.
“Unido, Junto avanza con el EduKalidad Cree, junto junto puede!”
Prepared by: Lesley Ann F. Atilano-Tang, J.D. / Senior High School Teacher - II
Ma. Clara L. Lobregat National High School
7
Another explanatory variable that can be investigated for its correlation with
poverty rates is the employment-population ratio (percent). This is the percent of the
population that is employed at least one hour in the month
SAQ1: How do you use statistical techniques to analyze data?
SAQ2: What are the differences and relationship in terms of bivariate analysis?
Let’s Practice!
Directions: Data are given below. Present these by drawing tables using bivariate
analysis through numerical or categorical pattern. Give the analysis
based on data presented.
x-variable y-variable
(Academic Year) (No. of Student-Enrollees)
2017 4,500
2018 5,100
2019 5,600
2020 7,800
Table 1
No. of Student-Enrollees for MCLL National High School from
Academic Year 2017-2020 using Scatterplot Analysis
(Numerical to Numerical Pattern)
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
y-variable
x-variable
(No. of Student-Enrollees who opted
(Grade Level)
Online Education)
Grade 7 1,800
Grade 8 900
Grade 9 1,200
Grade 10 1,100
Grade 11 700
Grade 12 850
Table 2
No. of Student-Enrollees for Don Pablo Lorenzo Memorial High School – Senior
High Stand Alone who have chosen Online Education using Scatterplot
Analysis (Categorical to Categorical Pattern)
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
Guide Questions:
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
REMEMBER
Key Points
CHAPTER 4
DATA PRESENTATION, DISCUSSION AND INTERPRETATION
This Chapter presents the results of data collection, collation and discussion
of implication and interpretations of the results.
I. Collate the data, tally the results, and get the research questions.
II. After data collection, apply the statistical treatment.
III. Create a Table
IV. Write Analysis
V. Discuss the Results
VI. Interpret the Results
A simple scatterplot.
2. Regression Analysis
Regression analysis is a catch all term for a wide variety of tools that you can
use to determine how your data points might be related. In the image above, the
points look like they could follow an exponential curve (as opposed to a straight line).
Regression analysis can give you the equation for that curve or line. It can also give
you the correlation coefficient.
3. Correlation Coefficients
TRY
Let’s see how much you have learned today!
DIRECTIONS: Refer to your own research. Construct your own Table. Label it with
the x and y-variables. Afterwards, state your bivariate analysis.
Table
_____________________________________________________________
Analysis: ___________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________
Parent/Guardian’s Signature
Above printed Name
REF WEBSITE:
ERE
Stephanie Glen. "Bivariate Analysis Definition & Example" From StatisticsHowTo.com:
NCE
S Elementary Statistics for the rest of us! https://www.statisticshowto.com/bivariate-analysis/
https://www.statisticshowto.com/bivariate-analysis#:~:text=Bivariate%20analysis%20means
%20the%20analysis,the%20variables%20X%20and%20Y.&text=Bivariate%20analysis%20is
%20the%20analysis%20of%20exactly%20two%20variables.
https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book
%3A_Foundations_in_Statistical_Reasoning_(Kaslik)/07%3A_Analysis_of_Bivariate_Quantitati
ve_Data
This learning resource contains copyright materials. The use of which has not been specifically
authorized by the copyright owner. We are making this learning resource in our efforts to provide
DISCL printed and e-copy learning resources available for the learners in reference to the learning continuity
AIME
R
plan for this division in this time of pandemic. This LR is produced and distributed locally without
profit and will be used for educational purposes only. No malicious infringement is intended by the
writer. Credits and respect to the original creator/owner of the materials found in this learning
resource.
CapSLET
Practical research ii
“Unido, Junto avanza con el EduKalidad Cree, junto junto puede!”
Prepared by: Lesley Ann F. Atilano-Tang, J.D. / Senior High School Teacher - II
Ma. Clara L. Lobregat National High School
13
Subject &
PR II -12 QUARTER 2 WEEK 7 DAY
Grade Level dd/mm/yyyy
CONTENT STATISTICAL TECHNIQUES FOR BIVARIATE ANALYSIS
MOST ESSENTIAL Uses statistical techniques to analyse data – study of
LEARNING
COMPETENCY differences and relationships limited for bivariate analysis
(CS_RS12-lld-g-3)
ANSWER KEY
Let’s Practice!