0% found this document useful (0 votes)
8 views81 pages

Q2-Lesson 5

The document outlines the process of data analysis in research, emphasizing the organization and summarization of collected data to answer research questions. It details three main data analysis strategies: exploratory, descriptive, and inferential, along with various scales of measurement (nominal, ordinal, interval, and ratio). Additionally, it discusses measures of central tendency and dispersion, including mean, median, mode, range, average deviation, and standard deviation.

Uploaded by

Vannie Mondero
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views81 pages

Q2-Lesson 5

The document outlines the process of data analysis in research, emphasizing the organization and summarization of collected data to answer research questions. It details three main data analysis strategies: exploratory, descriptive, and inferential, along with various scales of measurement (nominal, ordinal, interval, and ratio). Additionally, it discusses measures of central tendency and dispersion, including mean, median, mode, range, average deviation, and standard deviation.

Uploaded by

Vannie Mondero
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 81

LESSON 5: PLANNING

DATA ANALYSIS USING


STATISTICS
TEACHER: VPM
INTRODUCTION

When the necessary


data have already been
collected, the next step
is to organize the raw
data for data analysis.
01
DATA ANALYSIS

Data analysis in research is


a process in which
gathered information are
summarized
in such a manner that it
will yield answers to the
research questions. 02
DATA ANALYSIS STRATEGIES
02

01 DESCRIPTIVE 03
DATA ANALYSIS
EXPLORATORY INFERENTIAL
DATA ANALYSIS DATA ANALYSIS

03
1. EXPLORATORY DATA ANALYSIS

This is used when it is not clear


what to expect from the data.
It makes use of numerical and
visual representation such as
graphs. Since the research
interest is new, it is possible to
find some inconsistencies.
04
2. DESCRIPTIVE DATA ANALYSIS

This is used to describe or The commonly used descriptive


summarize data in statistics are those that analyze
a meaningful way leading to a distribution of data such as: frequency,
simple interpretation of data. percentage, measure of central
tendency (mean, median, mode), and
dispersion (range, average deviation,
standard deviation).

05
3. INFERENTIAL DATA ANALYSIS

This is used to test hypotheses about a set of


data to reach conclusions and generalizations
beyond merely describing the data.
The commonly used inferential statistics includes
test of significance of difference such as t-test,
and ANOVA. Also the test of relationship such as
Product Moment Coefficient or Correlation or
Pearson r, Spearman rho, regression, and Chi-
square test.
06
QUANTITATIVE ANALYSIS IN
EVALUATION

• Nominal Scale

• Ordinal Scale

• Interval Scale

• Ratio Scale

07
NOMINAL SCALE

• used for labelling variables. It is


sometimes called categorical
variable. The numbers assigned to
the variables have no quantitative
value. Some examples of variables
measured on a nominal scale are
gender, religious affiliation, race
or ethnic group.?
08
NOMINAL SCALE

• Example: If you want to


categorize respondents
based on gender, you
could use 1 for male and 2
for female.

09
ORDINAL SCALE
• assigns order on items on the
characteristics being measured. It
involves the ranking of individuals,
attitudes and
characteristics?Numerical scores
such as first, second, third and so
on are assigned but the numerical
value or quantity has no value
except its ability to establish
ranking among a set of data.
10
ORDINAL SCALE

• Example: order in
honor roll (first
honor, second honor,
third honor)

11
INTERVAL SCALE

• has equal units of measurement,


thereby, making it possible to
interpret the order of the scale
scores and the distance between
them. However, interval scale
does not have a “true zero”.
Variables can be added and
subtracted, but cannot multiplied
and divided.
12
INTERVAL SCALE

• EXAMPLE:

13
RATIO SCALE
• considered the highest level of
measurement. It has the
characteristics of an interval scale but
it has a zero point. Because of this
property, all statistical operations can
be performed on ratio scales. All
descriptive and inferential statistics
may be applied. All variables can be
added, subtracted, multiplied and
divided.
14
RATIO SCALE
• EXAMPLE:

15
Q&A
SESSION
True or False. Write T if the statement is true and F if not.

___ 1. Ordinal scale is a measurement scale that


ranks the variables.
___ 2. Data analysis plan refers to describing how
the data will be cleaned,transformed, and analyzed.
___ 3. The weakest level of measurement scale is
ratio scale.
___ 4. Percentage is categorized as descriptive
statistic.
___ 5. Descriptive analysis includes t-test and
ANOVA. 17
True or False. Write T if the statement is true and F if not.

___ 6. Exploratory data analysis is used when it is


not clear what to expect from the data.
___ 7. Inferential data analysis is used when there is
testing of hypothesis.
___ 8. Interval scale is also called categorical
variables.
___ 9. Inferential statistics includes the mean,
median, and mode.
___ 10. The highest level of measurement scale is the
ratio scale. 18
PART II.

Indicate which level of measurement is being used in the given


scenario.
Choose the letter of your choice below.
A. Nominal Scale C. Interval Scale
B. Ordinal Scale D. Ratio Scale

____ 1. Mr. Abalos enlists the top 10 greatest movie of all time.
____ 2. The nurse records the gender of each patient.
____ 3. A veterinary clinic keeps record of the weight of their
patient dogs.
____ 4. The organizer of the basketball lists down the jersey
number of all players.
____ 5. A PAGASA officer record the temperature in degree 19
Celsius for the month of September.
PART II.
Indicate which level of measurement is being used in the given
scenario.
Choose the letter of your choice below.
A. Nominal Scale C. Interval Scale
B. Ordinal Scale D. Ratio Scale
6. Meteorologist lists the temperature in degrees Kelvin for
the month of August.
____ 7. The teacher of a class of tenth graders records the
letter grade for mathematics for each student.
____ 8. The teller keeps track the amount of his client’s saving
account.
____ 9. The researcher records the family income of his
subject for two consecutive months. 20
____ 10. A car magazine lists the most expensive hotels in
United States
DATA ANALYSIS STRATEGIES
02

01 DESCRIPTIVE 03
DATA ANALYSIS
EXPLORATORY INFERENTIAL
DATA ANALYSIS DATA ANALYSIS

21
DESCRIPTIVE DATA ANALYSIS

1. Measures of Central Tendency


a. Mean 2. Measures of Dispersion
-Ungrouped Data a. Range
-Grouped Data b. Average (Mean) Deviation
-Weighted Mean c. Standard Deviation
b. Median
-Ungrouped Data
-Grouped Data
c. Mode

22
MEASURES
OF CENTRAL
TENDENCY
MEAN
The mean is the average of a set
of numbers. It is the most widely
used and simplest measure of
central tendency. It is frequently
used for interval or ratio variables.
The mean is calculated by getting
the summation of
all observation divided by the
number of observation. 24
FOR UNGROUPED DATA

1. Find the mean of


the measurement 18,
26, 27, 29, 30?

25
WEIGHTED MEAN

where:
?f = frequency?
x= numerical value or item in a
set of data
n= number of observations in
the data set?

26
WEIGHTED MEAN
Example 1:?Find the mean of the heights of 50 senior
high school students summarized as follows:?

27
GROUPED DATA

When the observations are grouped into classes, the


formula for grouped data is as follows:?

28
GROUPED DATA
Example 2: Solve for the mean of the data below.?

29
MEDIAN

The median is the middle value


of a given set of
measurements, provided that
the values are arranged in
increasing or decreasing order.
It is also the most appropriate
measure of central tendency
for ordinal data.
30
MODE

The mode is the most


appropriate measure of central
tendency when data are on
nominal scale. It is also the quick
approximation of average but
the weakest measure of central
tendency.

31
MODE

EXAMPLE 1
The ages of fifteen (15)
persons assembled in a room
are as follows: ?16, 18, 18, 18, 25,
25, 25, 30, 34, 36 and 38.?

32
MODE

EXAMPLE 2
The number of hours spent by
10 students in an internet cafe
was as follows:? 2, 2, 2, 3, 3, 4,
4, 4, 5, 5 ?
33
MEASURES
OF
DISPERSION
MEASURES OF DISPERSION

Dispersion in statistics is a way of


describing how spread out a set of
data is. It is important for describing
the spread of the data, or its
variation around a central value. It
is also called measure of variability
35
1. RANGE
The range is the difference
between the largest and the
smallest values in a set of data.?

EXAMPLE:
6, 10, 12, 15, 18, 18, 20, 23, 25, 28 ?

36
2. AVERAGE (MEAN)
DEVIATION
The mean absolute deviation of a
dataset is the average distance
between each data point and the
mean. It gives us an idea
about the variability in a dataset.
Absolute value is used to disregard
the positive
or negative value of data.
37
2. AVERAGE (MEAN)
DEVIATION
The values are 6, 10, 12, 15, 18, 18,
20, 23, 25, 28?

38
2. AVERAGE (MEAN)
DEVIATION
The values are 6, 10, 12, 15, 18, 18,
20, 23, 25, 28?

39
3. STANDARD
DEVIATION
The standard deviation (SD) is a
measure of spread or
variation of data about the mean.
SD computed by calculating the
average distance that the
average value is from the mean. It
is used to measure the
confidence in statistical conclusion.
40
DESCRIPTIVE STATISTICS
MEASURES OF CENTRAL TENDENCY MEASURES OF DISPERSION / VARIATION
Central tendency is sometimes called ‘measures of location’, ‘central Dispersion in statistics is a way of describing how spread out a set of data is. It is
location’, or just ‘center’. It is a way to describe what’s typical for a set of important for describing the spread of the data, or its variation around a
data. There are three major ways to show central tendency: mean, mode and central value. It is also called measure of variability.
median.
a. Mean The mean is the average of a set of numbers. It is the most a. Range The range is the difference between the largest and the
widely used and simplest measure of central tendency. smallest values in a set of data. However, it only gives us
knowledge of the spread of data but it does not tell us about
the disperse of values from central tendency. A much larger
range suggests greater variation or dispersion.

A range has a disadvantage of being influenced by extreme


values called outliers. Another is that it is based on two
variables only. All other values in the set are being ignored.
b. Median The median is the middle value of a given set of b. Average The mean absolute deviation of a dataset is the average
measurements, provided that the values are arranged in (Mean) distance between each data point and the mean. It gives us
increasing or decreasing order. It is also the most appropriate Deviation an idea about the variability in a dataset. Absolute value is
measure of central tendency for ordinal data. used to disregard the positive or negative value of data.
For Ungrouped Data The median may be calculated from
ungrouped data by doing the following steps:
1. Arrange the items or scores from lowest to highest.
2. Count to the middle value. If there are two middle numbers,
average the two.
c. Mode It is the value that occurs most often in the data set. It is also c. Standard The standard deviation (SD) is a measure of spread or variation
the quick approximation of average but the weakest measure Deviation of data about the mean. SD computed by calculating the
of central tendency. average distance that the average value is from the mean. It is
used to measure the confidence in statistical conclusion.
DATA ANALYSIS STRATEGIES
02

01 DESCRIPTIVE 03
DATA ANALYSIS
EXPLORATORY INFERENTIAL
DATA ANALYSIS DATA ANALYSIS

41
INFERENTIAL DATA
ANALYSIS

Inferential statistics refers


to statistical measures and
techniques that allow us to
use samples to make
generalizations about the
population from which the
samples were drawn.?
42
INFERENTIAL DATA ANALYSIS

• Refers to statistical measures and techniques that


allow us to use samples to make inference and
generalizations about the population from which the
samples were drawn.
• Inferential data analysis is used to determine if there
is a relationship between an intervention and an
outcome as well as the strength of that relationship.

42
Does the results really represents. . .
the whole
OR by chance?
population?
Types of Statistical Analysis of Variable
Scores
Temperature
Height
Weight
POINTS TO CONSIDER
1. Test of Significance of Difference
A t-test is a type of inferential statistic used to determine if
there is a significant difference between the means of two
T-test
groups, which may be related in certain features (Kenton,
2019).
This is used when significance of difference of means of
three or more groups are to be determined at one time.
Analysis of
Variance • One-Way ANOVA. A one-way ANOVA has just one
(ANOVA) independent variable.
• Two-Way ANOVA. Also known as factorial ANOVA. It
refers to an ANOVA using two independent variables.
1. Test of Significance of Difference
Analysis of
Variance • One-Way ANOVA. A one-way ANOVA has just one independent variable.
(ANOVA) Example: 1. A difference in IQ can be assessed by Country A, and County B
This is used can have 2, 20, or more different categories to compare.
when
significance of • Two-Way ANOVA. Also known as factorial ANOVA. It refers to an
difference of ANOVA using two independent variables.
means of three Example:
or more groups 1. Examining differences in IQ scores (the dependent variable) by
are to be Country (independent variable 1) and Gender (independent variable 2).
determined at
one time. 2. Females may have higher anxiety scores overall compared to males,
but this difference could be greater (or less) in western countries
compared to northern countries.
2. Test of Relationship
It is used when data available are expressed in
Spearman Rho
ranks (ordinal variables).

Chi-Square Test for It is used when data expressed in terms of


Independence. frequencies or percentage (nominal variables).

Product – Moment This is used when data are expressed in terms of


Coefficient of scores such as weights and heights or score in test
Correlation or Pearson r. (ratio and interval variables).
T-TEST
A t-test is a type of inferential
statistic used to determine if
there is a significant difference
between the means of two
groups, which may be related in
certain features.

43
ANALYSIS OF VARIANCE
(ANOVA)

ANOVA is used when


significance of difference of
means of two or more
groups are to be determined
at one time.

4
4
SPEARMAN RANK-ORDER
CORRELATION OR SPEARMAN RHO

It is used when
data available are
expressed in ranks
(ordinal variables)

45
CHI-SQUARE TEST FOR
INDEPENDENCE

It is used when data


expressed in
terms of frequencies
or percentage
(nominal variables).
46
PRODUCT – MOMENT COEFFICIENT
OF CORRELATION OR PEARSON R.

This is used when data are


expressed in terms of
scores such as weights
and heights or score in
test (ratio and interval
variables).
47
T-TEST TO TEST THE
SIGNIFICANCE OF PEARSON R.

This is used to determine


if the value of computed
Pearson r is significant.

48
THANK
YOU

You might also like