0% found this document useful (0 votes)
5 views18 pages

Analysis of Quantitative Data

This document provides an overview of quantitative data analysis, focusing on central tendency measures (mean, median, mode) and dispersion measures (range, standard deviation). It explains different levels of measurement, including nominal, ordinal, and interval/ratio data, and discusses how to calculate and interpret these statistical measures. Additionally, it covers graphical representations of data, such as bar charts and histograms, to illustrate summary data and frequencies.

Uploaded by

nuttyjobx
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views18 pages

Analysis of Quantitative Data

This document provides an overview of quantitative data analysis, focusing on central tendency measures (mean, median, mode) and dispersion measures (range, standard deviation). It explains different levels of measurement, including nominal, ordinal, and interval/ratio data, and discusses how to calculate and interpret these statistical measures. Additionally, it covers graphical representations of data, such as bar charts and histograms, to illustrate summary data and frequencies.

Uploaded by

nuttyjobx
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 18

Analysis of

Quantitative data
What will we learn?
• In this section we will learn about Central
tendency measures and Dispersion
measures.

• Central tendency measures: mean, median,


mode

• Dispersion measures: range, standard deviation


Levels of measurement
1. Nominal data: the data that can be categorized. It has no numerical
value. It is descriptive data.
For example, gender(M/F), YES/NO, LIKE/DISLIKE, hair
color(BRWON/BLACK/OTHER) are all nominal variables.

2. Ordinal data : it is a kind of categorical data in a particular set order.


It has no numerical value. It is used for measuring non-numerical
concepts like happiness, depression level, satisfaction.
For example: completely dissatisfied, slightly dissatisfied, neutral,
slightly satisfied, completely satisfied

3. Interval/ratio : it is the numerical variables like temperature,


depression score, height, time
Central tendency
measures
• Central tendency is a descriptive statistic that calculates the
average or the most typical value in a dataset.

• It is also called the central value.

• The average score can be calculated in different ways:


mean, mode, median.
Mean
• The mean of a dataset is calculated by adding up all the values in a
dataset and dividing the sum by the total number of values present in
the dataset.
• The mean is the most sensitive and most powerful measure of central
tendency because it involves all the scores in the dataset.
• The mean can be affected by extreme values.

• Mean= sum of all data values/ number of scores present.

• X = mean
• n= number of scores
Calculate the mean of the following
dataset
3, 6, 8, 11, 15, 2, 18, 19, 15, 13, 15, 4, 4, 15, 7

• Mean= 155/15
Mean = 10.33
Median
• The median is a measure of central tendency which calculated
the middle value of a dataset when the values are placed in
order (ascending or descending order).
• It is a simple calculation of the average score.

• However, it is less sensitive than the mean and is not useful on


datasets that have a small number of values as it may not
represent the typical score.

• Median is not affected by extreme values.


Calculate the median

3, 6, 8, 11, 15, 2, 18, 19, 15, 13, 15, 4, 4, 15, 7

• Ascending order: 2, 3, 4, 4, 6, 7, 8, 11, 13, 15, 15, 15, 15, 18, 19

• Middle value: 2, 3, 4, 4, 6, 7, 8, 11, 13, 15, 15, 15, 15, 18, 19

• Median: 11

3, 5, 6, 7, 8, 9

• Middle value: 3, 5, 6, 7, 8, 9

• Median: mean of middle values= 6+7/2= 6.5


Mode
• The mode is a measure of central tendency which calculates the
most frequent score in a dataset.

• It is the value that occurs the greatest number of times in a


dataset.

• If there are two most frequent scores, it is referred to as bi-


modal and both should be reported.

• If more than 2 modes are seen, the mode becomes a


meaningless measure of central tendency. It is not a useful
measure of central tendency on datasets with frequently
occurring same values.

• Mode is not affected by extreme values.


Calculate the mode for the following dataset

3, 6, 8, 11, 15, 2, 18, 19, 15, 13, 15, 4, 4, 15, 7

Mode= 15

2, 2, 3, 3, 4, 5, 5, 5, 7, 7, 7, 7, 7

Mode= 7
Easy exercise!
1. Is mean used on interval/ nominal/ ordinal data?
INTERVAL

2. Is mode used on interval/nominal/ordinal data?


NOMINAL

3. Is median used on interval/nominal/ordinal data?


ORDINAL
Dispersion measures

• Dispersion is a descriptive statistic that calculates the


spread of scores in a dataset.

• Also known as calculating the variation between the scores.

• They include the standard deviation, range, IQR, variance


Range
• Range is simply the difference between the highest value and the
lowest value in a dataset.

• Range= highest value- lowest value

• The range is affected by extreme values so it may not be a useful


descriptive statistic especially if there are outliers in a data set.

• If the data has extreme values(outliers) it is better to calculate


interquartile range (IQR).

• IQR= upper quartile (Q3) – lower quartile (Q1)


Standard deviation
• A good measure to understand the spread of scores is to calculate
the standard deviation.

• A standard deviation (σ) is a measure of how dispersed the data is in


relation to the mean.

• A low standard deviation indicates that the values tend to be close to


the mean of the set, while a high standard deviation indicates that
the values are spread out over a wider range from the mean.
Graphical representation of
data
• Graphs can be very useful to represent summary data or
frequencies.

• Bar charts and histograms are also useful ways of illustrating raw
data.

• Any bar chart or histogram will have an x-axis and a y-axis.

• X-axis represents the independent variable

• Y-axis always represents the dependent variable/ numerical


variable.
Bar graph
• A bar chart has bars to compare data among categories.
• Taller bars represent higher values and shorter bars represent smaller
values.
• X axis is a categorical variable, y-axis a numerical(continuous data).
Histograms
• A histogram is used to present the distribution of scores by illustrating the
frequency of values in the dataset.

• In a bar chart the bars are separated by a small space, but in histograms the bars
are joined together without any spaces.

• In a histogram the x-axis represents a numerical variable (continuous data) and


the y-axis represents the frequency of values.

You might also like