0% found this document useful (0 votes)
14 views17 pages

S2.Measures of Central Tendency and Variability, Data Visualization

Uploaded by

xrp7xrqc4z
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views17 pages

S2.Measures of Central Tendency and Variability, Data Visualization

Uploaded by

xrp7xrqc4z
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

S2: Descriptive Statistics: Measures of

Central Tendency, Variability, and Data


Visualization
By: F. Jabutay
Learning Objectives
• Understand and calculate measures of central tendency (mean,
median, mode).

• Understand and calculate measures of variability (range, variance,


standard deviation).

• Utilize data visualization techniques to interpret and present


statistical data.
Introduction to Descriptive Statistics
• Definition: Descriptive statistics summarize and organize data,
helping us understand datasets at a glance.

• Purpose: To provide insights into the data's typical values and spread
without making conclusions about a larger population.
Measures of Central Tendency
• Definition: These measures indicate the "center" or typical value of a
dataset.
• Key Types:
• Mean: The average value.

• Median: The middle value when data is ordered.

• Mode: The most frequently occurring value.


Mean

• Advantages: Easy to calculate; includes all data points.

• Disadvantages: Sensitive to outliers.

• Example Calculation: [Add a sample dataset for students to see the


mean calculation step-by-step]
Median
• Definition: Middle value in an ordered dataset.

• Steps to Calculate:
• Order the data values.
• Find the middle number (or the average of two middle numbers if the
dataset has an even number of values).

• Advantage: Less affected by outliers.

• Example Calculation: [Add a sample dataset and find the median]


Mode
• Definition: Most frequently occurring value in a dataset.

• Uses: Useful for categorical data to find the most common category.

• Limitations: May not exist or may have multiple values in the case of
bimodal or multimodal data.

• Example Calculation: [Add a dataset with a clear mode]


Comparing Central Tendency Measures
• Summary Table:
• Mean: Useful for quantitative data but sensitive to outliers.
• Median: Provides a robust middle value, especially in skewed distributions.
• Mode: Useful for categorical data and identifying the most common values.

• Choosing the Right Measure: Based on the data type and


distribution.
Measures of Variability (Dispersion)
• Definition: These measures indicate how spread out the data values
are.

• Importance: Helps to understand the range and consistency of the


data.
Range

•Description: The simplest measure of variability, showing the


difference between the highest and lowest values.

•Limitations: Only considers the two extreme values, ignoring


others.
Variance and Standard Deviation
• Variance: Measures the average squared deviation from the mean.

• Standard Deviation: Square root of variance; represents average


distance from the mean.

• Example Calculation: [Include dataset and step-by-step variance and


standard deviation calculation]
Interpreting Standard Deviation
• Low Standard Deviation: Data points are close to the mean (low
variability).

• High Standard Deviation: Data points are spread out (high variability).

• Real-World Example: Comparing product ratings of two products


with the same mean but different standard deviations.
Data Visualization Techniques
• Importance: Visualizing data makes it easier to interpret and
communicate insights.

• Types of Visualizations:
• Bar Charts: For comparing categories.
• Histograms: For showing frequency distribution of numerical data.
• Box Plots: For visualizing data spread and identifying outliers.
• Pie Charts: For representing proportions in categorical data.
Example: Bar Chart
• Description: Ideal for categorical data to compare quantities across
categories.

• Components: Title, axes labels, categories.

• Example Chart:
Example: Histogram
Description: Useful for showing the distribution of continuous data.

Components: Bins, frequency axis.

Example Chart:
Example: Box Plot
Description: Visualizes the minimum, first quartile, median, third
quartile, and maximum.

Uses: Identifying skewness, spread, and outliers.

Example Chart:
Summary
Recap

• Measures of Central Tendency: Mean, Median, Mode.

• Measures of Variability: Range, Variance, Standard Deviation.

• Data Visualization: Importance and types (bar chart, histogram, box


plot).Key Takeaway: These statistical tools and visualizations are
foundational for analyzing and interpreting data in business contexts.

You might also like