3.3.1 Data Summarization
3.3.1 Data Summarization
Numerical Measures
Measures found by using all the data values in the population are called
parameters. Measures obtained by using the data values from samples are
called statistics; hence, the average of the sales from a sample of
representatives is a statistic, and the average of sales obtained from the
entire population is a parameter.
A statistic is a characteristic or measure obtained by using the data
values from a
sample.
A parameter is a characteristic or measure obtained by using all the
data values
from a specific population.
General Rounding Rule In statistics the basic rounding rule is that when
computations are done in the calculation, rounding should not be done
until the final answer is calculated.
Measures of Central Tendency
Mean
Definition: The mean is the arithmetic average of a set of values. It is calculated
by summing all the values and dividing by the number of values.
Usage: The mean is useful for datasets without extreme outliers, as it provides a
good overall estimate of the data. It's widely used in various fields, including
economics (e.g., average income), education (e.g., average test scores), and
research.
Mean
Police Incidents
The number of calls that a local police department responded to for a
sample of 9 months is shown. Find the mean.
Solution:
σ 𝑥 475 + 447 + 440, +761 + 993 + 1052 + 783, +671 + 621
𝑋ത = =
𝑛 9
6243
= ≈ 693.7
9
Hence, the mean number of incidents per month to which the police
responded is 693. 7
Median
Definition: The median is the middle value of a dataset when the values are
arranged in ascending or descending order. If there is an even number of
observations, the median is the average of the two middle numbers.
Usage: The median is particularly useful for skewed distributions or datasets with
outliers, as it is less affected by extreme values. For example, in income data, a
few very high incomes can skew the mean, while the median provides a better
indication of the "typical" income.
Median
The median is the halfway point in a data set. Before you can find
this point, the data must be arranged in ascending or increasing
order. When the data set is ordered, it is called a data array.
Finding the Median
Step 1 Arrange the data values in ascending order.
Step 2 Determine the number of values in the data set.
Step 3 a. If n is odd, select the middle data value as the median.
=𝒙
𝒙 𝒏+𝟏
𝟐
b. If n is even, find the mean of the two middle values. That is, add
them and divide the sum by 2.
𝒙 𝒏 +𝒙 𝒏
𝟐 𝟐 +𝟏
=
𝒙
𝟐
Example:
Police Officers Killed
The number of police officers killed in the line of duty over the last 11 years is
shown. Find the median.
177, 153, 122, 141, 189, 155, 162, 165, 149, 157, 240