0% found this document useful (0 votes)

62 views160 pages

Describing The Data Using Numerical Measures

The document discusses numerical measures used to describe data, including measures of central tendency (mean, median, mode) and dispersion (range, variance, standard deviation). It explains how to calculate each measure and what each represents. For example, the mean is the average value and can be pulled upward by extreme values, while the median is not affected by outliers. The document also covers concepts like skewed vs. symmetric distributions and how the mean and median differ in skewed data. The overall aim is to describe how these numerical measures can be combined with graphs to fully analyze and understand quantitative data behavior.

Uploaded by

Jolina Pardillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views160 pages

Describing The Data Using Numerical Measures

Uploaded by

Jolina Pardillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 160

Describing the Data

using Numerical Measures

IS 106 - FUNDAMENTALS OF COMPUTING
Objectives

1. Compute the mean, median, mode, and weighted mean for a set of data and
understand what these values represent.
2. Compute the range, interquartile range, variance, standard deviation and know
what these values means.
3. Compute a z score and the coefficient of variation and understand how they
are applied in decision making situations.
4. Understand and apply some basic statistical and mathematical formulas in
analyzing numeric data behavior.

IS 106 – QUANTITATIVE METHODS

Introduction

o Graphs and charts ---

 provide effective tools for transforming data into information – this is just
the starting point!
 do not reveal all the information contained in a dataset.

o To complete our descriptive toolkit ---

 quantify the center of data and its spread – with numerical measures

IS 106 – QUANTITATIVE METHODS

Why learn?

o Suppose you are an ad manager

 Your task is to create an ad claiming that your company’s tire last longer

 You could:

• sample your tires and from your competition

• and graph each data in a histogram

IS 106 – QUANTITATIVE METHODS

Why learn?

o You should:
 Compute the summary mileage measures for the various tire brands

o To effectively describe data:

 combine graphical tools with numerical measures

IS 106 – QUANTITATIVE METHODS

Measures of Center and
Location

IS 106 – QUANTITATIVE METHODS

Measures of Center and Location

o Histograms are an effective way of converting quantitative data to

information
 It can show us the center of data and degree of spread

o Numerical measures help fully describe a quantitative data

 It can give us numerical measure of center and spread

IS 106 – QUANTITATIVE METHODS

Recall !

o“ What do we call numerical measures derived from a sample? ”

IS 106 – QUANTITATIVE METHODS

Parameters and Statistics

o Parameter
 A measure computed from the entire population

 As long as the population does not change, the value of the

parameter will not change

IS 106 – QUANTITATIVE METHODS

Parameters and Statistics

o Statistics
 A measure computed from a sample that has been selected from a
population
 The value of the statistic will depend on which sample is selected

IS 106 – QUANTITATIVE METHODS

Mean

o A numerical measure of the center of a set of quantitative measures

 computed by dividing the sum of the values by the number of values in
the data.

 Population Mean, Sample Mean

IS 106 – QUANTITATIVE METHODS

Population Mean

o The average for all values in the population computed by dividing

the sum of all values by the population size.

IS 106 – QUANTITATIVE METHODS

Population Mean

IS 106 – QUANTITATIVE METHODS

How to?

1. Collect the data for the variable of interest for all items in the
population. The data must be quantitative.

2. Sum all the values in the population.

3. Divide the sum by the number of values in the population.

IS 106 – QUANTITATIVE METHODS

Population Mean - Example

FOSTER CITY HOTEL. The manager of a small hotel in Foster

City, California, was asked by the corporate vice president to analyze
the Sunday night registration information for the past eight weeks.
Specifically, the goal is to identify the average number of rooms
rented on Sunday.

IS 106 – QUANTITATIVE METHODS

Population Mean - Example

IS 106 – QUANTITATIVE METHODS

Population Mean - Example

1. Collect data for the quantitative variable of interest.

IS 106 – QUANTITATIVE METHODS

Population Mean - Example

2. Add the data values.

= 121
IS 106 – QUANTITATIVE METHODS
Population Mean - Example

3. Divide the sum by the number of values in the population

Parameter Measure

Therefore, the average number of rooms

rented on Sundays for the past eight weeks is
15.125.

IS 106 – QUANTITATIVE METHODS

Sample Mean

o The average for all values in the sample computed by dividing the
sum of all sample values by the sample size.

IS 106 – QUANTITATIVE METHODS

How to?

1. Collect the sample data

2. Add the values in the sample.

3. Divide the sum by the sample size.

IS 106 – QUANTITATIVE METHODS

Sample Mean - Example

oTask: Compute the mean starting salary.

IS 106 – QUANTITATIVE METHODS

Sample Mean - Example

1. Collect the sample data

IS 106 – QUANTITATIVE METHODS

Sample Mean - Example

2. Add the values in the sample

IS 106 – QUANTITATIVE METHODS

Sample Mean - Example

3. Divide the sum by the sample size.

o Therefor, the average starting salary for the sample of seven

managers place is $170,571.43

IS 106 – QUANTITATIVE METHODS

The Impact of Extreme Values on the Mean

o Recall: The mean is the balance point for the data.

o But! It has a potential disadvantage

o The Disadvantage: It can be affected by extreme values.

 An extreme value on the high end can pull the mean upward from the center.

IS 106 – QUANTITATIVE METHODS

Impact of Extreme Values - Example

o What if our previous example ( Management Salaries) had been slightly

changed.

o We will change the salary from $316,000 to $1,000,000.

IS 106 – QUANTITATIVE METHODS

Impact of Extreme Values - Example

1. Collect the sample data

Extreme Value

IS 106 – QUANTITATIVE METHODS

Impact of Extreme Values - Example

2. Add the values.

IS 106 – QUANTITATIVE METHODS

Impact of Extreme Values - Example

3. Divide the sum buy the number of values in the sample.

IS 106 – QUANTITATIVE METHODS

Impact of Extreme Values - Example

Conclusion:
o With only one value in the sample changed, the mean is now substantially higher than before.

o Because the mean is affected by extreme values, it may be a misleading measure of data’s center.

IS 106 – QUANTITATIVE METHODS

Median

o Another measure of center of data

o A center value that divides a data array into two halves

 denotes the population median
 denotes the sample median

Data that have been arranged in numerical order.

IS 106 – QUANTITATIVE METHODS

Median

IS 106 – QUANTITATIVE METHODS

How to?

1. Collect the sample data

2. Sort the data from smallest to largest, forming a data array.

3. Calculate the median index.

4. Find the median.

IS 106 – QUANTITATIVE METHODS

Median - Example

IS 106 – QUANTITATIVE METHODS

Median - Example

1. Collect the sample data

IS 106 – QUANTITATIVE METHODS

Median - Example

2. Sort the data from smallest to largest.

IS 106 – QUANTITATIVE METHODS

Median - Example

3. Calculate the median index.

1 1
𝑖= 𝑛
2
𝑖= (7)
2 𝑖=3.5
𝑖= 4

IS 106 – QUANTITATIVE METHODS

Median - Example

4. Find the median.

IS 106 – QUANTITATIVE METHODS

Median – Example with an Extreme Value

IS 106 – QUANTITATIVE METHODS

Skewed and Symmetric Distributions

o Data in a population or sample can be either symmetric or skewed

 depending on the distribution of data around the center.

IS 106 – QUANTITATIVE METHODS

Skewed and Symmetric Distributions

o Symmetric Data
 Datasets whose values are evenly spread around the center.

 Mean and Median are equal.

IS 106 – QUANTITATIVE METHODS

Skewed and Symmetric Distributions
o Skewed Data
 Datasets that are not symmetric

 The mean will be larger or smaller than the median.

 Right Skewed

• The mean is larger than the median

 Left Skewed

• The mean is smaller than the median

IS 106 – QUANTITATIVE METHODS

Skewed and Symmetric Distributions

IS 106 – QUANTITATIVE METHODS

Skewed and Symmetric Distributions

oManagement Salaries.

Mean
> Median

Right Skewed
IS 106 – QUANTITATIVE METHODS
Mode

o Another measure of central location

o A value in a data set that occurs most frequently

o A data set may have more than one mode if multiple values tie for
the most frequently occurring values.

o If no value occurs more frequently than any other, there is no mode.

IS 106 – QUANTITATIVE METHODS

How to?

1. Collect the sample data.

2. Organize the data into a frequency distribution.

3. Determine the value(s) that occur(occurs) most frequently.

IS 106 – QUANTITATIVE METHODS

Mode - Example

IS 106 – QUANTITATIVE METHODS

Mode - Example

1. Collect the sample data.

IS 106 – QUANTITATIVE METHODS

Mode - Example

2. Organize the data into a frequency distribution

IS 106 – QUANTITATIVE METHODS

Mode - Example

3. Determine the value(s) that occur(occurs) most frequently.

IS 106 – QUANTITATIVE METHODS

Mode – Common Mistake

o Stating the mode as being the frequency of the most frequently

occurring value.

o In our example,
 The modes are 2 and 4, the occurred 6 times.

IS 106 – QUANTITATIVE METHODS

Other Measures of
Center and Location
WEIGHTED MEAN

PERCENTILES

Q U A RT I L E S

IS 106 – QUANTITATIVE METHODS

Weighted Mean

o The mean of data values that have been weighted according to their
relative importance.

IS 106 – QUANTITATIVE METHODS

Weighted Mean

o Consider this:

o The above formula is what we have used to compute for the mean of a sample.

o In this case, each x value is given equal weight in the computation

o However, there will be times, when some other values are weighted more than
the other.

IS 106 – QUANTITATIVE METHODS

Weighted Mean

IS 106 – QUANTITATIVE METHODS

How to?

1. Collect the desired data and determine the weight to be assigned to each data
value.

2. Multiply each weight by the data value and sum these.

3. Sum the weights for all values.

4. Compute the weighted mean.

IS 106 – QUANTITATIVE METHODS

Weighted Mean - Example

oOne of the most common usage of weighted mean measure is in computing your
General Point Average (GPA).

IS 106 – QUANTITATIVE METHODS

Weighted Mean - Example

1. Collect the desired data and determine the weight to be assigned to each data
value.

1.00 1.00 1.00 1.00 1.00 1.00 1.25

3 3 3 3 3 3 3
IS 106 – QUANTITATIVE METHODS
Weighted Mean - Example

2. Multiply each weight by the data value and sum these.

= (1.00 * 3) + (1.00 * 3) + (1.00 * 3) + (1.00 * 3) + (1.00 * 3) + (1.00 *

3) + (1.25 * 3 )

= 3.0 + 3.0 + 3.0 + 3.0 + 3.0 + 3.0 + 3.75

= 21.75

IS 106 – QUANTITATIVE METHODS

Weighted Mean - Example

3. Sum the weights for all values.

= 3.0 + 3.0 + 3.0 + 3.0 + 3.0 + 3.0 + 3.00

= 21

IS 106 – QUANTITATIVE METHODS

Weighted Mean - Example

4. Computed the weighted mean.

𝛴 𝑤 𝑖 𝑥𝑖 21.75
𝜇𝑤 = = = 1.0357=1.04
𝛴 𝑤𝑖 21

Your GPA is 1.04.

IS 106 – QUANTITATIVE METHODS

Percentiles

o Used to describe the location of the data in terms other than center of data.

o Definition:
 the pth percentile in a data array is a value that divides the data set into two parts.

 the lower segment contains at least p% and the upper segment contains at least 100
– p%

IS 106 – QUANTITATIVE METHODS

Percentiles

o Suppose:
 You are enrolling in a university and took an entrance exam. You then received the result
saying that your score is at the 90th percentile. What does it mean?

• It means that you scored as high or higher than 90% of the other students.

 What if you scored at 50th percentile?

IS 106 – QUANTITATIVE METHODS

Percentile Location Index

o Allows us to point the exact value associated with the percentile.

IS 106 – QUANTITATIVE METHODS

Percentiles – How to?

1. Sort the data in order from the lowest to the highest value.

2. Determine the percentile location index.

3. Locate the percentile value.

a) If i is not an integer, round the value up to the next highest integer. The pth
percentile is located at the rounded index position.

b) If i is an integer, the pth percentile is the average of the values at location index
positions i and i + 1

IS 106 – QUANTITATIVE METHODS

Percentiles – Example?

IS 106 – QUANTITATIVE METHODS

Percentiles – Example

1. Sort the data in order from the lowest to the highest value.

IS 106 – QUANTITATIVE METHODS

Percentiles – Example

2. Determine the percentile location index.

𝑃 80
ⅈ= ( 𝑛 )= ( 30 ) =24
100 100

IS 106 – QUANTITATIVE METHODS

Percentiles – Example

3. Locate the percentile value.

o Because i = 24 is an integer value, the 80th percentile is found by averaging the values in

the 24th and 25th positions.

o Therefor, the distance on the 80th percentile that will be subject to surcharge is :

20.5 + 21 = 20.75

IS 106 – QUANTITATIVE METHODS

Quartiles

o Quartiles in a data array are those values that divide the data set into four equal-
sized groups.

o The median corresponds to the second quartile.

o Can be approximated manually using the same method as for percentiles.

IS 106 – QUANTITATIVE METHODS

Quartiles

o First quartile (25th percentile)

 The value at or below in which there is at least 25% (a quarter) of the data

 and at or above which there is at least 75% of the data.

o Second Quartile (50th percentile)

 There is 50% at or below the data and 50% at or above the data

IS 106 – QUANTITATIVE METHODS

Quartiles

o Third Quartile (75th percentile)

 The value at or below which there is at least 75% of the data

 and at or above which there is at least 25% of the data

o Fourth Quartile
 The rest

IS 106 – QUANTITATIVE METHODS

Quartiles – Example?

o Let’s say we’re only interested in the 3rd quartile of this data.

IS 106 – QUANTITATIVE METHODS

Quartiles – Example

𝑃 75
ⅈ= ( 𝑛=
) ( 30 ) =22.5=23
100 100

o Therefor, the third quartile is the 23rd value from low end of the sorted data.

IS 106 – QUANTITATIVE METHODS

Box and Whisker Plots

o A descriptive tool
 that incorporates the median and the quartiles to graphically display data.

o Used to identify outliers that are usually small or large data values that lie
mostly by themselves.

IS 106 – QUANTITATIVE METHODS

Box and Whisker Plots

o
o Definition:

 A graph that is composed of two parts: a box and whiskers

• The box has a width that ranges from the first quartile () to the third quartile ()

• A vertical line through the box is placed at the median.

• Limits are then located at a value that is 1.5 times the difference between and below and above

• The whiskers extend to the left to the lowest value within the limits and to the right to the

highest value within the limits.

IS 106 – QUANTITATIVE METHODS

How to?

o
1. Sort the data values from low to high.

2. Calculate the 25th percentile (1st quartile), the 50th percentile(median), and 75th percentile(3rd
quartile).

3. Create a graph with the data values on the horizontal axis. Draw a box so the ends
correspond to and .

4. Draw a vertical line through the box at the median. Half the data values in the box will be on
either side of the median.

IS 106 – QUANTITATIVE METHODS

How to?

o
5. Use the interquartile range () to compute for upper and lower limits.
 Lower Limit =

 Upper Limit =

 Outliers = any values outside these limits are referred to as outliers.

6. Draw the whiskers using dashed lines from each end of the box to the lowest and highest
value within the limits.