Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
I. Overview: This module tackles mathematics as applied to different areas such as data
organization, analysis, or interpretation to make these data into a meaningful
information. Descriptive measures in statistics are the one in focus.
II. Learning Objectives: At the end of this module the students are expected to
3. Mode - The Mode of a list of numbers is the number occurs most frequently.
Find the mode of
a. 18, 15, 21, 16, 15, 14, 15, 21
b. 2, 5, 8, 9, 11, 4, 7, 23
c. 12, 24, 12, 71, 48, 93, 71
Weighted Mean – The value called the weighted mean is often used when some data values are
more important than the others.
The table shows Drillon’s fall semester course grades. Use the weighted mean
formula to find his GPA for the fall semester.
Course Course Course
Grade Unit
English B=3 4 12
History A=4 3 12
Chemistry D=1 3 3
Algebra C=2 4 8
Observed Frequency
Event Number of
Try thisof
Number homes with x
bedroom, x bedroom
2 5 10
3 25 75
4 10 40
5 5 25
MEASURES OF DISPERSION:
MEASURES THE SPREAD OF
DATA
1. Range- The range of a set of data values is the difference between the greatest data value
and the least data value.
2. Standard Deviation- A measure of dispersion that is less sensitive to extreme values is the
standard deviation. The standard deviation of a set of numerical data makes use of the
amount by which each individual data value deviates from the mean.
a.
b.
3. Variance a. b.
Procedure for computing Standard Deviation
1. Determine the mean of n numbers
2. For each number, calculate the deviation (difference) between
the number and the mean of the numbers.
3. Calculate the square of each deviation and find the sum of these
squared deviation.
4. If the data is a population, then divide the sum by n. If the data
is a sample, then divide the sum by n-1.
5. Find the square root of the quotient in step 4.
A student has the following quiz scores: 5, 8, 16, 17, 18, 20.
Find the standard deviation for this population of quiz scores.
X X-
2 -6 36
4 -4 16
77 -1
-1 11
12
12 44 16
16
15 7 49
15 7 49
118
118
A consumer group has tested a sample of 8 size-D batteries from each of 3 companies. The
results of the lists are shown in the following table. According to these tests, which
company produces batteries for which the values representing hours of constant use have
the smallest standard deviation?
EverSoBright 6.2, 6.4, 7.1, 5.9, 8.3, 5.3, 7.5, 9.3 1.328h
Dependable 6.8, 6.2, 7.2, 5.9, 7.0, 7.4, 7.3, 8.2 0.719h
Beacon 6.1, 6.6, 7.3, 5.7, 7.1, 7.6, 7.1, 8.5 0.877h
The batteries from Dependable company have the smallest standard deviation.
According to these results, the Dependable company produces the most
consistent batteries with regard to life expectancy under constant use.
MEASURES OF RELATIVE POSITION
1.
Z-Score – The z-score for a given data value x is the number of standard deviations that x
is above or below the mean of the data. The following formulas show how to calculate the z-
score for the value x in a population and in a sample.
Raul has taken two tests in his chemistry class. He scored 72 on the first test, for which the
mean of all scores was 65 and the standard deviation was 8. He received a 60 on a second
test, for which the mean of all scores was 45 and the standard deviation was 12. In
comparison to the other students, did Raul do better on the first test or the second test?
Raul scored 0.875 standard deviation above the mean on the first test and 1.25 standard
deviation above the mean on the second test. The z-scores indicate that, in comparison to
his classmates, Raul scored better on the second test than he did on the first test.
NORMAL CURVE
68.26%
95.44%
2. pth Percentile- A value x is called the pth percentile of a data set provides of the data values
are less than x.
In a recent year, the median annual salary for a physical therapist was $74,480. If the 90 th
percentile for the annual salary of a PT was $105,900, find the percent of physical therapists
whose annual salary was
a. More than $74,480. ans. 50% of the PT earned more than $74,480 per year
b. Less than $105,900. ans. 90%of all PT made less than $105,900.
c. Between $74,480 and $105,900. ans. 90%-50% = 40% of the PT earned $74,480 and
105,900.
Percentile for a Given Data Value
Given a set of data and a data value x,
Stems Leaves
4 3
5
6 1 5
7 2 3 4 5 5 8
8 4 5 6 6 6
9 6 8
Legend: 8/6 represents 86
b. The percent of data in all the classes with a boundary of 5s and an upper boundary of
20s is the sum of the percent. Thus the percent of subscribers who required at least 5s
but less than 20s to download the file is 15.2%. The probability that a subscriber
chosen at random will require at least 5s but less than 20s to download the file is 0.152.
Properties of a Normal Distribution
A survey of 1000 U.S. gas stations that the price charged for a gallon of regular gas could be
closely approximated by a normal distribution with a mean of $3.10 and a standard deviation
of $0.18. How many of the station charge
a. between $2.74 and $3.46 for a gallon of regular gas?
b. less than $3.28 for a gallon of regular gas?
c. more than $3.46 for a gallon of regular gas?
Solution
a. 950
b. 840
c. 25
The Standard Normal Distribution – The standard normal distribution is the normal distribution
In the standard normal distributions, the area of the distribution from z=a to z= b
represents
a. the percentage of z-values that lie in the interval from a to b.
b. the probability that z lies in the interval from a to b.
A soda machine dispenses soda into 12-ounce cups. Tests show that the actual amount of soda
dispensed is normally distributed, with a mean of 11.5 ounce and a standard deviation of 0.2 oz.
1. What percent of cups will receive less than 11.25 oz of soda?
2. What percent of cups will receive between 11.2 oz and 11.55 oz of soda?
3. If a cup is filled at random, what is the probability that the machine will overflow the
cup?
LINEAR REGRESSION AND
CORRELATION
Linear Regression
Research – wish to know whether two variables
are related. If the variables are determined to be
related, a scientist may then wish to find an equation
that can be used to model the relationship.
Data involving two variables are called bivariate
data.
For instance, a geologist might want to know whether there is a relationship
between the duration of an eruption of a geyser and the time between
eruptions. A first step in this determination is to collect some data. The table
below gives bivariate data showing the time between two eruptions and the
duration of the second eruption for 5 eruptions of the geyser
Duration of eruption 89 79 83 82 81
(in seconds), y
LINEAR REGRESSION AND
CORRELATION
The Least-Square Regression Line
The least-squares regression line for a
set of bivariate data is the line that
minimizes the sum of the squares of the
vertical deviations from each data point to
the line.
LINEAR REGRESSION AND
CORRELATION
The Least-Square Regression Line
Formula
ˆ ax b, where
y
nxy x y
a
n x x
2 2
and b y ax
Time between Duration of
eruptions (in eruption
seconds), x (in seconds),
y XY X2
points.
Extrapolation – The process of using an
equation to determine a
point to the right or left of
a given data points.
Linear Correlation Coefficient – To determine the strength of a