Topic 5 Statictics and Introduction
Topic 5 Statictics and Introduction
knowledge of foreign
language of algebra; it may prove
of use at any time under
Any circumstance”…
Bowley.
Why study statistics?
1. Data are everywhere
2. Statistical techniques are used to make
many decisions that affect our lives
3. No matter what your career, you will make
professional decisions that involve data. An
understanding of statistical methods will
help you make these decisions efectively
- Refers to a set of pertinent activities
such as collection, presentation,
analysis and interpretation of
quantitative data
- A field of study which deals with
mathematical characterization of a
group or groups of items.
Collection of Data
- Refers to the process of gathering
numerical information. Methods o
gathering pertinent information
include interview, questionnaire,
experiments, observation, and
documentary analysis.
Presentation of Data
- Once the data are gathered
presentation of data in appropriate
tables and graphs is next. Such tables
refer to frequency distributions
which may either be one-
dimensional or two-dimensional.
Graphical presentation includes bar
graphs, frequency polygon, pie graph
and many others.
Analysis of Data
- Refers to the activity of describing the
properties or behaviour of the data or
the possible correlation of different
quantities or variables. Such
description can be obtained after
summarizing the data into
measurements like the averages.
Interpretation
- Has to be made based on
the preliminary activities
and other statistical
methods. Such methods
involve testing the
significance of the results.
Characteristics
of Statistics
a) Aggregate of facts/data
b) Numerically expressed
c) Affected by different factors
d) Collected or estimated
e) Reasonable standard of accuracy
f) Predetermined purpose
g) Comparable
h) Systematic collection.
TYPES OF
STATISTICS
Descriptive Statistics
𝑁
𝑛= 2
1 + 𝑁𝑒
Where: N is the population size
e is the margin of error
n is the sample size
Example:
A researcher is conducting an
investigation regarding the factors
affecting the efficiency of the 185
faculty members of a certain college. If
he wanted to have a margin of error of
5%, then how many of the faculty
members should be taken as
respondents?
Types of
Variables
Quantitative Variables
Examples:
Income, gross sales, prices of homes,
numbers of cars owned, stock prices, and
accidents.
DISCRETE VARIABLES
A variable whose values are countable.
It can assume only certain values with
no intermediate values.
CONTINOUS VARIABLES
A variable that can assume any
numerical value over a certain interval
or intervals.
QUALITATIVE OR CATEGORICAL VARIABLES
x : 3, 2, 8, 6, 2, 0
y : 5, 0, 8, 3, 2, 5
Evaluate the following expressions
1. σ 𝑥
2. σ 𝑦
3. σ 𝑥 2
4. σ 𝑥 2
5. σ(𝑥𝑦)
6. σ 𝑥 σ 𝑦
Tips:
1. Summation of x is simply the sum of
all the values represented by the
variable x.
2. The value of summation of y is the
sum of all values of y.
3. The expression σ 𝑥 2 implies that the
values represented by the variable x
should be squared first before being
added.
σ 2
4. The expression 𝑥 implies that
the quantities represented by x should
be added first before the sum is
squared.
5. The expression σ(𝑥𝑦)implies that
each value of x should b first
multiplied to its corresponding value
of y. Then, the products shall be
added.
6. The expression σ 𝑥 σ 𝑦 refers to the product of
the summation of x and summation of y. Thus, to
get the product we should get first the sum of the
values represented by x and the sum of the values
represented by y.
4
3
(30 − 𝑘) (2𝑘 − 5)
𝑘=1
𝑘=1
7 5
(3𝑘) (4𝑘 + 2)
𝑘=1 𝑘=1
Generally, data collected from
different sources are usually
unorganized and in a form unsuitable
for immediate interpretation. In any
statistical investigation, once
pertinent data are already gathered,
the next step is to present such data
in organized form using appropriate
tables and graphs.
When the mass of data is so large, it
can hardly give information that can
be of help in making fast decisions.
Management of modern business
demands fast and accurate decisions.
Otherwise, they can be ruined by
competition from others who know
how to summarize large mass of data
and how to interpret them.
One such summary table is called
frequency distribution. The frequency
distribution is an arrangement of the
data which shows the frequency of
different values or groups of values
of a variable.
Ages (in years) of 100 Residents of Panhulan, Agoncillo,
Batangas
14 27 27 23 29 21 20 12 22 17
23 24 18 20 27 16 12 22 19 19
15 20 29 25 24 20 20 17 18 18
12 22 23 17 23 26 16 21 21 20
17 18 26 18 28 27 18 22 19 16
14 16 19 20 20 18 25 19 26 15
28 13 18 17 14 27 24 20 18 25
17 20 23 18 18 24 19 19 14 18
21 21 25 24 14 25 20 17 17 17
15 12 26 23 17 20 24 25 18 15
What is the lowest age?
𝑓
%𝑓 = 𝑋 100
𝑛
Where: %f = the relative frequency
for each class interval
f = the frequency of each class
n = the sample size
Determine the relative
frequency of the ages of
100 residents of
Panhulan, Agoncillo,
Batangas
The cumulative frequency distribution can
also be derived from the frequency
distribution. This distribution can be
obtained by simply adding the class
frequencies. Unlike the relative frequency
distribution where the frequencies are
converted as percents of the sample size,
this type of distribution tries to determine
partial sums from the data classified in terms
of classes.
There are two types of
cumulative frequency
distribution. These are as
follows:
Less than cumulative
frequency distribution