Data Management 1
Data Management 1
Management
1
Math031 - Mathematics in the Modern
”The quiet statisticians have changed our
world; not by discovering new facts or
technical developments, but by changing the
ways that we reason, experiment, and form
opinions” - Ian Hacking
2
Math031 - Mathematics in the Modern
STATISTIC
S
It is an art of learning from the data. It is a
branch of knowledge which deals with
collection, presentation, analysis and
interpretation of data that are subject for
variability.
3
Math031 - Mathematics in the Modern
Application of Statistics
4
Math031 - Mathematics in the Modern
Role of Statistics in Scientific
Process
Science is a method of asking questions
and finding answers.
1. Researchers form theories.
2. Generate testable hypothesis.
3. Design studies / Collect data.
- Observation, measurement
Where does Statistics fit in?
Science method of collecting data
Statistics tools for analyzing data
5
Math031 - Mathematics in the Modern
Two Aims of Statistics
1.Descriptive
2.Inferential
6
Math031 - Mathematics in the Modern
Descriptive Statistics
It pertains to the methods dealing with the
collection, organization and analysis of a set of
data without making conclusions, predictions
or inferences about a larger set.
Examp
le
Suppose we want to describe the test scores in a
specific class of 30 students. We record all of the test
scores and calculate the summary statistics and
produce graphs.
7
Math031 - Mathematics in the Modern
Inferential Statistics
This pertains to the methods dealing with making
inferences, estimation or prediction about a larger set
of data (population) using the information gathered
from a subset of this larger set (sample).
Examp
le
a) You randomly select a sample of 11th graders in your school
and collect data on their SAT scores and other
characteristics.
b) You can use inferential statistics to make estimates and test
hypotheses about the whole population of 11th graders in the
school based on your sample data.
8
Math031 - Mathematics in the Modern
Descriptive vs. Inferential
Statistics
9
Math031 - Mathematics in the Modern
Basic Statistical Terms
Universe or physical population is the set
of all individuals or entities under consideration
or study.
10
Math031 - Mathematics in the Modern
Variable is a characteristic or attribute of persons or
objects which assumes different values or label. This is
a thing that we measure, control or manipulate in a
research varying from unit to unit.
Example
Suppose we collect a dataset by measuring the
resting heart rate, score of an IQ test, favorite
color of every student in a class. The variables in
this dataset would be resting heart rate, score of
an IQ test, favorite color.
11
Math031 - Mathematics in the Modern
Types of data:
1. Qualitative Data (categorical) occur
when we assign objects into labelled
groups or categories.
Example
when we group people according to:
Marital Status, Socio-Economic Status,
Religious Sector, zip code, and
military rank
12
Math031 - Mathematics in the Modern
Types of data:
1. Quantitative Data(either Discrete or
Continuous) occur when we count or
measure.
a) Discrete – can be counted using
integral values
Example
a) count the number of students in a
classroom
b) count the number of computers in
the computer laboratory
13
Math031 - Mathematics in the Modern
World
Types of data:
1. Quantitative Data(either Discrete or
Continuous) occur when we count or
measure.
b) Continuous – can be measured on an
infinite scale
Example
a) measure the weight and height of a
respondent
b) Measure the temperature of covid
patients
14
Math031 - Mathematics in the Modern
World
Dependent and Independent Variables:
1. Dependent Variable – the variable
whose value is being predicted
2. Independent Variable – the predictor
Example
To predict the amount of sunlight on a growth of
a certain plant
Independent: amount of sunlight
Dependent: growth of a certain plant
To evaluate the effect of using computer to the
performance of the students
Independent: using computer
Dependent: performance of students
15
Math031 - Mathematics in the Modern
World
Classification of data according to their dependence on time:
Example
the gross annual income for each of 1000
randomly chosen households in Cebu City
for the year 2000.
16
Math031 - Mathematics in the Modern
World
Classification of data according to their dependence on time:
Example
The financial sector is a typical example of time-
series data usage: be it stocks, cryptocurrencies
or other financial assets, time-series data allows
you to see how prices changed over time and
helps you spot trends.
17
Math031 - Mathematics in the Modern
World
Classification of data according to their dependence on time:
Example
For example, because longitudinal data
measures how long events last, it can be used to
see if the same group of individuals remain
unemployed during a recession, or whether
different individuals are moving in and out of
unemployment.
18
Math031 - Mathematics in the Modern
World
Primary and Secondary Data
Primary Data refer to information which are
gathered directly from the original source or
which are based on direct or first-hand
experience (e.g. surveys, interviews,
observations, registration, autobiographies,
diaries, etc.).
19
Math031 - Mathematics in the Modern
Statistical Population is a collection of all
cases in which the researcher is interested in a
statistical study.
20
Math031 - Mathematics in the Modern
Some of the statistical measures and symbols
are presented in the table.
21
Math031 - Mathematics in the Modern
Levels (Scales) of Measurement
Nominal Scale does not indicate an amount,
rather, it is used for identification, as a name.
Example
gender, race, color, and savings account number.
Example
temperature, score in an exam, and IQ.
Example
socioeconomic status of families, Class Standing (A to
D), and Teacher’s Evaluation (Excellent to Poor).
Ratio scale reflects the true amount of the variable
that is present because the scores measure an actual
amount, there is an equal unit of measurement, and 0
truly means that zero amount of the variable is present.
Example
time or space, height, weight, width, area, and income.