Fundamentals of Statistics
Fundamentals of Statistics
Statistics
What is Statistics?
The science of conducting studies to collect, organize,
summarize, analyze, and draw conclusions from data
is called statistics. It is used in almost all fields of
human endeavor such as sports, education, health,
research, and among others. Statistical analysis are
used to manipulate, summarize, and investigate data
for a useful decision – making.
Example:
number of Covid-19cases
poverty rate
number of drop outs in a year
2
What is Statistics?
Sir Ronald Aylmer Fisher (February 17,
1890 – July 29, 1962), British
statistician and geneticist who
pioneered the application of
statistical procedures to the design of
scientific experiments. He is consider
as the Father of Modern Statistics.
3
Population vs. Sample
4
Parameter vs. Statistic
Example:
5
Variable, Observation, Data
6
Classification of Variables
Continuous
Quantitative
Variable Discrete
Qualitative
7
Classification of Variables
Qualitative - measures a characteristic that can be
classified into one of a group of categories and
cannot be measured on a natural numerical scale.
Examples:
Sex (Male, Female)
Employment status of parents (Employed,
Unemployed)
Reason(s) for not attending school
8
Classification of Variables
Quantitative - measures a
characteristicthat
is recorded on a naturally
occurring numerical scale Discrete -
can only take a finite or countable
number of values
Continuous - takes infinitely many
values at any
point on agiven interval
9
Level of Measurement
Examples:
Sex (Male, Female)
Employment status of parents(Employed,
Unemployed)
Reason(s) for not attending school
10
Level of Measurement
Ordinal - qualitative variables with an inherent
ordering.
Examples:
Highest educational attainment of parents
(elementary graduate, high school graduate,
college graduate,etc.)
Conduciveness of household for studying (1 –
lowest to 10 –highest)
11
Level of Measurement
Examples:
Daily maximum temperature (in Celsius)
Intelligent quotient score
12
1. One can measure time during the day using a 12-hour clock, this is a good example of interval
data. Time in a 12-hour format is a rotational measure that keeps restarting from zero at set
periodicity. These numbers are on an interval scale as the distance between them is measurable and
comparable. For example, the difference between 5 minutes and 10 minutes is the same as 15
minutes and 20 minutes in a 12-hour clock.
2. The temperature measured in Fahrenheit and Celsius but not in Kelvin. If you measure temperature
in Fahrenheit and Celsius then it will be considered interval data as 0 is arbitrary. But in Kelvin, 0 is
absolute. There can’t be a temperature below zero degrees in Kelvin.
3. When you calculate intelligence score in an IQ test. There is no zero point for IQ. According to
psychological studies, a person cannot have zero intelligence, therefore in this example, zero is
arbitrary. IQ is numeric data expressed in intervals using a fixed measurement scale.
4. Test scores of examination like SAT. Scores in SAT test are in the range of 200-800. The numbers from
0 to 200 are not used when they scale the raw score (number of questions answered correctly) to the
section score. The reference point is not an absolute zero, thus, it qualifies to become interval data.
5. Age is also a variable that can be measured on an interval scale. For example if A is 15 years old
and B is 20 years old, it not only clear than B is older than A, but B is elder to A by 5 years.
Level of Measurement
Examples:
Household size
Household monthly net income (in pesos)
Distance from house to school (in km)
14
Broad Categories of Statistics
Examples:
Construction of graphs, charts and tables
Frequency count
Percentage
Measures of central tendency (mean, median, mode)
Measures of variability (Variance, standard deviation,
etc.)
Measures ofposition 15
Broad Categories of Statistics
16
Examples:
Parametric
Z-tests
T-tests
F-tests (such as ANOVA-Analysis of Variance)
Pearson correlation (Pearsonr)
Non-parametric
Wilcoxon Rank sum Test
Mann-WhitneyU Test
Spearman Correlation
Kruskal Wallis Test
Friedman Test
Data Collection and Sampling Techniques
17
Sampling
It is the process of selecting a number of
individuals for a study in such a way that they
represent the larger group from which they were
selected.
The process which involves taking a part of the
population, making observations on this
representative groups, and then generalizing the
findings to the bigger population.
Data Collection and Sampling Techniques
19
Why sample?
Makes possible the study of large,
heterogeneous population
Saves time, money, and effort
More effective
Faster and cheaper
More accurate
Gives more
Comprehensive information
Data Collection and Sampling Techniques
23
▫ Slovin’s Formula
𝑁
▫ 𝑛=
1+𝑁𝑒 2
▫
▫ where:
▫ 𝑛 is the samplesize
▫ 𝑁 is the population size
▫ 𝑒is the margin of error (e.g. 0.01, 0.05, 0.1, etc)
Data Collection and Sampling Techniques
24
Example:
Assuming a certain is to be conducted to a
certain community with 6,518 residents.
Determine the number of respondents of the
study with 5% level of significance using
Slovin’s formula.
Data Collection and Sampling Techniques
27
𝑁
𝑛=
1 + 𝑁𝑒 2
6,518
𝑛=
1 + 6,518 0.05 2
6,518
𝑛=
1 + 6,518 0.0025
Data Collection and Sampling Techniques
28
6,518
𝑛=
1 +16.295
6,518
𝑛=
17.295
𝑛 = 376.87 ≈ 377
This implies that using Slovin’s formula, the
given’s sample size is 377(respondents).
Data Collection and Sampling Techniques
29
Sampling Techniques
Sampling techniques are methods of
identifying who will be the respondents of
the study (sample). For instance, in the
previous example, how to identify the 377
respondents? Here comes the sampling
techniques.
Data Collection and Sampling Techniques
32
Data Collection and Sampling Techniques
34
Data Collection and Sampling Techniques
35
Data Collection and Sampling Techniques
36
Data Collection and Sampling Techniques
37
Data Collection and Sampling Techniques
38
Data Collection and Sampling Techniques
39
Data Collection and Sampling Techniques
40
Data Collection and Sampling Techniques
41
Data Collection and Sampling Techniques
43
Data Collection and Sampling Techniques
44
Data Collection and Sampling Techniques
a. Convenience or AccidentalSampling
Technique
The use of most convenient way of determining
the samples.
45
Data Collection and Sampling Techniques
48
Data Collection and Sampling Techniques
49
Data Collection and Sampling Techniques
51
References
• Institute of Statistics - UPLB (2021). Software-Assisted Statistical Analysis: Basic
Statistical Methods using R. [PowerPoint Slides]
• Mascariñas, A. (2022). QRM 6 [Selecting a Sample from the Population] [PowerPoint
Slides]
• Peck, R. (2020). Introduction to Statistics and Data Analysis. Cengage Learning, Inc.
• Johnson, R.A. (2019). Statistics: Principles and Methods. Wiley.
• Jabilles, A.B. (2018). Statistics for Thesis Writers. Books Atbp. Publishing Corp.
• Murphy, C. (2018). Statistics and Sampling Theory. Willford Press.
• Blumann, A.G. (2017). Elementary Statistics: A Step-by-Step Approach. McGraw-Hill
Education.
• Navidi, W.C. & Monk, B.J. (2019). Elementary Statistics. McGraw-Hill Education.
• Triola, M.F. (2018). Elementary Statistics. Pearson.
52
THANK YOU!