CHAPTER 1: Introduction To Statistical Concepts: Objectives
CHAPTER 1: Introduction To Statistical Concepts: Objectives
Objectives:
1. Enumerate the importance and limitations of statistics.
.
2. Know the difference between descriptive and inferential statistics.
3. Distinguish between qualitative and quantitative variables.
DEFINITION OF STATISTICS
Statistics plays a major role in many aspects of our lives. It is used in sports, for example, to help a
general manager decide which player might be the best fit for a team. It is used in politics to help candidates
understand how the public feels about various policies. And statistics is used in medicine to help determine
the effectiveness of new drugs. Used appropriately, statistics can enhance our understanding of the world
around us. Used inappropriately, it can lend support to inaccurate beliefs. Understanding statistical methods
will provide you with the ability to analyze and critique studies and the opportunity to become an informed
consumer of information. Understanding statistical methods will also enable you to distinguish solid analysis
from bogus “facts.”
Many people say that statistics is numbers. After all, we are bombarded by numbers that supposedly
represent how we feel and who we are. Certainly, statistics has a lot to do with numbers, but this definition is
only partially correct. Statistics is also about where the numbers come from (that is, how they were obtained)
and how closely the numbers
reflect reality.
Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw
conclusions or answer questions. In addition, statistics is about providing a measure of confidence in any
conclusions.
Let’s break this definition into four parts. The first part states that statistics involves the collection of
information. The second refers to the organization and summarization of information. The third states that
the information is analyzed to draw conclusions or answer specific questions. The fourth part states that
results should be reported using some measure that represents how convinced we are that our conclusions
reflect reality.
Page 1
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
• Statistics is important because it enables people to make decisions based on empirical evidence.
• Statistics provides us with tools needed to convert massive data into pertinent information that can be used
in decision making.
• Statistics can provide us information that we can use to make sensible decisions.
Field of Statistics
A. Mathematical Statistics- The study and development of statistical theory and methods in the abstract.
B. Applied Statistics- The application of statistical methods to solve real problems involving randomly
generated data and the development of new statistical methodology motivated by real problems. Example
branches of Applied S t a t i s t i c s : psychometric, econometrics, and biostatistics.
Limitation of Statistics
1. Statistics is not suitable to the study of qualitative phenomenon.
2. Statistics does not study individuals.
3. Statistical laws are not exact.
4. Statistics table may be misused.
5. Statistics is only, one of the methods of studying a problem.
Definitions:
• Universe is the set of all entities under study.
• A Population is the total or entire group of individuals or observations from which information is desired by
a researcher. Apart from persons, a population may consist of mosquitoes, villages, institution, etc.
• An individual is a person or object that is a member of the population being studied.
• A statistic is a numerical summary of a sample.
Page 2
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
In the PHP100 study presented, the population is all the students at the school. Each student is an individual.
The sample is the 50 students selected to participate in the study. Suppose 39 of the 50 students stated that
they would return the money to the owner. We could present this result by saying that the percent of
students in the survey who would return the money to the owner is 78%. This is an example of a descriptive
statistic because it describes the results of the sample without making any general conclusions about the
population. So 78% is a statistic because it is a numerical summary based on a sample. Descriptive statistics
make it easier to get an overview of what the data are telling us.
If we extend the results of our sample to the population, we are performing inferential statistics. The
generalization contains uncertainty because a sample cannot tell us everything about a population.
Therefore, inferential statistics includes a level of confidence in the results. So rather than saying that 78%
of all students would return the money, we might say that we are 95% confident that between 74% and 82%
of all students would return the money. Notice how this inferential statement includes a level of confidence
Page 3
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
(measure of reliability) in our results. It also includes a range of values to account for the variability in our
results. One goal of inferential statistics is to use statistics to estimate parameters.
PROCESS OF STATISTICS
1. Identify the research objective.
A researcher must determine the question(s) he or she wants answered. The question(s) must clearly identify
the population that is to be studied. Identify the research objective.
2. Collect the information needed to answer the questions.
Conducting research on an entire population is often difficult and expensive, so we typically look at a sample.
This step is vital to the statistical process, because if the data are not collected correctly, the conclusions
drawn are meaningless. Do not overlook the importance of appropriate data collection.
Example:
A research objective is presented. For each research objective, identify the population and sample in the
study.
1. The Philippine Mental Health Associations contacts 1,028 teenagers who are 13 to 17 years of age and
live in Antipolo City and asked whether or not they had been prescribed medications for any mental disorders,
such as depression or anxiety.
1. A farmer wanted to learn about the weight of his soybean crop. He randomly sampled 100 plants and
weighted the soybeans on each plant.
Population: Entire soybean crop
Sample: 100 selected soybean crop
3. Organize and summarize the information.
Descriptive statistics allow the researcher to obtain an overview of the data and can help determine the type
of statistical methods the researcher should use.
4. Draw conclusion from the information.
Page 4
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
In this step the information collected from the sample is generalized to the population. Inferential statistics
uses methods that takes results obtained from a sample, extends them to the population, and measures the
reliability of the result.
Take Note!
If the entire population is studied, then inferential statistics is not necessary, because descriptive statistics
will provide all the information that we need regarding the population.
Example:
For the following statements, decide whether it belongs to the field of descriptive statistics or inferential
statistics.
1. A badminton player wants to know his average score for the past 10 games.
(Descriptive Statistics)
2. A car manufacturer wishes to estimate the average lifetime of batteries by testing a sample of 50 batteries.
(Inferential Statistics)
3. Janine wants to determine the variability of her six exam s cores in Algebra.
(Descriptive Statistics)
4. A shipping company wishes to estimate the number of passengers traveling via their ships next year using
their data on the number of passengers in the past three years. (Inferential Statistics)
5. A politician wants to determine the total number of votes his rival obtained in the past election based on
his copies of the tally sheet of electoral returns.
(Descriptive Statistics)
Page 5
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
allow us to determine the weights of all tomatoes. However, the weights of the tomatoes vary. One goal of
research is to learn the causes of the variability so that we can learn to grow plants that yield the best
tomatoes.
It is helpful to divide variables into different types, as different statistical methods are applicable to each. The
main division is into qualitative (or categorical) or quantitative (or numerical variables).
Variables can be classified into two groups:
1. Qualitative variables (Categorical) is variable that yields categorical responses. It is a word or a code that
represents a class or category.
2. Quantitative variables (Numeric) takes on numerical values representing an amount or quantity.
Example:
Determine whether the following variables are qualitative or quantitative.
1. Haircolor (Qualitative)
2. Temperature (Quantitative)
3. Stages of breast cancer (Qualitative)
4. Number of hamburger sold (Quantitative)
5. Number of children (Quantitative)
6. Zip code (Qualitative)
7. Place of birth (Qualitative)
8. Degree of pain (Qualitative)
Page 6
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
3. The distance of a 2005 Toyota Prius can travel in city conditions with a full tank of gas. (Continuous)
4. Number of words correctly spelled. (Discrete)
5. Time of a runner to finish one lap. (Continuous)
It is important to know which type of scale is represented by your data since different statistics are appropriate
for different scales of measurement. A characteristic may be measured using nominal, ordinal, interval and
ration scales.
1. Nominal Level - They are sometimes called categorical scales or categorical data. Such a scale classifies
persons or objects into two or more categories. Whatever the basis for classification, a person can only be in
one category, and members of a given category have a common set of characteristics.
Example:
- Method of payment (cash, check, debit card, credit card)
- Type of school (public vs. private)
- Eye Color (Blue, Green, Brown)
2. Ordinal Level - This involves data that may be arranged in some order, but differences between data values
either cannot be determined or meaningless. An ordinal scale not only classifies subjects but also ranks them
Page 7
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
in terms of the degree to which they possess a characteristic of interest. In other words, an ordinal scale puts
the subjects in order from highest to lowest, from most to least. Although ordinal scales indicate that some
subjects are higher, or lower than others, they do not indicate how much higher or how much better.
Example:
- Food Preferences
- Stage of Disease
- Social Economic Class (First, Middle, Lower)
- Severity of Pain
Page 8
MODULE STATISTICAL ANALYSIS WITH SOFTWARE APPLICATION – CAE11
Both interval and ratio data involve measurement. Most data analysis techniques that apply to ratio data also
apply to interval data. Therefore, in most practical aspects, these types of data (interval and ratio) are grouped
under metric data. In some other instances, these type of data are also known as numerical discrete and
numerical continuous.
Example:
Categorize each of the following as nominal, ordinal, interval or ratio measurement.
1. Ranking of college athletic teams.(Ordinal)
2. Employee number. (Nominal)
3. Number of vehicles registered. (Ratio)
4. Brands of soft drinks. (Nominal)
5. Number of car passers along C5 on a given day. (Ratio)
6. Zip code (Nominal)
7. Degree of pain (Ordinal)
For more knowledge about this lesson, please check the link provided;
https://www.youtube.com/watch?v=sxQaBpKfDRk
REFERENCES
https://www.investopedia.com/terms/s/statistics.asp
Page 9