Introduction To Statistics
Introduction To Statistics
Definition of Statistics
Types of variables
Types of Statistics
4
*What is statistics?
*Statistics is the science of data.
*It involves:
Collection
Organization
Summarization
Presentation
Analyses
Interpretation of Data.
5
*Data & Variables
Data are measurements/observations/facts of a single variable
or a set of variables which when collected, organized and
evaluated become information.
8
*Types of Variables
Categorical (qualitative) vs. Quantitative
Categorical variables are those for which the values indicate
different groupings.
• They are classified in categories or groups.
• The groups are distinguished base on quality (attribute or
characteristics of someone or something).
• Mathematical operations not applicable. E.g political party
affiliation, gender, race etc.
11
*Levels of measurement
Nominal Interval
Quantitative
Categorical
Rankable
Consist of names, labels, categories
Can determine numerical
Not rankable
difference
Cannot determine numerical
difference Ratios are meaningless
Can only describe the counts for No true zero
each category E.g. Temperature (Celsius and
E.g. gender, religion, colour Fahrenheit scales), Years
Ratio
Ordinal Quantitative
Rankable
Categorical Can determine numerical
Rankable difference
Cannot determine numerical Ratios are meaningful
difference
True zero
E.g. education level, type of
degree, course grades (A, B, C, D, E.g. income, speed, prices of
F) etc. college textbooks etc.
12
*
13
*
• A researcher wonders if the age at which people have their
first child is related to their level of education. To investigate
this possibility, he divides people who have had children into
four education brackets: high school or less, some college,
college degree, and advanced degree. He then records the
age at which each person had his or her first child. Determine
the scale of measurement used for each of the variables listed
below:
a) Education bracket: Nominal Ordinal Interval/ratio
b) Age at birth of first child: Nominal Ordinal
Interval/ratio
14
*
* A researcher used government records to classify each
family into one of seven different income categories
ranging from “below the poverty line” to “more than $1
million a year.” The researcher used police records to
determine the number of times each family was
burglarized. Determine the scale of measurement used
for each of the following variables:
a) Income category: Nominal Ordinal
Interval/ratio
b) Number of times burglarized: Nominal Ordinal
Interval/ratio
15
*
Determine which of the variables are discrete and which
are continuous.
• Number of times burglarized
• Gender of the participants
• Age at birth of first child
16
*Population vs. Sample
Population: The collection of all measurements of interest to the researcher.
It is the group of all units or subjects under investigation. Quantities
computed based on a population are called parameters.
Random Sample: It is a sample collected such that all sampling units have an
equal chance of being selected (no sampling units is favoured over another).
17
*
Determine whether the given value is a statistic or a
parameter.
* A sample of households is selected and the average
(mean) number of people per household is 2.58 (based
on data from U.S. Census Bureau)
* Currently 42% of the governors of the 50 United States
are Democrats.
* A polling organization asked a representative sample of
50- to 55-year-olds living in the United States to
determine how much they had saved for retirement. The
average amount saved was $125,000.
18
*Types of Statistics
Descriptive Statistics:
Refers to a collection of techniques used to organize, summarize and
characterize data.
E.g. the use of tables, graphs and statistics such as the mean and the
mode.
Inferential Statistics:
Refers to a collection of techniques used to interpret and make inferences
about a population based on sample information.
20
21