Chapter 1 Correct
Chapter 1 Correct
Introduction to Statistics
Definition
• The term statistics have two definitions;
• In its plural sense, it is equivalent to numerical facts,
figures or measurements.
• But all numerical figures/data are not statistics.
A numerical data to be statistical data
The data should be numerically expressed.
The data must be comparable.
The data should be collected in systematic manner.
The data should be collected for a pre-determined
purpose.
1
Definition …
Statistics in its singular sense:
The branch of applied research that deals with the
development and application of methods for
collecting, organizing, presenting, analysing and
interpreting of numerical data.
Generally [Statistics]
• Is a science that helps us make better decisions in
business and economics as well as in other fields.
• Teaches us how to summarize, analyze, and draw
meaningful inferences from data that then lead to
improve decisions.
• Maximizing information by reducing error
2
Classifications of statistics
Descriptive Statistics:
Is concerned with summary calculations,
graphs, charts and tables.
A statistical method that is concerned with the
collection, organization, summarization, and
analysis of data from a sample of population.
Helps to describe a given set of data without
going beyond that data themselves.
3
……….Continued
Inferential Statistics:
is a method used to generalize from a sample to
a population(helps to make inference /conclusion
about a population based on the selected
sample).
It consists of
Predict and forecast values of population
parameters
Test hypotheses about values of population
parameters
Make decisions
4
Stages in Statistical Investigation
5
……….Continued
C. Presentation of the data: The process of re-
organization, classification, compilation, and
summarization of data to present it in a meaningful
form
overall view of what the data actually looks like.
facilitate further statistical analysis.
Can be done in the form of tables and graphs or diagrams.
D. Analysis of data: The process of extracting
relevant information from the summarized data(like
mean,median,mode,range,variance….)
To dig out useful information for decision
making
6
……….Continued
E. Inference(Interpretation) of data:
Concerned with drawing conclusions from
the data collected and analyzed; and giving
meaning to analysis results.
A difficult task and requires a high degree of skill
and experience.
7
Definition of Some Basic terms
A population consists of the set of all measurements/elements
under study for which the investigator is interested.
12
……….Continued
• Furnishes a technique of comparison
• Estimating unknown population
characteristics.
• Testing and formulating of hypothesis.
• Studying the relationship between two or
more variable.
• Forecasting future events.
13
Limitations of statistics
As a science statistics has its own limitations. The
following are some of the limitations:
• Deals with only quantitative information(It does not
study qualitative characteristics directly).
• Deals with only aggregate of facts and not with
individual data items.
• Statistical data are only approximately and not
mathematical correct.
• Statistics can be easily misused and therefore
should be used be experts.
14
Types of variables(Data)
On the basis of information
contained by the data
Qualitative Variables :- Are non
numeric variables and can't be measured.
Examples gender, religious affiliation,
Color, Nationality, marital status of
patients and state of birth.
15
……….Continued
Quantitative Variables are numerical variables and
can be measured.
Examples balance in checking account, number of
children in family, Temperatures, Salaries, Number
of points scored on a 100 point exam, Number of
students in a class; number of cars in a parking lot etc
16
……….Continued
Note that quantitative variables are either discrete (which
can assume only certain values, and there are usually "gaps"
between the values, such as the number of bedrooms in your
house)
17
Discrete Variables
• are variables which assume a finite or countable number of
possible values.
• are usually obtained by counting.
• is characterized by gaps or interruptions in the values
that it can assume. These gaps or interruptions indicate the
absence of values between particular values that the
variable can assume.
Example:
• The number of daily admissions to a general hospital, and
• The number of first year statistics students
• The number of decayed, missing or filled teeth per child in
an elementary school.
Continuous Variables
• are variables which assume an infinite number
of possible values between any two specific
values.
• are usually obtained by measurement.
• does not possess the gaps or interruptions
characteristic of a discrete variable.
Example:
• Weight, age, length, temperature, weight,
speed, salary and mark of students
Types of Data…
On the basis of their source :
1. Primary data
Data generated for the first time primarily/originally for the
study in question.
2. Secondary data
Data obtained from other pre-existing/ priorly collected
sources.
21
Nominal Scales
Only "naming" and classifying observations is possible.
When numbers are assigned to categories, it is only for
coding purposes and it does not provide a sense of size.
Level of measurement which classifies data into mutually
exclusive, all inclusive categories in which no order or ranking
can be imposed on the data.
22
……….Continued
Examples:
– Political party preference (Republican,
Democrat, or Other,)
– Sex (Male or Female.)
– Marital status(married, single, widow, divorce)
– Country code
– Regional differentiation of Ethiopia.
– Eye color (e.g. brown, blue)
– religion (Muslim, Christian),
– place of residence (urban, rural) etc
23
Ordinal Scales
• Level of measurement which classifies data
into categories that can be ranked.
We can talk of greater than or less than and it conveys
meaning to the value but;
Impossible to express the real difference between
measurements in numerical terms.
• Ordering is the sole property of ordinal
scale.
• Used for grouping and ordering
+, -, *, / are impossible
The magnitude b/n the values is not clearly known 24
……….Continued
• Arithmetic operations are not applicable but relational
operations are applicable. Examples:
– Letter grades (A, B, C, D, F).
– Rating scales (Excellent, Very good, Good, Fair,
poor).
– Military status/ranks.
– Economic status (poor,medium,higher) or Socio-
economic status (very low, low, medium, high,
very high)
– severity(mild, moderate, sever)
– blood pressure (very low, low, high, very high etc.
25
Interval Scales
• Level of measurement which classifies data that can be ranked
and differences are meaningful. (The magnitude b/n the values
is clearly known)
27
……….Continued
Examples:
– Weight
– Height
– Number of students / items
– Age
– Salary
– Volume
– Length
28
Exercise-1
The following are list of different attributes/ variables or
data. Classify the variables/data in to different
measurement scales.
1. Your checking account number as a name for your account.
2. Your score on statistics test as a measure of your knowledge of
statistics.
3. A response to the statement "Abortion is a woman's right"
where "Strongly Disagree" = 1, "Disagree" = 2, "No Opinion"
= 3, "Agree" = 4, and "Strongly Agree" = 5, as a measure of
attitude toward abortion.
4. Times for swimmers to complete a 50-meter race
5. Months of the year as September, October…
6. Economic status of a family when classified as low, middle
and upper classes.
7. Blood type of individuals as A, B, AB and O.
8. Regions of Ethiopia as region 1, region 2, region 3…
30
LEVELS OF MEASUREMENT
• A level of measurement is the precision by which a
variable is measured.
• The four levels of measurement, from lowest to highest,
are:
– Nominal
– Ordinal
– Interval
– Ratio
• QUIZ: (Identify the usual level of measurement for each
of the following)
1. year in school 2. IQ scores
3. life expectancy 4. fatigue
5. cynicism 6. grade point average
7. hair color 8. type of neighborhood
9. temperature 10. climate