Unit 1,2 Introduction N Summarization
Unit 1,2 Introduction N Summarization
Unit 1
Data Introduction
and Summarization
WHAT IS STATISTICS ?
Statistics is the science of
learning from the data
It is concerned with
data collection
data analysis
data interpretation
TYPES OF STATISTICS
Descriptive Statistics
It deals with collecting, summarizing and simplifying
data, which are otherwise quite unwieldy and
voluminous.
When the population interest is small, we will be able
to directly describe the important aspects of the
population measurements.
Inferential Statistics
It is the science of using a sample to make
generalizations about the important aspects of a
population.
A descriptive value for a population is called a
SAMPLE
Usually populations are so large that a
researcher cannot examine the entire group.
Therefore, a sample is selected to represent
the population in a research study. The goal
is to use the results obtained from the
sample to help answer questions about the
population.
A sample is a subset o the elements of a
population.
DATA CLASSIFICATION AND PRESENTATION
Male Females
Female Female
Male Male Unemploy
Unemploy Employed ed
Employed ed
QUANTITATIVE CLASSIFICATION
157 2 2500-3000 18
158 12 3000-3500 12
159 12
Total 100
TABULAR AND GRAPHICAL METHODS
Scatter Diagrams
SUMMARIZING QUALITATIVE DATA
Frequency Distribution
Relative Frequency
Bar Graph
Pie Chart
FREQUENCY DISTRIBUTION
A frequency distribution is a tabular summary of
data showing the frequency (or number) of items
in each of several non-overlapping classes.
Rating Frequency
Poor 2
Below Average 3
Average 5
Above Average 9
Excellent 1
Total 20
RELATIVE FREQUENCY DISTRIBUTION
The relative frequency of a class is the fraction
or proportion of the total number of data items
belonging to the class.
Relative Percent
Rating Frequency Frequency
Poor .10 10
Below Average .15 15
Average .25 25
Above Average .45 45
Excellent .05 5
Total 1.00 100
BAR GRAPH
9
8
7
Frequency
6
5
4
3
2
1
Rating
Poor Below AverageAbove Excellent
Average Average
PIE CHART
The pie chart is a commonly used
graphical device for presenting relative
frequency distributions for qualitative
data.
First draw a circle, then use the relative
Quality Ratings
EXAMPLE: MARADA INN
Insights Gained from the Preceding Pie Chart
85 7
93 6 7 8
STEM-AND-LEAF DISPLAY
Leaf Units
A single digit is used to define each leaf.
In the preceding example, the leaf unit was 1.
Leaf units may be 100, 10, 1, 0.1, and so on.
Where the leaf unit is not shown, it is assumed to
equal 1.
EXAMPLE: LEAF UNIT = 0.1
If we have data with values such as
8.6 11.7 9.4 9.1 10.2 11.0 8.8
a stem-and-leaf display of these data will be
5 2 7
6 2 2 2 2 5 6 7 8 8 8 9 9 9
7 1 1 2 2 3 4 4 5 5 5 6 7 8 9
9 9
8 0 0 2 3 5 8 9
9 1 3 7 7 7 8 9
10 1 4 5 5 9
SCATTER DIAGRAM
x = Number of y = Number of
Interceptions Points Scored
1 14
3 24
2 18
1 17
3 27
EXAMPLE: PANTHERS FOOTBALL
TEAM
Scatter Diagram
30
25
20
15
10
5
0 x
0 1 2 3
Number of Interceptions
EXAMPLE: PANTHERS FOOTBALL TEAM
x
SCATTER DIAGRAM
A Negative Relationship
y
x
SCATTER DIAGRAM
No Apparent Relationship
y
x
TABULAR AND GRAPHICAL PROCEDURES
Data
Qualitative
Qualitative Data
Data Quantitative Data