PDF Document
PDF Document
69
48
Example
• From the chart:
- What is this chart called?
- How many observations are in study?
- What are the max and min values?
- Around what values do observations tend
to cluster?
Example
• In one class, there are 20 students that came from
various countries to study business major. The 20
students that study business are from:
Egypt Japan Korea India Egypt
Japan India Egypt China Korea
China China Australia China China
Korea China Egypt Egypt Japan
Construct dot diagram (Pareto)
Stem-and-Leaf Displays
• A statistical technique for displaying a set of data.
• Each numerical value is divided into two parts: the
leading digits become the stem and the trailing
digits become the leaf.
• Ex: Stock prices on twelve consecutive days for a
major publicly traded company
Double-Stem Display
• A modification display from Stem-and-Leaf
• Basically, Display divide into two parts;
First stem position to hold leaves 0, 1, 2, 3
and 4.
Second stem position to hold leaves 5, 6, 7,
8, and 9.
Example
• The first row of a stem-and-leaf chart
appears as follows: 62 1 3 3 7 9. Assume
whole number values.
a. What is the “possible range” of the values
in this row?
b. How many data values are in this row?
c. List the actual values in this row of data.
Quartiles
- Divide a set of observations into 4 equal parts
Lp = (n+1)
Example
• 20, 15, 12, 11, 18, 11, 19, 15, 18
Find the Q1, Q2 and Q3!
Example of Quartiles
Using the twelve stock prices, we can find the
median, 25th, and 75th percentiles as follows:
Quartile 3
Median
Quartile 1
Percentile
th
12 96 75 percentile
Q4 11 92 Price at 9.75 observation = 88 + .75(91-88)
10 91 = 90.25
9 88
Q3 8 86
50th percentile: Median
7 85
Price at 6.50 observation = 84 + .5(85-84)
6 84
= 84.50
Q2 5 83
4 82 th
3 79 25 percentile
Q1 2 78 Price at 3.25 observation = 79 + .25(82-79)
1 69 = 79.75
Example
• Find 10th, 38th, 50th, and 76th percentile from
the data below:
20 35 42 35 26 37 47 38 31 40
33 41 35 24 36 46 37 30 39 50
41 21 22 43 44 27 26 49 48 31
Frequency Distributions
• A grouping of data into mutually exclusive
categories showing the number of observations in
each class.
• Constructing a frequency distribution involves:
- Determining the question to be addressed
- Collecting raw data
- Organizing data (Frequency Distributions)
- Presenting data (Graph)
- Drawing Conclusion
Organizing Data (Frequency
Distributions)
• Arrange data from the smallest to the largest
• Determine “Range” from data
Range = Largest data – Smallest data
• Determine how many class (k);
A. Trial and Error
B. [Sturgess Formula] k = 1 + 3.3 log n ; k need to be rounded, k =
class(s), n = data
C. 2k > n; n = data
• Determine the length of class interval (i)
i = R/k
• Determine the first lower class limit
• Write down frequency in table with tally according with the data
Frequency and Categorical
Distributions
Categorical Distribution
Frequency Distribution
Some Rules in Choosing the
Classes
• Hardly ever use fewer than 6 or more than
15 classes
• Make sure each item (measurement or
observation) goes into only one class
• Use classes covering equal ranges (or
intervals) of values
Parts of Frequency Distributions
• Class
• Class limits; lower class limits and upper
class limits
• Class boundary or real class limits
• Class midpoint or class marks
• Class interval
• Class Frequency
Two Ways to Modify Frequency
Distributions to Suit Particular Needs
• Percentage • Cumulative
Distribution Distribution
(Frequency and
Percentage)
By dividing each class
frequency by the total
To convert it into a “less than”,
number of items grouped
or “more than”
and then multiplying by
100%
Example
• Below is the data of 50 employee's salary in
a year (in million Rupiah)
80 18 69 51 71 92 35 28 60 45
63 59 64 98 47 49 48 64 58 74
85 56 72 38 89 55 28 67 84 78
37 73 65 66 86 96 57 76 57 19
54 76 49 53 83 55 83 47 64 39
Construct a frequency distribution!
Graphical Presentation
• Histogram is a graph in which the class midpoints
or limits are marked on the horizontal axis and the
class frequencies on the vertical axis. The class
frequencies are represented by the heights
of the bars and the bars are drawn adjacent
to each other.
midpoint
Graphical Presentation
• Frequency Polygon consists of line segments
connecting the points formed by the class
midpoint and the class frequency.
Graphical Presentation
• Cumulative Frequency Distribution is used to
determine how many or what proportion of the
data values are below or above a certain value.
Other Useful Charts
• Bar Chart can be used to depict any of the levels
of measurement
• Pie Chart is useful for displaying a relative
frequency distribution. A circle is divided
proportionally to the relative frequency and
portions of the circle are allocated for the different
groups.
Bar Chart and Pie Chart