Frequency Distribution: Postgraduate Corner

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

[Downloaded free from http://www.jpharmacol.com on Wednesday, September 28, 2016, IP: 177.148.138.

166]

Postgraduate Corner

Frequency distribution

Manikandan S
Assistant Editor, JPP

INTRODUCTION are very few, then the shape of the distribution itself cannot
be determined. Generally, 6–14 intervals are adequate.[2]
The next step after the completion of data collection is to
organize the data into a meaningful form so that a trend, if The width of the class can be determined by dividing the range
any, emerging out of the data can be seen easily. One of the of observations by the number of classes. The following are
common methods for organizing data is to construct frequency some guidelines regarding class widths:[1]
distribution. Frequency distribution is an organized tabulation/ • It is advisable to have equal class widths. Unequal class
graphical representation of the number of individuals in widths should be used only when large gaps exist in data.
each category on the scale of measurement.[1] It allows the • The class intervals should be mutually exclusive and
researcher to have a glance at the entire data conveniently. nonoverlapping.
It shows whether the observations are high or low and also • Open-ended classes at the lower and upper side (e.g., <10,
whether they are concentrated in one area or spread out >100) should be avoided.
across the entire scale. Thus, frequency distribution presents
a picture of how the individual observations are distributed in The frequency distribution table of the resting pulse rate
the measurement scale. in healthy individuals is given in Table 1. It also gives the
cumulative and relative frequency that helps to interpret the
data more easily.
DISPLAYING FREQUENCY DISTRIBUTIONS
Frequency distribution graphs
Frequency tables A frequency distribution graph is a diagrammatic illustration
A frequency (distribution) table shows the different of the information in the frequency table.
measurement categories and the number of observations in
each category. Before constructing a frequency table, one Histogram
should have an idea about the range (minimum and maximum A histogram is a graphical representation of the variable
values). The range is divided into arbitrary intervals called
“class interval.” If the class intervals are too many, then Table 1: Frequency distribution of the resting
there will be no reduction in the bulkiness of data and minor pulse rate in healthy volunteers (N = 63)
deviations also become noticeable. On the other hand, if they Pulse/min Frequency Cumulative Relative
frequency cumulative
frequency (%)
Access this article online 60–64 2 2 3.17
Quick Response Code:
65–69 7 9 14.29
Website:
70–74 11 20 31.75
www.jpharmacol.com
75–79 15 35 55.56
80–84 10 45 71.43
DOI:
85–89 9 54 85.71
10.4103/0976-500X.77120
90–94 6 60 95.24
95–99 3 63 100

Address for correspondence:


Manikandan S., Department of Pharmacology, Indira Gandhi Medical College and Research Institute, Kadirkamam, Puducherry, India.
E-mail: [email protected]

54 Journal of Pharmacology and Pharmacotherapeutics | January-March 2011 | Vol 2 | Issue 1


[Downloaded free from http://www.jpharmacol.com on Wednesday, September 28, 2016, IP: 177.148.138.166]

Manikandan: Frequency distribution

of interest in the X axis and the number of observations


(frequency) in the Y axis. Percentages can be used if the
objective is to compare two histograms having different
number of subjects. A histogram is used to depict the frequency
when data are measured on an interval or a ratio scale. Figure  1
depicts a histogram constructed for the data given in Table 1.

A bar diagram and a histogram may look the same but there
are three important differences between them:[3,4]

In a histogram, there is no gap between the bars as the variable


is continuous. A bar diagram will have space between the bars.

All the bars need not be of equal width in a histogram (depends Figure 1: Histogram of the resting pulse rate in healthy volunteers
on the class interval), whereas they are equal in a bar diagram. (N = 63)

The area of each bar corresponds to the frequency in a


histogram whereas in a bar diagram, it is the height [Figure 1].

Frequency polygon
A frequency polygon is constructed by connecting all
midpoints of the top of the bars in a histogram by a straight
line without displaying the bars. A frequency polygon aids in
the easy comparison of two frequency distributions. When
the total frequency is large and the class intervals are narrow,
the frequency polygon becomes a smooth curve known as the
frequency curve. A frequency polygon illustrating the data in
Table 1 is shown in Figure 2.

Box and whisker plot


Figure 2: Frequency polygon of the resting pulse rate in healthy
This graph, first described by Tukey in 1977, can also be volunteers (N = 63)
used to illustrate the distribution of data. There is a vertical
or horizontal rectangle (box), the ends of which correspond
to the upper and lower quartiles (75th and 25th percentile, Outlier
respectively). Hence the middle 50% of observations are Inner Fence
represented by the box. The length of the box indicates the
variability of the data. The line inside the box denotes the
Upper quarle
median (sometimes marked as a plus sign). The position of
the median indicates whether the data are skewed or not.
If the median is closer to the upper quartile, then they are Median
negatively skewed and if it is near the lower quartile, then
positively skewed.
Lower quarle
The lines outside the box on either side are known as whiskers Whisker
[Figure 3]. These whiskers are 1.5 times the length of the
box, i.e., the interquartile range (IQR). The end of whiskers is Figure 3: Schematic diagram of a “box and whisker plot”
called the inner fence and any value outside it is an outlier. If
the distribution is symmetrical, then the whiskers are of equal
length. If the data are sparse on one side, the corresponding side CHARACTERISTICS OF FREQUENCY
whisker will be short. The outer fence (usually not marked) DISTRIBUTION
is at a distance of three times the IQR on either side of the
box. The reason behind having the inner and outer fence at There are four important characteristics of frequency
1.5 and 3 times the IQR, respectively, is the fact that 95% of distribution.[6] They are as follows:
observations fall within 1.5 times the IQR, and it is 99% for • Measures of central tendency and location (mean, median,
3 times the IQR.[5] mode)

Journal of Pharmacology and Pharmacotherapeutics | January-March 2011 | Vol 2 | Issue 1 55


[Downloaded free from http://www.jpharmacol.com on Wednesday, September 28, 2016, IP: 177.148.138.166]

Manikandan: Frequency distribution

• Measures of dispersion (range, variance, standard 2. Dawson B, Trapp RG. Basic and clinical biostatistics. 4th ed. New York:
deviation) McGraw Hill; 2004.
3. Sundaram KR, Dwivedi SN, Sreenivas V. Medical statistics principles and
• The extent of symmetry/asymmetry (skewness) methods. 1st ed. New Delhi: B.I Publications Pvt Ltd; 2010.
• The flatness or peakedness (kurtosis). 4. Swinscow TDV, Campbell MJ. Statistics at square one. 10th ed (Indian).
New Delhi: Viva Books Private Limited; 2003.
5. Norman GR, Streiner DL. Biostatistics the bare essentials. 2nd ed. Hamilton:
These will be dealt with in detail in the next issue. B.C. Decker Inc; 2000.
6. Sundar Rao PS, Richard J. Introduction to biostatistics and research
methods. 4th ed. New Delhi: Prentice Hall of India Pvt Ltd; 2006.
REFERENCES
1. Gravetter FJ, Wallnau LB. Statistics for the behavioral sciences. 5th ed.
Source of Support: Nil, Conflict of Interest: None declared
Belmont: Wadsworth – Thomson Learning; 2000.

56 Journal of Pharmacology and Pharmacotherapeutics | January-March 2011 | Vol 2 | Issue 1

You might also like