Freq Distribution
Freq Distribution
Data are a set of facts, and provide a partial picture of reality. They must
be summarized, organized, and analyzed to usefully derive information
from them.
Advantages:
Disadvantages:
Whereas tables can be used for presenting all the information, graphs
simplify complex information by using images and emphasizing data
patterns or trends, and are useful for summarizing, explaining, or
exploring quantitative data.
While graphs are effective for presenting large amounts of data, they
can be used in place of tables to present small sets of data.
A graph or chart maybe a bar graph, line graph, pie chart, pictograph
or a statistical map.
B. Frequency Distribution
Frequency
It is the number of times that value appears in a data set.
Frequency Distribution
1. Set the values of data, which are called scores, in the column starting
from the lowest value to the highest or vice versa;
2. Create the second column with the frequency of each data occurrence.
This column is known as the tally of the scores;
3. Create the third column, where the relative frequency of each score
will be inserted. The relative frequency can be obtained as follows:
f
fr =
N
Where:
f = the frequency of each score (from the second column)
N = the total number of scores.
6. The next column is called “cumulative proportion” and the values of its
column are obtained as a ratio of cumulative frequency for each score
and the total number of scores.
We begin with example of raw data on the electricity cost during the
month of July 1999 for a random sample of 50 two-bedroom apartments in a
large city.
2. Determine the range or gap between the highest and the lowest
scores.
k = 1+3.322 log 50
k = 1+5.64
k = 6.64 or 7
Range
C=
k
131
C=
7
C=18.7∨19
5. Pick the starting value that is less than or equal to the smallest value.
Try to make it multiple of the group size if you can.
Since the width of each class interval for Utility Charges has been set
as 19, the boundaries of the various class groupings must be
established so as to include the entire range of observations.
Whenever possible, these boundaries should be chosen to facilitate the
reading and interpreting of data. Thus, the first class interval ranges
from 82 to 100, the second from 101 to 119 and so on, until they have
been tallied into seven classes, each having an interval width of 19,
without overlapping. By establishing these boundaries, all 50
observations can be tallied into each class.
8. Determine the class mid-point (also known as the class mark). Class
mid-point is the point mid-way the boundaries of each class interval
and is representative of the data within the class interval.
82+100
o Class Mid-point = = 91
2
Utility Charges
f Relative Frequency Percentage
(Class)
82-100 4 4/50 = 0.08 0.08 x 100 = 8
101-119 7 0.14 14
120-138 8 0.16 16
139-157 13 0.26 26
158-176 9 0.18 18
177-195 5 0.10 10
196-214 4 0.08 8
Total 50 1.00 100
C. Graphical Representation
Graphical Representation is a way of analyzing numerical data. It exhibits
the relation between data, ideas, information and concepts in a diagram. It
is easy to understand and it is one of the most important learning strategies.
It always depends on the type of information in a particular domain. There
are different types of graphical representation. Some of them are as follows
Line Graphs – Linear graphs are used to display the continuous data
and it is useful for predicting the future events over time.
Line Plot – It shows the frequency of data on a given number line. ‘ x
‘ is placed above a number line each time when that data occurs again.
Frequency Table – The table shows the number of pieces of data that
falls within the given interval.
Circle Graph – Also known as pie chart that shows the relationships
of the parts of the whole. The circle is considered with 100% and the
categories occupied is represented with that specific percentage like
15%, 56% , etc.
Stem and Leaf Plot – In stem and leaf plot , the data are organized
from least value to the greatest value. The digits of the least place
values from the leaves and the next place value digit forms the stems.
Box and Whisker Plot – The plot diagram summarizes the data by
dividing into four parts. Box and whisker shows the range (spread) and
the middle ( median) of the data.
Graphic presentation has a number of advantages, some of which
are enumerated below:
1. Graphs are visual aids which give a bird’s eye view of a given set of
numerical data. They present the data in simple, readily
comprehensible form.
3. They are more catching and as such are extensively used to present
statistical figures and facts in most of the exhibitions, trade or
industrial fairs, public functions, statistical reports, etc. Graphs have
universal applicability.
7. Graphic form of data is also very useful device to suggest the direction
of investigations. Investigations cannot be conducted without any
regard to the desired aim and the graphic form helps in fulfilling that
desired aim by suggesting the direction of investigations.