Basic Statistics Terms Defined
Basic Statistics Terms Defined
ASSIGNMENT NO. 2
Harvey M. Pedrosa
ABM 11-25
Decide about the number of classes. Too many classes or too few classes might
not reveal the basic shape of the data set, also it will be difficult to interpret such
frequency distribution. The maximum number of classes may be determined by
formula: {\displaystyle \mathrm {NumberofClasses} =C=1+3.3\mathrm {log}
Calculate the range of the data (Range = Max – Min) by finding minimum and
maximum data value. Range will be used to determine the class interval or class
width.
Generally the class interval or class width is the same for all classes. The classes
all taken together must cover at least the distance from the lowest value
(minimum) in the data set up to the highest (maximum) value. Also note that
equal class intervals are preferred in frequency distribution, while unequal class
interval may be necessary in certain situations to avoid a large number of empty,
or almost empty classes.
Decide the individual class limits and select a suitable starting point of the first
class which is arbitrary, it may be less than or equal to the minimum value.
Usually it is started before the minimum value in such a way that the midpoint
(the average of lower and upper class limits of the first class) is properly placed.
Take an observation and mark a vertical bar (|) for a class it belongs. A running
tally is kept till the last observation.
4.1.Class Limit
4.2.Class Boundaries
4.3.Class Mark
4.4.Class Width
The difference between the upper and lower boundaries of any class. The class
width is also the difference between the lower limits of two consecutive classes or
the upper limits of two consecutive classes. It is not the difference between the
upper and lower limits of the same class.
4.5.Range
The range of a set of data is the difference between the largest and smallest
values.
4.6.Slovin’s Formula
-is used to calculate the sample size (n) given the population size (N) and a
margin of error (e).
- it's a random sampling technique formula to estimate sampling size
-It is computed as n = N / (1+Ne2).
whereas:
n = no. of samples
N = total population
e = error margin / margin of error
5.1.Histograms
A graph which displays the data by using vertical bars of various heights to
represent frequencies. The horizontal axis can be either the class boundaries, the
class marks, or the class limits.
5.2.Frequency Polygon
A line graph. The frequency is placed along the vertical axis and the class
midpoints are placed along the horizontal axis. These points are connected with
lines.
5.3.Ogive
5.4.Pie Chart
Graphical depiction of data as slices of a pie. The frequency determines the size of
the slice. The number of degrees in any slice is the relative frequency times 360
degrees.
5.5.Pareto Chart
A bar graph for qualitative data with the bars arranged according to frequency.
A data plot which uses part of the data value as the stem and the rest of the data
value (the leaf) to form groups or classes. This is very useful for sorting data
quickly.