Statistics and Probability - CSE
Statistics and Probability - CSE
STATISTICS
Dr B. Vasu
Department of Mathematics
MNNIT, ALLAHABAD
Prayagraj-211004
THE STATISTICAL METHODS
• Statistical methods are devices by which complex and numerical data are so
systematically treated as to present a comprehensible and intelligible view of
them. In other words, the statistical method is a technique used to obtain,
analyze and present numerical data.
• Variable: A quantity which can vary from one individual to another is called a variable. It
is also called a variate.
Examples: Wages, barometer readings, rainfall records, heights, and weights are the
common examples of variables.
• Quantities which can take any numerical value within a certain range are called Continuous
variables. Examples: the height of a child at various ages is a continuous variable since, as
the child grows from 120 cm to 150 cm, his height assumes all possible values within the
limit.
• Quantities which are incapable of taking all possible values are called Discontinuous or
Discrete variables.
Examples: the number of rooms in a house can take only the integral values such as 2, 3, 4
etc.
• Frequency Distributions: The scores of 50 students in mathematics are arranged below
according to their roll numbers, the maximum scores being 100. 19, 70, 75, 15, 0, 23, 59, 56, 27,
89, 91, 22, 21, 22, 50, 89, 56, 73, 56, 89, 75, 65, 85, 22, 3, 12, 41, 87, 82, 72, 50, 22, 87, 50, 89,
28, 89, 50, 40, 36, 40, 30, 28, 87, 81, 90, 22, 15, 30, 35.
• The data given in the crude form (or raw form) is called ungrouped data.
• If the data is arranged in ascending or descending order of magnitude, it is said to be arranged in
an array.
• Let us now arrange it in the intervals 0–10, 10–20, 20–30, 30–40, 40–50, 50–60, 60–70, 70–80,
80–90, 90–100. This is arranged by a method called the tally method.
• In this we consider every observation and put it in the suitable class by drawing a vertical line.
After every 4 vertical lines, we cross it for the 5th entry and then a little space is left and the next
vertical line is drawn.
This type of
representation is called
a grouped frequency
distribution or simply a
frequency distribution.
• The groups are called the classes and the boundary ends 0, 10, 20, ...... etc. are called class limits.
• In the class limits 10-20, 10 is the lower limit and 20 is the upper limit. The difference between the
upper and lower limits of a class is called its magnitude or class-interval.
• The number of observations falling within a particular class is called its frequency or class frequency.
The frequency of the class 80-90 is 11.
• The variate value which lies mid-way between the upper and lower limits is called mid-value or mid-
point of that class. The mid-points of these are respectively 5, 15, 25, 35, ......
• The cumulative frequency corresponding to a class is the total of all the frequencies up to and
including that class. Thus the cumulative frequency of the class 10-20 is 2 + 4, i.e., 6 the cumulative
frequency of the class 20-30 is 6 + 10, i.e., 16, and so on.
• While preparing the frequency distribution the following points must be remembered:
➢ The class-intervals should be of equal width as far as possible .A comparison of different
distributions is facilitated if the class interval is used for all. The class-interval should be an integer
as far as possible.
➢ The number of classes should never be fewer than 6 and not more than 30. With a smaller
number of classes, the accuracy may be lost, and with a larger number of classes, the computations
become tedious.
➢ The observation corresponding to the common point of two classes should always be put in the
higher class. For example, a number corresponding to the value 30 is to be put up in the class 30-40
and not in 20-30.
GRAPHICAL REPRESENTATION OF A FREQUENCY DISTRIBUTION
FREQUENCY CURVE
❖ If through the vertices of a frequency polygon a smooth freehand curve is drawn,
we get the Frequency Curve. This is done usually when the class-intervals are of
small widths.
CUMULATIVE FREQUENCY CURVE OR THE OGIVE
❖If from a cumulative frequency table, the upper limits of the class taken as x co-ordinates and the
cumulative frequencies as the y co-ordinates and the points are plotted, then these points when joined
by a freehand smooth curve give the Cumulative Frequency Curve or the Ogive.
TYPES OF FREQUENCY CURVES
1. Symmetrical curve or Bell shaped curve
❖ If a curve can be folded symmetrically along a vertical
line, it is called a symmetrical curve.
❖ In this type the class frequencies decrease to zero
symmetrically on either side of a central maximum, i.e., the
observations equidistant from the central maximum have
the same frequency.
4. U-shaped curve.
In this curve, the maximum frequency is at the ends of the range
and a maximum towards the centre.
• 5. A Bimodal curve has two maxima.