Ch 2 data processing
Ch 2 data processing
Definition
The central tendency is stated as the statistical measure that represents the single value of the
entire distribution ora dataset. It aims to provide an accurate description of the entire data in
the distribution.
CENTRAL TENDENCY
Central Tendency
Arithmetic mean represents a number that is obtained by dividing the sum of the elements of a
set by the
number of values in the set. So you can use the
layman term Average, or be a little bit
fancier and use the word "Arithmetic mean"
your call, take your pick -they both mean the same.
The arithmetic mean may be either. A. Simple Arithmetic Mean B.
Weighted Arithmetic Mean
#Arithmetic Mean Formula( Ungroped Data)
If any data set consisting of the values b1, b2, b3, .., bn then the arithmetic mean Bis defined
as:
b; b2+bz-t:+b
2 =
# The sum of the squared deviations of the items from Arithmetic Mean (A.M) is minimum,
which is less than the sum of the squared deviations of the items from any other values.
# If each item in the arithmetic series is substituted by the mean, then the sum of these
replacements will be equal to the sum of the specific items.
4.t is a measured value and not based on the position in the series.
1.It is changed by extreme items such as very small and very large items.
3.In some cases, A.M. does not represent the original item. For example, average patients
admitted to a hospital are 10.7 per day.
A median is a positional number that determines the position of the middle set of data. It
divides the set of data into two parts. In which, one part includes all the greater values or which
is equal to a median value and the other set includes all lesser values or equal to the median. In
simple words, the median is the middle value when a data set is organised according to the
magnitude. The value of the median remains unchanged if the size of the largest value
increases because it is defined by the position of various value.
To evaluate the median the value must be arranged in the sequence of numbers, and the
numbers should be arranged in the value order starting from lowest to highest. For instance,
while evaluating the medium if there is any sort of odd amount of number in the list, the median
will be the middle number, with a similar number presented below or above. However, if the
amount is an even number than the middle pair must be evaluated, combined together, and
divided by two to find the median value.
Charecteristics of MEDIAN:
1."The median is that value of the variable which divides the group into two equal parts, one part
comprising all values greater and the other values less than the median."...L.R. Connor
2.Median is the middle value of the series when items are arranged either in ascending or
descending order.
3.It divides the series into two equal parts. One part comprises all values greater than the
median and the other part comprises all values smaller than the median.
PROPERTIES OF MEDIAN
1.The sum of deviations of items from median, ignoring the signs, is minimum.
2.Median is a positional average and hence it is not influenced by the extreme values.
Medan unmala
tox mula 1 (umgroup Data)
Median (+1) TD+al No.fobsemakon
O Foxmula 2 (toup Data
Median= L+ CFiL=auer e Bunday
Median elas*
N Total Sum of equma
euenuy o tamask an
elu
CF: Cumulaive xequany
uPto meian elas
elan imenval
MERITS OF MEDIAN:
It is not affected by the extreme values i.e. the largest and smallest values. Because it is a
positional average and not dependent on magnitude.
(3) RIGIDLY DEFINED
Median is the best measure of central tendency when we deal with qualitative data, where
ranking is preferred instead of measurement or counting.
It can be calculated even if the value of the extremes is not known. But the number of items
should be known.
lts value can be determined or represented graphically with the help of Ogive curves. Whereas it
is not possible in case of Arithmetic Mean.
DEMERITS OF MEDIAN:
Since the median is an average of position, therefore arranging the data in ascending or
descending order of magnitude is time-consuming in the case of a large number of
observations.
#It is a positional average and doesn't consider the magnitude of the items.
It is not dependent on all the observations so, it cannot be considered as their good
representative.
In case there is a big variation between the data, it will not be able to represent the data.
It is affected by the fluctuations of sampling and this effect is more than in case of Arithmetic
Mean.
It is a positional average so further algebraic treatment is not possible. Like, we cannot compute
the combined median of two groups of data.
Measures of Central Tendency-Mode
The measure of central tendency mode is the value that appears regularly in the data set. On a
histogram or a bar chart, the highest bar in the chart is the mode. In the data set if the data has
multiple values and has occurred multiple times, then the data has a mode if the data have no
value repeats than it does not have a mode.
Typically, the mode is used with ordinal, category, and discrete data. Also, the mode is only the
measure that uses category data- for instance, the most liked flavoured ice-cream. But, the
category data doesn't have a central value because it is not possible to order the group.
However, the ordinal and discrete data has a mode with value and which is not in the centre. In
simple words, mode represents.
Characteristics of MODE
1.Mode is that value which occurs most frequently in a distribution.
3.It is that value of the variable which has the highest frequency.
Mode foumula.
# Everyone understands the concept of majority. Mode is based on this concept so, it's easy to
understand.
(2)REPRESENTATIVE VALUE
#We can find mode even in case of open ended frequency distribution.
#We basically need the point of maximum concentration of frequencies, it is not necessary to
know all the values.
#For example: In the surveys it is used to measure taste and preferences of people for a
particular brand of the commodity.
DEMERITS OF MODE:
#The value of mode is not based on each and every item of the series as it considers only the
highest concentration of frequencies.
(2) SOMETIMES IT IS INDETERMINATE OR ILL DEFINED
# There are two methods of determining mode, Inspection Method and Grouping Method. We
may not get the same value of mode by the two methods. So, it is not rigidly defined.
Grouping of data is desirable for correct computation but it is a complex process and involves
so much calculations.
Since it is not based on all the observations and not rigidly defined, it is not suitable for further
algebraic treatment.