Presentation On Data Analysis: Submitted by
Presentation On Data Analysis: Submitted by
ANALYSIS
SUBMITTED BY :
ISHITA (30818210004) KARAN (3081821008)
In short, analysed data reveals insights that tell you where you need to focus
your efforts.
Measures of
Measures of Measures of
Central
Dispersion Asymmetry
Tendency
Measures of Other
Relationship Measures
MEASURES OF CENTRAL TENDENCY
■ Measures of central tendency are also usually
called as the averages.
■ They give us an idea about the concentration
of the values in the central part of the
distribution.
■ The following are the six measures of central
tendency that are in common use: (i)
Arithmetic mean, (ii) Median, (iii) Mode, (iv)
Geometric mean (v) Harmonic mean and (vi)
weighted mean
MEAN
Mean (Average) Mean locate the center of distribution.
Also known as arithmetic mean.
The mean is simply the sum of the values divided by the total
number of items in the set.
MERITS :-
It is easy to understand and easy to calculate
It is based upon all the observations
It is familiar to common man and rigidly defined
TYPES OF MEAN
HARMONIC MEAN - is a type of average that is calculated by
dividing the number of values in a data series by the sum of the
reciprocals of each value in the data series.
GEOMETRIC MEAN – It is defined as the nth root of the product of n
numbers. It is noted that the geometric mean is different from the
arithmetic mean. Because, in arithmetic mean, we add the data values
and then divide it by the total number of values. But in geometric mean,
we add the given data values and then take the root. For example: for a
given set of two numbers such as 3 and 1, the geometric mean is equal
to √(3+1) = √4 = 2.
WEIGHTED MEAN - The weighted mean is a type of mean that is
calculated by multiplying the weight (or probability) associated with a
particular event or outcome with its associated quantitative outcome and
then summing all the products together.
MODE
A mode is defined as the value that has a
higher frequency in a given set of values. It is
the value that appears the most number of
times.
Odd Number of Observations - If the total number of observation given is odd,
then the formula to calculate the median is:
Median = {(n+1)/2}thterm
Even Number of Observations - If the total number of observation is even, then
the median formula is:
Median = [(n/2)th term + {(n/2)+1}th]/2
Where n is the number of observations
MEASURES OF DISPERSION
Dispersion refers to the variations of the
items among themselves / around an
average.
Greater the variation amongst different
items of a series, the more will be the
dispersion.
As per Bowley, “Dispersion is a measure of
the variation of the items”.
METHODS OF MEASURING
DISPERSION
Range
Interquartile Range & Quartile Deviation
Mean Deviation
Standard Deviation
Coefficient of Variation
RANGE
It is the simplest measures of dispersion.
It is defined as the difference between the
largest and smallest values in the series.
R= L–S
Where R = Range
L = Largest Value
S = Smallest Value
Interquartile Range & Quartile
Deviation
Positively Skewed
Negatively Skewed
Measure of Relationship
Correlation and coefficient is commonly used to
measure the relationship. It is mostly used for
prediction. Higher the degree of correlation,
greater the accuracy with which one can predict
a score. Karl Pearson’s coefficient of correlation
is the frequently used measure in case of
statistics of variables, whereas Yule’s coefficient
of association is used in case of statistics of
attributes.
Mainly three types of correlation research have been identified:
1. Positive correlation: A positive relationship between two variables is when an
increase in one variable leads to a rise in the other variable. A decrease in one
variable will see a reduction in the other variable. For example, the amount of
money a person has might positively correlate with the number of cars the person
owns.
2. Negative correlation: A negative correlation is quite literally the opposite of a
positive relationship. If there is an increase in one variable, the second variable
will show a decrease and vice versa. For example, being educated might
negatively correlate with the crime rate when an increase in one variable leads to
a decrease in another and vice versa. If the level of education in a country is
improved, it can lower crime rates. Please note that this doesn't mean that lack of
education leads to crimes. It only means that a lack of education and crime is
believed to have a common reason - poverty.
3. No correlation: In this third type, there is no correlation between the two
variables. A change in one variable may not necessarily see a difference in the
other variable. For example, being a millionaire and happiness is not correlated.
An increase in money doesn't lead to happiness.
Other Measures
Index number and analysis of time series are some of the
other tools of Data Analysis.
ANALYSIS OF INDEX NUMBERS
Index numbers are indicators which reflect the relative
changes in the level of a certain phenomenon in any given
period called the current period with respect to its values
in some other period called the base period selected
primarily for this comparison.