Chapter 7
Chapter 7
Introduction
The goal of any research is to provide information out of raw data.
The raw data after collection has to be processed and analyzed in
line with outline (plan) laid down for purpose at time of
developing research plan.
Response on measurement instruments (words, check mark etc.)
conveys little information as such.
The compiled data must be classified, processed, analyzed and
interpreted carefully before their complete meanings and
implications can be understood.
Cont’d
Generally stages in data processing and analysis can be
summarized in chart as follow :
Editing
Coding processing
Data Analysis
Descriptive
And X M Z
Where X is mean, M is median and Z is mode
Cont’d
Skew-ness is, thus measurement of asymmetry and shows
manner in which items are clustered around average.
In symmetric (normal distribution) items show perfect balance
on either side of mode, but in skewed distribution balance is
skewed one side or distorted.
The amount by which balance exceeds on one side measures
skew-ness.
Cont’d
Knowledge about shape of distribution is crucial to use statistical
measure in research analysis, Since most method make specific
assumption about nature of distribution.
Data transformation: It is process of changing original form of
data to form that is more suitable to perform data analysis that
will achieve research objective.
The researcher often modifies value of scalar data or even create
new variable
Index numbers: Most of the time, financial information (price,
value of output, interest rate, and exchange rate) will be
adjusted for possible price changes by using index numbers (like
CPI, PPI).
Cont’d
An index number is a number, which is used to measure level of
given phenomenon at some standard date.
i. Index numbers measures only relative changes.
ii. Different indices serve different purpose
iii. Commodity index serves as measure of changes in phenomenon
on that commodity only
iv. Some index numbers are used to measure cost of living (CPI)
v. In economic sphere they are often termed as economic barometer
Cont’d
Scores of observation are recalibrated so that they may be
related to certain base period or base number.
Most commonly used index number to reduce influence of price
change on our observation is CPI
Researcher also uses index numbers to make comparison
between observations.
When series (data) are expressed in same units, we can use,
averages for purpose of comparison.
But two or more series are expressed in different units;
statistical average cannot be used to compare them.
By converting numbers in to index number we can make
comparison between two or more series.
Inferential Analysis
Most researcher wishes to go beyond simple tabulation of
frequency distribution and calculation of averages and / or
dispersion.
They frequently conduct and seek to determine relationship
between variables and test statistical significance.
When population is consisting of more than one variable it is
possible to measure relationship between them.
If we have data on two variables we said to have bivariate
variable, if data is more than two variables then population is
known as multivariate population.
If for every measure of variable, X, we have corresponding value
of variable, Y, resulting pairs of value are called bivariate
population
Cont’d
In case of bivariate or multivariate population, we often wish to know
relationship between two or more variables from data obtained.
E.g., we may like to know, “Whether number of hours students devote
for study is somehow related to their family income, to age, to sex, or
to similar other factors.
There are several methods of determining relationship between
variables.
Two questions should be answered to determine relationship
between variables.
1. Is there exist association or correlation between two or more
variables? If yes, then up to what degree?
This will be answered by use of correlation technique. Correlation
technique can be different
Cont’d
In case of bivariate population correlation can be found using:-
i. Cross tabulation
ii. Karl Pearson’s coefficient of correlation: It is simple correlation
and commonly used
iii. Charles Spearman’s coefficient of correlation
In case of multivariate population correlation can be studied
through:
i. Coefficient of multiple correlation
ii. Coefficient of partial correlation
Cont’d
2. Is there any cause and effect (causal relationship) between two
variables or between one variable on one side and two or more variables
on other side?
This question can be answered by use of regression analysis.
In regression analysis researcher tries to estimate or predict
average value of one variable on basis of value of other variable.
For instance a researcher estimates average value score on statistics
knowing a student’s score on mathematics examination.
There are different techniques of regression.
In case of bivariate population cause and effect relationship can
be studied through simple regression.
In case of multivariate population: Causal relationship can be
studied through multiple regression analysis.
Cont’d
Time series Analysis; Successive observations of given
phenomenon over period of time are analyzed through time series
analysis.
It measures relationship between variables and time (trend)
Time series will measure seasonal (seasonal fluctuation), cyclical
irregular fluctuation, and Trend.
The analysis of time series is done to understand dynamic
condition of achieving short term and long-term goal of
business firm for forecasting purpose
The past trend can be used to evaluate success or failure of
management or any other policy.
Based on past trend future patterns can be predicted and policy
may accordingly be formulated.
R 7
P T E
H A
F C
D O
EN