Research Methodology - Introduction
Research Methodology - Introduction
Review of Literature
Formulate Design the
Define research Questionnaire
problem Objectives (Determine the Scale)
Find the Research Gap
Stratified Quota
Cluster Snowball
Probability Sampling
• Simple Random - Each population element has a known
and equal chance of selection.
Simple Random Sample: High cost Only minimal advance Requires sampling frame to work
Each population element has an Most likely used knowledge of population from; Does not use knowledge of
equal chance of being selected needed; easy to analyse population; larger errors for same
into the sample. data and compute error sample size than with stratified
sampling.
Systematic or fixed interval: Moderate cost Simple to draw sample; If sampling interval is related to a
Researcher uses natural ordering Moderately used easy to check periodic ordering of the
or order of sampling frame, population, may introduce
selects an arbitrary staring point, increased variability.
then selects items at a
preselected intervals.
Cluster sampling: Low cost If clusters geographically Larger error for comparable size
Population is divided into Frequently used defined, yields lowest field than other probability samples.
internally heterogeneous cost; requires listing of all
subgroups. Some are randomly clusters but of individuals
selected for further study. only within clusters
Stratified Random sample: High cost Assures representation of Requires accurate information on
Divides population into mutually Moderately used all groups in sample; proportion in each stratum; If
exclusive subpopulations or strata Reduces variability for stratified lists are not already
and uses simple random on each same sample size available, they can be costly to
stratum. prepare.
COMPARISON OF NON – PROBABILITY SAMPLE
Description Cost and Degree Advantages Disadvantages
of use
Coding Estimation of
Uni- parameter Testing hypotheses
dimensional values
analysis
Classification
Point estimate Parametric tests
Bivariate
Analysis
15
SPSS 16.0 DATA EDITOR WINDOW – DATA VIEW
SPSS 16.0 DATA EDITOR WINDOW – VARIABLE VIEW
CHARACTERISTICS OF MEASUREMENT SCALES
18
a parametric statistical test is one that makes assumptions about the parameters (defining properties) of the population distribution(s)
from which one's data are drawn, assume the underlying source population(s) to be normally distributed ,
while a non-parametric test is one that makes no such assumptions.
OVERVIEW OF UNIVARIATE DATA ANALYSIS PROCEDURES
Univariate Procedures
Interval Nominal
What is the
scale level of
the variable?
Ordinal
1. Descriptive
a) Central A) Mean A) Median A) Mode
tendency B) Standard B) Interquartile B) Relative and
absolute frequency
deviation range by category
b) Dispersion
20
BIVARIATE DATA ANALYSIS PROCEDURES
Bivariate Procedures
2. Inferential
* t test on regression
coefficient
Mann-Whitney
* z test on the difference U test Chi-square
between means Kalmogorov - test
* t test on the difference
between means Smirnov test
21
MULTI VARIATE METHODS
INTERDEPENDENCE DEPENDENCE
METHODS METHODS
23
Analysis and Interpretation
SAS
FREQUENCY TABLE
The Chi-Square Test
Chi-square which is available in cross tab is used to test
whether there is a significant association between
two attributes.
• Chi-Square Formula :
Expected score:
HYPOTHESIS:
ASSUMPTION:
• The variables must be normally distributed
• Samples are independent.
• Variances of populations are equal.
• The sample is a simple random sample (SRS).
HYPOTHESIS:
• Null hypothesis H0- There is no significant difference between the
variables.
• Alternate hypothesis H1-There is significant difference between the
variables.
ONE - WAY ANOVA TEST
PROCEDURE:
Based on the result generated by SPSS, the significant value is 0.000 and it is
lower than 0.05 so reject null hypothesis . Hence there is a significance
difference in the saving of respondents with respect to their income level.
Post Hoc: Duncan
Total monthly saving
Duncan
Subset for alpha = 0.05
Total Income N 1 2 3 4
upto 20000 124 5310.48
20001-30000 219 7064.87
30001-40000 103 1.00E4
40001-50000 54 1.12E4
50001 and above 52 1.35E4
Sig. 1.000 1.000 .133 1.000
Means for groups in homogeneous subsets are displayed.
In the mean plot when the saving level is low when income earned by
them is lower while the saving level is high when the income earned
by them is higher.
CORRELATION ANALYSIS
ASSUMPTION:
• The variables must be normally distributed.
• The variables must be linear.
HYPOTHESIS:
• Null hypothesis Ho: there is no significant relationship between the
variables.
• Alternate hypothesis H1: there is a significant relationship between the
variables.
PROCEDURE
⚫ It is a parametric method
ASSUMPTION:
• The variables must be linear.
• The variables must be normally distributed.
HYPOTHESIS:
Null hypothesis Ho: There is no significant relationship between the variables.
ASSUMPTION:
◼ The variables must be linear (Metric) – Ratio or
Interval scale.
◼ The variables must be normally distributed.
53
FACTOR ANALYSIS
◼ Factor analysis is a technique that is used
to reduce a large number of variables into
fewer numbers of factors.
◼ Factor analysis extracts maximum common
variance from all variables and puts them
into a common score.
◼ Interval data are assumed for factor
analysis.
54
Cluster Analysis
56