Unit 4 - Data Preparation
Unit 4 - Data Preparation
• It is the activity that ensures the accuracy of the data and their
conversation from RAW form to reduced and classified forms that
are more appropriate for analysis.
10
EDITING
• The process of checking and adjusting responses in the
completed questionnaires for omissions, legibility, and
consistency and readying them for coding and storage.
• Detects errors and omissions, correct them when possible, and
certifies that minimum data quality standards are achieved.
• It ensures
▫ Completeness
▫ Accuracy and
▫ Uniformity
11
12
TYPES OF EDITING
1. FIELD EDITING
1. Premilinary editing will be done on the same day by a field
supervisor
2. Contact to the respondents immediately in the field if in case of
any query
2. IN HOUSE-CENTRAL EDITING
1. Editing will be done by central office staff
2. Respondents can be contacted later for clarification
13
CODING
• Data coding involves assigning a number to the participants
responses so, they can be entered in database
• In coding, categories are the partitions of a data set of a given
variable.
• E.g) If the variable is gender, the categories are male and
female
14
con
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
1.MULTIPLE REGRESSION
• In multiple regression we form a linear composite of
explanatory variables in such way that it has maximum
correlation with a criterion variable.
36
1.MULTIPLE REGRESSION
• One can predict the level of the dependent
phenomenon through multiple regression
analysis model, given the levels of independent
variables.
37
1.MULTIPLE REGRESSION
• Sometimes the researcher may use step-wise regression
techniques to have a better idea of the independent
contribution of each explanatory variable.
38
39
40
41
FACTOR ANALYSIS
42
FACTOR ANALYSIS
• Factor analysis is the generic name given to a class of
techniques whose purpose is data reduction and
summarization.
43
44
Basic assumptions
• Scaled data (1-5), (1-7)
• Variables and respondents composition 1:4
(preferably 1:10)
• Sample must be homogenous
• Interdependence technique. No distinction
between dependent and independent variable.
45
MULTIDIMENSIONAL SCALING
(MDS)
• Multidimensional scaling is a class of procedures for
representing perceptions and preferences of respondents
spatially by means of a visual display.
46
MULTIDIMENSIONAL SCALING
(MDS)
• Mostly it is used in the field of marketing
▫ To identify the image of the product.
▫ To position of the product in the customers mind.
▫ Market segmentation
▫ To identify the gap
▫ New product development.
Dimensions
• Price
• Safety
• Design
• Comfort
• mileage
47
48
49
CONJOINT ANALYSIS
• Technique developed in early 70s
• It measures how buyers valued components of
a product/service bundle
• Customer choose the product on the
combination of many attributes
50
CONJOINT ANALYSIS
• It is a tool/technique highly utilized by marketers
• Marketer identifies the possible combination of
several attributes and then which combination of
attribute has highest value in customer’s point of
view
• E.g coco cola in 3 different pack. How a cusotmer
prefer his choice. Customer choose the product on the
combination of many attributes
▫ Price and quality
▫ Quality and design
▫ Taste and price
▫ Volume and price
51
52
cluster analysis
• cluster analysis identifies homogeneous
subgroups of study objects or participants
and then studies the data by these subgroups.
• Definition
53
Cluster analysis-Example