0% found this document useful (0 votes)

34 views21 pages

2 Data Preperation

The document discusses various steps involved in preparing marketing data for analysis, including identifying missing values and outliers, transforming variables, weighting data for representativeness, creating dummy variables, and standardizing scales. It also covers selecting an appropriate data analysis strategy based on the known characteristics of the data, the properties of different statistical techniques, and the researcher's background and philosophy.

Uploaded by

Raiyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views21 pages

2 Data Preperation

Uploaded by

Raiyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Marketing Analytics

Dr. Md. Kashedul Wahab Tuhin, PhD

Associate Professor
Department of Marketing
Jahangirnagar University
Savar, Dhaka
14-2

Chapter Outline
1. Data Preparation
1. Missing value identification
2. Outlier detection
3. Variable re-specification
4. Transpose
14-3

Restaurant preference

ID PREFEREN QUALITY QUANTITY VALUE SERVICE INCOME OVERALL RINCOME

1 2 2 3 1 3 6 9.00 4.00
2 6 5 6 5 7 2 23.00 1.00
3 4 4 3 4 5 3 16.00 2.00
4 1 2 1 1 2 5 6.00 4.00
5 7 6 6 5 4 1 21.00 1.00
6 5 4 4 5 4 3 17.00 2.00
7 2 2 3 2 3 5 10.00 4.00
8 3 3 4 2 3 4 12.00 3.00
9 7 6 7 6 5 2 24.00 1.00
10 2 3 2 2 2 5 9.00 4.00
11 2 3 2 1 3 6 9.00 4.00
12 6 6 6 6 7 2 25.00 1.00
13 4 4 3 3 4 3 14.00 2.00
14 1 1 3 1 2 4 7.00 3.00
15 7 7 5 5 4 2 21.00 1.00
16 5 5 4 5 5 3 19.00 2.00
17 2 3 1 2 3 4 9.00 3.00
18 4 4 3 3 3 3 13.00 2.00
19 7 5 5 7 5 5 22.00 4.00
20 3 2 2 3 3 3 10.00 2.00
14-4

A Codebook
Colum No Variable No. Variable name Question no. Coding instruction

1 1 ID 1 to 20 as coded
Input the number circle
1= weak preference
2 2 Preference 1 2= strong preference
Input the number circle
1= poor
3 3 Quality 2 2= excellent
Input the number circle
1= poor
4 4 Quantity 3 2= excellent
Input the number circle
1= poor
5 5 Value 4 2= excellent
Input the number circle
1= poor
6 6 Service 5 2= excellent
Input the number circle
1= less than 20000
14-5

Coding Questionnaires
 The respondent code and the record number appear
on each record in the data.
 The first record contains the additional codes: project
code, interviewer code, date and time codes, and
validation code.
 It is a good practice to insert blanks between parts.
14-6

Data Transcription
Fig. 14.4
Raw Data

CATI/ Keypunching via Mark Sense Optical Computerized

CAPI CRT Terminal Forms Scanning Sensory
Analysis
Verification:Correct
Keypunching Errors

Computer Magnetic
Disks
Memory Tapes

Transcribed Data
Data Cleaning 14-7

Consistency Checks
Data Cleaning- includes consistency checks and treatment of missing response.

Consistency checks identify data that are out of range, logically inconsistent, or
have extreme values.
 Computer packages like SPSS, SAS, EXCEL and MINITAB can be programmed
to identify out-of-range values for each variable and print out the respondent
code, variable code, variable name, record number, column number, and out-
of-range value.
 Extreme values should be closely examined.
Data Cleaning 14-8

Treatment of Missing Responses

 Missing response- represent values of a variable that are unknown,
either because respondents provided ambiguous answers, or their
answers were not properly recorded.
 Substitute a Neutral Value – A neutral value, typically the mean
response to the variable, is substituted for the missing responses.
 Substitute an Imputed Response – The respondents' pattern of
responses to other questions are used to impute or calculate a
suitable response to the missing questions.
 In casewise deletion, cases, or respondents, with any missing
responses are discarded from the analysis.
 In pairwise deletion, instead of discarding all cases with any
missing values, the researcher uses only the cases or respondents
with complete responses for each calculation.
Statistically Adjusting the Data 14-9

Weighting

 In weighting, each case or respondent in the database is

assigned a weight to reflect its importance relative to other
cases or respondents.
 Weighting is most widely used to make the sample data
more representative of a target population on specific
characteristics.
 Yet another use of weighting is to adjust the sample so that
greater importance is attached to respondents with certain
characteristics.
14-10

Statistically Adjusting the Data

Use of Weighting for Representativeness

Years of Sample Population

Education Percentage Percentage Weight

Elementary School
0 to 7 years 2.49 4.23 1.70
8 years 1.26 2.19 1.74

High School
1 to 3 years 6.39 8.65 1.35
4 years 25.39 29.24 1.15

College
1 to 3 years 22.33 29.42 1.32
4 years 15.02 12.01 0.80
5 to 6 years 14.94 7.36 0.49
7 years or more 12.18 6.90 0.57

Totals 100.00 100.00

Statistically Adjusting the Data 14-11

Variable Respecification
 Variable respecification involves the transformation of data to create
new variables or modify existing variables.
 E.G., the researcher may create new variables that are composites of
several other variables.
 Dummy variables are used for respecifying categorical variables. The
general rule is that to respecify a categorical variable with K categories, K-
1 dummy variables are needed.
Statistically Adjusting the Data 14-12

Variable Respecification
Table 14.2

Product Usage Original Dummy Variable Code

Category Variable
Code X1 X2 X3

Nonusers 1 1 0 0
Light users 2 0 1 0
Medium users 3 0 0 1
Heavy users 4 0 0 0

Note that X1 = 1 for nonusers and 0 for all others. Likewise, X2 =

1 for light users and 0 for all others, and X3 = 1 for medium users
and 0 for all others. In analyzing the data, X1, X2, and X3 are
used to represent all user/nonuser groups.
Statistically Adjusting the Data 14-13

Scale Transformation and Standardization

Scale transformation involves a manipulation of

scale values to ensure comparability with other scales
or otherwise make the data suitable for analysis.

A more common transformation procedure is

standardization. Standardized scores, Zi, may be
obtained as:

Zi = (Xi - X )/sx
14-14

Selecting a Data Analysis Strategy

Fig. 14.5

Earlier Steps (1, 2, & 3) of the Marketing Research Process

Known Characteristics of the Data

Properties of Statistical Techniques

Background and Philosophy of the Researcher

Data Analysis Strategy

14-15

A Classification of Univariate Techniques

Fig. 14.6 Univariate Techniques

Metric Data Non-numeric Data

One Sample Two or More One Sample Two or More

Samples Samples
* t test * Frequency
* Z test * Chi-Square
* K-S
* Runs
* Binomial
Independent Related
* Two- Independent
Group test Related
* Paired
* Z test t test * Chi-Square
* One-Way * Sign
* Mann-Whitney * Wilcoxon
ANOVA * Median * McNemar
* K-S * Chi-Square
* K-W ANOVA
14-16

A Classification of Multivariate Techniques

Fig. 14.7
Multivariate Techniques

Dependence Interdependence
Technique Technique

One Dependent More Than One Variable Interobject

Variable Dependent Interdependence Similarity
Variable
* Cross- * Multivariate * Factor * Cluster Analysis
Tabulation Analysis of Analysis * Multidimensional
* Analysis of Variance and Scaling
Variance and Covariance
Covariance * Canonical
* Multiple Correlation
Regression * Multiple
* Conjoint Discriminant
Analysis Analysis
Nielsen’s Internet Survey:
14-17

Does it Carry Any Weight?

The Nielsen Media Research Company, a longtime player in

television-related marketing research has come under fire
from the various TV networks for its surveying techniques.
Additionally, in another potentially large, new revenue
business, Internet surveying, Nielsen is encountering serious
questions concerning the validity of its survey results. Due
to the tremendous impact of electronic commerce on the
business world, advertisers need to know how many people
are doing business on the Internet in order to decide if it
would be lucrative to place their ads online.
Nielsen performed a survey for CommerceNet, a group of
companies that includes Sun Microsystems and American
Express, to help determine the number of total users on the
Internet.
Nielsen’s Internet Survey:
14-18

Does it Carry Any Weight?

Nielsen’s research stated that 37 million people

over the age of 16 have access to the Internet and
24 million have used the Net in the last three
months. Where statisticians believe the numbers
are flawed is in the weighting used to help match
the sample to the population. Weighting must be
used to prevent research from being skewed
toward one demographic segment.
Nielsen’s Internet Survey:
14-19

Does it Carry Any Weight?

The Nielsen survey was weighted for gender but not
for education which may have skewed the population
toward educated adults. Nielsen then proceeded to
weight the survey by age and income after they had
already weighted it for gender. Statisticians also feel
that this is incorrect because weighting must occur
simultaneously, not in separate calculations. Nielsen
does not believe the concerns about their sample are
legitimate and feel that they have not erred in
weighting the survey. However, due to the fact that
most third parties have not endorsed Nielsen’s
methods, the validity of their research remains to be
established.
14-20

SPSS Windows
14-21

SPSS Windows
 Using the Base module, out-of-range values can be selected using the
SELECT IF command. These cases, with the identifying information
(subject ID, record number, variable name, and variable value) can
then be printed using the LIST or PRINT commands. The Print
command will save active cases to an external file. If a formatted list is
required, the SUMMARIZE command can be used.
 SPSS Data Entry can facilitate data preparation. You can verify
respondents have answered completely by setting rules. These rules
can be used on existing datasets to validate and check the data,
whether or not the questionnaire used to collect the data was
constructed in Data Entry. Data Entry allows you to control and check
the entry of data through three types of rules: validation, checking, and
skip and fill rules.
 While the missing values can be treated within the context of the Base
module, SPSS Missing Values Analysis can assist in diagnosing missing
values and replacing missing values with estimates.
 TextSmart by SPSS can help in the coding and analysis of open-ended
responses.

C207 Study Guide
No ratings yet
C207 Study Guide
27 pages
Data Preparation For Analytics Using SAS
100% (1)
Data Preparation For Analytics Using SAS
440 pages
Week 9 Data Analysis Using SPSS 33
0% (1)
Week 9 Data Analysis Using SPSS 33
82 pages
RemoteConnect and SCADAPack x70 Utilities R2.6.1-Release Notes
No ratings yet
RemoteConnect and SCADAPack x70 Utilities R2.6.1-Release Notes
12 pages
Data Preparation and Analysis 3
No ratings yet
Data Preparation and Analysis 3
182 pages
Rich Content in The Online Environment and The User Experience
100% (3)
Rich Content in The Online Environment and The User Experience
14 pages
Profibus DP Mapping of Siprotec Compact Relays
No ratings yet
Profibus DP Mapping of Siprotec Compact Relays
56 pages
Samsung Bloatware List
No ratings yet
Samsung Bloatware List
2 pages
Research Methodoly 151 298
No ratings yet
Research Methodoly 151 298
148 pages
Ansible Rhel 90
No ratings yet
Ansible Rhel 90
72 pages
Data Analysis
100% (2)
Data Analysis
87 pages
Orkin Commercial - Service Agreement
No ratings yet
Orkin Commercial - Service Agreement
7 pages
Final Exam - MBA
No ratings yet
Final Exam - MBA
41 pages
AI Syllbus
No ratings yet
AI Syllbus
5 pages
Personal Computer: Mujallar DC - Main
100% (1)
Personal Computer: Mujallar DC - Main
10 pages
Data Preparation
100% (1)
Data Preparation
38 pages
Anachip 18CV8P 25 Datasheet
No ratings yet
Anachip 18CV8P 25 Datasheet
11 pages
SAP B1 Approval Procedures
No ratings yet
SAP B1 Approval Procedures
7 pages
NPM Administrator Guide
No ratings yet
NPM Administrator Guide
183 pages
SPSS Session
No ratings yet
SPSS Session
133 pages
6.research Methodology-BBA S1M6
No ratings yet
6.research Methodology-BBA S1M6
64 pages
Qunt Data Coding & Analysis
No ratings yet
Qunt Data Coding & Analysis
104 pages
Data Preparation
No ratings yet
Data Preparation
47 pages
STAT730 Lect 915
No ratings yet
STAT730 Lect 915
52 pages
2014 Smart Card cloner User's Manual V3.0: 1、Equipment introduction
No ratings yet
2014 Smart Card cloner User's Manual V3.0: 1、Equipment introduction
2 pages
Data Preparation
No ratings yet
Data Preparation
39 pages
Test Data-1 Employee List
No ratings yet
Test Data-1 Employee List
63 pages
MR - Data Preparation & Analysis 13th and 14th Feb 2024
No ratings yet
MR - Data Preparation & Analysis 13th and 14th Feb 2024
59 pages
Research Methodology: Data Collection, Analysis and Interpretation
No ratings yet
Research Methodology: Data Collection, Analysis and Interpretation
54 pages
Data Science Slides
No ratings yet
Data Science Slides
57 pages
L18&19 Data Exploration
No ratings yet
L18&19 Data Exploration
50 pages
Data Preprocessing
No ratings yet
Data Preprocessing
49 pages
MKT3600 - L05 - Survey Methods
No ratings yet
MKT3600 - L05 - Survey Methods
35 pages
Chap13 - Quantitative Data Analysis - Revised - Jan2021
No ratings yet
Chap13 - Quantitative Data Analysis - Revised - Jan2021
54 pages
Marketing Research
No ratings yet
Marketing Research
36 pages
FIN10002 - Notes Master
No ratings yet
FIN10002 - Notes Master
44 pages
Chap13 Quantitative Data Analysis Revised Jan2021
No ratings yet
Chap13 Quantitative Data Analysis Revised Jan2021
54 pages
Marketing Research Print
No ratings yet
Marketing Research Print
71 pages
INF30036 Lecture4
No ratings yet
INF30036 Lecture4
47 pages
Chapter3 DataPreprocessing
No ratings yet
Chapter3 DataPreprocessing
50 pages
AKANKSHA START PAGE - Merged
No ratings yet
AKANKSHA START PAGE - Merged
51 pages
Module 3 Data Preparation
No ratings yet
Module 3 Data Preparation
33 pages
Data Cleaning
No ratings yet
Data Cleaning
39 pages
BRM File
No ratings yet
BRM File
55 pages
11-Data Pre-Processing, Exploratory Data Analysis.-23-03-2023
No ratings yet
11-Data Pre-Processing, Exploratory Data Analysis.-23-03-2023
37 pages
Session 1
No ratings yet
Session 1
23 pages
Ba035iu Week 9
No ratings yet
Ba035iu Week 9
45 pages
Final Project Report Mobile Phone Jammer
No ratings yet
Final Project Report Mobile Phone Jammer
19 pages
Tips For Mainframe Programmers
No ratings yet
Tips For Mainframe Programmers
101 pages
Data Preparation
No ratings yet
Data Preparation
23 pages
Event Plan-September 21 - National File
No ratings yet
Event Plan-September 21 - National File
37 pages
Data Preprocessing Techniques
No ratings yet
Data Preprocessing Techniques
11 pages
Dell Wembley-Mt-Dt-Ra01-Pd Optiplex 980
No ratings yet
Dell Wembley-Mt-Dt-Ra01-Pd Optiplex 980
61 pages
BRM Unit 3 Part 2
No ratings yet
BRM Unit 3 Part 2
7 pages
Data Cleaning
No ratings yet
Data Cleaning
8 pages
Material Requirement Planning (MRP)
No ratings yet
Material Requirement Planning (MRP)
14 pages
Foods 12 01242 v2
No ratings yet
Foods 12 01242 v2
33 pages
L9 Planning Data Management & Analysis
No ratings yet
L9 Planning Data Management & Analysis
26 pages
Supercrete TLP Retail Survey (23-26 August)
No ratings yet
Supercrete TLP Retail Survey (23-26 August)
40 pages
Emerging Biometric Modalities and Their Use
No ratings yet
Emerging Biometric Modalities and Their Use
6 pages
Data Preparation and Processing
No ratings yet
Data Preparation and Processing
30 pages
Midterm 1
No ratings yet
Midterm 1
14 pages
Web Application Development: Essay Report Web Programming and Applications
No ratings yet
Web Application Development: Essay Report Web Programming and Applications
27 pages
Data Preparation & Univariate Analysis
No ratings yet
Data Preparation & Univariate Analysis
18 pages
Lecture 8 Data Analysis
No ratings yet
Lecture 8 Data Analysis
30 pages
Data Preparation
No ratings yet
Data Preparation
12 pages
Business Analytics - L4-L6, Ch. 3-4
No ratings yet
Business Analytics - L4-L6, Ch. 3-4
17 pages
Vaibhav Chawla Session 4
No ratings yet
Vaibhav Chawla Session 4
36 pages
Analyzing The Data
No ratings yet
Analyzing The Data
54 pages
Chapter Fourteen: Data Preparation
No ratings yet
Chapter Fourteen: Data Preparation
26 pages
BRM Session 3
No ratings yet
BRM Session 3
32 pages
Soft Q-Learning With Mutual Information Regularization
No ratings yet
Soft Q-Learning With Mutual Information Regularization
19 pages
Aws Report 1
No ratings yet
Aws Report 1
7 pages
Data Preparation - 2
No ratings yet
Data Preparation - 2
16 pages
Chapter Fourteen: Data Preparation
No ratings yet
Chapter Fourteen: Data Preparation
21 pages
Research Methodology: Chapter - 7
No ratings yet
Research Methodology: Chapter - 7
28 pages
MIN-EM-GL-008 - FLS MIE Enovia Naming Conventions
No ratings yet
MIN-EM-GL-008 - FLS MIE Enovia Naming Conventions
5 pages
PA Unit
No ratings yet
PA Unit
2 pages
चरित्र प्रमाण पत्र - PDF
No ratings yet
चरित्र प्रमाण पत्र - PDF
6 pages
Manual Instructions Surcharge 80EEB
No ratings yet
Manual Instructions Surcharge 80EEB
11 pages
Analytical Design - Quant: Cardiff Business School E: Pagekl@cardiff - Ac.uk T: @drkellypage T: @caseinsights
No ratings yet
Analytical Design - Quant: Cardiff Business School E: Pagekl@cardiff - Ac.uk T: @drkellypage T: @caseinsights
30 pages
Date SL New Ship To ID Retailers Name
No ratings yet
Date SL New Ship To ID Retailers Name
6 pages
AFF FAS8300 and FAS8700 Install and Setup
No ratings yet
AFF FAS8300 and FAS8700 Install and Setup
13 pages
3B. Kuantitatif - Data Preparation (Malhotra 14)
No ratings yet
3B. Kuantitatif - Data Preparation (Malhotra 14)
28 pages
Market Research 2
No ratings yet
Market Research 2
30 pages
Kmu Cat Rollnoslip 333580
No ratings yet
Kmu Cat Rollnoslip 333580
1 page
Vehicle Requisition Form Updated
No ratings yet
Vehicle Requisition Form Updated
2 pages
SPSS Data File: Project Activities
No ratings yet
SPSS Data File: Project Activities
1 page
Shifting To CCL Tongi
No ratings yet
Shifting To CCL Tongi
3 pages
Data Preparation Process PDF
No ratings yet
Data Preparation Process PDF
30 pages
Data Preparation: March 6, 2010
No ratings yet
Data Preparation: March 6, 2010
17 pages
Vehicle Bangla Note Sheet - Tongi Zone
No ratings yet
Vehicle Bangla Note Sheet - Tongi Zone
1 page
Chapter Fourteen: Data Preparation
No ratings yet
Chapter Fourteen: Data Preparation
21 pages
Tongi Zone
No ratings yet
Tongi Zone
1 page
Library Confirmation Form For Plagiarism
No ratings yet
Library Confirmation Form For Plagiarism
2 pages
3.1.4 Packet Tracer - Who Hears The Broadcast PDF
No ratings yet
3.1.4 Packet Tracer - Who Hears The Broadcast PDF
5 pages
Malhotra14 Tif
No ratings yet
Malhotra14 Tif
19 pages
6 Data Analysis
No ratings yet
6 Data Analysis
24 pages
Cookware Allocation: Region Delivery Address Concern Person & Contact Number
No ratings yet
Cookware Allocation: Region Delivery Address Concern Person & Contact Number
3 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
3 pages
EC305 Microprocessor & Microcontroller
No ratings yet
EC305 Microprocessor & Microcontroller
2 pages
Math Starters: 5- to 10-Minute Activities Aligned with the Common Core Math Standards, Grades 6-12
From Everand
Math Starters: 5- to 10-Minute Activities Aligned with the Common Core Math Standards, Grades 6-12
Gary R. Muschla
No ratings yet

2 Data Preperation

Uploaded by

2 Data Preperation

Uploaded by

Marketing Analytics

Dr. Md. Kashedul Wahab Tuhin, PhD

ID PREFEREN QUALITY QUANTITY VALUE SERVICE INCOME OVERALL RINCOME

CATI/ Keypunching via Mark Sense Optical Computerized

Treatment of Missing Responses

 In weighting, each case or respondent in the database is

Statistically Adjusting the Data

Years of Sample Population

Totals 100.00 100.00

Product Usage Original Dummy Variable Code

Note that X1 = 1 for nonusers and 0 for all others. Likewise, X2 =

Scale Transformation and Standardization

Scale transformation involves a manipulation of

A more common transformation procedure is

Selecting a Data Analysis Strategy

Earlier Steps (1, 2, & 3) of the Marketing Research Process

Known Characteristics of the Data

Properties of Statistical Techniques

Background and Philosophy of the Researcher

Data Analysis Strategy

A Classification of Univariate Techniques

Metric Data Non-numeric Data

One Sample Two or More One Sample Two or More

A Classification of Multivariate Techniques

One Dependent More Than One Variable Interobject

Does it Carry Any Weight?

The Nielsen Media Research Company, a longtime player in

Does it Carry Any Weight?

Nielsen’s research stated that 37 million people

Does it Carry Any Weight?

You might also like