0% found this document useful (0 votes)
6 views

Editing & Coding of Data

The document discusses editing and coding data. It covers understanding when a response is an error and should be edited, how data is represented in files, and coding structured responses using dummy variables. It also discusses technological advances that have simplified the coding process.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

Editing & Coding of Data

The document discusses editing and coding data. It covers understanding when a response is an error and should be edited, how data is represented in files, and coding structured responses using dummy variables. It also discusses technological advances that have simplified the coding process.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Editing & Coding Data

Chapter 19
LEARNING
OUTCOMES

1. Know when a response is really an error and


should be edited
2. Understand the way data are represented in
a data file
3. Understand the coding of structured
responses including a dummy variable
approach
4. Appreciate the ways that technological
advances have simplified the coding process

19-2
©2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
EXHIBIT 19.1 Overview of the Stages of Data
Analysis

19–3
©2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Stages of Data Analysis
• Raw Data
• The unedited responses from a respondent exactly as indicated by that
respondent.
• Nonrespondent Error
• Error that the respondent is not responsible for creating, such as when the
interviewer marks a response incorrectly or due to typing error of the
researcher

19–4
• Missing Responses
Questions that the respondent has advertently/inadvertently missed filling.

• Data Integrity
• The notion that the data file actually contains the information that the
researcher is trying to obtain to adequately address research questions.
Editing
• Editing
• The process of checking the completeness, consistency, and legibility of data and making the
data ready for coding and transfer to storage.
• In-House Editing
• A rigorous editing job performed by a centralized office staff.

19–6
Editing
• Checking for Consistency
• Respondents match defined population
• Check for consistency within the data collection framework
• Taking Action When Response is Obviously in Error
• Change/correct responses only when there are multiple pieces of evidence
for doing so.
• Editing Technology
• Computer routines can check for consistency automatically.

19–7
• How long have you lived at the current address? 30 years

• How old are you? 28 years


Editing for Completeness
• What about missing data?
• Row Deletion
• The entire record for a respondent that has left a response missing is excluded from use
in statistical analysis.

• Item Nonresponse
• The technical term for an unanswered question on an otherwise complete
questionnaire resulting in missing data.
• Missing Value
• Plug Value – Replace missing value with Mean or Median

19–9
Facilitating the Coding Process
• Editing And Tabulating “Don’t Know” Answers
• Legitimate don’t know (no opinion)
• Reluctant don’t know (refusal to answer)
• Confused don’t know (does not understand)

19–10
Coding Qualitative Responses/Nominal &
Ordinal Scale
• Coding
• The process of assigning a numerical score or other character symbol to previously edited
data.
• Codes
• Rules for interpreting, classifying, and recording data in the coding process.
• The actual numerical or other character symbols assigned to raw data.
• Dummy Coding
• Numeric “1” or “0” coding where each number represents an alternate response such as
“female” or “male.”
• Numeric coding of 1,2,3 etc. for Occupational Categories

19–11
Data File Terminology
• Field/Variable
• A collection of characters that represents a single type of data—usually a
variable.

• Record
• A collection of related fields that represents the responses from one sampling
unit.

Note – Once data is collected, make a copy of the file and start editing &
Coding. Original File should always remain intact.

19–12
Variable/Field Field Value

1. Do you use ready to eat X1 Yes = 1


foods? No =2
2. Rank the Brands in order of X2_ITC 1 = Highest Pref
preference (1 for Highest
preference, 3 for Lowest X2_MTR 2
Preference)
ITC X2_Sun 3 = Lowest Pref
MTR
Sunfeast

3. Ready to eat foods are X3_con 1= Strongly Disagree


convenient to use
5= Strongly Agree

4.Ready to eat foods are X4_unh 1 = Strongly Agree


unhealthy because of
preservatives (R) 5 = Strongly Disagree
5. Occupational Category X5 Student = 1
Home Maker = 2
Service = 3
Business = 4
Data Processing Steps
• Before Data Analysis

• In case of Multi-item Construct – remove responses where Standard


Deviation is less than 0.07/0.03

• In order to remove unengaged responses

You might also like