0% found this document useful (0 votes)
25 views

Data Processing and Coding

The document discusses various methods of data processing including data editing, coding, classification, tabulation, and exploratory data analysis. It describes editing to detect errors and ensure accuracy, coding variables numerically, classifying data into categories, and using tables and visualizations to summarize and explore patterns in the data. Key steps involve identifying errors, assigning numeric codes to variables, grouping data, and calculating descriptive statistics.

Uploaded by

tanya.p23
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

Data Processing and Coding

The document discusses various methods of data processing including data editing, coding, classification, tabulation, and exploratory data analysis. It describes editing to detect errors and ensure accuracy, coding variables numerically, classifying data into categories, and using tables and visualizations to summarize and explore patterns in the data. Key steps involve identifying errors, assigning numeric codes to variables, grouping data, and calculating descriptive statistics.

Uploaded by

tanya.p23
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

Data Processing

Preparing data for analysis, extracting information and gaining


insight
• Data Editing
• Data Coding
• Data Classification
• Data Tabulation and summarization
• Exploratory Data Analysis
Editing
• Editing detects errors and omissions, corrects them when possible, and certifies the accuracy and
reliability of data.

• Review the filled questionnaire to get maximum accuracy and unambiguity to ensure maximum
reliability of data, in terms of

- LEGIBILITY OF ENTRIES
- COMPLETENESS OF ENTRIES
- CONSISTENCY OF ENTRIES
- ACCURACY OF ENTRIES
- It is done as early as possible
Editing
• We look for irregularities in the raw data obtained
 Typographical error
 Data entry error
 Illegible entries
 Values that impossible or undefined
 Missing responses
 outliers
a. Field Editing: Check and verify collection of data and recording of data
• Preliminary editing is done by the field investigator or supervisor. May be at the end of each day.. review
the questionnaires for inconsistencies , non response, illegible response and correct them at the earliest.
• b. Central Editing
 Second level of editing at the researcher level when questionnaires are received from the field.
 Backtracking : Going back to respondent ( if traceable)
 Allocating missing values ( limited case)_
 Plug Values : average value, neutral value, pattern of other questions is used to extrapolate and
calculate the appropriate response to the missing values
 Discarding the unsatisfactory response.
Classification and Tabulation:

• Classification of data on the basis of attributes:


• Socio classification can be identified on the basis education and occupation
• Score on a particular variable is computed by various combination of the
original data obtained.
Product rating on the basis of :Appearance, durability and writing
performance ( each measured on 1 to 7)
• Collapsing the number of categories
• Classification using class interval.
Exploratory data analysis
• Identify trend, patterns, variation
Summary table of frequencies or percentages of each category
• Marginal table, bivariate tables / contingency tables, pivot tables,
multidimensional tables.
• Data Visualisation: Graphical method: Bar chart, Pie chart, line
graph,
• histogram, frequency curve, scatter plot, box plot etc.
Descriptive measure

• Location of data : Mean, Mode, Median


• Variation : Range, variance, standard deviation,
coefficient of variation, quartiles, inter quartile
range., Z score.
Q. No. Variable Name Coding of answer Variable for Q
1 Buy ready to eat food Yes =1 X1
products No =0
22 Gender Male=1 X22
Female =2
Coding
23 Age < 20 =1 X23
21-25 =2
26-36 =3
36-45 =4
 45 =5

24 Marital status Single=1 X24


Married =2
Divorced/widow =3

25 No of children Exact value X25


26 Monthly Household < 20k=0 X26
income 20k - 35K =1
35k- < 50k=2
Coding

Q. No. Variable Name Coding of answer Variable for Q


1 Buy ready to eat food Yes =1 X1
products No =0
22 Gender Male=1 X22
Female =2
23 Age < 20 =1 X23
21-25 =2
26-36 =3
36-45 =4
 45 =5

24 Marital status Single=1 X24


Married =2
Divorced/widow =3
25 No. of children Exact value X25
Coding
25 No. of children Exact value X25
26
Monthly Household < 20k=0 X26
income 20k - 35K =1
35k- < 50k=2
>= 50k =4
27
Education level X27
Under graduation =1
Graduation=2
Post graduation & higher =3

28
Occupation Student =1 X28
Businessman=2
Professional=3
Service=4, Housewife=5
Others=6
Ranking Questions

Variable Rank Variable name

1 Product A Exact value X10a

2 Product B X10b

3 Product C X10c

4 Product D X10d

5 Product E X10e
• Checklist/Multiple Response

15 Which of these newspaper do you read ?


a The times of India ------------------
b The Hindustan Times -----------------
c The Indian Express -------------------------
d Business standard -----------------------------
e Mint --------------------------
Check List

Name Code Variable

The Times of Yes=1, No=0 X15a


India
The Hindustan X15b
times
The Indian X15c
Express
Business X15 d
Standard
Mint X15e
Scaled Questions
S. Statement Strongly Disagree( Undecided( Agree( Strongly Variable
No. Disagree( 2) 3) 4) agree(5) name
1)

a In my organisation sex X9a


discrimination is non
existent

b Work environment is X9b


negatively charged
c Team work is well X9c
appreciated
d Has sound work X9d
procedures in place.
e Employee’s grievances X9e
are often ignored
f Provides equal X9e
opportunities for
growth
8-15
Total score = 3 +5 +4 +4 + 5 +4 + 4 +5 +4 +4 = 42 out of 50.

Likert Scale : to measure image of the company


No statement Strongly Disagree Nether agree nor Agree Strongly
. disagree disagree agree

1 Company makes quality X


products
2 Is leader in technology X
3 Doesn’t care about general X
public
4 Company leads in R&D to X
improve products

5 Not a good paymaster X


6 Products go through X
stringent quality test

7 Not done anything to curb X


pollution
8 Does not care about X
community service

9 Company stocks are good X


to buy
10 Does not have good labour X
relations
8-16

Item Analysis for selecting items


1. Find correlation of each item with the total scorer.
 Determine total scale for all items
 Examine the correlation between responses to each item and total score
 Eliminate items that have low correlation.
 Select items having high correction.

2. Divide respondents into two groups


 25% respondents with most favorable total score
 25% respondents with least favorable total score
 Compare the average score of each item in these 2 groups and compare using t- test.
8-17

Selection of statements
Resp item item 3 -- --- -- -- 59 60 Total
No. 1 2
score

1 3 4 2 220
2 4 5 1 250
3 5 3 5 270
4 4 5 3 260

100

You might also like