Chapter 7

Data Processing and Analysis

Uploaded by

Haile Girma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views35 pages

Chapter 7

Data Processing and Analysis

Uploaded by

Haile Girma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Chapter 7: Data Processing and Analysis

Introduction
 The goal of any research is to provide information out of raw data.
 The raw data after collection has to be processed and analyzed in
line with outline (plan) laid down for purpose at time of
developing research plan.
 Response on measurement instruments (words, check mark etc.)
conveys little information as such.
 The compiled data must be classified, processed, analyzed and
interpreted carefully before their complete meanings and
implications can be understood.
Cont’d
 Generally stages in data processing and analysis can be
summarized in chart as follow :
Editing

Coding processing

Classification and tabulation (data entry)

Data Analysis
Descriptive

Inferential Statistics Univariate Bivariate Multivariate

Cont’d
 There are two stages of data analysis, data processing and
analysis.
 Some authors do like to make difference between processing
and analysis.
 However we see separately these terms briefly
7.1. Data processing
 It implies editing, coding, classification and tabulation of
collected data so that they are amendable to analysis.
Editing: Is process of examining collected raw data to detect
errors and omission (extreme values) and to correct those when
possible
 It involves careful scrutiny of completed questionnaires or
schedules
Cont’d
 It is done to assure that data are:-
i. Accurate
ii. Consistent with other data gathered
iii. Uniformly entered
iv. As complete as possible
v. has been well organized to facilitate coding and tabulation
 Editing can be either field editing or central editing
Cont’d
Field editing: Consist of reviewing of reporting forms by
investigator for completing what has been written in abbreviation and/
or in illegible form at time of recording respondents’ response
 This sort of editing should be done as soon as possible after
interview or observation.
Central editing: It will take place at research office.
 Its objective is to correct errors such as entry in wrong place,
entry recorded in month
Coding: Refers to process of assigning numerical or other symbols
to answers so that responses can be put into limited number of
categories or classes.
 Such classes should be appropriate to research problem under
consideration.
Cont’d
 There must be class of every data items.
 They must be mutually exclusive (specific answer can be placed in one
and only one cell in given category set)
 Coding is necessary for efficient analysis and through it several replies
may be reduced to small number of classes, which contain critical
information required for analysis
E.g., Closed end question
1 [ ] Yes
2 [ ] No Or
Less than 200 [ ] 001
201- 699 [ ] 002
1500 and more [ ] 006
 Coding is used when researcher uses computer to analyze data otherwise
it can be avoided.
Cont’d
 Classification: Most research studies result in large volume of raw
data, which must be reduced into homogeneous group.
 Which means to classify raw data or arranging data in-groups or
classes on basis of common characteristics?
 Data Classification implies processes of arranging data in
groups or classes on basis of common characteristics.
 Data having common characteristics placed in one class and in this
way entire data get divided into number of groups or classes.
 Classification according to attributes: Data are classified on
basis of common characteristics, which can either be descriptive
(such as literacy, sex, honesty, etc.) or numerical (such as,
weight, age, height, income, expenditure, etc.).
Cont’d
 Descriptive characteristics refer to qualitative phenomenon,
which cannot be measured quantitatively: only their presence or
absence in individual item can be noticed.
 Data obtained this way on basis of certain attributes are known as
statistics of attributes and their classification is said to be
classification according to attributes.
 Classification according to class interval: Unlike descriptive
characteristics numerical characteristics refer to quantitative
phenomenon, which can be measured through some statistical
unit.
 Data relating to income, production, age, weighted, come under
this category.
 Such data are known as statistics of variables and are classified on
basis of class interval.
Cont’d
 Fore example, individuals whose incomes, say, are within 1001-
1500 Birr can form one group, those whose incomes within 500-
1000 Birr form another group and so on.
 In this way entire data may be divided into a number of groups or
classes or what are usually called, class interval.
 Each class-interval, thus, has upper as well as lower limit, which is
known as class limit.
 The difference between two class limits is class magnitude
 The number of items that fall in given class is known as frequency
of given class.
 All classes with their respective frequency are taken together and
put in form of table are describing as group frequency
distribution or simply frequency distribution
Cont’d
 Classification according to class intervals usually involves
following problems:-
1. How many classes should be there?
2. What should be their class size (magnitude)?
 The answer is left to skill and experience of researcher.
 However, objective should be to display data in such way as to
make it meaningful to analyst.
 Concerning class size, each group is expected to have equal size.
 Multiples of 2, 5 and 10 are generally preferred while determining
class size.
Cont’d
 Some statistician adopts the following formula:

Where, I = class size

 R = Range (i.e., difference between value of largest item and
smallest item among items to be grouped.
 N = Number of item to grouped
Some problems in processing
 Don’t know (DK) Responses: During data processing, researcher
often comes across some responses that are difficult to handle.
 Don’t know (DK) is one example of such responses.
 When DK response group is small, it is of little significance.
 But when it is relatively big, it becomes matter of major concern.
 How DK responses are to be dealt with by researcher?
 Prevention is best!
 The best way is to design better types of question.
 Good rapport (understanding) of interviews with respondents will
result in minimizing DK response.
Cont’d
 But what about DK responses that have already taken place?
 One way to tackle this issue is to estimate allocation of DK
answers from other data in questionnaire
 The other way is to keep DK responses as separate replay
category if DK response happens to be legitimate, otherwise we
should let reader make his own decision.
7.2. Data Analysis
 Data analysis is further transformation of processed data to
look for patterns and relations among data groups.
 By analysis we mean computation of certain indices or
measures along with searching for patterns or relationship that
exist among data groups.
 Analysis particularly in case of survey or experimental data
involves estimating values of unknown parameters of
population and testing of hypothesis for drawing inferences.
 Analysis can be categorized as :-
i. Descriptive Analysis
ii. Inferential (Statistical) Analysis
7.2.1. Descriptive analysis:
 Descriptive analysis is largely study of distribution of one
variable.
 Analysis begins for most projects with some form of descriptive
analysis to reduce data into summary format.
 Descriptive analysis refers to transformation of raw data into
form that will make them easy to understand and interpret.
 Descriptive response or observation is typically first form of
analysis.
 The calculation of averages, frequency distribution, and
percentage distribution is most common form of summarizing
data.
Cont’d
 The most common forms of describing processed data are:
i. Tabulation
ii. Percentage
iii. Measurements of central tendency
iv. Measurements of dispersion
v. Measurement of asymmetry
vi. Data transformation and index number
Cont’d
Tabulation: Refers to orderly arrangement of data in table or other
summary format.
 It presents responses or observations on question-by-question or
item-by-item basis and provides most basic form of information.
 It tells researcher how frequently each response occurs
 This starting point of analysis requires counting of responses or
observations for each of categories. E.g., Frequency tables
Need for tabulation:
 It conserves space and reduces explanatory and descriptive
statement to minimum
 It facilitate process of comparison
 It facilitate summation of items, detection of errors and omission
 It provide basis for various statistical computation,
Cont’d
Percentage: Whether data are tabulated by computer or by hand, it is
useful to have percentages and cumulative percentage.
 Table containing percentage and frequency distribution is easier to
interpret.
 Percentages are useful for comparing trend over time or among
categories
Cont’d
Measure of central tendency: Describing central tendency of
distribution with mean, median or mode is another basic form of
descriptive analysis.
 These measures are most useful when purpose is to identify typical
values of variable or most common characteristics of group.
 Measure of central tendency is also known as statistical average.
Mean, median and mode are most popular averages.
 Mean (arithmetic mean) is common measure of central tendency
 Mode is not commonly used but in such study like estimating
popular size of shoes it can be used
 Median is commonly used in estimating average of qualitative
phenomenon like estimating intelligence.
Cont’d
Measurement of dispersion: Is measurement of how value of item
scattered around true value of average.
 Average value fails to give any idea about dispersion of values of
item or variable around true value of average.
 After identifying typical value of variable researcher can measure
how value of item is scattered around true value of mean.
 It is measurement of how far is value of variable from average
value.
 It measures variation of value of item.
 Important measures of dispersion are:
1. Range: Measures difference between maximum and minimum
value of observed variable
Cont’d
2. Mean deviation: It is average dispersion of observation around
mean value:
3.Variance: It is mean deviation square :
 It measures sample variability.
Cont’d
 Measurement of asymmetry (skew-ness):
 When distribution of items is happen to be perfectly symmetrical,
we then have normal curve and relating distribution is normal
distribution.
 Such curve is perfectly bell shaped curve in which case value of
Mean = Median = Mode
 Under this condition skew-ness is altogether absent.
 If curve is distorted (whether on right or left side), we have
asymmetric distribution this indicates that there is skew ness.
Cont’d
Cont’d
 If curve is skewed on right side we call it positive skewness
Positively skewed data

 Z is mean, M is median and X is mode

 In such case Z > M > X
Cont’d
 But when curve is skewed toward left, we call it negative skew
ness.
Negatively skewed data

And X M Z
Where X is mean, M is median and Z is mode
Cont’d
 Skew-ness is, thus measurement of asymmetry and shows
manner in which items are clustered around average.
 In symmetric (normal distribution) items show perfect balance
on either side of mode, but in skewed distribution balance is
skewed one side or distorted.
 The amount by which balance exceeds on one side measures
skew-ness.
Cont’d
 Knowledge about shape of distribution is crucial to use statistical
measure in research analysis, Since most method make specific
assumption about nature of distribution.
 Data transformation: It is process of changing original form of
data to form that is more suitable to perform data analysis that
will achieve research objective.
 The researcher often modifies value of scalar data or even create
new variable
 Index numbers: Most of the time, financial information (price,
value of output, interest rate, and exchange rate) will be
adjusted for possible price changes by using index numbers (like
CPI, PPI).
Cont’d
 An index number is a number, which is used to measure level of
given phenomenon at some standard date.
i. Index numbers measures only relative changes.
ii. Different indices serve different purpose
iii. Commodity index serves as measure of changes in phenomenon
on that commodity only
iv. Some index numbers are used to measure cost of living (CPI)
v. In economic sphere they are often termed as economic barometer
Cont’d
 Scores of observation are recalibrated so that they may be
related to certain base period or base number.
 Most commonly used index number to reduce influence of price
change on our observation is CPI
 Researcher also uses index numbers to make comparison
between observations.
 When series (data) are expressed in same units, we can use,
averages for purpose of comparison.
 But two or more series are expressed in different units;
statistical average cannot be used to compare them.
 By converting numbers in to index number we can make
comparison between two or more series.
Inferential Analysis
 Most researcher wishes to go beyond simple tabulation of
frequency distribution and calculation of averages and / or
dispersion.
 They frequently conduct and seek to determine relationship
between variables and test statistical significance.
 When population is consisting of more than one variable it is
possible to measure relationship between them.
 If we have data on two variables we said to have bivariate
variable, if data is more than two variables then population is
known as multivariate population.
 If for every measure of variable, X, we have corresponding value
of variable, Y, resulting pairs of value are called bivariate
population
Cont’d
 In case of bivariate or multivariate population, we often wish to know
relationship between two or more variables from data obtained.
 E.g., we may like to know, “Whether number of hours students devote
for study is somehow related to their family income, to age, to sex, or
to similar other factors.
 There are several methods of determining relationship between
variables.
 Two questions should be answered to determine relationship
between variables.
1. Is there exist association or correlation between two or more
variables? If yes, then up to what degree?
 This will be answered by use of correlation technique. Correlation
technique can be different
Cont’d
 In case of bivariate population correlation can be found using:-
i. Cross tabulation
ii. Karl Pearson’s coefficient of correlation: It is simple correlation
and commonly used
iii. Charles Spearman’s coefficient of correlation
 In case of multivariate population correlation can be studied
through:
i. Coefficient of multiple correlation
ii. Coefficient of partial correlation
Cont’d
2. Is there any cause and effect (causal relationship) between two
variables or between one variable on one side and two or more variables
on other side?
 This question can be answered by use of regression analysis.
 In regression analysis researcher tries to estimate or predict
average value of one variable on basis of value of other variable.
 For instance a researcher estimates average value score on statistics
knowing a student’s score on mathematics examination.
 There are different techniques of regression.
 In case of bivariate population cause and effect relationship can
be studied through simple regression.
 In case of multivariate population: Causal relationship can be
studied through multiple regression analysis.
Cont’d
 Time series Analysis; Successive observations of given
phenomenon over period of time are analyzed through time series
analysis.
 It measures relationship between variables and time (trend)
 Time series will measure seasonal (seasonal fluctuation), cyclical
irregular fluctuation, and Trend.
 The analysis of time series is done to understand dynamic
condition of achieving short term and long-term goal of
business firm for forecasting purpose
 The past trend can be used to evaluate success or failure of
management or any other policy.
 Based on past trend future patterns can be predicted and policy
may accordingly be formulated.
R 7
P T E
H A
F C
D O
EN

AutoCAD 2012 Full Version Gratis
No ratings yet
AutoCAD 2012 Full Version Gratis
3 pages
Research Methods Chapter 8 & 9
No ratings yet
Research Methods Chapter 8 & 9
17 pages
BRM CH - 07
No ratings yet
BRM CH - 07
7 pages
Research Methods Chap 7 - Tamirat
No ratings yet
Research Methods Chap 7 - Tamirat
16 pages
Data Processing
No ratings yet
Data Processing
17 pages
Editing Coding Tabulation of Data
No ratings yet
Editing Coding Tabulation of Data
18 pages
Unit V Proessing & Analysis
No ratings yet
Unit V Proessing & Analysis
35 pages
Research Bussiness
No ratings yet
Research Bussiness
9 pages
Chapter 8
No ratings yet
Chapter 8
36 pages
22 RM - Group 22
No ratings yet
22 RM - Group 22
44 pages
BRM CH-6
No ratings yet
BRM CH-6
30 pages
Data Processing
No ratings yet
Data Processing
73 pages
BRM CH-7
No ratings yet
BRM CH-7
38 pages
RM Module 1
No ratings yet
RM Module 1
63 pages
6.research Methodology-BBA S1M6
No ratings yet
6.research Methodology-BBA S1M6
64 pages
Business and Market Research - Unit 4 - Final
No ratings yet
Business and Market Research - Unit 4 - Final
181 pages
Data Analysis and Interpretation
100% (2)
Data Analysis and Interpretation
19 pages
Chapter Five - Processing and Analysis
No ratings yet
Chapter Five - Processing and Analysis
27 pages
BRM Chapter 6
No ratings yet
BRM Chapter 6
8 pages
Data Analysis
No ratings yet
Data Analysis
52 pages
Chapter Six Data Processing, Analysis and Interpretation
No ratings yet
Chapter Six Data Processing, Analysis and Interpretation
8 pages
Chap - 7 Data Analysis & Interpretation
No ratings yet
Chap - 7 Data Analysis & Interpretation
10 pages
Data Processing and Analysis: Chapter Six
No ratings yet
Data Processing and Analysis: Chapter Six
39 pages
PO 221 Topic Five 5
No ratings yet
PO 221 Topic Five 5
42 pages
Chapter VI: Data Processing, Analysis and Interpretation
No ratings yet
Chapter VI: Data Processing, Analysis and Interpretation
40 pages
Chap - 7 Data Analysis & Interpretation
No ratings yet
Chap - 7 Data Analysis & Interpretation
13 pages
Data Collection and Analysis: Interpretation and Providing Solution
No ratings yet
Data Collection and Analysis: Interpretation and Providing Solution
39 pages
Processing and Interpretation of Data: Prashanta Sharma Professor Department of Commerce Gauhati University
100% (1)
Processing and Interpretation of Data: Prashanta Sharma Professor Department of Commerce Gauhati University
27 pages
Data Preparation and Analysis 3
No ratings yet
Data Preparation and Analysis 3
182 pages
CH 6
No ratings yet
CH 6
42 pages
Stat I CH - II
No ratings yet
Stat I CH - II
46 pages
Mathematics: Quarter 4 - Module 4
No ratings yet
Mathematics: Quarter 4 - Module 4
20 pages
BUS 221 Topic 4 Data Analysis Part A
No ratings yet
BUS 221 Topic 4 Data Analysis Part A
62 pages
PHT 427 Module 5
No ratings yet
PHT 427 Module 5
34 pages
Unit Iv BRM
No ratings yet
Unit Iv BRM
15 pages
Chapter 10 Processing Analysis and Interpretation of Data
No ratings yet
Chapter 10 Processing Analysis and Interpretation of Data
21 pages
Chapter Five:: Analyses and Interpretation of Data
No ratings yet
Chapter Five:: Analyses and Interpretation of Data
72 pages
Research Trail Observation and Data Anaylsis
No ratings yet
Research Trail Observation and Data Anaylsis
43 pages
Chap 7-1
No ratings yet
Chap 7-1
4 pages
KMBN 203 - BRM - Unit-5
No ratings yet
KMBN 203 - BRM - Unit-5
67 pages
Chapter Five:: Analyses and Interpretation of Data
No ratings yet
Chapter Five:: Analyses and Interpretation of Data
64 pages
Processing and Analysis of Data
No ratings yet
Processing and Analysis of Data
43 pages
BRM Unit-4
No ratings yet
BRM Unit-4
18 pages
New Week 3 4
No ratings yet
New Week 3 4
15 pages
Data Processing: ANUBHAV (73) MOHIT (75) Priyanka (77) Sangeeta (81) GUNJAN
No ratings yet
Data Processing: ANUBHAV (73) MOHIT (75) Priyanka (77) Sangeeta (81) GUNJAN
31 pages
CHAPTER 8 DATA ANALYSIS (Autosaved)
No ratings yet
CHAPTER 8 DATA ANALYSIS (Autosaved)
115 pages
Research M
No ratings yet
Research M
9 pages
Processing & Analysis of Data
No ratings yet
Processing & Analysis of Data
25 pages
Methods of Data Processing
No ratings yet
Methods of Data Processing
2 pages
Chapter 7
No ratings yet
Chapter 7
13 pages
Week 12 Data Analysis and Presentation
No ratings yet
Week 12 Data Analysis and Presentation
21 pages
3is Module 5 Lecture Notes
No ratings yet
3is Module 5 Lecture Notes
4 pages
RM Unit-4 & 5
No ratings yet
RM Unit-4 & 5
23 pages
Unit 5
No ratings yet
Unit 5
63 pages
Lecture 8 Data Analysis
No ratings yet
Lecture 8 Data Analysis
30 pages
Processing of Data
No ratings yet
Processing of Data
20 pages
Processing and Interpretation of Data
No ratings yet
Processing and Interpretation of Data
12 pages
Business Statistics and Computing Complete Ppts
No ratings yet
Business Statistics and Computing Complete Ppts
213 pages
Data Analysis and Interpretation
No ratings yet
Data Analysis and Interpretation
19 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Chapter 2
No ratings yet
Chapter 2
30 pages
Chapter Four
No ratings yet
Chapter Four
42 pages
Chapter Six
No ratings yet
Chapter Six
23 pages
Chapter One
No ratings yet
Chapter One
16 pages
Chapter Two
No ratings yet
Chapter Two
23 pages
Chapter Three Economics of Agricultural Production
No ratings yet
Chapter Three Economics of Agricultural Production
16 pages
Chapter 5: Agricultural Finance Meaning and Scope of Agricultural Finance and Credit
100% (1)
Chapter 5: Agricultural Finance Meaning and Scope of Agricultural Finance and Credit
15 pages
Chapter 4: Agricultural Marketing
No ratings yet
Chapter 4: Agricultural Marketing
13 pages
Chapter Five :agricultural Finance
100% (1)
Chapter Five :agricultural Finance
42 pages
Chapter Four: Agricultural Marketing
No ratings yet
Chapter Four: Agricultural Marketing
39 pages
Chapter Two: Economics of Agricultural Development
100% (1)
Chapter Two: Economics of Agricultural Development
40 pages
Chapter Three: Producer Decision Making
No ratings yet
Chapter Three: Producer Decision Making
35 pages
AAA Minahil Saeed Assigment 1
No ratings yet
AAA Minahil Saeed Assigment 1
4 pages
Part 1: C C: Ode and Ommentary
No ratings yet
Part 1: C C: Ode and Ommentary
1 page
Ncert Solutions For Class 10 Maths Chapter 13
No ratings yet
Ncert Solutions For Class 10 Maths Chapter 13
31 pages
Electrical Syllabus
No ratings yet
Electrical Syllabus
2 pages
Yuken A
No ratings yet
Yuken A
92 pages
Geologist's Daily Progress Report: Status of Excavation
No ratings yet
Geologist's Daily Progress Report: Status of Excavation
2 pages
Class 12 Maths Project Helpful
No ratings yet
Class 12 Maths Project Helpful
23 pages
II - 17BT2102 - qp8
No ratings yet
II - 17BT2102 - qp8
2 pages
Task Performance Os Pre Final
No ratings yet
Task Performance Os Pre Final
3 pages
0404-Mathematics Paper+With+Sol. Evening
No ratings yet
0404-Mathematics Paper+With+Sol. Evening
11 pages
Operating Systems CS-362 3 Dr. Iftikhar H. Shah
No ratings yet
Operating Systems CS-362 3 Dr. Iftikhar H. Shah
13 pages
DNM ENG Series PDF
No ratings yet
DNM ENG Series PDF
24 pages
Data Science & Analytics Placement Assurance Program Brochure
No ratings yet
Data Science & Analytics Placement Assurance Program Brochure
19 pages
2018 G11 Math E PDF
No ratings yet
2018 G11 Math E PDF
244 pages
Math IB Questions
No ratings yet
Math IB Questions
11 pages
Testing and Evaluating Glycol Sample
No ratings yet
Testing and Evaluating Glycol Sample
3 pages
Response of Mung Bean (Vigna Radiata (L.) R. Wilczek) To An Increasing Natural Temperature Gradient Under Different Crop Management Systems
No ratings yet
Response of Mung Bean (Vigna Radiata (L.) R. Wilczek) To An Increasing Natural Temperature Gradient Under Different Crop Management Systems
18 pages
Chalimbana University BFM 3100 2024
No ratings yet
Chalimbana University BFM 3100 2024
6 pages
Working With Mongo DB PDF
No ratings yet
Working With Mongo DB PDF
12 pages
Manual Aire Acondicionado
No ratings yet
Manual Aire Acondicionado
22 pages
Task Description PC Comm. Electrical
No ratings yet
Task Description PC Comm. Electrical
7 pages
Through, From, Out, On, and At. A Prepositional Phrase Includes A Preposition A Noun
No ratings yet
Through, From, Out, On, and At. A Prepositional Phrase Includes A Preposition A Noun
2 pages
Object Oriented Programming Lab-10 (Polymorphism and Abstract Classes)
100% (1)
Object Oriented Programming Lab-10 (Polymorphism and Abstract Classes)
5 pages
CSD Final - Doc
No ratings yet
CSD Final - Doc
12 pages
Statistika Elementer
No ratings yet
Statistika Elementer
65 pages
Limiting Reagents - Chemistry LibreTexts
No ratings yet
Limiting Reagents - Chemistry LibreTexts
5 pages
OS Objective Type
No ratings yet
OS Objective Type
16 pages
Lecture 6 - Disinfection
No ratings yet
Lecture 6 - Disinfection
91 pages
Ch.6 Projectile Motion
No ratings yet
Ch.6 Projectile Motion
1 page

Chapter 7

Uploaded by

Chapter 7

Uploaded by

Chapter 7: Data Processing and Analysis

Classification and tabulation (data entry)

Inferential Statistics Univariate Bivariate Multivariate

Where, I = class size

 Z is mean, M is median and X is mode

You might also like