0% found this document useful (0 votes)

22 views30 pages

Chapter 6 Processing and Analysis of Data

The document discusses processing and analysis of data in research. It covers topics like measures of central tendency, dispersion, skewness, and relationship. Measures of central tendency discussed include mode, median, and mean. Measures of dispersion covered are range, variance, and standard deviation. The document provides details on calculating and applying these statistical measures to data.

Uploaded by

solomon tadesse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views30 pages

Chapter 6 Processing and Analysis of Data

Uploaded by

solomon tadesse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

School of Electrical

Engineering and Computing

Department of Electronics

and Communication

Engineering

Engineering Research and

Development Methodology
By

BY
4/25/2021
Demissie Jobir Gelmecha (PhD.)
1

Engineering Research and Development Methodology

Chapter 6: Processing and Analysis of Dat
6.1 Elements/Types of Analysis
6.2 Statistics in Research
6.3 Measures of Central Tendency
6.4 Measures of Dispersion
6.5 Measures of Asymmetry (Skewness)
6.6 Measures of Relationship
6.7 Simple Regression Analysis
6
4/25/2021 2

Nonlinear Chiral Fiber

6.1 Processing Operations
 Before you can interpret your data, you must first organize
and summarize them.
 How you organize your data depends on your research design.
1. Editing: is a process of examining the collected raw data
(specially in surveys) to detect errors and omissions and to
correct these when possible
2. Coding: is the process of assigning numerals or other symbols
to answers so that responses can be put into a limited number
of categories or classes.
3. Classification: Most research studies result in a large volume
of raw data which must be reduced into homogeneous groups
if we are to get meaningful relationships.
4. Tabulation: When a mass of data has been assembled, it
becomes necessary for the researcher to arrange the same in
some kind of concise and logical order.
6.2 Statistics in Research

In many research situations, it is convenient

to summarize your data by applying

descriptive statistics.

 Two categories of descriptive statistics:

I. Measures of center Tendency and

II.Measures of Dispersion.
6.3 Measures of Center Tendency
 It gives you a single score that represents the

general magnitude of scores in a distribution.

 This score characterizes your distribution by

indicating a score value that falls at or near the

middle of the distribution.

 The most common measures of center are the mode,

the median, and the mean (also called the

arithmetic average).

 Each measure of center has strengths and

The Mode
 The mode is simply the most frequent score in a distribution.

 To obtain the mode, count the number of scores falling into

each response category.

 The response category with the highest frequency is the mode.

 The mode of the distribution 1, 2, 4, 6, 4, 3, 4 is 4, because 4 is

the most frequent score.

 No mode exists for a distribution in which all the scores are

different.

 Some distributions, called bimodal distributions, have two

modes.
 Although the mode is simple to calculate, it is limited

because it does not take into account the values of scores

outside of the most frequent score.

 The only information yielded by the mode is the most

frequent score.

 Consequently, two distributions may have similar modes

and yet look very different.

 Looking only at the mode, you might conclude that the two

distributions are similar.

 Obviously, this conclusion is incorrect.

The Median
 The median is the middle score in a distribution.

 To calculate the median, follow these steps:

1. Order the scores in your distribution from lowest to

highest (or highest to lowest, it does not matter).

2. Count down through the distribution and find the score

in the middle of the distribution.

 median of the following distribution: 7, 5, 2, 9, 4, 8, 1 is 5.

 The ordered distribution is 1, 2, 4, 5, 7, 8, 9, and 5 is the

middle score.
 The median takes more information into account
than the mode.
 However, it is still a rather insensitive measure of
center because it does not take into account the
magnitudes of the scores above and below the
median.
 As with the mode, two distributions can have the
same median and yet be very different in character.
 For this reason, the median is used primarily when
the mean is not a good choice.
The Mean
 The mean is the most sensitive measure of center because

it takes into account all scores in a distribution when it is

calculated.

 The computational formula for the mean is the sum of the

scores (ΣX) divided by the number of scores in the

distribution (n).

 The major advantage of the mean is that, unlike the mode

and the median, its value is directly affected by the

magnitude of each score in the distribution.

 Assume that distribution A contains the scores 4, 6, 3, 8, 9,

2, 3, and distribution B contains the scores 4, 6, 3, 8, 9, 2, 43.

Although the two distributions differ by only a single

score (3 versus 43), they differ greatly in their means (5

versus 10.7, respectively).

 The mean of 5 appears to be more representative of the

first distribution than the mean of 10.7 is of the second.

 The median is a better measure of center for the second

distribution. The medians of the two distributions are 4

and 6, respectively.
Choosing a Measure of Center
 One of the first things you should do when summarizing your
data is to generate a frequency distribution of the scores.

 If your scores are normally distributed (or at least nearly

normally distributed), then the mean, median, and mode will fall
at the same point in the middle of the distribution,

 When your scores are normally distributed, use the mean as your
measure of center because it is based on the most information.
 As your distribution deviates from normality, the
mean becomes a less representative measure of
center.

 The two graphs show the relationship between the

three measures of center with a positively skewed
distribution and a negatively skewed distribution.
 In a negatively skewed distribution, the mean underestimates the center.

 Conversely, in a positively skewed distribution, the mean overestimates

the center.

 Because the median is much less affected by skew, it provides a more

representative picture of the distribution’s center than does the mean

and should be preferred whenever your distribution is strongly skewed.

 Neither the mean nor the median will accurately represent the center if

your distribution is bimodal.

 With a bimodal distribution, both measures of center underrepresent one

large cluster of scores and over represent the other.

6.4 Measures of Dispersion
 If you look again at some of the sample distributions described
thus far, you will notice that the scores in the distributions differ
from each other.

 A measure of spread provides information that helps you to

interpret your data.

 Two sets of scores may have highly similar means yet very
different distributions, as the following example illustrates.

 The distributions of the two players’ averages are as follows:

• Player 1: .260, .397, .200, .195

• Player 2: .263, .267, .259, .263

 Each player has a .263 batting average over 4 years

 Which of these two players would you prefer to have
on your team? Most likely, you would pick player 2
because he is more “consistent” than player 1.

 This simple example illustrates an important point

about descriptive statistics.

 When you are evaluating your data, you should take

into account both the center and the spread of the
scores.

 Measures of spread: the range, the variance, and the

standard deviation.
The Range
 is the simplest and least informative measure of spread.

 To calculate the range, you simply subtract the lowest

score from the highest score.

 In the baseball example, the range for player 1 is .202, and

the range for player 2 is .008.

 Compare the following two distributions of scores: 1, 2, 3, 4,

5, 6 and 1, 2, 3, 4, 5, 31. The range for the first distribution is

5, and the range for the second is 30.

 The two ranges are highly discrepant despite the fact that
The Variance
 The variance is the average squared deviation

from the mean.

 The defining formula is

The Standard Deviation

 Although the variance is frequently used as a

measure of spread in certain statistical

calculations, it does have the disadvantage of

being expressed in units different from those of

the summarized data.

 However, the variance can be easily converted

into a measure of spread (s) expressed in the same

unit of measurement as the original scores: To

4.5 Measures of Skewness
 Skewness means lack of symmetry.
 In skewed distribution, the mean and the median are
pulled away from the mode.
 Mean, median and mode are not equal.

 A skewed distribution is an asymmetrical distribution.

 It has a long tail on one side and short tail on the other

side.
 Test of skewness

 To test whether a distribution is skewed or not, the

following are to be noticed. A distribution is skewed if

1. mean, median and mode are not equal.

 Shape can be described by degree of asymmetry (i.e.,

skewness).

◦ mean > median positive or right-skewness

◦ mean = median symmetric or zero-skewness

◦ mean < median negative or left-skewness

 Positive skewness can arise when the mean is increased by

some unusually high values.

 Negative skewness can arise when the mean is decreased by

some unusually low values.

4.6 Measures of Relationship
 In some cases, you may want to evaluate the direction and degree of
relationship (correlation) between the scores in two distributions.
 For this purpose, you must use a measure of association.
 The most widely used measure of association is the Pearson product-
moment correlation coefficient, or Pearson r.
 The Pearson correlation coefficient provides an index of the direction
and magnitude of the relationship between two sets of scores.
 The value of Pearson r can range from +1 through 0 to −1. The sign of the
coefficient tells you the direction of the relationship.
 A positive correlation indicates a direct relationship.
 A negative correlation indicates an inverse relationship.
Cont.
4.7 Simple Regression Analysis
 Simple Regression analysis is a quantitative research
method which is used when the study involves modelling
and analyzing variables, where the relationship includes a
dependent variable and independent variables.
 In simple terms, regression analysis is a quantitative
method used to test the nature of relationships between a
dependent variable and one or more independent
variables.
 The basic form of regression models includes unknown
parameters (β), independent variables (X), and the
dependent variable (Y).
 Regression model, basically, specifies the relation of

dependent variable (Y) to a function combination of

independent variables (X) and unknown parameters (β)

 Y ≈ f (X, β)

 Regression equation can be used to predict the values of

‘y’, if the value of ‘x’ is given, and both ‘y’ and ‘x’ are the

two sets of measures of a sample size of ‘n’. The formulae

for regression equation would be

Simple Regression Example
The following data are diastolic blood
pressure (DBP) measurements taken at
different times after an intervention for n =
5 persons. For each person, the data
available include the time of the
measurement and the DBP level. Of interest
is the relationship between these two
variables.

19 -
26
Time DPB
Patie x 2 y y2 xy
nt x
1 0 0 72 5,184 0
2 5 25 66 4,356 330
3 10 100 70 4,900 700
4 15 225 64 4,096 960
5 20 4,356 1,320
Sum 50 750 338 22,892 3,310
Mean 10 67.6 19 -
27
19 -
28
19 -
29
Thank You

THE CHAMELEON MIRROR by MICHAEL BROWN V1.0
100% (2)
THE CHAMELEON MIRROR by MICHAEL BROWN V1.0
165 pages
All Subjects - Yearly and Termly Scheme of Learning For Primary - Part 1
100% (7)
All Subjects - Yearly and Termly Scheme of Learning For Primary - Part 1
59 pages
Now 1&refreqid Excelsior:&seq 4#pa Ge - Scan - Tab - Contents
No ratings yet
Now 1&refreqid Excelsior:&seq 4#pa Ge - Scan - Tab - Contents
5 pages
Ain Shams Engineering Journal: Laila M. Khodeir, Alaa El Ghandour
100% (1)
Ain Shams Engineering Journal: Laila M. Khodeir, Alaa El Ghandour
9 pages
Measures of Central Tendency and Variability
No ratings yet
Measures of Central Tendency and Variability
72 pages
Introduction To Statistics Lecture 7
No ratings yet
Introduction To Statistics Lecture 7
32 pages
AOL 1 Chapter Chapter 7 Part 1
No ratings yet
AOL 1 Chapter Chapter 7 Part 1
10 pages
Module 8-Students'
No ratings yet
Module 8-Students'
11 pages
Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
36 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
Chapter 3: Central Tendency
No ratings yet
Chapter 3: Central Tendency
30 pages
Session 1 ISM May 2024
No ratings yet
Session 1 ISM May 2024
59 pages
Module 3 Descriptive Statistics Final
100% (1)
Module 3 Descriptive Statistics Final
15 pages
ISM Session 1-8+webinar1,2 Merged
No ratings yet
ISM Session 1-8+webinar1,2 Merged
718 pages
3.central Tendency
No ratings yet
3.central Tendency
18 pages
Unit-Ii
No ratings yet
Unit-Ii
174 pages
Week4 - Probability and Cenral Tendency
No ratings yet
Week4 - Probability and Cenral Tendency
59 pages
Interpretation of Assessment Results: Graphical Presentation & Quantitative Analysis
No ratings yet
Interpretation of Assessment Results: Graphical Presentation & Quantitative Analysis
33 pages
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
No ratings yet
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
13 pages
ISM - Session 1 - May 2025
No ratings yet
ISM - Session 1 - May 2025
54 pages
Statistics in Assessment of Learning
No ratings yet
Statistics in Assessment of Learning
11 pages
Measures of Central Tendency
0% (1)
Measures of Central Tendency
13 pages
Measures of Central Tendecy
No ratings yet
Measures of Central Tendecy
5 pages
Lecture 2
No ratings yet
Lecture 2
93 pages
Lecture 5 Descriptive Statistics
No ratings yet
Lecture 5 Descriptive Statistics
47 pages
Data Analysis Techniques
No ratings yet
Data Analysis Techniques
12 pages
Bioepi Lesson 6. Descriptive Statistics
No ratings yet
Bioepi Lesson 6. Descriptive Statistics
38 pages
Chapter 7
No ratings yet
Chapter 7
59 pages
PSY123 Lecture 10-1
No ratings yet
PSY123 Lecture 10-1
31 pages
Chapter 3: Central Tendency
No ratings yet
Chapter 3: Central Tendency
26 pages
Chapter 3 DESCRIPTIVE STATISTICS FOR EDA
No ratings yet
Chapter 3 DESCRIPTIVE STATISTICS FOR EDA
51 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
EDUC 75 Module 7revised Measures of Central Tendency.
No ratings yet
EDUC 75 Module 7revised Measures of Central Tendency.
14 pages
Considerations For Choosing A Measure of Central Tendency
No ratings yet
Considerations For Choosing A Measure of Central Tendency
6 pages
Measures of Central Tendency Position
No ratings yet
Measures of Central Tendency Position
12 pages
Module 3: Measures of Central Tendency
No ratings yet
Module 3: Measures of Central Tendency
2 pages
Module Assessment1 C7.
No ratings yet
Module Assessment1 C7.
15 pages
Slides For IT SKill
No ratings yet
Slides For IT SKill
63 pages
Measure of Central Tendency Variability or Dispersion Group 6
No ratings yet
Measure of Central Tendency Variability or Dispersion Group 6
8 pages
Measure of Central Tendency Dispersion A
No ratings yet
Measure of Central Tendency Dispersion A
8 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
Module 3 Descriptive Statistics
No ratings yet
Module 3 Descriptive Statistics
38 pages
Unit 5 Descriptive Statistics Measures of Central Tendency
No ratings yet
Unit 5 Descriptive Statistics Measures of Central Tendency
6 pages
Data Analysis
No ratings yet
Data Analysis
40 pages
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
No ratings yet
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
23 pages
Module 10 Introduction To Data and Statistics
No ratings yet
Module 10 Introduction To Data and Statistics
63 pages
Topic 3 - Data Presentation, Summarization, Measure of Central Tendency&Spread.
No ratings yet
Topic 3 - Data Presentation, Summarization, Measure of Central Tendency&Spread.
48 pages
MMW Finals Reviewer
No ratings yet
MMW Finals Reviewer
9 pages
Week 13 Central Tendency For Ungrouped Data
No ratings yet
Week 13 Central Tendency For Ungrouped Data
27 pages
Engineering Statistics: Measures of Central Tendency
No ratings yet
Engineering Statistics: Measures of Central Tendency
10 pages
Lesson 5:: Measures of Central Tendency
No ratings yet
Lesson 5:: Measures of Central Tendency
4 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Central Tendency Variation Skewness Individual Performance Relationships
No ratings yet
Central Tendency Variation Skewness Individual Performance Relationships
9 pages
Drawing Conclusions From Statistical Data: Measures of Central Tendency
No ratings yet
Drawing Conclusions From Statistical Data: Measures of Central Tendency
22 pages
Unit 4 & 5 8614
No ratings yet
Unit 4 & 5 8614
58 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Lecture 02 16092024 023410pm 1 11022025 095913am
No ratings yet
Lecture 02 16092024 023410pm 1 11022025 095913am
16 pages
Mean Median Mode
No ratings yet
Mean Median Mode
56 pages
g5 Assessment in Learning 1
No ratings yet
g5 Assessment in Learning 1
26 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Quantitative Method-Breviary - SPSS: A problem-oriented reference for market researchers
From Everand
Quantitative Method-Breviary - SPSS: A problem-oriented reference for market researchers
Jens K. Perret
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
L4-RESEARCH - DESIGN - & - Data - Collection ECEG - 4341
No ratings yet
L4-RESEARCH - DESIGN - & - Data - Collection ECEG - 4341
17 pages
Overview of Monetary Systems
No ratings yet
Overview of Monetary Systems
27 pages
Blockchain and Cryptocurrency - Chapter 1
No ratings yet
Blockchain and Cryptocurrency - Chapter 1
32 pages
ch1 2 3 AI
No ratings yet
ch1 2 3 AI
186 pages
Knowledge Representation 2
No ratings yet
Knowledge Representation 2
42 pages
Lecture 211
No ratings yet
Lecture 211
92 pages
Workbook Dump User Guide
No ratings yet
Workbook Dump User Guide
2 pages
Scatha Press Kit
No ratings yet
Scatha Press Kit
10 pages
14 Water Image
No ratings yet
14 Water Image
2 pages
Annex 1 Child Mapping Tool Deped Order 3 2018
No ratings yet
Annex 1 Child Mapping Tool Deped Order 3 2018
16 pages
Practical 3 - Center of Gravity
No ratings yet
Practical 3 - Center of Gravity
1 page
M2019HRM008 - PLO - Assignment1 &2 Both - Akhil
No ratings yet
M2019HRM008 - PLO - Assignment1 &2 Both - Akhil
4 pages
Hinge Loss For SVM
No ratings yet
Hinge Loss For SVM
9 pages
Student's T Distribution
No ratings yet
Student's T Distribution
6 pages
Geometric Dimensioning and Tolerancing White Paper 2016
100% (1)
Geometric Dimensioning and Tolerancing White Paper 2016
17 pages
Evaluation of The Quality of Concrete by Rebound Hammer Method
No ratings yet
Evaluation of The Quality of Concrete by Rebound Hammer Method
8 pages
Guidelines For Writing A Practicum Report DEEE - 1
No ratings yet
Guidelines For Writing A Practicum Report DEEE - 1
6 pages
Study of Means End Value Chain Model
100% (1)
Study of Means End Value Chain Model
19 pages
Elx DD Nic 5.00.31.01-6 Windows 32-64
No ratings yet
Elx DD Nic 5.00.31.01-6 Windows 32-64
4 pages
Section 270 Soil-Cement Base 270-1 Description. 270-2 Materials
No ratings yet
Section 270 Soil-Cement Base 270-1 Description. 270-2 Materials
7 pages
BS EN 1503-4 2002 Valves Materials For Bodies Bonnets
No ratings yet
BS EN 1503-4 2002 Valves Materials For Bodies Bonnets
10 pages
Notice: National Vaccine Injury Compensation Program: Petitions Received List
No ratings yet
Notice: National Vaccine Injury Compensation Program: Petitions Received List
3 pages
BDSP Poc Guide
No ratings yet
BDSP Poc Guide
31 pages
Contemporary Topics Unit 12 Vocab
No ratings yet
Contemporary Topics Unit 12 Vocab
2 pages
Erasmus+ Faculty Coordinators en
No ratings yet
Erasmus+ Faculty Coordinators en
4 pages
Sample of Study Programmes MC-SC
No ratings yet
Sample of Study Programmes MC-SC
2 pages
VLSI CAD Laboratory: Research Areas
No ratings yet
VLSI CAD Laboratory: Research Areas
3 pages
Arithmetic
No ratings yet
Arithmetic
15 pages
Creative Writing Is Any Form of Writing Which Is Written With The Creativity of Mind: Fiction
No ratings yet
Creative Writing Is Any Form of Writing Which Is Written With The Creativity of Mind: Fiction
17 pages
QDA and PCA On Milk
No ratings yet
QDA and PCA On Milk
9 pages
Primordial Soup Theory
No ratings yet
Primordial Soup Theory
3 pages
Research Format
No ratings yet
Research Format
29 pages