0% found this document useful (0 votes)

55 views38 pages

Advance Business Research Methods

The document discusses data analysis methods including univariate, bivariate, and multivariate analysis. Univariate analysis examines one variable at a time through measures like frequency distributions, means, and standard deviations. Bivariate analysis examines relationships between two variables using cross-tabulation. Multivariate analysis examines relationships between more than two variables using methods like multiple linear regression to model relationships and determine predictor importance. The document provides examples and definitions of key statistical concepts used in these various analysis methods.

Uploaded by

Michael Tesfaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views38 pages

Advance Business Research Methods

Uploaded by

Michael Tesfaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

Chapter Four

Data Preparation and Analysis

Data analysis and interpretation
• Think about analysis EARLY
• Start with a plan
• Code, enter, clean
• Analyze
• Interpret
• Reflect
– What did we learn?
– What conclusions can we draw?
– What are our recommendations?
– What are the limitations of our analysis?
Coding and quantifying
Age  Educational level
 1 = 1-5 years  1= Bellow grade 12
  2= Diploma holder
2 = 6-10 years
 3= Degree holder
 3 = 11-18 years
 4= Masters & above
 4 = 19-25 years
 5 = >25 Years
Region of Country
 West Ethiopia = 1
Sex  East Ethiopia = 2
 Male = 1  South Ethiopia= 3
 Female = 2  North Ethiopia = 4
Coding
Three types of analysis
• Univariate analysis
– the examination of the distribution of cases on only one
variable at a time (e.g., college graduation)
– Purpose: description
• Bivariate analysis
– the examination of two variables simultaneously (e.g., the
relation between gender and college graduation)
– Purpose: determining the empirical relationship between
the two variables
• Multivariate analysis
– the examination of more than two variables simultaneously
(e.g., the relationship between gender, race, and college
graduation)
– Purpose: determining the empirical relationship among the variables
Univariate Analysis
• Univariate Analysis – The analysis of a single variable, for
purposes of description (examples: frequency distribution,
averages, and measures of dispersion).

 It helps to explores each variable in a data set separately

 Frequencies can tell you if many study participants share a

characteristic of interest (age, gender, etc.)

 Graphs and tables can be helpful

Example: Gender >> the number of men/ women in a

sample/population
Univariate Data Analysis (Measures of
Central Tendency)
• Measures of dispersion reflect the spread or distribution of
the distribution
• Commonly used statistics with univariate analysis of
continuous variables :
 Mean – an average computed by summing the values of several
observations and dividing by the number of observations.
 Mode- an average representing the most frequently observed
value or attribute.
 Median – an average representing the value of the “middle”
case in a rank-ordered set of observations.
 Range of values – from minimum value to maximum value
Measures of Dispersion
• Measures of dispersion reflect the spread or
distribution of the distribution
– Range is the difference between largest & smallest scores;
high – low

– Variance is the average of the squared differences

between each observation and the mean
– Standard deviation is the square root of variance
Distributions
 Frequency Distributions : A description of the number of
times the various attributes of a variable are observed in
a sample.
Dispersion – The distribution of values around
some central value, such as an average.
Standard Deviation – A measure of dispersion
around the mean, calculated so that approximately
68 percent of the cases will lie within plus or minus
one standard deviation from the mean, 95 percent
within two, and 99.9 percent within three standard
deviations.
Bivariate Analysis
• Bivariate Analysis – The analysis of two
variables simultaneously, for the purpose of
determining the empirical relationship
between them.
Bivariate analysis allows us to:
• Look at associations/relationships among two
variables.
• Look at measures of the strength of the relationship
between two variables.
• Test hypotheses about relationships between two
nominal or ordinal level variables.
Cross-tabulation
We use cross-tabulation when:

• We want to look at relationships among two

or three variables.
• We want a descriptive statistical measure to
tell us whether differences among groups are
large enough to indicate some sort of
relationship among variables.
Multivariate Analysis
• Multivariate Analysis :The analysis of the
simultaneous relationships among several
variables.
Regression Analysis
Multiple Linear Regression
• Multiple Regression is a statistical method for estimating the
relationship between a dependent variable and two or more
independent (or predictor) variables.

• MLR is a method for studying the relationship between a

dependent variable and two or more independent variables.
• Purposes:
– Prediction
– Explanation
– Theory building
Linear Regression and Correlation
• Relationship between the mean of the response
variable and the level of the explanatory variable
assumed to be approximately linear (straight line)
• Model: b0  Mean response when x=0 (y-
intercept)
Y   0  1 x   b1  Change in mean response
when x increases by 1 unit (slope)

• b1 > 0  Positive Association

b0, b1 are unknown parameters (like
m)
• b1 < 0  Negative Association
• b1 = 0  No Association b0+b1x  Mean response when
explanatory variable takes on the
value x
Design Requirements

One dependent variable (criterion)

Two or more independent variables

(predictor or explanatory variables).

Sample size: >= 50 (at least 10 times as

many cases as independent variables)
MLR Model: Basic Assumptions
• Independence: The data of any particular subject are
independent of the data of all other subjects
• Normality: in the population, the data on the dependent
variable are normally distributed for each of the possible
combinations of the level of the X variables; each of the
variables is normally distributed
• Homoscedasticity: In the population, the variances of the
dependent variable for each of the possible combinations of the
levels of the X variables are equal.
• Linearity: In the population, the relation between the
dependent variable and the independent variable is linear
when all the other independent variables are held constant.
Simple vs. Multiple Regression

 One dependent variable Y • One dependent variable Y

predicted from one predicted from a set of
independent variable X independent variables (X1,
X2 ….Xk)
 One regression coefficient • One regression coefficient for
each independent variable
• R 2
: proportion of variation in
 r2: proportion of variation in
dependent variable Y
dependent variable Y
predictable by set of
predictable from X
independent variables (X’s)
MLR Equation
X = the independent or
predictor variables

Y= a + B1X1 + B2X2 … + BnXn

Y=Dependent variable
a = “raw score b = b weights; or partial regression
or the variable to be equations” include a coefficients.
predicted. constant or Y. Intercept The bs show the relative contribution
ob Y axis, representing the of their independent variable on the
value of Y when X = 0 dependent variable when controlling
for the effects of the other predictors
MLR Output
• The following notions are essential for the
understanding of MLR output: R2, adjusted R2,
constant, b coefficient, beta, F-test, t-test

• For MLR “R2” (the coefficient of multiple

determination) is used rather than “r” (Pearson’s
correlation coefficient) to assess the strength of this
more complex relationship (as compared to a
bivariate correlation)
Adjusted R square and b coefficient
• The adjusted R2 adjusts for the inflation in R2 caused by the number of
variables in the equation. As the sample size increases above 20 cases per
variable, adjustment is less needed (and vice versa).

• When comparing the R2 of an original set of variables to the R2 after

additional variables have been included, the researcher is able to identify
the unique variation explained by the additional set of variables.

• b coefficient measures the amount of increase or decrease in the

dependent variable for a one-unit difference in the independent variable,
controlling for the other independent variable(s) in the equation.
Various Significance Tests
• Testing R2
– Test R2 through an F test
– Test of competing models (difference between R2)
through an F test of difference of R2s
• Testing b
– Test of each partial regression coefficient (b) by t-tests
– Comparison of partial regression coefficients with each
other - t-test of difference between standardized
partial regression coefficients ()
F and t tests

• The F-test is used as a general indicator of the

probability that any of the predictor variables
contribute to the variance in the dependent variable
within the population.
• The null hypothesis is that the predictors’ weights are
all effectively equal to zero, none of the predictors
contribute to the variance in the dependent variable
in the population
• t-tests are used to test the significance of each
predictor in the equation.
SPSS: 1) analyze, 2)
regression, 3) linear
SPSS Screen
SPSS Output Interpret
the
coefficients
SPSS Output Interpret the
r square

What does the

ANOVA result
mean?
Repeated Measures ANOVA
• Between Subjects Design
– ANOVA in which each participant participated
in one of the three treatment groups for
example.
• Within Subjects or Repeated Measures
Design
– Participants participate in one treatment and
the outcome of the treatment is measured in
different time points. for example 3, (before
treatment, immediately after, and 6 months
after treatment)
RM ANOVA Vs. Paired T test
• Repeated measures ANOVA, are an extension of Paired T-Tests.
• Like T-Tests, repeated measures ANOVA gives us the statistic
tools to determine whether or not changed has occurred over
time.

• Repeated measures ANOVA compared the average score at

multiple time periods for a single group of subjects.

• T-Tests compare average scores at two different time periods

for a single group of subjects.

• Solving repeated measures ANOVA required to combine the

data from the multiple time periods into a single time factor for
analysis.
RM ANOVA: Understanding the terms & analysis
interpretation
• The first step in solving repeated measures ANOVA is to
combine the data from the multiple time periods into a single
time factor for analysis.

• The different time periods are analogous to the categories of

the independent variable is a one-way analysis of variance.

• The time factor is then tested to see if the mean for the
dependent variable is different for some categories of the time
factor.

• If the time factor is statistically significant in the ANOVA test,

then Bonferroni pair wise comparisons are computed to
identify specific differences between time periods.
RM ANOVA: Understanding the terms & analysis
interpretation

• The dependent variable is measured at three time

periods, there are three paired comparisons:

Example:
• time 1 versus time 2 (Profitability before Promotion measure)
• time 2 versus time 3 (Profitability immediate after Promotion measure)
• time 1 versus time 3 (Long term effect or Follow-up the post Promotion
measure)
Statistical Assumptions of RM ANOVA
• Independence
• Normality
• Homogeneity of within-treatment variances: In one-way ANOVA,
we expect the variances to be equal & the samples are not related
to one another (so no covariance or correlation)

• Sphericity: All variances & covariance are equal to each other

RM
hyp is id
eff oth eal in
con ectiv esis o testi
str ene n t
ain ss rea ng th
of t w t e
con s rest hen ment
tro rict ethi
l su s th cal
bje e u
ct se
Correlation Coefficient
• Measures the strength of the linear association
between two variables
• Takes on the same sign as the slope estimate from
the linear regression
• Not effected by linear transformations of y or x
• Does not distinguish between dependent and
independent variable (e.g. height and weight)
• Population Parameter - r
• Pearson’s Correlation Coefficient:
S xy
r  1 r  1
S xx S yy
Thank you

[email protected]
Reading assignment
How to interpret results:

• R2
• Β
Significance level (P value)
• T-test
• F-test
• Mean
• Median
• Standard deviation

Applied Longitudinal Analysis Lecture Notes
No ratings yet
Applied Longitudinal Analysis Lecture Notes
475 pages
Advance Business Research Methods
100% (1)
Advance Business Research Methods
22 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
Metodos de Regresion
No ratings yet
Metodos de Regresion
8 pages
module5 Bigdata Analytics
No ratings yet
module5 Bigdata Analytics
110 pages
Applied Multivariate Research - Design and Interpretation P1
No ratings yet
Applied Multivariate Research - Design and Interpretation P1
60 pages
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
No ratings yet
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
31 pages
Multivariate Analysis Spss Operation and Application: Student Name: Deniz Yilmaz Student Number: M0987107
No ratings yet
Multivariate Analysis Spss Operation and Application: Student Name: Deniz Yilmaz Student Number: M0987107
27 pages
Analisis Data Inferensi
No ratings yet
Analisis Data Inferensi
17 pages
Statistical Tests - Handout PDF
No ratings yet
Statistical Tests - Handout PDF
21 pages
Quantitative Anaysise Solomon
No ratings yet
Quantitative Anaysise Solomon
51 pages
Lecture 6
No ratings yet
Lecture 6
16 pages
Y Abx BX BX: Multiple Linear Regression
No ratings yet
Y Abx BX BX: Multiple Linear Regression
48 pages
Bio2 Module 4 - Multiple Linear Regression
No ratings yet
Bio2 Module 4 - Multiple Linear Regression
20 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
Lesson 11 Statistical Techniques Toanalyze Data
No ratings yet
Lesson 11 Statistical Techniques Toanalyze Data
34 pages
Univariate and Bivariate Analysis.
No ratings yet
Univariate and Bivariate Analysis.
7 pages
Dva 2
No ratings yet
Dva 2
13 pages
Statistics Oral
No ratings yet
Statistics Oral
8 pages
Sas Stat
No ratings yet
Sas Stat
44 pages
CH 5
No ratings yet
CH 5
26 pages
CG8 Data-Analysis
No ratings yet
CG8 Data-Analysis
63 pages
Multivariate Research Assignment
No ratings yet
Multivariate Research Assignment
6 pages
Module 3 - Lesson 3.2 Quantitative Data Analysis
No ratings yet
Module 3 - Lesson 3.2 Quantitative Data Analysis
41 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
45 pages
QTIA Report 2012 IQRA UNIVERSITY
No ratings yet
QTIA Report 2012 IQRA UNIVERSITY
41 pages
W7 Dmitriy-Zinovev Descriptive Stats
0% (1)
W7 Dmitriy-Zinovev Descriptive Stats
19 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
41 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Multivariate Analysis in SPSS
No ratings yet
Multivariate Analysis in SPSS
65 pages
Regression
No ratings yet
Regression
82 pages
Data Analysis Guide
No ratings yet
Data Analysis Guide
4 pages
9 Tutorial Statistics Revision
No ratings yet
9 Tutorial Statistics Revision
56 pages
Data Analysis and Interpretation
No ratings yet
Data Analysis and Interpretation
24 pages
L4&5 Multiple Regression 2010B
No ratings yet
L4&5 Multiple Regression 2010B
77 pages
Multiple Regression
No ratings yet
Multiple Regression
61 pages
Course Code: 8614 Course Name: Educational Statistics Assignment: 2 Semester: Spring 2022 Program: B.Ed
No ratings yet
Course Code: 8614 Course Name: Educational Statistics Assignment: 2 Semester: Spring 2022 Program: B.Ed
19 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
9 pages
Handout 3iss
No ratings yet
Handout 3iss
4 pages
Analysing Quantitative Data
No ratings yet
Analysing Quantitative Data
33 pages
ISA Summary Toya
No ratings yet
ISA Summary Toya
38 pages
06 - Banerjee and Banerjee - Business Analytics - Ch06
No ratings yet
06 - Banerjee and Banerjee - Business Analytics - Ch06
21 pages
Unit 5 Business Analytics
No ratings yet
Unit 5 Business Analytics
24 pages
Statistics - Thesis Writing
No ratings yet
Statistics - Thesis Writing
18 pages
C207 Study Guide
No ratings yet
C207 Study Guide
27 pages
Lecture 12 (Data Analysis and Interpretation
No ratings yet
Lecture 12 (Data Analysis and Interpretation
16 pages
Business Research Methods Unit 4
No ratings yet
Business Research Methods Unit 4
25 pages
408 Mid
No ratings yet
408 Mid
7 pages
Psy 3 - M
No ratings yet
Psy 3 - M
3 pages
Operational Foundation of Statistics
No ratings yet
Operational Foundation of Statistics
59 pages
639984762
No ratings yet
639984762
5 pages
SPSS Training Program & Introduction To Statistical Testing: Variance and Variables
No ratings yet
SPSS Training Program & Introduction To Statistical Testing: Variance and Variables
13 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
List of Important AP Statistics Concepts To Know
No ratings yet
List of Important AP Statistics Concepts To Know
9 pages
Using Multivariate Statistics: Barbara G. Tabachnick
100% (1)
Using Multivariate Statistics: Barbara G. Tabachnick
22 pages
Advanced Statistics Day 1
No ratings yet
Advanced Statistics Day 1
61 pages
Cheat Sheet Statistics
No ratings yet
Cheat Sheet Statistics
3 pages
Regression
No ratings yet
Regression
20 pages
6nov regressionII 202425
No ratings yet
6nov regressionII 202425
40 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
39 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Advance Business Research Methods
No ratings yet
Advance Business Research Methods
57 pages
ch-5 - WU MKG Research Communicating Research Project
No ratings yet
ch-5 - WU MKG Research Communicating Research Project
11 pages
Advance Business Research Methods
100% (1)
Advance Business Research Methods
81 pages
EDUC 202 Educational Statistics and Analysis 2
No ratings yet
EDUC 202 Educational Statistics and Analysis 2
6 pages
Bradley
No ratings yet
Bradley
15 pages
Rejection Region
No ratings yet
Rejection Region
22 pages
Research Imp. Ques. Hinglish
No ratings yet
Research Imp. Ques. Hinglish
29 pages
An Introduction To Using Microsoft Excel For Data Analysis
No ratings yet
An Introduction To Using Microsoft Excel For Data Analysis
16 pages
Data Preparation - 2
No ratings yet
Data Preparation - 2
16 pages
Hasbullah & Sajiman (2020)
No ratings yet
Hasbullah & Sajiman (2020)
8 pages
Heath CarterManual
No ratings yet
Heath CarterManual
24 pages
TCH 206 - Statistics For Chemical Engineers
No ratings yet
TCH 206 - Statistics For Chemical Engineers
2 pages
Prism 6 - T Test
No ratings yet
Prism 6 - T Test
7 pages
Normality and Sample Size Appropriate Statistical Test For Two-Group Comparisons - Poncet, A. Et Al. - 2016
No ratings yet
Normality and Sample Size Appropriate Statistical Test For Two-Group Comparisons - Poncet, A. Et Al. - 2016
11 pages
41 3 Tests Two Samples
No ratings yet
41 3 Tests Two Samples
22 pages
Complete Data Science, Machine Learning, DL, NLP Bootcamp - Udemy Business
No ratings yet
Complete Data Science, Machine Learning, DL, NLP Bootcamp - Udemy Business
25 pages
SHS Stat Proba Q4 For Print 40 Pages v2
No ratings yet
SHS Stat Proba Q4 For Print 40 Pages v2
40 pages
Employer Branding Aids in Enhancing Employee Attraction and Retention
No ratings yet
Employer Branding Aids in Enhancing Employee Attraction and Retention
13 pages
Pervan 2020 Efficiency of Large Firms Operating
No ratings yet
Pervan 2020 Efficiency of Large Firms Operating
9 pages
Study The Effects of Customer Service An
No ratings yet
Study The Effects of Customer Service An
9 pages
Nurse Education Today: Kyoungja Kim, Insook Lee T
No ratings yet
Nurse Education Today: Kyoungja Kim, Insook Lee T
6 pages
Team5 Dataanalysis of The Defense Language Institute Icpt 101
No ratings yet
Team5 Dataanalysis of The Defense Language Institute Icpt 101
17 pages
13 PDF
No ratings yet
13 PDF
24 pages
2nd Semester - 4th Summative Exam
No ratings yet
2nd Semester - 4th Summative Exam
3 pages
Dec 2024 Unit Test MQP2 II PU Stats
No ratings yet
Dec 2024 Unit Test MQP2 II PU Stats
3 pages
Syllabus FDS
No ratings yet
Syllabus FDS
4 pages
Of Abbay River Basin, Ethiopia
No ratings yet
Of Abbay River Basin, Ethiopia
10 pages
File004 Hatfield Sample Final Discussion
No ratings yet
File004 Hatfield Sample Final Discussion
16 pages
Islamic-Based Family Resilience Training To Increase Family Resilience, Coping, and Disaster Preparedness
No ratings yet
Islamic-Based Family Resilience Training To Increase Family Resilience, Coping, and Disaster Preparedness
4 pages
Choosing Statistical Tests PDF
No ratings yet
Choosing Statistical Tests PDF
4 pages
C6 - DSC551 - R Programming
No ratings yet
C6 - DSC551 - R Programming
16 pages
PHD Course Work Kadi Sarva Vishwa Vidyalaya
No ratings yet
PHD Course Work Kadi Sarva Vishwa Vidyalaya
136 pages
R Handout Statistics and Data Analysis Using R
No ratings yet
R Handout Statistics and Data Analysis Using R
91 pages

Advance Business Research Methods

Uploaded by

Advance Business Research Methods

Uploaded by

Chapter Four

Data Preparation and Analysis

 It helps to explores each variable in a data set separately

 Frequencies can tell you if many study participants share a

 Graphs and tables can be helpful

Example: Gender >> the number of men/ women in a

– Variance is the average of the squared differences

• We want to look at relationships among two

• MLR is a method for studying the relationship between a

• b1 > 0  Positive Association

One dependent variable (criterion)

Two or more independent variables

Sample size: >= 50 (at least 10 times as

 One dependent variable Y • One dependent variable Y

Y= a + B1X1 + B2X2 … + BnXn

• For MLR “R2” (the coefficient of multiple

• When comparing the R2 of an original set of variables to the R2 after

• b coefficient measures the amount of increase or decrease in the

• The F-test is used as a general indicator of the

What does the

• Repeated measures ANOVA compared the average score at

• T-Tests compare average scores at two different time periods

• Solving repeated measures ANOVA required to combine the

• The different time periods are analogous to the categories of

• If the time factor is statistically significant in the ANOVA test,

• The dependent variable is measured at three time

• Sphericity: All variances & covariance are equal to each other

You might also like