BRM-Lecture 4-2023

The document discusses various statistical methods for analyzing the relationships between multiple variables, including correlations, regression, and how correlations do not necessarily imply causation; it provides examples of using multiple regression to analyze relationships between independent and dependent variables and interpret regression outputs; and it explains assumptions that must be met when using regression to generalize results from a sample to the target population.

Uploaded by

sharma.divya0598

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views48 pages

BRM-Lecture 4-2023

Uploaded by

sharma.divya0598

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Business Research Methods

Dr Teidorlang Lyngdoh
Associate Professor-Marketing
Session- 4
Today’s Outline
• In class exercise
• Correlations & Regression
What is a Correlation?
• It is a way of measuring the extent to which two variables are
related.
• It measures the pattern of responses across variables.
• The correlation coefficient computed from the sample data
measures the strength and direction of a relationship between two
variable
• Sample correlation coefficient is denoted by r.
• Population correlation, often denoted as ρ
• Correlation does NOT necessarily imply causation
Correlation & Causation
• Causation means cause & effect relation
• Correlation denotes the interdependency among the variables for
correlating two phenomenon, it is essential that the two
phenomenon should have relationship but may not be cause-
effect.
• If two variables vary in such a way that change in one (cause) are
accompanied by change in other (effect) having all other factors
that can make move the ‘effect’, constant, then these two
variables are said to have cause and effect relationship.
• In other words causation always implies correlation but correlation
does not always imply causation.
Very Small Relationship
160

140
Appreciation of Dimmu Borgir

120

100

-20
10 20 30 40 50 60 70 80 90

Age
Positive Relationship 90

Appreciation of Dimmu Borgir 70

10
10 20 30 40 50 60 70 80 90

Age
Range of Values for the Correlation Coefficient
Variance vs Co- Variance
• Variance measures the spread or • Covariance measures the degree to
dispersion of a single random which two random variables change
variable. It quantifies how much together. It quantifies the relationship
individual data points in a dataset between two variables.
deviate from the mean or expected • If the variables tend to increase or
value. decrease together, the covariance is
• Variance is useful for understanding positive; if one increases while the
the "spread" or variability in a dataset other decreases, the covariance is
and is often used to calculate the negative; if they're unrelated, the
standard deviation, which is the covariance is close to zero.
square root of the variance.
Pearson’s R
• Covariance does not really tell us much about the strength of
association (Solution: standardise this measure)
• Correlation is a standardized measure of the linear relationship
between two variables, derived from covariance. It always falls within
the range of -1 (perfect negative correlation) to 1 (perfect positive
correlation).
Correlation: Example
• Anxiety and Exam Performance
• Participants:
• 103 students
• Measures
• Time spent revising (hours)
• Exam performance (%)
• Exam Anxiety (the EAQ, score out of 100)
• Gender
Correlations in SPSS
Correlations Output
Correlations in SPSS
• Check out the correlations between the different variables in the
Benetton data set
•
Multiple Regression
Multiple Regression
• Independent Variables- IV (Career Limitations & Experience)
• Dependent Variables- DV ( Days until employed)
Forecasting is like trying to drive a car
blindfolded and following directions given
by a person who is looking out the back
window.
Examples
• Insurance companies heavily rely on regression analysis to estimate
the credit standing of policyholders and a possible number of claims
in a given time period
• A retail store manager may believe that extending shopping hours will
greatly increase sales. RA, however, may indicate that the increase in
revenue might not be sufficient to support the rise in operating
expenses due to longer working hours (such as additional employee
labor charges)
• analysis of data from point of sales systems and purchase accounts
may highlight market patterns like increase in demand on certain days
of the week or at certain times of the year
Regression
• Simple regression: Y = b0 + b1 x.
• Multiple regression: Y = b0 + b1 x1 + b0 + b1 x2…b0…b1 xn.
Multiple Regression as an Equation
• With multiple regression the relationship is described using a
variation of the equation of a straight line.

y = b0 + b1 X 1 +b2 X 2 +  + bn X n +  i
Methods of Regression
• Hierarchical:
• Experimenter decides the order in which variables are entered
into the model.
• Forced Entry:
• All predictors are entered simultaneously.
• Stepwise:
• Predictors are selected using their semi-partial correlation with
the outcome.
b0 is the intercept. Beta Values

• The intercept is the value of • b1 is the regression coefficient for

the Y variable when all Xs = 0. variable 1.
• This is the point at which the • b2 is the regression coefficient for
regression plane crosses the variable 2.
Y-axis (vertical). • bn is the regression coefficient for
nth variable.
SPSS & Out Interpretation
Multiple Regression Interpretation
Significance .000 ( statistically
Adjusted R Square- .237 ( It means 23.7 significant) When we talk about a
percent of the variance of DV is significance level of 0.05, we're
explained by the IV) referring to a threshold that helps us
decide whether a result or finding is
"statistically significant"
Significance .000 ( statistically
significant)

If the p-value is less than 0.05 (your significance level), you say your results are "statistically significant." This means
there's strong evidence to reject the null hypothesis and support the alternative hypothesis. In simple terms, you
believe that something interesting is indeed happening.
1) Independent
Variables (CL &
Experience) 3) Unstandardized coefficients of the IV
2) Significant levels of IV (CL & CL 2.658 what it means is that as CL index
Experience) p-values are less than increase by a value of 1, 1 unit change in CL we
.05 so statistically significance are going to see a 2.658 unit change in the DV
contributions from both the IV
Ex -4.044 As experience increase by 1 year, no of
days on employed (DV) decrease by 4. More
experience smaller no of days of unemployment
Std deviations- For every 1 std deviation
movement, the DV increases by 2.33 std
deviation There is a 95% chance that the actual
value of unstandardized coefficient is
As experience increases by 1 std deviation, between .671 and 4.644
we have a decrease in DV by -4.36 std
deviation
Generalization
• When we run regression, we hope to be able to
generalize the sample model to the entire population.
• To do this, several assumptions must be met.
• Violating these assumptions stops us generalizing
conclusions to our target population.
Straight forward Assumptions
• Variable Type:
• Outcome must be continuous
• Predictors can be continuous or dichotomous.
• Non-Zero Variance:
• Predictors must not have zero variance.
• Linearity:
• The relationship we model is, in reality, linear.
• Independence:
• All values of the outcome should come from a different person.
The More Tricky Assumptions
• No Multicollinearity:
• Predictors must not be highly correlated.
• Homoscedasticity:
• For each value of the predictors the variance of the error term should be
constant.
• Independent Errors:
• For any pair of observations, the error terms should be uncorrelated.
• Normally-distributed Errors
More Explanations of Regression- self
learning
Hierarchical Regression Forced Entry Regression
• Known predictors (based on
past research) are entered • All variables are entered into the
into the regression model model simultaneously.
first. • The results obtained depend on the
• New predictors are then variables entered into the model.
entered in a separate
step/block. • It is important, therefore, to have
• Experimenter makes the good theoretical reasons for
decisions including a particular variable.
• You can see the unique
predictive influence of a new
variable on the outcome
because known predictors are
held constant in the model
Stepwise Regression I

• Variables are entered into the Step 2:

model based on mathematical
criteria. • Having selected the 1st predictor, a second
one is chosen from the remaining
• Computer selects variables in predictors.
steps.
• Step 1 • The semi-partial correlation is used as a
• SPSS looks for the predictor that criterion for selection.
can explain the most variance in
the outcome variable. • Should be used only for exploration
Linearity
Outliers
Homoscedasticity
Homoscedasticity
• It refers to the condition where the variance of the errors (or
residuals) of a regression model is constant across all levels of the
independent variables.
• In simpler terms, it means that the spread or dispersion of the
residuals should be roughly the same for all values of the predictor
variables.
Multicollinearity
• Multicollinearity is a statistical concept where several independent
variables in a model are correlated
• Multicollinearity among independent variables will result in less
reliable statistical inferences.
• Multicollinearity makes it challenging to understand the individual
effect of each predictor variable because they are highly correlated
• This assumption can be checked with collinearity diagnostics
• Tolerance should be more than 0.2
(Menard, 1995)
• VIF should be less than 10 (Myers, 1990)
Autocorrelation

• Autocorrelation is a measure of the similarity between data points at

different time lags. It helps us understand whether there's a pattern
or relationship between observations at different time points.

• The Durbin Watson test reports a test statistic, with a value from 0 to
4, where:
• 2 is no autocorrelation.
• 0 to <2 is positive autocorrelation (common in time series data).
• >2 to 4 is negative autocorrelation (less common in time series data).
Checking Assumptions about Errors
• Homoscedacity/Independence of Errors:
• Plot ZRESID against ZPRED.
• Normality of Errors:
• Normal probability plot.
Regression Plots
Homoscedasticity: ZRESID vs. ZPRED
Normality of Errors: Histograms
and P-P plots
48

Data Analysis
No ratings yet
Data Analysis
263 pages
Session 19&20
No ratings yet
Session 19&20
54 pages
Lecture 12 Regression
No ratings yet
Lecture 12 Regression
55 pages
Introduction of Regression
No ratings yet
Introduction of Regression
57 pages
6nov regressionII 202425
No ratings yet
6nov regressionII 202425
40 pages
Regreesion and Correlation Presentation Revised
No ratings yet
Regreesion and Correlation Presentation Revised
17 pages
Regression 2024
No ratings yet
Regression 2024
49 pages
Correlation Regression
No ratings yet
Correlation Regression
29 pages
Chapter 5
No ratings yet
Chapter 5
73 pages
REGRESSION ANALYSIS 1 and 2 Notes
No ratings yet
REGRESSION ANALYSIS 1 and 2 Notes
9 pages
Module 3 - Data Analysis - S RM
No ratings yet
Module 3 - Data Analysis - S RM
63 pages
Lesson 9
No ratings yet
Lesson 9
4 pages
Slides
No ratings yet
Slides
39 pages
Chapter 8.2
No ratings yet
Chapter 8.2
33 pages
Quntative Data Analysis SPSS: Correlation & Regression
No ratings yet
Quntative Data Analysis SPSS: Correlation & Regression
65 pages
PBH7003 Tests of Relationships
No ratings yet
PBH7003 Tests of Relationships
68 pages
Corelation and Regression
No ratings yet
Corelation and Regression
15 pages
Stat Cor Reg
No ratings yet
Stat Cor Reg
85 pages
Predictive Analytics-Mid Sem Exam Question Bank
No ratings yet
Predictive Analytics-Mid Sem Exam Question Bank
28 pages
Regn & Marketing Research
No ratings yet
Regn & Marketing Research
23 pages
Ch08 Part 2 - Multiple Regression
No ratings yet
Ch08 Part 2 - Multiple Regression
45 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
19 pages
Module 4
No ratings yet
Module 4
33 pages
Data Screening and Main Model Analysis in Spss
No ratings yet
Data Screening and Main Model Analysis in Spss
26 pages
Screenshot 2023-12-04 at 11.27.14
No ratings yet
Screenshot 2023-12-04 at 11.27.14
32 pages
Linear Regression
100% (2)
Linear Regression
28 pages
CH 5
No ratings yet
CH 5
36 pages
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
100% (1)
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
14 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
06 - Banerjee and Banerjee - Business Analytics - Ch06
No ratings yet
06 - Banerjee and Banerjee - Business Analytics - Ch06
21 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
Regression Analysis (AI)
No ratings yet
Regression Analysis (AI)
9 pages
Correlation
No ratings yet
Correlation
5 pages
Correlation Regression 15 16
No ratings yet
Correlation Regression 15 16
19 pages
Regression Analysis Presentation
No ratings yet
Regression Analysis Presentation
52 pages
Cheat SHeet ECON 334
No ratings yet
Cheat SHeet ECON 334
2 pages
Ch08 Part 2 - Multtiple Regression
No ratings yet
Ch08 Part 2 - Multtiple Regression
45 pages
Presentation4 - Bivariate Analysis and Simple Linear Regression
No ratings yet
Presentation4 - Bivariate Analysis and Simple Linear Regression
31 pages
Module 3
No ratings yet
Module 3
34 pages
Review: I Am Examining Differences in The Mean Between Groups
100% (2)
Review: I Am Examining Differences in The Mean Between Groups
44 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
Chapter Two Part One
No ratings yet
Chapter Two Part One
6 pages
10 Regression Analysis
No ratings yet
10 Regression Analysis
55 pages
Chapter 6 (Part Ii)
No ratings yet
Chapter 6 (Part Ii)
41 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Group 5 - Paz, Chavez, Raña, Corporal
No ratings yet
Group 5 - Paz, Chavez, Raña, Corporal
46 pages
Notes On Linear Regression - 2
No ratings yet
Notes On Linear Regression - 2
4 pages
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
No ratings yet
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
28 pages
Correlation New
No ratings yet
Correlation New
37 pages
CSSGB ASQ Certified Six Sigma Green Belt
100% (1)
CSSGB ASQ Certified Six Sigma Green Belt
12 pages
6.multiple Regressions - BDSM - 2020 - Oct
No ratings yet
6.multiple Regressions - BDSM - 2020 - Oct
45 pages
Scanning Electron Micros
100% (2)
Scanning Electron Micros
55 pages
Chapter 5.3-Mulitple Linear Regression
No ratings yet
Chapter 5.3-Mulitple Linear Regression
26 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Data Science 03 - Regression PDF
No ratings yet
Data Science 03 - Regression PDF
32 pages
Understanding Correlation and Regression Analysis in SPSS - 2024
No ratings yet
Understanding Correlation and Regression Analysis in SPSS - 2024
5 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Non Parametric 307 MCQ
82% (11)
Non Parametric 307 MCQ
3 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
CFA LVL II Quantitative Methods Study Notes
No ratings yet
CFA LVL II Quantitative Methods Study Notes
10 pages
Interpreting Correlation
No ratings yet
Interpreting Correlation
13 pages
Writing A Publishable Paper
No ratings yet
Writing A Publishable Paper
14 pages
Basic Concepts On Research For MOR
No ratings yet
Basic Concepts On Research For MOR
31 pages
Statistics The Art and Science of Learning From Data 4th Edition Agresti Fast Access
No ratings yet
Statistics The Art and Science of Learning From Data 4th Edition Agresti Fast Access
317 pages
Hypothesis Testing Roadmap
No ratings yet
Hypothesis Testing Roadmap
2 pages
Business Research Methods: Fall of Mayur Hypermarket: Significance of Research Group No-05
No ratings yet
Business Research Methods: Fall of Mayur Hypermarket: Significance of Research Group No-05
13 pages
+2348173582925 I Want To Join Occult For Money Ritual or To Be Rich
No ratings yet
+2348173582925 I Want To Join Occult For Money Ritual or To Be Rich
12 pages
Coca Cola FY22-23
No ratings yet
Coca Cola FY22-23
183 pages
Statistical Analysis of Grades of Rtu College Students in Calculus/Statistics
No ratings yet
Statistical Analysis of Grades of Rtu College Students in Calculus/Statistics
9 pages
Math 533 Course Project Part B 2
100% (2)
Math 533 Course Project Part B 2
10 pages
Definition
No ratings yet
Definition
39 pages
LS Science EN SOW Syllabus 2021 Onwards
No ratings yet
LS Science EN SOW Syllabus 2021 Onwards
65 pages
Assignment 8614-2
No ratings yet
Assignment 8614-2
10 pages
Pasikowski 2023 Snowball Sampling and Its Non-Trivial Nature
No ratings yet
Pasikowski 2023 Snowball Sampling and Its Non-Trivial Nature
17 pages
How The World Works5c
No ratings yet
How The World Works5c
4 pages
Statistical Research Evaluation, Badet
No ratings yet
Statistical Research Evaluation, Badet
7 pages
Monopoly
No ratings yet
Monopoly
36 pages
Introduction To Market-Perfect Competition
No ratings yet
Introduction To Market-Perfect Competition
34 pages
Lesson Plan Rice Sorting
No ratings yet
Lesson Plan Rice Sorting
2 pages
1341 4952 2 PB
No ratings yet
1341 4952 2 PB
14 pages
LMM (02) 17 Model Law On Computer and Computer Related Crime: Background
No ratings yet
LMM (02) 17 Model Law On Computer and Computer Related Crime: Background
24 pages
The Research Instrument PDF
No ratings yet
The Research Instrument PDF
27 pages
Anova and Pca
No ratings yet
Anova and Pca
10 pages
Lab Report
No ratings yet
Lab Report
2 pages
Introductory Econometrics: Prachi Singh & Partha Bandopadhyay
No ratings yet
Introductory Econometrics: Prachi Singh & Partha Bandopadhyay
29 pages
Quantitative Techniques Assignment-2: One Sample T-Test
No ratings yet
Quantitative Techniques Assignment-2: One Sample T-Test
8 pages
Mid Term Test - Answers
No ratings yet
Mid Term Test - Answers
4 pages
Quantum Probabilities As Bayesian Probabilities
No ratings yet
Quantum Probabilities As Bayesian Probabilities
6 pages
Chapter 6 7 Anomaly Fraud Detection Advanced Datamining Application
No ratings yet
Chapter 6 7 Anomaly Fraud Detection Advanced Datamining Application
10 pages
Jurnal Ilmiah Wahana Pendidikan: Vol. 6, No.3, Agustus 2020
No ratings yet
Jurnal Ilmiah Wahana Pendidikan: Vol. 6, No.3, Agustus 2020
8 pages
Lattice Example 5 X 5 Simple Lattice
No ratings yet
Lattice Example 5 X 5 Simple Lattice
4 pages
Music Fest Poster
No ratings yet
Music Fest Poster
1 page
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

BRM-Lecture 4-2023

Uploaded by

BRM-Lecture 4-2023

Uploaded by

Business Research Methods

Appreciation of Dimmu Borgir 70

• The intercept is the value of • b1 is the regression coefficient for

• Variables are entered into the Step 2:

• Autocorrelation is a measure of the similarity between data points at

You might also like