BRM - Topic 8
BRM - Topic 8
Correlation
Factor analysis
&
Regression
GROUP 3
Table of Content
I - Technique Overview
1. Correlation
2. Factor Analysis
3. Regression
II - Technique Practices
Technique
Overview
Correlation
Analysis
Correlation?
Example
r = 0.9361 - Respondents’
duration of residence in the city
is strongly associated with their
attitude toward the city.
Example
The simple correlations between the variables are:
Partial correlation
controlling for the effect of importance attached to weather has little effect
on the association between attitude toward the city and duration of residence
Partial correlations have an order associated with them. The order
indicates how many variables are being adjusted or controlled.
The correlation between Y and X when the linear effects of the other
independent variables have been removed from X but not from Y.
Let’s calculate the part correlation between attitude toward the city and the
duration of residence, when the linear effects of the importance attached to
weather have been removed from the duration of residence.
Example
Non Metric Correlation
If the non metric variables are ordinal and numeric, Spearman’s rho, ρs, and
Kendall’s tau, τ, are two measures of non metric correlation that can be used to
examine the correlation between them.
Both these measures use rankings rather than the absolute values of the variables
In the absence of ties, Spearman's ρs yields a closer
approximation to the Pearson product moment correlation
coefficient, ρ, than Kendall's τ. In these cases, the absolute
magnitude of τ tends to be smaller than Pearson's.
- Strength of Relationship
- Direction of Relationship
- Patterns and Trends
Answers to
Product Moment Correlation
- How strongly are sales related to advertising expenditures?
- Is there an association between market share and size of the sales force?
- Are consumers’ perceptions of quality related to their perceptions of prices?
Partial Correlation
- How strongly are sales related to advertising expenditures when the effect of price
is controlled?
- Is there an association between market share and size of the sales force after
adjusting for the effect of sales promotion?
- Are consumers’ perceptions of quality related to their perceptions of prices when
the effect of brand image is controlled?
Factor
Analysis
Definition
Factor analysis is a general name denoting a class of procedures
primarily used for data reduction and summarization.
Objectives
To identify a new, smaller, set of uncorrelated
variables to replace the original set of
Factor analysis is used in the following correlated variables in subsequent multivariate
circumstances: analysis (regression or discriminant analysis).
Market Segmentation
Identifying the underlying variables on which to group the customers
New car buyers might be grouped based on the relative emphasis they place on economy,
convenience, performance, comfort, and luxury ---> 5 segments: economy seekers, convenience
seekers, performance seekers, comfort seekers, and luxury seekers.
Product Research
Determine the brand attributes that influence consumer choice
Toothpaste brands might be evaluated in terms of protection against cavities, whiteness of
teeth, taste, fresh breath, and price.
Advertising Studies
Understand the media consumption habits of the target market
The users of frozen foods may be heavy viewers of cable TV, see a lot of movies, and listen to
country music.
Benefits
Spotting trends
Example:
A retail company conducts factor analysis on
customer purchase data.
They discover that customers who buy certain
products also tend to purchase related
accessories.
This insight allows the company to create
targeted product bundles and improve cross-
selling strategies
Benefits
Pinpoint the number of factors in a data set
Example:
Studying employees’ performance in the
company. By using factor analysis, they identify
key factors such as working habits,
attendance, and overtimes involvement.
This helps managers focus on these critical
aspects to improve overall performance
outcomes.
Benefits
Streamlines segmenting data
Example:
An e-commerce business planning a customer
segmentation study uses factor analysis to
streamline data inputs.
They find that customer preferences for fast
delivery, product variety, and pricing structure
are interconnected ---> the creation of distinct
customer segments for targeted marketing
campaigns and personalized services.
Business Questions
Market Product Employee
Segmentation Development Satisfaction
What attributes are most
What are the key factors What factors contribute most to
important to our customers
influencing customer preferences employee satisfaction in the
when considering a new
in our market? workplace?
product?
Kaiser-Meyer-Olkin (KMO):
Measure of sampling adequacy.
Small values of the KMO statistic indicate that the correlations between pairs of
variables cannot be explained by other variables and that factor analysis may not
be appropriate.
A KMO value greater than 0.5 is generally desirable.
Construct the Correlation Matrix
Determine the Method of Factor Analysis
Once it has been determined that factor analysis is suitable for analyzing the data,
an appropriate method must be selected
Orthogonal Rotation:
Axes are maintained at right angles.
Commonly used method: Varimax.
Minimizes variables with high loadings on a factor, enhancing interpretability.
Results in uncorrelated factors.
Oblique Rotation:
Axes are not at right angles, allowing for correlated factors.
Useful when factors in the population are likely to be strongly correlated.
Can simplify the factor pattern matrix by permitting correlations among factors.
Rotate Factors
where
The use of
them.
Quantitative Data
regression Regression is appropriate for quantitative variables on an
interval or ratio scale. For instance, predicting salary based on
analysis
experience, education, and age exemplifies a fitting
application of regression analysis.
Multivariate Relationships
If you have more than one independent variable and you want
to understand how they collectively influence the dependent
variable, multiple regression analysis can be employed.
Bivariate
Regression
Bivariate regression is a procedure for deriving a mathematical
relationship, in the form of an equation, between a single metric
dependent or criterion variable and a single metric independent
or predictor variable. The analysis is similar in many ways to
determining the simple correlation between two variables.
2
It is good practice to use adjusted R rather than
2 2
R because adjusted R only increases when a
new independent variable contributes
significantly to explain total variations in the
dependent variable.
Collinearity and
Multicollinearity
When the independent variables X1, X2, . . . , Xm are related to
each other instead of being independent, we have a condition
known as multicollinearity. If only two predictors are correlated,
we have collinearity.
Multicollinearity may be detected through “variance inflation
factors (VIF)”
Significance Testing
The purpose of significance testing in regressio analysis is to
determine how an independant variable affects the depent
variable (positively, negatively or no effects at all)
Degree of freedom:
The greater the number of observations n, the more distinct the sampling distribution of the
estimated parameter β is closer to the standard distribution and the easier it is to reject the
hypothesis, with k is the number of variables explained in the model.
Two-sided Test
Hypotheses:
Significance:
<0.05 is significant
2. The EMP is potentially influenced by both SAL and MAS. If this is the case, to
which extent the variation of EMP is uniquely due to SAL? And, uniquely due to MAS?
The extent that EMP is EMP is significantly
uniquely due to SAL & MAS influenced by SAL & MAS
Variables with a higher absolute Beta coefficient will have stronger unique
influence on EMP
3. Whether increasing salary is a good solution for improving employees’ performance? To which extent
employees’ salary predicts their performance?
R Square = .387 indicates that 38.7% of EMP can be explained by SAL
Sig. = .000 < .05 ---> can predict EMP based on SAL
Sig. = .045 < .05 ---> can predict EMP based on SUP
sig. = 0.000 <0.05 ---> the research model can help predict EMP
The most important factor is training (TRA ) (0.269)
The least important factor is heavy workload (HWL) (0.021)
Even though HWL and SUP have Sig. > 0.05 , we don’t remove them from the model and re-run the analysis
Factor
Analysis
Practice
1. Theoretically, how many attributes the managers should pay attention to?
Theoretically, the manager should pay
attention to all the attributes:
A1: Acceleration
A2: Engine size
A3: Sporty shape
A4: Modern design
A5: Airbag
A6: Safety rating
A7: Stability control
A8: Max speed
A9: Colour
A10: ABS brake
A11: Entertainment systems
A12: Number of cylinders
A13: Panoramic sunroof
A14: Number of seats
A15: Voice recognition
2. Practically, what can be the disadvantage if the managers pay attention to those
attributes?
Component 4: Design
Component 3: Utilities
A3: Sporty shape
A11: Entertainment systems
A4: Modern design
A14: Number of seats
A9: Colour
A15: Voice recognition
A13: Panoramic sunroof
Business Research Methods
Thank
you very
much!
GROUP 3