Regression Analysis Final Project
Regression Analysis Final Project
UNIVERSITY OF LAHORE
Lahore Business School
Submitted To:
Dr. Ayesha Iftikhar
Submitted By:
Subject:
Data Analytics
CASE PROBLEM: SELECTING A POINT- AND - SHOOT DIGITAL CAMERA
Consumer Reports tested 166 different point-and-shoot digital cameras. Based upon factors such as the number
of megapixels, weight (oz), image quality, and ease of use, they developed an overall score for each camera
tested. The overall score ranges from 0 to 100, with higher scores indicating better overall test results. Selecting
a camera with many options can be a difficult process, and price is certainly a key issue for most consumers. By
spending more, will a consumer really get a superior camera? And, do cameras that have more megapixels, a
factor often considered to be a good measure of picture quality, cost more than cameras with fewer megapixels?
Table 14.15 shows the brand, average retail price ($), number of megapixels, weight (oz), and the overall score
for 13 Canon and 15 Nikon subcompact cameras tested by Consumer Reports (Consumer Reports website).
Managerial Report
Note: Use MS EXCEL for analysis and MS Word for report Writing. No marks will be rewarded if only
excel file will be submitted. It is a group project. No marks will be awarded for plagiarized or similar
content.
Project must contain proper Title page having name of all group members, subject, teacher’s name, date
of Submission and LBS LOGO. Which is attached herewith.
A report must be comprehensive and should have proper interpretation of each part.
Place Table no and Figure No with each table and plots.
1. Develop numerical summaries of the data.
Weight (oz)
Mean 5.821428571
Standard Error 0.185831261
Median 6
Mode 5
Standard 0.983326607
Deviation
Sample Variance 0.966931217
Kurtosis -1.190294996
Skewness -0.119748909
Range 3
Minimum 4
Maximum 7
Sum 163
Count 28
Score
Mean 56.35714286
Standard Error 1.26534422
Median 56.5
Mode 66
Standard 6.695572256
Deviation
Sample Variance 44.83068783
Kurtosis -0.616238347
Skewness -0.429488071
Range 24
Minimum 42
Maximum 66
Sum 1578
Count 28
2. Using overall score as the dependent variable, develop three scatter diagrams, one using price as the
independent variable, one using the number of megapixels as the independent variable, and one using
weight as the independent variable. Which of the three independent variables appears to be the best
predictor of overall score?
price as the independent variable
Price
80
70
60
50
Score
40
30
20
10
0
0 50 100 150 200 250 300 350 400 450
Price
60
50
40
Score
30
20
10
0
0 2 4 6 8 10 12 14 16 18
Megapixels
Weight
70
60
50
40
Score
30
20
10
0
0 1 2 3 4 5 6 7 8
Weight
The Scatter Diagram between score as independent variable and price as independent variable. This is the
best predictor of the overall score because dots are closely to each other and relation is strong positive.
3. Choose independent variable of your own choice. Using simple linear regression, develop an estimated
regression equation that could be used to predict the overall score of the camera. Interpret your findings
in detail.
We choose price independent variable,
Estimated Regression Equation:
̂ = 𝒃𝟎 + 𝒃𝟏𝑿
𝒀
̂ = 𝟒𝟔. 𝟔𝟔 + 𝟎. 𝟎𝟓𝟓𝑿
𝒀
Findings,
Regression Statistics
Multiple R 0.683211844
R Square 0.466778424
Adjusted R Square 0.446269901
Standard Error 4.982379069
Observations 28
Multiple R shows the coefficient of correlation which is 0.683. This is more closer to one than zero which
shows this relation is strong positive.
R Square is the Square root of R and it is coefficient of determination. It shows there is 46.66% change in
score due to price.
Sample Size is 28.
P-Value is 0.0000615 which is less than 0.05. This mean that this model is significant good.
Intercept 𝑏0 is 46.66.
Slope 𝑏1 which is 0.0552
Standard Error which is 4.98 shows the percentage variance along with the regression line.
Assumptions:
Normal Probability Plot is linearity and normality.
Residual Plot is showing equal Variance.
Line fit plot is showing independence of error.
4. Repeat Step 3 and develop an estimated regression equation by Using Multiple linear regression that
could be used to predict the overall score of the camera. Interpret your findings in details.
We choose Price, Megapixels and Weight independent variable,
Estimated Regression Equation:
̂ = 𝒃𝟎 + 𝒃𝟏𝑿𝟏 + 𝒃𝟐𝑿𝟐 + 𝒃𝟑𝑿𝟑
𝒀
̂ = 𝟓𝟎. 𝟏𝟒 + 𝟎. 𝟎𝟓𝟓𝑿𝟏 − 𝟎. 𝟑𝟓𝑿𝟐 + 𝟎. 𝟏𝟕𝟗𝑿𝟑
𝒀
Findings,
Regression Statistics
Multiple R 0.691437086
R Square 0.478085244
Adjusted R Square 0.4128459
Standard Error 5.130547941
Observations 28
Multiple R shows the coefficient of correlation which is 0.69. This is more closer to one than
zero which shows this relation is strong positive.
R Square is the Square root of R and it is coefficient of determination. It shows there is
47.80% change in score due to price.
Sample Size is 28.
P-Value is 0.001184 which is less than 0.05. This mean that this model is significant
good fit.
Intercept 𝑏0 is 50.14.
Slope 𝑏1 which is 0.0556
Slope 𝑏2 which is -0.35
Slope 𝑏3 which is 0.17
Standard Error which is 5.13 shows the percentage variance along with the regression line.
Assumptions:
Normal Probability Plot is linearity and
normality. Residual Plot is showing equal
Variance.
Line fit plot is showing independence of error.