P (Y 1) e 1+ E: Business Analytics - Assignment
P (Y 1) e 1+ E: Business Analytics - Assignment
______________________________________________________________________________
I. ‘FNB Bank’ is looking to increase the number of customers who use a ‘Direct payroll’ deposit.
The Management is considering a new marketing campaign that will require each branch
manager to call up customers who do not have this deposit. Because of the time and cost
involved, the management wants to focus their efforts only on those customers who have the
highest probability of signing up for the Direct payroll deposit.
The management believes that the ‘average monthly balance’ in a customer’s savings account
may be a useful predictor of whether or not the customer will sign up for the Direct Payroll
deposit.
To investigate the relation between these two variables, the bank built a model using the
average monthly balance (in hundreds of $) and whether the customer signed up for the
payroll deposit (coded 1 if they signed and 0 if not).
A portion of the output is as follows:
Parameter Coeff. SE Coef. P
Constant -2.6335 0.7985 0.001
Monthly Balance 0.22018 0.09002 0.0001
(a) Write the Equation that this model represents.
(c) Suppose the bank wants to contact customers who have a 0.5 or higher probability of
opening a Direct payroll deposit, what is the average monthly balance required to
achieve this probability?
II. Bendrix company manufactures various types of parts for automobiles. The manager of the
factory wants to get a better understanding of overhead costs. Over the past 36 months, the
manager has tracked total overhead costs. To help explain these, he has also collected data
on two variables that relate to the work done at the factory, these are Machine hours (no.
of machine hours in a month) and Production Runs. (number of runs during a month). Refer
to the file ‘Overhead costs.xls”.
a. Build a model to help the manager predict Overhead costs.
c. What would the predicted overhead cost be for 60 production runs and 1500
Machine hours.
Y= 3996.67 + 43.53(1500) + 883.61(60)
Y= 122,308.3
III. Box Office success of Bollywood movies was analysed using several variables. A model was
developed to predict success (1) or failure (0) of a movie using the movie Budget as an
independent variable. A sample of the output is shown below:
Parameter Parameter Estimates P value
Budget -0.016 0.001
Constant 1.621 0.002
(a) Are higher budget movies more likely to fail at the box office? Explain.
P(Y=1) = e1.621 -0.016(Budget)
1+ e1.621-0.016(Budget)
From the equation above, it can be noted that the co-efficient of budget is -0.016. So,
the higher the budget the less is likelihood of success of the movie.
Budget 50 100 200 350
Z 0.821 0.021 -1.579 -3.979
Odds 2.272771 1.021222 0.206181 0.018704
Probability 0.694449 0.50525 0.170937 0.018361
Yes, The Higher budget movies are more likely to fail. As, according to the table above it
can be seen that there exists a negative relationship between budget and success and of
the movie.
(b) Calculate the budget for which box office success and failure are equally likely.
Here, P (Success = 1) = 0.5
Odds = 1(i.e. No. of Fav outcomes/No. of Un-fav. Outcomes)
Budget = (1.621- Ln(Odds))/0.016 = 101.325
IV. The ‘Restaurant Customer Satisfaction’ Survey was conducted during the period 2016-17.
The data is available in the excel file ‘Restaurant Ratings’. The variable ‘Type’ indicates if the
restaurant is ‘’Italian” or “Chinese”. ‘Price’ indicates the average amount paid per person for
dinner. ‘Score’ reflect the customer’s overall satisfaction, with higher values indicating
greater satisfaction.
a. Develop a model to show how ‘overall customer satisfaction’(Score) is related to
‘average price of the meal’ and ‘Type of restaurant’. Is the ‘Type of restaurant’ a
significant factor in overall customer satisfaction? Paste your output and answer the
question.
H0 : There is no Relationship between Type of Restaurant , Average Price of the
meal and, Overall Customer Satisfaction.
H1 : There is Relationship between Type of Restaurant , Average Price of the meal
and, Overall Customer Satisfaction.
Here, P < Alpha, Reject H0.
Yes, The type of restaurant will be a significant factor in the Overall Customer
Satisfaction as 0.017 i.e. the p-value of the restaurant types is less than alpha i.e.
0.05
b. Use the model to predict the satisfaction score of a Chinese restaurant that has an
average meal price of $20.
Satisfaction Score = 67.40 + 0.573(Avg. Meal Price) + 3.038 (Italian Restaurant)
Satisfaction Score of Chinese Restaurant with avg. meal price of Rupees 20 = 78.86
c. How much would the predicted score have changed for an Italian restaurant?
Satisfaction Score = 67.40 + 0.573(Avg. Meal Price) + 3.038 (Italian Restaurant)
Satisfaction Score of Italian Restaurant with avg. meal price of Rupees 20 = 81.898
V. A bank is interested in predicting which customers will respond to its direct marketing
campaign to open a Fixed Deposit with the bank. The response variable Y = 1 implies that a
customer will open an FD after the campaign. The Bank built a model using ‘Job’ as a
predictor variable. The kind of jobs was categorized into 5 levels ; Blue Collar, Management,
Self Employed, Unemployed and Other. The model output is shown as follows:
Variables Parameter Estimates Significance
Blue Collar -0.627 0.0001
Self Employed -0.285 0.002
Management 0.060 0.003
Unemployed -0.264 0.0001
Constant -1.916 0.001
Predicted 0 Predicted 1
Actual 0 3871 129
Actual 1 452 69
(BONUS) If you were responsible for this direct marketing campaign to open FDs in the bank,
which part of the classification matrix would be critical to you