0% found this document useful (0 votes)

68 views4 pages

Extra Practice #3 Making A Movie

extra homework

Uploaded by

Matthew Khachigian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views4 pages

Extra Practice #3 Making A Movie

extra homework

Uploaded by

Matthew Khachigian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

BUAD 425

Extra Practice #3 Making a Movie

Use JMP13, Excel and the data to answer the following questions. Where appropriate,
provide clear, succinct justifications of your rational.

Background

In 2016, the film industry in the US was over $11.5 Billion dollars.

Since talented directors and lead actors often consistently create blockbuster movies,
the name of the director and cast are generally good predictive variables for a film’s
success. However, a number of other variables may also be important to consider
including the film budget, country of origin, and plot (is the movie a sequel?).

In the past several years, social media has emerged as an important method of
marketing and advertising. In this assignment, we attempt to predict whether a movie
will be a “financial success” by considering a number of factors pertaining to the social
media presence, the duration of the movie, and the overall popularity of the cast and
director etc. We say a movie is a “financial success” if the movie yields a 30% profit or
higher.

The dataset movies.jmp consists of a subset of 2,823 movies produced from 1936 to
2010. A detailed description of the variables and their meanings are available at the
end of this document.

Instructions
Download the dataset “movies.jmp” from BB. The “.jmp” version has already been split
into a training set and a test set (the first 1000 movies are considered testing data).

You should use JMP, Excel or both to answer the following questions. Be sure to detail
your work and approach. Where appropriate, justify your responses quantitatively. You
may work in teams, but everyone must write-up their own solutions (and perform their
own computations) separately.

Logistic Regression (Linear Classification):

1. (2 pts) Build a logistic regression model to predict whether a movie will be a
financial success using all the variables EXCEPT:

 Movie_title
 Title_year
 Content_rating
 Country
 Director_name
NOTICE: This case and its solutions are COPYRIGHTED. They may not be copied, sold, published, disseminated, shared, or other wise
communicated to third parties whether in person, online or otherwise and whether or not for a profit or nonprofit purpose (2016).
1
BUAD 425

 Actor_main
 Testing_Data

What is the R^2 of this model?

2. (2 pts) Use stepwise and the “Go” option to build another logistic regression
model to predict whether a movie will be a financial success. Allow JMP to use
the same variables as in Q1. What variables does JMP ultimately pick? What is
the R^2 of this model?

3. (2 pts) Compare your answers in Q1, and Q2. Which do you think is a better
model for the business? Justify your response using the JMP output.

4. (6 pts) In the stepwise model you created in Q2, what is the coefficient (or
weight) of “director_facebook_likes”? Comparatively, what is the coefficient (or
weight) of the “actor_main_facebook_likes”?

Do these coefficients make sense? Justify your response. What conclusions

might you draw for the business?

5. (4 pts) Using the stepwise model from Q2 and a probability threshold of .57,
what is the resulting confusion matrix? (Remember to only use the testing data).

6. (4 pts) What is the accuracy of the classifier in Q5 (on the testing data)?

7. (6 pts) As a conservative estimate, let’s assume that we invest $1M for every
movie we predict is going to be a financial success, and on average, we earn a
profit of $4M every time we are correct (i.e. when we correctly invest in movies
which are financial successes).

In such a scenario, each time we incorrectly predict a movie is going to be a

financial success, we lose $1M and each time we predict a movie is not going to
be a success, and it is, we lose the opportunity to make $4M.

According to the testing data in our model, how much money would we have
lost given the scenario above? Can we adjust our model to lower this number?
Briefly explain.

NOTICE: This case and its solutions are COPYRIGHTED. They may not be copied, sold, published, disseminated, shared, or other wise
communicated to third parties whether in person, online or otherwise and whether or not for a profit or nonprofit purpose (2016).
2
BUAD 425

Decision Tree Questions:

8. (4 pts) Use the “Go” option and fit a decision tree model with the same variables
as in Q1. How many times did JMP split the data? Comment on why JMP
stopped at this number?

9. (4 pts) What is the R Square value of your model on the training data? On the
testing data?

10. (4 pts) A retired movie producer comments that social media has changed the
movie production world, and at this stage, the main actor having high facebook
likes is the most important indication of whether a movie will be successful or
not.

Based on the decision tree model in Q8, do you agree? If not, which variable is
the most important indication?

11. (4 pts) Describe the characteristics of a movie which is most likely NOT to be a
financial success (in other words, describe the characteristics of a movie which
would lead us to most confidently assume that the movie will not be a success)

12. (4pts) What is the confusion matrix for this model? Use a probability threshold
of .65.

13. (4 pts) What is the overall accuracy of this model? Is this model more accurate
than the Logistic Regression model from the previous parts?

Based on the assumptions stated in Q7, which model would you suggest the
company use? State clearly any assumptions you are making and justify your
response quantitatively.

Appendix:

Variable key for “movies.jmp”

 movietitle = the title of the movie
NOTICE: This case and its solutions are COPYRIGHTED. They may not be copied, sold, published, disseminated, shared, or other wise
communicated to third parties whether in person, online or otherwise and whether or not for a profit or nonprofit purpose (2016).
3
BUAD 425

 title_year = the year the movie was released

 duration = the length of the movie in minutes
 content_rating = the rating of the content of the movie
 country = the country in which the movie was produced
 director_name = the name of the director
 actor_main = the name of the main actor in the movie
 director_facebook_likes = the number of facebook likes the director received
during the promotion phase of the movie
 actor_main_facebook_likes = the number of facebook likes the main actor
received during the promotion phase of the movie
 cast_total_likes = the number of facebook likes the entire cast received during
the promotion phase of the movie
 movie_facebook_likes = the number of facebook likes the movie overall
received during the promotion phase of the movie
 num_voted_users = the number of users who voted in various social media
campaigns including facebook
 facenumber_in_poster = the number of people in the promotional posters for
the movie
 imdb_score = the IMDB score for the movie (movies are scored before being
released)
 Financial_Success = an indication of whether the movie was financially
successful (1 = financially successful or 0 = not financially successful)
 Testing = an indication of whether the record is training or testing data (1 =
testing data or 0 = training data)

Netflix Prize: All Together Now: A Perspective On The
No ratings yet
Netflix Prize: All Together Now: A Perspective On The
1 page
Win/Loss Analysis: How to Capture and Keep the Business You Want
From Everand
Win/Loss Analysis: How to Capture and Keep the Business You Want
Ellen Naylor
No ratings yet
Surveying Presentation
No ratings yet
Surveying Presentation
23 pages
Report
No ratings yet
Report
26 pages
Predicting Movie Success Based On Imdb Data
No ratings yet
Predicting Movie Success Based On Imdb Data
5 pages
Linear Regression and Modeling Data
No ratings yet
Linear Regression and Modeling Data
3 pages
Bheem Final
No ratings yet
Bheem Final
65 pages
The Real Value of Training: Measuring and Analyzing Business Outcomes and the Quality of ROI
From Everand
The Real Value of Training: Measuring and Analyzing Business Outcomes and the Quality of ROI
Ron Stone
No ratings yet
Review 2
No ratings yet
Review 2
21 pages
Producing an Independent Film
From Everand
Producing an Independent Film
James F Simpson
No ratings yet
Final Review
No ratings yet
Final Review
24 pages
Succession Planning Simulation
From Everand
Succession Planning Simulation
William Rothwell
No ratings yet
SDM - Task B - Group 1G - Movies
No ratings yet
SDM - Task B - Group 1G - Movies
11 pages
Gprof Second Edition
From Everand
Gprof Second Edition
Gerardus Blokdyk
No ratings yet
A Fool With a Tool
From Everand
A Fool With a Tool
Blair Goulet
No ratings yet
Movie Success Prediction Using Data Mining: Functional Requirements
No ratings yet
Movie Success Prediction Using Data Mining: Functional Requirements
3 pages
Google Fiber A Complete Guide
From Everand
Google Fiber A Complete Guide
Gerardus Blokdyk
No ratings yet
Professional Scrum Master II Practice Questions and Exam Tests PSM II Exam Guidebook And Updated Questions
From Everand
Professional Scrum Master II Practice Questions and Exam Tests PSM II Exam Guidebook And Updated Questions
Idea Link
No ratings yet
JProfiler Third Edition
From Everand
JProfiler Third Edition
Gerardus Blokdyk
No ratings yet
Project 5
No ratings yet
Project 5
13 pages
Succession Management the “How To” Puzzle—Solved!: A Practical Guide to Talent Management
From Everand
Succession Management the “How To” Puzzle—Solved!: A Practical Guide to Talent Management
Mark Caruso
No ratings yet
Quantitative Methods II Mid-Term Examination: Instructions
100% (1)
Quantitative Methods II Mid-Term Examination: Instructions
17 pages
Group Project Description
No ratings yet
Group Project Description
6 pages
Sample Resume
No ratings yet
Sample Resume
3 pages
Pilot plant A Complete Guide
From Everand
Pilot plant A Complete Guide
Gerardus Blokdyk
No ratings yet
Data Analytics Group 7
No ratings yet
Data Analytics Group 7
7 pages
IBM RPG A Complete Guide
From Everand
IBM RPG A Complete Guide
Gerardus Blokdyk
No ratings yet
Student Details
No ratings yet
Student Details
10 pages
Strategic Continuous Process Improvement
From Everand
Strategic Continuous Process Improvement
Gerhard J. Plenert
No ratings yet
YouTube My Business
From Everand
YouTube My Business
Laura Maya
No ratings yet
Game design Complete Self-Assessment Guide
From Everand
Game design Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Movie Succ Pred
No ratings yet
Movie Succ Pred
4 pages
Data Science for Decision Makers: Enhance your leadership skills with data science and AI expertise
From Everand
Data Science for Decision Makers: Enhance your leadership skills with data science and AI expertise
Jon Howells
No ratings yet
Prediks I Movie
No ratings yet
Prediks I Movie
25 pages
Evidence Guided: Creating High Impact Products in the Face of Uncertainty
From Everand
Evidence Guided: Creating High Impact Products in the Face of Uncertainty
Itamar Gilad
No ratings yet
IP multicast Third Edition
From Everand
IP multicast Third Edition
Gerardus Blokdyk
No ratings yet
Google Video Complete Self-Assessment Guide
From Everand
Google Video Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Project 5
No ratings yet
Project 5
13 pages
Practice Final Part 1
No ratings yet
Practice Final Part 1
9 pages
Microsoft Azure Machine Learning
From Everand
Microsoft Azure Machine Learning
Sumit Mund
4.5/5 (3)
Movie Box Office Success Prediction Using Machine Learning
No ratings yet
Movie Box Office Success Prediction Using Machine Learning
4 pages
JProbe A Complete Guide
From Everand
JProbe A Complete Guide
Gerardus Blokdyk
No ratings yet
AAS DSExam
No ratings yet
AAS DSExam
5 pages
P&C Data Platforms Standard Requirements
From Everand
P&C Data Platforms Standard Requirements
Gerardus Blokdyk
No ratings yet
Google Voice Complete Self-Assessment Guide
From Everand
Google Voice Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Data fusion A Clear and Concise Reference
From Everand
Data fusion A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Movie Success Prediction Using Machine Learning Algorithms and Their Comparison
No ratings yet
Movie Success Prediction Using Machine Learning Algorithms and Their Comparison
6 pages
b1 PDF
No ratings yet
b1 PDF
6 pages
Imdb Questions
No ratings yet
Imdb Questions
4 pages
Generic data model Complete Self-Assessment Guide
From Everand
Generic data model Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Movie Success Prediction Using Data Mining PHP: Objective
No ratings yet
Movie Success Prediction Using Data Mining PHP: Objective
2 pages
IBM System z9 Second Edition
From Everand
IBM System z9 Second Edition
Gerardus Blokdyk
No ratings yet
IMDB Movie Analysis
No ratings yet
IMDB Movie Analysis
2 pages
Account Based Analytics Final Spring 2025
No ratings yet
Account Based Analytics Final Spring 2025
2 pages
Google Play A Complete Guide
From Everand
Google Play A Complete Guide
Gerardus Blokdyk
No ratings yet
Google Lens A Clear and Concise Reference
From Everand
Google Lens A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
From Everand
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
Richard Tortoriello
4/5 (1)
Backup (software) Complete Self-Assessment Guide
From Everand
Backup (software) Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Public Cloud Storage Standard Requirements
From Everand
Public Cloud Storage Standard Requirements
Gerardus Blokdyk
No ratings yet
SEO Split Testing: Split Testing In SEO For Data Driven Success
From Everand
SEO Split Testing: Split Testing In SEO For Data Driven Success
Dr. Michael C. Melvin
No ratings yet
Gamify at Work: How to Tap Into the Potential Within Your Organization
From Everand
Gamify at Work: How to Tap Into the Potential Within Your Organization
Jason Kelly
No ratings yet
Barbosa Et Al. (2022) - Spatial Correlates of COVID-19 First Wave Across Continental Portugal
No ratings yet
Barbosa Et Al. (2022) - Spatial Correlates of COVID-19 First Wave Across Continental Portugal
13 pages
Amsterdam + Berlin Schedule & Curriculum Edorer Business Analytics & Data Science Bootcamp
No ratings yet
Amsterdam + Berlin Schedule & Curriculum Edorer Business Analytics & Data Science Bootcamp
14 pages
Final Exam - Proba - 2022 - Sem01 - v1
No ratings yet
Final Exam - Proba - 2022 - Sem01 - v1
3 pages
Stat 1 Midterm Exam
No ratings yet
Stat 1 Midterm Exam
2 pages
Unit-5 BRM
No ratings yet
Unit-5 BRM
10 pages
Stats Poster Project 1
No ratings yet
Stats Poster Project 1
3 pages
Unit 4 - Classification and Prediction
No ratings yet
Unit 4 - Classification and Prediction
72 pages
Annotated Stata Output - Logistic Regression
100% (1)
Annotated Stata Output - Logistic Regression
3 pages
6.03.P Spread of Data
No ratings yet
6.03.P Spread of Data
6 pages
Estimating Population Variance
No ratings yet
Estimating Population Variance
26 pages
A Robust and Regularized Extreme Learning Machine
No ratings yet
A Robust and Regularized Extreme Learning Machine
7 pages
(Fix) Multilevel Modelling and Cluster Analysis (Group 10)
No ratings yet
(Fix) Multilevel Modelling and Cluster Analysis (Group 10)
13 pages
Setting Up Your WEKA Experiments With Feature Sets
No ratings yet
Setting Up Your WEKA Experiments With Feature Sets
3 pages
Start: Anderson-Darling: A Goodness of Fit Test For Small Samples Assumptions
No ratings yet
Start: Anderson-Darling: A Goodness of Fit Test For Small Samples Assumptions
6 pages
Hypothesis Tests For The Means of Two Populations
No ratings yet
Hypothesis Tests For The Means of Two Populations
21 pages
Demand Forecasting
0% (1)
Demand Forecasting
56 pages
Statistics Mcqs - Estimation Part 1: Examrace
100% (1)
Statistics Mcqs - Estimation Part 1: Examrace
7 pages
Risk and Return
No ratings yet
Risk and Return
3 pages
DA Unit 1 Updated
No ratings yet
DA Unit 1 Updated
19 pages
Spss Fisher Exact
No ratings yet
Spss Fisher Exact
24 pages
Estad Istica II Chapter 5. Regression Analysis (Second Part)
No ratings yet
Estad Istica II Chapter 5. Regression Analysis (Second Part)
39 pages
Project STA108
No ratings yet
Project STA108
25 pages
II Puc Statistics Old Question Papers Upto 2017
No ratings yet
II Puc Statistics Old Question Papers Upto 2017
14 pages
Assignment Report - Data Mining
No ratings yet
Assignment Report - Data Mining
24 pages
Quiz - Data Science and Big Data Analytics (1) (Autosaved)
No ratings yet
Quiz - Data Science and Big Data Analytics (1) (Autosaved)
43 pages
ACC 205 Exam Question2
No ratings yet
ACC 205 Exam Question2
4 pages
Econometrics 3: Massimiliano Marcellino
No ratings yet
Econometrics 3: Massimiliano Marcellino
4 pages
DESCRIPTIVE ANALYTICS PPT - Updated
No ratings yet
DESCRIPTIVE ANALYTICS PPT - Updated
127 pages
Weekly Usage Hrs Annual Maintenance Expense (1000s)
No ratings yet
Weekly Usage Hrs Annual Maintenance Expense (1000s)
5 pages

Extra Practice #3 Making A Movie

Uploaded by

Extra Practice #3 Making A Movie

Uploaded by

BUAD 425

Extra Practice #3 Making a Movie

Logistic Regression (Linear Classification):

What is the R^2 of this model?

Do these coefficients make sense? Justify your response. What conclusions

In such a scenario, each time we incorrectly predict a movie is going to be a

Decision Tree Questions:

Variable key for “movies.jmp”

 title_year = the year the movie was released

You might also like