0% found this document useful (0 votes)
45 views

Assignment for Data Analysis

Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views

Assignment for Data Analysis

Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

DATA ANALYSIS AND TECHNIQUES

(DAT)
ASSIGNMENT

SUBMITTED TO – Dr. Deepti Aggarwal


SUBMITTED BY – Pragya Kasana
Roll No. – 2024/PHDUSME/05

UNIVERSITY SCHOOL OF MANAGEMENT AND


ENTREPRENEURSHIP (USME)
DELHI TECHNOLOGICAL UNIVERSITY (DTU)
Topic: Case study on All-Time Movie Box Office Data
1. Create a scatter chart to examine the relationship between the year released and the
inflation-adjusted U.S. box office receipts. Include a trendline for this scatter chart.
What does the scatter chart indicate about inflation-adjusted U.S. box office receipts
over time for these top 50 movies?

Solution:

U.S. Box Office Receipts (Inflation Adjusted Millions $)


$1,800
$1,600
$1,400
U.S box office receipt

$1,200
$1,000
$800
$600
$400
$200
$0
1930 1940 1950 1960 1970 1980 1990 2000 2010 2020
Year Released

Interpretation: a. The scatter plot likely to indicate that, as the years progress, the
inflation-adjusted box office earnings of these top movies do not increase or perhaps
even decrease slightly. Despite the general expectation that inflation-adjusted values
should rise over time due to inflation, the box office receipts may not be keeping
pace, suggesting that more recent movies are earning less when adjusted for inflation
compared to earlier films.
b. The trendline in the scatter plot shows a downward trend or a flattening trend,
meaning that inflation-adjusted box office receipts are expected to continue to
decrease or remain steady for movies released in future years. This indicates that,
even though movie ticket prices and inflation increase over time, the top movies are
not necessarily making more in inflation-adjusted dollars. The downward slope
implies that, relative to the past, future movies may not achieve the same inflation-
adjusted box office success.
2. Create a scatter chart to examine the relationship between the budget and the non-
inflation-adjusted world box office receipts. (note: You may have to adjust the data in
Excel to ignore the missing budget data values to create your scatter chart. You can do
this by first sorting the data using Budget and then creating a scatter chart using only
the movies that include data for Budget.) What does this scatter chart indicate about
the relationship between the movie’s budget and the world box office receipts?

Solution:

World Box Office Receipts (Non-Inflation Adjusted Mil-


lions $)
$3,000
Non Inflation Adjusted world box office

$2,500

$2,000

$1,500

$1,000

$500

$0
$0 $50 $100 $150 $200 $250
Budget

Interpretation: The scatter chart shows a clear relationship between a movie's budget and
its world box office receipts. In general, as the budget of a movie increases, the box office
earnings tend to increase significantly. This suggests that higher-budget films often bring in
more revenue globally.
However, there are some exceptions. Some lower-budget films have performed
exceptionally well at the box office, which means that even though these movies had
smaller budgets, they still earned impressive box office numbers. These outliers can
be found among the top movies.
Despite these exceptions, the overall trend indicated by the chart is that the bigger the
budget, the larger the box office earnings, particularly for the most expensive films.
This shows a strong positive relationship between a movie's budget and its global
box office success.
3. Create a frequency distribution and histogram for inflation-adjusted U.S. box office
receipts. Use bin sizes of $100 million. Interpret the results. Do any data points
appear to be outliers in this distribution?
Solution:
Bin Size Frequency Distribution

400-500 3
500-600 19
600-700 9
700-800 5
800-900 6
900-1000 1
1000-1100 3
1100-1200 2
1200-1300 0
1300-1400 0
1400-1500 1
1500-1600 0
1600-1700 1
1700-1800 0

Interpretation: The histogram indicates that most movies have inflation-adjusted


U.S. box office receipts between $500 million and $900 million, with the highest
concentration in the $500-$600 million range. As the box office receipts increase, the
frequency of movies in each higher range decreases. Only a few movies surpass $1
billion in receipts, and those earning over $1.2 billion can be considered potential
outliers, standing out due to their exceptional box office success compared to the
majority of films in the dataset.
4. Create a PivotTable for these data. Use the PivotTable to generate a crosstabulation
for movie genre and rating. Determine which combinations of genre and rating are
most represented in the top 50 movie data. Now filter the data to consider only
movies released in 1980 or later. What combinations of genre and rating are most
represented for movies after 1980? What does this indicate about how the preferences
of moviegoers may have changed over time?

Solution:

Count of Year
Released Column Labels
PG- Grand
Row Labels G PG 13 R UR Total
Action 1 2 3 6
Animated 8 1 9
Comedy 2 2 4
Drama 4 5 3 1 2 15
Horror 1 1
Musical 2 2
SciFi/Fantasy 6 6 12
Western 1 1
Grand Total 15 17 12 4 2 50

Interpretation: Based on the PivotTable, animated movies are the most represented
in the "General Audience" (G-rated) category among the top 50 movies, with a total
of 8 films. This suggests that animated films, which are typically designed to appeal
to a wide range of audiences, especially families and children, have consistently
performed well and have remained popular over time.

Year (Multiple
Released Items)

Count of Column
Rating Labels
P PG- Grand
Row Labels G G 13 R Total
Action 1 3 4
Animated 1 1 2
Comedy 1 1 2
Drama 2 2
SciFi/Fantasy 5 6 11
Grand Total 1 8 11 1 21

Interpretation: After filtering the data to include only movies released in 1980 or
later, the most represented combination of genre and rating in the top 50 movies is
Sci-Fi/Fantasy movies with a PG-13 rating, totaling 6 movies. This indicates a
significant shift in audience preferences compared to earlier years.
The rise of PG-13 rated Sci-Fi/Fantasy films suggests that, after 1980, moviegoers
became more interested in films that feature imaginative, futuristic, or speculative
themes, often combined with action and adventure elements. The PG-13 rating, which
was introduced in 1984, allows for content that appeals to both teenagers and adults
without being as restricted as R-rated films. This broadens the potential audience,
capturing the interest of younger viewers while still allowing for more mature themes
and action compared to G or PG-rated films.
5. Use the PivotTable to display the average inflation-adjusted U.S. box office receipts
for each genre–rating pair for all movies in the dataset. Interpret the results.
Solution:
Average of U.S. Box Office Receipts (Inflation Adjusted Millions $) Column Labels
Row Labels G PG PG-13 R UR Grand Total
Action 506.6 856.35 664.1 701.9333333
Animated 696.625 557.4 681.1555556
Comedy 545 521.65 533.325
Drama 1016.175 612.28 895.7666667 622.8 519.85 765.06
Horror 808.7 808.7
Musical 915.25 915.25
SciFi/Fantasy 884.1166667 550.4666667 717.2916667
Western 553 553
Grand Total 798.32 722.3058824 665.2 618.7 519.85 715.018

Interpretation: The Pivot Table analysis shows that certain genre-rating


combinations, like "Drama" with "G" or "Animated" with "G", are consistently
associated with strong box office performance. Overall, the rating "G" seems to be the
most successful, indicating a balance of content suitable for a wide audience.
These insights can be valuable for studios in making informed decisions about film
production, marketing, and distribution strategies. By understanding the preferences
of audiences and the factors that contribute to box office success, studios can increase
their chances of producing profitable films.

You might also like