0% found this document useful (0 votes)
2 views

IMDB Movie Analysis

Uploaded by

Sayan Santra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

IMDB Movie Analysis

Uploaded by

Sayan Santra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

IMDB Movie Analysis

Project Description:
The IMDB Movie Analysis project typically involves exploring and analysing data from
the Internet Movie Database (IMDb) to gain insights into various aspects of films and the
film industry. This can include analysing trends in genres, box office performance, user
ratings, and reviews, as well as examining the relationships between different variables,
such as budget and revenue. The project may utilize data visualization tools and statistical
methods to present findings, helping to understand patterns in movie popularity, critical
reception, and industry dynamics. It can be a valuable resource for filmmakers, marketers,
and movie enthusiasts alike.

Approach:
 Identify the key question to be answered.
 Handle missing values, duplicates and unnecessary data.
 Convert data types as needed
 Visualize data distributions and relationships.
 Conduct statistical tests to determine significant relationships
 Summarize key insights and trends discovered during the analysis.
 Provide actionable recommendations based on the analysis.

Tech-Stack Used:
“Microsoft Excel 2016” was used to perform analysis and also used to
prepare report.

Cleaning the Data:


This is one of the most important step to do before moving farther with the analysis.
Following steps are:

 Dropping multiple Unnecessary columns.


 Remove blank cells and null values.
 Remove duplicate values.
A. Movie Genre Analysis:
Count of
Genre Genre
Action 976
Adventure 380
Animation 47
Biography 209
Comedy 1041
Crime 262
Documentar
y 43
Drama 704
Family 3
Fantasy 37
Horror 173
Musical 2
Mystery 23
Romance 2
Sci-Fi 10
Thriller 3
Western 4
(blanks)
Grand Total 3919

Average 230.5
Median 43
Mode 3
Maximum 1041
Minimum 2
120073.
Variance 8
Standard
Deviation 346.5

Insights:
 Analyze trends such as: which genres have the highest mean IMDB
scores.
 A genre with a low standard deviation indicates consistent ratings.
 Common ratings can highlight what audiences frequently
appreciate in specific genres.
B. Movie Duration Analysis:

109.872
Average 5
Median 106
Standard 22.4408
Deviation 6

imdb_score
10
9
8
7
6
5
4
3
2
1
0
0 50 100 150 200 250 300 350

Insights:
 Calculating median, average, standard deviation for movie
durations.
 Creating a scatter plot to visualize the distribution of movie
durations.
 Determine the most common duration range.
 Also determine if most movies fall within a specific duration
range.
C. Language Analysis:

Standard
Language Mean Median Deviation
Aboriginal 6.95 6.6 1.047211453
Arabic 7.3 6.6 1.047211453
Aramaic 7.1 6.6 1.047211453
Bosnian 4.3 6.6 1.047211453
Cantonese 7.12 6.6 1.047211453
Czech 7.4 6.6 1.047211453
Danish 7.9 6.6 1.047211453
Dari 7.5 6.6 1.047211453
7.566666
Dutch 7 6.6 1.047211453
Dzongkha 7.5 6.6 1.047211453
6.432988
English 3 6.6 1.047211453
Filipino 6.7 6.6 1.047211453
French 7.2 6.6 1.047211453
German 7.65 6.6 1.047211453
Greek 7.3 6.6 1.047211453
Hebrew 7.8 6.6 1.047211453
7.191666
Hindi 7 6.6 1.047211453
Hungarian 7.1 6.6 1.047211453
Icelandic 6.9 6.6 1.047211453
Indonesian 7.9 6.6 1.047211453
7.185714
Italian 3 6.6 1.047211453
Japanese 7.625 6.6 1.047211453
Kazakh 6 6.6 1.047211453
Korean 7.7 6.6 1.047211453
Mandarin 6.95625 6.6 1.047211453
Maya 7.8 6.6 1.047211453
Mongolian 7.3 6.6 1.047211453
None 8.5 6.6 1.047211453
Norwegian 7.15 6.6 1.047211453
Persian 7.575 6.6 1.047211453
Polish 7.4 6.6 1.047211453
7.666666
Portuguese 7 6.6 1.047211453
Romanian 7.9 6.6 1.047211453
7.466666
Russian 7 6.6 1.047211453
7.088888
Spanish 9 6.6 1.047211453
Swedish 7.15 6.6 1.047211453
Telugu 8.4 6.6 1.047211453
6.633333
Thai 3 6.6 1.047211453
Vietnames
e 7.4 6.6 1.047211453
Zulu 7.3 6.6 1.047211453

Insights:
 Determine which languages are most common in dataset. For
example English dominate but other languages like Spanish,
French, Thai might also have significant representation.
 For each language, calculate mean, median, standard deviation.
 Examine the standard deviation helps to see if certain languages
have more consistent ratings or if there are huge fluctuations.

D. Director Analysis:

Average of
Directors imdb_score
Alfred Hitchcock 8.5
Asghar Farhadi 8.4
Catherine Owens 8.4
Charles Chaplin 8.6
Christopher Nolan 8.425
Damien Chazelle 8.5
Majid Majidi 8.5
Rakeysh Omprakash
Mehra 8.4
Richard Marquand 8.4
Ron Fricke 8.5
S.S. Rajamouli 8.4
Sergio Leone 8.433333333
Tony Kaye 8.6
Grand Total 8.45

Highest Score 8.6


Percent rank 0.916
Percentile 8.5992
Insights:
 Summarize the characteristics of the top directors based on average IMDB
scores.
 Highlight any surprising trends or commonalities among them
 These directors work has influences the film industry and they help 91% to
make the movie a successful one.

E. Budget Analysis:

Correlation 0.0987473

Hight Profit 52350584


Margin 7 Avatar

Insights:
 Analyze the correlation results to determine if higher budgets
consistently lea to higher gross earnings or if there are outliers.
 Identify trends among the top profit-margin movies.
 Highlight specific movies that dignify high profitability, analyzing
factors that contributed to their success.

You might also like