Google Play Store Data Analysis
Google Play Store Data Analysis
The Google Play Store dataset is a publicly available dataset that contains information about
over 1 million apps, including their category, rating, reviews, size, installs, price, and more.
This dataset can be used to answer a variety of questions about the Android app market,
such as:
The Google Play Store data can be analyzed using a variety of tools and techniques, such as:
1. Data visualization: This can be used to create charts and graphs that illustrate the data
in a clear and concise way.
2. Statistical analysis: This can be used to identify patterns and trends in the data.
3. Machine learning: This can be used to build models that predict future behavior.
In [1]: # Numpy is also called as Numerical Python, used for Scientific computing and Numer
import numpy as np
In [2]: # importing the libraries which are needed for the visualization.
import matplotlib.pyplot as plt
%matplotlib inline
import seaborn as sns
In [4]: #Versions.
print("pandas",pd.__version__)
print("numpy",np.__version__)
print("seaborn",sns.__version__)
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 1/46
6/30/23, 12:02 PM Google_playstore_analysis
pandas 1.4.4
numpy 1.21.5
seaborn 0.11.2
Out[6]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19M 10,000+ Free 0 Everyone Art
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14M 500,000+ Free 0 Everyone Desig
moana
U
Launcher
Lite –
2 FREE Live ART_AND_DESIGN 4.7 87510 8.7M 5,000,000+ Free 0 Everyone Art
Cool
Themes,
Hide ...
Sketch -
3 Draw & ART_AND_DESIGN 4.5 215644 25M 50,000,000+ Free 0 Teen Art
Paint
Pixel Draw
- Number
4 Art ART_AND_DESIGN 4.3 967 2.8M 100,000+ Free 0 Everyone
Design
Coloring
Book
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 2/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[7]: Co
App Category Rating Reviews Size Installs Type Price
R
iHoroscope
- 2018
Daily
10840 LIFESTYLE 4.5 398307 19M 10,000,000+ Free 0 Eve
Horoscope
&
Astrology
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 10841 entries, 0 to 10840
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10841 non-null object
1 Category 10841 non-null object
2 Rating 9367 non-null float64
3 Reviews 10841 non-null object
4 Size 10841 non-null object
5 Installs 10841 non-null object
6 Type 10840 non-null object
7 Price 10841 non-null object
8 Content Rating 10840 non-null object
9 Genres 10841 non-null object
10 Last Updated 10841 non-null object
11 Current Ver 10833 non-null object
12 Android Ver 10838 non-null object
dtypes: float64(1), object(12)
memory usage: 1.1+ MB
10841
13
In [10]: # method can be used to get a quick overview of the distribution of the values in a
#This information can be helpful for understanding the data and identifying any pot
df.describe()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 3/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[10]: Rating
count 9367.000000
mean 4.193338
std 0.537431
min 1.000000
25% 4.000000
50% 4.300000
75% 4.500000
max 19.000000
App 0
Out[12]:
Category 0
Rating 1474
Reviews 0
Size 0
Installs 0
Type 1
Price 0
Content Rating 1
Genres 0
Last Updated 0
Current Ver 8
Android Ver 3
dtype: int64
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 4/46
6/30/23, 12:02 PM Google_playstore_analysis
['App',
Out[14]:
'Category',
'Rating',
'Reviews',
'Size',
'Installs',
'Type',
'Price',
'Content Rating',
'Genres',
'Last Updated',
'Current Ver',
'Android Ver']
In [15]: # Checking the value counts of all the columns in the dataset.
for i in df.columns.to_list():
print("*****************",i,"*****************")
print(df[i].value_counts())
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 5/46
6/30/23, 12:02 PM Google_playstore_analysis
3.5 163
3.4 128
3.3 102
4.9 87
3.0 83
3.1 69
3.2 64
2.9 45
2.8 42
2.7 25
2.6 25
2.5 21
2.3 20
2.4 19
1.0 16
2.2 14
1.9 13
2.0 12
1.7 8
1.8 8
2.1 8
1.6 4
1.4 3
1.5 3
1.2 1
19.0 1
Name: Rating, dtype: int64
***************** Reviews *****************
0 596
1 272
2 214
3 175
4 137
...
342912 1
4272 1
5517 1
4057 1
398307 1
Name: Reviews, Length: 6002, dtype: int64
***************** Size *****************
Varies with device 1695
11M 198
12M 196
14M 194
13M 191
...
429k 1
200k 1
460k 1
728k 1
619k 1
Name: Size, Length: 462, dtype: int64
***************** Installs *****************
1,000,000+ 1579
10,000,000+ 1252
100,000+ 1169
10,000+ 1054
1,000+ 907
5,000,000+ 752
100+ 719
500,000+ 539
50,000+ 479
5,000+ 477
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 7/46
6/30/23, 12:02 PM Google_playstore_analysis
100,000,000+ 409
10+ 386
500+ 330
50,000,000+ 289
50+ 205
5+ 82
500,000,000+ 72
1+ 67
1,000,000,000+ 58
0+ 14
0 1
Free 1
Name: Installs, dtype: int64
***************** Type *****************
Free 10039
Paid 800
0 1
Name: Type, dtype: int64
***************** Price *****************
0 10040
$0.99 148
$2.99 129
$1.99 73
$4.99 72
...
$1.75 1
$14.00 1
$4.85 1
$46.99 1
$1.04 1
Name: Price, Length: 93, dtype: int64
***************** Content Rating *****************
Everyone 8714
Teen 1208
Mature 17+ 499
Everyone 10+ 414
Adults only 18+ 3
Unrated 2
Name: Content Rating, dtype: int64
***************** Genres *****************
Tools 842
Entertainment 623
Education 549
Medical 463
Business 460
...
Arcade;Pretend Play 1
Card;Brain Games 1
Lifestyle;Pretend Play 1
Comics;Creativity 1
Strategy;Creativity 1
Name: Genres, Length: 120, dtype: int64
***************** Last Updated *****************
August 3, 2018 326
August 2, 2018 304
July 31, 2018 294
August 1, 2018 285
July 30, 2018 211
...
March 20, 2014 1
April 7, 2015 1
September 22, 2014 1
October 3, 2015 1
March 23, 2014 1
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 8/46
6/30/23, 12:02 PM Google_playstore_analysis
In [16]: # Checking for the datatype which are object in the dataset
df.dtypes == "object"
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 9/46
6/30/23, 12:02 PM Google_playstore_analysis
App True
Out[16]:
Category True
Rating False
Reviews True
Size True
Installs True
Type True
Price True
Content Rating True
Genres True
Last Updated True
Current Ver True
Android Ver True
dtype: bool
App
Category
Reviews
Size
Installs
Type
Price
Content Rating
Genres
Last Updated
Current Ver
Android Ver
Rating
In [19]: df.head(2)
Out[19]: Content
App Category Rating Reviews Size Installs Type Price G
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19M 10,000+ Free 0 Everyone Art & D
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14M 500,000+ Free 0 Everyone Design;Pr
moana
In [20]: df["Reviews"].dtype
dtype('O')
Out[20]:
In [21]: df.Reviews.str
<pandas.core.strings.accessor.StringMethods at 0x15d5dd90130>
Out[21]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 10/46
6/30/23, 12:02 PM Google_playstore_analysis
In [22]: df["Reviews"].str
<pandas.core.strings.accessor.StringMethods at 0x15d5dd90130>
Out[22]:
In [23]: df.Reviews.str.isnumeric().sum()
10840
Out[23]:
In [24]: df.Reviews.shape
(10841,)
Out[24]:
In [25]: df[~df.Reviews.str.isnumeric()]
Out[25]: Content
App Category Rating Reviews Size Installs Type Price Genre
Rating
Life Made
WI-Fi
Februar
10472 Touchscreen 1.9 19.0 3.0M 1,000+ Free 0 Everyone NaN
11, 201
Photo
Frame
In [26]: print(df.Reviews.str)
In [29]: df_copy.shape
(10840, 13)
Out[29]:
In [30]: df_copy["Reviews"].dtype
dtype('O')
Out[30]:
In [32]: df_copy["Reviews"].dtype
dtype('int32')
Out[32]:
In [33]: df_copy.info()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 11/46
6/30/23, 12:02 PM Google_playstore_analysis
<class 'pandas.core.frame.DataFrame'>
Int64Index: 10840 entries, 0 to 10840
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10840 non-null object
1 Category 10840 non-null object
2 Rating 9366 non-null float64
3 Reviews 10840 non-null int32
4 Size 10840 non-null object
5 Installs 10840 non-null object
6 Type 10839 non-null object
7 Price 10840 non-null object
8 Content Rating 10840 non-null object
9 Genres 10840 non-null object
10 Last Updated 10840 non-null object
11 Current Ver 10832 non-null object
12 Android Ver 10838 non-null object
dtypes: float64(1), int32(1), object(11)
memory usage: 1.1+ MB
In [34]: df_copy.head(2)
Out[34]: Content
App Category Rating Reviews Size Installs Type Price G
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19M 10,000+ Free 0 Everyone Art & D
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14M 500,000+ Free 0 Everyone Design;Pr
moana
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 12/46
6/30/23, 12:02 PM Google_playstore_analysis
In [36]: np.nan
nan
Out[36]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 13/46
6/30/23, 12:02 PM Google_playstore_analysis
In [38]: df_copy["Size"].dtype
dtype('O')
Out[38]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 14/46
6/30/23, 12:02 PM Google_playstore_analysis
In [43]: df_copy["Size"][2]*1000
8700.0
Out[43]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 15/46
6/30/23, 12:02 PM Google_playstore_analysis
In [45]: df_copy["Size"].head()
0 19000.0
Out[45]:
1 14000.0
2 8700.0
3 25000.0
4 2800.0
Name: Size, dtype: float64
In [47]: df_copy["Size"].head()
0 19.0
Out[47]:
1 14.0
2 8.7
3 25.0
4 2.8
Name: Size, dtype: float64
In [48]: df_copy.head()
Out[48]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19.0 10,000+ Free 0 Everyone Art
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14.0 500,000+ Free 0 Everyone Desig
moana
U
Launcher
Lite –
2 FREE Live ART_AND_DESIGN 4.7 87510 8.7 5,000,000+ Free 0 Everyone Art
Cool
Themes,
Hide ...
Sketch -
3 Draw & ART_AND_DESIGN 4.5 215644 25.0 50,000,000+ Free 0 Teen Art
Paint
Pixel Draw
- Number
4 Art ART_AND_DESIGN 4.3 967 2.8 100,000+ Free 0 Everyone
Design
Coloring
Book
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 16/46
6/30/23, 12:02 PM Google_playstore_analysis
In [56]: df_copy.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 10840 entries, 0 to 10840
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10840 non-null object
1 Category 10840 non-null object
2 Rating 9366 non-null float64
3 Reviews 10840 non-null int32
4 Size 9145 non-null float64
5 Installs 10840 non-null float64
6 Type 10839 non-null object
7 Price 10840 non-null float64
8 Content Rating 10840 non-null object
9 Genres 10840 non-null object
10 Last Updated 10840 non-null object
11 Current Ver 10832 non-null object
12 Android Ver 10838 non-null object
dtypes: float64(4), int32(1), object(8)
memory usage: 1.4+ MB
In [58]: df_copy.head()
Out[58]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19.0 10000.0 Free 0.0 Everyone Art &
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14.0 500000.0 Free 0.0 Everyone Design
moana
U
Launcher
Lite –
2 FREE Live ART_AND_DESIGN 4.7 87510 8.7 5000000.0 Free 0.0 Everyone Art &
Cool
Themes,
Hide ...
Sketch -
3 Draw & ART_AND_DESIGN 4.5 215644 25.0 50000000.0 Free 0.0 Teen Art &
Paint
Pixel Draw
- Number
4 Art ART_AND_DESIGN 4.3 967 2.8 100000.0 Free 0.0 Everyone
Design;C
Coloring
Book
0 January 7, 2018
Out[59]:
1 January 15, 2018
2 August 1, 2018
3 June 8, 2018
4 June 20, 2018
...
10836 July 25, 2017
10837 July 6, 2018
10838 January 20, 2017
10839 January 19, 2015
10840 July 25, 2018
Name: Last Updated, Length: 10840, dtype: object
dtype('O')
Out[60]:
In [62]: df_copy.info()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 18/46
6/30/23, 12:02 PM Google_playstore_analysis
<class 'pandas.core.frame.DataFrame'>
Int64Index: 10840 entries, 0 to 10840
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10840 non-null object
1 Category 10840 non-null object
2 Rating 9366 non-null float64
3 Reviews 10840 non-null int32
4 Size 9145 non-null float64
5 Installs 10840 non-null float64
6 Type 10839 non-null object
7 Price 10840 non-null float64
8 Content Rating 10840 non-null object
9 Genres 10840 non-null object
10 Last Updated 10840 non-null datetime64[ns]
11 Current Ver 10832 non-null object
12 Android Ver 10838 non-null object
dtypes: datetime64[ns](1), float64(4), int32(1), object(7)
memory usage: 1.4+ MB
In [66]: df_copy.head()
Out[66]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19.0 10000.0 Free 0.0 Everyone Art &
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14.0 500000.0 Free 0.0 Everyone Design
moana
U
Launcher
Lite –
2 FREE Live ART_AND_DESIGN 4.7 87510 8.7 5000000.0 Free 0.0 Everyone Art &
Cool
Themes,
Hide ...
Sketch -
3 Draw & ART_AND_DESIGN 4.5 215644 25.0 50000000.0 Free 0.0 Teen Art &
Paint
Pixel Draw
- Number
4 Art ART_AND_DESIGN 4.3 967 2.8 100000.0 Free 0.0 Everyone
Design;C
Coloring
Book
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 19/46
6/30/23, 12:02 PM Google_playstore_analysis
In [67]: df_copy.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 10840 entries, 0 to 10840
Data columns (total 16 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10840 non-null object
1 Category 10840 non-null object
2 Rating 9366 non-null float64
3 Reviews 10840 non-null int32
4 Size 9145 non-null float64
5 Installs 10840 non-null float64
6 Type 10839 non-null object
7 Price 10840 non-null float64
8 Content Rating 10840 non-null object
9 Genres 10840 non-null object
10 Last Updated 10840 non-null datetime64[ns]
11 Current Ver 10832 non-null object
12 Android Ver 10838 non-null object
13 Day 10840 non-null int64
14 Month 10840 non-null int64
15 Year 10840 non-null int64
dtypes: datetime64[ns](1), float64(4), int32(1), int64(3), object(7)
memory usage: 1.6+ MB
In [70]: df_copy.head(2)
Photo
Editor &
Candy
0 0 ART_AND_DESIGN 4.1 159 19.0 10000.0 Free 0.0 Everyone
Camera &
Grid &
ScrapBook
Coloring
1 1 book ART_AND_DESIGN 3.9 967 14.0 500000.0 Free 0.0 Everyone
moana
In [72]: df_copy.sample(2)
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 20/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[72]: Content
App Category Rating Reviews Size Installs Type Price Genres
Rating U
Code on the 2
9416 TOOLS 3.9 146 0.784 10000.0 Free 0.0 Everyone Tools
egg
EC -
Encumbrance
2
9230 Search - BUSINESS 3.4 45 5.500 10000.0 Free 0.0 Everyone Business
telangana
state
In [73]: df_copy.head(2)
Out[73]: Content
App Category Rating Reviews Size Installs Type Price Ge
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19.0 10000.0 Free 0.0 Everyone Art & De
Camera &
Grid &
ScrapBook
Coloring A
1 book ART_AND_DESIGN 3.9 967 14.0 500000.0 Free 0.0 Everyone Design;Pre
moana
In [74]: df_copy.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 10840 entries, 0 to 10839
Data columns (total 16 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 App 10840 non-null object
1 Category 10840 non-null object
2 Rating 9366 non-null float64
3 Reviews 10840 non-null int64
4 Size 9145 non-null float64
5 Installs 10840 non-null float64
6 Type 10839 non-null object
7 Price 10840 non-null float64
8 Content Rating 10840 non-null object
9 Genres 10840 non-null object
10 Last Updated 10840 non-null object
11 Current Ver 10832 non-null object
12 Android Ver 10838 non-null object
13 Day 10840 non-null int64
14 Month 10840 non-null int64
15 Year 10840 non-null int64
dtypes: float64(4), int64(4), object(8)
memory usage: 1.3+ MB
In [75]: df_copy.shape
(10840, 16)
Out[75]:
In [76]: df_copy.isnull().sum()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 21/46
6/30/23, 12:02 PM Google_playstore_analysis
App 0
Out[76]:
Category 0
Rating 1474
Reviews 0
Size 1695
Installs 0
Type 1
Price 0
Content Rating 0
Genres 0
Last Updated 0
Current Ver 8
Android Ver 2
Day 0
Month 0
Year 0
dtype: int64
In [77]: df_copy.duplicated()
0 False
Out[77]:
1 False
2 False
3 False
4 False
...
10835 False
10836 False
10837 False
10838 False
10839 False
Length: 10840, dtype: bool
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 22/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[78]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Quick PDF
229 Scanner + BUSINESS 4.2 80805 NaN 5000000.0 Free 0.0 Everyone
OCR FREE
236 Box BUSINESS 4.2 159872 NaN 10000000.0 Free 0.0 Everyone
Google My
239 BUSINESS 4.4 70991 NaN 5000000.0 Free 0.0 Everyone
Business
ZOOM
256 Cloud BUSINESS 4.4 31614 37.0 10000000.0 Free 0.0 Everyone
Meetings
join.me -
261 Simple BUSINESS 4.0 6989 NaN 1000000.0 Free 0.0 Everyone
Meetings
... ... ... ... ... ... ... ... ... ...
Wunderlist:
8643 To-Do List PRODUCTIVITY 4.6 404610 NaN 10000000.0 Free 0.0 Everyone Pro
& Tasks
TickTick: To
Do List with
8654 PRODUCTIVITY 4.6 25370 NaN 1000000.0 Free 0.0 Everyone Pro
Reminder,
Day Planner
ColorNote
8658 Notepad PRODUCTIVITY 4.6 2401017 NaN 100000000.0 Free 0.0 Everyone Pro
Notes
Airway Ex -
Intubate.
10049 MEDICAL 4.3 123 86.0 10000.0 Free 0.0 Everyone
Anesthetize.
Train.
In [80]: df_copy.shape
(10357, 16)
Out[80]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 23/46
6/30/23, 12:02 PM Google_playstore_analysis
In [82]: cat
['App',
Out[82]:
'Category',
'Type',
'Content Rating',
'Genres',
'Last Updated',
'Current Ver',
'Android Ver']
In [84]: num
Data Proportion
In [85]: df_copy["App"].value_counts()
ROBLOX 9
Out[85]:
8 Ball Pool 7
Bubble Shooter 6
Helix Jump 6
Zombie Catchers 6
..
Popsicle Launcher for Android P 9.0 launcher 1
PixelLab - Text on pictures 1
P Launcher for Android™ 9.0 1
Pacify (Android P theme) - Theme for Xperia™ 1
iHoroscope - 2018 Daily Horoscope & Astrology 1
Name: App, Length: 9659, dtype: int64
ROBLOX 0.000869
Out[86]:
8 Ball Pool 0.000676
Bubble Shooter 0.000579
Helix Jump 0.000579
Zombie Catchers 0.000579
...
Popsicle Launcher for Android P 9.0 launcher 0.000097
PixelLab - Text on pictures 0.000097
P Launcher for Android™ 9.0 0.000097
Pacify (Android P theme) - Theme for Xperia™ 0.000097
iHoroscope - 2018 Daily Horoscope & Astrology 0.000097
Name: App, Length: 9659, dtype: float64
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 24/46
6/30/23, 12:02 PM Google_playstore_analysis
FAMILY 1943
Out[87]:
GAME 1121
TOOLS 843
BUSINESS 427
MEDICAL 408
PRODUCTIVITY 407
PERSONALIZATION 388
LIFESTYLE 373
COMMUNICATION 366
FINANCE 360
SPORTS 351
PHOTOGRAPHY 322
HEALTH_AND_FITNESS 306
SOCIAL 280
NEWS_AND_MAGAZINES 264
TRAVEL_AND_LOCAL 237
BOOKS_AND_REFERENCE 230
SHOPPING 224
DATING 196
VIDEO_PLAYERS 175
MAPS_AND_NAVIGATION 137
EDUCATION 130
FOOD_AND_DRINK 124
ENTERTAINMENT 111
AUTO_AND_VEHICLES 85
LIBRARIES_AND_DEMO 85
WEATHER 82
HOUSE_AND_HOME 80
ART_AND_DESIGN 65
EVENTS 64
PARENTING 60
COMICS 60
BEAUTY 53
Name: Category, dtype: int64
In [88]: df_copy["Category"].value_counts().sum()
10357
Out[88]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 25/46
6/30/23, 12:02 PM Google_playstore_analysis
FAMILY 18.760259
Out[89]:
GAME 10.823598
TOOLS 8.139423
BUSINESS 4.122815
MEDICAL 3.939365
PRODUCTIVITY 3.929709
PERSONALIZATION 3.746259
LIFESTYLE 3.601429
COMMUNICATION 3.533842
FINANCE 3.475910
SPORTS 3.389012
PHOTOGRAPHY 3.109008
HEALTH_AND_FITNESS 2.954524
SOCIAL 2.703486
NEWS_AND_MAGAZINES 2.549001
TRAVEL_AND_LOCAL 2.288307
BOOKS_AND_REFERENCE 2.220720
SHOPPING 2.162788
DATING 1.892440
VIDEO_PLAYERS 1.689678
MAPS_AND_NAVIGATION 1.322777
EDUCATION 1.255190
FOOD_AND_DRINK 1.197258
ENTERTAINMENT 1.071739
AUTO_AND_VEHICLES 0.820701
LIBRARIES_AND_DEMO 0.820701
WEATHER 0.791735
HOUSE_AND_HOME 0.772424
ART_AND_DESIGN 0.627595
EVENTS 0.617940
PARENTING 0.579318
COMICS 0.579318
BEAUTY 0.511731
Name: Category, dtype: float64
<AxesSubplot:xlabel='Rating', ylabel='Density'>
Out[91]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 26/46
6/30/23, 12:02 PM Google_playstore_analysis
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 27/46
6/30/23, 12:02 PM Google_playstore_analysis
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 28/46
6/30/23, 12:02 PM Google_playstore_analysis
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 29/46
6/30/23, 12:02 PM Google_playstore_analysis
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 30/46
6/30/23, 12:02 PM Google_playstore_analysis
In [93]: plt.figure(figsize=(15,15))
plt.suptitle("Univariate Analysis of Numerical Features", fontsize = 20)
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 31/46
6/30/23, 12:02 PM Google_playstore_analysis
In [94]: num_df.isnull().sum()
Rating 1465
Out[94]:
Reviews 0
Size 1526
Installs 0
Price 0
Day 0
Month 0
Year 0
dtype: int64
In [96]: cat_df["Type"].value_counts()
Free 9591
Out[96]:
Paid 765
Name: Type, dtype: int64
<AxesSubplot:xlabel='Type', ylabel='count'>
Out[97]:
In [98]: cat_df.columns
In [99]: df_copy.sample(10)
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 32/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[99]: Conten
App Category Rating Reviews Size Installs Type Price
Ratin
German
9973 Vocabulary FAMILY 3.3 1218 1.0 100000.0 Free 0.00 Everyon
Trainer
C
3988 FAMILY 4.3 22248 1.8 1000000.0 Free 0.00 Everyon
Programming
Matu
4316 Anna.K Tarot FAMILY 4.8 17 23.0 100.0 Paid 3.99
17
CM FILE
3513 MANAGER PRODUCTIVITY 4.3 144879 NaN 10000000.0 Free 0.00 Everyon
HD
The
7800 ClubHouse HEALTH_AND_FITNESS NaN 5 8.7 100.0 Free 0.00 Everyon
CR
Five Nights at
5648 GAME 4.7 27856 50.0 100000.0 Paid 2.99 Tee
Freddy's 3
Grubhub:
1193 Food FOOD_AND_DRINK 4.5 155944 35.0 5000000.0 Free 0.00 Everyon
Delivery
Jewels Crush-
2017 Match 3 FAMILY 4.4 14774 19.0 1000000.0 Free 0.00 Everyon
Puzzle
Dr. Panda
8800 FAMILY 4.3 3725 67.0 100000.0 Paid 2.99 Everyon
Restaurant 2
CP Smart
7694 PERSONALIZATION NaN 1 3.9 10.0 Free 0.00 Everyon
Check List
In [101… cat_df["Category"].value_counts()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 33/46
6/30/23, 12:02 PM Google_playstore_analysis
FAMILY 1943
Out[101]:
GAME 1121
TOOLS 843
BUSINESS 427
MEDICAL 408
PRODUCTIVITY 407
PERSONALIZATION 388
LIFESTYLE 373
COMMUNICATION 366
FINANCE 360
SPORTS 351
PHOTOGRAPHY 322
HEALTH_AND_FITNESS 306
SOCIAL 280
NEWS_AND_MAGAZINES 264
TRAVEL_AND_LOCAL 237
BOOKS_AND_REFERENCE 230
SHOPPING 224
DATING 196
VIDEO_PLAYERS 175
MAPS_AND_NAVIGATION 137
EDUCATION 130
FOOD_AND_DRINK 124
ENTERTAINMENT 111
AUTO_AND_VEHICLES 85
LIBRARIES_AND_DEMO 85
WEATHER 82
HOUSE_AND_HOME 80
ART_AND_DESIGN 65
EVENTS 64
PARENTING 60
COMICS 60
BEAUTY 53
Name: Category, dtype: int64
In [102… cat_df["Category"].value_counts().plot.pie(figsize=(10,10))
<AxesSubplot:ylabel='Category'>
Out[102]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 34/46
6/30/23, 12:02 PM Google_playstore_analysis
In [103… cat_df["Category"].value_counts().head()
FAMILY 1943
Out[103]:
GAME 1121
TOOLS 843
BUSINESS 427
MEDICAL 408
Name: Category, dtype: int64
In [105… category
Out[105]: Category
FAMILY 1943
GAME 1121
TOOLS 843
BUSINESS 427
MEDICAL 408
PRODUCTIVITY 407
PERSONALIZATION 388
LIFESTYLE 373
COMMUNICATION 366
FINANCE 360
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 35/46
6/30/23, 12:02 PM Google_playstore_analysis
<AxesSubplot:>
Out[106]:
<AxesSubplot:ylabel='Frequency'>
Out[107]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 36/46
6/30/23, 12:02 PM Google_playstore_analysis
In [109… category.head()
Out[109]: Count
FAMILY 1943
GAME 1121
TOOLS 843
BUSINESS 427
MEDICAL 408
In [110… plt.figure(figsize=(20,15))
plt.xticks(rotation=45)
plt.title("Top 10 categories")
sns.barplot(x = category.index[:10],y = "Count",data=category[:10])
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 37/46
6/30/23, 12:02 PM Google_playstore_analysis
1000000000.0
Out[111]:
In [112… # Checking for the apps which are installed maximum number of times
df_copy[df_copy["Installs"] == df_copy["Installs"].max()]
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 38/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[112]: Co
App Category Rating Reviews Size Installs Type Price
R
Google
152 BOOKS_AND_REFERENCE 3.9 1433233 NaN 1.000000e+09 Free 0.0
Play Books
Messenger
– Text and
335 Video COMMUNICATION 4.0 56642847 NaN 1.000000e+09 Free 0.0 Eve
Chat for
Free
WhatsApp
336 COMMUNICATION 4.4 69119316 NaN 1.000000e+09 Free 0.0 Eve
Messenger
Google
Chrome:
338 COMMUNICATION 4.3 9642995 NaN 1.000000e+09 Free 0.0 Eve
Fast &
Secure
340 Gmail COMMUNICATION 4.3 4604324 NaN 1.000000e+09 Free 0.0 Eve
341 Hangouts COMMUNICATION 4.0 3419249 NaN 1.000000e+09 Free 0.0 Eve
Messenger
– Text and
382 Video COMMUNICATION 4.0 56646578 NaN 1.000000e+09 Free 0.0 Eve
Chat for
Free
386 Hangouts COMMUNICATION 4.0 3419433 NaN 1.000000e+09 Free 0.0 Eve
Skype -
391 free IM & COMMUNICATION 4.1 10484169 NaN 1.000000e+09 Free 0.0 Eve
video calls
Google
Chrome:
411 COMMUNICATION 4.3 9643041 NaN 1.000000e+09 Free 0.0 Eve
Fast &
Secure
451 Gmail COMMUNICATION 4.3 4604483 NaN 1.000000e+09 Free 0.0 Eve
464 Hangouts COMMUNICATION 4.0 3419513 NaN 1.000000e+09 Free 0.0 Eve
Google
865 Play ENTERTAINMENT 4.3 7165362 NaN 1.000000e+09 Free 0.0
Games
Subway Eve
1654 GAME 4.5 27722264 76.0 1.000000e+09 Free 0.0
Surfers
Subway Eve
1700 GAME 4.5 27723193 76.0 1.000000e+09 Free 0.0
Surfers
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 39/46
6/30/23, 12:02 PM Google_playstore_analysis
Co
App Category Rating Reviews Size Installs Type Price
R
Subway Eve
1750 GAME 4.5 27724094 76.0 1.000000e+09 Free 0.0
Surfers
Subway Eve
1872 GAME 4.5 27725352 76.0 1.000000e+09 Free 0.0
Surfers
Google
2808 PHOTOGRAPHY 4.5 10858556 NaN 1.000000e+09 Free 0.0 Eve
Photos
Google
2853 PHOTOGRAPHY 4.5 10858538 NaN 1.000000e+09 Free 0.0 Eve
Photos
Google
2884 PHOTOGRAPHY 4.5 10859051 NaN 1.000000e+09 Free 0.0 Eve
Photos
Maps -
3117 Navigate TRAVEL_AND_LOCAL 4.3 9235155 NaN 1.000000e+09 Free 0.0 Eve
& Explore
Google
3127 Street TRAVEL_AND_LOCAL 4.2 2129689 NaN 1.000000e+09 Free 0.0 Eve
View
Maps -
3223 Navigate TRAVEL_AND_LOCAL 4.3 9235373 NaN 1.000000e+09 Free 0.0 Eve
& Explore
Google
3232 Street TRAVEL_AND_LOCAL 4.2 2129707 NaN 1.000000e+09 Free 0.0 Eve
View
3234 Google TOOLS 4.4 8033493 NaN 1.000000e+09 Free 0.0 Eve
Google
3454 PRODUCTIVITY 4.4 2731171 NaN 1.000000e+09 Free 0.0 Eve
Drive
Google
3523 PRODUCTIVITY 4.4 2731211 NaN 1.000000e+09 Free 0.0 Eve
Drive
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 40/46
6/30/23, 12:02 PM Google_playstore_analysis
Co
App Category Rating Reviews Size Installs Type Price
R
Google
Play
3687 VIDEO_PLAYERS 3.7 906384 NaN 1.000000e+09 Free 0.0
Movies &
TV
Google
3736 NEWS_AND_MAGAZINES 3.9 877635 13.0 1.000000e+09 Free 0.0
News
Google
3816 NEWS_AND_MAGAZINES 3.9 877643 13.0 1.000000e+09 Free 0.0
News
Subway Eve
3896 GAME 4.5 27711703 76.0 1.000000e+09 Free 0.0
Surfers
WhatsApp
3904 COMMUNICATION 4.4 69109672 NaN 1.000000e+09 Free 0.0 Eve
Messenger
Google
Chrome:
3996 COMMUNICATION 4.3 9642112 NaN 1.000000e+09 Free 0.0 Eve
Fast &
Secure
Maps -
4098 Navigate TRAVEL_AND_LOCAL 4.3 9231613 NaN 1.000000e+09 Free 0.0 Eve
& Explore
4150 Google TOOLS 4.4 8021623 NaN 1.000000e+09 Free 0.0 Eve
4153 Hangouts COMMUNICATION 4.0 3419464 NaN 1.000000e+09 Free 0.0 Eve
Google
4170 PRODUCTIVITY 4.4 2728941 NaN 1.000000e+09 Free 0.0 Eve
Drive
Google
5395 PHOTOGRAPHY 4.5 10847682 NaN 1.000000e+09 Free 0.0 Eve
Photos
Google
5856 Play FAMILY 4.3 7168735 NaN 1.000000e+09 Free 0.0
Games
Google
9844 NEWS_AND_MAGAZINES 3.9 878065 13.0 1.000000e+09 Free 0.0
News
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 41/46
6/30/23, 12:02 PM Google_playstore_analysis
In [113… df_copy.groupby(["Category"])["Installs"]
Category
Out[114]:
GAME 3.154402e+10
COMMUNICATION 2.415228e+10
SOCIAL 1.251387e+10
PRODUCTIVITY 1.246309e+10
TOOLS 1.145277e+10
FAMILY 1.004169e+10
PHOTOGRAPHY 9.721248e+09
TRAVEL_AND_LOCAL 6.361887e+09
VIDEO_PLAYERS 6.222003e+09
NEWS_AND_MAGAZINES 5.393218e+09
SHOPPING 2.573349e+09
ENTERTAINMENT 2.455660e+09
PERSONALIZATION 2.074495e+09
BOOKS_AND_REFERENCE 1.916470e+09
SPORTS 1.528574e+09
HEALTH_AND_FITNESS 1.361023e+09
BUSINESS 8.636649e+08
FINANCE 7.703487e+08
MAPS_AND_NAVIGATION 7.242819e+08
LIFESTYLE 5.348235e+08
EDUCATION 5.339520e+08
WEATHER 4.261005e+08
FOOD_AND_DRINK 2.578988e+08
DATING 2.065361e+08
HOUSE_AND_HOME 1.252125e+08
ART_AND_DESIGN 1.243381e+08
LIBRARIES_AND_DEMO 6.299591e+07
COMICS 5.608615e+07
AUTO_AND_VEHICLES 5.313021e+07
MEDICAL 4.220418e+07
PARENTING 3.152111e+07
BEAUTY 2.719705e+07
EVENTS 1.597316e+07
Name: Installs, dtype: float64
App
Out[115]:
Subway Surfers 5.000000e+09
Google Photos 4.000000e+09
Hangouts 4.000000e+09
Google News 3.000000e+09
Google Chrome: Fast & Secure 3.000000e+09
...
Command & Conquer: Rivals 0.000000e+00
Test Application DT 02 0.000000e+00
AP Series Solution Pro 0.000000e+00
I'm Rich/Eu sou Rico/أنا غني/我很有錢 0.000000e+00
Ak Parti Yardım Toplama 0.000000e+00
Name: Installs, Length: 9659, dtype: float64
In [116… df_copy.groupby(["Category"])["Installs"].sum().nlargest()
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 42/46
6/30/23, 12:02 PM Google_playstore_analysis
Category
Out[116]:
GAME 3.154402e+10
COMMUNICATION 2.415228e+10
SOCIAL 1.251387e+10
PRODUCTIVITY 1.246309e+10
TOOLS 1.145277e+10
Name: Installs, dtype: float64
In [117… df_copy.groupby(["Category"])["Installs"].sum().nlargest(5).plot.pie()
<AxesSubplot:ylabel='Installs'>
Out[117]:
In [118… df_copy.groupby(["Category"])["Installs"].sum().nlargest(5).plot.bar()
<AxesSubplot:xlabel='Category'>
Out[118]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 43/46
6/30/23, 12:02 PM Google_playstore_analysis
In [119… df_copy.groupby(["Category"])["Installs"].sum().nsmallest(5).plot.pie()
<AxesSubplot:ylabel='Installs'>
Out[119]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 44/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[120]: Content
App Category Rating Reviews Size Installs Type Price
Rating
Photo
Editor &
Candy
0 ART_AND_DESIGN 4.1 159 19.0 10000.0 Free 0.0 Everyone Art &
Camera &
Grid &
ScrapBook
Coloring
1 book ART_AND_DESIGN 3.9 967 14.0 500000.0 Free 0.0 Everyone Design
moana
U
Launcher
Lite –
2 FREE Live ART_AND_DESIGN 4.7 87510 8.7 5000000.0 Free 0.0 Everyone Art &
Cool
Themes,
Hide ...
Sketch -
3 Draw & ART_AND_DESIGN 4.5 215644 25.0 50000000.0 Free 0.0 Teen Art &
Paint
Pixel Draw
- Number
4 Art ART_AND_DESIGN 4.3 967 2.8 100000.0 Free 0.0 Everyone
Design;C
Coloring
Book
In [121… df_copy["Rating"].unique()
array([4.1, 3.9, 4.7, 4.5, 4.3, 4.4, 3.8, 4.2, 4.6, 3.2, 4. , nan, 4.8,
Out[121]:
4.9, 3.6, 3.7, 3.3, 3.4, 3.5, 3.1, 5. , 2.6, 3. , 1.9, 2.5, 2.8,
2.7, 1. , 2.9, 2.3, 2.2, 1.7, 2. , 1.8, 2.4, 1.6, 2.1, 1.4, 1.5,
1.2])
In [122… df_copy["Rating"] == 5
0 False
Out[122]:
1 False
2 False
3 False
4 False
...
10835 False
10836 True
10837 False
10838 False
10839 False
Name: Rating, Length: 10357, dtype: bool
In [123… df_copy[df_copy["Rating"] == 5]
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 45/46
6/30/23, 12:02 PM Google_playstore_analysis
Out[123]: Content
App Category Rating Reviews Size Installs Type Price Genres
Rating Upd
Hojiboy
Tojiboyev 201
329 COMICS 5.0 15 37.0 1000.0 Free 0.0 Everyone Comics
Life
Hacks
American
Girls Mature 201
612 DATING 5.0 5 4.4 1000.0 Free 0.0 Dating
Mobile 17+
Numbers
Spine-
The 201
633 DATING 5.0 5 9.3 500.0 Free 0.0 Teen Dating
dating
app
Girls Live
Talk -
Free Text Mature 201
636 DATING 5.0 6 5.0 100.0 Free 0.0 Dating
and 17+
Video
Chat
... ... ... ... ... ... ... ... ... ... ...
Mad
201
10720 Dash Fo' GAME 5.0 14 16.0 100.0 Free 0.0 Everyone Arcade
Cash
GKPB FP
201
10741 Online LIFESTYLE 5.0 32 7.9 1000.0 Free 0.0 Everyone Lifestyle
Church
Monster 201
10775 GAME 5.0 1 24.0 10.0 Free 0.0 Everyone Racing
Ride Pro
Fr. Mike
Schmitz 201
10836 FAMILY 5.0 4 3.6 100.0 Free 0.0 Everyone Education
Audio
Teachings
In [ ]:
localhost:8888/nbconvert/html/DA_Dashboard_Analysis/Google_playstore_analysis.ipynb?download=false 46/46