DMV - 4 - Jupyter Notebook
DMV - 4 - Jupyter Notebook
In [6]: data.head()
Out[6]:
City Date PM2.5 PM10 NO NO2 NOx NH3 CO SO2 O3 Benzene
2015-
0 Ahmedabad NaN NaN 0.92 18.22 17.15 NaN 0.92 27.64 133.36 0.00
01-01
2015-
1 Ahmedabad NaN NaN 0.97 15.69 16.46 NaN 0.97 24.55 34.06 3.68
01-02
2015-
2 Ahmedabad NaN NaN 17.40 19.30 29.70 NaN 17.40 29.07 30.70 6.80
01-03
2015-
3 Ahmedabad NaN NaN 1.70 18.48 17.97 NaN 1.70 18.59 36.08 4.43
01-04
2015-
4 Ahmedabad NaN NaN 22.10 21.42 37.76 NaN 22.10 39.33 39.31 7.01
01-05
In [7]: data.columns
Out[7]: Index(['City', 'Date', 'PM2.5', 'PM10', 'NO', 'NO2', 'NOx', 'NH3', 'CO',
'SO2',
'O3', 'Benzene', 'Toluene', 'Xylene', 'AQI', 'AQI_Bucket'],
dtype='object')
In [8]: data.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 29531 entries, 0 to 29530
Data columns (total 16 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 City 29531 non-null object
1 Date 29531 non-null object
2 PM2.5 24933 non-null float64
3 PM10 18391 non-null float64
4 NO 25949 non-null float64
5 NO2 25946 non-null float64
6 NOx 25346 non-null float64
7 NH3 19203 non-null float64
8 CO 27472 non-null float64
9 SO2 25677 non-null float64
10 O3 25509 non-null float64
11 Benzene 23908 non-null float64
12 Toluene 21490 non-null float64
13 Xylene 11422 non-null float64
14 AQI 24850 non-null float64
15 AQI_Bucket 24850 non-null object
dtypes: float64(13), object(3)
memory usage: 3.6+ MB
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 1/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
In [9]: data.describe()
Out[9]:
PM2.5 PM10 NO NO2 NOx NH3
In [10]: data.isnull().sum()
Out[10]: City 0
Date 0
PM2.5 4598
PM10 11140
NO 3582
NO2 3585
NOx 4185
NH3 10328
CO 2059
SO2 3854
O3 4022
Benzene 5623
Toluene 8041
Xylene 18109
AQI 4681
AQI_Bucket 4681
dtype: int64
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 2/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 3/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
--------------------------------------------------------------------------
-
TypeError Traceback (most recent call las
t)
Cell In[22], line 14
12 data['Xylene'].fillna(data['Xylene'].mean(), inplace=True)
13 data['AQI'].fillna(data['AQI'].mean(), inplace=True)
---> 14 data['AQI_Bucket'].fillna(data['AQI_Bucket'].mean(), inplace=True)
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 5/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 6/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 7/8
10/6/24, 7:51 PM DMV_4 - Jupyter Notebook
In [ ]:
In [ ]:
localhost:8888/notebooks/BE_PRACTICALS/DMV_4.ipynb 8/8