0% found this document useful (0 votes)

14 views16 pages

Ex 2 TP1

The document outlines a data analysis process using a housing dataset, including data loading, cleaning, and preparation for modeling. It employs libraries like pandas, numpy, and sklearn to handle data, perform transformations, and prepare for linear regression analysis. The analysis includes handling missing values, feature engineering, and visualizing correlations among features.

Uploaded by

Hajar Bensahl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views16 pages

Ex 2 TP1

Uploaded by

Hajar Bensahl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

ex2TP1

November 5, 2024

[59]: import pandas as pd

import numpy as np
from matplotlib import pyplot as plt
%matplotlib inline
import matplotlib
matplotlib.rcParams["figure.figsize"]=(20,10)
import seaborn as sns

[60]: import sklearn

from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_squared_error, r2_score
from sklearn.preprocessing import StandardScaler

[61]: df1=pd.read_csv("housing.csv")
df1.head()

[61]: longitude latitude housing_median_age total_rooms total_bedrooms \

0 -122.23 37.88 41.0 880.0 129.0
1 -122.22 37.86 21.0 7099.0 1106.0
2 -122.24 37.85 52.0 1467.0 190.0
3 -122.25 37.85 52.0 1274.0 235.0
4 -122.25 37.85 52.0 1627.0 280.0

population households median_income median_house_value ocean_proximity

0 322.0 126.0 8.3252 452600.0 NEAR BAY
1 2401.0 1138.0 8.3014 358500.0 NEAR BAY
2 496.0 177.0 7.2574 352100.0 NEAR BAY
3 558.0 219.0 5.6431 341300.0 NEAR BAY
4 565.0 259.0 3.8462 342200.0 NEAR BAY

[62]: df1.shape

[62]: (20640, 10)

[63]: df1.isnull().sum()

1
[63]: longitude 0
latitude 0
housing_median_age 0
total_rooms 0
total_bedrooms 207
population 0
households 0
median_income 0
median_house_value 0
ocean_proximity 0
dtype: int64

[64]: df2=df1.dropna()
df2.isnull().sum()

[64]: longitude 0
latitude 0
housing_median_age 0
total_rooms 0
total_bedrooms 0
population 0
households 0
median_income 0
median_house_value 0
ocean_proximity 0
dtype: int64

[65]: from sklearn.model_selection import train_test_split

x=df2.drop(['median_house_value'],axis=1)
y=df2['median_house_value']

[66]: x

[66]: longitude latitude housing_median_age total_rooms total_bedrooms \

0 -122.23 37.88 41.0 880.0 129.0
1 -122.22 37.86 21.0 7099.0 1106.0
2 -122.24 37.85 52.0 1467.0 190.0
3 -122.25 37.85 52.0 1274.0 235.0
4 -122.25 37.85 52.0 1627.0 280.0
… … … … … …
20635 -121.09 39.48 25.0 1665.0 374.0
20636 -121.21 39.49 18.0 697.0 150.0
20637 -121.22 39.43 17.0 2254.0 485.0
20638 -121.32 39.43 18.0 1860.0 409.0
20639 -121.24 39.37 16.0 2785.0 616.0

2
population households median_income ocean_proximity
0 322.0 126.0 8.3252 NEAR BAY
1 2401.0 1138.0 8.3014 NEAR BAY
2 496.0 177.0 7.2574 NEAR BAY
3 558.0 219.0 5.6431 NEAR BAY
4 565.0 259.0 3.8462 NEAR BAY
… … … … …
20635 845.0 330.0 1.5603 INLAND
20636 356.0 114.0 2.5568 INLAND
20637 1007.0 433.0 1.7000 INLAND
20638 741.0 349.0 1.8672 INLAND
20639 1387.0 530.0 2.3886 INLAND

[20433 rows x 9 columns]

[67]: y

[67]: 0 452600.0
1 358500.0
2 352100.0
3 341300.0
4 342200.0
…
20635 78100.0
20636 77100.0
20637 92300.0
20638 84700.0
20639 89400.0
Name: median_house_value, Length: 20433, dtype: float64

[68]: x_train,x_test,y_train,y_test = train_test_split(x,y,test_size=0.20)

[69]: train_data=x_train.join(y_train)
#train_data
#train_data['ocean_proximity'].unique()

[70]: train_data.hist()

[70]: array([[<Axes: title={'center': 'longitude'}>,

<Axes: title={'center': 'latitude'}>,
<Axes: title={'center': 'housing_median_age'}>],
[<Axes: title={'center': 'total_rooms'}>,
<Axes: title={'center': 'total_bedrooms'}>,
<Axes: title={'center': 'population'}>],
[<Axes: title={'center': 'households'}>,
<Axes: title={'center': 'median_income'}>,
<Axes: title={'center': 'median_house_value'}>]], dtype=object)

3
[71]: train_data.corr(numeric_only=True)
plt.figure()
sns.heatmap(train_data.corr(numeric_only=True), annot=True, cmap="YlGnBu")
plt.show()

[72]: train_data['total_rooms']=np.log(train_data['total_rooms']+1)
train_data['total_bedrooms']=np.log(train_data['total_bedrooms']+1)
train_data['population']=np.log(train_data['population']+1)

4
train_data['households']=np.log(train_data['households']+1)

[73]: train_data.hist()

[73]: array([[<Axes: title={'center': 'longitude'}>,

[74]: train_data.ocean_proximity.value_counts()

[74]: ocean_proximity
<1H OCEAN 7217
INLAND 5170
NEAR OCEAN 2118
NEAR BAY 1837
ISLAND 4
Name: count, dtype: int64

[75]: #pd.get_dummies(train_data.ocean_proximity).astype(int)
#train_data.join(pd.get_dummies(train_data.ocean_proximity.astype(int))).
↪drop(['ocean_proximity'],axis=1)

dummies = pd.get_dummies(train_data['ocean_proximity']).astype(int)

5
train_data = train_data.join(dummies).drop(['ocean_proximity'], axis=1)
train_data

[75]: longitude latitude housing_median_age total_rooms total_bedrooms \

8424 -118.36 33.93 27.0 8.399760 7.116394
19741 -122.57 39.90 15.0 8.262043 6.698268
19155 -122.71 38.37 16.0 7.764721 5.846439
7751 -118.15 33.92 28.0 6.946014 5.533389
17400 -120.44 34.93 15.0 6.767343 5.501258
… … … … … …
17208 -119.72 34.43 36.0 7.053586 5.736572
19855 -119.45 36.35 22.0 7.509335 5.811141
19380 -120.85 37.77 52.0 6.079933 4.406719
17254 -119.71 34.42 39.0 7.067320 5.777652
8760 -118.44 33.81 33.0 8.292799 6.898715

population households median_income median_house_value <1H OCEAN \

8424 8.114025 7.015712 3.1656 204500.0 1
19741 7.437206 6.442540 2.4555 55600.0 0
19155 6.922644 5.855072 5.6018 253000.0 1
7751 6.816736 5.505332 2.5875 161200.0 1
17400 7.033506 5.537334 2.0995 87500.0 1
… … … … … …
17208 6.257668 5.720312 2.6014 320600.0 1
19855 6.981935 5.645447 2.3365 69600.0 0
19380 5.288267 4.234107 1.8625 85400.0 0
17254 6.408529 5.758902 2.1600 259100.0 1
8760 7.407318 6.837333 5.0106 500001.0 0

INLAND ISLAND NEAR BAY NEAR OCEAN

8424 0 0 0 0
19741 1 0 0 0
19155 0 0 0 0
7751 0 0 0 0
17400 0 0 0 0
… … … … …
17208 0 0 0 0
19855 1 0 0 0
19380 1 0 0 0
17254 0 0 0 0
8760 0 0 0 1

[16346 rows x 14 columns]

[76]: plt.figure()
sns.heatmap(train_data.corr(),annot=True,cmap="YlGnBu")
plt.show()

6
[77]: plt.figure()
sns.scatterplot(x='longitude',y='latitude',data=train_data,␣
↪hue="median_house_value",palette="coolwarm")

[77]: <Axes: xlabel='longitude', ylabel='latitude'>

7
[78]: train_data['bedrooms_ratio']=train_data['total_bedrooms'] /␣
↪train_data['total_rooms']

train_data['household_rooms']=train_data['total_rooms'] /␣
↪train_data['households']

[79]: plt.figure()
sns.heatmap(train_data.corr(),annot=True,cmap="YlGnBu")
plt.show()

[80]: x_train=train_data.drop(['median_house_value'],axis=1)
y_train=train_data['median_house_value']

#normalisation des donnees

scaler = StandardScaler()
x_train_scaled = scaler.fit_transform(x_train)

model=LinearRegression()
model.fit(x_train_scaled,y_train_scaled)

[80]: LinearRegression()

[81]: test_data=x_test.join(y_test)
test_data['total_rooms']=np.log(test_data['total_rooms']+1)
test_data['total_bedrooms']=np.log(test_data['total_bedrooms']+1)

8
test_data['population']=np.log(test_data['population']+1)
test_data['households']=np.log(test_data['households']+1)

test_data = test_data.join(pd.get_dummies(test_data['ocean_proximity']).
↪astype(int)).drop(['ocean_proximity'], axis=1)

test_data['bedrooms_ratio']=test_data['total_bedrooms'] /␣
↪test_data['total_rooms']

test_data['household_rooms']=test_data['total_rooms'] / test_data['households']

#pd.get_dummies(test_data['ocean_proximity'])

[82]: test_data

[82]: longitude latitude housing_median_age total_rooms total_bedrooms \

10713 -117.84 33.66 5.0 6.501290 5.147494
20468 -118.71 34.27 26.0 6.898715 5.411646
17148 -122.20 37.43 38.0 8.196161 6.270988
7645 -118.27 33.81 10.0 7.540090 6.349139
17440 -120.44 34.66 22.0 8.080856 6.309918
… … … … … …
14905 -117.06 32.60 24.0 6.993015 5.594711
15752 -122.44 37.77 52.0 8.153637 6.694562
8273 -118.16 33.77 29.0 8.032360 6.668228
1746 -122.35 37.96 34.0 7.264730 5.817111
15696 -122.45 37.79 52.0 7.458763 6.180017

population households median_income median_house_value <1H OCEAN \

10713 5.953243 5.147494 4.5833 230400.0 1
20468 6.579251 5.451038 3.1630 179400.0 1
17148 7.208600 6.278521 7.3681 500001.0 0
7645 7.478735 6.317165 3.9286 114000.0 1
17440 7.461640 6.366470 4.5417 142400.0 0
… … … … … …
14905 6.999422 5.509388 2.4191 107300.0 0
15752 7.325808 6.656727 3.6186 500001.0 0
8273 7.286876 6.602588 2.8750 232500.0 0
1746 7.149132 5.768321 2.5461 93900.0 0
15696 6.595781 6.063785 1.4804 425000.0 0

INLAND ISLAND NEAR BAY NEAR OCEAN bedrooms_ratio household_rooms

10713 0 0 0 0 0.791765 1.263001
20468 0 0 0 0 0.784443 1.265578
17148 0 0 0 1 0.765113 1.305429
7645 0 0 0 0 0.842051 1.193588
17440 0 0 0 1 0.780848 1.269284
… … … … … … …
14905 0 0 0 1 0.800043 1.269291

9
15752 0 0 1 0 0.821052 1.224872
8273 0 0 0 1 0.830170 1.216547
1746 0 0 1 0 0.800733 1.259419
15696 0 0 1 0 0.828558 1.230051

[4087 rows x 16 columns]

[136]: x_test,y_test=test_data.
↪drop(['median_house_value'],axis=1),test_data['median_house_value']

x_test_scaled = scaler.transform(x_test)
#y_test_scaled = scaler.transform(y_test.values.reshape(-1, 1))
'''
y_scaler = StandardScaler()
y_train_scaled = y_scaler.fit_transform(y_train.values.reshape(-1, 1))
y_test_scaled = y_scaler.transform(y_test.values.reshape(-1, 1))
'''

[84]: x_test

[84]: longitude latitude housing_median_age total_rooms total_bedrooms \

population households median_income <1H OCEAN INLAND ISLAND \

10713 5.953243 5.147494 4.5833 1 0 0
20468 6.579251 5.451038 3.1630 1 0 0
17148 7.208600 6.278521 7.3681 0 0 0
7645 7.478735 6.317165 3.9286 1 0 0
17440 7.461640 6.366470 4.5417 0 0 0
… … … … … … …
14905 6.999422 5.509388 2.4191 0 0 0
15752 7.325808 6.656727 3.6186 0 0 0
8273 7.286876 6.602588 2.8750 0 0 0
1746 7.149132 5.768321 2.5461 0 0 0
15696 6.595781 6.063785 1.4804 0 0 0

NEAR BAY NEAR OCEAN bedrooms_ratio household_rooms

10713 0 0 0.791765 1.263001

10
20468 0 0 0.784443 1.265578
17148 0 1 0.765113 1.305429
7645 0 0 0.842051 1.193588
17440 0 1 0.780848 1.269284
… … … … …
14905 0 1 0.800043 1.269291
15752 1 0 0.821052 1.224872
8273 0 1 0.830170 1.216547
1746 1 0 0.800733 1.259419
15696 1 0 0.828558 1.230051

[4087 rows x 15 columns]

[138]: model.score(x_train_scaled, y_train_scaled)

---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
Cell In[138], line 1
----> 1 model.score(x_train_scaled, y_train_scaled)

AttributeError: 'Sequential' object has no attribute 'score'

[86]: y_pred = model.predict(x_test_scaled)

[87]: mse = mean_squared_error(y_test, y_pred)

mse

[87]: 57926172223.61929

[88]: from sklearn.ensemble import RandomForestRegressor

forest=RandomForestRegressor()
forest.fit(x_train,y_train)

[88]: RandomForestRegressor()

[89]: forest.score(x_test,y_test)

[89]: 0.8146615773444194

[91]: import keras

from keras.models import Sequential
from keras.layers import Dense

[92]: #definir le modele

model = Sequential([Dense(1, input_dim=x_train.shape[1], activation='linear')])
model.compile(optimizer='adam', loss='mean_squared_error')
model.summary()

11
C:\Users\king\anaconda3\Lib\site-packages\keras\src\layers\core\dense.py:87:
UserWarning: Do not pass an ìnput_shape`/ìnput_dim` argument to a layer. When
using Sequential models, prefer using an Ìnput(shape)` object as the first
layer in the model instead.
super().__init__(activity_regularizer=activity_regularizer, **kwargs)
Model: "sequential"

��
� Layer (type) � Output Shape � Param # �
��
� dense (Dense) � (None, 1) � 16 �
��

Total params: 16 (64.00 B)

Trainable params: 16 (64.00 B)

Non-trainable params: 0 (0.00 B)

[93]: train_5 = model.fit(x_train, y_train, validation_data=(x_test, y_test),␣

↪epochs=5)

# Entraînement pour 10 epochs

train_10 = model.fit(x_train, y_train, validation_data=(x_test, y_test),␣
↪epochs=10)

# Entraînement pour 15 epochs

train_15 = model.fit(x_train, y_train, validation_data=(x_test, y_test),␣
↪epochs=15)

Epoch 1/5
511/511 �� 3s 4ms/step -
loss: 55905394688.0000 - val_loss: 57862823936.0000
Epoch 2/5
511/511 �� 3s 6ms/step -
loss: 55015342080.0000 - val_loss: 57816014848.0000
Epoch 3/5
511/511 �� 2s 4ms/step -
loss: 55397969920.0000 - val_loss: 57769230336.0000
Epoch 4/5
511/511 �� 3s 5ms/step -
loss: 55470444544.0000 - val_loss: 57722441728.0000
Epoch 5/5
511/511 �� 2s 3ms/step -

12
loss: 55485304832.0000 - val_loss: 57675702272.0000
Epoch 1/10
511/511 �� 2s 3ms/step -
loss: 56076288000.0000 - val_loss: 57629020160.0000
Epoch 2/10
511/511 �� 3s 6ms/step -
loss: 55215300608.0000 - val_loss: 57582419968.0000
Epoch 3/10
511/511 �� 1s 2ms/step -
loss: 55182422016.0000 - val_loss: 57535737856.0000
Epoch 4/10
511/511 �� 1s 3ms/step -
loss: 54860214272.0000 - val_loss: 57489137664.0000
Epoch 5/10
511/511 �� 1s 3ms/step -
loss: 55305433088.0000 - val_loss: 57442570240.0000
Epoch 6/10
511/511 �� 2s 3ms/step -
loss: 54938718208.0000 - val_loss: 57396002816.0000
Epoch 7/10
511/511 �� 2s 3ms/step -
loss: 55876726784.0000 - val_loss: 57349505024.0000
Epoch 8/10
511/511 �� 2s 3ms/step -
loss: 55152832512.0000 - val_loss: 57302986752.0000
Epoch 9/10
511/511 �� 1s 3ms/step -
loss: 54960181248.0000 - val_loss: 57256488960.0000
Epoch 10/10
511/511 �� 3s 6ms/step -
loss: 56382586880.0000 - val_loss: 57210064896.0000
Epoch 1/15
511/511 �� 2s 3ms/step -
loss: 55094513664.0000 - val_loss: 57163624448.0000
Epoch 2/15
511/511 �� 2s 3ms/step -
loss: 55323426816.0000 - val_loss: 57117188096.0000
Epoch 3/15
511/511 �� 3s 5ms/step -
loss: 55270674432.0000 - val_loss: 57070817280.0000
Epoch 4/15
511/511 �� 1s 3ms/step -
loss: 53296803840.0000 - val_loss: 57024409600.0000
Epoch 5/15
511/511 �� 2s 3ms/step -
loss: 54269583360.0000 - val_loss: 56978063360.0000
Epoch 6/15
511/511 �� 2s 3ms/step -

13
loss: 54459072512.0000 - val_loss: 56931782656.0000
Epoch 7/15
511/511 �� 2s 3ms/step -
loss: 54719565824.0000 - val_loss: 56885473280.0000
Epoch 8/15
511/511 �� 2s 4ms/step -
loss: 53834076160.0000 - val_loss: 56839192576.0000
Epoch 9/15
511/511 �� 2s 4ms/step -
loss: 54277263360.0000 - val_loss: 56793022464.0000
Epoch 10/15
511/511 �� 3s 5ms/step -
loss: 54326079488.0000 - val_loss: 56746799104.0000
Epoch 11/15
511/511 �� 2s 4ms/step -
loss: 55075880960.0000 - val_loss: 56700678144.0000
Epoch 12/15
511/511 �� 2s 3ms/step -
loss: 54168629248.0000 - val_loss: 56654499840.0000
Epoch 13/15
511/511 �� 2s 4ms/step -
loss: 55217995776.0000 - val_loss: 56608378880.0000
Epoch 14/15
511/511 �� 2s 3ms/step -
loss: 54757396480.0000 - val_loss: 56562302976.0000
Epoch 15/15
511/511 �� 2s 3ms/step -
loss: 54337257472.0000 - val_loss: 56516194304.0000

[94]: def plot_performance(train, epochs):

plt.plot(train.history['loss'], label='train loss')
plt.plot(train.history['val_loss'], label='validation loss')
plt.title(f'Loss over {epochs} epochs')
plt.xlabel('Epochs')
plt.ylabel('Mean Squared Error')
plt.legend()
plt.show()

[95]: plot_performance(train_5, 5)
plot_performance(train_10, 10)
plot_performance(train_15, 15)

14
15
[96]: model.summary()

Model: "sequential"

��
� Layer (type) � Output Shape � Param # �
��
� dense (Dense) � (None, 1) � 16 �
��

Total params: 50 (204.00 B)

Trainable params: 16 (64.00 B)

Non-trainable params: 0 (0.00 B)

Optimizer params: 34 (140.00 B)

[ ]:

House Price Prediction Models
No ratings yet
House Price Prediction Models
16 pages
Tarea - Prediccion de Casas en California
No ratings yet
Tarea - Prediccion de Casas en California
5 pages
02 End To End Machine Learning Project
No ratings yet
02 End To End Machine Learning Project
26 pages
DA Lab2
No ratings yet
DA Lab2
5 pages
Setup: Chapter 2 - End-To-End Machine Learning Project
No ratings yet
Setup: Chapter 2 - End-To-End Machine Learning Project
31 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Compte Rendu Data Visualisation
No ratings yet
Compte Rendu Data Visualisation
5 pages
ML Regression
No ratings yet
ML Regression
9 pages
ML Manual
No ratings yet
ML Manual
9 pages
Normialization Dataset
No ratings yet
Normialization Dataset
7 pages
ML Assignment1
No ratings yet
ML Assignment1
4 pages
Faseeh Chap 2 Report
No ratings yet
Faseeh Chap 2 Report
30 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
HW 3
No ratings yet
HW 3
20 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
California Housing Price Prediction .
No ratings yet
California Housing Price Prediction .
1 page
Regression Algorithm
No ratings yet
Regression Algorithm
9 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
Real Estate Valuation Data Set: Section Order
No ratings yet
Real Estate Valuation Data Set: Section Order
17 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
Linear Regression Using Python
No ratings yet
Linear Regression Using Python
18 pages
Prac - 8 (1) - Jupyter Notebook
No ratings yet
Prac - 8 (1) - Jupyter Notebook
6 pages
DSBDA Prac4 2
No ratings yet
DSBDA Prac4 2
1 page
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
Python File
No ratings yet
Python File
5 pages
Data Analysis With Python - Jupyter Notebook
No ratings yet
Data Analysis With Python - Jupyter Notebook
10 pages
Emllab
No ratings yet
Emllab
6 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Exercise4 Solution
No ratings yet
Exercise4 Solution
20 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
Untitled6.Ipynb - Colab
No ratings yet
Untitled6.Ipynb - Colab
6 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
Machinelearning
No ratings yet
Machinelearning
26 pages
7 A
No ratings yet
7 A
2 pages
End To End Machine Learning Project-2
No ratings yet
End To End Machine Learning Project-2
10 pages
ML Merged
No ratings yet
ML Merged
28 pages
1684918425867
No ratings yet
1684918425867
14 pages
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
No ratings yet
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
14 pages
Ads Exp5 Code
No ratings yet
Ads Exp5 Code
2 pages
Week 12
No ratings yet
Week 12
2 pages
ML Spy Programs
No ratings yet
ML Spy Programs
16 pages
Example Project California Data Anaylsis Jupyter Notebook
No ratings yet
Example Project California Data Anaylsis Jupyter Notebook
28 pages
ML Manual
No ratings yet
ML Manual
30 pages
Chirag HOusing Price Pred
No ratings yet
Chirag HOusing Price Pred
12 pages
Week 6 LAB
No ratings yet
Week 6 LAB
13 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
127 pages
01.multiple Linear Regression - Ipynb - Colaboratory
No ratings yet
01.multiple Linear Regression - Ipynb - Colaboratory
10 pages
Wa0003
No ratings yet
Wa0003
16 pages
ML Lab - BCSL606
No ratings yet
ML Lab - BCSL606
67 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
Housing Prices Linear Regression
No ratings yet
Housing Prices Linear Regression
3 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
27 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
Machine Learning - Code - Jupiter
No ratings yet
Machine Learning - Code - Jupiter
14 pages
A
No ratings yet
A
2 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
A List of Factorial Math Constants
From Everand
A List of Factorial Math Constants
Archive Classics
No ratings yet
Ficha INGLES V G
No ratings yet
Ficha INGLES V G
6 pages
Better Angels of Our Natures
No ratings yet
Better Angels of Our Natures
108 pages
Model CV Curriculum Vitae European Engleza
No ratings yet
Model CV Curriculum Vitae European Engleza
2 pages
Traditional File Processing System Ne
100% (1)
Traditional File Processing System Ne
4 pages
(MAA 5.2) DERIVATIVES - BASIC RULES - Solutions
No ratings yet
(MAA 5.2) DERIVATIVES - BASIC RULES - Solutions
6 pages
Networking in Sensor: Rajiv Shrivastava Roll No. 15MP08 ME Manufacturing 2 Sem
No ratings yet
Networking in Sensor: Rajiv Shrivastava Roll No. 15MP08 ME Manufacturing 2 Sem
11 pages
MGIT 960 Brochure
No ratings yet
MGIT 960 Brochure
3 pages
CaseStudy Pondicherry-EcoTourism
No ratings yet
CaseStudy Pondicherry-EcoTourism
25 pages
Negligence in Construction and Medical
No ratings yet
Negligence in Construction and Medical
8 pages
CS586 HW2 Solution
No ratings yet
CS586 HW2 Solution
8 pages
BMJ h2622 Full
No ratings yet
BMJ h2622 Full
2 pages
Selling Skills MMS I NOTES
No ratings yet
Selling Skills MMS I NOTES
48 pages
Freshers Resume Sample PDF
100% (1)
Freshers Resume Sample PDF
2 pages
Rosemount™ 2051HT Hygienic Pressure Transmitter: Close
No ratings yet
Rosemount™ 2051HT Hygienic Pressure Transmitter: Close
7 pages
Math8 - q1 - Mod5a - Multiplying and Dividing Rational Algebraic Expressions - 08092020
No ratings yet
Math8 - q1 - Mod5a - Multiplying and Dividing Rational Algebraic Expressions - 08092020
23 pages
Language Arts Teach-A-Lesson Assignment
No ratings yet
Language Arts Teach-A-Lesson Assignment
12 pages
Ear: 1 Semester: 2
No ratings yet
Ear: 1 Semester: 2
2 pages
10 - Helical Anchors
No ratings yet
10 - Helical Anchors
55 pages
Journal 1 SKPK
No ratings yet
Journal 1 SKPK
2 pages
Chapter 1
No ratings yet
Chapter 1
12 pages
Self Identified in Robert Penn Warren's The Cave
No ratings yet
Self Identified in Robert Penn Warren's The Cave
2 pages
2010.1.15.facial Faradic
No ratings yet
2010.1.15.facial Faradic
2 pages
Dump
No ratings yet
Dump
110 pages
PDP 11 Handbook 1969
100% (2)
PDP 11 Handbook 1969
112 pages
Space and Culture Using Space Syntax For The Tenganan Pageringsingan Housing of Bali, Indonesia
No ratings yet
Space and Culture Using Space Syntax For The Tenganan Pageringsingan Housing of Bali, Indonesia
5 pages
Marico - Over The Wall
No ratings yet
Marico - Over The Wall
32 pages
Oral Communication in Context
No ratings yet
Oral Communication in Context
4 pages
CV - Marielle
No ratings yet
CV - Marielle
3 pages
TOA Course Outline
No ratings yet
TOA Course Outline
3 pages

Ex 2 TP1

Uploaded by

Ex 2 TP1

Uploaded by

ex2TP1

[59]: import pandas as pd

[60]: import sklearn

[61]: longitude latitude housing_median_age total_rooms total_bedrooms \

population households median_income median_house_value ocean_proximity

[62]: (20640, 10)

[65]: from sklearn.model_selection import train_test_split

[66]: longitude latitude housing_median_age total_rooms total_bedrooms \

[20433 rows x 9 columns]

[68]: x_train,x_test,y_train,y_test = train_test_split(x,y,test_size=0.20)

[70]: array([[<Axes: title={'center': 'longitude'}>,

[73]: array([[<Axes: title={'center': 'longitude'}>,

[75]: longitude latitude housing_median_age total_rooms total_bedrooms \

population households median_income median_house_value <1H OCEAN \

INLAND ISLAND NEAR BAY NEAR OCEAN

[16346 rows x 14 columns]

[77]: <Axes: xlabel='longitude', ylabel='latitude'>

#normalisation des donnees

[82]: longitude latitude housing_median_age total_rooms total_bedrooms \

population households median_income median_house_value <1H OCEAN \

INLAND ISLAND NEAR BAY NEAR OCEAN bedrooms_ratio household_rooms

[4087 rows x 16 columns]

[84]: longitude latitude housing_median_age total_rooms total_bedrooms \

population households median_income <1H OCEAN INLAND ISLAND \

NEAR BAY NEAR OCEAN bedrooms_ratio household_rooms

[4087 rows x 15 columns]

[138]: model.score(x_train_scaled, y_train_scaled)

AttributeError: 'Sequential' object has no attribute 'score'

[86]: y_pred = model.predict(x_test_scaled)

[87]: mse = mean_squared_error(y_test, y_pred)

[88]: from sklearn.ensemble import RandomForestRegressor

[91]: import keras

[92]: #definir le modele

Total params: 16 (64.00 B)

Trainable params: 16 (64.00 B)

Non-trainable params: 0 (0.00 B)

[93]: train_5 = model.fit(x_train, y_train, validation_data=(x_test, y_test),␣

# Entraînement pour 10 epochs

# Entraînement pour 15 epochs

[94]: def plot_performance(train, epochs):

Total params: 50 (204.00 B)

Trainable params: 16 (64.00 B)

Non-trainable params: 0 (0.00 B)

Optimizer params: 34 (140.00 B)

You might also like