0% found this document useful (0 votes)
54 views

Example Project California Data Anaylsis Jupyter Notebook

Uploaded by

bazett2142006
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views

Example Project California Data Anaylsis Jupyter Notebook

Uploaded by

bazett2142006
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 28

20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

Import libraries
In [1]: import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.preprocessing import LabelEncoder
import warnings
warnings.filterwarnings('ignore')

Read data
In [2]: file_path = '/california-housing-prices/housing.csv'
data = pd.read_csv(file_path)

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 1/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [3]: data.sample(20)

Out[3]: longitude latitude housing_median_age total_rooms total_bedrooms population households median_income median_house_

4000 -118.63 34.18 33.0 5252.0 760.0 2041.0 730.0 6.7977 389

18854 -122.45 41.28 15.0 2740.0 503.0 1188.0 445.0 3.4519 128

2332 -119.69 36.83 7.0 2075.0 353.0 1040.0 362.0 3.9943 100

11959 -117.44 33.90 23.0 4487.0 754.0 2609.0 778.0 4.2788 148

8882 -118.51 34.04 40.0 1382.0 167.0 483.0 178.0 11.7045 500

638 -122.15 37.72 31.0 1616.0 372.0 739.0 379.0 2.9097 210

15852 -122.43 37.74 52.0 2637.0 539.0 1159.0 497.0 3.8846 333

14529 -117.14 32.92 15.0 3242.0 595.0 1936.0 593.0 4.9706 184

16984 -122.30 37.56 35.0 1873.0 351.0 945.0 333.0 5.5184 274

3655 -118.43 34.22 36.0 1372.0 295.0 774.0 306.0 3.6618 187

9403 -122.53 37.88 25.0 4921.0 866.0 1913.0 834.0 6.8742 413

12112 -117.32 34.01 23.0 3021.0 527.0 1580.0 533.0 4.4063 129

20475 -118.75 34.26 24.0 2234.0 373.0 1325.0 383.0 5.4604 193

3872 -118.54 34.21 32.0 2593.0 566.0 1596.0 547.0 3.9886 199

20231 -119.27 34.26 23.0 3578.0 753.0 1455.0 649.0 4.1898 359

3670 -118.40 34.23 36.0 1643.0 349.0 1414.0 337.0 4.1181 172

3232 -119.69 36.25 35.0 2011.0 349.0 970.0 300.0 2.3950 94

321 -122.19 37.76 26.0 1293.0 297.0 984.0 303.0 1.9479 85

7302 -118.19 33.99 40.0 1547.0 434.0 1930.0 427.0 3.3869 157

6505 -118.06 34.08 34.0 1197.0 260.0 942.0 245.0 3.4202 189

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 2/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [4]: data.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 20640 entries, 0 to 20639
Data columns (total 10 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 longitude 20640 non-null float64
1 latitude 20640 non-null float64
2 housing_median_age 20640 non-null float64
3 total_rooms 20640 non-null float64
4 total_bedrooms 20433 non-null float64
5 population 20640 non-null float64
6 households 20640 non-null float64
7 median_income 20640 non-null float64
8 median_house_value 20640 non-null float64
9 ocean_proximity 20640 non-null object
dtypes: float64(9), object(1)
memory usage: 1.6+ MB

In [5]: data.describe().round(2)

Out[5]: longitude latitude housing_median_age total_rooms total_bedrooms population households median_income median_house

count 20640.00 20640.00 20640.00 20640.00 20433.00 20640.00 20640.00 20640.00 20

mean -119.57 35.63 28.64 2635.76 537.87 1425.48 499.54 3.87 206

std 2.00 2.14 12.59 2181.62 421.39 1132.46 382.33 1.90 115

min -124.35 32.54 1.00 2.00 1.00 3.00 1.00 0.50 14

25% -121.80 33.93 18.00 1447.75 296.00 787.00 280.00 2.56 119

50% -118.49 34.26 29.00 2127.00 435.00 1166.00 409.00 3.53 179

75% -118.01 37.71 37.00 3148.00 647.00 1725.00 605.00 4.74 264

max -114.31 41.95 52.00 39320.00 6445.00 35682.00 6082.00 15.00 500

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 3/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

Data cleaning

check duplicated data


In [6]: data.duplicated().sum()

Out[6]: 0

Check missing value


In [7]: data.isna().sum()

Out[7]: longitude 0
latitude 0
housing_median_age 0
total_rooms 0
total_bedrooms 207
population 0
households 0
median_income 0
median_house_value 0
ocean_proximity 0
dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 4/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

fill missing value


In [8]: data.total_bedrooms.value_counts().iloc[:50]

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 5/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

Out[8]: total_bedrooms
280.0 55
331.0 51
345.0 50
343.0 49
393.0 49
328.0 48
348.0 48
394.0 48
272.0 47
309.0 47
314.0 46
322.0 46
399.0 46
295.0 46
317.0 46
313.0 45
290.0 45
346.0 45
340.0 45
287.0 45
388.0 45
284.0 45
291.0 45
294.0 44
269.0 44
390.0 44
312.0 44
460.0 44
300.0 44
361.0 44
365.0 44
398.0 43
335.0 43
416.0 43
254.0 43
289.0 43
369.0 43
373.0 43
428.0 43
292.0 42
339.0 42
315.0 42
localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 6/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook
458.0 42
360.0 42
308.0 42
282.0 42
358.0 41
325.0 41
410.0 41
347.0 41
Name: count, dtype: int64

In [9]: data.total_bedrooms.median()

Out[9]: 435.0

In [10]: data.total_bedrooms.fillna(data.total_bedrooms.median(), inplace=True)

In [11]: data.isna().sum()

Out[11]: longitude 0
latitude 0
housing_median_age 0
total_rooms 0
total_bedrooms 0
population 0
households 0
median_income 0
median_house_value 0
ocean_proximity 0
dtype: int64

Preprocessing data

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 7/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [12]: data.ocean_proximity.value_counts()

Out[12]: ocean_proximity
<1H OCEAN 9136
INLAND 6551
NEAR OCEAN 2658
NEAR BAY 2290
ISLAND 5
Name: count, dtype: int64

In [13]: sns.histplot(data.ocean_proximity)

Out[13]: <Axes: xlabel='ocean_proximity', ylabel='Count'>

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 8/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [14]: data

Out[14]: longitude latitude housing_median_age total_rooms total_bedrooms population households median_income median_house_

0 -122.23 37.88 41.0 880.0 129.0 322.0 126.0 8.3252 452

1 -122.22 37.86 21.0 7099.0 1106.0 2401.0 1138.0 8.3014 358

2 -122.24 37.85 52.0 1467.0 190.0 496.0 177.0 7.2574 352

3 -122.25 37.85 52.0 1274.0 235.0 558.0 219.0 5.6431 341

4 -122.25 37.85 52.0 1627.0 280.0 565.0 259.0 3.8462 342

... ... ... ... ... ... ... ... ...

20635 -121.09 39.48 25.0 1665.0 374.0 845.0 330.0 1.5603 78

20636 -121.21 39.49 18.0 697.0 150.0 356.0 114.0 2.5568 77

20637 -121.22 39.43 17.0 2254.0 485.0 1007.0 433.0 1.7000 92

20638 -121.32 39.43 18.0 1860.0 409.0 741.0 349.0 1.8672 84

20639 -121.24 39.37 16.0 2785.0 616.0 1387.0 530.0 2.3886 89

20640 rows × 10 columns

In [15]: data.ocean_proximity.replace({'<1H OCEAN':0, 'INLAND':1, 'NEAR OCEAN':2,'NEAR BAY':3, 'ISLAND':4},inplace = T

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 9/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [16]: data

Out[16]: longitude latitude housing_median_age total_rooms total_bedrooms population households median_income median_house_

0 -122.23 37.88 41.0 880.0 129.0 322.0 126.0 8.3252 452

1 -122.22 37.86 21.0 7099.0 1106.0 2401.0 1138.0 8.3014 358

2 -122.24 37.85 52.0 1467.0 190.0 496.0 177.0 7.2574 352

3 -122.25 37.85 52.0 1274.0 235.0 558.0 219.0 5.6431 341

4 -122.25 37.85 52.0 1627.0 280.0 565.0 259.0 3.8462 342

... ... ... ... ... ... ... ... ...

20635 -121.09 39.48 25.0 1665.0 374.0 845.0 330.0 1.5603 78

20636 -121.21 39.49 18.0 697.0 150.0 356.0 114.0 2.5568 77

20637 -121.22 39.43 17.0 2254.0 485.0 1007.0 433.0 1.7000 92

20638 -121.32 39.43 18.0 1860.0 409.0 741.0 349.0 1.8672 84

20639 -121.24 39.37 16.0 2785.0 616.0 1387.0 530.0 2.3886 89

20640 rows × 10 columns

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 10/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

relation between total_rooms , total_bedrooms and median_house_value


In [17]: data[['total_rooms','total_bedrooms']].value_counts()

Out[17]: total_rooms total_bedrooms


1440.0 267.0 3
40.0 10.0 3
1387.0 236.0 3
1864.0 331.0 3
102.0 17.0 3
..
1678.0 606.0 1
514.0 1
386.0 1
369.0 1
39320.0 6210.0 1
Name: count, Length: 20460, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 11/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [18]: data[['total_rooms','median_house_value']].groupby(data['total_bedrooms']).value_counts().sort_values(ascendi

Out[18]: total_bedrooms total_rooms median_house_value


6.0 24.0 67500.0 2
3.0 18.0 275000.0 2
1.0 8.0 500001.0 1
556.0 1790.0 181300.0 1
2315.0 147900.0 1
2188.0 136800.0 1
1926.0 123200.0 1
1871.0 164400.0 1
1853.0 248100.0 1
1852.0 152500.0 1
1802.0 146900.0 1
1672.0 129200.0 1
2447.0 85500.0 1
1270.0 170800.0 1
555.0 3742.0 285400.0 1
3420.0 173800.0 1
3306.0 319900.0 1
3225.0 173300.0 1
3039.0 178600.0 1
3029.0 169200.0 1
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 12/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [19]: data[['total_rooms','median_house_value']].value_counts().sort_values(ascending=False)[0:20]

Out[19]: total_rooms median_house_value


2170.0 500001.0 4
2089.0 500001.0 3
1721.0 500001.0 3
2492.0 500001.0 3
2718.0 500001.0 3
2398.0 500001.0 3
2665.0 500001.0 3
3095.0 500001.0 3
60.0 67500.0 3
2959.0 500001.0 2
1590.0 500001.0 2
2501.0 500001.0 2
2142.0 325000.0 2
2629.0 500001.0 2
1777.0 500001.0 2
1366.0 150000.0 2
24.0 67500.0 2
2075.0 500001.0 2
2040.0 500001.0 2
1151.0 113600.0 2
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 13/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [20]: data[['total_rooms','median_house_value']].value_counts().sort_values(ascending = False)[0:15]

Out[20]: total_rooms median_house_value


2170.0 500001.0 4
2089.0 500001.0 3
1721.0 500001.0 3
2492.0 500001.0 3
2718.0 500001.0 3
2398.0 500001.0 3
2665.0 500001.0 3
3095.0 500001.0 3
60.0 67500.0 3
2959.0 500001.0 2
1590.0 500001.0 2
2501.0 500001.0 2
2142.0 325000.0 2
2629.0 500001.0 2
1777.0 500001.0 2
Name: count, dtype: int64

Relation between total_rooms and housing_median_age


In [21]: data[['total_rooms','total_bedrooms','housing_median_age']].value_counts().sort_values(ascending = False)

Out[21]: total_rooms total_bedrooms housing_median_age


2264.0 439.0 52.0 2
24.0 6.0 52.0 2
48.0 8.0 26.0 2
2205.0 453.0 33.0 2
1511.0 274.0 35.0 2
..
32627.0 6445.0 11.0 1
37937.0 5471.0 4.0 1
27870.0 5027.0 5.0 1
18767.0 3032.0 4.0 1
39320.0 6210.0 3.0 1
Name: count, Length: 20633, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 14/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [22]: dt = data[['total_rooms','housing_median_age']].sort_values(ascending = True,by = 'total_rooms')


localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 15/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [23]: plt.figure(figsize=(10,7))
plt.scatter(x = dt['total_rooms'],y = dt['housing_median_age'])
plt.xlabel('total_rooms')
plt.ylabel('housing_median_age')
plt.show()

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 16/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

Conclusion

total_rooms number less than in old houses

Relation longitude , latitude and median_house_value, ocean_proximity


In [24]: data[['longitude','median_house_value','ocean_proximity']].value_counts().sort_values(ascending = False)

Out[24]: longitude median_house_value ocean_proximity


-118.40 500001.0 0 29
-118.43 500001.0 0 23
-118.41 500001.0 0 23
-122.44 500001.0 3 20
-118.42 500001.0 0 17
..
-117.08 137500.0 0 1
137900.0 0 1
139800.0 2 1
140300.0 2 1
-114.31 66900.0 1 1
Name: count, Length: 19462, dtype: int64

In [25]: data[['latitude','median_house_value','ocean_proximity']].value_counts().sort_values(ascending = False)

Out[25]: latitude median_house_value ocean_proximity


34.06 500001.0 0 45
37.79 500001.0 3 33
34.07 500001.0 0 30
37.80 500001.0 3 28
34.05 500001.0 0 27
..
40.62 107000.0 2 1
40.65 118300.0 1 1
40.66 75800.0 1 1
92200.0 1 1
41.95 122400.0 2 1
Name: count, Length: 19375, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 17/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [26]: data[['longitude','latitude','median_house_value','ocean_proximity']].value_counts().sort_values(ascending =

Out[26]: longitude latitude median_house_value ocean_proximity


-122.44 37.80 500001.0 3 9
37.79 500001.0 3 8
-122.43 37.79 500001.0 3 7
-122.42 37.80 500001.0 3 7
-122.41 37.80 500001.0 3 6
..
-116.41 33.74
168800.0 1 1
177800.0 1 1
-116.40 33.78 259200.0 1 1
34.09 97800.0 1 1
-114.31 34.19 66900.0 1 1
Name: count, Length: 20315, dtype: int64

In [27]: data[['median_house_value']].groupby(data['ocean_proximity']).value_counts().sort_values(ascending=False)[0:2

Out[27]: ocean_proximity median_house_value


0 500001.0 532
2 500001.0 212
3 500001.0 194
0 162500.0 55
1 112500.0 52
0 187500.0 48
1 137500.0 47
87500.0 47
0 137500.0 45
225000.0 43
350000.0 43
1 67500.0 34
162500.0 33
0 175000.0 33
1 100000.0 30
0 275000.0 30
150000.0 28
112500.0 27
1 500001.0 27
0 200000.0 25
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 18/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [28]: data[['median_house_value','housing_median_age']].groupby(data['ocean_proximity']).value_counts().sort_values

Out[28]: ocean_proximity median_house_value housing_median_age


3 500001.0 52.0 97
0 500001.0 52.0 58
36.0 21
35.0 21
34.0 19
37.0 18
31.0 17
27.0 16
2 500001.0 52.0 15
0 500001.0 25.0 15
33.0 15
45.0 15
2 500001.0 34.0 15
0 500001.0 28.0 14
42.0 14
24.0 14
22.0 14
29.0 13
32.0 13
41.0 13
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 19/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

Relation between median_house_value and housing_median_age


In [29]: data['median_house_value'].value_counts().sort_values(ascending = False).iloc[0:20]

Out[29]: median_house_value
500001.0 965
137500.0 122
162500.0 117
112500.0 103
187500.0 93
225000.0 92
350000.0 79
87500.0 78
275000.0 65
150000.0 64
175000.0 63
100000.0 62
125000.0 56
67500.0 55
250000.0 47
200000.0 46
118800.0 39
450000.0 37
156300.0 35
212500.0 33
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 20/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [30]: data['housing_median_age'].groupby(data['median_house_value']).value_counts().iloc[0:20]

Out[30]: median_house_value housing_median_age


14999.0 16.0 1
19.0 1
36.0 1
52.0 1
17500.0 39.0 1
22500.0 52.0 2
8.0 1
33.0 1
25000.0 21.0 1
26600.0 34.0 1
26900.0 46.0 1
27500.0 17.0 1
28300.0 29.0 1
30000.0 23.0 1
45.0 1
32500.0 15.0 1
20.0 1
24.0 1
52.0 1
32900.0 14.0 1
Name: count, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 21/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

population data
In [31]: data['population'].value_counts()

Out[31]: population
891.0 25
761.0 24
1227.0 24
1052.0 24
850.0 24
..
2141.0 1
5546.0 1
3186.0 1
3590.0 1
6912.0 1
Name: count, Length: 3888, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 22/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [32]: plt.figure(figsize=(15,10))
plt.hist(data['population'],bins=150)
plt.xlabel('population')
plt.xlim(0,17000)
plt.ylim(0,6000)
plt.show()

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 23/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [33]: data[['population','households']].groupby(data['median_house_value']).value_counts().sort_values(ascending =

Out[33]: median_house_value population households


500001.0 39.0 14.0 2
410.0 171.0 2
225000.0 14.0 7.0 2
14999.0 18.0 8.0 1
230200.0 2037.0 727.0 1
..
141400.0 1767.0 628.0 1
1534.0 625.0 1
590.0 205.0 1
141300.0 2248.0 766.0 1
500001.0 7431.0 4930.0 1
Name: count, Length: 20637, dtype: int64

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 24/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [34]: plt.figure(figsize=(10,5))
sns.heatmap(data.corr(), annot=True,linewidths=2)

Out[34]: <Axes: >

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 25/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

relation between median_income and median_house_value


In [35]: data[['median_house_value','median_income']].sort_values(ascending = True,by ='median_income' )

Out[35]: median_house_value median_income

4861 500001.0 0.4999

7125 162500.0 0.4999

6688 500001.0 0.4999

19800 56700.0 0.4999

6343 112500.0 0.4999

... ... ...

4605 500001.0 15.0001

4606 500001.0 15.0001

4626 500001.0 15.0001

8848 500001.0 15.0001

17166 500001.0 15.0001

20640 rows × 2 columns

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 26/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [36]: data['median_house_value'].groupby(data['median_income']).value_counts().sort_values(ascending = False)

Out[36]: median_income median_house_value


15.0001 500001.0 46
2.6250 137500.0 3
187500.0 3
2.3750 175000.0 3
7.5000 500001.0 3
..
2.8913 60900.0 1
2.8910 178100.0 1
2.8906 204900.0 1
158300.0 1
15.0001 400000.0 1
Name: count, Length: 20532, dtype: int64

In [37]: data.columns

Out[37]: Index(['longitude', 'latitude', 'housing_median_age', 'total_rooms',


'total_bedrooms', 'population', 'households', 'median_income',
'median_house_value', 'ocean_proximity'],
dtype='object')

ML model to de

In [38]: from sklearn.model_selection import train_test_split


from sklearn.linear_model import LinearRegression
from sklearn.ensemble import RandomForestRegressor
from sklearn.ensemble import GradientBoostingRegressor

In [39]: X_data= data.drop(['median_house_value'], axis=1)


y_label = data['median_house_value']

In [40]: x_train, x_test, y_train, y_test = train_test_split(X_data, y_label, test_size=0.2, random_state=42)

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 27/28
20/10/2024, 14:07 california-data-anaylsis - Jupyter Notebook

In [41]: def Regression_model(model):


model.fit(x_train, y_train)
train_daTa_accuracy = model.score(x_train, y_train) #train daTa
test_daTa_accuracy = model.score(x_test, y_test) #test daTa
print('model name : ',model)
print('accuracy of train data = ',train_daTa_accuracy)
print('accuracy of test data = ',test_daTa_accuracy)

In [42]: # LinearRegression model


model = LinearRegression()
Regression_model(model)

model name : LinearRegression()


accuracy of train data = 0.640334144695327
accuracy of test data = 0.6133745270468198

In [43]: # RandomForestRegressor model


model = RandomForestRegressor()
Regression_model(model)

model name : RandomForestRegressor()


accuracy of train data = 0.9744241139262757
accuracy of test data = 0.8104205723540762

In [44]: # GradientBoostingRegressor model


model = GradientBoostingRegressor()
Regression_model(model)

model name : GradientBoostingRegressor()


accuracy of train data = 0.7896150670979764
accuracy of test data = 0.7604261904408444

localhost:8888/notebooks/Downloads/california-data-anaylsis.ipynb 28/28

You might also like