Multiple_Linear_Regression - Colaboratory
Multiple_Linear_Regression - Colaboratory
0.Data Preprocessing
1 import numpy as np
2 import matplotlib.pyplot as plt
3 import pandas as pd
Mounted at /content/drive
https://colab.research.google.com/drive/1PiocMZjWMtlFWkXfvvweEmJ_WHLIXNj4#scrollTo=yoInQ7qS6iiB&printMode=true 1/5
1/24/23, 7:52 PM Multiple_Linear_Regression - Colaboratory
10 101913.08 110594.11 229160.95 Florida 146121.95
1 dataset.info()
43 15505.73 127382.30 35534.17 New York 69758.98
44
<class 22177.74 154806.14
'pandas.core.frame.DataFrame'> 28334.72 California 65200.33
RangeIndex: 50 entries, 0 to 49
45 1000.23 124153.04 1903.93 New York 64926.08
Data columns (total 5 columns):
#
46 Column
1315.46 Non-Null Count
115816.21 Dtype
297114.46 Florida 49490.75
--- ------ -------------- -----
0
47 R&D Spend
0.00 50 non-null
135426.92 float64
0.00 California 42559.73
1 Administration 50 non-null float64
2
48 Marketing
542.05 Spend 51743.15
50 non-null float64
0.00 New York 35673.41
3 State 50 non-null object
49
4 Profit0.00 116983.80
50 non-null 45173.06
float64 California 14681.40
dtypes: float64(4), object(1)
memory usage: 2.1+ KB
1 X = dataset.drop('Profit', axis=1)
2 X
https://colab.research.google.com/drive/1PiocMZjWMtlFWkXfvvweEmJ_WHLIXNj4#scrollTo=yoInQ7qS6iiB&printMode=true 2/5
1/24/23, 7:52 PM Multiple_Linear_Regression - Colaboratory
10 101913.08 110594.11 229160.95 Florida
https://colab.research.google.com/drive/1PiocMZjWMtlFWkXfvvweEmJ_WHLIXNj4#scrollTo=yoInQ7qS6iiB&printMode=true 3/5
1/24/23, 7:52 PM Multiple_Linear_Regression - Colaboratory
1 y = dataset['Profit']
2 y
0 192261.83
1 191792.06
2 191050.39
3 182901.99
4 166187.94
5 156991.12
6 156122.51
7 155752.60
8 152211.77
9 149759.96
10 146121.95
11 144259.40
12 141585.52
13 134307.35
14 132602.65
15 129917.04
16 126992.93
17 125370.37
18 124266.90
19 122776.86
20 118474.03
21 111313.02
22 110352.25
23 108733.99
24 108552.04
25 107404.34
26 105733.54
27 105008.31
28 103282.38
29 101004.64
30 99937.59
31 97483.56
32 97427.84
33 96778.92
34 96712.80
35 96479.51
36 90708.19
37 89949.14
38 81229.06
39 81005.76
40 78239.91
41 77798.83
42 71498.49
43 69758.98
44 65200.33
45 64926.08
46 49490.75
47 42559.73
48 35673.41
49 14681.40
Name: Profit, dtype: float64
1 pd.DataFrame(transformed_X).head()
https://colab.research.google.com/drive/1PiocMZjWMtlFWkXfvvweEmJ_WHLIXNj4#scrollTo=yoInQ7qS6iiB&printMode=true 4/5
1/24/23, 7:52 PM Multiple_Linear_Regression - Colaboratory
0 1 2 3 4 5
LinearRegression()
1.1 Score
1 regressor.score(X_test,y_test)
0.9840064291741644
1 y_pred = regressor.predict(X_test)
1 pd.DataFrame(d)
y_pred y_test
32 98884.371543 97427.84
33 100047.235184 96778.92
47 47766.247901 42559.73
9 154976.558305 149759.96
37 91129.087779 89949.14
8 151755.926389 152211.77
23 112436.195860 108733.99
24 113375.898676 108552.04
17 130706.106786 125370.37
1 189141.730655 191792.06
39 85217.422839 81005.76
22 116952.737156 110352.25
46 60343.602070 49490.75
https://colab.research.google.com/drive/1PiocMZjWMtlFWkXfvvweEmJ_WHLIXNj4#scrollTo=yoInQ7qS6iiB&printMode=true 5/5