0% found this document useful (0 votes)

10 views

Multiple Regression

Uploaded by

pedropinto8400

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Multiple Regression

Uploaded by

pedropinto8400

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

MultipleRegression

March 14, 2024

1 Regressão múltipla
• E se mais de uma variável influenciar o que está sendo interessado?
• Exemplo: predizer o preço de um carro com base em seus vários atributos.
• Se também houver multiplas variáveis dependentes - coisas que estão tentando ser previstas
- isso é uma regressão multivariável.

1.0.1 Ainda usa least squares

• A unica diferença é que agora terá coeficientes diferentes para cada fator.
• Esses coeficientes implicam no quão importante cada fator realmente é, se os dados estiverem
normalizados.
• Se é livrado de variáveis que não influenciam.
• Ainda pode medir a adequação com r-squared.
• Precisa assumir que os diferentes fatores não são dependentes uns dos outros.

1.1 Pratica
[2]: import pandas as pd

df = pd.read_excel('cars.xls')

[3]: %matplotlib inline

import numpy as np
df1=df[['Mileage','Price']]
bins = np.arange(0,50000,10000)
groups = df1.groupby(pd.cut(df1['Mileage'],bins)).mean()
print(groups.head())
groups['Price'].plot.line()

Mileage Price
Mileage
(0, 10000] 5588.629630 24096.714451
(10000, 20000] 15898.496183 21955.979607
(20000, 30000] 24114.407104 20278.606252
(30000, 40000] 33610.338710 19463.670267
/tmp/ipykernel_12254/679127490.py:5: FutureWarning: The default of
observed=False is deprecated and will be changed to True in a future version of

1
pandas. Pass observed=False to retain current behavior or observed=True to adopt
the future default and silence this warning.
groups = df1.groupby(pd.cut(df1['Mileage'],bins)).mean()

[3]: <Axes: xlabel='Mileage'>

[4]: import statsmodels.api as sm

from sklearn.preprocessing import StandardScaler
scale = StandardScaler()

X = df[['Mileage', 'Cylinder', 'Doors']]

y = df['Price']

X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',␣

↪'Cylinder', 'Doors']].values)

X = sm.add_constant(X)

print (X)

est = sm.OLS(y, X).fit()

2
print(est.summary())

const Mileage Cylinder Doors

0 1.0 -1.417485 0.52741 0.556279
1 1.0 -1.305902 0.52741 0.556279
2 1.0 -0.810128 0.52741 0.556279
3 1.0 -0.426058 0.52741 0.556279
4 1.0 0.000008 0.52741 0.556279
.. … … … …
799 1.0 -0.439853 0.52741 0.556279
800 1.0 -0.089966 0.52741 0.556279
801 1.0 0.079605 0.52741 0.556279
802 1.0 0.750446 0.52741 0.556279
803 1.0 1.932565 0.52741 0.556279

[804 rows x 4 columns]

OLS Regression Results
==============================================================================
Dep. Variable: Price R-squared: 0.360
Model: OLS Adj. R-squared: 0.358
Method: Least Squares F-statistic: 150.0
Date: Thu, 14 Mar 2024 Prob (F-statistic): 3.95e-77
Time: 16:21:34 Log-Likelihood: -8356.7
No. Observations: 804 AIC: 1.672e+04
Df Residuals: 800 BIC: 1.674e+04
Df Model: 3
Covariance Type: nonrobust
==============================================================================
coef std err t P>|t| [0.025 0.975]
------------------------------------------------------------------------------
const 2.134e+04 279.405 76.388 0.000 2.08e+04 2.19e+04
Mileage -1272.3412 279.567 -4.551 0.000 -1821.112 -723.571
Cylinder 5587.4472 279.527 19.989 0.000 5038.754 6136.140
Doors -1404.5513 279.446 -5.026 0.000 -1953.085 -856.018
==============================================================================
Omnibus: 157.913 Durbin-Watson: 0.069
Prob(Omnibus): 0.000 Jarque-Bera (JB): 257.529
Skew: 1.278 Prob(JB): 1.20e-56
Kurtosis: 4.074 Cond. No. 1.03
==============================================================================

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly
specified.
/tmp/ipykernel_12254/1575598944.py:8: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

3
See the caveats in the documentation: https://pandas.pydata.org/pandas-
docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',
'Cylinder', 'Doors']].values)

[5]: y.groupby(df.Doors).mean()

[5]: Doors
2 23807.135520
4 20580.670749
Name: Price, dtype: float64

[11]: scaled = scale.transform([[45000, 8, 4]])

scaled = np.insert(scaled[0], 0, 1)
print(scaled)
predicted = est.predict(scaled)
print(predicted)

[1. 3.07256589 1.96971667 0.55627894]

[27658.15707316]

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Unit 5
No ratings yet
Unit 5
61 pages
MUNAR - Linear Regression - Ipynb - Colaboratory
No ratings yet
MUNAR - Linear Regression - Ipynb - Colaboratory
30 pages
Lecture 1. Digital Signal Processing - Basics
100% (1)
Lecture 1. Digital Signal Processing - Basics
19 pages
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
No ratings yet
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
16 pages
Multiple Regression1
No ratings yet
Multiple Regression1
27 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
7 pages
ML Unit
No ratings yet
ML Unit
23 pages
Unit 5
No ratings yet
Unit 5
171 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Pro Yec To Machine Learning
No ratings yet
Pro Yec To Machine Learning
35 pages
PLSandPLSDA_Torino2021 - Federico Marini
No ratings yet
PLSandPLSDA_Torino2021 - Federico Marini
53 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Mlmultiplelinearregression 170919114353 PDF
No ratings yet
Mlmultiplelinearregression 170919114353 PDF
8 pages
1 Regression
No ratings yet
1 Regression
4 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Assignment AI-ML
No ratings yet
Assignment AI-ML
13 pages
Exp_6-Model Development_sdk_ok
No ratings yet
Exp_6-Model Development_sdk_ok
11 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
DSBDAL_Assignment no 4
No ratings yet
DSBDAL_Assignment no 4
15 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
10 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Introduction To R Program and Output
No ratings yet
Introduction To R Program and Output
6 pages
LR-LogReg
No ratings yet
LR-LogReg
53 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
No ratings yet
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
22 pages
ML Lab-3
No ratings yet
ML Lab-3
14 pages
Aggialavura - Python Linear Regression Model
No ratings yet
Aggialavura - Python Linear Regression Model
1 page
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
Mtcars: Choosing The Most Related Variable (S) To The Response
No ratings yet
Mtcars: Choosing The Most Related Variable (S) To The Response
13 pages
ML Exp4
No ratings yet
ML Exp4
4 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
S2-Linear-Regression-LKW-9March2025
No ratings yet
S2-Linear-Regression-LKW-9March2025
23 pages
Regression Model
No ratings yet
Regression Model
30 pages
Machine Learning: by Team 2
No ratings yet
Machine Learning: by Team 2
41 pages
Gaurav - Data Mining Lab Assignment
No ratings yet
Gaurav - Data Mining Lab Assignment
36 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
No ratings yet
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
26 pages
7. Machine Learning - Develop machine learning model - Regression
No ratings yet
7. Machine Learning - Develop machine learning model - Regression
36 pages
Linear Regression Mca Lab - Jupyter Notebook
No ratings yet
Linear Regression Mca Lab - Jupyter Notebook
2 pages
Advanced ML PDF
No ratings yet
Advanced ML PDF
25 pages
ML manoj
No ratings yet
ML manoj
51 pages
Mod2_Multiple Linear Regression
No ratings yet
Mod2_Multiple Linear Regression
10 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
Ash Regression
No ratings yet
Ash Regression
11 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Mulitple Linear Regression
No ratings yet
Mulitple Linear Regression
6 pages
2-Multivarient Regression Using Python
No ratings yet
2-Multivarient Regression Using Python
7 pages
DS EXP6
No ratings yet
DS EXP6
5 pages
Multiple Regression PDF
No ratings yet
Multiple Regression PDF
19 pages
DTSL B2
No ratings yet
DTSL B2
4 pages
Introduction To ML Linear Regression
No ratings yet
Introduction To ML Linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Session7 LinearRegression
No ratings yet
Session7 LinearRegression
52 pages
Module 4
No ratings yet
Module 4
41 pages
lecture 9-10
No ratings yet
lecture 9-10
28 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Extracted Text
No ratings yet
Extracted Text
391 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
DIP Lecture
No ratings yet
DIP Lecture
32 pages
Interpolation: PPT by - Abhishek Beloshe Sub - Game Programming Class - T.Y. Comp Science Roll No - 1909805
No ratings yet
Interpolation: PPT by - Abhishek Beloshe Sub - Game Programming Class - T.Y. Comp Science Roll No - 1909805
6 pages
SMAI StateSpaceSearch
No ratings yet
SMAI StateSpaceSearch
88 pages
Chapter 1 Algorithm
No ratings yet
Chapter 1 Algorithm
46 pages
Gnuradio Projectos
No ratings yet
Gnuradio Projectos
4 pages
ESTIMATION AND CORRECTION OF GAIN MISMATCH AND TIMING ERROR IN TIME-INTERLEAVED ADCs BASED ON DFT
No ratings yet
ESTIMATION AND CORRECTION OF GAIN MISMATCH AND TIMING ERROR IN TIME-INTERLEAVED ADCs BASED ON DFT
10 pages
Python For Finance: Regressions, Interpolation & Optimisation
No ratings yet
Python For Finance: Regressions, Interpolation & Optimisation
38 pages
MotionPlanningPart2-2025
No ratings yet
MotionPlanningPart2-2025
5 pages
STM Q Paper 2-Mid
No ratings yet
STM Q Paper 2-Mid
2 pages
Inserting Turbo Code Technology Into The DVB Satellite Broadcasting System
No ratings yet
Inserting Turbo Code Technology Into The DVB Satellite Broadcasting System
20 pages
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
No ratings yet
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
39 pages
Data Structures and Algorithms Module
No ratings yet
Data Structures and Algorithms Module
187 pages
UEMH3073/UECS2053/UECS2153 Artificial Intelligence - Lab 3/practical Assessment 2
No ratings yet
UEMH3073/UECS2053/UECS2153 Artificial Intelligence - Lab 3/practical Assessment 2
2 pages
Algebra 1st Sem IX 2023-24
No ratings yet
Algebra 1st Sem IX 2023-24
4 pages
NONLINEAR Bisection Method Example: F (X) X 3+x 2-3x-3 0
0% (1)
NONLINEAR Bisection Method Example: F (X) X 3+x 2-3x-3 0
5 pages
Optimized Sigma Delta Modulated Current Measurement For Motor Control
No ratings yet
Optimized Sigma Delta Modulated Current Measurement For Motor Control
6 pages
Reliable Parkinsons Disease Detection by Analyzing Handwritten Drawings Construction of An Unbiased Cascaded Learning System Based On Feature Selection and Adaptive Boosting Model
No ratings yet
Reliable Parkinsons Disease Detection by Analyzing Handwritten Drawings Construction of An Unbiased Cascaded Learning System Based On Feature Selection and Adaptive Boosting Model
10 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
55 pages
Week 2
No ratings yet
Week 2
6 pages
Midterm F02soln
No ratings yet
Midterm F02soln
14 pages
DMBI
No ratings yet
DMBI
15 pages
Answers To Practice Questions For Module 2 - PDF
No ratings yet
Answers To Practice Questions For Module 2 - PDF
7 pages
A Novel Sidelobe Cancellation Method For Binary Ba
No ratings yet
A Novel Sidelobe Cancellation Method For Binary Ba
11 pages
IT2070 - Data Structures and Algorithms
No ratings yet
IT2070 - Data Structures and Algorithms
8 pages
Solution Manual Tanjiang 3rd Part1
100% (1)
Solution Manual Tanjiang 3rd Part1
128 pages
QUESTIONS Dynamic Programming
No ratings yet
QUESTIONS Dynamic Programming
6 pages
Quick Sort New
No ratings yet
Quick Sort New
11 pages
Cui 2014
No ratings yet
Cui 2014
11 pages

Multiple Regression

Uploaded by

Multiple Regression

Uploaded by

MultipleRegression

March 14, 2024

1.0.1 Ainda usa least squares

[3]: %matplotlib inline

[3]: <Axes: xlabel='Mileage'>

[4]: import statsmodels.api as sm

X = df[['Mileage', 'Cylinder', 'Doors']]

X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',␣

est = sm.OLS(y, X).fit()

const Mileage Cylinder Doors

[804 rows x 4 columns]

[11]: scaled = scale.transform([[45000, 8, 4]])

[1. 3.07256589 1.96971667 0.55627894]

You might also like