0% found this document useful (0 votes)

22 views7 pages

Python_Codes_Regression - Jupyter Notebook

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views7 pages

Python_Codes_Regression - Jupyter Notebook

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

Simple Regression
In [1]:  1 import pandas as pd
2 import numpy as np
3 import matplotlib.pyplot as plt
4 import warnings
5 warnings.filterwarnings('ignore')
6 import statsmodels.formula.api as smf
7 import statsmodels.api as sm

In [2]:  1 data = pd.DataFrame({'RDE': [2,3,5,4,11,5],'AP': [20,25,34,30,40,3

2 data.plot('RDE', 'AP', kind='scatter')
3 plt.title("Annual Profit against R&D Expenditure")
4
5 plt.xlabel("R&D Expenditure (Millions)")
6
7 plt.ylabel("Annual Profit (Millions)")

Out[2]: Text(0, 0.5, 'Annual Profit (Millions)')

localhost:8888/notebooks/Python_Codes_Regression.ipynb 1/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [3]:  1 df=pd.DataFrame({'RDE': [2,3,5,4,11,5],'AP': [20,25,34,30,40,31]})

2 df.plot('RDE','AP', kind='scatter')
3 lm = smf.ols("AP ~ RDE", data=df).fit()
4 xmin = df.RDE.min()
5 xmax = df.RDE.max()
6
7 X = np.linspace(xmin, xmax, 100)
8
9 # params[0] is the intercept (w₀)
10 # params[1] is the slope (w₁)
11 Y = lm.params[0] + lm.params[1] * X
12 plt.plot(X, Y, color="darkgreen")
13 plt.xlabel("R&D Expenditure (Millions)")
14 plt.ylabel("Annual Profit (Millions)")

Out[3]: Text(0, 0.5, 'Annual Profit (Millions)')

In [4]:  1 df = pd.DataFrame({'RDE': [2,3,5,4,11,5,10,8],'AP': [20,25,34,30,40

2 # create and fit the linear model
3 lm = smf.ols(formula='AP ~ RDE', data=df).fit()
4 print(lm.params)

Intercept 20.157895
RDE 1.973684
dtype: float64

In [5]:  1 # use the fitted model for prediction

2 lm.predict({'RDE': 10})
3 # Expected Annual Profit (Millons) for R&D Expenditure of 10 (Mill

Out[5]: 0 39.894737
dtype: float64

localhost:8888/notebooks/Python_Codes_Regression.ipynb 2/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [6]:  1 df_rd=pd.read_excel("R&D_Profit.xlsx")
2 df_rd

Out[6]:
R&D Expenditure (Millions) Annual Profit (Millions)

0 2 20

1 3 25

2 5 34

3 4 30

4 11 40

5 5 31

localhost:8888/notebooks/Python_Codes_Regression.ipynb 3/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [7]:  1 X=df_rd['R&D Expenditure (Millions)']

2 y=df_rd['Annual Profit (Millions)']
3 # Add a constant to the X variable for the intercept term
4 X = sm.add_constant(X)
5
6 # Fit the model
7 model = sm.OLS(y, X).fit()
8
9 # Print model summary
10 print(model.summary())

OLS Regression Results

=====================================================================
===============
Dep. Variable: Annual Profit (Millions) R-squared:
0.826
Model: OLS Adj. R-squared:
0.783
Method: Least Squares F-statistic:
19.05
Date: Wed, 23 Oct 2024 Prob (F-statistic):
0.0120
Time: 18:18:33 Log-Likelihood:
-14.351
No. Observations: 6 AIC:
32.70
Df Residuals: 4 BIC:
32.29
Df Model: 1
Covariance Type: nonrobust
=====================================================================
=========================
coef std err t P>|t
| [0.025 0.975]
---------------------------------------------------------------------
-------------------------
const 20.0000 2.646 7.559 0.00
2 12.654 27.346
R&D Expenditure (Millions) 2.0000 0.458 4.364 0.01
2 0.728 3.272
=====================================================================
=========
Omnibus: nan Durbin-Watson:
1.500
Prob(Omnibus): nan Jarque-Bera (JB):
0.327
Skew: -0.000 Prob(JB):
0.849
Kurtosis: 1.857 Cond. No.
11.8
=====================================================================
=========

Notes:
[1] Standard Errors assume that the covariance matrix of the errors i
s correctly specified.

localhost:8888/notebooks/Python_Codes_Regression.ipynb 4/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

C:\Users\91941\anaconda3\lib\site-packages\statsmodels\stats\stattool
s.py:74: ValueWarning: omni_normtest is not valid with less than 8 ob
servations; 6 samples were given.
warn("omni_normtest is not valid with less than 8 observations; %i
"

In [8]:  1 lm=smf.ols(formula='AP~RDE', data=df).fit()

2 lm.summary()

Out[8]:
OLS Regression Results

Dep. Variable: AP R-squared: 0.871

Model: OLS Adj. R-squared: 0.849

Method: Least Squares F-statistic: 40.42

Date: Wed, 23 Oct 2024 Prob (F-statistic): 0.000710

Time: 18:48:59 Log-Likelihood: -18.166

No. Observations: 8 AIC: 40.33

Df Residuals: 6 BIC: 40.49

Df Model: 1

Covariance Type: nonrobust

coef std err t P>|t| [0.025 0.975]

Intercept 20.1579 2.094 9.626 0.000 15.034 25.282

RDE 1.9737 0.310 6.358 0.001 1.214 2.733

Omnibus: 0.039 Durbin-Watson: 1.564

Prob(Omnibus): 0.980 Jarque-Bera (JB): 0.151

Skew: -0.053 Prob(JB): 0.927

Kurtosis: 2.336 Cond. No. 15.0

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly
specified.

In [9]:  1 data = pd.read_excel("Store_Data.xlsx")

2 data.head()

Out[9]:
Bars Price Promotion

0 4141 59 200

1 3842 59 200

2 3056 59 200

3 3519 59 200

4 4226 59 400

localhost:8888/notebooks/Python_Codes_Regression.ipynb 5/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [10]:  1 data.describe()

Out[10]:
Bars Price Promotion

count 34.000000 34.000000 34.000000

mean 3098.676471 77.823529 388.235294

std 1256.422018 16.286210 162.862102

min 675.000000 59.000000 200.000000

25% 2125.250000 59.000000 200.000000

50% 3430.500000 79.000000 400.000000

75% 3968.750000 99.000000 600.000000

max 5120.000000 99.000000 600.000000

In [11]:  1 lm = smf.ols(formula='Bars ~ Price + Promotion', data=data).fit()

2 print(lm.params)

Intercept 5837.520759
Price -53.217336
Promotion 3.613058
dtype: float64

localhost:8888/notebooks/Python_Codes_Regression.ipynb 6/7
11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [12]:  1 lm.summary()

Out[12]:
OLS Regression Results

Dep. Variable: Bars R-squared: 0.758

Model: OLS Adj. R-squared: 0.742

Method: Least Squares F-statistic: 48.48

Date: Wed, 23 Oct 2024 Prob (F-statistic): 2.86e-10

Time: 18:52:31 Log-Likelihood: -266.26

No. Observations: 34 AIC: 538.5

Df Residuals: 31 BIC: 543.1

Df Model: 2

Covariance Type: nonrobust

coef std err t P>|t| [0.025 0.975]

Intercept 5837.5208 628.150 9.293 0.000 4556.400 7118.642

Price -53.2173 6.852 -7.766 0.000 -67.193 -39.242

Promotion 3.6131 0.685 5.273 0.000 2.216 5.011

Omnibus: 1.418 Durbin-Watson: 2.282

Prob(Omnibus): 0.492 Jarque-Bera (JB): 0.486

Skew: -0.034 Prob(JB): 0.784

Kurtosis: 3.582 Cond. No. 2.45e+03

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly
specified.
[2] The condition number is large, 2.45e+03. This might indicate that there are
strong multicollinearity or other numerical problems.

In [13]:  1 #Predcited average/mean sales for price of 79 cents and promotiona

2 lm.predict({'Price': 79, 'Promotion': 400})

Out[13]: 0 3078.574405
dtype: float64

In [ ]:  1

localhost:8888/notebooks/Python_Codes_Regression.ipynb 7/7

Thesis Format Bukidnon State University
No ratings yet
Thesis Format Bukidnon State University
11 pages
Biomechanics of The Javelin Throw
No ratings yet
Biomechanics of The Javelin Throw
18 pages
TestExercise 3.ipynb - Colab
No ratings yet
TestExercise 3.ipynb - Colab
8 pages
TP Regression
100% (1)
TP Regression
1 page
Linear Regression
No ratings yet
Linear Regression
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
assignment2
No ratings yet
assignment2
5 pages
Simple_and_Multiple_Regression
No ratings yet
Simple_and_Multiple_Regression
9 pages
BA Soln
No ratings yet
BA Soln
9 pages
Pregunta 5
No ratings yet
Pregunta 5
2 pages
Data_Analysis_Report
No ratings yet
Data_Analysis_Report
16 pages
Assignment_Solution_1
No ratings yet
Assignment_Solution_1
11 pages
regression
No ratings yet
regression
4 pages
predictive modelling outputs
No ratings yet
predictive modelling outputs
7 pages
MLR-handson - Jupyter Notebook
No ratings yet
MLR-handson - Jupyter Notebook
5 pages
Copper Linear Regression Results
No ratings yet
Copper Linear Regression Results
3 pages
Linear_Regression_Report
No ratings yet
Linear_Regression_Report
2 pages
Week 2 MrSumanBera HandsOn
No ratings yet
Week 2 MrSumanBera HandsOn
9 pages
CE1 Sol
No ratings yet
CE1 Sol
7 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Kata Pengantar Vano
No ratings yet
Kata Pengantar Vano
86 pages
5
No ratings yet
5
3 pages
5
No ratings yet
5
3 pages
Lab4 - SLR - Ipynb - Colaboratory
No ratings yet
Lab4 - SLR - Ipynb - Colaboratory
7 pages
7
No ratings yet
7
5 pages
Regressao Linear Simples - Ipynb - Colaboratory
100% (1)
Regressao Linear Simples - Ipynb - Colaboratory
2 pages
How to Perform Simple Linear Regression in Python
No ratings yet
How to Perform Simple Linear Regression in Python
8 pages
Regressao Linear Multipla - Ipynb - Colaboratory
No ratings yet
Regressao Linear Multipla - Ipynb - Colaboratory
2 pages
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
No ratings yet
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
5 pages
Problem Set 6
No ratings yet
Problem Set 6
6 pages
vertopal.com_Lab_Linear_Regression
No ratings yet
vertopal.com_Lab_Linear_Regression
21 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
4.2 Tests of Structural Changes: X y X y
No ratings yet
4.2 Tests of Structural Changes: X y X y
8 pages
Exercise 4: Simple and Multiple Linear Regression Analysis
No ratings yet
Exercise 4: Simple and Multiple Linear Regression Analysis
15 pages
Ekonometrika
No ratings yet
Ekonometrika
3 pages
OLSLinear Regquestion
No ratings yet
OLSLinear Regquestion
5 pages
Korelasi Parsial
No ratings yet
Korelasi Parsial
4 pages
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
No ratings yet
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
6 pages
Coding Activity 3.ipynb - Colaboratory
No ratings yet
Coding Activity 3.ipynb - Colaboratory
7 pages
Outputs 1
No ratings yet
Outputs 1
3 pages
hw-3
No ratings yet
hw-3
20 pages
Examen Parcial 2 2023-2 Secc 1 (Solutions Alumnos)
No ratings yet
Examen Parcial 2 2023-2 Secc 1 (Solutions Alumnos)
5 pages
6
No ratings yet
6
3 pages
2 Simple Regression Model
No ratings yet
2 Simple Regression Model
55 pages
230393_2_14022025
No ratings yet
230393_2_14022025
7 pages
Machine Learning Basics 1683717543
No ratings yet
Machine Learning Basics 1683717543
15 pages
Assignment: Topic - Testing For Violation of OLS Assumptions
No ratings yet
Assignment: Topic - Testing For Violation of OLS Assumptions
50 pages
Popularity Prediction On Twitter EE239AS Project 3
No ratings yet
Popularity Prediction On Twitter EE239AS Project 3
21 pages
ECMT1020 - Week 06 Workshop
No ratings yet
ECMT1020 - Week 06 Workshop
4 pages
Analisis Regresi Sederhana Dan Berganda (Teori Dan Praktik)
No ratings yet
Analisis Regresi Sederhana Dan Berganda (Teori Dan Praktik)
53 pages
Simple Regression Model
No ratings yet
Simple Regression Model
55 pages
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
4 pages
Uji Asumsi Klasik 1
No ratings yet
Uji Asumsi Klasik 1
6 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
ML - Lab-6.ipynb - Colab
No ratings yet
ML - Lab-6.ipynb - Colab
4 pages
4
No ratings yet
4
3 pages
Simple Regression Model
No ratings yet
Simple Regression Model
54 pages
Lab 5
No ratings yet
Lab 5
6 pages
Econometrics 7
No ratings yet
Econometrics 7
49 pages
Arima Model
No ratings yet
Arima Model
6 pages
Computer Science, Career and Job
From Everand
Computer Science, Career and Job
Ramkrishna Ghosh
No ratings yet
T E A E S A: HE Ffect of RTS Ducation ON Tudent Chievement and Attainment
No ratings yet
T E A E S A: HE Ffect of RTS Ducation ON Tudent Chievement and Attainment
35 pages
Analisis Kualitas Pelayanan Terhadap Kepuasan Pasien Berobat Di Puskesmas Pembantu Desa Pasir Utama
No ratings yet
Analisis Kualitas Pelayanan Terhadap Kepuasan Pasien Berobat Di Puskesmas Pembantu Desa Pasir Utama
11 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Impact of Training On Employee Performance (Banking Sector Karachi)
No ratings yet
Impact of Training On Employee Performance (Banking Sector Karachi)
10 pages
Measures of Fit For Logistic Regression: Paul D. Allison, Statistical Horizons LLC and The University of Pennsylvania
No ratings yet
Measures of Fit For Logistic Regression: Paul D. Allison, Statistical Horizons LLC and The University of Pennsylvania
12 pages
Impact of Credit Management On Bank Performance in Nigeria
No ratings yet
Impact of Credit Management On Bank Performance in Nigeria
7 pages
Regression
No ratings yet
Regression
21 pages
Chapter5 - Solution Manual
No ratings yet
Chapter5 - Solution Manual
4 pages
Lab 9 Report
No ratings yet
Lab 9 Report
5 pages
Factors Influencing Sustainable Logistics
No ratings yet
Factors Influencing Sustainable Logistics
15 pages
The Relationship Between Competency and Performanc
No ratings yet
The Relationship Between Competency and Performanc
13 pages
Assignment Usol 2016-2017 PGDST
No ratings yet
Assignment Usol 2016-2017 PGDST
20 pages
Customer Retail Shopping Analysis 1686591558
No ratings yet
Customer Retail Shopping Analysis 1686591558
45 pages
Marketing Mix Strategy To Increase Outpatient Loyalty at The Hajj Hospital Jakarta Indonesia
No ratings yet
Marketing Mix Strategy To Increase Outpatient Loyalty at The Hajj Hospital Jakarta Indonesia
18 pages
Ola Data Analysis For Dynamic Price Prediction Usi
No ratings yet
Ola Data Analysis For Dynamic Price Prediction Usi
8 pages
Statistics Practice Workbook
No ratings yet
Statistics Practice Workbook
87 pages
Econometric S
No ratings yet
Econometric S
7 pages
IJRPR21983
No ratings yet
IJRPR21983
7 pages
Cost Concept
No ratings yet
Cost Concept
39 pages
Feruz Belarus
No ratings yet
Feruz Belarus
2 pages
Multiple Regression
No ratings yet
Multiple Regression
36 pages
Marketing (603a)
No ratings yet
Marketing (603a)
7 pages
The Influenceof Employees Commitmenton Organizational
No ratings yet
The Influenceof Employees Commitmenton Organizational
15 pages
Chapter 13
No ratings yet
Chapter 13
108 pages
Eur/Usd Daily Quotation Forecast: Hosein Nooriaan E714 Econometrics Final Project
No ratings yet
Eur/Usd Daily Quotation Forecast: Hosein Nooriaan E714 Econometrics Final Project
18 pages
Sciencedirect: Categorical Principal Component Logistic Regression: A Case Study For Housing Loan Approval
No ratings yet
Sciencedirect: Categorical Principal Component Logistic Regression: A Case Study For Housing Loan Approval
7 pages
ECON W3412: Introduction To Econometrics Chapter 12. Instrumental Variables Regression (Part II)
No ratings yet
ECON W3412: Introduction To Econometrics Chapter 12. Instrumental Variables Regression (Part II)
33 pages
[2024+issue]+DIRDC2-301-PUB24_319+-+Full+paper+++-+JES++--+AL+--
No ratings yet
[2024+issue]+DIRDC2-301-PUB24_319+-+Full+paper+++-+JES++--+AL+--
10 pages

Python_Codes_Regression - Jupyter Notebook

Uploaded by

Python_Codes_Regression - Jupyter Notebook

Uploaded by

11/24/24, 2:31 PM Python_Codes_Regression - Jupyter Notebook

In [2]:  1 data = pd.DataFrame({'RDE': [2,3,5,4,11,5],'AP': [20,25,34,30,40,3

Out[2]: Text(0, 0.5, 'Annual Profit (Millions)')

In [3]:  1 df=pd.DataFrame({'RDE': [2,3,5,4,11,5],'AP': [20,25,34,30,40,31]})

Out[3]: Text(0, 0.5, 'Annual Profit (Millions)')

In [4]:  1 df = pd.DataFrame({'RDE': [2,3,5,4,11,5,10,8],'AP': [20,25,34,30,40

In [5]:  1 # use the fitted model for prediction

In [7]:  1 X=df_rd['R&D Expenditure (Millions)']

OLS Regression Results

In [8]:  1 lm=smf.ols(formula='AP~RDE', data=df).fit()

Dep. Variable: AP R-squared: 0.871

Model: OLS Adj. R-squared: 0.849

Method: Least Squares F-statistic: 40.42

Date: Wed, 23 Oct 2024 Prob (F-statistic): 0.000710

Time: 18:48:59 Log-Likelihood: -18.166

No. Observations: 8 AIC: 40.33

Df Residuals: 6 BIC: 40.49

Covariance Type: nonrobust

coef std err t P>|t| [0.025 0.975]

Intercept 20.1579 2.094 9.626 0.000 15.034 25.282

RDE 1.9737 0.310 6.358 0.001 1.214 2.733

Omnibus: 0.039 Durbin-Watson: 1.564

Prob(Omnibus): 0.980 Jarque-Bera (JB): 0.151

Skew: -0.053 Prob(JB): 0.927

Kurtosis: 2.336 Cond. No. 15.0

In [9]:  1 data = pd.read_excel("Store_Data.xlsx")

count 34.000000 34.000000 34.000000

mean 3098.676471 77.823529 388.235294

std 1256.422018 16.286210 162.862102

min 675.000000 59.000000 200.000000

25% 2125.250000 59.000000 200.000000

50% 3430.500000 79.000000 400.000000

75% 3968.750000 99.000000 600.000000

max 5120.000000 99.000000 600.000000

In [11]:  1 lm = smf.ols(formula='Bars ~ Price + Promotion', data=data).fit()

Dep. Variable: Bars R-squared: 0.758

Model: OLS Adj. R-squared: 0.742

Method: Least Squares F-statistic: 48.48

Date: Wed, 23 Oct 2024 Prob (F-statistic): 2.86e-10

Time: 18:52:31 Log-Likelihood: -266.26

No. Observations: 34 AIC: 538.5

Df Residuals: 31 BIC: 543.1

Covariance Type: nonrobust

coef std err t P>|t| [0.025 0.975]

Intercept 5837.5208 628.150 9.293 0.000 4556.400 7118.642

Price -53.2173 6.852 -7.766 0.000 -67.193 -39.242

Promotion 3.6131 0.685 5.273 0.000 2.216 5.011

Omnibus: 1.418 Durbin-Watson: 2.282

Prob(Omnibus): 0.492 Jarque-Bera (JB): 0.486

Skew: -0.034 Prob(JB): 0.784

Kurtosis: 3.582 Cond. No. 2.45e+03

In [13]:  1 #Predcited average/mean sales for price of 79 cents and promotiona

You might also like