0% found this document useful (0 votes)

14 views3 pages

Electrical Machine Learning Tool

Machine learning data for Electrical engine

Uploaded by

Martins Richmond

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

Electrical Machine Learning Tool

Machine learning data for Electrical engine

Uploaded by

Martins Richmond

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

In [20]:

# Importing necessary libraries

import pandas as pd # Used for data manipulation and handling
import numpy as np # Useful for numerical operations
from sklearn.model_selection import train_test_split # For splitting the data
from sklearn.linear_model import LinearRegression # The ML model we will use
from sklearn.metrics import mean_squared_error, r2_score # For evaluating the

In [2]:

# Load the dataset

df = pd.read_csv('Electricity_Consumption_Dataset.csv')

In [4]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5000 entries, 0 to 4999
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Date 5000 non-null object
1 Hour 5000 non-null int64
2 Number_of_Appliances 4700 non-null float64
3 Usage_Duration 4700 non-null float64
4 Peak_Usage 4800 non-null float64
5 Electricity_Consumption 5000 non-null float64
dtypes: float64(4), int64(1), object(1)
memory usage: 234.5+ KB

In [5]:
df.head()

Out [5]:
Date Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
2023-
0 0 5.0 1.118288 0.0 4.935174
01-01
2023-
1 1 4.0 1.737984 0.0 7.495992
01-01
2023-
2 2 4.0 3.350142 0.0 11.460053
01-01
2023-
3 3 5.0 4.893616 0.0 28.588596
01-01
2023-
4 4 5.0 1.030203 0.0 4.929359
01-01

In [6]:
df.describe()

Out [6]:
Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
count 5000.000000 4700.000000 4700.000000 4800.000000 5000.000000
mean 11.487200 5.020426 2.750078 0.125000 15.082294
std 6.925332 2.220455 1.295222 0.330753 11.412605
min 0.000000 0.000000 0.500025 0.000000 0.000000
25% 5.000000 3.000000 1.613859 0.000000 6.768928
50% 11.000000 5.000000 2.751787 0.000000 12.560582
75% 17.000000 6.000000 3.854879 0.000000 20.362971
max 23.000000 15.000000 4.999052 1.000000 121.078379
In [7]:

# Data Preprocessing
# -------------------
# Convert 'Date' to datetime type for any time series analysis necessity
df['Date'] = pd.to_datetime(df['Date'])

In [11]:

# Handling missing values by filling them with the median of the column
for column in ['Number_of_Appliances', 'Usage_Duration', 'Peak_Usage']:
if df[column].isnull().any():
df[column].fillna(df[column].median(), inplace=True)

df.head()

Out [11]:
Date Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
2023-
0 0 5.0 1.118288 0.0 4.935174
01-01
2023-
1 1 4.0 1.737984 0.0 7.495992
01-01
2023-
2 2 4.0 3.350142 0.0 11.460053
01-01
2023-
3 3 5.0 4.893616 0.0 28.588596
01-01
2023-
4 4 5.0 1.030203 0.0 4.929359
01-01

In [12]:

# Feature Engineering (if needed)

# -------------------------------
# For example, creating new features that might help improve model performance
# Here, we can think of extracting day of the week or month from the date if r
df['DayOfWeek'] = df['Date'].dt.dayofweek

In [13]:

# Modeling
# --------
# Define features and target variable
X = df[['Hour', 'Number_of_Appliances', 'Usage_Duration', 'Peak_Usage', 'DayOf
y = df['Electricity_Consumption']

In [14]:

# Split the data into train and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, rando

In [15]:

# Initialize the Linear Regression model

model = LinearRegression()

In [16]:

# Train the model

model.fit(X_train, y_train)
Out [16]: ▾ LinearRegression

LinearRegression()

In [17]:

# Predict on the test set

y_pred = model.predict(X_test)

In [21]:

# Evaluation
# ----------
# Calculate the Mean Squared Error and the R^2 score to evaluate the model
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

In [22]:

print(f'Mean Squared Error (MSE): {mse}')

print(f'R-squared Score: {r2}')

# The MSE provides a measure of how well the model predictions approximate the
# The R-squared score is a statistical measure of how close the data are to th

Mean Squared Error (MSE): 34.27105705807505

R-squared Score: 0.7447340948875754

Data Cleaning - Cheatsheet
100% (2)
Data Cleaning - Cheatsheet
8 pages
Industrial Training Report
100% (5)
Industrial Training Report
42 pages
Ojukwu Chika Project - 0
No ratings yet
Ojukwu Chika Project - 0
101 pages
Asuquo IT Report M
100% (1)
Asuquo IT Report M
42 pages
Electrical Engineering Siwes Report
0% (1)
Electrical Engineering Siwes Report
2 pages
Energy Consumption Time Series Forcasting 1681824033
No ratings yet
Energy Consumption Time Series Forcasting 1681824033
14 pages
Pandas Roadmap
No ratings yet
Pandas Roadmap
6 pages
fixed random trắc nghiệm tự luận
100% (1)
fixed random trắc nghiệm tự luận
12 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
28 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
EDA Cheat Sheet
No ratings yet
EDA Cheat Sheet
7 pages
Jkuat Mba Thesis
100% (3)
Jkuat Mba Thesis
5 pages
R Markdown File Mid
No ratings yet
R Markdown File Mid
13 pages
Regression Analysis and Modelling - Amar Sahay
No ratings yet
Regression Analysis and Modelling - Amar Sahay
93 pages
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
No ratings yet
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
10 pages
Life Asset
No ratings yet
Life Asset
78 pages
Annuities
No ratings yet
Annuities
4 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
7 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
27 Jupyter Notebook
No ratings yet
27 Jupyter Notebook
42 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Tutorial - Time Series Analysis With Pandas - Dataquest
No ratings yet
Tutorial - Time Series Analysis With Pandas - Dataquest
32 pages
Group-3 Report
No ratings yet
Group-3 Report
38 pages
Time Series Visualization From Raw Data To Insights
No ratings yet
Time Series Visualization From Raw Data To Insights
34 pages
Pandas Fuction Notes
No ratings yet
Pandas Fuction Notes
3 pages
Pandas Module (Part-I)
No ratings yet
Pandas Module (Part-I)
36 pages
Task2 Eda Cleaning
No ratings yet
Task2 Eda Cleaning
33 pages
CP1 Study Guide 2025
No ratings yet
CP1 Study Guide 2025
22 pages
Topic 1v5
No ratings yet
Topic 1v5
34 pages
Econometrics For Management Assignment
No ratings yet
Econometrics For Management Assignment
3 pages
Individual Household Electric Power Consumption
No ratings yet
Individual Household Electric Power Consumption
29 pages
bài tập
No ratings yet
bài tập
4 pages
Linear Regression and SVR
No ratings yet
Linear Regression and SVR
25 pages
DAP Writeups - Merged
No ratings yet
DAP Writeups - Merged
33 pages
Actuary India April 2015
No ratings yet
Actuary India April 2015
28 pages
Machine Exercise 3
No ratings yet
Machine Exercise 3
22 pages
Solar Power Generation Forecasting in Europe A Time Series Analysis
No ratings yet
Solar Power Generation Forecasting in Europe A Time Series Analysis
19 pages
Load Prediction With 20 Models
No ratings yet
Load Prediction With 20 Models
19 pages
Stochastic Modelling
No ratings yet
Stochastic Modelling
18 pages
Disaggregation Using Nilmtk PDF
No ratings yet
Disaggregation Using Nilmtk PDF
12 pages
Project Intern - Jupyter Notebook
No ratings yet
Project Intern - Jupyter Notebook
16 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Python Scripts For Machine Learning
No ratings yet
Python Scripts For Machine Learning
13 pages
Co Digit Ooo
No ratings yet
Co Digit Ooo
15 pages
Kedar Maheshwari
No ratings yet
Kedar Maheshwari
17 pages
Extensive Reading 05
No ratings yet
Extensive Reading 05
15 pages
Iai Brochure College 1
No ratings yet
Iai Brochure College 1
8 pages
Sample Report
No ratings yet
Sample Report
17 pages
Solar Data
No ratings yet
Solar Data
15 pages
Pandas 1
No ratings yet
Pandas 1
13 pages
s3950476 TimeSeriesAnalysis Assignment 3
No ratings yet
s3950476 TimeSeriesAnalysis Assignment 3
13 pages
Evaluation of Electrical Power
No ratings yet
Evaluation of Electrical Power
16 pages
Reading 4
No ratings yet
Reading 4
15 pages
Lab Exercise 2-CS0017
No ratings yet
Lab Exercise 2-CS0017
17 pages
Pandas Data Manipulation Extended CheatSheet 1731972219
No ratings yet
Pandas Data Manipulation Extended CheatSheet 1731972219
9 pages
WorkingWithData - Ipynb - Colaboratory
No ratings yet
WorkingWithData - Ipynb - Colaboratory
13 pages
Optimal Sizing and Placement of Capacitor Banks in Distribution Networks Using A Genetic Algorithm
No ratings yet
Optimal Sizing and Placement of Capacitor Banks in Distribution Networks Using A Genetic Algorithm
18 pages
Manufacturing Machine Learning Tool Mechanical
No ratings yet
Manufacturing Machine Learning Tool Mechanical
13 pages
Topic 5 Prediction With Many Regressors and Big Data (Part 1)
No ratings yet
Topic 5 Prediction With Many Regressors and Big Data (Part 1)
13 pages
Hasil Regresi
No ratings yet
Hasil Regresi
13 pages
4.8 Slides - Example Melanoma Mortality (Count)
No ratings yet
4.8 Slides - Example Melanoma Mortality (Count)
12 pages
1 Demand
No ratings yet
1 Demand
13 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
Python2 Master
No ratings yet
Python2 Master
12 pages
Hammilton IFRS17
No ratings yet
Hammilton IFRS17
12 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
Sunbase Data Assignment
No ratings yet
Sunbase Data Assignment
11 pages
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
No ratings yet
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
4 pages
Load Dataset: Import As
No ratings yet
Load Dataset: Import As
8 pages
Sample II
No ratings yet
Sample II
8 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Simulation of Capacitor Bank For Improvement of Voltage Profile at Distribution Center (Review)
No ratings yet
Simulation of Capacitor Bank For Improvement of Voltage Profile at Distribution Center (Review)
5 pages
Data Wrangling
No ratings yet
Data Wrangling
6 pages
Lecture Notes 1
No ratings yet
Lecture Notes 1
6 pages
Olah Data Eviews
No ratings yet
Olah Data Eviews
8 pages
Econometrics Summary
No ratings yet
Econometrics Summary
5 pages
Efficient Incremental Smart Grid Data Analytics: David Xi Cheng Wojciech Golab Paul A. S. Ward
No ratings yet
Efficient Incremental Smart Grid Data Analytics: David Xi Cheng Wojciech Golab Paul A. S. Ward
8 pages
6 +ARTIKEL+Nur+Rahma
No ratings yet
6 +ARTIKEL+Nur+Rahma
9 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Algorithm Current Situation
No ratings yet
Algorithm Current Situation
7 pages
Actuarial Analysis Dependent Lives: Ragnar
No ratings yet
Actuarial Analysis Dependent Lives: Ragnar
7 pages
Electric Power Consumption Forecasting
No ratings yet
Electric Power Consumption Forecasting
5 pages
Panel SM
No ratings yet
Panel SM
7 pages
General Insurance Reserving Actuarial Best Estimates and Proxy Methods
No ratings yet
General Insurance Reserving Actuarial Best Estimates and Proxy Methods
7 pages
Index
No ratings yet
Index
4 pages
Dedication: Table of Contents
No ratings yet
Dedication: Table of Contents
6 pages
Task 2 Exploratory Data Analysis
No ratings yet
Task 2 Exploratory Data Analysis
5 pages
Assignment
No ratings yet
Assignment
4 pages
Google Cluster Data Preprocessing - Updated
No ratings yet
Google Cluster Data Preprocessing - Updated
4 pages
Practical No. 09.ipynb - Colab
No ratings yet
Practical No. 09.ipynb - Colab
4 pages
PID987658
No ratings yet
PID987658
4 pages
Mortality Schedule of 1961, 1976, 1996 and 2016: Age Group
No ratings yet
Mortality Schedule of 1961, 1976, 1996 and 2016: Age Group
4 pages
Question Bank - PA
No ratings yet
Question Bank - PA
3 pages
Code
No ratings yet
Code
2 pages
Ele Pro
No ratings yet
Ele Pro
1 page
UCI Machine Learning Repository - Individual Household Electric Power Consumption Data Set
No ratings yet
UCI Machine Learning Repository - Individual Household Electric Power Consumption Data Set
1 page
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet

Electrical Machine Learning Tool

Uploaded by

Electrical Machine Learning Tool

Uploaded by

In [20]:

# Importing necessary libraries

# Load the dataset

# Feature Engineering (if needed)

# Split the data into train and test sets

# Initialize the Linear Regression model

# Train the model

# Predict on the test set

print(f'Mean Squared Error (MSE): {mse}')

Mean Squared Error (MSE): 34.27105705807505

You might also like