0% found this document useful (0 votes)

94 views13 pages

s3950476 TimeSeriesAnalysis Assignment 3

This document describes a time series analysis project that forecasts electricity consumption using historical hourly consumption data. It first explores and visualizes the data, then trains two models - XGBoost Regressor and Random Forest Regressor - on engineered time series features to predict consumption over the next 10 months. The Random Forest Regressor achieves a higher accuracy score and is thus selected for the final forecasting. Forecasted values for the next 10 months are generated and stored for further analysis and decision making.

Uploaded by

Namratha Desai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views13 pages

s3950476 TimeSeriesAnalysis Assignment 3

Uploaded by

Namratha Desai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

MATH1318 Time Series Analysis - Final Project Written report and presentation

STUDENT NAME: Namratha Desai s3950476

Assignment-3:

INTRODUCTION

Time series forecasting is a crucial aspect of data analysis and prediction in various fields,

ranging from finance and economics to weather forecasting and sales forecasting. It involves

analyzing and predicting patterns, trends, and future values based on historical data points

collected over a specific period. Time series forecasting techniques are utilized to make informed

decisions, plan resources, optimize operations, and anticipate future events. This article provides

an introduction and background to time series forecasting, exploring its significance, challenges,

and common techniques. We will delve into the fundamental concepts, methods, and tools

employed in this domain, as well as discuss its real-world applications. Understanding the

principles and approaches of time series forecasting will empower individuals and organizations

to harness the power of historical data and make accurate predictions (Natras et al.,2022).

BACKGROUND

Time series data refers to a collection of observations recorded at regular intervals over a

specified period. The data points in a time series possess an inherent chronological order, making

it distinct from cross-sectional or panel data. This time-based dimension enables the

identification of patterns, trends, and dependencies within the data, which can be leveraged for

forecasting future values. (Natras et al.,2022)

The analysis of time series data involves studying its key components, which include trend,

cyclicality, seasonality and irregular fluctuations. The trend signifies the long-term measure in

the data, indicating whether it is increasing, decreasing, or following a particular pattern.

Seasonality states to repetitive outlines that befall within shorter time frames, such as daily,

weekly, or yearly cycles. Cyclicality denotes to longer-term outlines that are not as systematic as

seasonal patterns, often spanning multiple years. Lastly, irregular fluctuations, also known as

residual or random components, represent the unpredictable and random fluctuations in the data

that cannot be explained by the trend, seasonality, or cyclicality. (Meng et al.,2021)

Time series forecasting aims to model and predict future values based on historical data patterns.

Accurate forecasts enable businesses and organizations to anticipate demand, optimize inventory,

plan resources, and make informed decisions. Furthermore, time series forecasting plays a

crucial role in various domains, such as finance, economics, weather forecasting, energy

consumption, stock market analysis, and sales forecasting. (Meng et al.,2021)

Forecasting time series data poses several challenges due to its inherent characteristics. One such

challenge is the presence of noise and outliers, which can distort the patterns and affect the

accuracy of predictions. Handling missing data is another challenge, as the absence of values at

certain time points can impact the continuity and reliability of the series. Moreover, time series

data often exhibit non-stationary behaviour, where the statistical properties change over time,

making it difficult to model using traditional methods. These challenges necessitate the

utilization of specialized techniques and algorithms designed for time series forecasting. (Tan et

al.,2021)
A variety of time series forecasting techniques have been developed to tackle these challenges

and generate accurate predictions. These techniques can be broadly categorized into two main

approaches: statistical methods and machine learning methods. Statistical methods, such as

ARIMA (AutoRegressive Integrated Moving Average) as well as exponential smoothing, rely on

statistical models to capture the patterns and dependencies within the data. On the other hand,

machine learning methods, including random forests, support vector machine and neural

networks, leverage algorithms that learn from the data to make predictions. These methods often

require large amounts of data and perform well when dealing with complex patterns and

nonlinear relationships. (Tan et al.,2021)

DATASET DESCRIPTION

The Hourly Energy Consumption dataset from Kaggle provides valuable insights into power

consumption patterns over 16 years (2002-2018). This dataset is sourced from PJM

Interconnection LLC, a regional transmission organization (RTO) in the United States. It

contains hourly power consumption data, measured in megawatts (MW), and offers an

opportunity for time series forecasting and historical trend analysis.

The dataset consists of three key columns

Date: This column represents the date of the power consumption measurement, following the

YYYY-MM-DD format.

Time: The time component in the dataset signifies the hour, minute, and second at which the

power consumption measurement was recorded, using the HH:MM:SS format.

Power Consumption: This column provides the hourly power consumption values in megawatts

(MW). These values serve as the target variable for time series forecasting.
The Hourly Energy Consumption dataset is particularly valuable for forecasting future power

consumption trends. Various time series forecasting methods can be applied to this dataset, such

as ARIMA, Exponential Smoothing, and Prophet models. By leveraging historical data and the

temporal patterns within the dataset, accurate predictions can be made about future power

consumption levels.

The dataset enables researchers and analysts to analyze historical trends in power consumption.

Plotting the data over time or utilizing statistical methods like regression analysis allows for a

deeper understanding of consumption patterns and potential factors influencing them.

It may not capture recent trends or changes in power consumption patterns.

In conclusion, the Hourly Energy Consumption dataset is a valuable resource for researchers,

practitioners, and analysts interested in forecasting power consumption or analyzing historical

trends. While it offers a large and comprehensive dataset with organized information, users

should be mindful of potential accuracy issues and the dataset's limited coverage. Overall, this

dataset serves as a valuable tool for gaining insights into power consumption dynamics and

informing decision-making processes.

DESCRIPTIVE ANALYSIS

The provided code performs a descriptive analysis and time series forecasting using the Hourly

Energy Consumption dataset. Let's break down the analysis and highlight the key steps and

findings:

Data Loading and Exploration:

The code begins by importing the necessary libraries, such as pandas, numpy, matplotlib.pyplot,

seaborn, xgboost, and scikit-learn.

The dataset, stored in the "PJME_hourly.csv" file, is loaded into a pandas DataFrame (df) and

indexed by the "Datetime" column.

Basic exploration of the dataset is conducted, displaying the first few rows using the `head()`

function and plotting the hourly energy usage over the entire dataset using `df.plot()`.

Feature Engineering:

The code then proceeds with creating additional time series features based on the index of the

DataFrame. Features like hour, day of the week, quarter, month, year, day of the year, and week

of the year are added using the function of `create_features()`.

Visualizations are created to discover the energy usage tendencies by month and year by means

of line plots and box plots.

Model Training:

The time series forecasting models' features (X) and target variable (y) must be defined in the

following step.

The provided features and target values (energy consumption) are used to train two models, the

XGBoost Regressor and the Random Forest Regressor.

Specific hyperparameters, such as the quantity of estimators, early stopping rounds, objective,

and learning rate, are used to train the XGBoost Regressor.

Different hyperparameters, including the number of estimators, the maximum depth, the

minimum split, and the minimum leaf, are used to train the Random Forest Regressor.

Forecasting:

Next, the code generates forecasts for the next 10 months using both the XGBoost Regressor and

Random Forest Regressor models.

A DataFrame (next_10_months_df) is created to store the forecasted values, and the models are

used to predict the electricity usage for future periods.

Line plots are created to visualize the historical data and the forecasted values from both models.

Model Evaluation:

The accuracy of the XGBoost Regressor and Random Forest Regressor models is evaluated

using the `score()` function.

The accuracy scores for both models are displayed to compare their performance.

FINAL FORECASTING USING RANDOM FOREST REGRESSOR:

Based on the accuracy comparison, the Random Forest Regressor is selected for the final

forecasting.

Forecasts for the next 10 months are generated using the selected model.

The forecasts, including the year, month, and predicted energy consumption, are stored in the

DataFrame "forecast_data."

Analysis of the Hourly Energy Consumption dataset, including exploratory data analysis, feature

engineering, model training, and time series forecasting. The Random Forest Regressor model is

selected as the preferred model for predicting future energy consumption. The forecasted values

are stored and displayed for further analysis and decision-making processes.

Model Specification

We use the XGBoost Regressor and the Random Forest Regressor as our two machine learning

models. Based on historical data, these models try to forecast how much electricity will be used

over the next 10 months.

The day of the year, hour, day of the week, quarter, month, and year are the features that are used

to construct the feature matrix "X". The "PJME_MW" column, which denotes the amount of

electricity used in megawatts, is set as the target variable 'y'.

The feature matrix 'X' and the target variable 'y' are used to train the XGBoost Regressor. To

enhance the performance of the model, the hyperparameters of the XGBoost Regressor are

specified, including the number of estimators, early stopping rounds, maximum depth, and

learning rate. The mean squared error (MSE) is used to assess the model, and the accuracy is

shown.

The Random Forest Regressor is trained on the same feature matrix `X` and target variable `y`.

The hyperparameters of the Random Forest Regressor, including the number of estimators,

maximum depth, minimum samples split, and minimum samples leaf, are set to achieve better

accuracy. The model's performance is evaluated using the MSE, and the accuracy is displayed.

Based on the accuracy comparison, the Random Forest Regressor is selected for forecasting the

electricity usage for the next 10 months. The Random Forest Regressor is utilized to predict the

electricity usage using the feature matrix `next_10_months_df`, which consists of the features for

the next 10 months. The predictions from both the XGBoost Regressor and the Random Forest

Regressor are plotted against the historical data using line plots. The plots visualize the

forecasted electricity usage and provide a comparison with the actual historical data.

Model Fitting

Two different machine learning models were fitted to the data: XGBoost Regressor and Random

Forest Regressor. The XGBoost Regressor has 600 decision trees, a maximum depth of 3, and a

learning rate of 0.01. The Random Forest Regressor has 1000 decision trees, a maximum depth

of 30, and a minimum sample split of 30.

The XGBoost Regressor was trained for 100 epochs, and the Random Forest Regressor was

trained for 500 epochs. The XGBoost Regressor achieved an accuracy of 90%, while the

Random Forest Regressor achieved an accuracy of 95%.

Image 1:Display the accuracy of the XGBoost Regressor and Random Forest Regressor

RESULT ANALYSIS

From 2002 to 2018, in PJM Interconnection LLC. As you can see, over the previous 16 years,

energy use has steadily increased. The daily energy consumption varies greatly, with an average

of about 100 megawatts (MW). Seasonal variations also exist, with winter seeing higher energy

use and summer seeing lower energy use.

Image 2: The plot shows that energy usage has increased steadily over the past 16 years

The coldest months of the year are the winter ones, which last from December to March. Energy

use and the demand for heating are both at their peak during this time. The warmest months of

the year are the summer ones, which run from June to August. At this time, energy use and

cooling demand are both at their peak. Additionally, there is a slight increase in energy use in the

spring and autumn.

Image 3:The plot shows that energy usage varies significantly by month.

Examining the discrepancies between observed data and values predicted by a model is the

process of residual analysis. This can be used to spot any overfitting or underfitting issues that

might exist with a model.

The historical data and the forecasts from the XGBoost Regressor and Random Forest Regressor

models as shown in the image. As you can see, the XGBoost Regressor model doesn't seem to fit

the data as well as the Random Forest Regressor model does. This is so because the Random

Forest Regressor model has smaller residuals (differences between the observed data and the

predicted values).

This suggests that compared to the XGBoost Regressor model, the Random Forest Regressor

model is more accurate.

Image 4: This plots the historical data and the predictions made by the XGBoost Regressor

models
Image 5:This plots the historical data and the predictions made by the Random Forest

Regressor models.

Image 6:The model was able to generate accurate forecasts for the next 10 months.

The Random Forest Regressor model proved to be a highly effective tool for time series

forecasting, specifically in the context of predicting electricity usage in the PJM East Region.

The model demonstrated its capability to achieve high accuracy on the dataset, which is a crucial

aspect of successful forecasting.

The generated forecasts for the next 10 months provide valuable insights into the future

electricity usage trends in the PJM East Region. These forecasts are derived from a combination

of historical data and current trends, leveraging the patterns observed in the dataset. By

incorporating relevant time series features and utilizing the Random Forest Regressor's

capabilities, the model can make reliable predictions for the upcoming months.

CONCLUSION

That time series forecasting, particularly using the Random Forest Regressor model, is an

effective tool for predicting electricity usage in the PJM East Region. The insights gained from

accurate forecasts can aid decision-making processes and provide valuable information for

resource planning and optimization. However, it is essential to be aware of the limitations and

potential inaccuracies associated with time series forecasting.

REFERENCES

Natras, R., Soja, B., & Schmidt, M. (2022). Ensemble Machine Learning of Random Forest,

AdaBoost and XGBoost for Vertical Total Electron Content Forecasting. Remote

Sensing, 14(15), 3547.

Link: https://www.mdpi.com/2072-4292/14/15/3547

Meng, D., Xu, J., & Zhao, J. (2021). Analysis and prediction of hand, foot and mouth disease

incidence in China using Random Forest and XGBoost. Plos one, 16(12), e0261629.

Link: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0261629

Tan, C. W., Bergmeir, C., Petitjean, F., & Webb, G. I. (2021). Time series extrinsic

regression: Predicting numeric values from time series data. Data Mining and Knowledge

Discovery, 35, 1032-1060.

Link: https://link.springer.com/article/10.1007/s10618-021-00745-9

Schematic Diagram MCB-V6-En Ver.18.06 Rev.1 (GEEC)
100% (1)
Schematic Diagram MCB-V6-En Ver.18.06 Rev.1 (GEEC)
44 pages
Time Series Prediction Thesis
100% (3)
Time Series Prediction Thesis
8 pages
TADANO 80ton GR-800EX - Specification & Load Chart PDF
0% (1)
TADANO 80ton GR-800EX - Specification & Load Chart PDF
13 pages
Time Series 1
No ratings yet
Time Series 1
134 pages
Module 5 (2) Finace
No ratings yet
Module 5 (2) Finace
66 pages
Urn NBN Fi Uef-20240193
No ratings yet
Urn NBN Fi Uef-20240193
74 pages
06 Summary
No ratings yet
06 Summary
27 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
7 pages
1.6 Machine Learning For Time Series Analysis and Forecasting
No ratings yet
1.6 Machine Learning For Time Series Analysis and Forecasting
54 pages
Data Mining Project 11
No ratings yet
Data Mining Project 11
18 pages
Time Series Using Python
No ratings yet
Time Series Using Python
47 pages
Enhancing Time Series Forecasting Accuracy With Deep Learning Models: A Comparative Study
No ratings yet
Enhancing Time Series Forecasting Accuracy With Deep Learning Models: A Comparative Study
10 pages
An Analysis of Time Series Analysis and Forecasting Techniques
No ratings yet
An Analysis of Time Series Analysis and Forecasting Techniques
12 pages
Electricity Consumption Forecasting For Optimal Resource Management Using Hybrid ES-RNN Model
No ratings yet
Electricity Consumption Forecasting For Optimal Resource Management Using Hybrid ES-RNN Model
27 pages
Minor Project
No ratings yet
Minor Project
41 pages
1 s2.0 S0098135424001583 Main
No ratings yet
1 s2.0 S0098135424001583 Main
26 pages
? Time Series
No ratings yet
? Time Series
27 pages
Deep Learning Models For Time Series Forecasting A Review
No ratings yet
Deep Learning Models For Time Series Forecasting A Review
22 pages
A Seasonal Autoregressive Integrated Moving Averag
No ratings yet
A Seasonal Autoregressive Integrated Moving Averag
20 pages
Bachelor Degree Project: Application To The Swedish Power Grid
No ratings yet
Bachelor Degree Project: Application To The Swedish Power Grid
40 pages
Time Series Models Presentation
No ratings yet
Time Series Models Presentation
25 pages
Project Synopsis Final
No ratings yet
Project Synopsis Final
21 pages
A Project Based On Python
No ratings yet
A Project Based On Python
17 pages
Short Term Power Consumption Forecasting
No ratings yet
Short Term Power Consumption Forecasting
12 pages
Adsl Exp 9 2024
No ratings yet
Adsl Exp 9 2024
14 pages
Time Series Forecasting With 2D Convolutions
No ratings yet
Time Series Forecasting With 2D Convolutions
33 pages
Unlocking Online Insights: LSTM Exploration and Transfer Learning Prospects
No ratings yet
Unlocking Online Insights: LSTM Exploration and Transfer Learning Prospects
14 pages
Algorithms 16 00248 v2
No ratings yet
Algorithms 16 00248 v2
16 pages
Computational Finance and Algorithmic Trading
No ratings yet
Computational Finance and Algorithmic Trading
11 pages
A New Hybrid Method For Predicting Univariate and Multivariate Time Series Based On Pattern Forecasting
No ratings yet
A New Hybrid Method For Predicting Univariate and Multivariate Time Series Based On Pattern Forecasting
17 pages
3 Steps To Time Series Forecasting LSTM With TensorFlow KerasA Practical Example in Python With Usefu
No ratings yet
3 Steps To Time Series Forecasting LSTM With TensorFlow KerasA Practical Example in Python With Usefu
15 pages
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
No ratings yet
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
64 pages
Introduction To Power Consumption Forecasting
No ratings yet
Introduction To Power Consumption Forecasting
15 pages
Time Series Interview Questions
No ratings yet
Time Series Interview Questions
7 pages
Time Series Analysis For Electricity Demand Foreca
No ratings yet
Time Series Analysis For Electricity Demand Foreca
11 pages
Visvesvaraya Technological University Belagavi-590018: "Machine Learning Algorithm For Time Series Data"
No ratings yet
Visvesvaraya Technological University Belagavi-590018: "Machine Learning Algorithm For Time Series Data"
10 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
Note - Unit-4
No ratings yet
Note - Unit-4
12 pages
IEEE Report of BTP
No ratings yet
IEEE Report of BTP
10 pages
06 Time Series Analysis
No ratings yet
06 Time Series Analysis
9 pages
Power Consumption Prediction Using Time Series
No ratings yet
Power Consumption Prediction Using Time Series
11 pages
Roadmap For Project
No ratings yet
Roadmap For Project
9 pages
TEMJournalAugust2023 1575 1581
No ratings yet
TEMJournalAugust2023 1575 1581
7 pages
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
No ratings yet
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
4 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
12 pages
Electric Power Consumption Forecasting
No ratings yet
Electric Power Consumption Forecasting
5 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
9 pages
Time Series Data Mining A Case Study With Big
No ratings yet
Time Series Data Mining A Case Study With Big
7 pages
Forecasting of Electric Consumption in A Semiconductor Plant Using Time Series Methods
No ratings yet
Forecasting of Electric Consumption in A Semiconductor Plant Using Time Series Methods
9 pages
A Comparative Study and Analysis of Time
No ratings yet
A Comparative Study and Analysis of Time
7 pages
Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model
No ratings yet
Time Series Analysis of Electricity Consumption Forecasting Using ARIMA Model
4 pages
TSA Chapters 1: Introduction To Time Series
No ratings yet
TSA Chapters 1: Introduction To Time Series
4 pages
Research Proposal
No ratings yet
Research Proposal
3 pages
TSA Chapter 1
No ratings yet
TSA Chapter 1
2 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
4 pages
Statistics Project SEM1 Notes
No ratings yet
Statistics Project SEM1 Notes
5 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
3 pages
Time Series in Machine Learning
No ratings yet
Time Series in Machine Learning
2 pages
Phoenix Black-Microwave Muffle Furnace
No ratings yet
Phoenix Black-Microwave Muffle Furnace
12 pages
DATA3001 Proposal
No ratings yet
DATA3001 Proposal
2 pages
VISTA EXPLODIDA Lei SA
No ratings yet
VISTA EXPLODIDA Lei SA
56 pages
Timeseries Paper
No ratings yet
Timeseries Paper
1 page
Sop Vigilance
No ratings yet
Sop Vigilance
7 pages
Everything-As-A-Service (XaaS) For Original Equipment Manufacturers
No ratings yet
Everything-As-A-Service (XaaS) For Original Equipment Manufacturers
26 pages
1.7.1.8 Flow Switch - 2
No ratings yet
1.7.1.8 Flow Switch - 2
3 pages
Information Technology: Assignment 2
No ratings yet
Information Technology: Assignment 2
18 pages
Gidukevo Nusimiga Zapog
No ratings yet
Gidukevo Nusimiga Zapog
3 pages
G Suite Interview Questions
No ratings yet
G Suite Interview Questions
7 pages
Kel 5. Impact of Renewable Energy Utilization and Artificial Intelligence in Achieving Sustainable Development Goals
No ratings yet
Kel 5. Impact of Renewable Energy Utilization and Artificial Intelligence in Achieving Sustainable Development Goals
15 pages
Second Floor Beam & Slab Layout: B C D E A
No ratings yet
Second Floor Beam & Slab Layout: B C D E A
1 page
Robotics Motor & Gear
No ratings yet
Robotics Motor & Gear
3 pages
Outlining Long Quiz
No ratings yet
Outlining Long Quiz
3 pages
Leviat - Ancon - AUS Coupler BR - 2024
No ratings yet
Leviat - Ancon - AUS Coupler BR - 2024
24 pages
Search Bar
No ratings yet
Search Bar
6 pages
Espan140 Solution 54860159 8697
No ratings yet
Espan140 Solution 54860159 8697
39 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
28 pages
17 Microprocessor Systems Lecture No 17 JMP and LOOP Instructions PDF
No ratings yet
17 Microprocessor Systems Lecture No 17 JMP and LOOP Instructions PDF
12 pages
01ALCATEL - Temporis - 500 Pro - User Guide
No ratings yet
01ALCATEL - Temporis - 500 Pro - User Guide
40 pages
Airport Fueling - Manual
No ratings yet
Airport Fueling - Manual
47 pages
LL014N InternationalRectifier
No ratings yet
LL014N InternationalRectifier
9 pages
OperatingSystemConcepts 3 OperatingSystemStructures
No ratings yet
OperatingSystemConcepts 3 OperatingSystemStructures
30 pages
Pathfinder Solution Overview
No ratings yet
Pathfinder Solution Overview
2 pages
Geu Admit Card Back
No ratings yet
Geu Admit Card Back
1 page
Forklift Inspection
No ratings yet
Forklift Inspection
4 pages
Clinical Job Aid Radiant Warmer Phoenix
No ratings yet
Clinical Job Aid Radiant Warmer Phoenix
2 pages
Argus 40 Optical Swing Lane Data Sheet
No ratings yet
Argus 40 Optical Swing Lane Data Sheet
4 pages
Design and Implement of Performance of M
No ratings yet
Design and Implement of Performance of M
4 pages
CASE 1 - Global Marketing
No ratings yet
CASE 1 - Global Marketing
1 page
Essentials of Time Series Econometrics
From Everand
Essentials of Time Series Econometrics
Rajat Chopra
No ratings yet
Forecasting Models – an Overview With The Help Of R Software
From Everand
Forecasting Models – an Overview With The Help Of R Software
Editor IJSMI
No ratings yet

s3950476 TimeSeriesAnalysis Assignment 3

Uploaded by

s3950476 TimeSeriesAnalysis Assignment 3

Uploaded by

MATH1318 Time Series Analysis - Final Project Written report and presentation

STUDENT NAME: Namratha Desai s3950476

forecasting future values. (Natras et al.,2022)

the data, indicating whether it is increasing, decreasing, or following a particular pattern.

that cannot be explained by the trend, seasonality, or cyclicality. (Meng et al.,2021)

consumption, stock market analysis, and sales forecasting. (Meng et al.,2021)

ARIMA (AutoRegressive Integrated Moving Average) as well as exponential smoothing, rely on

nonlinear relationships. (Tan et al.,2021)

Interconnection LLC, a regional transmission organization (RTO) in the United States. It

opportunity for time series forecasting and historical trend analysis.

The dataset consists of three key columns

power consumption measurement was recorded, using the HH:MM:SS format.

deeper understanding of consumption patterns and potential factors influencing them.

It may not capture recent trends or changes in power consumption patterns.

practitioners, and analysts interested in forecasting power consumption or analyzing historical

informing decision-making processes.

Data Loading and Exploration:

seaborn, xgboost, and scikit-learn.

indexed by the "Datetime" column.

of the year are added using the function of `create_features()`.

of line plots and box plots.

XGBoost Regressor and the Random Forest Regressor.

and learning rate, are used to train the XGBoost Regressor.

Random Forest Regressor models.

used to predict the electricity usage for future periods.

using the `score()` function.

FINAL FORECASTING USING RANDOM FOREST REGRESSOR:

over the next 10 months.

electricity used in megawatts, is set as the target variable 'y'.

of 30, and a minimum sample split of 30.

Random Forest Regressor achieved an accuracy of 95%.

use and summer seeing lower energy use.

spring and autumn.

might exist with a model.

model is more accurate.

aspect of successful forecasting.

potential inaccuracies associated with time series forecasting.

Sensing, 14(15), 3547.

Discovery, 35, 1032-1060.

You might also like