0% found this document useful (0 votes)

8 views6 pages

Exercise 5

The document outlines exercises on training LSTM models for predicting Bitcoin prices and housing values using various data preprocessing techniques and model architectures. It details the training process, loss metrics, and evaluation methods, including Mean Squared Error and R-squared values. Additionally, it discusses the importance of feature normalization, model regularization, and hyperparameter tuning to improve model performance.

Uploaded by

pandumukesh143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views6 pages

Exercise 5

Uploaded by

pandumukesh143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Exercise 1:

Epoch 0 Loss: 0.015613915398716927

Epoch 25 Loss: 0.003066639183089137

Epoch 50 Loss: 0.0012870215578004718

Epoch 75 Loss: 0.0016396405408158898

Epoch 100 Loss: 0.0010618615197017789

Epoch 125 Loss: 0.0003182028012815863

Mean Squared Error: 0.23075591518161281

Data Preprocessing: Effectively using MinMaxScaler normalization enables models to operate on time
series data.

Model Architecture: The addition of linear layers (nn.Linear) could benefit the model in extracting
features regarding LSTM outputs for better prediction of the target.

Training: The training loop shows a steady decline in loss, with significant changes in loss every 25
epochs, implying the effectiveness of the model learning.

Prediction: The resultant Mean Squared Error (MSE) of 0.2308 is adequate. The graph describes how
the model has followed the trends and patterns in the actual data against its predictions.

Exercise 2:
1) Multivariate Data Representation And Lstm Prediction The historical bitcoin market data comprises
the following attributes:

Opening Price: The price at which Bitcoin was opened to trading.

High Price: The highest price reached by Bitcoin during the trading day.

Low Price: The lowest price reached by Bitcoin during a trading day.

Volume Traded: The total volume of bitcoins traded on that day.

Then, the task of the LSTM network is to analyse these features and predict the closing price of
Bitcoin for the next 50 days based on last 100 days.

2) Internal Correlation: Yes, there are entire correlations among features with respect to the data set.
For example, opening and closing prices are generally the same; transaction volume also depends on
the prices. These interdependencies are very important for LSTM learning and future price
predictions.

Some Improving Model Performance:

Data Normalization: normalizes all the input features to a common scale so that convergence is
achieved better during training

Model Architecture: Increase the number of LSTM layers or units in order to learn more complex
patterns. For example, for stacked LSTM, greater number of layers improved performance.

STACK OVERFLOW

Regularization: adding dropout layers to avoid overfitting and enhancing generalization.

Hyperparameter Tuning: Learning rates, batch sizes, and sequence lengths are different from the
above setting. Search for the optimal configuration.

Bidirectional LSTM: Use this method to capture patterns from both past and future contexts.

Exercise 3:
This basically takes care of data preprocessing, which is the data import and normalization through
Min Max Scaler, and then reformats data into a supervised learning problem for a look-back period of
1, and splits the data into a train dataset and test dataset. Next, it defines three models-RNN Model:
Uses basic RNN; GRU Model: Has a GRU Layer; and LSTM Model: Contains an LSTM layer.

Training/Evaluation: Each model is built on the Mean Squared Error (MSE) loss using Adam. After
training, the models were evaluated against the test set, and Root Mean Square Error (RMSE) was
computed. Predictions were reverse-transformed to the original scale, and test errors were
calculated.

This would mean plotting true vs predicted values for each model and visualizing the total errors
summed for each test sequence.

Exercise 4a:

dtype: int64

MedInc HouseAge AveRooms AveBedrms Population \

count 20640.000000 20640.000000 20640.000000 20640.000000 20640.000000

mean 3.870671 28.639486 5.429000 1.096675 1425.476744

std 1.899822 12.585558 2.474173 0.473911 1132.462122

min 0.499900 1.000000 0.846154 0.333333 3.000000

25% 2.563400 18.000000 4.440716 1.006079 787.000000

50% 3.534800 29.000000 5.229129 1.048780 1166.000000

75% 4.743250 37.000000 6.052381 1.099526 1725.000000

max 15.000100 52.000000 141.909091 34.066667 35682.000000

AveOccup Latitude Longitude MedHouseVal

count 20640.000000 20640.000000 20640.000000 20640.000000

mean 3.070655 35.631861 -119.569704 2.068558

std 10.386050 2.135952 2.003532 1.153956

min 0.692308 32.540000 -124.350000 0.149990

25% 2.429741 33.930000 -121.800000 1.196000

50% 2.818116 34.260000 -118.490000 1.797000

75% 3.282261 37.710000 -118.010000 2.647250

max 1243.333333 41.950000 -114.310000 5.000010

Intercept: -37.552166316589286

Coefficients: [ 4.41181942e-01 9.70730794e-03 -1.19993946e-01 7.84709051e-01

-3.39466724e-07 -3.28239095e-03 -4.23679731e-01 -4.39311822e-01]

Mean Squared Error: 0.5385446422423643

R-squared: 0.596623451962375
This dataset contains a total of 20,640 dimensions, with no individual missing values, as revealed by
the zeros in the count of their respective columns in the missing values check. The summary statistics
indicate that the dataset comprises a mixture of numerical features like 'MedInc' (the median
income), 'HouseAge' (median house age), 'MedHouseVal' (the median house value), and the others.

All the features' coefficients are displayed, the intercept being an estimated -37.55. These
coefficients of the features are:

'MedInc': 0.4418

'HouseAge': 0.0097

'AveRooms': -0.1199

'AveBedrms': 0.7847

'Population': -0.0000003395

'AveOccup': -0.0033

'Latitude': -0.4237

'Longitude': -0.4393

Mean Squared Error (MSE) is calculated at around 0.5385, R squared approximates to 0.5966. This
figure denotes that about 59.66% variance in the target variable 'MedHouseVal' has been explained
by the model.

This scatter plot likely shows how the actual values relate to the predicted values of 'MedHouseVal',
whereby a perfectly fitted model would be shown by points clustered very close to the line of perfect
prediction.

Exercise 4b:

ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Chapter11 Boston
No ratings yet
Chapter11 Boston
11 pages
How To Participate in A Zoom Meeting
No ratings yet
How To Participate in A Zoom Meeting
6 pages
Regression Analysis - Lasso and Ridge Regularization
No ratings yet
Regression Analysis - Lasso and Ridge Regularization
17 pages
Netflix Stock Price Prediction
No ratings yet
Netflix Stock Price Prediction
20 pages
Lec3 4 ML Project
No ratings yet
Lec3 4 ML Project
26 pages
ML Manual
No ratings yet
ML Manual
30 pages
Table of Contents
No ratings yet
Table of Contents
27 pages
Lab ML
No ratings yet
Lab ML
26 pages
SNT 7
No ratings yet
SNT 7
13 pages
ML Observation
No ratings yet
ML Observation
29 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
ML Manual
No ratings yet
ML Manual
9 pages
50 Inference
No ratings yet
50 Inference
31 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
4 - Học Máy Cơ Bản - Hồi Quy Tuyến Tính
No ratings yet
4 - Học Máy Cơ Bản - Hồi Quy Tuyến Tính
113 pages
MDS372 Lab4 2448001
No ratings yet
MDS372 Lab4 2448001
17 pages
Data Analytucs 1
No ratings yet
Data Analytucs 1
5 pages
Xgboost
No ratings yet
Xgboost
12 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
HG10CV2.0 Datasheet
No ratings yet
HG10CV2.0 Datasheet
5 pages
Da Lab File 2
No ratings yet
Da Lab File 2
13 pages
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
No ratings yet
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
25 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Ml-Exp-3 - Jupyter Notebook
No ratings yet
Ml-Exp-3 - Jupyter Notebook
6 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Locally Weighted Regression Algorithm
No ratings yet
Locally Weighted Regression Algorithm
6 pages
PRJ Housuing Price
No ratings yet
PRJ Housuing Price
14 pages
House Pricing
No ratings yet
House Pricing
15 pages
Pharmacy Minitheme by Slidesgo
No ratings yet
Pharmacy Minitheme by Slidesgo
42 pages
DL Lab Prog 2
No ratings yet
DL Lab Prog 2
2 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
18 Ajit Gupta Android Practical
No ratings yet
18 Ajit Gupta Android Practical
122 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
ExacTime 6000 - Datum, Inc - Specification Datasheet - DSET6000-6010D0105PDF
No ratings yet
ExacTime 6000 - Datum, Inc - Specification Datasheet - DSET6000-6010D0105PDF
6 pages
Project Report ME-315 Machine Learning in Practice: Sebastian Perez Viegener LSE ID:201870983 July 3, 2019
No ratings yet
Project Report ME-315 Machine Learning in Practice: Sebastian Perez Viegener LSE ID:201870983 July 3, 2019
15 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
Pharmacy Business Plan
No ratings yet
Pharmacy Business Plan
40 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
Linear Reg
No ratings yet
Linear Reg
25 pages
Dev Guide
No ratings yet
Dev Guide
8 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
Haemostasis: Catalogue
No ratings yet
Haemostasis: Catalogue
88 pages
Bootstrap - Comprehensive Guide
No ratings yet
Bootstrap - Comprehensive Guide
8 pages
Web Methods EbXML Module Installation and User's Guide 7.1 SP1
100% (1)
Web Methods EbXML Module Installation and User's Guide 7.1 SP1
154 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
100% (1)
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
28 pages
Coding Question
No ratings yet
Coding Question
6 pages
Illustrated Dictionary of Cyborg Anthropology Web
100% (3)
Illustrated Dictionary of Cyborg Anthropology Web
101 pages
2 Linear Regression Multivariate
No ratings yet
2 Linear Regression Multivariate
2 pages
Sklearn Tutorial: DNN On Boston Data
No ratings yet
Sklearn Tutorial: DNN On Boston Data
9 pages
PySpark Real Time Q&A
No ratings yet
PySpark Real Time Q&A
5 pages
Project Immo en
No ratings yet
Project Immo en
11 pages
A Case Study Application of Linear Programming and Simulation To Mine Planning
No ratings yet
A Case Study Application of Linear Programming and Simulation To Mine Planning
9 pages
08 GT I9070 Tshoo 7
No ratings yet
08 GT I9070 Tshoo 7
49 pages
Import As Import As From Import: "Mean Squared Errors: "
No ratings yet
Import As Import As From Import: "Mean Squared Errors: "
1 page
Box Sensor 2
No ratings yet
Box Sensor 2
1 page
VR&AR
No ratings yet
VR&AR
8 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
Razberi User Manual (Razberi VMS)
No ratings yet
Razberi User Manual (Razberi VMS)
34 pages
2 - Linear - Regression - Multivariate - Ipynb - Colaboratory
No ratings yet
2 - Linear - Regression - Multivariate - Ipynb - Colaboratory
4 pages
03 Multiple Linear Regression
No ratings yet
03 Multiple Linear Regression
7 pages
Python For Multivariate Analysis
No ratings yet
Python For Multivariate Analysis
47 pages
Screen Capture: User's Guide
No ratings yet
Screen Capture: User's Guide
15 pages
HSPA - High Speed Packet Access Tutorial
No ratings yet
HSPA - High Speed Packet Access Tutorial
21 pages
Pratapa P Evidence of Learning 4
No ratings yet
Pratapa P Evidence of Learning 4
2 pages
Application Information: Need To Know How? You've Turned To The Right Place - . - Literally
No ratings yet
Application Information: Need To Know How? You've Turned To The Right Place - . - Literally
50 pages
Schools Division of Cebu City/Lusaran National High School Workweek Plan
No ratings yet
Schools Division of Cebu City/Lusaran National High School Workweek Plan
2 pages
Name:-Nitish Xavier Tirkey F.Y.Bca Date: - 4 October, 2010
No ratings yet
Name:-Nitish Xavier Tirkey F.Y.Bca Date: - 4 October, 2010
10 pages
Pattern - Recognition - 3 - Code With Output
No ratings yet
Pattern - Recognition - 3 - Code With Output
7 pages
QHY163M Review en
No ratings yet
QHY163M Review en
31 pages
House Price Prediction: Project Description
No ratings yet
House Price Prediction: Project Description
11 pages
Control Theory Quiz
No ratings yet
Control Theory Quiz
28 pages
Data Mining Exercise 3
No ratings yet
Data Mining Exercise 3
11 pages
One To One and Onto1
No ratings yet
One To One and Onto1
9 pages
Ebit 30: Portable Color Doppler System
100% (2)
Ebit 30: Portable Color Doppler System
14 pages
Lab 3 - Linear Regression
No ratings yet
Lab 3 - Linear Regression
15 pages
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
No ratings yet
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
6 pages
The Boston Housing Dataset
100% (2)
The Boston Housing Dataset
4 pages
Week 3 - Probablistic Context Free Grammars
No ratings yet
Week 3 - Probablistic Context Free Grammars
18 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
ICT Trivia
No ratings yet
ICT Trivia
9 pages
Minutely
No ratings yet
Minutely
1 page
Reduced Row Echelon Form
No ratings yet
Reduced Row Echelon Form
4 pages

Exercise 5

Uploaded by

Exercise 5

Uploaded by

Exercise 1:

Epoch 0 Loss: 0.015613915398716927

Epoch 25 Loss: 0.003066639183089137

Epoch 50 Loss: 0.0012870215578004718

Epoch 75 Loss: 0.0016396405408158898

Epoch 100 Loss: 0.0010618615197017789

Epoch 125 Loss: 0.0003182028012815863

Mean Squared Error: 0.23075591518161281

Opening Price: The price at which Bitcoin was opened to trading.

Volume Traded: The total volume of bitcoins traded on that day.

Some Improving Model Performance:

Regularization: adding dropout layers to avoid overfitting and enhancing generalization.

MedInc HouseAge AveRooms AveBedrms Population \

count 20640.000000 20640.000000 20640.000000 20640.000000 20640.000000

mean 3.870671 28.639486 5.429000 1.096675 1425.476744

std 1.899822 12.585558 2.474173 0.473911 1132.462122

min 0.499900 1.000000 0.846154 0.333333 3.000000

25% 2.563400 18.000000 4.440716 1.006079 787.000000

50% 3.534800 29.000000 5.229129 1.048780 1166.000000

75% 4.743250 37.000000 6.052381 1.099526 1725.000000

AveOccup Latitude Longitude MedHouseVal

count 20640.000000 20640.000000 20640.000000 20640.000000

mean 3.070655 35.631861 -119.569704 2.068558

std 10.386050 2.135952 2.003532 1.153956

min 0.692308 32.540000 -124.350000 0.149990

25% 2.429741 33.930000 -121.800000 1.196000

50% 2.818116 34.260000 -118.490000 1.797000

75% 3.282261 37.710000 -118.010000 2.647250

max 1243.333333 41.950000 -114.310000 5.000010

Coefficients: [ 4.41181942e-01 9.70730794e-03 -1.19993946e-01 7.84709051e-01

-3.39466724e-07 -3.28239095e-03 -4.23679731e-01 -4.39311822e-01]

Mean Squared Error: 0.5385446422423643

You might also like