Stock Prediction Report

This study investigates stock price prediction using a Long Short-Term Memory Neural Network (LSTM) based classification model, focusing on whether stock prices will rise or fall the next day. The dataset includes historical prices and technical indicators from various financial entities, and the model achieved an accuracy of 73.7% with a significant gain metric. Future work aims to enhance the algorithm with more complex models and implement it for live trading.

Uploaded by

Thanh Tùng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Stock Prediction Report

Uploaded by

Thanh Tùng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Stock Prediction with Machine Learning

Antony Alexos AALEXOS @ UTH . GR

Antony Evmorfopoulos AEVMORFOP @ UTH . GR
Tilemachos Tsiapras TTSIAPRAS @ UTH . GR

Abstract stock price index movement. ANN has its own way to learn
patterns. ANN emulates functioning of our brain to learn
A good approach to predict future price of a
by creating network of neurons. This study focuses on a
stock, based on its past price and other factors
modification of ANN, a Long Short-Term Memory Neural
was investigated. In this paper we try to predict
Network, which is an optimization of RNNs. Here we use
whether the price of a stock will go up or down
the LSTMs(Hochreiter & Schmidhuber, 1997) in a classi-
on the next day, based on a classification model
fication approach, in order to predict the next day, if the
that we built, using LSTMs from keras library.
stock’s price will be up or down.

1. Introduction 2. Dataset and Features

Predicting stock and stock price index is difficult due to un- The basic data that we used are downloaded from Yahoo
certainties involved. There are two types of analysis which through its API and it is the price of Goldman Sachs stock.
investors perform before investing in a stock. First is the The dates that we chose are from 31/12/2014 to 19/3/2019,
fundamental analysis. In this, investors looks at intrinsic and apart from that we also use the price of Bank of Amer-
value of stocks, performance of the industry and economy, ica, Barclays, Credit Suisse, JPMorgan, Morgan Stanley,
political climate etc. to decide whether to invest or not. NASDAQ, Hang Seng Index, NYSE, Nikkei 225 and VIX.
On the other hand, technical analysis is the evaluation of We also used the technical indicators Moving Average 7
stocks by means of studying statistics generated by market and 21, Exponential Moving Average(EMA), Moving Av-
activity, such as past prices and volumes. Technical ana- erage Convergence Divergence(MACD), Bollinger Bands,
lysts do not attempt to measure a security s intrinsic value Momentum and Log Momentum. The last additions to our
but instead use stock charts to identify patterns and trends dataset are three fourier transformations. Because of the
that may suggest how a stock will behave in the future. Ef- Bollinger Bands we started the dataset 29 days after the
ficient market hypothesis by experts, states that prices of starting data of the data, because until day 29, some of the
stocks are informationally efficient which means that it is data are missing. The idea of adding all of this data is that
possible to predict stock prices based on the trading data. the price of a stock is affected by a majority of factors; so
This is quite logical as many uncertain factors like politi- the more we add, the better prediction we may have.
cal scenario of country, public image of the company will
start reflecting in the stock prices. So, if the information 3. Methods
obtained from stock prices is preprocessed efficiently and
appropriate algorithms are applied then trend of stock or 3.1. Long Short Term Memory
stock price index may be predicted. Long Short Term Memory networks(Wei Bao, 2017) are
Since years, many techniques have been developed to pre- neural networks that are being used for solving time series
dict stock trends. Initially classical regression methods problems. As such, they provide the best possible solu-
were used to predict stock trends. Since stock data can be tion in solving the problem of predicting the prices in stock
categorized as non-stationary time series data, non-linear market. This kind of network is used to recognise pat-
machine learning techniques have also been used. Artifi- terns when past results have influence on the present result.
cial Neural Networks (ANN) is a machine learning algo- For this reason, our experiments and efforts were revolved
rithms which is most widely used for predicting stock and around LSTM networks.
Our first network was simple LSTM network of 50 neu-
University of Pennsylvania, CIS 419/519 Course Project. rons, with one hidden layer. We tried different type of
Copyright 2017 by the author(s). models with this kind of architecture. Firstly, we passed
PUT YOUR LAST NAMES HERE

our data through the features dimension of the LSTM cell, Cadima, 2016). PCA is a statistical procedure that uses
then through the timestep dimension. Another model we an orthogonal transformation to convert a set of observa-
built was making the LSTM network stateful - with mem- tions of possibly correlated variables into a set of values
ory between the batches of data we passed, making it learn of linearly uncorrelated variables called principal compo-
from every sequence instead of every epoch. Afterwards, nents. This transformation is defined in such a way that
we applied dropout of 0.2 to the model that gave the best the first principal component has the largest possible vari-
results. That was done in order to avoid any overfitting of ance (that is, accounts for as much of the variability in the
our algorithm to our training set, trying to make it gener- data as possible), and each succeeding component in turn
alize better to unseen data. All these models were built in has the highest variance possible under the constraint that
order to fully capture the nature of our problem, trying to it is orthogonal to the preceding components. The resulting
see which algorithm would fit best as a solution. vectors (each being a linear combination of the variables
and containing n observations) are an uncorrelated orthog-
The way we evaluated our model was through a custom
onal basis set. From this analysis we resulted to 5 variables,
metric, named gain. While classic metrics, like mean
which try to capture the most information from our data and
squared error, mean absolute error etc, can be used to eval-
we followed the same procedure, to build the new models,
uate how close is our prediction to the actual price of the
as before.
stock, the problem itself suggests that we should see if the
model is gaining money or not. With that in our mind, if However, in the results of this process, it can be noticed,
we predict that the stock will move to the same direction that even though it wasn’t possible to predict the stock
as the actual move, then we won the absolute value of the price, the predictions were following correctly the direc-
subtraction of the closing price from the opening price. In tion the price was going. For this reason , we transformed
the same way we lost money if we predict the opposite di- the whole problem to a new one. Instead of predicting the
rection from the markets. price directly, we will try now to predict the direction of the
price where the price is heading, making it a classification
Afterwards, we built stacked LSTM models, trying to take
problem.We transform our data to binary, 0 if the price fell
advantage of the memory the LSTM cells provide to a net-
that day, 1 if the price went up. That way the information
work. The architecture is composed from two layers, one
our models need to process is much simpler and therefore,
with 50 neurons followed by one with 30. We followed the
much easier to learn.
same principle as before while building our models.

4. Results and Discussion

Time series data, as the name indicates, differ from other
types of data in the sense that the temporal aspect is impor-
tant. On a positive note, this gives us additional informa-
tion that can be used when building our machine learning
model, that not only the input features contain useful in-
3.2. XGBoost
formation, but also the changes in input/output over time.
We used XGBoost in order to see the importance of our However, while the time component adds additional infor-
features(Dey et al., 2016). From there, we could imple- mation, it also makes time series problems more difficult to
ment a different model which hypothetically be better since handle compared to many other prediction tasks.
we would use only the data that capture better the informa- So the gain metric in the classification approach has given
tion we want to learn, however no such result was achieved us a gain of 818.18, which is a very impressive result, be-
since only 2 out of 30 features were considered as not in- cause at the time of testing the LSTM in the data the Gold-
formative enough. For the record, XGBoost has been a man Sachs stock only gains 38 points. We can observe that
proven model in data science for its speed and accuracy. our algorithm surpasses that by a lot.
In XGBoost the trees are built sequentially such that each
subsequent tree aims to reduce the errors of the previous The accuracy of the algorithm is 0.736986. This is the per-
tree. Each tree learns from its predecessors and updates the centage that we predicted right the days that the stock went
residual errors. Hence, the tree that grows next in the se- either up or down. The accuracy can also be seen from the
quence will learn from an updated version of the residuals. graph below where we see with green color the days that
we predicted right, and with red color the days that we pre-
3.3. PCA dicted wrong.

In order to reduce the dimensionality of our data, we im-

plemented principal components analysis(PCA)(Jolliffe &
PUT YOUR LAST NAMES HERE

5. Conclusion and Future Work

In conclusion with this work, we have achieved a really
good result. The accuracy of the algorithm may be only
0.736986, but this is above 51% which is considered satis-
fying by most professionals. But accuracy combined with
the gain metric, which is quite high doesn’t mean that we
have completely solved the stock market prediction prob-
lem. We have some future plans to make this algorithm
more reliable and more successful. In the future we will
add more complex and better algorithms like GANs, we
will also add prices targets and stop losses to achieve a bet-
ter result, and finally we will implement the algorithm in
MQL4 in order to do live trading with it.

References
Dey, Shubharthi, Kumar, Yash, Saha, Snehanshu, and
Basak, Suryoday. Forecasting to classification: Predict-
ing the direction of stock market price using xtreme gra-
dient boosting, 10 2016.
Hochreiter, Seep and Schmidhuber, Jurgen. Long short-
term memory. NEURAL COMPUTATION, 9(8):1735–
1780, 1997.
Jolliffe, Ian T. and Cadima, Jorge. Principal component
analysis: a review and recent developments. Philosoph-
ical Transactions of the Royal Society A: Mathematical,
Physical and Engineering Sciences, 374(2065), 04 2016.
Wei Bao, Jun Yue, Yulei Rao. A deep learning framework
for financial time series using stacked autoencoders and
long-short term memory. PLOS ONE, 12(7):1–24, 07
2017.

Yamaha YH50 19992006 EN
60% (5)
Yamaha YH50 19992006 EN
203 pages
Stock Prediction FINAL YEAR PROJECT
No ratings yet
Stock Prediction FINAL YEAR PROJECT
23 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Advantages of Microsoft Excel
No ratings yet
Advantages of Microsoft Excel
4 pages
Stock Price Predictions Using Machine Learning Models
No ratings yet
Stock Price Predictions Using Machine Learning Models
11 pages
UCLA Electronic Theses and Dissertations: Title
No ratings yet
UCLA Electronic Theses and Dissertations: Title
44 pages
Stock Prediction Using Machine Learning
No ratings yet
Stock Prediction Using Machine Learning
7 pages
Stock Price Prediction: A Comparative Study Between Traditional Statistical Approach and Machine Learning Approach
No ratings yet
Stock Price Prediction: A Comparative Study Between Traditional Statistical Approach and Machine Learning Approach
37 pages
C S M L M S P P: Omparative Tudy of Achine Earning Odels For Tock Rice Rediction
No ratings yet
C S M L M S P P: Omparative Tudy of Achine Earning Odels For Tock Rice Rediction
6 pages
Qt0cp1x8th NoSplash 8d558aaddf4d132a8c432c705eff7d5b (1)
No ratings yet
Qt0cp1x8th NoSplash 8d558aaddf4d132a8c432c705eff7d5b (1)
43 pages
Deep Learning Algorithm For Stock Price Prediction: 1. Abstract
No ratings yet
Deep Learning Algorithm For Stock Price Prediction: 1. Abstract
5 pages
What-If Analysis Template
No ratings yet
What-If Analysis Template
5 pages
Analysis of Machine Learning Methods For
No ratings yet
Analysis of Machine Learning Methods For
12 pages
Stock Price Prediction
No ratings yet
Stock Price Prediction
8 pages
Summer Training Project Report
No ratings yet
Summer Training Project Report
13 pages
A_Stock_Price_Prediction_Method_Based_on_Price_Trend_Curves
No ratings yet
A_Stock_Price_Prediction_Method_Based_on_Price_Trend_Curves
10 pages
Patel Prince Vipulbhai Thesis 2021
No ratings yet
Patel Prince Vipulbhai Thesis 2021
41 pages
Stock Price Predication
No ratings yet
Stock Price Predication
51 pages
Project Title: Stock Market Prediction
No ratings yet
Project Title: Stock Market Prediction
8 pages
Research Project Stelios Gavriel
No ratings yet
Research Project Stelios Gavriel
9 pages
Soft Computing Paper
No ratings yet
Soft Computing Paper
6 pages
Project Report
No ratings yet
Project Report
5 pages
MICS 2018 Paper 55
No ratings yet
MICS 2018 Paper 55
6 pages
Deep Learning Applying On Stock Trading
No ratings yet
Deep Learning Applying On Stock Trading
6 pages
Project Report
No ratings yet
Project Report
14 pages
Stock Market Prediction Using Machine Learning
No ratings yet
Stock Market Prediction Using Machine Learning
3 pages
GAN For Stock Market Price Prediction
No ratings yet
GAN For Stock Market Price Prediction
5 pages
Vikky Presentation
No ratings yet
Vikky Presentation
11 pages
A CNN-LSTM-based Model To Forecast Stock Prices
No ratings yet
A CNN-LSTM-based Model To Forecast Stock Prices
10 pages
Stock Price Predictor 1.1
No ratings yet
Stock Price Predictor 1.1
9 pages
Rishi
No ratings yet
Rishi
22 pages
Stock Market Prediction Using Machine Learning
No ratings yet
Stock Market Prediction Using Machine Learning
8 pages
Stock Price Prediction Based On CNN-LSTM Model in The Pytorch Environment
No ratings yet
Stock Price Prediction Based On CNN-LSTM Model in The Pytorch Environment
5 pages
InCCCS 2024 Stock Price Prediction
No ratings yet
InCCCS 2024 Stock Price Prediction
7 pages
125989735
No ratings yet
125989735
11 pages
Python A.I. Stock Prediction
100% (1)
Python A.I. Stock Prediction
24 pages
Stock Price Prediction Report - Parth Bathla - 18ECU016
No ratings yet
Stock Price Prediction Report - Parth Bathla - 18ECU016
12 pages
Intern
No ratings yet
Intern
26 pages
Synopsis Report - Upendra
No ratings yet
Synopsis Report - Upendra
34 pages
Final
No ratings yet
Final
35 pages
Final Year Project II (1) - 4-84-1-55
No ratings yet
Final Year Project II (1) - 4-84-1-55
55 pages
Stock Price Prediction Using Long Short Term Memory: International Research Journal of Engineering and Technology (IRJET)
No ratings yet
Stock Price Prediction Using Long Short Term Memory: International Research Journal of Engineering and Technology (IRJET)
9 pages
Stock Price Prediction - Machine Learning Project in Python
No ratings yet
Stock Price Prediction - Machine Learning Project in Python
15 pages
B112_B114_B121_Stock Prediction Using RNN & LSTM
No ratings yet
B112_B114_B121_Stock Prediction Using RNN & LSTM
18 pages
2408.12408v1
No ratings yet
2408.12408v1
11 pages
Stock Price Prediction
No ratings yet
Stock Price Prediction
51 pages
ARIMA Model Has A Strong Potential For Short-Term Prediction of Stock Market Trends
No ratings yet
ARIMA Model Has A Strong Potential For Short-Term Prediction of Stock Market Trends
4 pages
Batch_12_TSA_using_LSTM
No ratings yet
Batch_12_TSA_using_LSTM
14 pages
Testing Stock Market Efficiency Using Historical Trading Data and Machine Learning
No ratings yet
Testing Stock Market Efficiency Using Historical Trading Data and Machine Learning
40 pages
MyFile (9)
No ratings yet
MyFile (9)
20 pages
Comparative_Analysis_of_Stock_Price_Prediction_using_Time_Series_Models
No ratings yet
Comparative_Analysis_of_Stock_Price_Prediction_using_Time_Series_Models
6 pages
AI Project 2 Report
No ratings yet
AI Project 2 Report
6 pages
541735853
No ratings yet
541735853
13 pages
f 3308049620
No ratings yet
f 3308049620
3 pages
Seminar
No ratings yet
Seminar
24 pages
Ijcrti020009 4
No ratings yet
Ijcrti020009 4
1 page
A Stock Price Prediction Method Based on BiLSTM and Improved Transformer
No ratings yet
A Stock Price Prediction Method Based on BiLSTM and Improved Transformer
13 pages
Enhanced Stock Price Prediction Presentation v2
No ratings yet
Enhanced Stock Price Prediction Presentation v2
8 pages
Future Stock Price Prediction Using Recurrent Neural Network LSTM and Machine Learning IJERTCONV9IS05065
No ratings yet
Future Stock Price Prediction Using Recurrent Neural Network LSTM and Machine Learning IJERTCONV9IS05065
5 pages
Expert Systems With Applic Ations: Jonathan L. Ticknor
No ratings yet
Expert Systems With Applic Ations: Jonathan L. Ticknor
6 pages
Python
No ratings yet
Python
12 pages
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
Cessna 172 Question Bank
No ratings yet
Cessna 172 Question Bank
2 pages
4th Lesson Plan - Metric Conversion
No ratings yet
4th Lesson Plan - Metric Conversion
3 pages
Model Question Paper
No ratings yet
Model Question Paper
5 pages
Startup MCQ 2
No ratings yet
Startup MCQ 2
7 pages
Transient Response of First Order Circuits
No ratings yet
Transient Response of First Order Circuits
13 pages
LIWC 22manual DevelopmentandPsychometrics - DRS
No ratings yet
LIWC 22manual DevelopmentandPsychometrics - DRS
49 pages
6 Lecture专项
No ratings yet
6 Lecture专项
44 pages
Vivablast Corporate Handbook 2021
No ratings yet
Vivablast Corporate Handbook 2021
31 pages
GT Owners Manual PDF
No ratings yet
GT Owners Manual PDF
270 pages
Anaqua Worldsuite Data Sheet Final
No ratings yet
Anaqua Worldsuite Data Sheet Final
2 pages
Pendulum Expt
No ratings yet
Pendulum Expt
3 pages
Super Fast Recovery Diode: Data Sheet
No ratings yet
Super Fast Recovery Diode: Data Sheet
7 pages
Sanitization-Remove Hidden Data From PDF Files With Adobe Acrobat XI
No ratings yet
Sanitization-Remove Hidden Data From PDF Files With Adobe Acrobat XI
1 page
Slickline Calculations
100% (2)
Slickline Calculations
0 pages
CS223 - Windows Programming - 2021-10-12
No ratings yet
CS223 - Windows Programming - 2021-10-12
3 pages
Instruction Manual: Heto Gel Dryer GD-2
No ratings yet
Instruction Manual: Heto Gel Dryer GD-2
16 pages
Part 1. Listen To A Talk About "Driverless Cars". Decide Whether The
No ratings yet
Part 1. Listen To A Talk About "Driverless Cars". Decide Whether The
11 pages
01 - Python Pandas 1 & 2
No ratings yet
01 - Python Pandas 1 & 2
5 pages
Ndii 2ndterm2012 Result
No ratings yet
Ndii 2ndterm2012 Result
15 pages
HP xw4300 Base Model Workstation HP XW Workstation - SCSI Hard Drive Installation
No ratings yet
HP xw4300 Base Model Workstation HP XW Workstation - SCSI Hard Drive Installation
18 pages
BSD-148 - Simplified Prediction of Driving Rain On Buildings - ASHRAE 160P and WUFI 4.0 - Building Science Corporation
No ratings yet
BSD-148 - Simplified Prediction of Driving Rain On Buildings - ASHRAE 160P and WUFI 4.0 - Building Science Corporation
10 pages
Natural Persons, Juridical Persons and Legal Personhood Elvia Arcelia Q A
No ratings yet
Natural Persons, Juridical Persons and Legal Personhood Elvia Arcelia Q A
18 pages
Autism
No ratings yet
Autism
17 pages
Factsheet NIFTY100 ESG Index
No ratings yet
Factsheet NIFTY100 ESG Index
2 pages
SN YEAR-2-DLP
No ratings yet
SN YEAR-2-DLP
10 pages
haascnc.com-40T - Gearbox - Troubleshooting Guide - CHC
No ratings yet
haascnc.com-40T - Gearbox - Troubleshooting Guide - CHC
29 pages
Group12 - Do No. 24, S. 2020
No ratings yet
Group12 - Do No. 24, S. 2020
28 pages
PHD Thesis Writing Services in Bangalore
67% (3)
PHD Thesis Writing Services in Bangalore
5 pages