0% found this document useful (0 votes)

85 views10 pages

A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling

This document summarizes research applying deep learning techniques to finance and trading. It describes papers that use neural networks like deep belief networks to model limit order books, classify stock price movements, and predict which stocks will have higher returns. The papers describe neural network architectures, training methods, and report results like predictive accuracy rates and backtested portfolio performance.

Uploaded by

Jorge Ivan Zapata Riveros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views10 pages

A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling

Uploaded by

Jorge Ivan Zapata Riveros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

A Survey of Deep Learning

Techniques Applied to Trading

Published on July 31, 2016

by Greg Harris

http://gregharris.info/a-survey-of-deep-learning-techniques-applied-t
o-trading/

Deep learning has been getting a lot of attention lately with

breakthroughs in image classification and speech recognition.
However, its application to finance doesn’t yet seem to be
commonplace. This survey covers what I’ve found so far that is
relevant to systematic trading. Please tell me if you know of some
research I’ve missed.

Acronyms:

DBN = Deep Belief Network

LSTM = Long Short-Term Memory

MLP = Multi-layer Perceptron

RBM = Restricted Boltzmann Machine

ReLU = Rectified Linear Units

CNN = Convolutional Neural Network

Limit Order Book Modeling

Sirignano (2016) predicts changes in limit order books. He has

developed a “spatial neural network” that can take advantage of
local spatial structure, is more interpretable, and more
computationally efficient than a standard neural network for this
purpose. He models the joint distribution of the best bid and ask at
the time of the next state change. Also, he models the joint
distribution of the best bid and ask prices upon the change in either of
them.

Architecture – Each neural network has 4 layers. The standard

neural network has 250 neurons per hidden layer, and the spatial
neural network has 50. He uses the tanh activation function on the
hidden layer neurons.

Training – He trained and tested on order books from 489 stocks

from 2014 to 2015 (a separate model for each stock). He uses Level III
limit order book data from the NASDAQ with event times having
nanosecond decimal precision. Training involved 50TB of data and
used a cluster with 50 GPUs. He includes 200 features: the price and
size of the limit order book across the first 50 non-zero bid and ask
levels. He uses dropout to prevent overfitting. He uses batch
normalization between each hidden layer to prevent internal
covariate shift. Training is done with the RMSProp algorithm.
RMSProp is similar to stochastic gradient descent with momentum
but it normalizes the gradient by a running average of the past
gradients. He uses an adaptive learning rate where the learning rate
is decreased by a constant factor whenever the training error
increases over a training epoch. He uses early stopping imposed via a
validation set to reduce overfitting. He also includes an l^2 penalty
when training in order to reduce overfitting.

Results – He shows that limit order books exhibit some degree of

local spatial structure. He predicts the order book 1 second ahead
and also at the time of the next bid/ask change. The spatial neural
network outperforms the standard neural network and logistic
regression with non-linear features. Both neural networks have 10%
lower error than logistic regression.

Price-based Classification Models

Dixon et al. (2016) use a deep neural network to predict the sign of
the price change over the next 5 minutes for 43 commodity and forex
futures.

Architecture – Their input layer has 9,896 neurons for input features
made up of lagged price differences and co-movements between
contracts. There are 5 learned fully-connected layers. The first of the
four hidden layers contains 1,000 neurons, and each subsequent
layer tapers by 100 neurons. The output layer has 135 neurons (3 for
each class {-1, 0, 1} times 43 contracts).

Training – They used the standard back-propagation with stochastic

gradient descent. They speed up training by using mini-batching
(computing the gradient on several training examples at once rather
than individual examples). Rather than an nVidia GPU, they used an
Intel Xeon Phi co-processor.

Results – They report 42% accuracy, overall, for three-class

classification. They do some walk-forward training instead of a
traditional backtest. Their boxplot shows some generally positive
Sharpe ratios from the mini-backtests for each contract. They did not
include transaction costs or crossing the bid-ask spread. All their
predictions and features were based on the mid-price at the end of
each 5-minute time period.

Takeuchi and Lee (2013) look to enhance the momentum effect by

predicting which stocks will have higher or lower monthly returns
than the median.

Architecture – They use an auto-encoder composed of stacked

RBMs to extract features from stock prices which they then pass to a
feed-forward neural network classifier. Each RBM consists of one
layer of visible units and one layer of hidden units connected by
symmetric links. The first layer has 33 units for input features from
one stock at a time. For every month t, the features include the 12
monthly returns for month t-2 through t-13 and the 20 daily returns
approximately corresponding to month t. They normalize each of the
return features by calculating the z-score relative to the cross-section
of all stocks for each month or day. The number of hidden units in the
final layer of the encoder is sharply reduced, forcing dimensionality
reduction. The output layer has 2 units, corresponding to whether the
stock ended up above or below the median return for the month. Final
layer sizes are 33-40-4-50-2.

Training – During pre-training, they split the dataset into smaller,

non-overlapping mini-batches. Afterwards, they un-roll the RBMs to
form an encoder-decoder, which is fine-tuned using back-propagation.
They consider all stocks trading on the NYSE, AMEX, or NASDAQ with
a price greater than $5. They train on data from 1965 to 1989
(848,000 stock-month samples) and test on data from 1990 to 2009
(924,300 stock-month samples). Some training data held-out for
validation for the number of layers and the number of units per layer.
Results – Their overall accuracy is around 53%. When they consider
the difference between the top decile and the bottom decile
predictions, they get 3.35% per month, or 45.93% annualized return.

Batres-Estrada (2015) predicts which S&P 500 stocks will have

above-median returns for each given day, and his work appears to be
heavily influenced by Takeuchi and Lee (2013).

Architecture – He uses a 3-layer DBN coupled to an MLP. He uses

400 neurons in each hidden layer, and he uses a sigmoid activation
function. The output layer is a softmax layer with two output neurons
for binary classification (above median or below). The DBN is
composed of stacked RBMs, each trained sequentially.

Training – He first pre-trains the DBN module, then fine-tunes the

entire DBN-MLP using back-propagation. The input includes 33
features: monthly log-returns for months t-2 to t-13, 20 daily
log-returns for each stock at month t, and an indicator variable for the
January effect. The features are normalized using the Z-score for each
time period. He uses S&P 500 constituent data from 1985 to 2006
with a 70-15-15 split for training-validataion-test. He uses the
validation data to choose the number of layers, the number of
neurons, and the regularization parameters. He uses early-stopping
to prevent over-fitting.

Results – His model has 53% accuracy, which outperforms

regularized logistic regression and a few MLP baselines.

Sharang and Rao (2015) use a DBN trained on technical indicators to

trade a portfolio of US Treasury note futures.

Architecture – They use a DBN consisting of 2 stacked RBMs. The

first RBM is Gaussian-Bernoulli (15 nodes), and the second RBM is
Bernoulli (20 nodes). The DBN produces latent features which they try
feeding into three different classifiers: regularized logistic regression,
support vector machines, and a neural network with 2 hidden layers.
They predict 1 if portfolio goes up over 5 days, and -1 otherwise.

Training – They train the DBN using a contrastive divergence

algorithm. They calculate signals based on open, high, low, close,
open interest, and volume data, beginning in 1985, with some points
removed during the 2008 financial crisis. They use 20 features: the
“daily trend” calculated over different time frames, and then
normalized. All parameters are chosen using a validation dataset.
When training the neural net classifier, they mention using a
momentum parameter during mini-batch gradient descent training to
shrink the coefficients by half during every update.

Results – The portfolio is constructed using PCA to be neutral to the

first principal component. The portfolio is an artificial spread of
instruments, so actually trading it is done with a spread between the
ZF and ZN contracts. All input prices are mid-prices, meaning the
bid-ask spread is ignored. The results look profitable, with all three
classification models performing 5-10% more accurately than a
random predictor.

Zhu et al. (2016) make trade decisions using oscillation box theory
based on DBNs. Oscillation box theory says that a stock price will
oscillate within a certain range in a period of time. If the price moves
outside the range, then it enters into a new box. The authors try to
predict the boundaries of the box. Their trading strategy is to buy the
stock when it breaks through the top boundary or sell it when it
breaks through the bottom boundary.

Architecture – They use a DBN made up of stacked RBMs and a final

back-propagation layer.

Training – They used block Gibbs sampling to greedily train each

layer from lowest to highest in an unsupervised way. They then train
the back-propagation layer in a supervised way, which fine-tunes the
whole model. They chose 400 stocks out of the S&P 500 for testing,
and the test set covers 400 days from 2004 to 2005. They use open,
high, low, close prices as well as technical analysis indicators, for a
total of 14 model inputs. Some indicators are given more influence in
the prediction through the use of “gray relation analysis” or “gray
correlation degree.”

Results – In their trading strategy, they charge 0.5% transaction

costs per trade and add a couple of parameters for stop-loss and
“transaction rate.” I don’t fully understand the result tables, but they
seem to be reporting significant profits.

Text-based Classification Models

Rönnqvist and Sarlin (2016) predict bank distress using news articles.
Specifically, they create a classifier to judge whether a given
sentence indicates distress or tranquility.
Architecture – They use two neural networks in this paper. The first
is for semantic pre-training to reduce dimensionality. For this, they
run a sliding window over text, taking a sequence of 5 words and
learning to predict the next word. They use a feed-forward topology
where a projection layer in the middle provides the semantic vectors
once the connection weights have been learned. They also include
the sentence ID as an input to the model, to provide context and
inform the prediction of the next word. They use binary Huffman
coding to map sentence IDs and word to activation patterns in the
input layer, which organizes the words roughly by frequency. They
say feed-forward topologies with fixed context sizes are more
efficient than recurrent neural networks for modeling text sequences.
The second neural network is for classification. Instead of a million
inputs (one for each word), they use 600 inputs from the learned
semantic model. The first layer has 600 nodes, the middle layer has
50 rectified linear hidden nodes, and the output layer has 2 nodes
(distress/tranquil).

Training – They train it with 243 distress events over 101 banks
observed during the financial crisis of 2007-2009. They use 716k
sentences mentioning the banks, taken from 6.6m Reuters news
articles published during and after the crisis.

Results – They evaluate their classification model using a custom

“Usefulness” measure. The evaluation is done using cross-validation,
leaving N banks out in each fold. They aggregate the distress counts
into various timeseries but don’t go so far as to consider creating a
trading strategy.

Fehrer and Feuerriegel (2015) train a model to predict German stock

returns based on headlines.

Architecture – They use a recursive autoencoder with an additional

softmax layer in each autoencoder for estimating probabilities. They
perform three-class prediction {-1, 0, 1} for the following day’s return
of the stock associated with the headline.

Training – They initialize the weights with Gaussian noise, and then
update through back-propagation. They use an English ad-hoc news
announcement dataset (8,359 headlines) for the German market
covering 2004 to 2011.

Results – Their recursive autoencoder has 56% accuracy, which in

an improvement over a more traditional random forest modeling
approach with 53% accuracy. They do not develop a trading strategy.
They have made a Java implementation of their code publicly
available.

Ding et al. (2015) use structured information extracted from

headlines to predict daily S&P 500 moves. Headlines are processed
with Open IE to obtain structured event representations (actor, action,
object, time). A neural tensor network learns the semantic
compositionality over event arguments by combining them
multiplicatively instead of only implicitly, as with standard neural
networks.

Architecture – They combine short-term and long-term effects of

events, using a CNN to perform semantic composition over the input
event sequence. They use a max pooling layer on top of the
convolutional layer, which makes the network retain only the most
useful features produced by the convolutional layer. They have
separate convolutional layers for long-term events and mid-term
events. Both of these layers, along with an input layer for short-term
events, feed into a hidden layer which then feeds into two output
nodes.

Training – They extracted 10 million events from Reuters and

Bloomberg news. For training, they corrupt events by replacing one
event argument with a random argument. During training, they
assume that the actual event should be given a higher score than the
corrupted event. When it isn’t, model parameters get updated.

Results – They find that structured events are better features than
words for stock market prediction. Their approach outperforms
baseline methods by 6%. They make predictions for the S&P 500
index and 15 individual stocks, and a table appears to show that they
can predict the S&P 500 with 65% accuracy.

Volatility Prediction

Xiong et al. (2015) predict the daily volatility of the S&P 500, as
estimated from open, high, low, close prices.

Architecture – They use a single LSTM hidden layer consisting of

one LSTM block. For inputs they use daily S&P 500 returns and
volatilities. They also include 25 domestic Google trends, covering
sectors and major areas of the economy.
Training – They used the “Adam” method with 32 samples per batch
and mean absolute percent error (MAPE) as the objective loss
function. They set the maximum lag of the LSTM to include 10
successive observations.

Results – They show their LSTM method outperforms GARCH, Ridge,

and LASSO techniques.

Portfolio Optimization

Heaton et al. (2016) attempt to create a portfolio that outperforms

the biotech index IBB. They have the goal of tracking the index with
few stocks and low validation error. They also try to beat the index by
being anti-correlated during periods of large drawdowns. They don’t
directly model the covariance matrix, rather it is trained in the deep
architecture fitting procedure, which allows for nonlinearities.

Architecture – They use auto-encoding with regularization and

ReLUs. Their auto-encoder has one hidden layer with 5 neurons.

Training – They use weekly return data for the component stocks of
IBB from 2012 to 2016. They auto-encode all stocks in the index and
evaluate the difference between each stock and its auto-encoded
version. They keep the 10 most “communal” stocks that are most
similar to the auto-encoded version. They also keep a varying number
of other stocks, where the number is chosen with cross-validation.

Results – They show the tracking error as a function of the number

stocks included in the portfolio, but don’t seem to compare against
traditional methods. They also replace index drawdowns with positive
returns and find portolios that track this modified index.

Stock Market Forecasting using deep learning? (reddit.com)

deep learning in finance (reddit.com)

Leveraging Google DeepMind software and Deep Learning to play the

stock market(reddit.com)
They might be Robots, Using data for investment management and
trading (youtube.com)

Introducing Binatix: A Deep Learning Trading Firm That’s Already

Profitable (recode.net)

Will AI-Powered Hedge Funds Outsmart the

Market? (technologyreview.com)

Adaptive deep learning empowers traders (automatedtrader.net)

Deep Learning in Finance (slideshare.net)

Prediction of Exchange Rate Using Deep Neural

Network (slideshare.net)

References

Batres-Estrada, B. (2015). Deep learning for multivariate financial

time series. abstract

Ding, X., Zhang, Y., Liu, T., & Duan, J. (2015, June). Deep learning for
event-driven stock prediction. In Proceedings of the Twenty-Fourth
International Joint Conference on Artificial Intelligence (ICJAI) (pp.
2327-2333). abstract

Dixon, M. F., Klabjan, D., & Bang, J. H. (2016). Classification-based

Financial Markets Prediction using Deep Neural Networks. Available
at SSRN 2756331. abstract

Fehrer, R., & Feuerriegel, S. (2015). Improving Decision Analytics with

Deep Learning: The Case of Financial Disclosures. arXiv preprint
arXiv:1508.01993. abstract

Heaton, J. B., Polson, N. G., & Witte, J. H. (2016). Deep Portfolio Theory.
arXiv preprint arXiv:1605.07230. abstract

Rönnqvist, S., & Sarlin, P. (2016). Bank distress in the news:

Describing events through deep learning. arXiv preprint
arXiv:1603.05670. abstract

Sharang, A., & Rao, C. (2015). Using machine learning for medium
frequency derivative portfolio trading. arXiv preprint
arXiv:1512.06228. abstract
Sirignano, J. A. (2016). Deep Learning for Limit Order Books. arXiv
preprint arXiv:1601.01987. abstract

Takeuchi, L., Lee, Y. (2013). Applying Deep Learning to Enhance

Momentum Trading Strategies in Stocks. abstract

Xiong, R., Nicholas, E. P., & Shen, Y. (2015). Deep Learning Stock
Volatilities with Google Domestic Trends. arXiv preprint
arXiv:1512.04916. abstract

Zhu, C., Yin, J., & Li, Q. (2014). A stock decision support system based
on DBNs. Journal of Computational Information Systems, 10(2),
883-893. abstract·

Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
No ratings yet
Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
77 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Incandicent Bulb
No ratings yet
Incandicent Bulb
1 page
Major Project Final AI DS B31
No ratings yet
Major Project Final AI DS B31
71 pages
Production System: Fundamentals and Applications
From Everand
Production System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Enhancing Cell Segmentation
No ratings yet
Enhancing Cell Segmentation
6 pages
B2B Stablecoins and AI Verification
No ratings yet
B2B Stablecoins and AI Verification
21 pages
Transforming Personalized Travel Recommendations I
No ratings yet
Transforming Personalized Travel Recommendations I
22 pages
GROW - Using AI To Screen Human Intelligence
50% (4)
GROW - Using AI To Screen Human Intelligence
6 pages
Christina Chua Xiu Fang FINAL Thesis
No ratings yet
Christina Chua Xiu Fang FINAL Thesis
431 pages
PROJECT 1 DIAL5111w
No ratings yet
PROJECT 1 DIAL5111w
13 pages
December 2024
No ratings yet
December 2024
32 pages
6 Naive-Bayes
No ratings yet
6 Naive-Bayes
18 pages
Chinas AI Growth Multiplier vSAI
No ratings yet
Chinas AI Growth Multiplier vSAI
7 pages
The Role and Impact of Artificial Intelligence On Retail Business and Its Developments
No ratings yet
The Role and Impact of Artificial Intelligence On Retail Business and Its Developments
5 pages
Unit 4
No ratings yet
Unit 4
54 pages
Olatunji 2017
No ratings yet
Olatunji 2017
6 pages
New Tech Template
No ratings yet
New Tech Template
14 pages
Minor File Updated Okr - Merged
No ratings yet
Minor File Updated Okr - Merged
47 pages
MM-TrafficRisk: A Video-Based Fleet Management Application For Traffic Risk Prediction, Prevention, and Querying
No ratings yet
MM-TrafficRisk: A Video-Based Fleet Management Application For Traffic Risk Prediction, Prevention, and Querying
10 pages
LLM Interpretability 101
No ratings yet
LLM Interpretability 101
8 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Machine Learning (Csen 3233)
No ratings yet
Machine Learning (Csen 3233)
4 pages
Get Rich in The A.I
No ratings yet
Get Rich in The A.I
10 pages
SL Classification For Data Science..
No ratings yet
SL Classification For Data Science..
4 pages
Explainable Artificial Intelligence Methods For Financial Time Series (Stock) - Paper
No ratings yet
Explainable Artificial Intelligence Methods For Financial Time Series (Stock) - Paper
14 pages
Unit 6
No ratings yet
Unit 6
19 pages
IEEE CAA Journal of Automatica Sinica-3
No ratings yet
IEEE CAA Journal of Automatica Sinica-3
11 pages
Roshan CV Final
No ratings yet
Roshan CV Final
1 page
Organizational Behavior
No ratings yet
Organizational Behavior
2 pages
Automation in Construction: Aritra Pal, Shang-Hsien Hsieh
No ratings yet
Automation in Construction: Aritra Pal, Shang-Hsien Hsieh
22 pages
JPMorgan An Investors Guide To Artificial Intelligencev 2
No ratings yet
JPMorgan An Investors Guide To Artificial Intelligencev 2
110 pages
Stock Prediction Based On Technical Indicators Using Deep Learning Model
No ratings yet
Stock Prediction Based On Technical Indicators Using Deep Learning Model
14 pages
Deep Learning and The Cross-Section of Expected Returns
No ratings yet
Deep Learning and The Cross-Section of Expected Returns
56 pages
Ga-Mssr: Genetic Algorithm Maximizing Sharpe and Sterling Ratio Method For Robotrading
No ratings yet
Ga-Mssr: Genetic Algorithm Maximizing Sharpe and Sterling Ratio Method For Robotrading
8 pages
13. Dự Báo Chứng Khoán Bằng Transformer
No ratings yet
13. Dự Báo Chứng Khoán Bằng Transformer
8 pages
Paper 109-Day Trading Strategy Based On Transformer Model
No ratings yet
Paper 109-Day Trading Strategy Based On Transformer Model
13 pages
Daftar Mahasiswa Studi Independen 9 Februari 2023
No ratings yet
Daftar Mahasiswa Studi Independen 9 Februari 2023
32 pages
Results Conclusions: Winifred Akpan, Emmanuel Ayo, Malek Browning, Jason Kennedy
No ratings yet
Results Conclusions: Winifred Akpan, Emmanuel Ayo, Malek Browning, Jason Kennedy
1 page
Final Papre LSTM BiLSTM
No ratings yet
Final Papre LSTM BiLSTM
7 pages
Thesis PDF
No ratings yet
Thesis PDF
166 pages
Research Paper Modified
No ratings yet
Research Paper Modified
7 pages
RP Final
No ratings yet
RP Final
13 pages
Agile Methods On Large Projects
No ratings yet
Agile Methods On Large Projects
17 pages
Clustering Financial Time Series With Variance Ratio Statistics
No ratings yet
Clustering Financial Time Series With Variance Ratio Statistics
28 pages
3Rd Spelling Beeaver: Anglocanadian School Neiva
No ratings yet
3Rd Spelling Beeaver: Anglocanadian School Neiva
1 page
Information Sciences: Gang Kou, Yi Peng, Guoxun Wang
No ratings yet
Information Sciences: Gang Kou, Yi Peng, Guoxun Wang
12 pages
Asics and Decentralization Faq: 1 About This Document
No ratings yet
Asics and Decentralization Faq: 1 About This Document
8 pages
Bitcoin & Blockchain: Payment Instructions and Currencies
No ratings yet
Bitcoin & Blockchain: Payment Instructions and Currencies
14 pages
Deep Learning Models For Price Forecasting of Financial Time Series: A Review of Recent Advancements: 2020-2022
No ratings yet
Deep Learning Models For Price Forecasting of Financial Time Series: A Review of Recent Advancements: 2020-2022
37 pages
Time Series Forecasting With Transformer Models and Application To Asset Management
No ratings yet
Time Series Forecasting With Transformer Models and Application To Asset Management
44 pages
Hrai Center of Excellence
No ratings yet
Hrai Center of Excellence
19 pages
An Overview of Artificial Intelligence Applications For Power Electronics
No ratings yet
An Overview of Artificial Intelligence Applications For Power Electronics
26 pages
Forecasting Stock Prices From The Limit Order Book Using Convolutional Neural Networks
No ratings yet
Forecasting Stock Prices From The Limit Order Book Using Convolutional Neural Networks
6 pages
Higher Order and Recurrent Neural Architectures For Trading The EUR USD Exchange Rate
No ratings yet
Higher Order and Recurrent Neural Architectures For Trading The EUR USD Exchange Rate
28 pages
Deep Learning For Financial Derivative Pricing
No ratings yet
Deep Learning For Financial Derivative Pricing
5 pages
Stock Price Prediction Using Data Analytics: 978-1-5386-3852-1/17/$31.00 ©2017 IEEE
No ratings yet
Stock Price Prediction Using Data Analytics: 978-1-5386-3852-1/17/$31.00 ©2017 IEEE
5 pages
Application of Machine Learning and Deep Learning Algorithms To The Prediction of Stock Market Trends
No ratings yet
Application of Machine Learning and Deep Learning Algorithms To The Prediction of Stock Market Trends
5 pages
A Multi-Layer and Multi-Ensemble Stock Trader Using Deep Learning and Deep Reinforcement Learning - Anna's Archive
No ratings yet
A Multi-Layer and Multi-Ensemble Stock Trader Using Deep Learning and Deep Reinforcement Learning - Anna's Archive
17 pages
A Strategists Guide To Blockchain - PWC PDF
No ratings yet
A Strategists Guide To Blockchain - PWC PDF
17 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
WCE2008 pp1171-1175
No ratings yet
WCE2008 pp1171-1175
5 pages
A Time Series Analysis-Based Stock Price Predictio
No ratings yet
A Time Series Analysis-Based Stock Price Predictio
54 pages
Image Segmentation For Object Detection Using Mask R-CNN in Colab
No ratings yet
Image Segmentation For Object Detection Using Mask R-CNN in Colab
5 pages
$tock Forecasting Using Machine Learning: Greg Colvin, Garrett Hemann, and Simon Kalouche
No ratings yet
$tock Forecasting Using Machine Learning: Greg Colvin, Garrett Hemann, and Simon Kalouche
5 pages
Robust Machine Learning Pipelines For Trading Mark
No ratings yet
Robust Machine Learning Pipelines For Trading Mark
29 pages
Identifying Trades Using Technical Analysis and ML/DL Models
No ratings yet
Identifying Trades Using Technical Analysis and ML/DL Models
14 pages
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
100% (1)
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
5 pages
A Deep Reinforcement Learning Trader Without Offline Training
No ratings yet
A Deep Reinforcement Learning Trader Without Offline Training
17 pages
Trading Predictions Neural Networks
No ratings yet
Trading Predictions Neural Networks
7 pages
Axioms 09 00130
No ratings yet
Axioms 09 00130
15 pages
Analysis of Algorithmic Trading With Q-Learning in The Forex Market
No ratings yet
Analysis of Algorithmic Trading With Q-Learning in The Forex Market
5 pages
Using Deep Learning For Price Prediction by Exploiting Stationary
No ratings yet
Using Deep Learning For Price Prediction by Exploiting Stationary
19 pages
2023.02 - Time Series Forecasting With Transformer Models - en
100% (1)
2023.02 - Time Series Forecasting With Transformer Models - en
52 pages
Algorithmic Trading On Financial Time Series Using
No ratings yet
Algorithmic Trading On Financial Time Series Using
20 pages
SafeStart Slips Trips and Falls Guide - C
100% (1)
SafeStart Slips Trips and Falls Guide - C
16 pages
5225 Notes
No ratings yet
5225 Notes
2 pages
Predict Forex Trend Via Convolutional Neural Netwo
No ratings yet
Predict Forex Trend Via Convolutional Neural Netwo
30 pages
Product Mindset English v11
No ratings yet
Product Mindset English v11
69 pages
CNN PDF
No ratings yet
CNN PDF
79 pages
Multivariate Forex Forecasting Using Artificial Neural Networks
No ratings yet
Multivariate Forex Forecasting Using Artificial Neural Networks
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Stock Market Prediction Using CNN and LSTM
No ratings yet
Stock Market Prediction Using CNN and LSTM
7 pages
Modelling and Trading The EUR/USD Exchange Rate at The ECB Fixing
No ratings yet
Modelling and Trading The EUR/USD Exchange Rate at The ECB Fixing
25 pages
Price Prediction Evolution: From Economic Model To Machine Learning
No ratings yet
Price Prediction Evolution: From Economic Model To Machine Learning
7 pages
Research Paper SMP
No ratings yet
Research Paper SMP
5 pages
Literature Survey: 2.1 Review On Machine Learning Techniques For Stock Price Prediction
No ratings yet
Literature Survey: 2.1 Review On Machine Learning Techniques For Stock Price Prediction
15 pages
04.stock Market Prediction Using Machine Learning
No ratings yet
04.stock Market Prediction Using Machine Learning
6 pages
Stock Price Prediction Using Artificial Neural Network
No ratings yet
Stock Price Prediction Using Artificial Neural Network
8 pages
Machine Learning For Algorithmic Trading: T Kondratieva
No ratings yet
Machine Learning For Algorithmic Trading: T Kondratieva
5 pages
A Project Report: Submitted in Partial Fulfillment For The Award of The Degree of
No ratings yet
A Project Report: Submitted in Partial Fulfillment For The Award of The Degree of
18 pages
Application of Neural Network For Forecasting of Exchange Rates and Forex Trading Nijolė Maknickienė, Algirdas Maknickas
No ratings yet
Application of Neural Network For Forecasting of Exchange Rates and Forex Trading Nijolė Maknickienė, Algirdas Maknickas
6 pages
83 Ijcse 04795
No ratings yet
83 Ijcse 04795
6 pages
ANN Model To Predict Stock Prices at Stock Exchange Markets
No ratings yet
ANN Model To Predict Stock Prices at Stock Exchange Markets
23 pages
Using Recurrent Neural Networks To Forecasting of Forex
No ratings yet
Using Recurrent Neural Networks To Forecasting of Forex
23 pages
Experimental Results For Case Study: The Financial Market Is A Complex, Evolutionary, and Non-Linear Dynamical System
No ratings yet
Experimental Results For Case Study: The Financial Market Is A Complex, Evolutionary, and Non-Linear Dynamical System
4 pages
Neural Network Time Series Prediction SP500 2
No ratings yet
Neural Network Time Series Prediction SP500 2
7 pages
Artificial Neural Networks For Forecasting Stock Price
No ratings yet
Artificial Neural Networks For Forecasting Stock Price
15 pages
Applying Deep Learning To Enhance Momentum Trading Strategies in Stocks
100% (1)
Applying Deep Learning To Enhance Momentum Trading Strategies in Stocks
5 pages
Stock Market Forecasting
0% (1)
Stock Market Forecasting
18 pages

A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling

Uploaded by

A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling

Uploaded by

A Survey of Deep Learning

Techniques Applied to Trading

Published on July 31, 2016

Deep learning has been getting a lot of attention lately with

DBN = Deep Belief Network

LSTM = Long Short-Term Memory

MLP = Multi-layer Perceptron

RBM = Restricted Boltzmann Machine

ReLU = Rectified Linear Units

CNN = Convolutional Neural Network

Limit Order Book Modeling

Sirignano (2016) predicts changes in limit order books. He has

Architecture – Each neural network has 4 layers. The standard

Training – He trained and tested on order books from 489 stocks

Results – He shows that limit order books exhibit some degree of

Price-based Classification Models

Training – They used the standard back-propagation with stochastic

Results – They report 42% accuracy, overall, for three-class

Takeuchi and Lee (2013) look to enhance the momentum effect by

Architecture – They use an auto-encoder composed of stacked

Training – During pre-training, they split the dataset into smaller,

Batres-Estrada (2015) predicts which S&P 500 stocks will have

Architecture – He uses a 3-layer DBN coupled to an MLP. He uses

Training – He first pre-trains the DBN module, then fine-tunes the

Results – His model has 53% accuracy, which outperforms

Sharang and Rao (2015) use a DBN trained on technical indicators to

Architecture – They use a DBN consisting of 2 stacked RBMs. The

Training – They train the DBN using a contrastive divergence

Results – The portfolio is constructed using PCA to be neutral to the

Architecture – They use a DBN made up of stacked RBMs and a final

Training – They used block Gibbs sampling to greedily train each

Results – In their trading strategy, they charge 0.5% transaction

Text-based Classification Models

Results – They evaluate their classification model using a custom

Fehrer and Feuerriegel (2015) train a model to predict German stock

Architecture – They use a recursive autoencoder with an additional

Results – Their recursive autoencoder has 56% accuracy, which in

Ding et al. (2015) use structured information extracted from

Architecture – They combine short-term and long-term effects of

Training – They extracted 10 million events from Reuters and

Architecture – They use a single LSTM hidden layer consisting of

Results – They show their LSTM method outperforms GARCH, Ridge,

Heaton et al. (2016) attempt to create a portfolio that outperforms

Architecture – They use auto-encoding with regularization and

Results – They show the tracking error as a function of the number

Stock Market Forecasting using deep learning? (reddit.com)

deep learning in finance (reddit.com)

Leveraging Google DeepMind software and Deep Learning to play the

Introducing Binatix: A Deep Learning Trading Firm That’s Already

Will AI-Powered Hedge Funds Outsmart the

Adaptive deep learning empowers traders (automatedtrader.net)

Deep Learning in Finance (slideshare.net)

Prediction of Exchange Rate Using Deep Neural

Batres-Estrada, B. (2015). Deep learning for multivariate financial

Dixon, M. F., Klabjan, D., & Bang, J. H. (2016). Classification-based

Fehrer, R., & Feuerriegel, S. (2015). Improving Decision Analytics with

Rönnqvist, S., & Sarlin, P. (2016). Bank distress in the news:

Takeuchi, L., Lee, Y. (2013). Applying Deep Learning to Enhance

You might also like