0% found this document useful (0 votes)

39 views33 pages

Supervised Autoencoder MLP

The document proposes using machine learning techniques like supervised autoencoders and noise augmentation to improve financial time series forecasting and trading strategy performance. It evaluates these techniques on S&P500, EUR/USD, and BTC/USD data. Key research questions are whether noise augmentation and a novel triple barrier labeling technique can improve performance as measured by information ratio. Feature engineering techniques like fractional differentiation of time series are also considered. The aim is to determine whether these machine learning approaches can outperform traditional methods for algorithmic trading.

Uploaded by

Bartosz Bieganowski

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views33 pages

Supervised Autoencoder MLP

Uploaded by

Bartosz Bieganowski

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Supervised Autoencoder MLP for Financial Time Series Forecasting

Bartosz Bieganowski, Robert Ślepaczuk

University of Warsaw, Faculty of Economic Sciences

Department of Quantitative Finance and Machine Learning, Quantitative Finance Research Group

March 4, 2024

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 1 / 33
Table of Contents

1 Problem Statement & Data

2 Feature Engineering

3 Novelty 1 - Triple Barrier Labeling & Optimal Metric

4 Novelty 2 - Supervised Autoencoder & Noise Augmentation

5 Approach Comparison & Results

6 Sensitivity Analysis

7 Conclusion

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 2 / 33
Aim

The aim of this study is to verify whether the following machine-learning related techniques
can improve trading strategy performance:
Noise augmentation - originally developed for Computer Vision problems, it has been
noted that adding noise to the input data helps with generalization on image classification
tasks.
Supervised Autoencoder - originally developed for Natural Language Processing
problems, we test if SAE-MLP architecture can be applied in algorithmic trading
strategies.
Triple Barrier Labeling - although already mentioned in the literature, we expand on
this specific labeling method by developing an optimization metric that resembles the
strategy return better.

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 3 / 33
Research Questions

RQ1: Does noise augmentation used with SAE-MLP architecture improve

strategy performance as expressed by Information Ratio?

RQ2: Does the triple barrier labeling with correct optimization metric improve
strategy performance as expressed by Information Ratio?

RQ3: Does the hyperparameter tuning improve strategy performance with

SAE-MLP architectures as expressed by Information Ratio?

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 4 / 33
Literature Review

Efficient Market Hypothesis (EMH) suggests stock prices reflect all available information,
making them unpredictable.
Studies by Fama (1970) and Malkiel (2005) support EMH, while Barberis and Thaler
(2002) suggest market inefficiencies.
Machine learning (ML) techniques like LSTM outperform traditional methods in stock
price prediction (Kryńska and Ślepaczuk, 2022).
LSTM models show promise in forecasting, but challenges remain in handling
non-stationary data and parameter sensitivity.
Hybrid models combining LSTM and GRU demonstrate improved performance in
forecasting financial assets (Baranovhnikov and Ślepaczuk, 2022)
ML models, particularly deep learning, excel in predicting Bitcoin prices, indicating their
relevance in cryptocurrency trading (Michanków et. al., 2022).

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 5 / 33
Data

S&P500 - Low volatility compared to individual stocks, correlated with economic growth
indicators, right-skewed return distribution.

EUR/USD - Moderate volatility, driven by monetary policy of UE and FED, and

indicators from both regions, returns close to normal with leptokurtosis.

BTC/USD - High volatility, driven by speculation, technological developments. Low

correlation with traditional financial assets. Returns skewed and highly leptokurtic.

Training timeframe: 2010-01-01 - 2019-12-31

Testing timeframe: 2020-01-01 - 2022-04-30

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 6 / 33
Toolset

Hardware: GeForce RTX 2080 SUPER, Intel Core i7-9700K, Patriot 32GB RAM

Software: Python 3.10, Tensorflow, Pandas, Matplotlib, Scikit-learn

Computation Time: on average 4 minutes per 1 hyperparameter combination

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 7 / 33
Features - ICSA, Oil, Gas

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 8 / 33
Features - Corn, Gold, Copper

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 9 / 33
Features - Presumed Impact on Economy

Feature Presumed Increase Impact Presumed Decrease Impact

ICSA Negative: Indicates rising unemployment, poten- Positive: Suggests decreasing unemployment,
tial economic slowdown potential economic growth
Oil Mixed: Benefits oil exporters, increases costs for Mixed: Lowers costs for importers and con-
importers and consumers sumers, but may harm oil-exporting economies
Gas Negative: Increases energy costs, affects con- Positive: Decreases energy costs, boosts con-
sumer spending and production costs sumer spending and lowers production costs
Corn Negative: Raises food and feed prices, impacts Positive: Lowers food and feed prices, beneficial
food industry and inflation for food industry and inflation control
Gold Mixed: Often seen as a safe haven, increase may Mixed: Decrease may reflect investor confidence,
indicate economic uncertainty but could impact gold-producing economies
Copper Positive: Suggests industrial growth and de- Negative: May indicate reduced industrial activ-
mand, often a positive economic indicator ity and economic slowdown

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 10 / 33
Feature Engineering Question

What do we do with our features before we put them into the machine learning model?

Should we differentiate the time series (d=1, losing the memory aspect)?

Should we input it as-is (d=0, but data is not stationary)?

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 11 / 33
Fractionally differentiated features

We can apply ARFIMA (Granger, C. W. J.; Joyeux, Roselyne, 1980) assumptions to machine learning
features. We consider the backshift operator B applied to a time series of a feature {Xt } such that
B k Xt = Xt−k .

It follows that the difference between current and last feature’s value can be expressed as (1 − B)Xt . For
example, (1 − B)2 = 1 − 2B + B 2 , where B 2 Xt = Xt−2 so that (1 − B)2 Xt = Xt − 2Xt−1 + Xt−2 .

For any positive integer n, it also holds that:

n n
n
X n k n−k
X n n−k k
(x + y ) = x y = x y (1)
k k
k=0 k=0

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 12 / 33
Fractionally differentiated features

On the other hand for any real number d:

∞
X d k
(1 + x)d = x (2)
k
k=0

is the binomial series. In a model where d is allowed to be a real number, the binomial series
can be expanded into a series of weights which can be applied to feature values:
k−1
( )
d(d − 1) d(d − 1)(d − 1) Y d −i
ω = 1, −d, , , ..., (−1)k (3)
2! 3! k!
i=0

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 13 / 33
Optimal Differentiation Order

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 14 / 33
Optimal Differentiation Order

Algorithm 1 Fractional Feature Differentiation in Walk-Forward Validation

1: Set a range of possible values for d (e.g., from 0 to 1)
2: Set significance level for ADF test (e.g., 1%)
3: Initiate a dictionary associating each feature with optimal d.
4: for each segment pair (train, test) do
5: for each feature do
6: Apply fractional differencing to train segment of feature at discrete intervals
7: Calculate ADF test statistic and p-value for each d for a feature
8: Choose lowest d such that p-value ¡ significance level.
9: Save feature name and associated optimal d to dictionary
10: Apply optimal d differencing to both train and test set of the feature
11: end for
12: Train the model on train segment, evaluate on test set
13: end for

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 15 / 33
Triple Barrier Labeling

Whenever we try to express trading problem as a machine learning problem, we have to think
long and hard about what do we want our model to really predict (Y).

Regression on price in x time? (unstationary, ignores path, uninformative error metrics).

Regression on return over x time? (maybe stationary, ignores path, no directional

sensitivity unless using custom loss metrics like MADL).

Classification on movement direction? (better, still ignores path, high noise-to-signal).

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 16 / 33
Triple Barrier Labeling

Path-dependent classification, which is effectively ML-interpretation of concepts of stop-loss,

take-profit, and timed-exit.

Figure: Exemplary labels in triple-barrier-labeling

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 17 / 33
Triple Barrier Labeling


1,
 if max(St , ..., St+n ) ≥ St · (1 + λ)
Pt = −1, if min(St , ..., St+n ) ≤ St · (1 − λ) (4)

0, otherwise


λ - window size in (%)

(Idea: λ was a constant for this study, but it might work well to base it on an estimate of future volatility)

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 18 / 33
Payoff Table

Table 2. Return on a trade given classification result.

Pred/True 1 0 -1

1 λ (−λ, λ) −λ
0 0 0 0
-1 −λ (−λ, λ) λ

Source: Own Elaboration

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 19 / 33
Derived Optimization Metric

We define directly correct count as the number of times the model entered correct position
which resulted in return of λ. We can similarly define directly incorrect count as the number
of times the model entered incorrect position:

DCC = |{(Ypred , Ytrue ) ∈ S | Ypred ̸= 0 and Ypred = Ytrue }| (5)

DIC = |{(Ypred , Ytrue ) ∈ S | Ypred ̸= 0 and Ypred ̸= Ytrue }| (6)

Where |S| is the cardinality of set S.

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 20 / 33
Derived Optimization Metric

Basic optimization metric Φ:

DCC
Y DIC
Y
Φ= (1 + λ) · (1 − λ) = (1 + λ)DCC · (1 − λ)DIC (7)
1 1

Optimization metric with δ dictating error preference strength:

λ TEC

Φδ = (1 + λ)DCC · (1 − λ)DIC · 1 − (8)
δ

where δ > λ. In our study, we set δ arbitrarily to 20, indicating that twenty timed exits are
considered equally undesirable as one direct incorrect classification

(Note: Accurate prediction of zeros could also be taken advantage of with an option butterfly)

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 21 / 33
Data Augmentation in CV

Can we apply the concept to financial

time series?

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 22 / 33
Supervised Autoencoder

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 23 / 33
Supervised Autoencoder

Enhanced Feature Representation: Supervised autoencoders can learn more relevant and discriminative
features for the task at hand because they are trained to not only reconstruct the input data but also to
optimize for an additional task-specific loss (like classification or regression).

Regularization Effect: Incorporating the reconstruction objective alongside the task-specific objective
(like classification accuracy) can act as a form of regularization. This helps in preventing overfitting to the
training data by ensuring that the learned representations maintain information about the input data,
leading to more generalized models.

Efficiency in Data Use: By leveraging unlabeled data for the reconstruction part and labeled data for the
task-specific part, supervised autoencoders can make efficient use of datasets where obtaining labeled data
is expensive or time-consuming. This can be particularly beneficial in semi-supervised learning scenarios,
where the model can learn general features from a large pool of unlabeled data and fine-tune the
representations for the task with a smaller set of labeled examples.

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 24 / 33
Approaches Comparison

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 25 / 33
Drawdown-adjusted information ratio

We use the information ratio as our main metric, originally proposed by Kość et al. (2019)
which is a modification of the Information Ratio measure. This measure also takes into
account the sign of the portfolio’s rate of return and the maximum drawdown:

ARC 2 · sign(ARC )
IR ∗∗ = (9)
ASD · MDD

ARC - Annualized Return Compounded

ASD - Annualized Standard Deviation
MDD - Maximum Drawdown

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 26 / 33
Results - Eq. Weight Portfolio of Strategies - IRR**

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 27 / 33
Sensitivity Analysis - Triple Barrier Labelling
Y - window height λ X-window length (minutes)

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 28 / 33
Sensitivity Analysis - Supervised Autoencoder
Y - Gaussian nosie rate X - Bottleneck size (% of feature count)

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 29 / 33
Sensitivity Analysis - Supervised Autoencoder

Y - Encoder hidden layer count - Decoder hidden layer count

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 30 / 33
Research Question Findings

RQ1: Impact of Data Augmentation and Denoising

Data augmentation (Gaussian noise) and denoising (autoencoders) significantly improve
strategy performance.
Approach 3 excels over Approaches 1 & 2 in Information Ratio for all bar lengths.
Optimal noise level and autoencoder size are critical; relationship is non-linear, requiring
careful calibration.
RQ2: Efficacy of Triple Barrier Labelling
Triple barrier labelling surpasses simple direction classification, enhancing market noise
handling and optimization metrics.
Approach 4 outperforms others in 15 and 30-minute bars but falls short in high-frequency
(5-minute bars) trading scenarios.
RQ3: Role of Hyperparameter Tuning
Crucial for superior investment strategy performance; optimal results with specific noise
levels and autoencoder sizes.

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 31 / 33
Further elaboration ideas

Dynamic lambda - setting lambda (TBL window size) to be a dynamic estimate of future
volatility.

Dynamic window length - setting dynamic length size based on estimate of market activity.

Zero-classifications - Accurate predictions of the price staying the same can be taking advantage
of with options (theta decay).

Other architectures - More elaborate models than MLP can be stacked on top of SAE (Random
Forest, ADABoost, CatBoost).

Feature engineering - more elaborate feature engineering to see how SAE reacts to greater
number of features.

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 32 / 33
Conclusion

Thank you!

Q&A

Bartosz Bieganowski, Robert Ślepaczuk (WNE UW) SAE-MLP For Financial Time Series Forecasting March 4, 2024 33 / 33

Knowledge-Based Systems: Wen Long, Zhichen Lu, Lingxiao Cui
No ratings yet
Knowledge-Based Systems: Wen Long, Zhichen Lu, Lingxiao Cui
11 pages
Jismo Math P5
100% (1)
Jismo Math P5
33 pages
SAE MLP Paper
No ratings yet
SAE MLP Paper
61 pages
CBSE Class 6 Maths Practice Worksheets
73% (33)
CBSE Class 6 Maths Practice Worksheets
52 pages
Robust Machine Learning Pipelines For Trading Mark
No ratings yet
Robust Machine Learning Pipelines For Trading Mark
29 pages
Trading Predictions Neural Networks
No ratings yet
Trading Predictions Neural Networks
7 pages
Lab Manual OF Antenna and Wave Propagation: Using MATLAB
No ratings yet
Lab Manual OF Antenna and Wave Propagation: Using MATLAB
83 pages
Grade 9 Rationalized Mathematics Lesson Plans Term 1
No ratings yet
Grade 9 Rationalized Mathematics Lesson Plans Term 1
147 pages
Feature Engineering For Mid-Price Prediction With Deep Learning
No ratings yet
Feature Engineering For Mid-Price Prediction With Deep Learning
39 pages
Ten Financial Applications of Machine Learning: Marcos López de Prado
No ratings yet
Ten Financial Applications of Machine Learning: Marcos López de Prado
20 pages
Does Meta Labeling Add To Signal Efficacy
No ratings yet
Does Meta Labeling Add To Signal Efficacy
38 pages
Evaluating Machine Learning Classification For Financial Trading
No ratings yet
Evaluating Machine Learning Classification For Financial Trading
15 pages
1997 - Financial Prediction and Trading Strategies Using Neurofuzzy Appr
No ratings yet
1997 - Financial Prediction and Trading Strategies Using Neurofuzzy Appr
32 pages
1 s2.0 S1566253524003944 Main
No ratings yet
1 s2.0 S1566253524003944 Main
20 pages
Fintech Research Project
No ratings yet
Fintech Research Project
55 pages
A New Forecasting Framework For Stock Market Index Value With An Overfitting Prevention LSTM Model
No ratings yet
A New Forecasting Framework For Stock Market Index Value With An Overfitting Prevention LSTM Model
24 pages
1 s2.0 S0957417422009447 Main
No ratings yet
1 s2.0 S0957417422009447 Main
15 pages
CBSE Class 6 Playing With Numbers Worksheet
75% (4)
CBSE Class 6 Playing With Numbers Worksheet
5 pages
THUx院-Trade When Opportunity Comes- Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling
No ratings yet
THUx院-Trade When Opportunity Comes- Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling
9 pages
Deep Learning Applying On Stock Trading
No ratings yet
Deep Learning Applying On Stock Trading
6 pages
Journal of Financial Economics - Charting by Machines
No ratings yet
Journal of Financial Economics - Charting by Machines
28 pages
SSRN 4383184
No ratings yet
SSRN 4383184
18 pages
Analysis of Temporal Pattern, Causal Interaction and Predictive Modeling of
100% (1)
Analysis of Temporal Pattern, Causal Interaction and Predictive Modeling of
17 pages
Prediction of Stock Returns Using Machine Learning: A Project Report Submitted To Manipal Academy of Higher Education
No ratings yet
Prediction of Stock Returns Using Machine Learning: A Project Report Submitted To Manipal Academy of Higher Education
48 pages
Which Artificial Intelligence Algorithm Better Pre
No ratings yet
Which Artificial Intelligence Algorithm Better Pre
10 pages
Feature Selection With Annealing For Forecasting Financial Time Series
No ratings yet
Feature Selection With Annealing For Forecasting Financial Time Series
26 pages
Stock Price Prediction Using Machine Learning
No ratings yet
Stock Price Prediction Using Machine Learning
44 pages
Applsci 13 01956
No ratings yet
Applsci 13 01956
27 pages
Preprints202407 0895 v1
No ratings yet
Preprints202407 0895 v1
11 pages
Entropy: A Labeling Method For Financial Time Series Prediction Based On Trends
No ratings yet
Entropy: A Labeling Method For Financial Time Series Prediction Based On Trends
27 pages
IEEE CAA Journal of Automatica Sinica-3
No ratings yet
IEEE CAA Journal of Automatica Sinica-3
11 pages
Pang2020 Article AnInnovativeNeuralNetworkAppro
No ratings yet
Pang2020 Article AnInnovativeNeuralNetworkAppro
21 pages
Price Prediction Evolution: From Economic Model To Machine Learning
No ratings yet
Price Prediction Evolution: From Economic Model To Machine Learning
7 pages
Machine-Learning Classification Techniques For The Analysis and P
No ratings yet
Machine-Learning Classification Techniques For The Analysis and P
292 pages
WCE2008 pp1171-1175
No ratings yet
WCE2008 pp1171-1175
5 pages
Financial Technical Indicators
No ratings yet
Financial Technical Indicators
24 pages
Stock Market Prediction Using Machine Language
No ratings yet
Stock Market Prediction Using Machine Language
11 pages
Literature Survey: 2.1 Review On Machine Learning Techniques For Stock Price Prediction
No ratings yet
Literature Survey: 2.1 Review On Machine Learning Techniques For Stock Price Prediction
15 pages
Financial Time Series Forecasting Applying Deep Learning Algorithms
No ratings yet
Financial Time Series Forecasting Applying Deep Learning Algorithms
16 pages
Statistical Modeling of High Frequency Datasets Using The ARIMA-ANN Hybrid2023
No ratings yet
Statistical Modeling of High Frequency Datasets Using The ARIMA-ANN Hybrid2023
17 pages
10.3934 Dsfe.2022022
No ratings yet
10.3934 Dsfe.2022022
27 pages
Minor Project 1 Report
No ratings yet
Minor Project 1 Report
20 pages
Financial Market Prediction Using Deep Learning
No ratings yet
Financial Market Prediction Using Deep Learning
22 pages
Predicting Stock Market Index Using Fusion of Machine Learning Techniques
No ratings yet
Predicting Stock Market Index Using Fusion of Machine Learning Techniques
28 pages
04.stock Market Prediction Using Machine Learning
No ratings yet
04.stock Market Prediction Using Machine Learning
6 pages
Model Optimization For Stock Market Prediction Using Multiple Labelling Techniques
No ratings yet
Model Optimization For Stock Market Prediction Using Multiple Labelling Techniques
5 pages
$tock Forecasting Using Machine Learning: Greg Colvin, Garrett Hemann, and Simon Kalouche
No ratings yet
$tock Forecasting Using Machine Learning: Greg Colvin, Garrett Hemann, and Simon Kalouche
5 pages
Ic3 2019 8844891
No ratings yet
Ic3 2019 8844891
5 pages
Options Tradingusing Artificial Neural Network
No ratings yet
Options Tradingusing Artificial Neural Network
9 pages
Stock Price Prediction Using Data Analytics: 978-1-5386-3852-1/17/$31.00 ©2017 IEEE
No ratings yet
Stock Price Prediction Using Data Analytics: 978-1-5386-3852-1/17/$31.00 ©2017 IEEE
5 pages
Profitable Strategy Design For Trades On Cryptocurrency Markets With Machine Learning Techniques
No ratings yet
Profitable Strategy Design For Trades On Cryptocurrency Markets With Machine Learning Techniques
28 pages
Testing Stock Market Efficiency Using Historical Trading Data and Machine Learning
No ratings yet
Testing Stock Market Efficiency Using Historical Trading Data and Machine Learning
40 pages
Alzaman - Unlocking The Potential of Machine Learning in Portfolio Selection A Hybrid Approach Wi...
No ratings yet
Alzaman - Unlocking The Potential of Machine Learning in Portfolio Selection A Hybrid Approach Wi...
15 pages
Research Paper
No ratings yet
Research Paper
6 pages
ARIMA Model Has A Strong Potential For Short-Term Prediction of Stock Market Trends
No ratings yet
ARIMA Model Has A Strong Potential For Short-Term Prediction of Stock Market Trends
4 pages
Results Paper
No ratings yet
Results Paper
6 pages
10 1109@COMITCon 2019 8862225
No ratings yet
10 1109@COMITCon 2019 8862225
4 pages
Roark's Circular Plate
No ratings yet
Roark's Circular Plate
2 pages
Artificial Neural Network Intelligent Method For Prediction: Articles You May Be Interested in
No ratings yet
Artificial Neural Network Intelligent Method For Prediction: Articles You May Be Interested in
7 pages
Humanoid Robot Reinforcement Learning Algorithm For Biped Walking
No ratings yet
Humanoid Robot Reinforcement Learning Algorithm For Biped Walking
7 pages
Cycle Counting in Fatigue Analysis: Standard Practices For
No ratings yet
Cycle Counting in Fatigue Analysis: Standard Practices For
10 pages
Stock Market Prediction Using CNN and LSTM
No ratings yet
Stock Market Prediction Using CNN and LSTM
7 pages
Topic 3 Deep Learning Based Feature Engineering For Stock Price Movement Prediction 20pt
No ratings yet
Topic 3 Deep Learning Based Feature Engineering For Stock Price Movement Prediction 20pt
4 pages
Applying Deep Learning To Enhance Momentum Trading Strategies in Stocks
100% (1)
Applying Deep Learning To Enhance Momentum Trading Strategies in Stocks
5 pages
Linear Regression and Correlation Analysis PPT at BEC DOMS
50% (2)
Linear Regression and Correlation Analysis PPT at BEC DOMS
67 pages
Q2 Week 3 Relation and Function
No ratings yet
Q2 Week 3 Relation and Function
42 pages
Mel709 22
No ratings yet
Mel709 22
18 pages
Network Engineering PDF
0% (1)
Network Engineering PDF
44 pages
MAT1100 Integral Calculus I - 2020
No ratings yet
MAT1100 Integral Calculus I - 2020
6 pages
DCF Techniques
No ratings yet
DCF Techniques
25 pages
Dire Dawa University Institute of Technology School of Computing Department of Computer Science
No ratings yet
Dire Dawa University Institute of Technology School of Computing Department of Computer Science
13 pages
CIE 119 STEEL DESIGN P1 QUIZ 2 Compression Members
No ratings yet
CIE 119 STEEL DESIGN P1 QUIZ 2 Compression Members
1 page
Notification-CSPE 2020 N Engl
No ratings yet
Notification-CSPE 2020 N Engl
9 pages
Section 5 Quiz
100% (1)
Section 5 Quiz
7 pages
Power Quality Performance Enhancement Using Single-Phase UPQC With Fuzzy Logic Controller Integrated With PV-BES System
No ratings yet
Power Quality Performance Enhancement Using Single-Phase UPQC With Fuzzy Logic Controller Integrated With PV-BES System
22 pages
Module 9 - Motions of Physics - Study Guide
No ratings yet
Module 9 - Motions of Physics - Study Guide
4 pages
Recent Advances in Mathematics For Engineering (Mathematical Engineering, Manufacturing, and Management Sciences) 1st Edition Mangey Ram (Editor)
100% (3)
Recent Advances in Mathematics For Engineering (Mathematical Engineering, Manufacturing, and Management Sciences) 1st Edition Mangey Ram (Editor)
54 pages
Math First Quarter Module
No ratings yet
Math First Quarter Module
4 pages
DSD Unit 1 Analysis of Algorithm
No ratings yet
DSD Unit 1 Analysis of Algorithm
38 pages
ECON022 BAP With Major
No ratings yet
ECON022 BAP With Major
3 pages
University of Cambridge International Examinations General Certificate of Education Advanced Level
No ratings yet
University of Cambridge International Examinations General Certificate of Education Advanced Level
4 pages
Inter 1b Syllabus
No ratings yet
Inter 1b Syllabus
3 pages
Common AMS - Assignment - 1
No ratings yet
Common AMS - Assignment - 1
3 pages
Hypothesis Testing Keshav N
No ratings yet
Hypothesis Testing Keshav N
8 pages
DSP 1imp
No ratings yet
DSP 1imp
13 pages
Reinventing Discovery
No ratings yet
Reinventing Discovery
4 pages
Lecture#1: The Geometry of Linear Equations
No ratings yet
Lecture#1: The Geometry of Linear Equations
2 pages
The Unlucky Investor's Guide to Options Trading
From Everand
The Unlucky Investor's Guide to Options Trading
Julia Spina
3.5/5 (2)
Adaptive Asset Allocation: Dynamic Global Portfolios to Profit in Good Times - and Bad
From Everand
Adaptive Asset Allocation: Dynamic Global Portfolios to Profit in Good Times - and Bad
Adam Butler
5/5 (2)
Strategic Risk Management: Designing Portfolios and Managing Risk
From Everand
Strategic Risk Management: Designing Portfolios and Managing Risk
Campbell R. Harvey
No ratings yet

Supervised Autoencoder MLP

Uploaded by

Supervised Autoencoder MLP

Uploaded by

Supervised Autoencoder MLP for Financial Time Series Forecasting

Bartosz Bieganowski, Robert Ślepaczuk

University of Warsaw, Faculty of Economic Sciences

1 Problem Statement & Data

3 Novelty 1 - Triple Barrier Labeling & Optimal Metric

4 Novelty 2 - Supervised Autoencoder & Noise Augmentation

5 Approach Comparison & Results

RQ1: Does noise augmentation used with SAE-MLP architecture improve

RQ3: Does the hyperparameter tuning improve strategy performance with

EUR/USD - Moderate volatility, driven by monetary policy of UE and FED, and

BTC/USD - High volatility, driven by speculation, technological developments. Low

Training timeframe: 2010-01-01 - 2019-12-31

Software: Python 3.10, Tensorflow, Pandas, Matplotlib, Scikit-learn

Computation Time: on average 4 minutes per 1 hyperparameter combination

Feature Presumed Increase Impact Presumed Decrease Impact

Should we input it as-is (d=0, but data is not stationary)?

For any positive integer n, it also holds that:

On the other hand for any real number d:

Algorithm 1 Fractional Feature Differentiation in Walk-Forward Validation

Regression on price in x time? (unstationary, ignores path, uninformative error metrics).

Regression on return over x time? (maybe stationary, ignores path, no directional

Classification on movement direction? (better, still ignores path, high noise-to-signal).

Path-dependent classification, which is effectively ML-interpretation of concepts of stop-loss,

Figure: Exemplary labels in triple-barrier-labeling

λ - window size in (%)

Table 2. Return on a trade given classification result.

Source: Own Elaboration

DCC = |{(Ypred , Ytrue ) ∈ S | Ypred ̸= 0 and Ypred = Ytrue }| (5)

Where |S| is the cardinality of set S.

Basic optimization metric Φ:

Optimization metric with δ dictating error preference strength:

Can we apply the concept to financial

ARC - Annualized Return Compounded

Y - Encoder hidden layer count - Decoder hidden layer count

RQ1: Impact of Data Augmentation and Denoising

You might also like