0% found this document useful (0 votes)

176 views29 pages

MS&E 448 Final Presentation High Frequency Algorithmic Trading

The students developed high-frequency trading strategies using machine learning models on order book data. They improved their models by changing to second-by-second data and different prediction labels. Testing on historical stock data showed promising results, with cumulative profits increasing over time. Further optimization of model hyperparameters, features, and risk management were identified as areas for continued improvement.

Uploaded by

akion xc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

176 views29 pages

MS&E 448 Final Presentation High Frequency Algorithmic Trading

Uploaded by

akion xc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

MS&E 448 Final Presentation

High Frequency Algorithmic Trading

Francis Choi George Preudhomme Nopphon Siranart

Roger Song Daniel Wright

Stanford University

June 6, 2017

High-Frequency Trading MS&E448 June 6, 2017 1 / 29

Overview

Review our strategy and progress from the midterm

Changes in Data Processing
Changes to Models
Strategy and Simulations
Results
Evaluation and Next Steps

High-Frequency Trading MS&E448 June 6, 2017 2 / 29

Recall from the Midterm

Goal: Next-minute price movement prediction based on order book

dynamics
Data: Minute-by-Minute consolidated book for S&P 500 ETF (IVV)
Model: Random Forest three-way classifier
Labels: Mid-price changes and spread-crossing
Trading Strategy: Accumulating positions and closing them out at
the end of the day
Results: Still not generated profit

High-Frequency Trading MS&E448 June 6, 2017 3 / 29

After the Midterm

Data Processing
Changing the data from minute by minute to second by second
Change from three-way classification to binary classification (no
longer using spread crossing label)
Train and test on a rolling window basis - 2 weeks training period
prior to each day

High-Frequency Trading MS&E448 June 6, 2017 4 / 29

Data (Example)

High-Frequency Trading MS&E448 June 6, 2017 5 / 29

After the Midterm

New Labels
AREA
Time-weighed PnL over the next period (area under the price
movement curve)

VWAP
Volume-weighted average price (VWAP) based on inner bid and ask.
Whether it goes up or down in the window.

High-Frequency Trading MS&E448 June 6, 2017 6 / 29

After the Midterm

Adding new features

Bid-Ask Volume Imbalance Quantity indicating the number of
shares at the bid minus the number of shares at the ask in the current
order book.
VWAP A variation on mid-price where the average of the bid and ask
prices is weighted according to their inverse volume.
Second Order Derivatives Expand feature universe to encompass
multiple time periods.

High-Frequency Trading MS&E448 June 6, 2017 7 / 29

Model

Logistic Regression
Outputs probability (how confident we are) on each trade
Advantages over random forest: it trains much faster, the coefficients
have an interpretation

High-Frequency Trading MS&E448 June 6, 2017 8 / 29

Model
Random Forest
Again, outputs probability (how confident we are) on each trade
One key advantage over logistic regression - doesn’t assume any
functional form and slightly higher accuracy

High-Frequency Trading MS&E448 June 6, 2017 9 / 29

Strategy

Train the model on a rolling backwards window.

At each second, use the model to arrive at a prediction with a
probability estimate.
If the probability estimate is above the threshold, make the predicted
trade with the size weighted accordingly
Close out the trade at the end of the trading window.

High-Frequency Trading MS&E448 June 6, 2017 10 / 29

Thesys Simulator
Here is what we think it looks like

High-Frequency Trading MS&E448 June 6, 2017 11 / 29

Thesys Simulator
Here is what it actually looks like

High-Frequency Trading MS&E448 June 6, 2017 12 / 29

Thesys Simulator

Very frustrating and very slow

We decided to just pull the data
from Thesys and do the
simulations manually.

High-Frequency Trading MS&E448 June 6, 2017 13 / 29

Results

We choose 10 stocks and ETFs to test our trading strategies, chosen

based on liquidity
These include XLF, CSCO, EEM, IVV, IWM, QQQ, UVXY, VXX,
XLE, SPY
Training Period - 2 weeks from 01/05/2015 - 01/16/2015
Test Period - 2 weeks from 01/19/2015 - 01/30/2015
We use PnL per trade as a performance metric

High-Frequency Trading MS&E448 June 6, 2017 14 / 29

Tuning Parameters

Figure: Heat map of accuracy for different decay and window length parameters
(Left) XLE (Right) XLF

High-Frequency Trading MS&E448 June 6, 2017 15 / 29

Accuracy of Model: Logistic Regression

Figure: Prediction accuracy vs prediction threshold for the logistic regression

model

High-Frequency Trading MS&E448 June 6, 2017 16 / 29

Accuracy of Model: Random Forest

Figure: Prediction accuracy vs prediction threshold for the random forest model.

High-Frequency Trading MS&E448 June 6, 2017 17 / 29

Accuracy of Model: Difference
Overall, Random Forest has slightly better accuracy across threshold
values.

Figure: Prediction accuracy RF - LR vs prediction threshold.

High-Frequency Trading MS&E448 June 6, 2017 18 / 29

Cumulative PnL (XLF)
PnL stably increasing throughout the day - High Sharpe Ratio !!

Figure: Cumulative PnL within a day

High-Frequency Trading MS&E448 June 6, 2017 19 / 29

Trading PnL (XLF)
Logistic Regression with VWAP label performs best in this case

Figure: PnL per Trade vs prediction threshold for each algorithm and label

High-Frequency Trading MS&E448 June 6, 2017 20 / 29

Trading PnL (XLF)
Tuning hyperparameters improves the model significantly

Figure: PnL per Trade vs prediction threshold for different hyperparameters

High-Frequency Trading MS&E448 June 6, 2017 21 / 29

Trading PnL (MSFT)
Random Forest with AREA label performs best for MSFT

Figure: PnL per Trade vs prediction threshold for each algorithm and label

High-Frequency Trading MS&E448 June 6, 2017 22 / 29

Trading PnL (MSFT)
A combination of non-optimal hyperparameters, models and labels
performs poorly.

Figure: PnL per Trade vs prediction threshold for different hyperparameters

High-Frequency Trading MS&E448 June 6, 2017 23 / 29

Multiple Stocks
Random Forest with AREA labels. Window = 15, decay = 0.8

Figure: PnL per Trade vs prediction threshold for different stocks

High-Frequency Trading MS&E448 June 6, 2017 24 / 29

Multiple Stocks
Logistic Regression with AREA labels. Window = 15, decay = 0.8

Figure: PnL per Trade vs prediction threshold for different stocks

High-Frequency Trading MS&E448 June 6, 2017 25 / 29

Evaluating Our Strategy

Strengths:
High accuracy rates: model is doing a good job
High PnL per trade with small variance especially when training on a
longer period of time
The model can be generalized to multiple stocks/ETFs
Perform well even in tumultuous historical periods and on
hypothetical scenarios
Limitations:
Have to tune hyperparameters for each stock
High prediction accuracy does not always mean profit: label isn’t
exactly a prediction of PnL
Interpretability of the model

High-Frequency Trading MS&E448 June 6, 2017 26 / 29

Future Work and Areas for Improvement

Within 10 weeks, we can’t make the perfect trading strategy: there is

still a lot we could improve.
Some ideas for further work:
Training on a longer period of time
More sophisticated features: right now we only use the order book
data, could try including external features (such as an index like the
VIX, or data on correlated securities, etc.)
Converting to a strategy that trades at bid and ask (rather than
midprice)
Modifying strategy to handle scaled-up trade quantities
Risk Management

High-Frequency Trading MS&E448 June 6, 2017 27 / 29

Conclusion

Idea: use machine learning techniques on the order book to make

price movement predictions. Trade on these predictions to make $$$
Models: Random forest, logistic regression
Data: Second-by-second orderbook data from Thesys
Calibrated trading frequency, prediction label, hyperparameters of
models
Performed simulations on historical data
Promising results that can be built upon

High-Frequency Trading MS&E448 June 6, 2017 28 / 29

Conclusion

The End
Questions?

High-Frequency Trading MS&E448 June 6, 2017 29 / 29

Modelling Survival Data in Medical Research PDF
100% (6)
Modelling Survival Data in Medical Research PDF
538 pages
The Microstructure of Financial Markets: Frank de Jong
No ratings yet
The Microstructure of Financial Markets: Frank de Jong
2 pages
Convex Functions and Their Applications: Constantin P. Niculescu Lars-Erik Persson
No ratings yet
Convex Functions and Their Applications: Constantin P. Niculescu Lars-Erik Persson
430 pages
Equity Analytics - Modern Portfolio Theory-Jonathan Kinlay
100% (1)
Equity Analytics - Modern Portfolio Theory-Jonathan Kinlay
7 pages
MATH4512 2022spring HW1Solution
No ratings yet
MATH4512 2022spring HW1Solution
11 pages
ET Slides FIN566 201801205
No ratings yet
ET Slides FIN566 201801205
58 pages
Stock Market Prediction Using MLP and Random Forest
No ratings yet
Stock Market Prediction Using MLP and Random Forest
18 pages
M7 Assignment - 72
No ratings yet
M7 Assignment - 72
18 pages
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
No ratings yet
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
26 pages
Protein Assay Using The Bradford Method
100% (3)
Protein Assay Using The Bradford Method
2 pages
Microstructure Tutorial PDF
No ratings yet
Microstructure Tutorial PDF
38 pages
Rubisov Anton 201511 MAS Thesis
No ratings yet
Rubisov Anton 201511 MAS Thesis
94 pages
Market Microstructure: Pamantasan NG Lungsod NG Pasig College of Business and Accountancy
No ratings yet
Market Microstructure: Pamantasan NG Lungsod NG Pasig College of Business and Accountancy
10 pages
Greek Letters of Finance
No ratings yet
Greek Letters of Finance
40 pages
Special: Part II of II
No ratings yet
Special: Part II of II
16 pages
5-15 Genesis System 2012-10-07 Rev003 - 2
No ratings yet
5-15 Genesis System 2012-10-07 Rev003 - 2
11 pages
Econ 122 Lecture 3 Derivatives
No ratings yet
Econ 122 Lecture 3 Derivatives
19 pages
Wayne A. Thorp - Testing Trading Success
No ratings yet
Wayne A. Thorp - Testing Trading Success
5 pages
UNIT 5 - Technical Analysis in Investment
No ratings yet
UNIT 5 - Technical Analysis in Investment
22 pages
LM03 Fixed-Income Issuance and Trading IFT Notes
No ratings yet
LM03 Fixed-Income Issuance and Trading IFT Notes
9 pages
CH3 Derivatives PP
No ratings yet
CH3 Derivatives PP
61 pages
ATMASphere Sept 2014 PDF
No ratings yet
ATMASphere Sept 2014 PDF
25 pages
Almgren, Li - Closed-Form Solutions For Option Hedging With Market Impact
No ratings yet
Almgren, Li - Closed-Form Solutions For Option Hedging With Market Impact
30 pages
Ichimoku Kinko Hyo Strategies
No ratings yet
Ichimoku Kinko Hyo Strategies
3 pages
Micro Structure Tutorial
No ratings yet
Micro Structure Tutorial
38 pages
Financial Market Risks & Management
No ratings yet
Financial Market Risks & Management
73 pages
Two Way ANOVA
100% (1)
Two Way ANOVA
83 pages
MS&E448: Statistical Arbitrage: Group 5: Carolyn Soo, Zhengyi Lian, Jiayu Lou, Hang Yang
No ratings yet
MS&E448: Statistical Arbitrage: Group 5: Carolyn Soo, Zhengyi Lian, Jiayu Lou, Hang Yang
31 pages
EP Chan Course Offerings
No ratings yet
EP Chan Course Offerings
18 pages
Page 0008
No ratings yet
Page 0008
1 page
Market Microstructure: Information-Based Models
No ratings yet
Market Microstructure: Information-Based Models
8 pages
Time Series Forecasting With Feed-Forward Neural Networks
100% (1)
Time Series Forecasting With Feed-Forward Neural Networks
40 pages
TD Sequential
No ratings yet
TD Sequential
66 pages
Ahmed Rebai, PHD in Nuclear Physics
No ratings yet
Ahmed Rebai, PHD in Nuclear Physics
34 pages
MarketMaking Models - Summary
No ratings yet
MarketMaking Models - Summary
35 pages
A Primer On The MACD
No ratings yet
A Primer On The MACD
5 pages
The Augmented Bollinger Bands
No ratings yet
The Augmented Bollinger Bands
23 pages
Advanced Pairs Course Oct 2008
No ratings yet
Advanced Pairs Course Oct 2008
1 page
Optimal Trend Following Trading Rules
No ratings yet
Optimal Trend Following Trading Rules
25 pages
Hedging Mean-Reverting Commodities
No ratings yet
Hedging Mean-Reverting Commodities
8 pages
Predatory Trading II
No ratings yet
Predatory Trading II
47 pages
Resume 3
No ratings yet
Resume 3
2 pages
Order Deal Positions
No ratings yet
Order Deal Positions
21 pages
Market Timing Strat To Avoid Bear Markets
No ratings yet
Market Timing Strat To Avoid Bear Markets
25 pages
Moving Average
No ratings yet
Moving Average
36 pages
Zahar Udin PHD Thesis
No ratings yet
Zahar Udin PHD Thesis
201 pages
Deviation Scaled Moving Average
No ratings yet
Deviation Scaled Moving Average
4 pages
Options Obv Feb2013
100% (1)
Options Obv Feb2013
7 pages
Greeks: Type Delta Value Profits When..
No ratings yet
Greeks: Type Delta Value Profits When..
6 pages
Event Study On Stock Splits
No ratings yet
Event Study On Stock Splits
18 pages
Optimal Tracking Filters Ehlers
No ratings yet
Optimal Tracking Filters Ehlers
5 pages
Indicator #1: Trend-Following Indicators: Moving Average Simple Moving Average
No ratings yet
Indicator #1: Trend-Following Indicators: Moving Average Simple Moving Average
1 page
Contrarian and Momentum Strategies
No ratings yet
Contrarian and Momentum Strategies
7 pages
91 3 STOIKOV Microstructure Talk
No ratings yet
91 3 STOIKOV Microstructure Talk
60 pages
TG - Momentum, Acceleration
No ratings yet
TG - Momentum, Acceleration
25 pages
Backtest Overfitting Demonstration Tool
No ratings yet
Backtest Overfitting Demonstration Tool
9 pages
Orangeroshan'S SRDC Method
No ratings yet
Orangeroshan'S SRDC Method
14 pages
Getting Started With Edgerater: Chris White Founder and Ceo, Edgerater LLC December 2013
No ratings yet
Getting Started With Edgerater: Chris White Founder and Ceo, Edgerater LLC December 2013
26 pages
Luigi Piva
No ratings yet
Luigi Piva
2 pages
MSA3 Users Guide
No ratings yet
MSA3 Users Guide
125 pages
A Tale of Two Traders
No ratings yet
A Tale of Two Traders
4 pages
MS&E 448 Final Presentation High Frequency Algorithmic Trading
No ratings yet
MS&E 448 Final Presentation High Frequency Algorithmic Trading
29 pages
Research On Optimizing Real-Time Data Processing in High-Frequency Trading Algorithms Using Machine Learning
No ratings yet
Research On Optimizing Real-Time Data Processing in High-Frequency Trading Algorithms Using Machine Learning
4 pages
Analyzing Qualitative and Quantitative Data
100% (1)
Analyzing Qualitative and Quantitative Data
8 pages
BMS458 Assignment 1 Ariesya Adnan AS2442B
No ratings yet
BMS458 Assignment 1 Ariesya Adnan AS2442B
8 pages
Barrier Option Pricing: Degree Project in Mathematics, First Level
No ratings yet
Barrier Option Pricing: Degree Project in Mathematics, First Level
38 pages
Muhammad Palize Qazi - 24027 - Assignment 2
No ratings yet
Muhammad Palize Qazi - 24027 - Assignment 2
5 pages
MGMT3101 International Business Strategy S22014
No ratings yet
MGMT3101 International Business Strategy S22014
24 pages
Application of Derivative WA
No ratings yet
Application of Derivative WA
15 pages
Exhibit 15.5 - Simple Moving Average
No ratings yet
Exhibit 15.5 - Simple Moving Average
23 pages
Hydro Matlab
No ratings yet
Hydro Matlab
6 pages
Chapter 07.00G Physical Problem For Integration Mechanical Engineering
No ratings yet
Chapter 07.00G Physical Problem For Integration Mechanical Engineering
4 pages
Esfeqa Guideline For Method Key Selection: Hemogram
No ratings yet
Esfeqa Guideline For Method Key Selection: Hemogram
2 pages
Mat 111 - Single Variable Caluclus: Syllabus and Tutorial Problems
No ratings yet
Mat 111 - Single Variable Caluclus: Syllabus and Tutorial Problems
12 pages
Periodic Table of The Finite Elements
No ratings yet
Periodic Table of The Finite Elements
1 page
One-Way Analysis of Variance F-Tests Using Effect Size
No ratings yet
One-Way Analysis of Variance F-Tests Using Effect Size
8 pages
Public Version
No ratings yet
Public Version
214 pages
Differential Calculus
No ratings yet
Differential Calculus
4 pages
Math 7 Learning Competencies
No ratings yet
Math 7 Learning Competencies
2 pages
Function Approximation, Interpolation, and Curve Fitting PDF
100% (1)
Function Approximation, Interpolation, and Curve Fitting PDF
53 pages
Coelho, G. L. H., Et Al (2018) - de Jong Gierveld Loneliness Scale - Short Version Validation For The Brazilian Context. Paidéia (Ribeirão Preto), 28, E2805 PDF
No ratings yet
Coelho, G. L. H., Et Al (2018) - de Jong Gierveld Loneliness Scale - Short Version Validation For The Brazilian Context. Paidéia (Ribeirão Preto), 28, E2805 PDF
9 pages
Lesson 3-Roots and Optimization
No ratings yet
Lesson 3-Roots and Optimization
30 pages
Supreme IQ Option
No ratings yet
Supreme IQ Option
2 pages
Week 6 - Fourier Transform: (Textbook: Ch. 5)
No ratings yet
Week 6 - Fourier Transform: (Textbook: Ch. 5)
18 pages
CPT MATHS 1.limits and Continuity
No ratings yet
CPT MATHS 1.limits and Continuity
5 pages
18-Application of Derivative-01 Theory
No ratings yet
18-Application of Derivative-01 Theory
18 pages
HPLC Calibration
100% (1)
HPLC Calibration
5 pages
Differential Calculus Problem Set
No ratings yet
Differential Calculus Problem Set
2 pages
AnalysisI Sheet5
No ratings yet
AnalysisI Sheet5
2 pages
Analog Circuit Master Cheat
No ratings yet
Analog Circuit Master Cheat
7 pages

MS&E 448 Final Presentation High Frequency Algorithmic Trading

Uploaded by

MS&E 448 Final Presentation High Frequency Algorithmic Trading

Uploaded by

MS&E 448 Final Presentation

High Frequency Algorithmic Trading

Francis Choi George Preudhomme Nopphon Siranart

High-Frequency Trading MS&E448 June 6, 2017 1 / 29

Review our strategy and progress from the midterm

High-Frequency Trading MS&E448 June 6, 2017 2 / 29

Goal: Next-minute price movement prediction based on order book

High-Frequency Trading MS&E448 June 6, 2017 3 / 29

High-Frequency Trading MS&E448 June 6, 2017 4 / 29

High-Frequency Trading MS&E448 June 6, 2017 5 / 29

High-Frequency Trading MS&E448 June 6, 2017 6 / 29

Adding new features

High-Frequency Trading MS&E448 June 6, 2017 7 / 29

High-Frequency Trading MS&E448 June 6, 2017 8 / 29

High-Frequency Trading MS&E448 June 6, 2017 9 / 29

Train the model on a rolling backwards window.

High-Frequency Trading MS&E448 June 6, 2017 10 / 29

High-Frequency Trading MS&E448 June 6, 2017 11 / 29

High-Frequency Trading MS&E448 June 6, 2017 12 / 29

Very frustrating and very slow

High-Frequency Trading MS&E448 June 6, 2017 13 / 29

We choose 10 stocks and ETFs to test our trading strategies, chosen

High-Frequency Trading MS&E448 June 6, 2017 14 / 29

High-Frequency Trading MS&E448 June 6, 2017 15 / 29

Figure: Prediction accuracy vs prediction threshold for the logistic regression

High-Frequency Trading MS&E448 June 6, 2017 16 / 29

High-Frequency Trading MS&E448 June 6, 2017 17 / 29

Figure: Prediction accuracy RF - LR vs prediction threshold.

High-Frequency Trading MS&E448 June 6, 2017 18 / 29

Figure: Cumulative PnL within a day

High-Frequency Trading MS&E448 June 6, 2017 19 / 29

High-Frequency Trading MS&E448 June 6, 2017 20 / 29

Figure: PnL per Trade vs prediction threshold for different hyperparameters

High-Frequency Trading MS&E448 June 6, 2017 21 / 29

High-Frequency Trading MS&E448 June 6, 2017 22 / 29

Figure: PnL per Trade vs prediction threshold for different hyperparameters

High-Frequency Trading MS&E448 June 6, 2017 23 / 29

Figure: PnL per Trade vs prediction threshold for different stocks

High-Frequency Trading MS&E448 June 6, 2017 24 / 29

Figure: PnL per Trade vs prediction threshold for different stocks

High-Frequency Trading MS&E448 June 6, 2017 25 / 29

High-Frequency Trading MS&E448 June 6, 2017 26 / 29

Within 10 weeks, we can’t make the perfect trading strategy: there is

High-Frequency Trading MS&E448 June 6, 2017 27 / 29

Idea: use machine learning techniques on the order book to make

High-Frequency Trading MS&E448 June 6, 2017 28 / 29

High-Frequency Trading MS&E448 June 6, 2017 29 / 29

You might also like