0% found this document useful (0 votes)

131 views43 pages

High Frequency Trading Final Pres Slides

Uploaded by

施竣皓

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

131 views43 pages

High Frequency Trading Final Pres Slides

Uploaded by

施竣皓

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

A Low-Latency FPGA-based

Infrastructure for HFT Systems

Andrew Boutros, Brett Grady and Mustafa Abbas

ECE1373 Course Project
Recap
High Frequency Trading (HFT) is the rapid
automated exchange of financial instruments
using networked computers.
Our Application:
▪ Receives financial data in compressed form
(FAST) over the network
▪ Decode incoming packet (market update)
▪ Update model and possibly make a trading
decision (bid/ask)
▪ Encode outgoing data (market order)
▪ Send bid/ask over network with latency estimate.

1
Project Goals
Design an FPGA-based infrastructure for
HFT that:
▪ Abstracts the details of networking &
financial encoding/decoding.
▪ Handles order book keeping
(pre-processing) to ease development of
HFT algorithms on FPGAs.
▪ Is extensible and easy to interface with,
but still has very low round-trip latency
(<1 s including the network stack).

1
System Overview
Our system has five modules:
▪ UDP Network Stack - Xilinx's UDP network stack.
▪ Network Switch/Timestamper - Timestamps
incoming and outgoing packets to track latency.
▪ FAST Encoder/Decoder - Performs FAST
protocol conversion on incoming and outgoing
packets
▪ Order Book - Sorts all current valid bid and ask
orders by price, and passes these top values to the
application layer.
▪ Application Layer - Simple demo "client"
hardware that uses the order book data to execute
trades.

1
Network Switch & Timestamper
● Automatically tags incoming packets with a
timestamp
● This timestamp is passed throughout the
the downstream system unchanged.
● If an incoming message triggers an order,
that message's timestamp gets transmitted
back to the timestamper alongside the
outgoing order.
● The timestamp then computes the latency
of the packets and appends it to the packet.
● Also provides multiplexing on the transmit
side for monitoring (this was not used in the
final project iteration).

1
FAST Protocol - Recap
● Financial Information Exchange protocol
(FIX) adapted for streaming
○ Variable length message encoded in
bytes
○ Decoding depends on a template
○ Stop bits determine end of field 11011000 10000001 01100000 10011000 10011011

● Used to: PMap TID Field 2 Field 3

○ Decode UDP packets containing financial

data coming in from the network Layer
○ Encode packets coming from the
Application Layer

1
FAST Decoder
● An input UDP packet comes in 64 bit chunks
● Packets are buffered into bytes before processing
● Dataflow:
1) Stop bit detector inspects the MSB of each byte
2) Multiple decoders are run in parallel
3) Decoded message is sent to the Order Book
● Latency of 9 cycles @ 5 ns estimated by Vivado HLS
Timestamp Time Stamp
UDP Metadata UDP Metadata

Decode Price
UDP Packet Data Decode Size Combine Market Order
Stop Bit
Market
Detector Decode ID Order
Decode Type

FAST Decoder

1
FAST Encoder
● An input market order received from the Custom App module
● Dataflow:
1) Multiple encoders are run in parallel
2) Encoded message is sent to the Network layer in 64-bit chunks
● Latency of 0 cycles @ 5 ns estimated by Vivado HLS

Timestamp Time Stamp

UDP Metadata UDP Metadata

Encode Price
Market Order Encode Size Combine
UDP Packet Data
to UDP
Encode ID Packets
Encode Type

FAST Encoder

1
Order Book Keeping - Recap
▪ It is an essential pre-processing stage for
almost all financial algorithms. Bid Order Book

▪ Keeps track of Bids (offers to buy) and Asks Order ID Time Size Bid Price

(offers to sell) in order of their price. 101 12:02:36 4 27.4

▪ This translates into building a hardware 104 12:03:18 2 27.4

Priority Queue with some special features: 102 12:03:07 6 27.2

– Low-latency insert and delete. Ask Order Book

– Modify top entry. Order ID Time Size Price

– Remove a specific entry. 105 12:03:25 4 27.6

– Remove multiple entries. 103 12:03:25 2 27.7

1
Order Book Keeping - Implementation
1
26

2 3
23 22

4 5 6 7
21

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1
Order Book Keeping - Implementation
1
▪ For any node at index j:
26
– Left child has index 2j.
– Right child has index 2j+1. 2 3
23 22

4 5 6 7
21

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1
Order Book Keeping - Implementation
1
▪ For any node at index j:
26
– Left child has index 2j
– Right child has index 2j+1. 2 3
23 22

▪ For any node at index j and level i:

Path to this node is the binary 4 5 6 7
representation of the least significant 21
i-bits of j-2i.

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1
Order Book Keeping - Implementation
1
▪ For any node at index j:
26
– Left child has index 2j
– Right child has index 2j+1. 2 3
23 22

▪ For any node at index j and level i:

Path to this node is the binary 4 5 6 7
representation of the least significant 21
i-bits of j-2i.
For example: Node 5 in level 2
1 2 3 4 5 6 7
5 - 22 = 1 → in binary 0001
heap 26 23 22 21

holes

1
Order Book Keeping - Implementation
1
▪ For any node at index j:
26
– Left child has index 2j
– Right child has index 2j+1. 2 3
23 22

▪ For any node at index j and level i:

Path to this node is the binary 4 5 6 7
representation of the least significant 21
i-bits of j-2i.
For example: Node 5 in level 2
1 2 3 4 5 6 7
5 - 22 = 1 → in binary 0001
heap 26 23 22 21

left right
holes

1
Order Book Keeping - Insertion
1
Using those two simple yet very helpful
24 26
observations we can build very efficient PQ:
1. Get index of next empty node → 5 2 3
23 22

4 5 6 7
21

1 2 3 4 5 6 7
heap 26 23 25 21

holes

4 5 6 7
21

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1
Order Book Keeping - Insertion
1
Using those two simple yet very helpful
24 26
observations we can build very efficient PQ:
1. Get index of next empty node → 5 2 3
2. Get the path → 5-4= 01 (left - right) 23 22
3. Compare to index 1 → 26 > 24 (No swap)
4 5 6 7
21

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1 2 3 4 5 6 7
heap 26 23 22 21

holes

1
Order Book Keeping - Insertion
1
Using those two simple yet very helpful
23 26
observations we can build very efficient PQ:
1. Get index of next empty node → 5 2 3
2. Get the path → 5-4= 01 (left - right) 24 22
3. Compare to index 1 → 26 > 24 (No swap)
4. Move to left node → index 2x1 = 2 4 5 6 7
5. Compare to index 2 → 23 < 24 (Swap) 21

1 2 3 4 5 6 7
heap 26 24 22 21

holes

1 2 3 4 5 6 7
heap 26 24 22 21

holes

1
Order Book Keeping - Insertion
1
Using those two simple yet very helpful
26
observations we can build very efficient PQ:
1. Get index of next empty node → 5 2 3
2. Get the path → 5-4= 01 (left - right) 24 22
3. Compare to index 1 → 26 > 24 (No swap)
4. Move to left node → index 2x1 = 2 4 5 6 7
5. Compare to index 2 → 23 < 24 (Swap) 21 23
6. Move to right node → index (2x2)+1 = 5
7. Destination reached
1 2 3 4 5 6 7
heap 26 24 22 21 23

holes

1
Order Book Keeping - Specs
1
24
▪ Capacity of 4096 bids and 4096 asks.
2 3
▪ Streaming output after first comparison.
23 22
▪ II=1 for all insertion and deletion loops.

▪ Ability to modify top order size if only a 4 5 6 7

portion of the order is sold. 21 19

▪ Ability to delete any arbitrary node is

supported by adding another heap for 1 2 3 4 5 6 7
the “to-be-removed” orders.
heap 24 23 22 21 19

holes 5

21 19

heap 24 23 22 21 19

1
HLS Shortcoming
▪ When optimizing the order book for II=1, 24
we needed to partition the heap array such
that each level of the tree is in a separate
partition. 23 22
▪ Array partitioning with varying partition
sizes is not doable in HLS.
▪ We had to implement the heap array as a
2D array with dimensions equal to the 21 19
number of levels times the size of the tree
base.
▪ Wastes a lot of memory resources that can
heap 24
be avoided by more complex coding.
23 22

21 19

1
Testing and Verification
▪ We used an incremental approach to
integrate the system.
Iter 1. Network stack & timestamping
Iter 2. FAST Encoder/Decoder w/ Network
Iter 3. Entire system inc. Order Book
▪ This allowed more robust verification in
hardware of components such as the FAST
encoder.
▪ The incremental approach also allowed
creating a latency breakdown of the individual
blocks once synthesized.

1
Testing and Verification
Two different monitoring mechanisms were
used in the hardware:
1 | MicroBlaze Monitoring
▪ Order Book exposes an AXI-Lite interface
with top bid/ask, which is identical to the
last streaming output.
▪ Top Bid/Ask were reported periodically
(~1s) through JTAG debug interface.
▪ Allows observation of the Order Book
state, which is normally "hidden" behind
the trading algorithm.

1
Testing and Verification
2 | Network-Based Testing
▪ System is tested as a black-box; can only see the
outgoing packets it transmits.
▪ Latency data is appended unto the outgoing
order packets.
▪ Server-side software:
– Generates test data, encodes it as FAST orders
and sends them over Ethernet interface
– Receives and decodes orders from the
hardware
– Computes and displays an equivalent Order
Book state given the test data.

1
FPGA Platform
▪ Xilinx Kintex Ultrascale FPGA on the Alpha Data 8K5

– 10 Gigabit Ethernet

– Can reuse the same UDP/IP subsystem used for the Shell project

– Readily available and already configured through the Savi server

▪ Vivado HLS for high level synthesis cores

▪ Vivado IP integrator for connecting together HLS cores

Timing Results
Module Under Test Frequency Round Trip Latency Round Trip Runtime

Full System 156 MHz 42 cycles 269.2 ns

Full System + Ethernet 156 MHz 85 cycles 525 ns

▪ Final System ran at 156 MHz limited by Ethernet port

▪ Measured Latency of system is 42 cycles (269.2 ns)
▪ Additional 85 ns for the network transmit and 170 ns for the network receive
▪ Total runtime 525 ns on average using 10,000 random test packets
Timing breakdown
Module Latency Total Latency Runtime

Network Transmit 13 cycles - 85 ns

Network Receive 27 cycles - 170 ns

Network Switch 12 cycles 12 cycles 76.9 ns

FAST Encode/ Decode 18 cycles 30 cycles 115.4 ns

Order Book 12 cycles 42 cycles 76.9 ns

▪ Reported Network switch latency shows us that the streaming interface adds a
significant amount of latency.
▪ Adding the Order book only added 12 cycles of Latency: a minimal amount.
Area Results
Resource Utilization Available Utilization %

LUT 49,638 663,360 7.48

LUTRAM 2,148 29,3760 0.73

FF 32,718 1,326,720 2.47

BRAM 474 2,160 21.92

DSP 3 5,520 0.05

▪ Total utilization is minimal compared to the FPGA size

▪ Largest utilization is due to BRAM
– A trade off for having the order book run at little latency
Other HFT Work on the FPGA
System FPGA Platform Network Transmit Network Receive Decode/Encode Rest of system Total Runtime

Our solution Kintex Ultrascale 85 ns 170 ns 192.3 ns 76.9 ns 525 ns

Leber et. al [1] Virtex-4 FX100 - - - - 2.6 us

Lockwood et. al [2] NetFPGA-10G 400ns 400 ns 200 ns Not included 1 us

(Virtex-5)

▪ Leber et. al created an in parallel multiple FAST stream solution that sends
decoded data to a software processor at a total latency of 2.6 us
▪ Lockwood el. al used the FIX financial exchange protocol for encoding and
decoding and report 200 ns
– We measured our system with only FAST encoder / decoder and achieved
average runtimes of 192.3 ns
HLS ROCKS
▪ Streaming interfaces used HLS library calls
– Easy for integration, but gave little control over the latency of
communication between HLS IP cores.

▪ Regarding the FAST Protocol each exchange has their own message template
– In C, encoding/ decoding functions are simple to order in a way that
matches the message template

▪ Experts in financial trading algorithms can easily tinker/build onto our design
with basic understanding of hardware and no need for RTL expertise.
HLS ROCKS
▪ Design Space Exploration and Optimization
– HLS directives allow easier and faster tuning for performance.
– In HDL, all tuning is essentially manual code changes.
▪ Testing
– Getting RTL to work correctly requires lots of low level debugging.
– HLS design can be tested in C to verify the basic functionality before
worrying about hardware concerns.
▪ Empirical Observation
– Before this course was HLS, many projects finished after the summer
– Our cohort finished all projects by early June; in our case Mid-May.
– We believe this indicates that HLS is ~2x more productive
Database Organization
▪ Git/ bitbucket: cloud source code control hft
▪ Directory structure base project folder
– Each member worked on a project in
the hls folder
src
src code folder
• Network switch
• FAST Protocol hls
• Order book vivado hls projects
– Separate folder for IP Integration
ip
• Used for individual, partial and full
built IP cores
integration
– Scripts folder contains python files build
used to send information through the vivado IP integrator projects
network to the FPGA
scripts
network scripts
References

[1] C. Leber, B. Geib and H. Litz, "High Frequency Trading Acceleration Using
FPGAs," 2011 21st International Conference on Field Programmable Logic and
Applications, Chania, 2011, pp. 317-322.
[2] J. W. Lockwood, A. Gupte, N. Mehta, M. Blott, T. English and K. Vissers, "A
Low-Latency Library in FPGA Hardware for High-Frequency Trading (HFT),"
2012 IEEE 20th Annual Symposium on High-Performance Interconnects, Santa
Clara, CA, 2012, pp. 9-16.
Thank You!
Order Book Keeping - Deletion
1
Using those two simple yet very helpful 26
26
observations we can build very efficient PQ:
1. Return the top node. 2 3
24 22

4 5 6 7
21 23 19

1 2 3 4 5 6 7
heap 26 24 22 21 23 19

holes

1
Order Book Keeping - Deletion
1
Using those two simple yet very helpful 24
26
observations we can build very efficient PQ:
1. Return the top node. 2 3
2. Pick its larger child to replace it. 24 22

4 5 6 7
21 23 19

1 2 3 4 5 6 7
heap 24 24 22 21 23 19

holes

1
Order Book Keeping - Deletion
1
Using those two simple yet very helpful 24
26
observations we can build very efficient PQ:
1. Return the top node. 2 3
2. Pick its larger child to replace it. 23 22
3. Go to the picked child & repeat.
4 5 6 7
21 23 19

1 2 3 4 5 6 7
heap 24 23 22 21 23 19

holes

1 2 3 4 5 6 7
heap 24 23 22 21 19

holes 5

1
Order Book Keeping - Deletion
1
Using those two simple yet very helpful 24
observations we can build very efficient PQ:
1. Return the top node. 2 3
2. Pick its larger child to replace it. 23 22
3. Go to the picked child & repeat.
4. Reaching a leaf node → Add to holes. 4 5 6 7
21 19
If an insertion occurs, fill the holes first before
the next empty node to maintain the heap
1 2 3 4 5 6 7
structure.
heap 24 23 22 21 19

holes 5

Mlfinlab Release Hudson & Thames
100% (1)
Mlfinlab Release Hudson & Thames
74 pages
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
100% (1)
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
5 pages
SSRN Id4565813
No ratings yet
SSRN Id4565813
50 pages
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
No ratings yet
Statistical Arbitrage in High Frequency Trading Based On Limit Order Book Dynamics
26 pages
Machine Learning For Asset Management
No ratings yet
Machine Learning For Asset Management
2 pages
Bodek H., Shaw M. - Introduction To HFT Scalping Strategies
100% (1)
Bodek H., Shaw M. - Introduction To HFT Scalping Strategies
10 pages
Flight Direction Cosine Matrix
No ratings yet
Flight Direction Cosine Matrix
11 pages
HFT - Hardware Low Latency Techniques
No ratings yet
HFT - Hardware Low Latency Techniques
4 pages
HFT Fpga
No ratings yet
HFT Fpga
6 pages
Application of Machine Learning in High Frequency Trading of Stocks
No ratings yet
Application of Machine Learning in High Frequency Trading of Stocks
12 pages
Algorithmic Trading TCG
No ratings yet
Algorithmic Trading TCG
18 pages
Deep Robust Reinforcement Learning For Practical Algorithmic Trading
No ratings yet
Deep Robust Reinforcement Learning For Practical Algorithmic Trading
9 pages
Deep Reinforcement Learning in High Frequency Trad
No ratings yet
Deep Reinforcement Learning in High Frequency Trad
6 pages
On Machine Learning Based Cryptocurrency Trading
No ratings yet
On Machine Learning Based Cryptocurrency Trading
121 pages
Genesis Matrix Trading - Indicators - ProRealTime
No ratings yet
Genesis Matrix Trading - Indicators - ProRealTime
1 page
High Frequency Trading Final Paper
No ratings yet
High Frequency Trading Final Paper
7 pages
10 Introduction To Electronic Trading
No ratings yet
10 Introduction To Electronic Trading
6 pages
Gold (2003) - FX Trading Via Recurrent Reinforcement Learning PDF
No ratings yet
Gold (2003) - FX Trading Via Recurrent Reinforcement Learning PDF
8 pages
Liquidity Economics
No ratings yet
Liquidity Economics
28 pages
Automated Trading System Design
No ratings yet
Automated Trading System Design
82 pages
Mit Thesis HFT
No ratings yet
Mit Thesis HFT
59 pages
Genetic Trading
No ratings yet
Genetic Trading
22 pages
Market Microstructure: Information-Based Models
No ratings yet
Market Microstructure: Information-Based Models
8 pages
Financial Modelling Using C++
No ratings yet
Financial Modelling Using C++
1 page
DeepTrading With TensorFlow 1 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 1 - TodoTrader
6 pages
MF821 Syllabus
No ratings yet
MF821 Syllabus
5 pages
Test of 20 Different MA Filters For Smoothness and Responsiveness
No ratings yet
Test of 20 Different MA Filters For Smoothness and Responsiveness
12 pages
Deep Reinforcement Learning For Algorithmic Trading
No ratings yet
Deep Reinforcement Learning For Algorithmic Trading
9 pages
Alpha-GPT Human-AI Interactive Alpha Mining For Quantitative Investment
No ratings yet
Alpha-GPT Human-AI Interactive Alpha Mining For Quantitative Investment
9 pages
FoRex Trading Using Supervised Machine Learning PDF
No ratings yet
FoRex Trading Using Supervised Machine Learning PDF
5 pages
Low Latency Trading Systems From Basics To Implementation2 Header
No ratings yet
Low Latency Trading Systems From Basics To Implementation2 Header
207 pages
FinanceGPT LQM Case Study - Synthetic Options Chain Data
No ratings yet
FinanceGPT LQM Case Study - Synthetic Options Chain Data
26 pages
Targeting Risk Volatility and Leverage Management: Hangukquant
No ratings yet
Targeting Risk Volatility and Leverage Management: Hangukquant
28 pages
Cryptocurency Articol
No ratings yet
Cryptocurency Articol
30 pages
High Frequency Trading A Cceleration Using FPGAs
No ratings yet
High Frequency Trading A Cceleration Using FPGAs
6 pages
Advanced Portfolio Optimization Content 1736604664
No ratings yet
Advanced Portfolio Optimization Content 1736604664
7 pages
A I in Financial Services
100% (1)
A I in Financial Services
7 pages
Micro Structure Tutorial
No ratings yet
Micro Structure Tutorial
38 pages
Quantiacs Reading List PDF
No ratings yet
Quantiacs Reading List PDF
7 pages
A Q-Learning Agent For Automated Trading in Equity Stock Markets - Anna's Archive
No ratings yet
A Q-Learning Agent For Automated Trading in Equity Stock Markets - Anna's Archive
34 pages
Review of Deep Learning Models For Crypto Prices Prediction
No ratings yet
Review of Deep Learning Models For Crypto Prices Prediction
29 pages
Cross Hedging & HFT
No ratings yet
Cross Hedging & HFT
69 pages
Quantinsti
No ratings yet
Quantinsti
2 pages
EP Chan Course Offerings
No ratings yet
EP Chan Course Offerings
18 pages
Implementing A Pairs Trading Strategy in Python - A Step-by-Step Guide - by The Python Lab - Medium
No ratings yet
Implementing A Pairs Trading Strategy in Python - A Step-by-Step Guide - by The Python Lab - Medium
23 pages
Algorithmic Pattern Recognition in Day Trading - Nodrm
No ratings yet
Algorithmic Pattern Recognition in Day Trading - Nodrm
172 pages
MQL4 Programming Articles - Tester
No ratings yet
MQL4 Programming Articles - Tester
4 pages
Technical Indicators & Overlays
No ratings yet
Technical Indicators & Overlays
61 pages
Crude Oil Price Time Series Forecasting: A Novel Approach Based On Variational Mode Decomposition, Time-Series Imaging, and Deep Learning
No ratings yet
Crude Oil Price Time Series Forecasting: A Novel Approach Based On Variational Mode Decomposition, Time-Series Imaging, and Deep Learning
16 pages
A Reinforcement Learning Extension To The Almgren-Chriss Framework For Optimal Trade Execution
No ratings yet
A Reinforcement Learning Extension To The Almgren-Chriss Framework For Optimal Trade Execution
8 pages
Importing Data From Tick Data Into 3rd Party Software
No ratings yet
Importing Data From Tick Data Into 3rd Party Software
32 pages
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
No ratings yet
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
29 pages
Quantopian Platform
No ratings yet
Quantopian Platform
63 pages
Algorithm Trading in Indian Financial Markets
No ratings yet
Algorithm Trading in Indian Financial Markets
3 pages
Automatic Extraction and Identification of Chart Patterns Towards Financial Forecast
No ratings yet
Automatic Extraction and Identification of Chart Patterns Towards Financial Forecast
12 pages
A Generative Model of A Limit Order Book Using Recurrent Neural Networks
No ratings yet
A Generative Model of A Limit Order Book Using Recurrent Neural Networks
29 pages
Principles of Quantitative Development
From Everand
Principles of Quantitative Development
Manoj Thulasidas
No ratings yet
Crypto A Beginner's Guide
From Everand
Crypto A Beginner's Guide
Jake Masterfield
No ratings yet
TradeStation EasyLanguage for Algorithmic Trading: Discover real-world institutional applications of Equities, Futures, and Forex markets
From Everand
TradeStation EasyLanguage for Algorithmic Trading: Discover real-world institutional applications of Equities, Futures, and Forex markets
Domenico D'Errico
No ratings yet
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
From Everand
Quantitative Strategies for Achieving Alpha: The Standard and Poor's Approach to Testing Your Investment Choices
Richard Tortoriello
4/5 (1)
Mastering Markets: The Ultimate Guide to Backtesting and Strategy Validation
From Everand
Mastering Markets: The Ultimate Guide to Backtesting and Strategy Validation
William Johnson
No ratings yet
Asio SDK 2.3
No ratings yet
Asio SDK 2.3
50 pages
Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure
No ratings yet
Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure
20 pages
Performance Analysis of Lte Physical Layer Using System Vue
No ratings yet
Performance Analysis of Lte Physical Layer Using System Vue
87 pages
pg085 Axi4stream Infrastructure
No ratings yet
pg085 Axi4stream Infrastructure
82 pages
3GPP TS 28.554
No ratings yet
3GPP TS 28.554
22 pages
Swarm of Micro Flying Robots in The Wild
No ratings yet
Swarm of Micro Flying Robots in The Wild
44 pages
Multimedia System (New Additional) Amar Panchal
No ratings yet
Multimedia System (New Additional) Amar Panchal
35 pages
Reference Broadcast Synchronization RBS
100% (1)
Reference Broadcast Synchronization RBS
39 pages
Joint Scheduling of URLLC and eMBB Traffic in 5G Wireless Networks
No ratings yet
Joint Scheduling of URLLC and eMBB Traffic in 5G Wireless Networks
14 pages
Building The Smart Business
No ratings yet
Building The Smart Business
23 pages
EE2007-DCN Assignment - 1 (CLO 01)
No ratings yet
EE2007-DCN Assignment - 1 (CLO 01)
3 pages
Information and Communication Technology: Edexcel International Gcse (9 - 1)
No ratings yet
Information and Communication Technology: Edexcel International Gcse (9 - 1)
17 pages
White Paper Low Latency Server
No ratings yet
White Paper Low Latency Server
8 pages
Cost-Effective Revit Server Deployments PDF
No ratings yet
Cost-Effective Revit Server Deployments PDF
3 pages
Stock Market System Design
No ratings yet
Stock Market System Design
38 pages
Satellite Communication
100% (1)
Satellite Communication
26 pages
Prbs Overview White Paper LTR
No ratings yet
Prbs Overview White Paper LTR
14 pages
The Use of A T-Maze To Measure Cognitive-Motor Function in Cats (Felis Catus)
No ratings yet
The Use of A T-Maze To Measure Cognitive-Motor Function in Cats (Felis Catus)
8 pages
Mohith Suryanarayan
No ratings yet
Mohith Suryanarayan
4 pages
Kontakt 4 Reference Manual English
100% (1)
Kontakt 4 Reference Manual English
293 pages
Axxess IPdevices Install Manual
No ratings yet
Axxess IPdevices Install Manual
230 pages
Greatest Common Divisor Circuit Design Greatest Common Divisor Circuit Design
No ratings yet
Greatest Common Divisor Circuit Design Greatest Common Divisor Circuit Design
35 pages
Pg022 Axi Datamover
No ratings yet
Pg022 Axi Datamover
59 pages
5G Privacy Addressing Risk and Threats - WHP5G - WHP - Eng - 1122
No ratings yet
5G Privacy Addressing Risk and Threats - WHP5G - WHP - Eng - 1122
21 pages
NPT EL: Cloud Computing Assignment-Week 12 Type of Question: MCQ/MSQ Number of Questions 10 Total Mark: 10 X 1 10
No ratings yet
NPT EL: Cloud Computing Assignment-Week 12 Type of Question: MCQ/MSQ Number of Questions 10 Total Mark: 10 X 1 10
4 pages
SLA and Escalation Matrix 2023
No ratings yet
SLA and Escalation Matrix 2023
5 pages
IPV4 Addressing Problems and Solutions
No ratings yet
IPV4 Addressing Problems and Solutions
15 pages
5G Explainer From Sprint's Website2
No ratings yet
5G Explainer From Sprint's Website2
16 pages
Cloud RAN Nextgen Architecture
No ratings yet
Cloud RAN Nextgen Architecture
9 pages

High Frequency Trading Final Pres Slides

Uploaded by

High Frequency Trading Final Pres Slides

Uploaded by

A Low-Latency FPGA-based

Infrastructure for HFT Systems

Andrew Boutros, Brett Grady and Mustafa Abbas

● Used to: PMap TID Field 2 Field 3

○ Decode UDP packets containing financial

Timestamp Time Stamp

UDP Metadata UDP Metadata

(offers to sell) in order of their price. 101 12:02:36 4 27.4

▪ This translates into building a hardware 104 12:03:18 2 27.4

Priority Queue with some special features: 102 12:03:07 6 27.2

– Low-latency insert and delete. Ask Order Book

– Modify top entry. Order ID Time Size Price

– Remove a specific entry. 105 12:03:25 4 27.6

– Remove multiple entries. 103 12:03:25 2 27.7

▪ For any node at index j and level i:

▪ For any node at index j and level i:

▪ For any node at index j and level i:

▪ Ability to modify top order size if only a 4 5 6 7

▪ Ability to delete any arbitrary node is

– Readily available and already configured through the Savi server

▪ Vivado HLS for high level synthesis cores

▪ Vivado IP integrator for connecting together HLS cores

Full System 156 MHz 42 cycles 269.2 ns

Full System + Ethernet 156 MHz 85 cycles 525 ns

▪ Final System ran at 156 MHz limited by Ethernet port

Network Transmit 13 cycles - 85 ns

Network Receive 27 cycles - 170 ns

Network Switch 12 cycles 12 cycles 76.9 ns

FAST Encode/ Decode 18 cycles 30 cycles 115.4 ns

Order Book 12 cycles 42 cycles 76.9 ns

LUT 49,638 663,360 7.48

LUTRAM 2,148 29,3760 0.73

FF 32,718 1,326,720 2.47

BRAM 474 2,160 21.92

DSP 3 5,520 0.05

▪ Total utilization is minimal compared to the FPGA size

Our solution Kintex Ultrascale 85 ns 170 ns 192.3 ns 76.9 ns 525 ns

Leber et. al [1] Virtex-4 FX100 - - - - 2.6 us

Lockwood et. al [2] NetFPGA-10G 400ns 400 ns 200 ns Not included 1 us

You might also like