Furiosa Introduction Confidential
Furiosa Introduction Confidential
96
5
Confidential (c) 2024 FuriosaAI Inc.
$ 1 Trillion Long-Term Opportunity with Fast-evolving AI
7
02
by2
B
00
$4
$ 400B
Year 2027
$30B Today
Now!
Year 2023
Early stage of the dynamic
and fast changing AI landscape
Confidential (c) 2024 FuriosaAI Inc. Source: AMD & Nvidia announcements
6
Key to winning AI chip architecture
This makes TCP a great arch, not only for one use case
but reasonably future-proof to host new and upcoming
DNN architectures (based on current trends observed).”
– ISCA Reviewer
Context: The only other NPU architectures accepted by ISCA are Google's TPU and Groq
Energy efficiency
LLaMa 7B
03
Dynamo Furiosa
tracing compiler codegen
Python fx.GraphModule LLTC IR RNGD ISA
LLM engine quantizer
Furiosa SW stack
device runtime
runtime calibrator
LPDDR4X 16GB HBM3 48GB HBM3 96GB LPDDR5X 64GB HBM3E 288GB
66 GB/s 1.5 TB/s 3.0 TB/s 256 GB/s 8.0 TB/s
60 W 150 W 350 W 60 W 600 W
64 TOPS (INT8) 512 TFLOPS (FP8) 1024 TFLOPS (FP8) 230 TFLOPS (FP8) 2 PFLOPS (FP8)
Accepted by ISCA 2024 Co-authored with UW Madison Presented at PyTorch Conference 2023 Presented at DVCON 2024