Forecasting at Uber: A Brief Survey: Andrea Pasqua
Forecasting at Uber: A Brief Survey: Andrea Pasqua
Forecasting at Uber: A Brief Survey: Andrea Pasqua
Andrea Pasqua
D T R
Supply Transaction Demand
Use Cases for Forecasting at Uber
● Time series are ubiquitous at Uber
○ Markets: descriptors for supply, demand, transactions for multi-sided
markets
R
P R
D
T
...
...
R
Use Cases for Forecasting at Uber
● Time series are ubiquitous at Uber
○ Markets: descriptors for supply, demand, transactions for multi-sided
markets
C E
T
R
Use Cases for Forecasting at Uber
● Time series are ubiquitous at Uber
○ Markets
○ Internal resources: compute, storage and data resources
Use Cases for Forecasting at Uber
● Time series are ubiquitous at Uber
○ Markets
○ Internal resources
○ Technical time series: states of micro-services
The Scale of our Domain
● Powerful combinatorics
○ By city, by neighbourhood, by market, by product, by app version
○ By micro-service, by trace, by product, by cohort
● Hybrid Approaches
○ Best in class
Generalized Linear Model (GLM)
● A linking function to constrain the nonlinear structure
Forecast (red) vs. Actual (black) for a large cluster Forecast (red) vs. Actual (black) for a small cluster
GLM: The Importance of Featurization
● This model learns across time series in a given city
● Some of the features convey past observed values of each series …
● … but geotemporal information is also a strong predictor
○ Neighborhood
○ Hour of day, day of week, etc.
GLM: The Importance of Featurization
● From one-hot encoding for geotemporal features
0 0 1 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0
● To “fingerprinting”
○ i.e. robust aggregations of observed values by geotemporal units
○ Reduces the complexity of the model, while preserving all the relevant
information
Aggregation
Window
GLM: Aggregate Performance
● Incremental improvement over traditional methods
○ wMAPE: weighted Mean Absolute Percent Error is a common error
metric
wMAPE - ↓ 12.5%
Quantile Random Forest (QRF)
● Decision Trees
○ Greedy optimizer for Purity/Variance
Quantile Random Forest (QRF)
Mean
Quantile Random Forest (QRF)
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Long Short-Term Memory (LSTM)
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Long Short-Term Memory (LSTM)
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Long Short-Term Memory (LSTM)
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
NN: Typical Architectures
● A hybrid solution
● A Generalized Additive Model (GAM) for the baseline forecast
○ Six months out
○ Detrending with exponential smoothing
○ Capturing seasonality with semi-parametric splines
● Quantile Random Forest to learn multiplicative event lifts across time
series
An Event Lift Model
GAM +
Error Metric Region HW GAM
QRF
● The computational graph will vary from time series to time series
○ This mode is supported by several frameworks: Dynet, PyTorch and
more recently TensorFlow
Learn more about Forecasting at Uber
M4 Forecasting Competition: Introducing a New Hybrid ES-RNN Engineering Uncertainty Estimation in Neural Networks for Time
Model Series Prediction at Uber
Omphalos, Uber’s Parallel and Language-Extensible Time Series Forecasting at Uber: An Introduction
Backtesting Tool
Thank you!
Proprietary and confidential © 2018 Uber Technologies, Inc. All rights reserved. No part of this document may be reproduced or utilized in any
form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage or retrieval systems, without
permission in writing from Uber. This document is intended only for the use of the individual or entity to whom it is addressed and contains
information that is privileged, confidential or otherwise exempt from disclosure under applicable law. All recipients of this document are notified
that the information contained herein includes proprietary and confidential information of Uber, and recipient may not make use of, disseminate,
or in any way disclose this document or any of the enclosed information to any person other than employees of addressee to the extent
necessary for consultations with authorized personnel of Uber.