0% found this document useful (0 votes)

239 views14 pages

Streamlit for ML Engineers: A Guide

The document introduces Streamlit, an app framework for machine learning engineers. Some key points: - Streamlit allows ML engineers to easily create interactive web apps from Python scripts without needing advanced coding skills or a dedicated tools team. Apps can include visualizations, live data, and neural network inference. - Streamlit apps are Python scripts that rerun from top to bottom on each user interaction. Caching allows reusing data and computations across runs for improved performance. - A demo app shows semantic visual search across a large image dataset and live neural network inference, implemented in just 300 lines of Python code with Streamlit. - Streamlit provides benefits like version control integration, live coding, and efficient computation

Uploaded by

Kashaf Bakali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

239 views14 pages

Streamlit for ML Engineers: A Guide

Uploaded by

Kashaf Bakali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Turn Python Scripts into Beautiful ML

Tools
Introducing Streamlit, an app framework built for ML engineers

Adrien Treuille
Oct 1 · 7 min read

Coding a semantic search engine with real-time neural-net inference in 300 lines of Python.

In my experience, every nontrivial machine learning project is eventually stitched

together with bug-ridden and unmaintainable internal tools. These tools — often a
patchwork of Jupyter Notebooks and Flask apps — are difficult to deploy, require
reasoning about client-server architecture, and don’t integrate well with machine
learning constructs like Tensorflow GPU sessions.

I saw this first at Carnegie Mellon, then at Berkeley, Google X, and finally while
building autonomous robots at Zoox. These tools were often born as little Jupyter
notebooks: the sensor calibration tool, the simulation comparison app, the LIDAR
alignment app, the scenario replay tool, and so on.

As a tool grew in importance, project managers stepped in. Processes sprouted.

Requirements flowered. These solo projects gestated into scripts, and matured into
gangly maintenance nightmares.

The machine learning engineers’ ad-hoc app building ow.

When a tool became crucial, we called in the tools team. They wrote fluent Vue and
React. They blinged their laptops with stickers about declarative frameworks. They had
a design process:
The tools team’s clean-slate app building ow.

Which was awesome. But these tools all needed new features, like weekly. And the
tools team was supporting ten other projects. They would say, “we’ll update your tool
again in two months.”

So we were back to building our own tools, deploying Flask apps, writing HTML, CSS,
and JavaScript, and trying to version control everything from notebooks to stylesheets.
So my old Google X friend, Thiago Teixeira, and I began thinking about the following
question: What if we could make building tools as easy as writing Python scripts?

We wanted machine learning engineers to be able to create beautiful apps without

needing a tools team. These internal tools should arise as a natural byproduct of the ML
workflow. Writing such tools should feel like training a neural net or performing an ad-
hoc analysis in Jupyter! At the same time, we wanted to preserve all of the flexibility of
a powerful app framework. We wanted to create beautiful, performant tools that
engineers could show off. Basically, we wanted this:

The Streamlit app building ow.

With an amazing beta community including engineers from Uber, Twitter, Stitch Fix,
and Dropbox, we worked for a year to create Streamlit, a completely free and open
source app framework for ML engineers. With each prototype, the core principles of
Streamlit became simpler and purer. They are:

#1: Embrace Python scripting. Streamlit apps are really just scripts that run from top
to bottom. There’s no hidden state. You can factor your code with function calls. If you
know how to write Python scripts, you can write Streamlit apps. For example, this is
how you write to the screen:

import streamlit as st

st.write('Hello, world!')

Nice to meet you.

#2: Treat widgets as variables. There are no callbacks in Streamlit! Every interaction
simply reruns the script from top to bottom. This approach leads to really clean code:

import streamlit as st

x = st.slider('x')
st.write(x, 'squared is', x * x)
An interactive Streamlit app in three lines of code.

#3: Reuse data and computation. What if you download lots of data or perform
complex computation? The key is to safely reuse information across runs. Streamlit
introduces a cache primitive that behaves like a persistent, immutable-by-default, data
store that lets Streamlit apps safely and effortlessly reuse information. For example,
this code downloads data only once from the Udacity Self-driving car project,
yielding a simple, fast app:

1 import streamlit as st
2 import pandas as pd
3
4 # Reuse this data across runs!
5 read_and_cache_csv = st.cache(pd.read_csv)
6
7 BUCKET = "https://streamlit-self-driving.s3-us-west-2.amazonaws.com/"
8 data = read_and_cache_csv(BUCKET + "labels.csv.gz", nrows=1000)
9 desired_label = st.selectbox('Filter to:', ['car', 'truck'])
10 st.write(data[data.label == desired_label])

cache_example.py hosted with ❤ by GitHub view raw

Using st.cache to persist data across Streamlit runs. To run this code, please follow these instructions.

The output of running the st.cache example above.

In short, Streamlit works like this:

1. The entire script is run from scratch for each user interaction.

2. Streamlit assigns each variable an up-to-date value given widget states.

3. Caching allows Streamlit to skip redundant data fetches and computation.

Or in pictures:

User events trigger Streamlit to rerun the script from scratch. Only the cache persists across runs.

If this sounds intriguing, you can try it right now! Just run:

$ pip install --upgrade streamlit

$ streamlit hello

You can now view your Streamlit app in your browser.

Local URL: http://localhost:8501

Network URL: http://10.0.1.29:8501

This will automatically pop open a web browser pointing to your local Streamlit app. If
not, just click the link.
To see more examples like this fractal animation, run streamlit hello from the command line.

. . .

Ok. Are you back from playing with fractals? Those can be mesmerizing.

The simplicity of these ideas does not prevent you from creating incredibly rich and
useful apps with Streamlit. During my time at Zoox and Google X, I watched as self-
driving car projects ballooned into gigabytes of visual data, which needed to be
searched and understood, including running models on images to compare
performance. Every self-driving car project I’ve seen eventually has had entire teams
working on this tooling.

Building such a tool in Streamlit is easy. This Streamlit demo lets you perform semantic
search across the entire Udacity self-driving car photo dataset, visualize human-
annotated ground truth labels, and run a complete neural net (YOLO) in real time
from within the app [1].
This 300-line Streamlit demo combines semantic visual search with interactive neural net inference.

The whole app is a completely self-contained, 300-line Python script, most of which is
machine learning code. In fact, there are only 23 Streamlit calls in the whole app. You
can run it yourself right now!

$ pip install --upgrade streamlit opencv-python

$ streamlit run
https://raw.githubusercontent.com/streamlit/demo-self-
driving/master/app.py

. . .

As we worked with machine learning teams on their own projects, we came to realize
that these simple ideas yield a number of important benefits:

Streamlit apps are pure Python files. So you can use your favorite editor and
debugger with Streamlit.
My favorite layout for writing Streamlit apps has VSCode on the left and Chrome on the right.

Pure Python scripts work seamlessly with Git and other source control software,
including commits, pull requests, issues, and comments. Because Streamlit’s
underlying language is pure Python, you get all the benefits of these amazing
collaboration tools for free 🎉.

Because Streamlit apps are just Python scripts, you can easily version control them with Git.
Streamlit provides an immediate-mode live coding environment. Just click Always
rerun when Streamlit detects a source file change.

Click “Always rerun” to enable live coding.

Caching simplifies setting up computation pipelines. Amazingly, chaining cached

functions automatically creates efficient computation pipelines! Consider this code
adapted from our Udacity demo:

1 import streamlit as st
2 import pandas as pd
3
4 @st.cache
5 def load_metadata():
6 DATA_URL = "https://streamlit-self-driving.s3-us-west-2.amazonaws.com/labels.csv.gz"
7 return pd.read_csv(DATA_URL, nrows=1000)
8
9 @st.cache
10 def create_summary(metadata, summary_type):
11 one_hot_encoded = pd.get_dummies(metadata[["frame", "label"]], columns=["label"])
12 return getattr(one_hot_encoded.groupby(["frame"]), summary_type)()
13
14 # Piping one st.cache function into another forms a computation DAG.
15 summary_type = st.selectbox("Type of summary:", ["sum", "any"])
16 metadata = load_metadata()
17 summary = create_summary(metadata, summary_type)
18 st.write('## Metadata', metadata, '## Summary', summary)
caching DAG example py hosted with ❤ by GitHub view raw
A simple computation pipeline in Streamlit. To run this code, please follow these instructions.

Basically, the pipeline is load_metadata → create_summary. Every time the script is run
Streamlit only recomputes whatever subset of the pipeline is required to get the
right answer. Cool!

To make apps performant, Streamlit only recomputes whatever is necessary to update the UI.

Streamlit is built for GPUs. Streamlit allows direct access to machine-level primitives
like TensorFlow and PyTorch and complements these libraries. For example in this
demo, Streamlit’s cache stores the entire NVIDIA celebrity face GAN [2]. This approach
enables nearly instantaneous inference as the user updates sliders.
This Streamlit app demonstrates NVIDIA celebrity face GAN [2] model using Shaobo Guan’s TL-GAN [3].

Streamlit is a free and open-source library rather than a proprietary web app. You
can serve Streamlit apps on-prem without contacting us. You can even run Streamlit
locally on a laptop without an Internet connection! Furthermore, existing projects can
adopt Streamlit incrementally.

Several ways incrementally adopt Streamlit. (Icons courtesy of fullvector / Freepik.)

. . .

This just scratches the surface of what you can do with Streamlit. One of the most
exciting aspects of Streamlit is how these primitives can be easily composed into
complex apps that look like scripts. There’s a lot more we could say about how our
architecture works and the features we have planned, but we’ll save that for future
posts.
Block diagram of Streamlit’s components. More coming soon!

We’re excited to finally share Streamlit with the community today and see what you all
build with it. We hope that you’ll find it easy and delightful to turn your Python scripts
into beautiful ML apps.

. . .

Thanks to Amanda Kelly, Thiago Teixeira, TC Ricks, Seth Weidman, Regan Carey, Beverly
Treuille, Geneviève Wachtell, and Barney Pell for their helpful input on this article.

References:

[1] J. Redmon and A. Farhadi, YOLOv3: An Incremental Improvement (2018), arXiv.

[2] T. Karras, T. Aila, S. Laine, and J. Lehtinen, Progressive Growing of GANs for
Improved Quality, Stability, and Variation (2018), ICLR.

[3] S. Guan, Controlled image synthesis and editing using a novel TL-GAN model (2018),
Insight Data Science Blog.

Thanks to TC Ricks, Amanda Kelly, and Amanda Kelly.

Machine Learning Data Science Deep Learning Autonomous Vehicles Python

About Help Legal

KNN in Python for Data Scientists
No ratings yet
KNN in Python for Data Scientists
7 pages
Data Engineering Standards Guide
No ratings yet
Data Engineering Standards Guide
3 pages
Data Structures and Algorithms With Python LetsUpgrade
No ratings yet
Data Structures and Algorithms With Python LetsUpgrade
11 pages
Deep Learning Most Important Ideas PDF
No ratings yet
Deep Learning Most Important Ideas PDF
16 pages
Swarm MultiAgents Financial Analyst Framework
No ratings yet
Swarm MultiAgents Financial Analyst Framework
9 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
A Mindtree Publication I Volume 4 I 2018
No ratings yet
A Mindtree Publication I Volume 4 I 2018
44 pages
Cassandra Design Patterns Overview
No ratings yet
Cassandra Design Patterns Overview
32 pages
C# Tutorial - The Fundamentals You Need To Master C# - Edureka
No ratings yet
C# Tutorial - The Fundamentals You Need To Master C# - Edureka
55 pages
Automate Strategy Finding With LLM in Quant Invest
No ratings yet
Automate Strategy Finding With LLM in Quant Invest
13 pages
Guide To Interviewing Developers
No ratings yet
Guide To Interviewing Developers
14 pages
Graph Technology Buyers Guide EN A4
No ratings yet
Graph Technology Buyers Guide EN A4
34 pages
Event-Driven James Webb Space Telescope Operations Using On-Board JavaScripts
No ratings yet
Event-Driven James Webb Space Telescope Operations Using On-Board JavaScripts
10 pages
Software Engineering
No ratings yet
Software Engineering
2 pages
Fine-Tuning LLMs with PEFT & LoRa Techniques
No ratings yet
Fine-Tuning LLMs with PEFT & LoRa Techniques
25 pages
Embulk: Open-Source Data Loader Guide
No ratings yet
Embulk: Open-Source Data Loader Guide
36 pages
GitHub Trainings
No ratings yet
GitHub Trainings
5 pages
Data Science Career Boost Course
No ratings yet
Data Science Career Boost Course
7 pages
Esources: Python Python Modules SQL
No ratings yet
Esources: Python Python Modules SQL
5 pages
Scaler Data Science Program Overview
No ratings yet
Scaler Data Science Program Overview
18 pages
Healthcare Chatbot with Decision Tree
No ratings yet
Healthcare Chatbot with Decision Tree
3 pages
Title - Improving AI To Reach AGI
No ratings yet
Title - Improving AI To Reach AGI
2 pages
DevOps Engineer with 25+ Years Experience
No ratings yet
DevOps Engineer with 25+ Years Experience
1 page
EasySkill Brochure Final
No ratings yet
EasySkill Brochure Final
36 pages
Graph Rag Blank
No ratings yet
Graph Rag Blank
52 pages
Architecture Design and Principles
No ratings yet
Architecture Design and Principles
18 pages
Top Companies by Industry and Use Cases
No ratings yet
Top Companies by Industry and Use Cases
41 pages
Gradient Boosting for Load Forecasting
No ratings yet
Gradient Boosting for Load Forecasting
19 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
45 pages
.Net Core Best Practices Guide
No ratings yet
.Net Core Best Practices Guide
55 pages
Graph Databases for MDM Pros
100% (1)
Graph Databases for MDM Pros
14 pages
Scaling Laws For Neural Language Models
No ratings yet
Scaling Laws For Neural Language Models
30 pages
Prompt Engineering
No ratings yet
Prompt Engineering
1 page
GenAI Pinnacle
No ratings yet
GenAI Pinnacle
16 pages
Python Data Structures
No ratings yet
Python Data Structures
178 pages
Hands-On Deep Learning For Images With T PDF
No ratings yet
Hands-On Deep Learning For Images With T PDF
3 pages
AI Course Syllabus and Details
100% (1)
AI Course Syllabus and Details
18 pages
AI Automation Interview Booklet - Beginner Level
No ratings yet
AI Automation Interview Booklet - Beginner Level
10 pages
Evaluate RAG - Phoenix
No ratings yet
Evaluate RAG - Phoenix
25 pages
Statista, The AI Advantage Powering Business Competitiveness
No ratings yet
Statista, The AI Advantage Powering Business Competitiveness
25 pages
Generative Ai Leader Study Guide English
No ratings yet
Generative Ai Leader Study Guide English
11 pages
Int404 (Syllabus)
No ratings yet
Int404 (Syllabus)
1 page
WEEK - 5 SOLID Principles
No ratings yet
WEEK - 5 SOLID Principles
23 pages
AION Strategic Business Framework
No ratings yet
AION Strategic Business Framework
10 pages
Deploy Your Resume To Azure - Cheat Sheet
No ratings yet
Deploy Your Resume To Azure - Cheat Sheet
8 pages
A Tour of TensorFlow
No ratings yet
A Tour of TensorFlow
17 pages
Intel GenAI Hackathon
No ratings yet
Intel GenAI Hackathon
10 pages
AI's Impact on Cloud Growth Trends
No ratings yet
AI's Impact on Cloud Growth Trends
43 pages
Copilot Prompt
0% (1)
Copilot Prompt
2 pages
Brittany King Data Scientist Resume
No ratings yet
Brittany King Data Scientist Resume
1 page
PEFT Methods for Language Models
No ratings yet
PEFT Methods for Language Models
20 pages
Building genAI Products in Business
No ratings yet
Building genAI Products in Business
8 pages
Outline: How To Install Development Setup
No ratings yet
Outline: How To Install Development Setup
8 pages
Artificial Intelligence Definition, Ethics and Standards
33% (3)
Artificial Intelligence Definition, Ethics and Standards
12 pages
Face Recognition Using Facenet
No ratings yet
Face Recognition Using Facenet
46 pages
Azure Cognitive Services Openai PDF
No ratings yet
Azure Cognitive Services Openai PDF
246 pages
Purdue-Simplilearn AI & ML Program
No ratings yet
Purdue-Simplilearn AI & ML Program
35 pages
Streamlit: Simplify ML App Development
100% (1)
Streamlit: Simplify ML App Development
14 pages
Build A Database Website Using Streamlit Library: Most Accidents Zone Area Detection Website
No ratings yet
Build A Database Website Using Streamlit Library: Most Accidents Zone Area Detection Website
6 pages
Intelligent Waste Separator PDF
No ratings yet
Intelligent Waste Separator PDF
15 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Backpropagation Algorithm Explained
No ratings yet
Backpropagation Algorithm Explained
11 pages
Introduction to Machine Learning Guide
No ratings yet
Introduction to Machine Learning Guide
6 pages
Psychosocial Treatments - 1st Edition Latest Edition Download
100% (17)
Psychosocial Treatments - 1st Edition Latest Edition Download
15 pages
AS3979-2006 - Hydrotherapy Pools
No ratings yet
AS3979-2006 - Hydrotherapy Pools
25 pages
Accessories - Caponord Aprilia
No ratings yet
Accessories - Caponord Aprilia
12 pages
E Nihss
No ratings yet
E Nihss
5 pages
Eclipse Phase Rulebook Errata
No ratings yet
Eclipse Phase Rulebook Errata
5 pages
Artificial Neural Networks Lecture Notes: Stephen Lucci, PHD
No ratings yet
Artificial Neural Networks Lecture Notes: Stephen Lucci, PHD
19 pages
Accounting For Managers
No ratings yet
Accounting For Managers
11 pages
SPW Notes Unit I Wave Optics
No ratings yet
SPW Notes Unit I Wave Optics
58 pages
Grade4 101304 5 9204
No ratings yet
Grade4 101304 5 9204
4 pages
08.03 RCA Apollo Presentation
No ratings yet
08.03 RCA Apollo Presentation
95 pages
Gas Transmission Registration
No ratings yet
Gas Transmission Registration
15 pages
Case Study-2 - Online Retail Data Pre-Processing
No ratings yet
Case Study-2 - Online Retail Data Pre-Processing
2 pages
ONGC Junior Project Fellow Recruitment
No ratings yet
ONGC Junior Project Fellow Recruitment
3 pages
Panorama Andino ISDA
No ratings yet
Panorama Andino ISDA
10 pages
BMN B maXX 2024 en Web
No ratings yet
BMN B maXX 2024 en Web
40 pages
CNS Unit 2
No ratings yet
CNS Unit 2
38 pages
Pauli Restau
No ratings yet
Pauli Restau
4 pages
Regression Performance Metrics Overview
No ratings yet
Regression Performance Metrics Overview
6 pages
Chapter - 4 Performance Management System
No ratings yet
Chapter - 4 Performance Management System
59 pages
Ba 3094
No ratings yet
Ba 3094
29 pages
Dresser Water Pipeline Repair Brochure
No ratings yet
Dresser Water Pipeline Repair Brochure
20 pages
God of Vengeance
No ratings yet
God of Vengeance
19 pages
UiTM Jengka Academic Affairs Overview
No ratings yet
UiTM Jengka Academic Affairs Overview
20 pages
Moisture Methods Plastic EN PDF
No ratings yet
Moisture Methods Plastic EN PDF
24 pages
The Fourth Phase of Water Explained
No ratings yet
The Fourth Phase of Water Explained
1 page
Ce-311 Sludge Processing
No ratings yet
Ce-311 Sludge Processing
21 pages
Theoretical and Conceptual Frameworks
83% (6)
Theoretical and Conceptual Frameworks
133 pages
Care Plan Pre-School
No ratings yet
Care Plan Pre-School
2 pages
MSDS Buffer Solution PH 4.0
No ratings yet
MSDS Buffer Solution PH 4.0
5 pages
Profanity - Wikipedia
No ratings yet
Profanity - Wikipedia
23 pages

Streamlit for ML Engineers: A Guide

Uploaded by

Streamlit for ML Engineers: A Guide

Uploaded by

Turn Python Scripts into Beautiful ML

In my experience, every nontrivial machine learning project is eventually stitched

As a tool grew in importance, project managers stepped in. Processes sprouted.

The machine learning engineers’ ad-hoc app building ow.

We wanted machine learning engineers to be able to create beautiful apps without

The Streamlit app building ow.

Nice to meet you.

cache_example.py hosted with ❤ by GitHub view raw

The output of running the st.cache example above.

In short, Streamlit works like this:

2. Streamlit assigns each variable an up-to-date value given widget states.

3. Caching allows Streamlit to skip redundant data fetches and computation.

$ pip install --upgrade streamlit

You can now view your Streamlit app in your browser.

Local URL: http://localhost:8501

$ pip install --upgrade streamlit opencv-python

Click “Always rerun” to enable live coding.

Caching simplifies setting up computation pipelines. Amazingly, chaining cached

Several ways incrementally adopt Streamlit. (Icons courtesy of fullvector / Freepik.)

[1] J. Redmon and A. Farhadi, YOLOv3: An Incremental Improvement (2018), arXiv.

Thanks to TC Ricks, Amanda Kelly, and Amanda Kelly.

Machine Learning Data Science Deep Learning Autonomous Vehicles Python

About Help Legal

You might also like