0% found this document useful (0 votes)

8 views29 pages

Mlflow Workshop Part 2

This document provides an overview of MLflow, focusing on its components such as Tracking, Projects, and Models, which facilitate the machine learning lifecycle. It emphasizes the importance of reproducibility in machine learning and outlines how MLflow helps package and manage data science code and models. Additionally, it includes examples and resources for further learning about MLflow functionalities and usage.

Uploaded by

Tuan Minh Pham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views29 pages

Mlflow Workshop Part 2

Uploaded by

Tuan Minh Pham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Platform for Complete Machine

Learning Lifecycle
Jules S. Damji
@2twitme

San Francisco| May 13, 2020: Part 2 of 3 Series

Outline – Introduction to MLflow: Understanding MLflow
Projects and Models- Part 2
§ Review & Recap Part 1: MLflow Tracking
▪ https://youtu.be/x3cxvsUFVZA
§ MLFlow Component
▪ MLflow Projects & Models
▪ Concepts and Motivations
▪ MLflow on Databricks Community Edition (DCE)
▪ Explore MLflow UI
▪ Tutorials
§ Q&A

https://dbricks.co/mlflow-part-2
https://github.com/dmatrix/mlflow-workshop-project-expamle-1
Machine Learning
Development is Complex
Traditional Software vs. Machine Learning
Traditional Software Machine Learning

§ Goal: Meet a functional specification § Goal: Optimize metric(e.g., accuracy.

§ Quality depends only on code Constantly experiment to improve it
§ Typically pick one software stack w/ § Quality depends on input data and
fewer libraries and tools tuning parameters
§ Compare + combine many libraries,
model
Machine Learning Lifecycle
μ
λθ Tuning

Scale

Data Prep

μ
Model λθ Tuning
Delta Raw Data Exchange Training
Scale
Scale

Deploy
Governance

Scale
MLflow Components
w
ne

Tracking Projects Models Model

Record and query Package data Deploy machine Registry
experiments: code, science code in a learning models in
Store, annotate
data, config, and results format that enables diverse serving
and manage
reproducible runs environments
models in a
on any platform environments
central repository

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
Model Development with MLflow is Simple!
data = load_text(file) $ mlflow ui
ngrams = extract_ngrams(data, N=n)
model = train_model(ngrams,
learning_rate=lr)
score = compute_accuracy(model)
with mlflow.start_run() as run:
mlflow.log_param(“data_file”, file)
mlflow.log_param(“n”, n)
mlflow.log_param(“learn_rate”, lr)
mlflow.log_metric(“score”, score) Track parameters, metrics,
mlflow.sklearn.log_model(model) output files & code version
Search using UI or API
MLflow Tracking
Python,
Java, R or
REST API
Notebooks Tracking Server UI

Local Apps Parameters Metrics Artifacts

API

Spark
Cloud Jobs Metadata Models
Data Source
$ export MLFLOW_TRACKING_URI <URI>
mlflow.set_tracking_uri(URI)
MLflow Components
w
ne

Tracking Projects Models Model

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
MLflow Projects Motivation
Diverse set of tools

Projects
Package data science
Diverse set of environments code in a format that
enables reproducible runs
on any platform

Challenge: ML results difficult to reproduce

MLflow Projects

Local Execution
Project Spec

Code Config
Remote Execution
Dependencies Data
1. Example MLflow Project File
my_projectject/
├── MLproject conda_env: conda.yaml

│ entry_points:
│ main:
parameters:
│ training_data: path
│ lambda: {type: float, default: 0.1}
command: python main.py {training_data} {lambda}
│
├── conda.yaml
├── main.py $ mlflow run git://<my_project>.git -P lambda=0.2
└── model.py
mlflow.run(“git://<my_project>”, parameters={..})
...
mlflow run . –e main –P lambda=0.2
2. Example Conda.yaml
my_project/
├── MLproject
channels:
│ - defaults
│ dependencies:
│ - python=3.7.3
- scikit-learn=0.20.3
│ - pip:
│ - mlflow
├── conda.yaml - cloudpickle==0.8.0
├── main.py name: mlflow-env

└── model.py
….
MLflow Projects
Packaging format for reproducible ML runs
• Any code folder or GitHub repository
• MLproject file with project configuration
Defines dependencies for reproducibility
• Conda (+ R, Docker, …) dependencies can be specified in MLproject
• Reproducible in (almost) any environment
Execution API for running projects
§ CLI / Python / R / Java mml

directory paths to
§ Supports local and remote execution MLproject file
▪ mlflow run –help (CLI)
▪ mlflow run https://github.com/dmatrix/jsd-mlflow-examples.git#keras/imdbclassifier (CLI)
▪ mlflow.run (<project_uri>, parameters={}) or mlflow.projects.run((<project_uri>, parameters={}) (API)
Anatomy of MLflow Project Execution
1 2 3
$ mlflow run Fetch the GitHub project into
https://github.com/mlflow- /var/folders/xxx directory Create conda env & activate
d
project-example-1 d mlflow-runidd

4 5

Install packages & dependencies from In the activated conda environment

conda.yaml d mlflow-runid d
Execute your entry point:
python train.py args, …,args
How to build an MLflow Project
1 2
• Create an MLproject file • Create a conda.yaml file
• Populate with entry points • Populate with dependencies
d and
and default type • Copy from yourd mlflow ui
parameters artifacts ->Model->conda.yaml

3 • Test it
4
• Create a GitHub repository
• Populate or upload • mlflow run git://URI –P arg.. –P args
d
MLProject, conda.yaml, • d params-{})
mlflow.run(URI,
data, src files… etc. • Share it …
MLflow Project: Create Multi-Step Workflow

https://github.com/mlflow/mlflow/tree/master/examples/multistep_workflow
MLflow Components
w
ne

Tracking Projects Models Model

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
MLflow Model Motivations

Inference Code
NxM
Combination of
Model support for
all Serving tools

Batch & Stream Scoring

ML Frameworks Serving Tools

MLflow Model Motivation
MLflow Models
Inference Code

Model Format

Flavor 1 Flavor 2
Batch & Stream
Scoring

Standard for ML models

ML Frameworks Serving Tools
Example MLflow Model
Example MLflow Model
mlflow.tensorflow.log_model(...)
my_model/
├── MLmodel run_id: 769915006efd4c4bbd662461
time_created: 2018-06-28T12:34
│ flavors:
│ tensorflow:
Usable by tools that understand
saved_model_dir: estimator
│ signature_def_key: predict TensorFlow model format
│ python_function: Usable by any tool that can run
loader_module: mlflow.tensorflow
│ Python (Docker, Spark, etc!)
└── estimator/
├── saved_model.pb
└── variables/
...
Model Keras Flavor Example
mlflow.keras.log_model(…)

Train a model

predict = mlflow.pyfunc.load_model(…)
Flavor 1:
Pyfunc predict(pandas.input_dataframe)

Model Flavor 2:
Format Keras
model = mlflow.keras.load_model(…)

model.predict(keras.Input(…))
Model Flavors Example

predict = mlflow.pyfunc.load_model(model_uri)

predict(pandas.input_dataframe)
MLflow Models
Packaging format for ML Models
• Any directory with MLmodel file
Defines dependencies for reproducibility
• Conda environment can be specified in MLmodel configuration
Model creation and loading utilities
• mlflow.<model_flavor>.save_model(…) or log_model(…)
• mlflow.<model_flavor>.load_model(…)
Deployment APIs
• CLI / Python / R / Java
• mlflow models [OPTIONS] COMMAND [ARGS]...
• mlflow models serve [OPTIONS [ARGS] ….
• mlflow models predict [OPTIONS [ARGS] ...
MLflow Project & Models
Tutorials
Tutorials: https://github.com/dmatrix/mlflow-workshop-part-2

MLflow Project Keras Example:

https://github.com/dmatrix/mlflow-workshop-project-expamle-1
Learning More About MLflow

§ pip install mlflow to get started

§ Find docs & examples at mlflow.org
§ Peruse code at MLflow Github
§ Join the Slack channel
§ More MLflow tutorials
Thank you! J
Q&A
[email protected]
@2twitme
https://www.linkedin.com/in/dmatrix/

Artificial Intelligence With Python Cookbook
100% (4)
Artificial Intelligence With Python Cookbook
467 pages
ML in Production en
No ratings yet
ML in Production en
106 pages
Building Android Apps With Python - Part - 1
No ratings yet
Building Android Apps With Python - Part - 1
11 pages
Statistics Machine Learning Python
No ratings yet
Statistics Machine Learning Python
415 pages
MLOPS
No ratings yet
MLOPS
56 pages
MLFlow Experiment Tracking and Model Registering PPT 1711953158
No ratings yet
MLFlow Experiment Tracking and Model Registering PPT 1711953158
20 pages
Van Der Post H. Learn Python For Finance and Accounting..Step by Step Guide 2023
No ratings yet
Van Der Post H. Learn Python For Finance and Accounting..Step by Step Guide 2023
365 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Pandas
No ratings yet
Pandas
698 pages
MLflow Présentation
No ratings yet
MLflow Présentation
51 pages
MLFlow
No ratings yet
MLFlow
4 pages
Tutorial Using Excel With Python and Pandas
100% (2)
Tutorial Using Excel With Python and Pandas
28 pages
Acolite Manual 20221114.0
No ratings yet
Acolite Manual 20221114.0
45 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
HZDR Publications 17653
No ratings yet
HZDR Publications 17653
27 pages
b3 Plant Leaf Disease Detection
No ratings yet
b3 Plant Leaf Disease Detection
62 pages
MLOps Interview QnA
No ratings yet
MLOps Interview QnA
19 pages
Secure Online Payment
No ratings yet
Secure Online Payment
74 pages
Introduction To MLFlow
No ratings yet
Introduction To MLFlow
8 pages
Openpyxl
No ratings yet
Openpyxl
213 pages
MLOps Specialization Course
No ratings yet
MLOps Specialization Course
29 pages
Open Fast
No ratings yet
Open Fast
425 pages
MLOps Specialization Course January 2024!5!15
No ratings yet
MLOps Specialization Course January 2024!5!15
11 pages
MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
No ratings yet
MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
28 pages
Pyomo PDF
No ratings yet
Pyomo PDF
382 pages
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
No ratings yet
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
234 pages
Tensorflow Object Detection Api Tutorial PDF
No ratings yet
Tensorflow Object Detection Api Tutorial PDF
41 pages
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
38 pages
CS197 Harvard AI Research Experience
No ratings yet
CS197 Harvard AI Research Experience
252 pages
Install Pyqt5
No ratings yet
Install Pyqt5
38 pages
Dev
No ratings yet
Dev
33 pages
Matplotlib Guide PDF
No ratings yet
Matplotlib Guide PDF
6 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
MLops
No ratings yet
MLops
24 pages
Machine Learning Model Deployment
No ratings yet
Machine Learning Model Deployment
88 pages
Camelot Py Readthedocs Io en Master
No ratings yet
Camelot Py Readthedocs Io en Master
59 pages
Homework 5 Yessine Labyedh
No ratings yet
Homework 5 Yessine Labyedh
28 pages
Schedule Documentation: Release 1.1.0
No ratings yet
Schedule Documentation: Release 1.1.0
39 pages
7 - From ML To Production
No ratings yet
7 - From ML To Production
23 pages
Chat Bot
No ratings yet
Chat Bot
48 pages
Effortless Models Deployment With MLFlow - by Facundo Santiago - Medium
No ratings yet
Effortless Models Deployment With MLFlow - by Facundo Santiago - Medium
15 pages
Python and Jupyter Notebook Installation
No ratings yet
Python and Jupyter Notebook Installation
20 pages
KNIME Python Integration Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-12-06)
No ratings yet
KNIME Python Integration Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-12-06)
20 pages
The Tip of The Iceberg: 1 Before You Start
No ratings yet
The Tip of The Iceberg: 1 Before You Start
18 pages
MLOps
No ratings yet
MLOps
16 pages
ML Project (BS IT-8)
No ratings yet
ML Project (BS IT-8)
20 pages
Installing A Python Based Machine Learning Environment in Windows 10
No ratings yet
Installing A Python Based Machine Learning Environment in Windows 10
9 pages
Lecture Notes - Building Continuous Learning Infrastructure
No ratings yet
Lecture Notes - Building Continuous Learning Infrastructure
8 pages
Mlflow Workshop Part 3
No ratings yet
Mlflow Workshop Part 3
25 pages
ML Tools - MLflow
No ratings yet
ML Tools - MLflow
11 pages
Nebius LLM Fine Tuning Mlflow
No ratings yet
Nebius LLM Fine Tuning Mlflow
24 pages
Unit 2
No ratings yet
Unit 2
12 pages
Invoke
No ratings yet
Invoke
11 pages
04 CaseStudy DataPlatformPeopleStrategy Rao Tom
No ratings yet
04 CaseStudy DataPlatformPeopleStrategy Rao Tom
30 pages
Fds PDF
No ratings yet
Fds PDF
4 pages
Download
No ratings yet
Download
9 pages
Inmo 2012
No ratings yet
Inmo 2012
6 pages
Model Experimentation Tracking Using Open
No ratings yet
Model Experimentation Tracking Using Open
3 pages
Tantithamthavorn Et Al - 2025
No ratings yet
Tantithamthavorn Et Al - 2025
7 pages
Deep Learning Software Installation Guide
No ratings yet
Deep Learning Software Installation Guide
6 pages
Dataset - Databricks
No ratings yet
Dataset - Databricks
5 pages
The Sum of Squares Technique
No ratings yet
The Sum of Squares Technique
4 pages
Avatarifyadvi Bat
No ratings yet
Avatarifyadvi Bat
5 pages
Cheatsheet For Conda, Pipenv GitHub
No ratings yet
Cheatsheet For Conda, Pipenv GitHub
3 pages
Data Exploration On Databricks (Setup) - Databricks
No ratings yet
Data Exploration On Databricks (Setup) - Databricks
1 page
Data Exploration On Databricks - Databricks
No ratings yet
Data Exploration On Databricks - Databricks
1 page
AdTech Sample Notebook (Part 1) - Databricks
No ratings yet
AdTech Sample Notebook (Part 1) - Databricks
1 page
Software Architecture with Python
From Everand
Software Architecture with Python
Anand Balachandran Pillai
3/5 (1)
Expert PHP 5 Tools
From Everand
Expert PHP 5 Tools
Dirk Merkel
4/5 (5)
Mastering C# and .NET Framework
From Everand
Mastering C# and .NET Framework
Marino Posadas
5/5 (7)
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Mastering KnockoutJS
From Everand
Mastering KnockoutJS
Timothy Moran
No ratings yet
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
From Everand
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
Abdulrazak Nugwa Ibrahim
5/5 (1)
Java 17 Backend Development
From Everand
Java 17 Backend Development
Elara Drevyn
No ratings yet
Learning Hadoop 2
From Everand
Learning Hadoop 2
Garry Turkington
4/5 (1)
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
From Everand
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
Adalat Khan
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of C# Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
C++ Mastery: Advanced Techniques and Strategies
From Everand
C++ Mastery: Advanced Techniques and Strategies
Adam Jones
No ratings yet
PowerShell SysAdmin Crash Course: Unlock the Full Potential of PowerShell with Advanced Techniques, Automation, Configuration Management and Integration
From Everand
PowerShell SysAdmin Crash Course: Unlock the Full Potential of PowerShell with Advanced Techniques, Automation, Configuration Management and Integration
Steeve Lee
No ratings yet
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
From Everand
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
Julian Braun
No ratings yet
Programming Backend with Go
From Everand
Programming Backend with Go
Julian Braun
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
From Everand
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
Sherwin John C. Tragura
No ratings yet
Java 17 Backend Development: Design backend systems using Spring Boot, Docker, Kafka, Eureka, Redis, and Tomcat
From Everand
Java 17 Backend Development: Design backend systems using Spring Boot, Docker, Kafka, Eureka, Redis, and Tomcat
Elara Drevyn
No ratings yet
Odoo 10 Development Essentials
From Everand
Odoo 10 Development Essentials
Daniel Reis
No ratings yet
ML Ops on Azure: From Models to Production
From Everand
ML Ops on Azure: From Models to Production
Kameron Hussain
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Efficient Development with JetBrains Tools: Definitive Reference for Developers and Engineers
From Everand
Efficient Development with JetBrains Tools: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
From Everand
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
C++ Basics for New Programmers: A Practical Guide with Examples
From Everand
C++ Basics for New Programmers: A Practical Guide with Examples
William E. Clark
No ratings yet
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
From Everand
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
Ian Taylor
No ratings yet
Mastering Go Network Automation
From Everand
Mastering Go Network Automation
Ian Taylor
No ratings yet
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet

Mlflow Workshop Part 2

Uploaded by

Mlflow Workshop Part 2

Uploaded by

Platform for Complete Machine

San Francisco| May 13, 2020: Part 2 of 3 Series

§ Goal: Meet a functional specification § Goal: Optimize metric(e.g., accuracy.

Tracking Projects Models Model

Local Apps Parameters Metrics Artifacts

Tracking Projects Models Model

Challenge: ML results difficult to reproduce

Install packages & dependencies from In the activated conda environment

Tracking Projects Models Model

Batch & Stream Scoring

ML Frameworks Serving Tools

Standard for ML models

MLflow Project Keras Example:

§ pip install mlflow to get started

You might also like