0% found this document useful (0 votes)

24 views41 pages

Segmentation Dataset

Uploaded by

Brunet Nathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views41 pages

Segmentation Dataset

Uploaded by

Brunet Nathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Machine Learning Operations - MLOps

Getting from Good to Great

Michal Maciejewski, PhD

Acknowledgements: Dejan Golubovic, Ricardo Rocha, Christoph Obermair, Marek Grzenkowicz

Alice: X ML Model Y = f(X)

Bob: Y = f(X)

Let’s share our model with users aka let’s put it into production! 2
What Has to Go Right?

What is needed for an ML model to perform well in production? 3

What Can Go Wrong?

Concept and data drifts are one of the main challenges of production ML systems!

4
MLOps is about maintaining the trained model performance* in production.
The performance may degrade due to factors outside of our control
so we ought to monitor the performance and if needed, roll out a new model to users.

*model performance = accuracy, latency, jitter, etc. 5

ML Model = Data + Code

MLOps = ML Model + Software

+ Algorithm + Scripts
+ Weights + Libraries
+ Hyperparameters + Infrastructure
+ DevOps

6
MLOps = ML Model + Software
Machine
Feature Extraction Resource
Management

ML Model Serving
Configuration Analysis Tools Monitoring
Infrastructure
Data
Collection
Data Process
Verification Management
Tools

Your
ML Framework
code

Good news: most of these components come as ready-to-use frameworks

D. Sculley et. al. Hidden Technical Debt in Machine Learning Systems, NIPS 2015 7
MLOps Pipeline

Data Engineering Modelling Deployment Monitoring

MLOps is a multi-stage, iterative process. 8

Data Engineering
Reproducibility
Traceability
Data-driven ML

Data Engineering Modelling Deployment Monitoring 9

f( )=
10
Exploratory Data Analysis
For structured data:
- schema as required
tables, columns and
datatypes

For unstructured data:

- resolution, image
extension
- frequency, duration,
audio codec

Initial exploration allows indetifying requirements for input data in produciton. 11

Data Processing Pipeline

Data Data Data Feature

Ingestion Validation Cleaning Engineering

• Load from file • Schema check • Filling NaNs • Feature selection

• Load from db • Audio/video file • Filtering • Feature crossover
check • Normalization
• Standarization

We need to reproduce some of those steps (e.g. subtracting mean) in production! 12

Reproducibility
Excel
Dataset spreadsheets

Various
Notebooks
scripts

Curated dataset

https://sites.google.com/princeton.edu/rep-workshop/ 13
Keeping Track of Data Processing

• Version Input Data – DVC framework

• Version Processing Script - GitLab
• Version Computing Environment - Docker

Data Provenance – where does data come from?

Data Lineage – how data is manipulated? 14
Notebook Good Practices
• Linear flow of execution
• Little amount of code
• Extract reusable code into a package
• Pre-commit for cleaning notebook before
committing to a repository
• Set parameters on top so that notebook
can be treated as a function (papermill and
scrapbook packages)

It is OK, to do exploratory quick&dirty model development.

Once we start communicating the model outside, we need to clean it! 15
From Model-driven to Data-driven ML

Model-driven ML Data-driven ML
Fixed component Dataset Model Architecture
Variable component Model Architecture Dataset
Objective High accuracy Fairness, low bias
Explainability Limited Possible

https://datacentricai.org
16
https://spectrum.ieee.org/andrew-ng-data-centric-ai
Modelling
Training challenges
Rare events
Analyzing results

Data Engineering Modelling Deployment Monitoring 17

Selecting Data for Training
Training Validation
Dataset
80% 20%

Hyperparameter
train validate
tuning

With this approach, the model eventually sees the entire dataset. 18
Selecting Data for Training
Training Validation Test
Dataset
75% 15% 10%

Final
Hyperparameter check
train tuning
validate test

Splitting dataset in three allows to perform a final check with unseen data. 19
Balancing Datasets
Consider a binary classification problem with a dataset composed of 200 entries.
There are 160 negative examples (no failure) and 40 positive ones (failure).

Training Validation Test

Expected: 75% 15% 10%
(120 + 30) (24+6) (16+4)

Training Validation Test

Random: 75% 15% 10%
(131 + 19) (19+11) (10+10)

For continuous values it is important to preserve statistical distribution.

Although for big datasets it is not an issue, it is still a low-hanging-fruit. 20
Rare Events

There were 3130 healthy signals (Y=False) and 112 faulty ones (Y=True)
C. Obermair, Extension of Signal Monitoring Applications with Machine Learning, Master Thesis, TU Graz
M. Brice, LHC tunnel Pictures during LS2, https://cds.cern.ch/images/CERN-PHOTO-201904-108-15 21
Rare Events

This naive model is guaranteed to achieve 97% average dataset accuracy?! 22

Rare Events
TN
Ground truth Avg accuracy = = 97%
TN + FN
Y = True Y = False 𝑇𝑃 0
Precision = =
𝑇𝑃 + 𝐹𝑃 0
Y = True 0 0
Model

true positive false positive 𝑇𝑃 0

Recall = = =0
Y = False 112 3130 𝑇𝑃 + 𝐹𝑁 0 + 112
false negative true negative
2
F1score =
1/Precision + 1/Recall

It is a valuable conversation to decide if precision or recall (or both) is more important. 23

Data Augmentation

New examples obtained by New examples obtained by

shifting the region left and right rotating/shifting/hiding

JH. Kim et al. Hybrid Integration of Solid-State Quantum Emitters on a Silicon Photonic Chip, Nano Letters 2017 24
What else can we do?
When one of the values of Y is rare in the population, considerable
resources in data collection can be saved by randomly selecting within
categories of Y. […]
The strategy is to select on Y by collecting observations (randomly or all
those available) for which Y = 1 (the "cases") and a random selection of
observations for which Y = 0 (the "controls").

We can also collect more data of particular class (if even possible).
G. King and L. Zeng, “Logistic Regression in Rare Events Data,” Political Analysis, p. 28, 2001.
https://en.wikipedia.org/wiki/Cross-validation_(statistics) 25
Training Tracking
1. Pen & Paper
2. Spreadsheet
3. Dedicated framework
- Weights and Biases
- Neptune.ai
- Tensorflow
- …

26
Error Analysis

# Signal Noise Gap in signal Bias Wrong sampling

1 Magnet 1 x x
2 Magnet 2 x x
3 Magnet 3 x x

Such analysis may reveal issues with labelling or rare classes in data.
For unstructured data, a cockpit could help in analysis.
Useful in monitoring of certain classes of inputs. 27
28
Deployment
Degrees of automation
Modes of deployment
Reproducible environments

Data Engineering Modelling Deployment Monitoring 29

Degrees of Automation
Human in
Human inspection Shadow mode Full Automation
the loop

Starting from Shadow mode we can collect more training data in production!
C. Obermair, Extension of Signal Monitoring Applications with Machine Learning, Master Thesis, TU Graz 30
Modes of Deployment

100-X% Old version

Router

X%
New version

- In Canary deployment there is a gradual switch between versions

- In Blue/green deployment there is an on/off switch between versions

https://hbr.org/2017/09/the-surprising-power-of-online-experiments
https://en.wikipedia.org/wiki/Blue-winged_parrot 31
Reproducible Environments

Request Response Request Response

HTTP Server
KServe
REST API
Data Pipeline Config Pool of
file models
ML Model Computing
infrastructure
Computing environment
(OS, Python, packages)

Docker Containers Serverless compute

We will play with those during the exercise sessions! 32
Monitoring
Useful metrics
Relevant frameworks

Data Engineering Modelling Deployment Monitoring 33

34
Relevant Metrics
• Model metrics
• Distribution of input features – data/concept drift
• Missing/malformed values in the input
• Average output accuracy/classification distribution – concept drift

• Infrastructure metrics
• Logging errors
• Memory, CPU resources utilization
• Latency and jitter

For each of the relevant metrics one should define warning/error thresholds. 35
Monitoring Matters

C. Obermair, Extension of Signal Monitoring Applications with Machine Learning, Master Thesis, TU Graz 36
Data Engineering Modelling Deployment Monitoring

37
MLOps Pipeline with Tensorflow
Pipeline represented as DAG
directed acyclic graph
Data Engineering

Modelling

Deployment

https://www.tensorflow.org/tfx/guide 38
MLOps Pipeline with Kubeflow

Data Engineering

Modelling

https://ml.cern.ch
Deployment https://www.kubeflow.org/docs/started/ 39
Conclusion
Development ML Production ML
Objective High-accuracy model Efficiency of the overall system
Dataset Fixed Evolving
Code quality Secondary importance Critical
Model training Optimal tuning Fast turn-arounds
Reproducibility Secondary importance Critical
Traceability Secondary importance Critical

I do hope the presented MLOps concepts will allow your models to transition
40
From Good to Great
Resources

Machine Learning Engineering for Production (MLOps) Specialization

ML in Production en
No ratings yet
ML in Production en
106 pages
2007 GMC Acadia 3.6L Vin 7 Electric Diagrams 4of5
57% (7)
2007 GMC Acadia 3.6L Vin 7 Electric Diagrams 4of5
1 page
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Supervised - ML Complete Book
No ratings yet
Supervised - ML Complete Book
153 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
AI ML Session Slides
No ratings yet
AI ML Session Slides
34 pages
INTE 30103 Information Processing and Handling in Libraries and Information Centers
No ratings yet
INTE 30103 Information Processing and Handling in Libraries and Information Centers
4 pages
CBSE Class 6 Maths Practice Worksheets
100% (1)
CBSE Class 6 Maths Practice Worksheets
2 pages
Week 2 - Select and Train A Model
No ratings yet
Week 2 - Select and Train A Model
29 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Designing Machine Learning Systems by Chip Huygen by Rick
No ratings yet
Designing Machine Learning Systems by Chip Huygen by Rick
15 pages
Webinar Slides Mlops
100% (1)
Webinar Slides Mlops
35 pages
Determination of Caffeine in Tea Samples
No ratings yet
Determination of Caffeine in Tea Samples
7 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
90 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
ML Lectures 2022 Part 1
No ratings yet
ML Lectures 2022 Part 1
231 pages
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
No ratings yet
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
2 pages
Unit II
No ratings yet
Unit II
14 pages
Course Two
No ratings yet
Course Two
133 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
C2 - W1 Mlopssadsa
No ratings yet
C2 - W1 Mlopssadsa
111 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
83 pages
Lecture 3 - 1-ML and Data Systems Fundamentals
No ratings yet
Lecture 3 - 1-ML and Data Systems Fundamentals
48 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
MLOps Getting From Good To Great
No ratings yet
MLOps Getting From Good To Great
41 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
79 pages
03 ML Testing
No ratings yet
03 ML Testing
51 pages
AI 501 - Lesson 4 - Supervised Learning
No ratings yet
AI 501 - Lesson 4 - Supervised Learning
41 pages
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
No ratings yet
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
127 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
C1 W3
No ratings yet
C1 W3
60 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
cs329s 2022 02 Slides MLSD
No ratings yet
cs329s 2022 02 Slides MLSD
99 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Building A ML System
No ratings yet
Building A ML System
42 pages
Previous Lecture
No ratings yet
Previous Lecture
43 pages
Lecture 8 - Lifecycle of A Data Science Project - Part 2
No ratings yet
Lecture 8 - Lifecycle of A Data Science Project - Part 2
43 pages
3 - InnovatiCS - Introduction To CRISP-DM
No ratings yet
3 - InnovatiCS - Introduction To CRISP-DM
35 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Unit 1
No ratings yet
Unit 1
21 pages
Notes Unit 1-3 Part-II
No ratings yet
Notes Unit 1-3 Part-II
20 pages
Group 15
No ratings yet
Group 15
21 pages
AP Chemistry Bonding Help Sheet: 2, (Diamond)
No ratings yet
AP Chemistry Bonding Help Sheet: 2, (Diamond)
6 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
Neural Network Classification With
No ratings yet
Neural Network Classification With
25 pages
A Practical and Technical Introduction To Machine Learning
No ratings yet
A Practical and Technical Introduction To Machine Learning
23 pages
Feature Labs - ML 2.0
No ratings yet
Feature Labs - ML 2.0
13 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Unit 2
No ratings yet
Unit 2
12 pages
Day - 6 - WONotes
No ratings yet
Day - 6 - WONotes
11 pages
Unit-1 Introduction To Machine Learning (5hrs)
No ratings yet
Unit-1 Introduction To Machine Learning (5hrs)
8 pages
938G+ +Electrical+System
100% (5)
938G+ +Electrical+System
2 pages
Lec 2
No ratings yet
Lec 2
13 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
ML Assignment 1: 1. A) What Is Machine Learning? Explain Types of Machine Learning
No ratings yet
ML Assignment 1: 1. A) What Is Machine Learning? Explain Types of Machine Learning
8 pages
ML (AutoRecovered)
No ratings yet
ML (AutoRecovered)
5 pages
AZ 104 Microsoft Azure Administrator
100% (8)
AZ 104 Microsoft Azure Administrator
431 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
Kubernetes Basic To Advance End To End
100% (6)
Kubernetes Basic To Advance End To End
295 pages
Millennium Village 2
No ratings yet
Millennium Village 2
15 pages
Generative Ai Fundamentals v1
100% (16)
Generative Ai Fundamentals v1
80 pages
Azure Devops Explained
88% (8)
Azure Devops Explained
438 pages
RAG Architecture
100% (8)
RAG Architecture
52 pages
Semi - NCM 101
100% (1)
Semi - NCM 101
13 pages
Chapter 2 - of The Principles of Business Book
No ratings yet
Chapter 2 - of The Principles of Business Book
55 pages
Science
No ratings yet
Science
5 pages
AWS Course - All Slides
80% (10)
AWS Course - All Slides
879 pages
U Value Calculator Updated 13-12-2022 Protected
No ratings yet
U Value Calculator Updated 13-12-2022 Protected
76 pages
Kubernetes Practicals Ebook
75% (4)
Kubernetes Practicals Ebook
187 pages
Terraform Associate
100% (10)
Terraform Associate
465 pages
Docker Docker Tutorial For Beginners Build Ship and Run - Dennis Hutten
100% (11)
Docker Docker Tutorial For Beginners Build Ship and Run - Dennis Hutten
187 pages
As 3515.2-2002 Gold and Gold Bearing Alloys Determination of Gold Content 30 Percent To 99.5 Percent - Gravim
No ratings yet
As 3515.2-2002 Gold and Gold Bearing Alloys Determination of Gold Content 30 Percent To 99.5 Percent - Gravim
7 pages
100 Days of Kubernetes
100% (4)
100 Days of Kubernetes
121 pages
Terraform Certified
100% (3)
Terraform Certified
121 pages
OpenShift 4 Technical Deep Dive
100% (5)
OpenShift 4 Technical Deep Dive
129 pages
Azure Implementation Guide
100% (4)
Azure Implementation Guide
237 pages
Intern Data Science
No ratings yet
Intern Data Science
2 pages
Ansible For Kubernetes PDF
100% (6)
Ansible For Kubernetes PDF
172 pages
AWS Certified Solution Architect Associate Study Guide V1.0 Abdul Jaseem VP Release 30 Aug 2020
100% (6)
AWS Certified Solution Architect Associate Study Guide V1.0 Abdul Jaseem VP Release 30 Aug 2020
235 pages
Versa CSeries Aluminum Solenoid Valves
No ratings yet
Versa CSeries Aluminum Solenoid Valves
24 pages
Kubernetes Tutorial
100% (11)
Kubernetes Tutorial
83 pages
Carrier VRF Catalogue 2021 Tcm173-142860-Output-Output
No ratings yet
Carrier VRF Catalogue 2021 Tcm173-142860-Output-Output
2 pages
Kubernetes Docker
100% (7)
Kubernetes Docker
129 pages
Kubernetes
100% (3)
Kubernetes
139 pages
Az104 Master
100% (1)
Az104 Master
477 pages
Learn Kubernetes 5 Minutes at A Time
No ratings yet
Learn Kubernetes 5 Minutes at A Time
187 pages
Terraform Azure
100% (2)
Terraform Azure
297 pages
AMV in Pharma
No ratings yet
AMV in Pharma
13 pages
Gul Nawaz CV
No ratings yet
Gul Nawaz CV
2 pages
Terraform Practice Guide
100% (14)
Terraform Practice Guide
109 pages
AWS Certified DevOps Engineer Professional... Tests 2021
100% (3)
AWS Certified DevOps Engineer Professional... Tests 2021
210 pages
Container Networking Docker Kubernetes
100% (8)
Container Networking Docker Kubernetes
72 pages
Avatar Courage - AHTS Brochure Dec 2022 (Singapore Flag)
No ratings yet
Avatar Courage - AHTS Brochure Dec 2022 (Singapore Flag)
2 pages
Mendel and Heredity Worksheet
No ratings yet
Mendel and Heredity Worksheet
11 pages
Unit 5 Pointers
No ratings yet
Unit 5 Pointers
9 pages
Cyber Security Module 1 Lesson 3 Notes
No ratings yet
Cyber Security Module 1 Lesson 3 Notes
20 pages
PHD - Aerodynamics of Flexible Membranes
No ratings yet
PHD - Aerodynamics of Flexible Membranes
165 pages
Azure Solution Architect Map
100% (1)
Azure Solution Architect Map
1 page
Guidelines For Academic Writing
No ratings yet
Guidelines For Academic Writing
8 pages
Hands-On Kubernetes On Azure
100% (4)
Hands-On Kubernetes On Azure
330 pages
Chapter 1 Business
No ratings yet
Chapter 1 Business
52 pages
Spectrum of Imaging Findings in Pulmonary Infections Part 1&2
No ratings yet
Spectrum of Imaging Findings in Pulmonary Infections Part 1&2
19 pages
Conclusion
No ratings yet
Conclusion
14 pages
School Memorandum With Number
No ratings yet
School Memorandum With Number
29 pages
Why Law Students Should Study The Course On Environmental Studies and The Law 2
No ratings yet
Why Law Students Should Study The Course On Environmental Studies and The Law 2
5 pages
Angew Chem Int Ed - 2017 - Choi
No ratings yet
Angew Chem Int Ed - 2017 - Choi
5 pages
Terraform From Bigginer To Master
100% (4)
Terraform From Bigginer To Master
90 pages
Udemy Docker Advanced PDF
0% (1)
Udemy Docker Advanced PDF
71 pages
EB L1300U Datasheet
No ratings yet
EB L1300U Datasheet
3 pages
CNCF Webinar - Kubernetes 1.16 PDF
No ratings yet
CNCF Webinar - Kubernetes 1.16 PDF
48 pages
VDI Calculator v1
No ratings yet
VDI Calculator v1
3 pages
KNIME Workflow Design and Automation: Definitive Reference for Developers and Engineers
From Everand
KNIME Workflow Design and Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Segmentation Dataset

Uploaded by

Segmentation Dataset

Uploaded by

Machine Learning Operations - MLOps

Getting from Good to Great

Michal Maciejewski, PhD

Acknowledgements: Dejan Golubovic, Ricardo Rocha, Christoph Obermair, Marek Grzenkowicz

What is needed for an ML model to perform well in production? 3

*model performance = accuracy, latency, jitter, etc. 5

MLOps = ML Model + Software

Good news: most of these components come as ready-to-use frameworks

Data Engineering Modelling Deployment Monitoring

MLOps is a multi-stage, iterative process. 8

Data Engineering Modelling Deployment Monitoring 9

For unstructured data:

Initial exploration allows indetifying requirements for input data in produciton. 11

Data Data Data Feature

• Load from file • Schema check • Filling NaNs • Feature selection

We need to reproduce some of those steps (e.g. subtracting mean) in production! 12

• Version Input Data – DVC framework

Data Provenance – where does data come from?

It is OK, to do exploratory quick&dirty model development.

Data Engineering Modelling Deployment Monitoring 17

Training Validation Test

Training Validation Test

For continuous values it is important to preserve statistical distribution.

This naive model is guaranteed to achieve 97% average dataset accuracy?! 22

true positive false positive 𝑇𝑃 0

It is a valuable conversation to decide if precision or recall (or both) is more important. 23

New examples obtained by New examples obtained by

# Signal Noise Gap in signal Bias Wrong sampling

Data Engineering Modelling Deployment Monitoring 29

100-X% Old version

- In Canary deployment there is a gradual switch between versions

Request Response Request Response

Docker Containers Serverless compute

Data Engineering Modelling Deployment Monitoring 33

Machine Learning Engineering for Production (MLOps) Specialization

You might also like