0% found this document useful (0 votes)

30 views32 pages

Machine Learning (ML) and ML Engineering: CSE 473 24wi

Uploaded by

Nikolaos Tsinganos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views32 pages

Machine Learning (ML) and ML Engineering: CSE 473 24wi

Uploaded by

Nikolaos Tsinganos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Machine Learning (ML) and

ML Engineering

CSE 473 24wi

Rob Minneker
Administrivia

Due date changes:

PM3: Due Feb 13th. Late deadline Feb 14th. Working to get autograder ﬁxed.

PM 4: Due Feb 26th

RL: Due Feb 19th (out now!)

Uncertainty: Due Feb 23rd

ML: Cancelled

DL/NLP unchanged: Due March 1/8 respectively

ML as a subset of AI

We are getting “narrow” in this course

Honing in on data-driven decision making

New programming paradigm

- Data writes the program not the

programmer
Make a cat/dog image classiﬁer
Make a cat/dog image classiﬁer
ML Models learn from data

Remember how Q-learning updated it’s estimates (from estimates i.e.

bootstrapping)?

- This is a form of machine learning. We use data to ﬁnd the patterns/rules and
then the model executes the logic
- This is in opposition to a programmer explicitly writing the logic

Notice how in the second example, we don’t have any rules. The model does all the
work.
Machine learning systems

The model is only a small part of the overall

system!

ML engineers cover everything from data

collection to deployment

CSE 446 will cover mostly the ML algos

aspect

Credit: Chip Huyen

Machine learning systems in production

ML development is an iterative process

with a lot of back-and-forth

Everything in production is a trade-off

- Memory, time, space, budget, latency,
accuracy, precision, recall, safety,
reliability, maintainability,
extensibility, etc.

Credit: Chip Huyen

Some ML fundamentals

Garbage in → Garbage out

- The quality of your data matters A LOT

Not all problems should be solved with ML!

- When you have a hammer everything looks like a nail

Models are not static

- They must change over time like the users and their data do
ML System Design Example

Prompt: Design the Twitter recommendation engine

Questions to consider:

- What data do you need? How will you collect it? How much do you need?
- What kind of model(s) will you use? How will you train/evaluate it?
- Will it be online or ofﬂine learning? Where will the model live?
- Will there be personalization? If so, how?
- Is there NSFW content that needs to be ﬁltered out?
- Will you process individual tweets or batches? How and why?
- What will the end-user experience look like? How will ML enhance this?
ML System Design: Your turn!

Design the YouTube video recommendation engine

Work together with folks around you!

We’ll come back together to discuss.

ML Engineering Day 2

CSE 473 24wi

Rob Minneker
Agenda

ML (eng) fundamentals continued

ML System Design continued

Types of ML

https://resources.experfy.com/ai-ml/coding-deep-learn
ing-for-beginners-types-of-machine-learning/
Applications: NLP, Computer Vision, Robotics, Comp.
Bio., Interactive learning, Convex optimization, etc.

Adjacent ﬁelds: Databases, Data Viz, Compilers,

Types of ML Distributed Systems, etc.

ML, DL, ML for Big

Data ML, DL, NLP

RL https://resources.experfy.com/ai-ml/coding-deep-learn
ing-for-beginners-types-of-machine-learning/
Example ML Workﬂow (greatly simpliﬁed)

1. Experiment in Jupyter Notebooks / Google Colab

a. Find model candidates
2. Scale up training jobs as needed
a. Specialized libraries (Microsoft deepspeed, etc.) + specialized compute
GPU/TPU, etc.
3. Export models for production inference
a. Export model to production format: e.g. coreml (mobile), onnx (server) etc.
4. Build production pipeline around the model
a. Optimize business metrics
5. Repeat 1-4 as data changes, needs change, tech changes, etc.
Keeping up with the times

ML Engineering is a ﬁeld that changes every single day

Reading research papers, (engineering) blogs, OSS repos, documentation, etc.

Learn to be uncomfortable

- Many different mathematical notations, many different libraries, frameworks,

etc. constantly using new tools or developing them
ML System Design: Your turn!

Design the TikTok video recommendation engine

Work together with folks around you!

We’ll come back together to discuss.

The real TikTok engine
ML System Design extra resources

Textbook:

Designing Machine Learning Systems: An Iterative Process for Production-Ready

Applications by Chip Huyen

Online resources:

ByteByteGo: Machine Learning System Design Interview

Company engineering blogs: e.g. Uber Engineering blog

ML Engineering Day 3

CSE 473 24wi

Rob Minneker
Content adapted from Chip Huyen
https://huyenchip.com/2022/02/07/data-distribution-shifts-and-monitoring.html
Administrivia

Uncertainty HW released (due next Friday 2/23)

- Checkout Berkeley lectures on Markov Models and HMMs. Links are in course
calendar and posted on Ed.
Agenda

ML (eng) fundamentals continued

- Data distribution shifts

ML system failures

“ML engineering is more engineering than ML”

Most of the failures of an ML system are due to distributed system failures

- Pure software engineering issues, nothing to do with the model itself. E.g.
dependency failure, deployment failure, hardware failure, downtime/crash, etc.

Some of the time the model/data is at fault

Data distribution shifts

Big issue in supervised learning (most common models deployed in production)

- ML models trained from labeled datasets

- Inputs (covariates): X
- Outputs (labels): Y

Three major types of shift:

1. Covariate shift
2. Label shift
3. Concept drift
Data distribution shifts

In supervised learning, the training data can be viewed as samples from the joint
distribution: P(X, Y)
Remember, we can decompose the joint distribution two ways:
P(X, Y) = P(Y|X)*P(X) [eqn. 1]
P(X, Y) = P(X|Y)*P(Y) [eqn. 2]
Covariate shift: P(X) changes, but P(Y|X) remains the same. [eqn. 1]
Label shift: P(Y) changes, but P(X|Y) remains the same. [eqn. 2]
Concept drift: P(Y|X) changes, but P(X) remains the same. [eqn. 1]
Covariate shift

P(X, Y) = P(Y|X)*P(X)

Covariate shift: P(X) changes, but P(Y|X) remains the same.

I.e. the distribution of the input changes, but the conditional probability of a label
given an input remains the same.

Most widely studied form of data distribution shift

Essentially the input distribution at training time differs from inference time. Many
causes of this..
Label shift (aka prior shift)
P(X, Y) = P(X|Y)*P(Y)
Label shift: P(Y) changes, but P(X|Y) remains the same.
I.e. You can think of this as the case when the output distribution changes but for a
given output, the input distribution stays the same.
“When the input distribution changes, the output distribution also changes, resulting
in both covariate shift and label shift happening at the same time.”
E.g. predicting cancer incidence from age. If everyone takes an effective anti-cancer
drug P(Y|X) reduces for everyone. Imagine age is your only input. Age distribution
remains constant while true incidence of cancer decreases.
Concept drift (aka posterior shift)

P(X, Y) = P(Y|X)*P(X)
Concept drift: P(Y|X) changes, but P(X) remains the same.
I.e. when the input distribution remains the same but the conditional distribution of
the output given an input changes. → “same input, different output”
Predicting house prices. House features are ﬁxed. House prices are dynamic (e.g.
early pandemic house prices were much cheaper)
Many cases these drifts are cyclic or seasonal. E.g. Think of dynamic pricing on
rideshares. Companies may have different models for weekday vs. weekend pricing.
Other types of shifts

These shifts are not exhaustive

Other things can mess up your model’s performance in the real world

E.g. changing feature values, maybe you had age input as years and now it’s input as
months → range of feature values drifted

Maybe your data pipeline has a bug and it starts feeding NaNs to your model
Addressing data distribution shifts

Two main approaches in research/industry today.

Train models on massive datasets

- Hope that they learn all the complex patterns from the data

Retrain the models using labeled data from the target distribution

- Could be from scratch or continue training with the new data (ﬁne-tuning)
Monitoring and observability

Monitor predictions

- What is your model outputting?

Monitor Features

- What is being fed to your model?

Logs, Dashboards

Definitive Guide To Testing LLM Applications
No ratings yet
Definitive Guide To Testing LLM Applications
37 pages
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
No ratings yet
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
2 pages
Designing Machine Learning Systems by Chip Huygen by Rick
No ratings yet
Designing Machine Learning Systems by Chip Huygen by Rick
15 pages
Salesforce Ai
No ratings yet
Salesforce Ai
31 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Building A ML System
No ratings yet
Building A ML System
42 pages
ML System Design
No ratings yet
ML System Design
11 pages
State of The Art Research Methodology For Machine
No ratings yet
State of The Art Research Methodology For Machine
58 pages
Lecture 2 - What Is ML
No ratings yet
Lecture 2 - What Is ML
17 pages
Data Management Challenges in Production Machine Learning: Neoklis Polyzotis, Sudip Roy, Steven Whang, Martin Zinkevich
No ratings yet
Data Management Challenges in Production Machine Learning: Neoklis Polyzotis, Sudip Roy, Steven Whang, Martin Zinkevich
122 pages
Lecture 1
100% (1)
Lecture 1
51 pages
Week 2 - Select and Train A Model
No ratings yet
Week 2 - Select and Train A Model
29 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
60 pages
Subjects You Need To Know:: Programming Languages of AI
0% (1)
Subjects You Need To Know:: Programming Languages of AI
7 pages
C2 - W1 Mlopssadsa
No ratings yet
C2 - W1 Mlopssadsa
111 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
Unit I
No ratings yet
Unit I
132 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Chapter 2 Preparing To Model
No ratings yet
Chapter 2 Preparing To Model
49 pages
Lec 01
No ratings yet
Lec 01
28 pages
Aiml Guide
No ratings yet
Aiml Guide
4 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
90 pages
Difference Between Machine Learning and Traditional Programming
No ratings yet
Difference Between Machine Learning and Traditional Programming
11 pages
ML Lectures 2022 Part 1
No ratings yet
ML Lectures 2022 Part 1
231 pages
Class1 - Introduction and Foundation-1717413257735
No ratings yet
Class1 - Introduction and Foundation-1717413257735
23 pages
Unit I
No ratings yet
Unit I
23 pages
Machine Learning (ML) - Comprehensive Summary
No ratings yet
Machine Learning (ML) - Comprehensive Summary
7 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Activity Log
No ratings yet
Activity Log
23 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Machine Learning - Course
No ratings yet
Machine Learning - Course
6 pages
Final - State of The Art Research Methodology For Machine
No ratings yet
Final - State of The Art Research Methodology For Machine
53 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
ABES Presentation
No ratings yet
ABES Presentation
91 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
25 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
ML - Lecture - 1 Introduction To ML
No ratings yet
ML - Lecture - 1 Introduction To ML
29 pages
Unit 1
No ratings yet
Unit 1
38 pages
100 Days of ML
No ratings yet
100 Days of ML
383 pages
Batch Vs Online ML: Wednesday, March 17, 2021 5:30 PM
No ratings yet
Batch Vs Online ML: Wednesday, March 17, 2021 5:30 PM
436 pages
ML Notes
No ratings yet
ML Notes
25 pages
Presentation1.Pptx Tanushka
No ratings yet
Presentation1.Pptx Tanushka
13 pages
Introduction
No ratings yet
Introduction
18 pages
(Fall 2024) Intro To ML
No ratings yet
(Fall 2024) Intro To ML
51 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Chatgpt Unit - 1
No ratings yet
Chatgpt Unit - 1
5 pages
Fall2024 W4995 Lecture1
No ratings yet
Fall2024 W4995 Lecture1
110 pages
Syl3 ML
No ratings yet
Syl3 ML
5 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
10 pages
AI Roadmap
No ratings yet
AI Roadmap
45 pages
ML Unit 1
No ratings yet
ML Unit 1
37 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
The Machine Learning Lifecycle in 2021
No ratings yet
The Machine Learning Lifecycle in 2021
20 pages
ML Revision
No ratings yet
ML Revision
207 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Machine Learning Fundamentals: Concepts, Models, and Applications
From Everand
Machine Learning Fundamentals: Concepts, Models, and Applications
Amar Sahay
No ratings yet
Social Networks 21
No ratings yet
Social Networks 21
62 pages
2401 03910
No ratings yet
2401 03910
30 pages
Evaluating LLM Models For Production Systems - Methods and Practices - Data Phoenix
No ratings yet
Evaluating LLM Models For Production Systems - Methods and Practices - Data Phoenix
61 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
AI+Governance+Framework+by+Trail+ +2024.2
No ratings yet
AI+Governance+Framework+by+Trail+ +2024.2
22 pages
Iccgi 2024 1 10 10002
No ratings yet
Iccgi 2024 1 10 10002
11 pages
P1 Automated Recognition Chatv2
No ratings yet
P1 Automated Recognition Chatv2
10 pages
PEF Prediction
No ratings yet
PEF Prediction
10 pages
综述缺陷检测
No ratings yet
综述缺陷检测
28 pages
Machine Learning and Deep Learning Based Intrusion Detection in Cloud Environment A Review
No ratings yet
Machine Learning and Deep Learning Based Intrusion Detection in Cloud Environment A Review
9 pages
LD
No ratings yet
LD
11 pages
Efficient Low-Rank Multimodal Fusion With Modality-Specific Factors
No ratings yet
Efficient Low-Rank Multimodal Fusion With Modality-Specific Factors
10 pages
Literature Survey On Mental Health in Students
No ratings yet
Literature Survey On Mental Health in Students
3 pages
Time Series Forecasting - Sparkling - Buisness Report
No ratings yet
Time Series Forecasting - Sparkling - Buisness Report
70 pages
Zhang 2020
No ratings yet
Zhang 2020
5 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
Smart Energy Management System
No ratings yet
Smart Energy Management System
11 pages
Cse Cic Ids Dataset
No ratings yet
Cse Cic Ids Dataset
19 pages
Heart Disease Identification Method Using
No ratings yet
Heart Disease Identification Method Using
72 pages
Convolutional Neural Network Based Energy Consumption Management Model For The Full Life Cycle
No ratings yet
Convolutional Neural Network Based Energy Consumption Management Model For The Full Life Cycle
9 pages
Locally Weighted Regression
No ratings yet
Locally Weighted Regression
17 pages
A Comparative Study Between Full-Parameter and LoRA-based
No ratings yet
A Comparative Study Between Full-Parameter and LoRA-based
8 pages
21ZC63 - Industrial Visit and Technical Seminar
No ratings yet
21ZC63 - Industrial Visit and Technical Seminar
15 pages
Unit-5 - Part 2
No ratings yet
Unit-5 - Part 2
11 pages
Theobald Et Al. - 2017 - Student Perception of Group Dynamics Predicts Indi
No ratings yet
Theobald Et Al. - 2017 - Student Perception of Group Dynamics Predicts Indi
16 pages
NNDL Record Final
No ratings yet
NNDL Record Final
46 pages
Cs-3491-Ai-Ml-Lab RECORD
No ratings yet
Cs-3491-Ai-Ml-Lab RECORD
59 pages
Sridhar 2020
No ratings yet
Sridhar 2020
10 pages
Portfolio 3
No ratings yet
Portfolio 3
10 pages
ML - Underfitting and Overfitting - GeeksforGeeks
No ratings yet
ML - Underfitting and Overfitting - GeeksforGeeks
8 pages
Important QnA Data Acquisition AI Class 10
No ratings yet
Important QnA Data Acquisition AI Class 10
5 pages
Obfuscated Malware Detection Using Dilated Convolutional Network
0% (1)
Obfuscated Malware Detection Using Dilated Convolutional Network
6 pages
DMML
No ratings yet
DMML
65 pages
Toward Understanding Catastrophic Forgetting in Continual Learning
No ratings yet
Toward Understanding Catastrophic Forgetting in Continual Learning
12 pages
AI and ML Lab Manual 2022
No ratings yet
AI and ML Lab Manual 2022
37 pages
Mini Final Document
No ratings yet
Mini Final Document
49 pages

Machine Learning (ML) and ML Engineering: CSE 473 24wi

Uploaded by

Machine Learning (ML) and ML Engineering: CSE 473 24wi

Uploaded by

Machine Learning (ML) and

CSE 473 24wi

Due date changes:

PM 4: Due Feb 26th

RL: Due Feb 19th (out now!)

Uncertainty: Due Feb 23rd

DL/NLP unchanged: Due March 1/8 respectively

We are getting “narrow” in this course

Honing in on data-driven decision making

New programming paradigm

- Data writes the program not the

Remember how Q-learning updated it’s estimates (from estimates i.e.

The model is only a small part of the overall

ML engineers cover everything from data

CSE 446 will cover mostly the ML algos

Credit: Chip Huyen

ML development is an iterative process

Everything in production is a trade-off

Credit: Chip Huyen

Garbage in → Garbage out

- The quality of your data matters A LOT

Not all problems should be solved with ML!

- When you have a hammer everything looks like a nail

Models are not static

Prompt: Design the Twitter recommendation engine

Design the YouTube video recommendation engine

Work together with folks around you!

We’ll come back together to discuss.

CSE 473 24wi

ML (eng) fundamentals continued

ML System Design continued

Adjacent ﬁelds: Databases, Data Viz, Compilers,

ML, DL, ML for Big

1. Experiment in Jupyter Notebooks / Google Colab

ML Engineering is a ﬁeld that changes every single day

Reading research papers, (engineering) blogs, OSS repos, documentation, etc.

- Many different mathematical notations, many different libraries, frameworks,

Design the TikTok video recommendation engine

Work together with folks around you!

We’ll come back together to discuss.

Designing Machine Learning Systems: An Iterative Process for Production-Ready

ByteByteGo: Machine Learning System Design Interview

Company engineering blogs: e.g. Uber Engineering blog

CSE 473 24wi

Uncertainty HW released (due next Friday 2/23)

ML (eng) fundamentals continued

- Data distribution shifts

“ML engineering is more engineering than ML”

Most of the failures of an ML system are due to distributed system failures

Some of the time the model/data is at fault

Big issue in supervised learning (most common models deployed in production)

- ML models trained from labeled datasets

Three major types of shift:

Covariate shift: P(X) changes, but P(Y|X) remains the same.

Most widely studied form of data distribution shift

These shifts are not exhaustive

Two main approaches in research/industry today.

Train models on massive datasets

- What is your model outputting?

- What is being fed to your model?

You might also like