Introduction To Mlops
Introduction To Mlops
Introduction to MLOps
Bringing DevOps and Automation to Machine
Learning
Hei Chow
Solutions Architect
© 2021,
2022, Amazon Web Services, Inc. or its affiliates.
INTRODUCTION TO MLOPS
- Gartner
- Gartner
https://www.idgconnect.com/article/3583467/gartner-accelerating-ai-deployments-paths-of-least-resistance.html
Main Challenges
• Publishing a ML model is not • “IT leaders responsible for AI are
enough. discovering ‘AI pilot paradox’, where
launching pilots is deceptively easy but
• Managing the published ML
deploying them into production is
models is as important as
notoriously challenging.”
developing them.
© 2021,
2022, Amazon Web Services, Inc. or its affiliates. 5
INTRODUCTION TO MLOPS
The ML process
ML problem
Data collection
framing
Business
Problem
Feature Monitoring and
Data integration
engineering debugging
Data visualization
Model evaluation Model deployment
and analysis
NO YES
Phase 1: Research/Experiment
Data
Question: “Can we use ML to solve this?” Collection
• “Is it possible to … ?” Data
Monitoring Verificatio
• “Can we use this data to solve the following n
problem?”
• “Surely we must be able to …”
Serving
Feature
Infrastruct ML Code Extraction
Typical scenarios ure
• Scientific projects
Phase 2: Operational
Question: “How do we implement this method
at scale?” Data
Collection
• How do we pipe the data into the model in a timely Data
fashion? Monitoring Verificatio
• How do we collect, store and transform data so n
models can be retrained consistently?
• How do we build an A/B testing environment, in
order to test future model iterations? Serving
ML Feature
Infrastruct Code Extraction
ure
Typical scenarios
• After PoC, bringing your ML models to Resource
Analysis
Manageme
Tools
production Process
nt
Manageme
• Migration of existing models into ML platform nt
MLOps – Why?
MLOps – What?
Models
+ +
ML Dev Ops
Data Monitor
Model analysis
Model tuning
Model deployment
Training Model
Data validation
Data Shuffle and split
Transformation and
feature engineering
MLOps practices
Business
Stakeholder Training pipeline Deployment pipeline
Feedback
ML lifecycle management
Model
evaluation and Productionize Testing and Monitoring and
Model building Deployment
experimentatio model quality Observability
n
Data Model Code
Automating ML Workflows
using SageMaker
© 2021,
2022, Amazon Web Services, Inc. or its affiliates.
INTRODUCTION TO MLOPS
MLOPS • ML development,
experimentation,
• Create and manage collaboration
workflows • Compute/training
environment
• Automate ML steps Task
• Model registry
& pipelines orchestratio ML platform • Feature store
n • Model deployment
• Implement CI/CD
• Monitoring in
• Form a Directed production
Acyclic Graph (DAG) • Hyperparameter
Open optimization
AWS native DIY SageMaker
source • Dataset management
A
B C
Amazon EKS Amazon ECS Amazon
D SageMaker
Amazon SageMaker
Most complete, end-to-end ML service
Integrated Workbench
Capabilities designed specifically for ML, data
preparation, experiment management,
and workflows
Managed Infrastructure
Designed for ultra low latency and high
throughput, automatic scaling, and
distributed training
Managed Tooling
Purpose-built from the ground up to
work together including auto ML,
collaboration, debugger, profiler, bias
analyzer, and explainability
SageMaker Studio
Integrated development environment (IDE) for ML
MLOPS • ML development,
experimentation,
• Create and manage collaboration
workflows • Compute/training
environment
• Automate ML steps Task
• Model registry
& pipelines orchestratio ML platform • Feature store
n • Model deployment
• Implement CI/CD
• Monitoring in
• Form a Directed production
Acyclic Graph (DAG) • Hyperparameter
Open optimization
AWS native DIY SageMaker
source • Dataset management
A
B C Pipelines
Kubeflow Amazon EKS Amazon ECS Amazon
D SageMaker
E AWS Step
MLflow Amazon EC2 Amazon ECR
Functions
Task orchestration
Open source 3rd party options Native AWS options
1
Project 2 Automated 3
Pipelines Model Build Model Registry
Workflows
Source Code Model
</> Control Deployment
Pipeline
Supported Steps:
• Processing
• Training
• Tuning
• Conditional
• Register Model
• Create Model
1
Project 2 Automated 3
Pipelines Model Build Model Registry
Workflows
Source Code Model
</> Control Deployment
Pipeline
https://aws.amazon.com/blogs/machine-learning/create-amazon-sagemaker-projects-using-third-party-source-control-and-jenkins/
1
Project 2 Automated 3
Pipelines Model Build Model Registry
Workflows
Source Code Model
</> Control Deployment
Pipeline
3 - Metric Based
Example:
Supported Features:
• Automatic data collection
• Continuous monitoring
• Flexible Monitoring Rules
• Visual data analysis
• CloudWatch integration
Getting Started