0% found this document useful (0 votes)

192 views9 pages

MLOPS Case Study Questions and Answers

The document outlines a comprehensive MLOps case study focusing on the design and implementation of a machine learning system for car damage detection. It details key performance indicators (KPIs) for model efficiency, operational effectiveness, and reliability, as well as the advantages of an MLOps system over a simple model. Additionally, it describes the architecture, tools, and workflows necessary for building an end-to-end ML system, including data ingestion, model training, deployment, monitoring, and continuous improvement.

Uploaded by

geekychintu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

192 views9 pages

MLOPS Case Study Questions and Answers

Uploaded by

geekychintu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

MLOPS Case Study Submitted by: Ritik Karir

Q1. System design: Based on the above information, describe the KPI that the
business should track.

Answer:
1. Model view KPI (technical efficiency)

• Accuracy: Measures the general purity of the model that detects damage when identifying scratches, bulk
or "no damage" (no damage).

• Accurate, recall and F1 score (per square):

• Prosecutor: indicates how much identified damage (dent/scratch) is classified correctly, which reduces
false positives.

• F1-score: Balances accuracy and remembers to provide a single performance metal.

• Confusion matrix: Visit performance in different damage categories, which help identify miscalculation
patterns.

• ESTIMMS Delay: Measures the time taken by the model to process an image and return the result of
detecting significant damage to real -time applications.

2. Business Effect KPI (operational efficiency)

• Price accuracy index: Evaluates how a resale price is proximity to using model facility with actual sales
prices in the market.

• Reduction in manual inspection costs: Manual car tracks The costs received by reducing the need for
inspection.

• LED-to-cell conversion frequency: Monitor whose fast, detect automatically damage improves the speed
that the entry is converted to successful sales.

• Operational scalability: The system measures the system's ability to handle the increasing amount of
images of the car without a decrease in performance.

3. Model reliability and maintenance KPI (distribution and monitoring)

• Model Operation Detection Rate: Changes in data distribution (eg selection of lighting in new images)
Change that may affect the model performance.

• Model support frequency: Tracks that the model requires retrieval due to a decrease in performance or
new annotate data availability.
Q2. System design: Your company has decided to build an MLOps system. What
advantages would you get from building an MLOps system rather than a simple
model?

Answer:
• Scalability: MLOPS allows the system to handle large versions of car images and data, which support the
growing international operation of Carcadeepo.

• Automation: Whole ML provides automatic - data Patting, model training, perineogenic and monitoring -
to reduce human efforts and human errors.

• Continuous integration and purinogenic (CI/CD): Fast, reliable model updates and perfections, without
disturbing existing services.

• Model monitoring and operating detection: Continuous model in production spores performance, detect
data message (eg due to poor light) and trigger retrieval when needed.

• Breeding and use of use: The model maintains a broad overview of versions, experiments and data sets,
so that easy replication and results can compare.

• BETTER COOPERATION: Data offers steady workflows between researchers, engineers and business teams
through standardized pipelines and shared equipment.

• Cost efficiency: By reducing manual interventions, adaptation of resource use and advisory time to the
market for new models, reduces operating costs.

Q3: System design: You must create an ML system that has the features of a
complete production stack, from experiment tracking to automated model
deployment and monitoring. For this problem, create an ML system design
(diagram)

Answer:
Q4. System design: After creating the architecture,
please specify your reason for choosing the specific
tools you chose for the use case.
Answer:
Explaination why I would choose these technology sources

Explanation of the Diagram:

Data Sources:

• Input: Car images and annotations from multiple sources (user uploads, partner dealerships, legacy
systems).

Data Ingestion & Storage:

• Function: Consolidate and store raw data in a centralized data lake (e.g., AWS S3).

ETL & Data Processing Pipeline:

• Tools: Orchestrated with Airflow or Kubeflow Pipelines.

• Function: Clean, augment, and transform the data to prepare it for training.

Experimentation & Training Layer:

• Tools: TensorFlow/Keras for model building; MLflow for experiment tracking (logging parameters, metrics,
and artifacts).

• Function: Run training experiments, tune models, and compare performance.

Model Registry & Artifact Store:

• Tools: MLflow Model Registry.

• Function: Maintain version control and organize the best-performing models for production use.

Deployment & Inference Layer:

• Tools: Docker for containerization; Kubernetes for orchestration; Flask/FastAPI for RESTful API endpoints.

• Function: Package and deploy the model to serve real-time predictions.

Monitoring, Logging & Alerting:

• Tools: Prometheus and Grafana for metrics, ELK Stack for logging, and custom drift detection solutions.
• Function: Continuously monitor the deployed model's performance, log issues, and trigger alerts if
metrics (such as drift or latency) fall out of acceptable ranges.

CI/CD Pipeline:

• Tools: Jenkins, GitLab CI/CD, or GitHub Actions.

• Function: Automate testing, building, and deployment processes ensuring smooth transitions from
development to production.

Q5. Workflow of the solution:

You must specify the steps that should be taken to build
such a system end to end.
The steps should mention the tools used in each of the
components and how they are connected with one
another to solve the problem.
Answer:
Data Ingestion & Preprocessing

Data Sources & Storage:

What: Collect car images and annotations from various sources (user uploads, partner dealerships, legacy
systems).

Where: Store the raw data in a centralized cloud data lake (e.g., AWS S3, Azure Blob Storage).

ETL & Data Processing:

Tool: Apache Airflow or Kubeflow Pipelines

How:

Schedule and orchestrate ETL jobs that extract raw images, clean them, and apply preprocessing steps.

Use Python libraries (e.g., TensorFlow’s ImageDataGenerator) to perform image augmentation (rotation, scaling,
brightness adjustments) and normalization.

Outcome: Preprocessed images are stored in a designated training repository.

2. Experimentation & Model Training

Model Building:

Tool: TensorFlow/Keras

How:

Design a Convolutional Neural Network (CNN) to classify images into “Dent,” “Scratch,” or “None.”

Experiment with various architectures and hyperparameters.

Experiment Tracking:

Tool: MLflow

How:

Log hyperparameters, training metrics (accuracy, loss), and model artifacts during each experiment.

Compare different experiment runs to select the best-performing model.

Integration:

The preprocessed data from the ETL pipeline feeds directly into the training scripts, ensuring consistent input for
experiments.

3. Model Evaluation & Registration

Evaluation:

What: Assess model performance using metrics such as accuracy, precision, recall, and F1-score.

Model Registry:

Tool: MLflow Model Registry

How:

Register the best-performing model version.

Maintain version control and metadata (e.g., training parameters, experiment logs) to enable rollback if necessary.

4. Automated Deployment & Inference

Containerization:

Tool: Docker

How: Package the trained model along with its inference server (using Flask or FastAPI) into a Docker container.

Orchestration & Deployment:

Tool: Kubernetes

How:

Deploy the containerized model to a Kubernetes cluster.

Use Kubernetes Ingress and Horizontal Pod Autoscaler to manage load balancing and auto-scale the service.

Inference API:

What: Expose a RESTful endpoint that accepts car images and returns the predicted damage classification.

5. Continuous Monitoring, Logging & Alerting

Performance Monitoring:

Tools: Prometheus (for metrics collection) and Grafana (for dashboard visualization)

How:

Monitor key metrics such as inference latency, throughput, and error rates.

Drift Detection:

Approach:

Implement custom drift detection scripts or use libraries (e.g., Evidently AI) to continuously compare current input
data distributions against historical baselines.

Monitor prediction distributions to detect anomalies such as poor lighting conditions.

Logging & Alerting:

Tools: ELK Stack (Elasticsearch, Logstash, Kibana) or Splunk

How:

Aggregate logs from the deployed service for debugging and historical analysis.

Set up alerts (using Prometheus Alertmanager or PagerDuty) to notify stakeholders if performance or drift metrics
exceed predefined thresholds.

6. Automated Retraining & CI/CD Integration

Retraining Triggers:
When:

If drift is detected (e.g., due to lighting issues) or when new annotated data is available.

How:

The drift monitoring or data ingestion pipeline (monitored via Airflow/Kubeflow) automatically triggers a retraining
job.

Retraining Pipeline:

Process:

Load the latest preprocessed data and retrain the model.

Log new experiments via MLflow and compare against the current production model.

If the updated model performs better, register the new version in the model registry.

CI/CD Pipeline:

Tools: Jenkins, GitLab CI/CD, or GitHub Actions

How:

Automatically test, build, and deploy new models as soon as they pass integration and performance tests.

Ensure seamless updates from development to production.

7. Feedback Loop

User Feedback:

How:

Collect user and system feedback (via operational metrics and logs) to identify areas for further improvement.

Continuous Improvement:

Outcome:

The feedback loop feeds back into the data ingestion layer, triggering further retraining and fine-tuning of the
model.

Integration Overview

Data Flow:

Raw images → ETL Pipeline (Airflow/Kubeflow) → Preprocessed Data → Training Pipeline (TensorFlow/Keras,
MLflow)
Experimentation & Versioning:

Training experiments → MLflow logging → Model Registry → Dockerized Deployment

Real-Time Inference:

Deployed RESTful API (Flask/FastAPI) on Kubernetes → Inference & Prediction → Monitoring

(Prometheus/Grafana)

Monitoring & Retraining:

Continuous drift and performance monitoring → Alerting → Automated retraining (triggered via CI/CD)

CI/CD Integration:

Automated build, test, and deployment cycles ensure that new code or models are smoothly transitioned to
production.

Google Cloud Professional Machine Learning Engineer Exam Questions
100% (5)
Google Cloud Professional Machine Learning Engineer Exam Questions
123 pages
CV Template - Scalian Benelux - FY24 - DS
No ratings yet
CV Template - Scalian Benelux - FY24 - DS
3 pages
Competency Assessment Template
50% (2)
Competency Assessment Template
2 pages
Case Management System
No ratings yet
Case Management System
34 pages
2020-09-17 - Lak - GDG - Machine Learning Design Patterns For MLOps PDF
No ratings yet
2020-09-17 - Lak - GDG - Machine Learning Design Patterns For MLOps PDF
43 pages
Architecting To Support Machine Learning
No ratings yet
Architecting To Support Machine Learning
47 pages
Cleantech Documentation
No ratings yet
Cleantech Documentation
15 pages
Leather Analyzer
No ratings yet
Leather Analyzer
2 pages
Real-Time Defect Detection
No ratings yet
Real-Time Defect Detection
2 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Lavajiit Singh CV
No ratings yet
Lavajiit Singh CV
3 pages
3 Project Plan and Workflow
No ratings yet
3 Project Plan and Workflow
2 pages
Sagar T
No ratings yet
Sagar T
4 pages
Naukri TejaswihiAhirkar (4y 0m)
No ratings yet
Naukri TejaswihiAhirkar (4y 0m)
2 pages
Chethan Datascience CV
No ratings yet
Chethan Datascience CV
3 pages
Mlops: Continuous Delivery and Automation Pipelines in Machine Learning
100% (1)
Mlops: Continuous Delivery and Automation Pipelines in Machine Learning
14 pages
Demo Day Presentation
No ratings yet
Demo Day Presentation
23 pages
2 ML Kasthuri
No ratings yet
2 ML Kasthuri
17 pages
Notesv 1
No ratings yet
Notesv 1
6 pages
New ITRAdd On
No ratings yet
New ITRAdd On
6 pages
Previous AI Projects - 10 Sample Projects
No ratings yet
Previous AI Projects - 10 Sample Projects
14 pages
Data Science, Machine Learning, Python, Basics of SQL.: Professional Summary
No ratings yet
Data Science, Machine Learning, Python, Basics of SQL.: Professional Summary
5 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
No ratings yet
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
234 pages
Diksha 9
No ratings yet
Diksha 9
4 pages
Final Model Study and Benchmarking For Computer Vision Solution
No ratings yet
Final Model Study and Benchmarking For Computer Vision Solution
3 pages
ML Process and Map
No ratings yet
ML Process and Map
7 pages
SEM RESPOSTA - 736496689-Google-Cloud-Professional-Machine-Learning-Engineer-Exam-Questions
No ratings yet
SEM RESPOSTA - 736496689-Google-Cloud-Professional-Machine-Learning-Engineer-Exam-Questions
82 pages
Main Content
No ratings yet
Main Content
17 pages
Unit 1
No ratings yet
Unit 1
21 pages
ASDFGHJKL
No ratings yet
ASDFGHJKL
10 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
Traffic Flow Prediction Using The METR-LA Traffic
No ratings yet
Traffic Flow Prediction Using The METR-LA Traffic
8 pages
ML Pipelines AI Community
No ratings yet
ML Pipelines AI Community
53 pages
Diksha Iot 8
No ratings yet
Diksha Iot 8
5 pages
Final Reserach Proposal
No ratings yet
Final Reserach Proposal
7 pages
Project Report
No ratings yet
Project Report
13 pages
On The Automation of Machine Learning Pipelines: F E U P
No ratings yet
On The Automation of Machine Learning Pipelines: F E U P
86 pages
ஓம் சக்தி ஓம் சரவணா பவா - RESUME MODELS FINALE
No ratings yet
ஓம் சக்தி ஓம் சரவணா பவா - RESUME MODELS FINALE
48 pages
Auto Recovered
No ratings yet
Auto Recovered
2 pages
MLOps Interview QnA
No ratings yet
MLOps Interview QnA
19 pages
Autonomous Car
100% (1)
Autonomous Car
12 pages
Step-By-Step Guide To Gain MLOps Skills
No ratings yet
Step-By-Step Guide To Gain MLOps Skills
6 pages
AI Engineer Interview Prep Guide
No ratings yet
AI Engineer Interview Prep Guide
16 pages
DS Architecture
No ratings yet
DS Architecture
7 pages
Projects
No ratings yet
Projects
7 pages
C2 - W1 Mlopssadsa
No ratings yet
C2 - W1 Mlopssadsa
111 pages
Synopsis
No ratings yet
Synopsis
51 pages
Tech Lead Screening Questions
No ratings yet
Tech Lead Screening Questions
6 pages
ML Pipeline
No ratings yet
ML Pipeline
6 pages
Bhavana
No ratings yet
Bhavana
4 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Anshul Yadav: Profile
No ratings yet
Anshul Yadav: Profile
6 pages
Hardest Level Senior Product Manager For AI Products Interview2
No ratings yet
Hardest Level Senior Product Manager For AI Products Interview2
4 pages
DSML Projects
No ratings yet
DSML Projects
10 pages
MLops 12 Draft
No ratings yet
MLops 12 Draft
5 pages
Machine Learning Model Workflow
No ratings yet
Machine Learning Model Workflow
3 pages
ML Report
No ratings yet
ML Report
11 pages
Analysis and Data Mining of Call Detail Records
No ratings yet
Analysis and Data Mining of Call Detail Records
4 pages
File Format
No ratings yet
File Format
2 pages
DA Unit-2
No ratings yet
DA Unit-2
7 pages
Spring Boot With MongoDB
No ratings yet
Spring Boot With MongoDB
16 pages
Types of Information System
No ratings yet
Types of Information System
112 pages
Tutorial For Oracle10g Forms and Reports
No ratings yet
Tutorial For Oracle10g Forms and Reports
76 pages
Unit 2 FDS
No ratings yet
Unit 2 FDS
55 pages
Data Driven Decision Making
No ratings yet
Data Driven Decision Making
164 pages
Courier Management
No ratings yet
Courier Management
5 pages
Dbms Imp
No ratings yet
Dbms Imp
3 pages
Brief Summary of SAP Buffers
No ratings yet
Brief Summary of SAP Buffers
2 pages
Chatbot Using A Knowledge in Database
No ratings yet
Chatbot Using A Knowledge in Database
7 pages
DBMS ASSIGNMENT For PRACTICE
No ratings yet
DBMS ASSIGNMENT For PRACTICE
10 pages
Exercise 7,8,9 Basic Commands
No ratings yet
Exercise 7,8,9 Basic Commands
7 pages
AWS-Certified-Cloud-Practitioner Dumps Amazon AWS Certified Cloud Practitioner
No ratings yet
AWS-Certified-Cloud-Practitioner Dumps Amazon AWS Certified Cloud Practitioner
8 pages
Year 8 - Single Variable Data Analysis
No ratings yet
Year 8 - Single Variable Data Analysis
100 pages
HW 3 Sol
No ratings yet
HW 3 Sol
8 pages
"Aistifsar" Enquiry Chatbot Using Artificial Intelligence Markup Language (AIML)
No ratings yet
"Aistifsar" Enquiry Chatbot Using Artificial Intelligence Markup Language (AIML)
10 pages
Data Engineering
No ratings yet
Data Engineering
8 pages
Analytics For Finance Model Paper
No ratings yet
Analytics For Finance Model Paper
5 pages
Sankirthan Raj Mavella - Full Stack Java Developer
No ratings yet
Sankirthan Raj Mavella - Full Stack Java Developer
6 pages
Class Xii Cs 083 Syllabus 2024 25
No ratings yet
Class Xii Cs 083 Syllabus 2024 25
4 pages
DBMS Module 3 Notes
No ratings yet
DBMS Module 3 Notes
21 pages
PRN212Assignment01 WPF - LINQ
No ratings yet
PRN212Assignment01 WPF - LINQ
5 pages
Dont See This
No ratings yet
Dont See This
10 pages
Sas Data Management
100% (1)
Sas Data Management
908 pages
Upgrade Instructions For Digisaf 100
No ratings yet
Upgrade Instructions For Digisaf 100
3 pages
Docs Huihoo Com Apache Ofbiz 2 1 1 OFBizQuickStart HTML
No ratings yet
Docs Huihoo Com Apache Ofbiz 2 1 1 OFBizQuickStart HTML
1 page

MLOPS Case Study Questions and Answers

Uploaded by

MLOPS Case Study Questions and Answers

Uploaded by

MLOPS Case Study Submitted by: Ritik Karir

• Accurate, recall and F1 score (per square):

• F1-score: Balances accuracy and remembers to provide a single performance metal.

2. Business Effect KPI (operational efficiency)

3. Model reliability and maintenance KPI (distribution and monitoring)

Explanation of the Diagram:

Data Ingestion & Storage:

ETL & Data Processing Pipeline:

• Tools: Orchestrated with Airflow or Kubeflow Pipelines.

Experimentation & Training Layer:

• Function: Run training experiments, tune models, and compare performance.

Model Registry & Artifact Store:

• Tools: MLflow Model Registry.

Deployment & Inference Layer:

• Function: Package and deploy the model to serve real-time predictions.

Monitoring, Logging & Alerting:

• Tools: Jenkins, GitLab CI/CD, or GitHub Actions.

Q5. Workflow of the solution:

Data Sources & Storage:

ETL & Data Processing:

Tool: Apache Airflow or Kubeflow Pipelines

Outcome: Preprocessed images are stored in a designated training repository.

2. Experimentation & Model Training

Experiment with various architectures and hyperparameters.

Compare different experiment runs to select the best-performing model.

3. Model Evaluation & Registration

Tool: MLflow Model Registry

Register the best-performing model version.

4. Automated Deployment & Inference

Orchestration & Deployment:

Deploy the containerized model to a Kubernetes cluster.

5. Continuous Monitoring, Logging & Alerting

Monitor prediction distributions to detect anomalies such as poor lighting conditions.

Logging & Alerting:

Tools: ELK Stack (Elasticsearch, Logstash, Kibana) or Splunk

6. Automated Retraining & CI/CD Integration

Load the latest preprocessed data and retrain the model.

Tools: Jenkins, GitLab CI/CD, or GitHub Actions

Ensure seamless updates from development to production.

Training experiments → MLflow logging → Model Registry → Dockerized Deployment

Deployed RESTful API (Flask/FastAPI) on Kubernetes → Inference & Prediction → Monitoring

Monitoring & Retraining:

You might also like