Problem+Formulation+Exercise+Solutions

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views3 pages

Problem+Formulation+Exercise+Solutions

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Problem Formulation: What this pipeline phase entails

and why it’s important

The problem formulation phase of the ML Pipeline is critical, and it’s where everything begins. Typically,
this phase is kicked off with a question of some kind. Examples of these kinds of questions include: Could
cars really drive themselves? What additional product should we offer someone as they checkout? How
much storage will clients need from a data center at a given time?

The problem formulation phase starts by seeing a problem and thinking “what question, if I could
answer it, would provide the most value to my business?” If I knew the next product a customer was
going to buy, is that most valuable? If I knew what was going to be popular over the holidays, is that
most valuable? If I better understood who my customers are, is that most valuable?

However, some problems are not so obvious. When sales drop, new competitors emerge, or there’s a
big change to a company/team/org, it can be easy to say, “I see the problem!” But sometimes the
problem isn’t so clear. Consider self-driving cars. How many people think to themselves, “driving cars is
a huge problem”? Probably not many. In fact, there isn’t a problem in the traditional sense of the word
but there is an opportunity. Creating self-driving cars is a huge opportunity. That doesn’t mean there
isn’t a problem or challenge connected to that opportunity. How do you design a self-driving system?
What data would you look at to inform the decisions you make? Will people purchase self-driving cars?

Part of the problem formulation phase includes seeing where there are opportunities to use machine
learning.

In the following practice examples, you are presented with four different business scenarios. For each
scenario, consider the following questions:

1. Is machine learning appropriate for this problem, and why or why not?
2. What is the ML problem if there is one, and what would a success metric look like?
3. What kind of ML problem is this?
4. Is the data appropriate?

The solutions given in this document are one of the many ways you can formulate a business problem.
The first scenario has been completed for you. Remember that there are two ways to start an ML
problem. The first is by addressing an obvious problem, the second is by seeing an opportunity. Lastly,
be sure to consider whether this is even an ML problem at all. Take a look at scenarios 2 – 4 below and
see if you can answer the questions above.

1) Amazon recently began advertising to its customers when they visit the company website. The
Director in charge of the initiative wants the advertisements to be as tailored to the customer as
possible. You will have access to all the data from the retail webpage, as well as all the customer
data.

1) ML is appropriate because of the scale, variety and speed required. There are potentially
thousands of ads and millions of customers that need to be served customized ads
immediately as they arrive to the site.
2) The problem is ads that are not useful to customers are a wasted opportunity and a
nuisance to customers, yet not serving ads at all is a wasted opportunity. So how does
Amazon serve the most relevant advertisements to its retail customers?
i. Success would be the purchase of a product that was advertised.
3) This is a supervised learning problem because we have a labeled data point, our success
metric, which is the purchase of a product.
4) This data is appropriate because it is both the retail webpage data as well as the customer
data.

2) You’re a Senior Business Analyst at a social media company that focuses on streaming. Streamers
use a combination of hashtags and predefined categories to be discoverable by your platform’s
consumers. You ran an analysis on unique streamer counts by hashtags and categories over the last
month and found that out of tens of thousands of streamers, almost all use only 40 hashtags and 10
categories despite innumerable hashtags and hundreds of categories. You presume the predefined
categories don’t represent all the possibilities very well, and that streamers are simply picking the
closest fit. You figure there are likely many categories and groupings of streamers that are not
accounted for. So you collect a dataset that consists of all streamer profile descriptions (all text), all
the historical chat information for each streamer, and all their videos that have been streamed.
1) ML is appropriate because of the scale and variability.
2) The problem is the content of streamers is not being represented by the existing categories.
Success would be naturally grouping the streamers into categories based on content and
seeing if those align with the hashtags and categories that are being commonly used. If they
do not, then the streamers are not being well represented and you can use these groupings
to create new categories.
3) There isn’t a specific outcome variable. There’s no target or label. So this is an unsupervised
problem.
4) The data is appropriate.

3) You’re a headphone manufacturer who sells directly to big and small electronic stores. As an
attempt to increase competitive pricing, Store 1 and Store 2 decided to put together the pricing
details for all headphone manufacturers and their products (about 350 products) and conduct daily
releases of the data. You will have all the specs from each manufacturer and their product’s pricing.
Your sales have recently been dropping so your first concern is whether there are competing
products that are priced lower than your flagship product.
1) ML is probably not necessary for this. You can just search the dataset to see which
headphones are priced lower than the flagship, then compare their features and build
quality.

4) You’re a Senior Product Manager at a leading ridesharing company. You did some market research,
collected customer feedback, and discovered that both customers and drivers are not happy with an
app feature. This feature allows customers to place a pin exactly where they want to be picked up.
The customers say drivers rarely stop at the pin location. Drivers say customers most often put the
pin in a place they can’t stop. Your company has a relationship with the most used maps app for the
driver’s navigation so you leverage this existing relationship to get direct, backend access to their
data. This includes latitude and longitude, visual photos of each lat/long, traffic delay details, and
regulation data if available (ie- No Parking zones, 3 minute parking zones, fire hydrants, etc.).
1) ML is appropriate because of the scale and automation involved. It’s not feasible to drive
everywhere and write down all the places that are ok for pickup. However, maybe we can
predict whether a location is ok for pickup.
2) The problem is drivers and customers are having poor experiences connecting for pickup,
which is pushing customers away from the platform.
i. Success would be properly identifying appropriate pickup locations so they
can be integrated into the feature.
3) This is a supervised learning problem even though there aren’t any labels, yet. Someone will
have to go through a sample of the data to label where there are ok places to park and not
park, giving the algorithms some target information.
4) The data is appropriate once a sample of the dataset has been labeled. There may be some
other data that could be included too. What about asking UPS for driver stop information?
Where do they stop?

Developing Machine Learning Solutions
No ratings yet
Developing Machine Learning Solutions
25 pages
ML
No ratings yet
ML
331 pages
Learning AI Development With UX
No ratings yet
Learning AI Development With UX
41 pages
ML Projects For Final Year
No ratings yet
ML Projects For Final Year
7 pages
1_Intro to ML System Design
No ratings yet
1_Intro to ML System Design
45 pages
Nqd28MNrTwKndvDDay8Cgg C MMLPGC B Managing ML Projects With GC Student Slides v2.0
No ratings yet
Nqd28MNrTwKndvDDay8Cgg C MMLPGC B Managing ML Projects With GC Student Slides v2.0
118 pages
howtobeagoodmachinelearningpmbygoogleproductmanager-181031104416
No ratings yet
howtobeagoodmachinelearningpmbygoogleproductmanager-181031104416
71 pages
C1 W3
No ratings yet
C1 W3
60 pages
Lecture 2.4-1
No ratings yet
Lecture 2.4-1
119 pages
3-Data Considerations
No ratings yet
3-Data Considerations
46 pages
Framing A Machine Learning Problem
No ratings yet
Framing A Machine Learning Problem
32 pages
Lecture 3_1-ML and Data Systems Fundamentals
No ratings yet
Lecture 3_1-ML and Data Systems Fundamentals
48 pages
Design A Machine Learning System
No ratings yet
Design A Machine Learning System
9 pages
Bag2022_Examining Collaborative Buyer-supplier Relationships and Social Sustainability in the New Normal Era the Moderating Effects
No ratings yet
Bag2022_Examining Collaborative Buyer-supplier Relationships and Social Sustainability in the New Normal Era the Moderating Effects
46 pages
How_Corporations_Can_Develop_M
No ratings yet
How_Corporations_Can_Develop_M
210 pages
Looper E2e ML Platform
No ratings yet
Looper E2e ML Platform
13 pages
Polyzotis Et Al_2018
No ratings yet
Polyzotis Et Al_2018
12 pages
House DZ RC 158 ML Patterns 2023
No ratings yet
House DZ RC 158 ML Patterns 2023
7 pages
Certified Artificial Intelligence Practitioner 1
No ratings yet
Certified Artificial Intelligence Practitioner 1
43 pages
07---data-lifecycle-challenges-in-production-ml
No ratings yet
07---data-lifecycle-challenges-in-production-ml
12 pages
cs329s 2022 02 Slides MLSD
No ratings yet
cs329s 2022 02 Slides MLSD
99 pages
Solving Enterprise ML 5 Challenges
No ratings yet
Solving Enterprise ML 5 Challenges
20 pages
A Data Quality-Driven View of Mlops
No ratings yet
A Data Quality-Driven View of Mlops
12 pages
Towards Machine Learning Guided by Best Practices
No ratings yet
Towards Machine Learning Guided by Best Practices
5 pages
ML Feasability Studies
No ratings yet
ML Feasability Studies
4 pages
20
No ratings yet
20
19 pages
Foundry Databricks 40822 Tech Dossier Final v2 7.26
No ratings yet
Foundry Databricks 40822 Tech Dossier Final v2 7.26
10 pages
How To Manage Machine Learning Products - Towards Data Science
No ratings yet
How To Manage Machine Learning Products - Towards Data Science
8 pages
ML at Scale Ebook
No ratings yet
ML at Scale Ebook
14 pages
Coats Annual Report 2021
No ratings yet
Coats Annual Report 2021
212 pages
Machine Learning For Product Managers
No ratings yet
Machine Learning For Product Managers
7 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
10 pages
Assignment Title: Trends and Issues in Sciences
No ratings yet
Assignment Title: Trends and Issues in Sciences
4 pages
C2 - W1 Mlopssadsa
No ratings yet
C2 - W1 Mlopssadsa
111 pages
Tantithamthavorn Et Al_2025
No ratings yet
Tantithamthavorn Et Al_2025
7 pages
Method Statement for Terrazzo Flooring
No ratings yet
Method Statement for Terrazzo Flooring
8 pages
FULLTEXT01
No ratings yet
FULLTEXT01
40 pages
Feature Labs - ML 2.0
No ratings yet
Feature Labs - ML 2.0
13 pages
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Procurement of Aircraft Spares
No ratings yet
Procurement of Aircraft Spares
5 pages
Problem+Formulation+Exercise
No ratings yet
Problem+Formulation+Exercise
2 pages
OM Unit 5 Service Design
No ratings yet
OM Unit 5 Service Design
34 pages
Chapter 5 Strama Report
No ratings yet
Chapter 5 Strama Report
23 pages
Standardizing ML Ebook
No ratings yet
Standardizing ML Ebook
24 pages
Summary Mill
No ratings yet
Summary Mill
44 pages
From Field Problems To Machine Learning
No ratings yet
From Field Problems To Machine Learning
51 pages
PMP Group Activity Booklet_Official PMI_v0.1 (1)
No ratings yet
PMP Group Activity Booklet_Official PMI_v0.1 (1)
23 pages
ASSIGNMENT BRIEF SOLUTION
No ratings yet
ASSIGNMENT BRIEF SOLUTION
15 pages
Month to Month Sales Forecast Template
No ratings yet
Month to Month Sales Forecast Template
12 pages
International Strategy Wip
No ratings yet
International Strategy Wip
17 pages
Sepm Previous Year Questions
No ratings yet
Sepm Previous Year Questions
3 pages
Fdocuments - in PPT On Nahar 1
No ratings yet
Fdocuments - in PPT On Nahar 1
34 pages
Microsoft PL600 Activity Booklet Day 3
No ratings yet
Microsoft PL600 Activity Booklet Day 3
4 pages
Lab 3 ML
No ratings yet
Lab 3 ML
19 pages
Stop Card 1 Kelompok 5
No ratings yet
Stop Card 1 Kelompok 5
1 page
1571-Article Text-2942-1-10-20180804
No ratings yet
1571-Article Text-2942-1-10-20180804
9 pages
Case Studies ML
No ratings yet
Case Studies ML
21 pages
Labs_ER_v2
No ratings yet
Labs_ER_v2
23 pages
SOP AIR TIMAR Morocco
No ratings yet
SOP AIR TIMAR Morocco
2 pages
Module Preprocesing_MLPipeline
No ratings yet
Module Preprocesing_MLPipeline
7 pages
Machine Learning Process Lifecycle: Talat@amii - Ca Luke@amii - Ca Shazan@amii - Ca Sankalp@amii - Ca
No ratings yet
Machine Learning Process Lifecycle: Talat@amii - Ca Luke@amii - Ca Shazan@amii - Ca Sankalp@amii - Ca
13 pages
Academic Year 25_26
No ratings yet
Academic Year 25_26
3 pages
Progress and Update Term 3 2024_2025
No ratings yet
Progress and Update Term 3 2024_2025
8 pages
PMI Official_Group_PMPv3
No ratings yet
PMI Official_Group_PMPv3
5 pages
terraform_lambda
No ratings yet
terraform_lambda
5 pages
AWS_SPARK
No ratings yet
AWS_SPARK
3 pages
Linux
No ratings yet
Linux
4 pages
Risk Management Plan: Shangrila May Dimol-Bergonio, MD
No ratings yet
Risk Management Plan: Shangrila May Dimol-Bergonio, MD
15 pages
Job Design (Job Analysis & Job Description) (1 Day)
No ratings yet
Job Design (Job Analysis & Job Description) (1 Day)
4 pages
Training Outline_Partyrock
No ratings yet
Training Outline_Partyrock
4 pages
Kenenisa Bekele Resume
No ratings yet
Kenenisa Bekele Resume
1 page
Microssoft PL600 Activity Booklet Day 1
No ratings yet
Microssoft PL600 Activity Booklet Day 1
5 pages
Attach EI
No ratings yet
Attach EI
3 pages
Task 1 - Identify The Customer Profile
No ratings yet
Task 1 - Identify The Customer Profile
4 pages
Module 2 - PRACTICING ENTREPRENEURSHIP
No ratings yet
Module 2 - PRACTICING ENTREPRENEURSHIP
15 pages
OUDREY THOMAS ASSIGNMENT AWS_Oudrey
No ratings yet
OUDREY THOMAS ASSIGNMENT AWS_Oudrey
2 pages
The Effect of Artificial Intelligence On Service Quality and Customer Satisfaction in Jordanian Banking Sector
No ratings yet
The Effect of Artificial Intelligence On Service Quality and Customer Satisfaction in Jordanian Banking Sector
7 pages
QAS Vol.22 No.181 April.2021 p3-6
No ratings yet
QAS Vol.22 No.181 April.2021 p3-6
5 pages
Financial Management Hinglish Notes
No ratings yet
Financial Management Hinglish Notes
4 pages
Assignment Details
No ratings yet
Assignment Details
4 pages
Sagemaker Studio_EMR_Glue_macarious
No ratings yet
Sagemaker Studio_EMR_Glue_macarious
2 pages
mb300andER__sent
No ratings yet
mb300andER__sent
2 pages
mb300andER
No ratings yet
mb300andER
2 pages
MySql Student companion_day 3
No ratings yet
MySql Student companion_day 3
2 pages
Construction Office Manager Job Description For Resume
100% (1)
Construction Office Manager Job Description For Resume
4 pages
[PO] IT Architecture for JKT
No ratings yet
[PO] IT Architecture for JKT
1 page
Assignment Bedawi
No ratings yet
Assignment Bedawi
1 page
Dinellie D_Assignment
No ratings yet
Dinellie D_Assignment
1 page
Aidel Azhar Assignment
No ratings yet
Aidel Azhar Assignment
1 page
Innovation in Public Administration: An Important Tool for Simplifying Public Service Delivery in Bangladesh
No ratings yet
Innovation in Public Administration: An Important Tool for Simplifying Public Service Delivery in Bangladesh
4 pages
Chang_Si_Ju
No ratings yet
Chang_Si_Ju
2 pages
Python Institute
No ratings yet
Python Institute
6 pages
Procedure of Additional Spare Part Request
No ratings yet
Procedure of Additional Spare Part Request
2 pages
Elliana Yasmin_Assignment
No ratings yet
Elliana Yasmin_Assignment
1 page
2.1.9 Surface Scratch Examination Metal-Clad Foil
No ratings yet
2.1.9 Surface Scratch Examination Metal-Clad Foil
1 page
Growth and Trade: Multiple Choice Questions
No ratings yet
Growth and Trade: Multiple Choice Questions
20 pages
Outline Splunk
No ratings yet
Outline Splunk
3 pages
3-Day IT Architecture Business Technology Requirement Architecture Training Course Plan
No ratings yet
3-Day IT Architecture Business Technology Requirement Architecture Training Course Plan
2 pages
Machine Learning Decoded
From Everand
Machine Learning Decoded
Mary Chapman
No ratings yet
Marketing, Sales and Service with AI
From Everand
Marketing, Sales and Service with AI
Steve Kaplan
2/5 (2)
Digital and Marketing Asset Management: The Real Story about DAM Technology and Practices
From Everand
Digital and Marketing Asset Management: The Real Story about DAM Technology and Practices
Theresa Regli
3/5 (2)
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
From Everand
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
Mirza Rahim Baig
No ratings yet
Unlevel the Playing Field: The Biggest Mindshift in PPC History
From Everand
Unlevel the Playing Field: The Biggest Mindshift in PPC History
Frederick Vallaeys
5/5 (1)
AI for Marketers Outperform Competitors with Smarter Campaigns and Data
From Everand
AI for Marketers Outperform Competitors with Smarter Campaigns and Data
tarek mohamed
No ratings yet
Optimum Sigma is NOT 6
From Everand
Optimum Sigma is NOT 6
Kermit Taylor
No ratings yet
Out of the Box, or Out of the Question: What Won't Your Incentive Compensation Management System Do?
From Everand
Out of the Box, or Out of the Question: What Won't Your Incentive Compensation Management System Do?
David Kelly
No ratings yet
The Big Car Con
From Everand
The Big Car Con
Chris Rainsford
No ratings yet
Remote Retailing Blueprint
From Everand
Remote Retailing Blueprint
Brian Pasch
No ratings yet
Hyper-Local Marketing Handbook for Automotive Retail
From Everand
Hyper-Local Marketing Handbook for Automotive Retail
Brian Pasch
No ratings yet
How to Build a SaaS Business: A Step-by-Step Guide to Starting and Operating a Software Company
From Everand
How to Build a SaaS Business: A Step-by-Step Guide to Starting and Operating a Software Company
Einar Uvslokk
4/5 (5)
How Will Automation Transform Your Business?
From Everand
How Will Automation Transform Your Business?
Ary S. Jr.
No ratings yet
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
From Everand
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
MAX EDITORIAL
No ratings yet
Data-Driven Marketing: The 15 Metrics Everyone in Marketing Should Know
From Everand
Data-Driven Marketing: The 15 Metrics Everyone in Marketing Should Know
Mark Jeffery
3.5/5 (19)
MDM for Multichannel Commerce Standard Requirements
From Everand
MDM for Multichannel Commerce Standard Requirements
Gerardus Blokdyk
No ratings yet
Mobile data terminal A Clear and Concise Reference
From Everand
Mobile data terminal A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Mobile technology Second Edition
From Everand
Mobile technology Second Edition
Gerardus Blokdyk
No ratings yet
Mobile Unified Communications Third Edition
From Everand
Mobile Unified Communications Third Edition
Gerardus Blokdyk
No ratings yet
Mobile Cloud The Ultimate Step-By-Step Guide
From Everand
Mobile Cloud The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Marketbusters (Review and Analysis of Mcgrath and Macmillan's Book)
From Everand
Marketbusters (Review and Analysis of Mcgrath and Macmillan's Book)
BusinessNews Publishing
No ratings yet
Summary of Roland Smart's The Agile Marketer
From Everand
Summary of Roland Smart's The Agile Marketer
IRB Media
No ratings yet

Problem+Formulation+Exercise+Solutions

Uploaded by

Problem+Formulation+Exercise+Solutions

Uploaded by

Problem Formulation: What this pipeline phase entails

and why it’s important

You might also like