More About Planning A Machine Learning Project

The document discusses the importance of the Plan stage of the PACE workflow for machine learning projects. In the Plan stage, you define the problem, determine what type of machine learning model is needed to solve it (supervised vs. unsupervised, regression vs. classification), and verify that you have the necessary data and tools. Key aspects to consider include the goal of the project, the type of variables in the data, and whether the data needs preprocessing. The plan provides guidance but is not fixed - it can be reassessed as the project progresses.

Uploaded by

rayed786

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

More About Planning A Machine Learning Project

Uploaded by

rayed786

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

More about planning a machine learning

project
The PACE workflow is something that can be used to keep the most experienced data professionals
on track in their projects. In this reading, you will learn more about the Plan stage of PACE and the
things that must be considered and determined to ensure a smooth and successful model
development process.

The Plan Stage

The PACE workflow is something that you can use to keep you on track, no matter the project you’re
working on. Each step is important to get to your final product. However, like many things, the most
important part is setting up the foundations of your project - The Plan Stage.

The Plan Stage is the part of the process where you first start thinking about what the problem
actually is, and what needs to be done to find a solution. You start to consider what tools you have
available to you, and how you’ll need to manipulate the dataset. Sometimes this can be as
straightforward as needing to create some visualizations for the data. Or, it can get as complex as
needing to make a predictive model using the dataset.

The plan that you create during this stage will be carried through the whole process, so it is
important to really make sure you’ve considered all the aspects and constraints of the project.
However, that isn’t to say that the plan you create must stay unchanging, you can absolutely
reassess as you progress. It is there to get you started heading in the right direction.

What should your Plan Include?

This section of the course focuses on machine learning algorithms, so we will use those types of
projects as the example here. However, you need to think about whether you need a model in the
first place! Many analytical tasks do not require the creation of a model, and you could spend time
creating something that is not necessary to what you’re trying to achieve.

Knowing What You Need For a Problem

The first thing to do when forming your plan is to consider the end goal. What exactly are you trying
to model, and what types of results from the model are needed? Something that can be determined
immediately is what type of machine learning model you’ll need. The two types that you’ve seen so
far are Supervised and Unsupervised models.

Supervised models are used to make predictions about unseen events. These types of models use
labeled data, and the model will use these labels and the predictor variables present to learn from
the dataset. And when given new data points, they’re able to make a prediction of the label. So, for
example, if you’re tasked with predicting rainfall amounts, you already know that you will need a
supervised learning model.

Unsupervised models, on the other hand, don’t really make predictions. They are used to discover
the natural structure of the data, finding relationships within unlabeled data. So, for example, if
you’re tasked with discovering relationships between customer habits and segment users, you know
you’ll need an unsupervised model.

Now, let’s go back to the rainfall example. Just from that problem statement alone, we know we
need a supervised learning model. However, not all supervised learning models are the same. The
two main types of supervised learning are Regression and Classification. There are different types of
regression models that you have practiced, with different models able to perform regression or
classification tasks.

Linear regression models are used when the result must be a continuous variable. As you have
learned, continuous variables are numerical values that can have an unlimited number of values
between the highest and lowest points of measurement. So if you need rainfall amounts in inches or
centimeters, you know a linear regression model is needed.

However, what if we don’t need exact rainfall amount predictions, but just whether or not it will rain
that day? This is where a classification model, such as a logistic regression model, would be more
appropriate. Classification models will deliver results as a categorical variable, where there is a finite
set of values that the variable can be. In this example, the model would only ever predict two results:
Will Rain or Won’t Rain.

Figuring out the tools you need

After you’ve determined what type of model you’re going to need, you must consider what you have
at your disposal to complete the project. Most importantly, you need to figure out if you have the data
you’ll need to build the model.

If your dataset only has one or two predictor variables, it probably will not produce a model that will
be useful. Or, if it has very few data points, the model’s performance will similarly suffer. On the
other hand, your dataset might be large and unwieldy, meaning that you’ll either need to clean it up
or cut it down to get it into a format that you can use to train the model. Having these issues means
that you’ll have to put in a little extra work to get it to usable form, or look elsewhere for data that will
be helpful to create the model.
Key Takeaways

 The PACE workflow for machine learning is very useful for planning out and solving data
driven problems.
 The Plan stage of PACE is one of the most important, setting you up for success throughout
the rest of the process
 In the Plan stage, you first consider the problem at hand and what will be needed to solve it
 You also verify that you have the tools and resources you need to solve the problem
 The Plan is not set in stone. It just serves as a foundational starting point for the rest of the
project

Unit-4 Part 2 Modelling and Evaluation
No ratings yet
Unit-4 Part 2 Modelling and Evaluation
35 pages
SS CH2 LM Ai Class X
No ratings yet
SS CH2 LM Ai Class X
92 pages
Topic 5-Types of Machine Learning
No ratings yet
Topic 5-Types of Machine Learning
31 pages
Dav Unit 3
No ratings yet
Dav Unit 3
50 pages
MODEL LIFECYCLE Class 12 Full PDF
100% (2)
MODEL LIFECYCLE Class 12 Full PDF
85 pages
AI Model Life Cycle
No ratings yet
AI Model Life Cycle
13 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
178 pages
How To Avoid Machine Learning Pitfalls
No ratings yet
How To Avoid Machine Learning Pitfalls
25 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
Lones 2024
No ratings yet
Lones 2024
28 pages
Cluster
No ratings yet
Cluster
42 pages
Machine Learning Path
No ratings yet
Machine Learning Path
21 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
Unit 1 Part 4
No ratings yet
Unit 1 Part 4
8 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
How To Avoid Machine Learning Pitfalls: A Guide For Academic Researchers
No ratings yet
How To Avoid Machine Learning Pitfalls: A Guide For Academic Researchers
17 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Difference Between Machine Learning and Traditional Programming
No ratings yet
Difference Between Machine Learning and Traditional Programming
11 pages
AI Project Cycle Class 10 Notes
No ratings yet
AI Project Cycle Class 10 Notes
7 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
AI Project
No ratings yet
AI Project
14 pages
Chapter 2
No ratings yet
Chapter 2
4 pages
Module 1
No ratings yet
Module 1
50 pages
MCS224 Dec 2024 Solved
No ratings yet
MCS224 Dec 2024 Solved
22 pages
Data Exploration
No ratings yet
Data Exploration
5 pages
Types of ML
No ratings yet
Types of ML
4 pages
AI Project Cycle Class 10
No ratings yet
AI Project Cycle Class 10
11 pages
Project Life Cycle
No ratings yet
Project Life Cycle
14 pages
AI Project Cycle
No ratings yet
AI Project Cycle
4 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Model Lifecycle (XII)
No ratings yet
Model Lifecycle (XII)
10 pages
Machine Learning Report
No ratings yet
Machine Learning Report
58 pages
AI Session 5 Class 10
No ratings yet
AI Session 5 Class 10
19 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Biology A Levels P5 Help
50% (2)
Biology A Levels P5 Help
11 pages
Xii Std-Artifical Intelligence-Unit 2 Model Lifecycle
No ratings yet
Xii Std-Artifical Intelligence-Unit 2 Model Lifecycle
10 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Optimization of Sucrose Loss From Sugar Industry: European Chemical Bulletin December 2016
No ratings yet
Optimization of Sucrose Loss From Sugar Industry: European Chemical Bulletin December 2016
10 pages
An Enlightenment To Machine Learning - Resp
No ratings yet
An Enlightenment To Machine Learning - Resp
22 pages
Chilled Water Balancing For HVAC System Method Statement
0% (1)
Chilled Water Balancing For HVAC System Method Statement
2 pages
Time Series Analysis Interview Questions 1567388017
No ratings yet
Time Series Analysis Interview Questions 1567388017
6 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
Exponential Smoothing
No ratings yet
Exponential Smoothing
5 pages
Tenability W.poh
No ratings yet
Tenability W.poh
6 pages
The Complete Guide To Time Series Analysis and Forecasting
No ratings yet
The Complete Guide To Time Series Analysis and Forecasting
20 pages
Time Series
No ratings yet
Time Series
27 pages
Peel Strength
No ratings yet
Peel Strength
13 pages
Gully Rehabilitation
No ratings yet
Gully Rehabilitation
26 pages
Soilds, Liquids & Gases Revision Booklet
No ratings yet
Soilds, Liquids & Gases Revision Booklet
4 pages
Rapid Landslide Susceptibility Mapping: Dunham
No ratings yet
Rapid Landslide Susceptibility Mapping: Dunham
11 pages
Calc1 Chapter 1
No ratings yet
Calc1 Chapter 1
104 pages
Me6301 QB
100% (1)
Me6301 QB
46 pages
2023 Regional Distribution of Intensity-Duration-Frequency (IDF) OMAN Rain
No ratings yet
2023 Regional Distribution of Intensity-Duration-Frequency (IDF) OMAN Rain
15 pages
A 3 Forecasting Final
No ratings yet
A 3 Forecasting Final
16 pages
Notes On SPSS
No ratings yet
Notes On SPSS
19 pages
Ptest 2 v1mrs ch9
No ratings yet
Ptest 2 v1mrs ch9
7 pages
Sample Space, S: Probability 1
No ratings yet
Sample Space, S: Probability 1
6 pages
Fluid Power Systems: Lecture #3: Physical Properties of Hydraulic Fluid
No ratings yet
Fluid Power Systems: Lecture #3: Physical Properties of Hydraulic Fluid
32 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
32 pages
J E T B:, Instructions To Authors
No ratings yet
J E T B:, Instructions To Authors
5 pages
Algebra Question Paper For Board Exam 3
No ratings yet
Algebra Question Paper For Board Exam 3
2 pages
Gas Demand Forecasting Methodology
No ratings yet
Gas Demand Forecasting Methodology
38 pages
1937 Clayton Leon Farrar - The Influence of Colony Populations On Honey Production 1937
No ratings yet
1937 Clayton Leon Farrar - The Influence of Colony Populations On Honey Production 1937
10 pages
Adaptive Signal Processing Lab
No ratings yet
Adaptive Signal Processing Lab
5 pages
Allen Ap 1999 c56 2029
No ratings yet
Allen Ap 1999 c56 2029
12 pages
Logit Marginal Effects
No ratings yet
Logit Marginal Effects
12 pages
HW6 483 Fall17
No ratings yet
HW6 483 Fall17
1 page
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
From Everand
Optimizing AI and Machine Learning Solutions: Your ultimate guide to building high-impact ML/AI solutions (English Edition)
Mirza Rahim Baig
No ratings yet
Beyond The Algorithm: Practical Machine Learning Strategies
From Everand
Beyond The Algorithm: Practical Machine Learning Strategies
Jane Onwuchekwa
No ratings yet
Predicting the Unpredictable: Pragmatic Approaches to Estimating Project Schedule or Cost
From Everand
Predicting the Unpredictable: Pragmatic Approaches to Estimating Project Schedule or Cost
Johanna Rothman
4.5/5 (4)
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
From Everand
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Alok Kumar
No ratings yet
Better Embedded System Software
From Everand
Better Embedded System Software
Philip Koopman
No ratings yet
Slick SaaS Development: Process Templates
From Everand
Slick SaaS Development: Process Templates
Kangethe Mbugua
No ratings yet
Love to Excel: A Financial Modeling Masterclass for the Analyst in You
From Everand
Love to Excel: A Financial Modeling Masterclass for the Analyst in You
Jules Nkansah
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Implementing Computer Systems for Small & Medium Businesses
From Everand
Implementing Computer Systems for Small & Medium Businesses
Randy Rolleman
No ratings yet
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
From Everand
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
Calvert Long
No ratings yet
200 Erp Questions: The Most Important Things To Think About When Considering Microsoft Dynamics 365 Business Central
From Everand
200 Erp Questions: The Most Important Things To Think About When Considering Microsoft Dynamics 365 Business Central
Sune Lohse
No ratings yet
Demonstrating Design for Six Sigma
From Everand
Demonstrating Design for Six Sigma
Robert Perrine
3/5 (2)
Mastering Large-Scale Solutions: A Comprehensive Guide to Efficiently Running and Optimizing Big Systems
From Everand
Mastering Large-Scale Solutions: A Comprehensive Guide to Efficiently Running and Optimizing Big Systems
Adil Khan
No ratings yet
Ishikawa Diagram: Anticipate and solve problems within your business
From Everand
Ishikawa Diagram: Anticipate and solve problems within your business
50minutes
5/5 (3)
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
No ratings yet
IT Technical best practices: How to Reduce Agile cycle time with reusable code?
From Everand
IT Technical best practices: How to Reduce Agile cycle time with reusable code?
Shanthi Vemulapalli
No ratings yet
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
Scrum: What You Need to Know About This Agile Methodology for Project Management
From Everand
Scrum: What You Need to Know About This Agile Methodology for Project Management
Robert McCarthy
No ratings yet
Excel :The Ultimate Comprehensive Step-by-Step Guide to Strategies in Excel Programming (Formulas, Shortcuts and Spreadsheets): 2
From Everand
Excel :The Ultimate Comprehensive Step-by-Step Guide to Strategies in Excel Programming (Formulas, Shortcuts and Spreadsheets): 2
Kevin Clark
No ratings yet
A Joosr Guide to... The McKinsey Way by Ethan Rasiel: Using the Techniques of the World’s Top Strategic Consultants to Help You and Your Business
From Everand
A Joosr Guide to... The McKinsey Way by Ethan Rasiel: Using the Techniques of the World’s Top Strategic Consultants to Help You and Your Business
Joosr
No ratings yet

More About Planning A Machine Learning Project

Uploaded by

More About Planning A Machine Learning Project

Uploaded by

More about planning a machine learning

The Plan Stage

What should your Plan Include?

Knowing What You Need For a Problem

Figuring out the tools you need

You might also like