0% found this document useful (0 votes)
14 views25 pages

Chap 10-Machine Learning

Machine learning (ML) is a subset of artificial intelligence that allows computers to learn from data and improve over time without explicit programming. The machine learning life cycle consists of six steps: planning, data preparation, model engineering, model evaluation, model deployment, and monitoring and maintenance. There are three main types of machine learning: supervised, unsupervised, and reinforcement learning, each with distinct characteristics and applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views25 pages

Chap 10-Machine Learning

Machine learning (ML) is a subset of artificial intelligence that allows computers to learn from data and improve over time without explicit programming. The machine learning life cycle consists of six steps: planning, data preparation, model engineering, model evaluation, model deployment, and monitoring and maintenance. There are three main types of machine learning: supervised, unsupervised, and reinforcement learning, each with distinct characteristics and applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 25

MACHINE

LEARNING
WHAT IS MACHINE LEARNING?
 Machine learning (ML) is a branch of artificial intelligence (AI) that enables
computers to “self-learn” from training data and improve over time, without being
explicitly programmed.
 Machine learning algorithms are able to detect patterns in data and learn from
them, in order to make their own predictions.
 In short, machine learning algorithms and models learn through experience.
 In traditional programming, a computer engineer writes a series of directions that
instruct a computer how to transform input data into a desired output.
 Instructions are mostly based on an IF-THEN structure: when certain conditions are
met, the program executes a specific action.
 Machine learning, on the other hand, is an automated process that enables
machines to solve problems with little or no human input, and take actions based on
past observations.
Machine Learning Traditional Programming
Machine Learning is a subset of artificial
In traditional programming, rule-based
intelligence(AI) that focus on learning
code is written by the developers
from data to develop an algorithm that
depending on the problem statements.
can be used to make a prediction.

Machine Learning uses a data-driven Traditional programming is typically rule-


approach, It is typically trained on based and deterministic. It hasn’t self-
historical data and then used to make learning features like Machine Learning
predictions on new data. and AI.
Traditional programming is totally
ML can find patterns and insights in large
dependent on the intelligence of
datasets that might be difficult for
developers. So, it has very limited
humans to discover.
capability.
Machine Learning is the subset of AI. And
Traditional programming is often used to
Now it is used in various AI-based tasks
build applications and software systems
like Chatbot Question answering, self-
that have specific functionality.
driven car., etc.
MACHINE LEARNING LIFE CYCLE
The 6 steps in a standard machine learning life cycle:
1.Planning
2.Data Preparation
3.Model Engineering
4.Model Evaluation
5.Model Deployment
6.Monitoring and Maintenance
PLANNING
 The planning phase involves assessing
the scope, success metric, and feasibility
of the ML application.
 For example: do we require machine
learning? Can we achieve similar
requests with simple programming?
 Define clear and measurable success
metrics for business, machine learning
models (Accuracy, F1 score, AUC), and
economic (key performance indicators).
It will consist of the information about:
 Availability of the data: do we have enough data available to train the model?
Can we get a constant supply of new and updated data? Can we use synthetic data
to reduce the cost?
 Applicability: will this solution solve the problem or improve the current process?
Can we even use machine learning to solve this issue?
 Legal constraints: do we have permission from the local government to
implement this solution? Are we following an ethical way of collecting the data?
What will be the impact of this application on society?
 Robustness and scalability: is this application robust enough? Is it scalable?
 Explainability: can we explain how the machine learning model is coming up with
the results? Can we explain the deep neural networks' inner workings?
 Availability of resources: do we have enough computing, storage, network, and
human resources? Do we have qualified professionals?
DATA PREPARATION
The data preparation section is
further divided into four parts:
1. data procurement and labeling
2. Cleaning
3. Management
4. processing.
Data collection and labelling:
The data collection and labeling require most of the company resources: money, time,
professionals, subject matter experts, and legal agreements.
Data Cleaning:
we will clean the data by imputing missing values, analyzing wrong-labeled data,
removing outliers, and reducing the noise. You will create a data pipeline to automate
this process and perform data quality verification.
Data processing:
The data processing stage involves feature selection, dealing with imbalanced classes,
feature engineering, data augmentation, and normalizing and scaling the data.
Data management:
Finally, we will figure out data storage solutions, data versioning for reproducibility,
storing metadata, and creating ETL pipelines. This part will ensure a constant data
stream for model training.
MODEL ENGINEERING
 Build effective model architecture by doing extensive research.
 Defining model metrics.
 Training and validating the model on the training and validation dataset.
 Tracking experiments, metadata, features, code changes, and machine learning
pipelines.
 Performing model compression and ensembling.
 Interpreting the results by incorporating domain knowledge experts.
MODEL EVALUATION
 Now that we have finalized the
version of the model, it is time to test
various metrics. Why? So that we can
ensure that our model is ready for
production.

 We will first test our model on a test


dataset and make sure we involve
subject matter experts to identify the
error in the predictions.
MODEL DEPLOYMENT
 In this phase, we deploy machine learning
models to the current system.
 For example: introducing automatic warehouse
labeling using the shape of the product. We will
be deploying a computer vision model into the
current system, which will use the images from
the camera to print the labels.
 Generally, the models can be deployed on the
cloud and local server, web browser, package as
software, and edge device. After that, you can
use API, web app, plugins, or dashboard to
access the predictions.
MONITORING AND
MAINTENANCE
 After deploying the model to production, we need to constantly monitor
and improve the system. We will be monitoring model metrics, hardware
and software performance, and customer satisfaction.
 The monitoring is done completely automatically, and the professionals are
notified about the anomalies, reduced model and system performance, and
bad customer reviews.
TYPES OF MACHINE
LEARNING
There are 3 types of Machine Learning
1.Supervised Machine Learning
2.Unsupervised Machine Learning
3.Reinforcement Learning
SUPERVISED LEARNING
 Supervised learning is defined as
when a model gets trained on a
“Labelled Dataset”.
 Labelled datasets have both input
and output parameters.
 In Supervised Learning algorithms
learn to map points between
inputs and correct outputs.
 It has both training and validation
datasets labelled.
There are two main categories of supervised learning that are mentioned below:
1. Classification
2. Regression
Classification: Classification deals with predicting categorical target
variables, which represent discrete classes or labels. For instance,
classifying emails as spam or not spam, or predicting whether a patient
has a high risk of heart disease. Classification algorithms learn to map
the input features to one of the predefined classes.
Regression:Regression, on the other hand, deals with predicting
continuous target variables, which represent numerical values. For
example, predicting the price of a house based on its size, location, and
amenities, or forecasting the sales of a product. Regression algorithms
learn to map the input features to a continuous numerical value.
Advantages of Supervised Machine Learning
 Supervised Learning models can have high accuracy as they are trained on labelled
data.
 The process of decision-making in supervised learning models is often interpretable.
 It can often be used in pre-trained models which saves time and resources when
developing new models from scratch.
Disadvantages of Supervised Machine Learning
 It has limitations in knowing patterns and may struggle with unseen or unexpected
patterns that are not present in the training data.
 It can be time-consuming and costly as it relies on labeled data only.
 It may lead to poor generalizations based on new data.
UNSUPERVISED LEARNING
 Unsupervised Learning Unsupervised
learning is a type of machine learning
technique in which an algorithm
discovers patterns and relationships
using unlabeled data.
 The primary goal of
Unsupervised learning is often
to discover hidden patterns,
similarities, or clusters within
the data, which can then be
used for various purposes, such
as data exploration,
visualization, dimensionality
reduction, and more.
There are two main categories of unsupervised learning:
1. Clustering
2. Association
Clustering
Clustering is the process of grouping data points into clusters based on their similarity.
This technique is useful for identifying patterns and relationships in data without the
need for labeled examples.
Association
Association rule learning is a technique for discovering relationships between items in a
dataset. It identifies rules that indicate the presence of one item implies the presence of
another item with a specific probability.
REINFORCEMENT MACHINE
LEARNING
 Reinforcement machine
learning algorithm is a
learning method that
interacts with the
environment by producing
actions and discovering
errors. Trial, error, and
delay are the most
relevant characteristics of
reinforcement learning.
Positive reinforcement
 Rewards the agent for taking a desired action.
 Encourages the agent to repeat the behavior.
 Examples: Giving a treat to a dog for sitting, providing a point in a game for a correct
answer.

Negative reinforcement
 Removes an undesirable stimulus to encourage a desired behavior.
 Discourages the agent from repeating the behavior.
 Examples: Turning off a loud buzzer when a lever is pressed, avoiding a penalty by
completing a task.
CASE STUDIES
 Customer churn Prediction

You might also like