0% found this document useful (0 votes)

12 views26 pages

Lec05 - Supervised

The document discusses supervised learning, which involves training algorithms on known input-output pairs to make predictions on new data. It covers various techniques such as classification and regression, along with common algorithms like logistic regression, kNN, SVM, and neural networks. Additionally, it addresses model improvement strategies including feature selection, transformation, and hyperparameter tuning.

Uploaded by

Tazmil Dity

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views26 pages

Lec05 - Supervised

Uploaded by

Tazmil Dity

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

CSM 6405: Symbolic ML II

Lecture 5: Supervised

Pro f. Dr. M d . R a k i b Ha s s an
De pt . o f Co m p u te r S c i en ce a n d M at h e m at ics ,
Ba n gl adesh Agr i cul tural Un i ve rsi ty.
E m a i l: ra k i b@ bau .edu.bd
Supervised Learning
❖ A supervised learning algorithm takes a known set of
input data (the training set) and known responses to
the data (output) and trains a model to generate
reasonable predictions for the response to new input
data.

2
Supervised Learning Techniques
❖ Classification:
❑ It predicts discrete responses—for example, whether an
email is genuine or spam, or whether a tumor is small,
medium, or large.
❑ Classification models are trained to classify data into
categories.
❑ Applications include medical imaging, speech recognition,
and credit scoring.
❖ Regression:
❑ Predicts continuous responses—for example, changes in
temperature or fluctuations in electricity demand.
❑ Applications include forecasting stock prices, handwriting
recognition, and acoustic signal processing.

3
Selecting the Right Algorithm
❖ Speed of training
❖ Memory usage
❖ Predictive accuracy on new data
❖ Transparency or interpretability (how easily you can
understand the reasons an algorithm makes its
predictions)

4
Binary vs. Multiclass Classification
❖ Binary classification problem:
❑ a single training or test item (instance) can only be divided
into two classes—for example, if you want to determine
whether an email is genuine or spam.
❖ Multiclass classification problem:
❑ it can be divided into more than two—for example, if you
want to train a model to classify an image as a dog, cat, or
other animal.
❖ A multiclass classification problem is generally more
challenging because it requires a more complex
model.

5
Common Classification Algorithms
❖ Logistic Regression
❑ How it Works
o Fits a model that can predict the probability of a binary response
belonging to one class or the other. Because of its simplicity,
logistic regression is commonly used as a starting point for binary
classification problems.
❑ Best Used...
o When data can be clearly separated by a single, linear boundary
o As a baseline for evaluating more complex classification methods

6
Common Classification Algorithms
❖ k Nearest Neighbor (kNN)
❑ How it Works
o kNN categorizes objects based on the classes of their nearest
neighbors in the dataset. kNN predictions assume that objects
near each other are similar. Distance metrics, such as Euclidean,
city block, cosine, and Chebychev, are used to find the nearest
neighbor.
❑ Best Used...
o When you need a simple algorithm to establish benchmark
learning rules
o When memory usage of the trained model is a lesser concern
o When prediction speed of the trained model is a lesser concern

7
Common Classification Algorithms
❖ Support Vector Machine (SVM)
❑ How It Works
o Classifies data by finding the linear decision boundary (hyperplane) that
separates all data points of one class from those of the other class.
o The best hyperplane for an SVM is the one with the largest margin
between the two classes, when the data is linearly separable.
o If the data is not linearly separable, a loss function is used to penalize
points on the wrong side of the hyperplane.
o SVMs sometimes use a kernel transform to transform nonlinearly
separable data into higher dimensions where a linear decision boundary
can be found.
❑ Best Used...
o For data that has exactly two classes (you can also use it for multiclass
classification with a technique called error-correcting output codes)
o For high-dimensional, nonlinearly separable data
o When you need a classifier that’s simple, easy to interpret, and accurate

8
Common Classification Algorithms
❖ Neural Network
❑ How it Works
o Inspired by the human brain, a neural network consists of highly
connected networks of neurons that relate the inputs to the
desired outputs.
o The network is trained by iteratively modifying the strengths of
the connections so that given inputs map to the correct response.
❑ Best Used...
o For modeling highly nonlinear systems
o When data is available incrementally and you wish to constantly
update the model
o When there could be unexpected changes in your input data
o When model interpretability is not a key concern

9
Common Classification Algorithms
❖ Naïve Bayes
❑ How It Works
o A naive Bayes classifier assumes that the presence of a particular
feature in a class is unrelated to the presence of any other feature.
o It classifies new data based on the highest probability of its
belonging to a particular class.
❑ Best Used...
o For a small dataset containing many parameters
o When you need a classifier that’s easy to interpret
o When the model will encounter scenarios that weren’t in the
training data, as is the case with many financial and medical
applications

10
Common Classification Algorithms
❖ Discriminant Analysis
❑ How It Works
o Discriminant analysis classifies data by finding linear combinations
of features.
o Discriminant analysis assumes that different classes generate data
based on Gaussian distributions.
o Training a discriminant analysis model involves finding the
parameters for a Gaussian distribution for each class. The
distribution parameters are used to calculate boundaries, which
can be linear or quadratic functions. These boundaries are used to
determine the class of new data.
❑ Best Used...
o When you need a simple model that is easy to interpret
o When memory usage during training is a concern
o When you need a model that is fast to predict

11
Common Classification Algorithms
❖ Decision Tree
❑ How it Works
o A decision tree lets you predict responses to data by following the
decisions in the tree from the root (beginning) down to a leaf
node.
o A tree consists of branching conditions where the value of a
predictor is compared to a trained weight. The number of
branches and the values of weights are determined in the training
process. Additional modification, or pruning, may be used to
simplify the model.
❑ Best Used...
o When you need an algorithm that is easy to interpret and fast to
fit
o To minimize memory usage
o When high predictive accuracy is not a requirement

12
Common Classification Algorithms
❖ Bagged and Boosted Decision Trees
❑ How They Work
o In these ensemble methods, several “weaker” decision trees are
combined into a “stronger” ensemble.
o A bagged decision tree consists of trees that are trained
independently on data that is bootstrapped from the input data.
o Boosting involves creating a strong learner by iteratively adding
“weak” learners and adjusting the weight of each weak learner to
focus on misclassified examples.
❑ Best Used...
o When predictors are categorical (discrete) or behave nonlinearly
o When the time taken to train a model is less of a concern

13
Common Regression Algorithms
❖ Linear Regression
❑ How it Works
o Linear regression is a statistical modeling technique used to
describe a continuous response variable as a linear function of one
or more predictor variables. Because linear regression models are
simple to interpret and easy to train, they are often the first model
to be fitted to a new dataset.
❑ Best Used...
o When you need an algorithm that is easy to interpret and fast to
fit
o As a baseline for evaluating other, more complex, regression
models

14
Common Regression Algorithms
❖ Nonlinear Regression
❑ How It Works
o Nonlinear regression is a statistical modeling technique that helps
describe nonlinear relationships in experimental data.
o Nonlinear regression models are generally assumed to be
parametric, where the model is described as a nonlinear equation.
❑ Best Used...
o When data has strong nonlinear trends and cannot be easily
transformed into a linear space
o For fitting custom models to data

15
Common Regression Algorithms
❖ Gaussian Process Regression Model
❑ How it Works
o Gaussian process regression (GPR) models are nonparametric
models that are used for predicting the value of a continuous
response variable. They are widely used in the field of spatial
analysis for interpolation in the presence of uncertainty. GPR is
also referred to as Kriging.
❑ Best Used...
o For interpolating spatial data, such as hydrogeological data for the
distribution of ground water
o As a surrogate model to facilitate optimization of complex designs
such as automotive engines

16
Common Regression Algorithms
❖ SVM Regression
❑ How It Works
o SVM regression algorithms work like SVM classification
algorithms, but are modified to be able to predict a continuous
response. Instead of finding a hyperplane that separates data,
SVM regression algorithms find a model that deviates from the
measured data by a value no greater than a small amount, with
parameter values that are as small as possible (to minimize
sensitivity to error).
❑ Best Used...
o For high-dimensional data (where there will be many predictor
variables)

17
Common Regression Algorithms
❖ Generalized Linear Model
❑ How it Works
o A generalized linear model is a special case of nonlinear models
that uses linear methods. It involves fitting a linear combination of
the inputs to a nonlinear function (the link function) of the
outputs.
❑ Best Used...
o When the response variables have non-normal distributions, such
as a response variable that is always expected to be positive

18
Common Regression Algorithms
❖ Regression Tree
❑ How It Works
o Decision trees for regression are similar to decision trees for
classification, but they are modified to be able to predict
continuous responses.
❑ Best Used...
o When predictors are categorical (discrete) or behave nonlinearly

19
Improving Models
❖ Improving a model means
increasing its accuracy and
predictive power and
preventing overfitting
(when the model cannot
distinguish between data
and noise).

❖ Model improvement
involves feature engineering
(feature selection and
transformation) and
hyperparameter tuning.

20
Feature Selection
❖ Identifying the most relevant features, or variables,
that provide the best predictive power in modeling
your data. This could mean adding variables to the
model or removing variables that do not improve
model performance.
❖ It’s especially useful when you’re dealing with high-
dimensional data or when your dataset contains a
large number of features and a limited number of
observations.
❖ Reducing features also saves storage and
computation time and makes your results easier to
understand.
21
Feature Selection Techniques
❖ Stepwise regression:
❑ Sequentially adding or removing features until there is no
improvement in prediction accuracy.
❖ Sequential feature selection:
❑ Iteratively adding or removing predictor variables and
evaluating the effect of each change on the performance of
the model.
❖ Regularization:
❑ Using shrinkage estimators to remove redundant features by
reducing their weights (coefficients) to zero.
❖ Neighborhood component analysis (NCA):
❑ Finding the weight each feature has in predicting the output,
so that features with lower weights can be discarded.

22
Feature Transformation
❖ Turning existing features into new features using
techniques such as principal component analysis,
nonnegative matrix factorization, and factor analysis.
❖ Feature transformation is a form of dimensionality
reduction.
❖ As discussed earlier, the three most commonly used
dimensionality reduction techniques are:
❑ Principal component analysis (PCA)
❑ Nonnegative matrix factorization
❑ Factor analysis

23
Hyperparameter Tuning
❖ The process of identifying the set of parameters that
provides the best model. Hyperparameters control
how a machine learning algorithm fits the model to
the data.
❖ Parameter tuning is an iterative process. You begin by
setting parameters based on a “best guess” of the
outcome. Your goal is to find the “best possible”
values - those that yield the best model.
❖ As you adjust parameters and model performance
begins to improve, you see which parameter settings
are effective and which still require tuning.

24
Hyperparameter Tuning Methods
❖ Three common parameter tuning methods are:
❑ Bayesian optimization
❑ Grid search
❑ Gradient-based optimization

25
PROF. DR. MD. RAKIB HASSAN 26

Final Project Report
No ratings yet
Final Project Report
44 pages
ML Notes
No ratings yet
ML Notes
10 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Group 2 ML Asignmet
No ratings yet
Group 2 ML Asignmet
23 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Machine Learning Section4 Ebook v03
No ratings yet
Machine Learning Section4 Ebook v03
20 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Presentation On: Supervised Learning
No ratings yet
Presentation On: Supervised Learning
10 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Zzplagiarism
No ratings yet
Zzplagiarism
23 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
No ratings yet
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
13 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Machine Learning-Supervised Learning
No ratings yet
Machine Learning-Supervised Learning
31 pages
Supervised ML Algorithms
No ratings yet
Supervised ML Algorithms
9 pages
Module 3
No ratings yet
Module 3
63 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Machine Learning Mastery Notes
No ratings yet
Machine Learning Mastery Notes
4 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Algorithms in ML
No ratings yet
Algorithms in ML
15 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
Machine Learning Algorithms 1728923216
No ratings yet
Machine Learning Algorithms 1728923216
12 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
DS Unit2
No ratings yet
DS Unit2
23 pages
Kavin
No ratings yet
Kavin
15 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
MLSC Final Notes
No ratings yet
MLSC Final Notes
24 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
No ratings yet
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
65 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
Machine Learning Supervised
No ratings yet
Machine Learning Supervised
42 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
1 - Supervised Learning & Its Types
No ratings yet
1 - Supervised Learning & Its Types
24 pages
Unit 4 - Machine Learning PDF
No ratings yet
Unit 4 - Machine Learning PDF
49 pages
Chapter5 - Machine Learning
No ratings yet
Chapter5 - Machine Learning
37 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
ML 2
No ratings yet
ML 2
3 pages
1machine Learning
No ratings yet
1machine Learning
26 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Lec01 - Intro
No ratings yet
Lec01 - Intro
37 pages
Fundamental Satellite VSAT
100% (1)
Fundamental Satellite VSAT
80 pages
E-Classroom: An Online Based Classroom
No ratings yet
E-Classroom: An Online Based Classroom
25 pages
IJSRpaper Kanchan Yadav 22
No ratings yet
IJSRpaper Kanchan Yadav 22
8 pages
English Grade 6
No ratings yet
English Grade 6
192 pages
Developing An Effective Employee Orientation Program LB
No ratings yet
Developing An Effective Employee Orientation Program LB
7 pages
Theories of Morality Chart
No ratings yet
Theories of Morality Chart
1 page
Study - As A Population Gets Older, Automation Accelerates - MIT News - Massachusetts Institute of Technology 2024 09 27 04x58 507.0 KB
No ratings yet
Study - As A Population Gets Older, Automation Accelerates - MIT News - Massachusetts Institute of Technology 2024 09 27 04x58 507.0 KB
5 pages
Summary of Some ONS and Enteral Formulas
No ratings yet
Summary of Some ONS and Enteral Formulas
3 pages
Group Discussion Evaluation Sheet YUVA
100% (3)
Group Discussion Evaluation Sheet YUVA
4 pages
Dowsing ReviewOfExperimetnalResearch Hansen JSPR 1982 PDF
No ratings yet
Dowsing ReviewOfExperimetnalResearch Hansen JSPR 1982 PDF
13 pages
Biome Lesson Plan
No ratings yet
Biome Lesson Plan
4 pages
Clinical Oral Medicine and Pathology 2010
100% (2)
Clinical Oral Medicine and Pathology 2010
176 pages
Air Canada SMS
No ratings yet
Air Canada SMS
42 pages
True or False Low Intermediate (B1)
No ratings yet
True or False Low Intermediate (B1)
2 pages
Cover Page - Tugopes Sip 2023-2028
No ratings yet
Cover Page - Tugopes Sip 2023-2028
135 pages
Sisipan Harga BU 2026 - R3 - Share
No ratings yet
Sisipan Harga BU 2026 - R3 - Share
2 pages
ĐỀ 11
No ratings yet
ĐỀ 11
18 pages
Kalita & Deka (2024)
No ratings yet
Kalita & Deka (2024)
6 pages
Turn Taking
No ratings yet
Turn Taking
7 pages
E Class Record 9 STE Consumer Chem R. CAYANAN
No ratings yet
E Class Record 9 STE Consumer Chem R. CAYANAN
12 pages
The Effect of Sociocultural and Economic Factor in Broken Homes and Childhood Development
No ratings yet
The Effect of Sociocultural and Economic Factor in Broken Homes and Childhood Development
5 pages
Banners
No ratings yet
Banners
2 pages
Week 5 PDF
No ratings yet
Week 5 PDF
3 pages
PPP Neurological Disorders-2017-Infectious Diseases Spinal Cord Injury Degenerative Diseases Etc.
No ratings yet
PPP Neurological Disorders-2017-Infectious Diseases Spinal Cord Injury Degenerative Diseases Etc.
167 pages
Math Small Group Lesson Plan
No ratings yet
Math Small Group Lesson Plan
5 pages
Case Based 1 - Week 9
No ratings yet
Case Based 1 - Week 9
3 pages
Writing A Student Recommendation Letter
No ratings yet
Writing A Student Recommendation Letter
3 pages
From Detached Concern To Empathy Humanizing Medical Practice Jodi Halpern Instant Download
No ratings yet
From Detached Concern To Empathy Humanizing Medical Practice Jodi Halpern Instant Download
46 pages
A Descriptive Study To Assess The Knowledge On Legal and Ethical Aspects of Nursing Among Outgoing and Final Year BSC Nursing Students at Sacred Heart Nursing College, Madurai
No ratings yet
A Descriptive Study To Assess The Knowledge On Legal and Ethical Aspects of Nursing Among Outgoing and Final Year BSC Nursing Students at Sacred Heart Nursing College, Madurai
4 pages
Relation Between Sociology and Social Work
100% (1)
Relation Between Sociology and Social Work
7 pages
Faculty Name List
No ratings yet
Faculty Name List
6 pages
Joshua William Buckholtz, PH.D.: Curriculum Vitae
No ratings yet
Joshua William Buckholtz, PH.D.: Curriculum Vitae
7 pages

Lec05 - Supervised

Uploaded by

Lec05 - Supervised

Uploaded by

CSM 6405: Symbolic ML II

You might also like