0% found this document useful (0 votes)

18 views14 pages

Comparing ML Algorithms - Anjali Garg

Uploaded by

Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views14 pages

Comparing ML Algorithms - Anjali Garg

Uploaded by

Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

A systematic approach for better understanding of:

How to compare algorithms effectively

How to optimize models for specific tasks
MODEL
MODEL
TRAINING LOSS / COST OPTIMISATION WEIGHTS
WEIGHTS AND TRAINING OUTPUT
DATA FUNCTION ALGORITHM AND
HYPER- PROCESS DATA
HYPER-
PARAMETERS
PARAMETERS

These six components are crucial for understanding the core differences between
machine learning algorithms.
Different algorithms are designed to handle
different types of data.

Data Structure:
Some Algorithms work with structured i.e.,
tabular data like decision tree, regression
models, k-means

Neural Network models can work with

unstructured data as well like images, texts
etc.

Time series or sequential data is handled by

models like ARIMA, LSTMs
Different algorithms are designed to handle
different types of data.

Labeled vs. Unlabeled Data:

Labeled data is used in supervised

learning algorithms like decision tree
classifier, logistic regression, linear
regression etc.

Unlabeled data is used in unsupervised

algorithms like K-Means Clustering
Loss function quantifies the difference between predicted and actual outcomes.
The choice of loss function determines how the model learns from data, because the model
tries to minimize this loss in order to get better at understanding the patterns in data.

Common Loss Functions:

The loss function of classification algorithm The loss function of regression algorithm
measures how well or poorly the model’s predicted measures the difference between the
labels match the true labels. predicted values and the actual values.
Example: Cross entropy loss for multi-class Example: MSE measures the average of
classification: the squares of the errors

where C is the number of classes, y_icis the true label

(1 if the class is the correct one, 0 otherwise), and p_ic where y_iis the actual value and p_iis the
is the predicted probability for class c. predicted value.
Common Loss Functions:

Algorithm Name Algorithm Type Loss Function Name

Linear Regression Regression Mean Squared Error (MSE)

Logistic Regression Classification Cross Entropy Loss

Support Vector Machine (SVM) Classification Hinge Loss

Robust Regression Regression Huber Loss

Poisson Regression Regression Poisson Loss

AdaBoost Classification Exponential Loss

Linear SVM Classification Squared Hinge Loss

Lasso Regression Regression Mean Absolute Error (MAE)

Gradient Boosting (for binary classification) Classification Log Loss

Optimization algorithms help in finding the best set of parameters (weights, biases, etc.) that
reduce the error as much as possible by minimizing the loss function, improving the model's
accuracy or predictive performance.

Key Types of Optimization Algorithms in ML:

Gradient Descent:
Minimizes the cost function by iteratively moving in the
direction of steepest descent (negative gradient) with respect
to the parameters.

Adaptive Learning Rate Algorithms:

Adjust the learning rate dynamically to make the algorithm
more efficient by modifying how fast or slow it learns.
Model Parameters: These are internal variables learned from the data during training
(e.g., weights in a neural network, coefficients and bias in linear regression). They directly
affect the model's predictions and are adjusted by optimization algorithms like gradient
descent.

Hyperparameters: These are external configurations set before training (e.g., learning
rate, number of layers, number of neurons in a neural network). They guide the learning
process and affect the model's performance and generalization ability. Tuning them is
key to improving model accuracy.

By tuning hyperparameters based on the validation data, the model is more likely to
generalize well to unseen data (i.e., test data), ensuring better performance in real-world
scenarios.
Model Parameters Hyperparameters

Linear Regression Coefficients (weights), Intercept (bias) Regularization strength (L2: Ridge, L1: Lasso), Learning rate

Logistic Regression Coefficients (weights), Intercept (bias) Regularization strength, Solver type

Decision Tree Node splits, Leaf nodes Max depth, Min samples split, Min samples leaf

Random Forest Decision tree parameters (per tree) Number of trees, Max depth, Max features, Min samples split

Support Vector
Support vectors, Coefficients Kernel type, Regularization parameter (C), Gamma
Machine

K-Nearest Neighbors N/A Number of neighbors (K), Distance metric

Learning rate, Number of layers, Number of neurons, Activation

Neural Networks Weights, Biases
function, Batch size, Epochs

K-Means Clustering Cluster centroids Number of clusters (K), Initialization method, Max iterations

XGBoost Tree parameters (weights) Learning rate, Max depth, Number of estimators, Subsample ratio
The training process is the backbone of building an effective machine learning model.
Here's a breakdown of the key steps involved.

1. Data Preprocessing: Prepare the data (e.g., normalization, feature selection, lemmatization for text)
to ensure it is in a suitable format for the model.
2. Model Initialization: Set initial values for the model's weights and hyperparameters.
3. Forward Pass: The model makes predictions using the initial weights on the training data.
4. Compute Loss: The difference between predicted and actual values is measured using a loss
function (e.g., MSE for regression, Cross Entropy for classification).
5. Backpropagation: The error is propagated back through the network to adjust the weights using an
optimization algorithm like Gradient Descent.
6. Parameter Update: The model's parameters (weights, biases) are updated to reduce the loss.
7. Repeat: Steps 3 to 6 are repeated for multiple iterations (epochs) until the model converges

Different models differ in their data preprocessing steps, initialization process, loss fuctions,
optimization algorithms. Choosing effective methods are crucial for model’s performance and
accuracy.
Algorithms can be categorized by their output data, depending on the nature and type of data they produce.
Here are some key ways algorithms differ based on their output:

Output Algorithm Type Common Algorithms

Categorical label or class Classification Logistic Regression, Decision Trees, SVM, k-NN

Continuous value Regression Linear Regression, Ridge Regression, Neural Networks

Group or cluster Clustering k-Means, DBSCAN, Hierarchical Clustering

Generated data resembling input Generative GANs, VAEs

Reduced data Dimensionality Reduction PCA, t-SNE, UMAP

Suggested items Recommendation Collaborative Filtering, Content-based Filtering

Optimal solution Optimization Gradient Descent, Genetic Algorithms

Aspect Linear Regression Decision Tree

Uses labeled and structured dataset, assuming a Also requires labeled and structured dataset but splits
Training Data
linear relationship between features and target. data recursively without assuming linearity.

Uses MSE (regression) or Gini/Entropy (classification) to

Loss Function Minimizes Mean Squared Error (MSE).
determine splits.

Optimization Direct computation (normal equation) or gradient Greedy algorithm that selects the feature providing the
Algorithm descent to minimize loss. best split.

Model Parameters Parameters: coefficients and intercept. Parameters: tree structure. Hyperparameters: depth,
and Hyperparameters Hyperparameters: learning rate for gradient descent. split criteria, etc.

Fits a linear equation by minimizing error i.e., loss Builds a tree by recusively splitting data on features
Training Process
function. until stopping criteria.

Continuous values (regression) or class labels

Output Data Continuous values (regression).
(classification).
Aspect Logistic Regression SVM (Support Vector Machine)

Requires labeled data for binary or multiclass Requires labeled data, works well with small or large
Training Data
classification. datasets.

Uses Log Loss (Cross Entropy Loss) to measure

Loss Function Uses Hinge Loss to maximize the margin between classes.
prediction error.

Optimization Uses Quadratic Programming (QP) to maximize the margin,

Optimized via gradient descent or its variants.
Algorithm or gradient-based methods for non-linear cases.

Model Parameters Parameters: weights and bias. Hyperparameters: Parameters: support vectors and weights. Hyperparameters:
and Hyperparameters regularization strength (L1/L2). C (regularization), kernel type.

Adjusts weights to minimize Log Loss using Finds a hyperplane that maximizes the margin, with kernel
Training Process
gradient descent. options for non-linear cases.

Outputs probabilities for class membership (via Outputs class labels based on distance from the separating
Output Data
sigmoid function). hyperplane (no probabilities).
Thank's For
Watching

CS202 Current Final Term Paper 2022
0% (1)
CS202 Current Final Term Paper 2022
4 pages
Lec10 Intro ML
No ratings yet
Lec10 Intro ML
93 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Machine Learning - I
No ratings yet
Machine Learning - I
126 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Moule 3
No ratings yet
Moule 3
25 pages
Key Terms in Machine Learning
No ratings yet
Key Terms in Machine Learning
6 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
Cours1 ML
No ratings yet
Cours1 ML
41 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
GIS-Based Application For DepEd Schools in The Philippines Using Spatial Data Analysis
No ratings yet
GIS-Based Application For DepEd Schools in The Philippines Using Spatial Data Analysis
5 pages
AIch 5
No ratings yet
AIch 5
50 pages
That One Privacy Guy's VPN Comparison Chart
No ratings yet
That One Privacy Guy's VPN Comparison Chart
55 pages
Intro DL 01
No ratings yet
Intro DL 01
64 pages
UNIT 1 - Types of Learning
No ratings yet
UNIT 1 - Types of Learning
13 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
机器学习
No ratings yet
机器学习
41 pages
Lec05 - Supervised
No ratings yet
Lec05 - Supervised
26 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Introduction To AI
No ratings yet
Introduction To AI
51 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Lecture-16 Machine Learning With Python
No ratings yet
Lecture-16 Machine Learning With Python
39 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
Chapter III - Supervised and Unsupervised Algorithms
No ratings yet
Chapter III - Supervised and Unsupervised Algorithms
122 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Light Burn Docs
No ratings yet
Light Burn Docs
187 pages
Training Method: Iterative Trial and Error Process That Machine Learning Algorithms May Use To Train A Model
No ratings yet
Training Method: Iterative Trial and Error Process That Machine Learning Algorithms May Use To Train A Model
8 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Lecture 8
No ratings yet
Lecture 8
11 pages
CS 4038 - DM Course Outline (Fall 2021)
No ratings yet
CS 4038 - DM Course Outline (Fall 2021)
4 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
Gog and Magog
No ratings yet
Gog and Magog
193 pages
Neural Networks Cheat Sheet - 2020 PDF
No ratings yet
Neural Networks Cheat Sheet - 2020 PDF
14 pages
Sans Emea Curriculum Overview Catalogue 2020
No ratings yet
Sans Emea Curriculum Overview Catalogue 2020
20 pages
ML 01
No ratings yet
ML 01
24 pages
ML 5
No ratings yet
ML 5
26 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
MUJ Newsletter April 2024
No ratings yet
MUJ Newsletter April 2024
1 page
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Recursive Functions
No ratings yet
Recursive Functions
12 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
JH WMT - Manual - v1.3
No ratings yet
JH WMT - Manual - v1.3
39 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Quiz SVC L2 Attempt Review PDF
100% (2)
Quiz SVC L2 Attempt Review PDF
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Quantum Autoencoders With Enhanced Data Encoding
No ratings yet
Quantum Autoencoders With Enhanced Data Encoding
7 pages
MLSM Lecture1 050923
No ratings yet
MLSM Lecture1 050923
37 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
Got A Better Name? Please Let Me Know!
No ratings yet
Got A Better Name? Please Let Me Know!
30 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Ai Notes Unit II
No ratings yet
Ai Notes Unit II
46 pages
Sasvinaa Kandasamy (DSTR Final TP053388)
No ratings yet
Sasvinaa Kandasamy (DSTR Final TP053388)
34 pages
DSE 3 Unit 3
No ratings yet
DSE 3 Unit 3
4 pages
File Converter - Requirements
No ratings yet
File Converter - Requirements
6 pages
2014 Smart Card cloner User's Manual V3.0: 1、Equipment introduction
No ratings yet
2014 Smart Card cloner User's Manual V3.0: 1、Equipment introduction
2 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Group Assignment - Sampling Distribution
No ratings yet
Group Assignment - Sampling Distribution
3 pages
DM Trends
No ratings yet
DM Trends
49 pages
Xper Information Management System 2.3 IFU EN US
No ratings yet
Xper Information Management System 2.3 IFU EN US
180 pages
Aws Report 1
No ratings yet
Aws Report 1
7 pages
ASCC R&D Platforms (Future of Education) Regional Online Forum - Participant Administrative Note V2
No ratings yet
ASCC R&D Platforms (Future of Education) Regional Online Forum - Participant Administrative Note V2
3 pages
RAJU
No ratings yet
RAJU
25 pages
Unit 04
No ratings yet
Unit 04
24 pages
Digital Logic Families - TTL - NMOS
No ratings yet
Digital Logic Families - TTL - NMOS
36 pages
Computer Programming 2 Prelim Reviewer (AMA)
No ratings yet
Computer Programming 2 Prelim Reviewer (AMA)
4 pages
Asuquo Happiness CV
No ratings yet
Asuquo Happiness CV
6 pages
Coding - Decoding English
No ratings yet
Coding - Decoding English
3 pages
The Role of Artificial Intelligence in Project Management
No ratings yet
The Role of Artificial Intelligence in Project Management
3 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Comparing ML Algorithms - Anjali Garg

Uploaded by

Comparing ML Algorithms - Anjali Garg

Uploaded by

A systematic approach for better understanding of:

How to compare algorithms effectively

Neural Network models can work with

Time series or sequential data is handled by

Labeled vs. Unlabeled Data:

Labeled data is used in supervised

Unlabeled data is used in unsupervised

Common Loss Functions:

where C is the number of classes, y_ic​is the true label

Algorithm Name Algorithm Type Loss Function Name

Linear Regression Regression Mean Squared Error (MSE)

Logistic Regression Classification Cross Entropy Loss

Support Vector Machine (SVM) Classification Hinge Loss

Robust Regression Regression Huber Loss

Poisson Regression Regression Poisson Loss

AdaBoost Classification Exponential Loss

Linear SVM Classification Squared Hinge Loss

Lasso Regression Regression Mean Absolute Error (MAE)

Gradient Boosting (for binary classification) Classification Log Loss

Key Types of Optimization Algorithms in ML:

Adaptive Learning Rate Algorithms:

K-Nearest Neighbors N/A Number of neighbors (K), Distance metric

Learning rate, Number of layers, Number of neurons, Activation

Output Algorithm Type Common Algorithms

Continuous value Regression Linear Regression, Ridge Regression, Neural Networks

Group or cluster Clustering k-Means, DBSCAN, Hierarchical Clustering

Generated data resembling input Generative GANs, VAEs

Reduced data Dimensionality Reduction PCA, t-SNE, UMAP

Suggested items Recommendation Collaborative Filtering, Content-based Filtering

Optimal solution Optimization Gradient Descent, Genetic Algorithms

Uses MSE (regression) or Gini/Entropy (classification) to

Continuous values (regression) or class labels

Uses Log Loss (Cross Entropy Loss) to measure

Optimization Uses Quadratic Programming (QP) to maximize the margin,

You might also like

where C is the number of classes, y_icis the true label