0% found this document useful (0 votes)

24 views

Algorithms 1

The document provides an overview of various machine learning algorithms, including regression, classification, and clustering techniques. It discusses the strengths and weaknesses of specific algorithms such as Multiple Linear Regression, Logistic Regression, Decision Trees, Random Forests, K Nearest Neighbour, Support Vector Machine, Naive Bayes, and K Means. Additionally, it emphasizes the importance of training and testing datasets in evaluating machine learning models.

Uploaded by

qubefexe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Algorithms 1

Uploaded by

qubefexe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

ML

ALGORITHMS
REGRESSION

A statistical technique that relates a

dependent variable to one or more
independent (explanatory) variables.
A regression model is able to show
whether changes observed in the
dependent variable are associated
with changes in one or more of the
explanatory variables
CLASSIFICATION

Statistical classification is the broad

supervised learning approach that trains a
program to categorize new information
based upon its relevance to known, labeled
data. The algorithms that sort new data
into labeled classes, or categories of
information, are called classifiers.
Multiple Linear Regression

• Multiple linear regression

attempts to model the
relationship between two or
more independent variables and
a dependent variable by fitting a
linear equation to observed data.
Every value of the independent
variable x is associated with a
value of the dependent variable
Multiple Linear Regression

• Good
- Simple to implement and efficient to train.
- Overfitting can be reduced by regularization.
- Performs well when the dataset is linearly
separable.
• Bad
- Assumes that the data is independent which
is rare in real life.
-Prone to noise and overfitting.
- Sensitive to outliers.
Logistic Regression

• binary classification algorithm used

to predict a binary outcome. It is
often used in cases where the
outcome is either yes or no. Logistic
regression uses a sigmoid function
to map input variables to a
probability of the output variable
Logistic Regression

• Good
- Less prone to over-fitting but it can overfit in
high dimensional datasets.
- Efficient when the dataset has features that
are linearly separable.
- Easy to implement and efficient to train.
• Bad
-Should not be used when the number of
observations are lesser than the number of
features.
- Assumption of linearity which is rare in
practice.
- Can only be used to predict discrete functions.
Decision Trees

• A decision tree is a simple and intuitive model

that is used for both regression and classification
problems. It is a tree-like structure where each
node represents a feature or attribute, and each
branch represents a decision rule. Decision trees
are particularly useful in situations where the
data has multiple variables and is non-linear
Decision Trees

• Good
- Can solve non-linear problems.
- Can work on high-dimensional data with
excellent accuracy.
- Easy to visualize and explain.
• Bad
- Overfitting. Might be resolved by random
forest.
- A small change in the data can lead to a
large change in the structure of the optimal
decision tree.
- Calculations can get very complex.
Random Forests

• Random forests are a powerful and popular

ensemble learning technique used for
classification, regression, and anomaly detection.
It is an extension of decision trees, where a large
number of decision trees are trained on subsets of
data. The final prediction is made by taking the
average of all the individual tree predictions
Random Forests

• Good
- It can perform both regression and
classification tasks.
- Handle large datasets efficiently.
- Higher level of accuracy in predicting.
• Bad
-These are prone to overfitting.
-It can be quite large, thus making pruning
necessary.
-Calculations can become complex when
there are many class variables.
K Nearest Neighbour

• The K in the name of this classifier represents

the k nearest neighbors, where k is an integer
value specified by the user. Hence as the name
suggests, this classifier implements learning
based on the k nearest neighbors. The choice of
the value of k is dependent on data
K Nearest Neighbour

• Good
- Can make predictions without training.
- Time complexity is O(n).
- Can be used for both classification and
regression.
• Bad
- Does not work well with large dataset.
- Sensitive to noisy data, missing values
and outliers.
- Need feature scaling.
- Choose the correct K value.
Support Vector Machine

Support vector machines (SVM) are a supervised

learning algorithm used for classification and
regression analysis. SVM tries to find the best
hyperplane that separates the data points into
different classes, with maximum margin. SVM can
also handle non-linearly separable data by
transforming the data into a higher-dimensional
space.
Support Vector Machine

• Good
- Good at high dimensional
data.
- Can work on small dataset.
- Can solve non-linear
problems.
• Bad
-Inefficient on large data.
- Requires picking the right
Naive Bayes

• probability of happening of A depends upon

occurrence of B. Naive Bayes classifiers are a
collection of classification algorithms based on
Bayes’ Theorem. every pair of features being
classified is independent of each other
• we will divide data set into
• 1. Feature matrix contains all the vectors(rows) of
dataset in which each vector consists of the value of
dependent features
• 2. Response vector contains the value of class
variable(prediction or output) for each row of
feature matrix
Naive Bayes

• Good
- Training period is less. Bayes Theorem
- Better suited for categorical inputs.
- Easy to implement.
• Bad
- Assumes that all features are
independent which is rarely happening in
real life.
- Zero Frequency.
- Estimations can be wrong in some
K Means(UNSUPERVISED LEARNING)

• K-Means Clustering is an Unsupervised Learning

algorithm, which groups the unlabeled dataset into different
clusters. Here K defines the number of pre-defined clusters
that need to be created in the process.
• As if K=2, there will be two clusters, and for K=3, there will
be three clusters, and so on.
• Allows us to cluster the data into different groups and a
convenient way to discover the categories of groups in the
unlabeled dataset on its own without the need for any
training . It is a centroid-based algorithm, where each
cluster is associated with a centroid.
K Means

• Good
- Simple to implement.
- Scales to large data sets.
- Guarantees convergence.
- Easily adapts to new examples.
- Generalizes to clusters of different shapes
and sizes.
• Bad
- Sensitive to the outliers.
- Choosing the k values manually is tough.
- Dependent on initial values.
- Scalability decreases when dimension
TRAIN AND TEST
• Machine learning is about learning
some properties of a data set and
then testing those properties against
another data set.
• A common practice in machine
learning is to evaluate an algorithm
by splitting a data set into two.
• We call one of those sets the
training set, on which we learn some
properties. we call the other set the
testing set, on which we test the
learned properties
*

Machine Learning 1
No ratings yet
Machine Learning 1
29 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Unit 1
No ratings yet
Unit 1
15 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Unit 3 big data
No ratings yet
Unit 3 big data
50 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
5 pages
AIML
No ratings yet
AIML
30 pages
Comparative Study
No ratings yet
Comparative Study
17 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
machine learning
No ratings yet
machine learning
37 pages
Machine Learning
100% (6)
Machine Learning
115 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Classification
No ratings yet
Classification
7 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Evolutional Study On KNN and K-Means Algorithms (SP)
No ratings yet
Evolutional Study On KNN and K-Means Algorithms (SP)
9 pages
ICT202B AI ML and Emerging technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging technologies UNIT 3 (Classification and Regression) 2
23 pages
Machine Learning File
No ratings yet
Machine Learning File
7 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Introduction to AI
No ratings yet
Introduction to AI
51 pages
Unit 6
No ratings yet
Unit 6
22 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Primer On Major Data Mining Algorithms
No ratings yet
Primer On Major Data Mining Algorithms
86 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
UNIT1
No ratings yet
UNIT1
38 pages
UCS551 Chapter 6 - Classification
No ratings yet
UCS551 Chapter 6 - Classification
20 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
Assessing a Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing a Single Classification Algorithm and Two Classification Algorithms
12 pages
Top 10 Machine Learning Algorithms With Their Use
100% (1)
Top 10 Machine Learning Algorithms With Their Use
12 pages
ML notes
No ratings yet
ML notes
10 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Unit - 2 ML notes
No ratings yet
Unit - 2 ML notes
14 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
Machine_Learning
No ratings yet
Machine_Learning
35 pages
Module Iii
No ratings yet
Module Iii
15 pages
ML UNIT-4
No ratings yet
ML UNIT-4
20 pages
CSE-VSEM-503-B-PR-UNIT-2-NOTES
No ratings yet
CSE-VSEM-503-B-PR-UNIT-2-NOTES
17 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Machine Learning[2]
No ratings yet
Machine Learning[2]
16 pages
Data Frame Creation
No ratings yet
Data Frame Creation
10 pages
Kanis Method of Frame Analysis
85% (20)
Kanis Method of Frame Analysis
20 pages
2nd Grade Catching The Moon
100% (1)
2nd Grade Catching The Moon
12 pages
Unit1 Project
No ratings yet
Unit1 Project
10 pages
Map Overlay Concept - The Nature of Geographic Information
No ratings yet
Map Overlay Concept - The Nature of Geographic Information
5 pages
Music and Urban Geography. ISBN 0415970121, 978-0415970129
100% (31)
Music and Urban Geography. ISBN 0415970121, 978-0415970129
23 pages
Properties of Determinants
No ratings yet
Properties of Determinants
3 pages
Isfr 2019 Vol II Himachal Pradesh
No ratings yet
Isfr 2019 Vol II Himachal Pradesh
10 pages
Fundamentals of Geotechnical Engineering 5th Edition Das Solutions Manual instant download
100% (2)
Fundamentals of Geotechnical Engineering 5th Edition Das Solutions Manual instant download
49 pages
More Philosophy for Teens
No ratings yet
More Philosophy for Teens
191 pages
World Energy Trilemma 2024
100% (1)
World Energy Trilemma 2024
92 pages
Practical Research 1 of 11 Abm
No ratings yet
Practical Research 1 of 11 Abm
3 pages
Module 35_ Fans for Ducted Ventilation Systems – CIBSE Journal
No ratings yet
Module 35_ Fans for Ducted Ventilation Systems – CIBSE Journal
13 pages
12 Scentsational 2021
No ratings yet
12 Scentsational 2021
25 pages
Te 1274 PRN
No ratings yet
Te 1274 PRN
66 pages
Regolith Geochemistry
No ratings yet
Regolith Geochemistry
53 pages
Babel_SocialContextandTranslationofPNs
No ratings yet
Babel_SocialContextandTranslationofPNs
13 pages
Jacket Foundation
No ratings yet
Jacket Foundation
15 pages
Quadratic Functions Assignment: Number 2
No ratings yet
Quadratic Functions Assignment: Number 2
3 pages
Experimental Analysis On Solar PV For Diverse Slope Angles
No ratings yet
Experimental Analysis On Solar PV For Diverse Slope Angles
7 pages
UCSP Written Output #2 (Essay) PDF
No ratings yet
UCSP Written Output #2 (Essay) PDF
2 pages
Building Utilities 3: Part 1 Acoustics: Ar - Gabrielangelobucad
No ratings yet
Building Utilities 3: Part 1 Acoustics: Ar - Gabrielangelobucad
77 pages
The Electromagnetic Theory of Coaxial Transmission Lines and Cylindrical Shields
No ratings yet
The Electromagnetic Theory of Coaxial Transmission Lines and Cylindrical Shields
48 pages
Papers Published
No ratings yet
Papers Published
15 pages
Fire Resistance of Aluminium
No ratings yet
Fire Resistance of Aluminium
10 pages
Pushover Analysis of Soft Storey Structure With Bracing and Shearwall
No ratings yet
Pushover Analysis of Soft Storey Structure With Bracing and Shearwall
6 pages
Iterative Linear System PDF
No ratings yet
Iterative Linear System PDF
13 pages
My Favorite Chaperone
No ratings yet
My Favorite Chaperone
9 pages
BioStat Assignment 3
No ratings yet
BioStat Assignment 3
2 pages
EN 220 Vertintojui WWW
No ratings yet
EN 220 Vertintojui WWW
2 pages
UnitTestPracticeAssignment - D01 Feb 2024
No ratings yet
UnitTestPracticeAssignment - D01 Feb 2024
17 pages

Algorithms 1

Uploaded by

Algorithms 1

Uploaded by

ML

A statistical technique that relates a

Statistical classification is the broad

• Multiple linear regression

• binary classification algorithm used

• A decision tree is a simple and intuitive model

• Random forests are a powerful and popular

• The K in the name of this classifier represents

Support vector machines (SVM) are a supervised

• probability of happening of A depends upon

• K-Means Clustering is an Unsupervised Learning

You might also like