0% found this document useful (0 votes)

11 views12 pages

Unit 3

The document provides an overview of various machine learning algorithms, focusing on Linear Regression, Logistic Regression, Decision Trees, Random Forests, Support Vector Machines (SVM), and Naive Bayes Classifier. It explains the definitions, types, and applications of these algorithms, along with their key concepts and mathematical formulations. Additionally, it highlights the advantages and disadvantages of each method, emphasizing their practical uses in classification and regression tasks.

Uploaded by

Kajal Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views12 pages

Unit 3

Uploaded by

Kajal Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

UNIT-3

Linear Regression
Definition:
Linear Regression is a supervised machine learning algorithm
used to predict a value (dependent variable) based on the value of
one or more input variables (independent variables). It shows the
linear relationship between the variables.

• Y = Predicted value (dependent variable)

• X = Input value (independent variable)
• m = Slope of the line (shows how much Y changes with X)
• c = Intercept (value of Y when X = 0)

Example:

Suppose you want to predict a student's marks (Y) based on hours

of study (X). Linear regression helps you find a line that best fits the
data points and can be used to predict future scores.

Types:

1. Simple Linear Regression – One independent variable

2. Multiple Linear Regression – More than one independent
variable

Simple Linear Regression

• Uses: One independent variable (X) to predict one

dependent variable (Y)
• Example:
Predicting salary (Y) based on years of experience (X)
• Goal: Find the best straight line that fits the data points
• Formula: Y = mX + c

2. Multiple Linear Regression

• Uses: Two or more independent variables (X₁, X₂, ..., Xn)

to predict one dependent variable (Y)
• Example:
Predicting house price (Y) based on size (X₁), number of
bedrooms (X₂), and location rating (X₃)
• Goal: Understand how several factors influence the
outcome
• Formula: Y = b₀ + b₁X₁ + b₂X₂ + ... + bnXn

In short:

• Simple Linear Regression = One factor affecting the result

• Multiple Linear Regression = Many factors affecting the result

What is Logistic Regression?

Logistic Regression is a supervised machine learning algorithm

used for classification problems. Unlike Linear Regression, which
predicts continuous values, logistic regression is used to predict
categorical outcomes – especially binary outcomes (like Yes/No,
0/1, True/False).

Decision Tree

A Decision Tree is a supervised learning algorithm used for both

classification and regression tasks. It is called a "tree" because it
resembles a flowchart-like structure where each internal node
represents a test on an attribute (feature), each branch represents
the outcome of the test, and each leaf node represents a class
label (in classification) or a value (in regression).

Why It's Called a Decision Tree

The model makes decisions based on conditions in a hierarchical

manner, similar to how a human might make choices
Real-World Applications:

• Medical diagnosis (e.g., disease prediction)

• Customer segmentation
• Credit scoring
• Fraud detection

What is a Random Forest?

A Random Forest is an ensemble machine learning algorithm

based on Decision Trees. It combines the output of multiple
decision trees to make a more accurate and stable prediction.

• For classification, it outputs the majority vote of the trees.

• For regression, it outputs the average of the outputs of the
trees.
How Does Random Forest Work?

Let’s understand the process step-by-step:

Step 1: Bootstrapping (Sampling)

• From the training dataset, multiple random subsets (with

replacement) are created.
• Each subset is used to train a separate decision tree.

Step 2: Growing Trees with Random Feature Selection

• For each decision tree:

o A random subset of features is selected at each split (not
all features).
o This adds randomness and decorrelates the trees,
reducing overfitting.

Step 3: Aggregating Predictions

• For classification → Majority vote.

• For regression → Mean prediction.

Key Concepts Behind Random Forest

🔸 Bagging (Bootstrap Aggregating)

• Technique to reduce variance and avoid overfitting.

• Multiple models trained on different subsets of data.
• Random Forest = Bagging + Decision Trees.

🔸 Decision Tree

• Base learner in Random Forest.

• Each tree learns a decision boundary, but individual trees may
overfit.
• Combining many trees reduces this risk.

🔸 Out-of-Bag (OOB) Score

• Since trees are trained on bootstrap samples, about 1/3 of

data is left out (not seen by the tree).
• These are used as a test set to estimate model accuracy
without using a separate test set.

Advantages and Disadvantages

Pros:

• Handles both classification and regression tasks.

• Resistant to overfitting (due to averaging).
• Can handle missing data.
• Feature importance ranking.

Cons:

• Slower and more complex than individual decision trees.

• Less interpretable than a single decision tree.
• May not perform well on high-dimensional sparse data (like
text).

What is Support Vector Machine (SVM)

Support Vector Machine (SVM) is a powerful supervised learning algorithm
used in classification, regression, and anomaly detection. Its main
objective is to find the optimal boundary (hyperplane) that separates
different classes in a dataset with the maximum possible margin.

The Main Idea Behind SVM

SVM tries to find the best decision boundary (also called a hyperplane) that
maximally separates the classes.

Key Concepts:

• Hyperplane: A line (in 2D), plane (in 3D), or a higher-dimensional space

that separates data into classes.
• Margin: Distance between the hyperplane and the nearest points
(support vectors) from each class.
• Support Vectors: Data points closest to the hyperplane. They define
the margin.

Goal of SVM

To maximize the margin between classes = better generalization.

Support Vector Machine (SVM) Terminology

• Hyperplane: A decision boundary separating different classes in
feature space, represented by the equation wx + b = 0 in linear
classification.
• Support Vectors: The closest data points to the hyperplane, crucial for
determining the hyperplane and margin in SVM.
• Margin: The distance between the hyperplane and the support vectors.
SVM aims to maximize this margin for better classification
performance.
• Kernel: A function that maps data to a higher-dimensional space,
enabling SVM to handle non-linearly separable data.
• Hard Margin: A maximum-margin hyperplane that perfectly separates
the data without misclassifications.
• Soft Margin: Allows some misclassifications by introducing slack
variables, balancing margin maximization and misclassification
penalties when data is not perfectly separable.
• C: A regularization term balancing margin maximization and
misclassification penalties. A higher C value enforces a stricter penalty
for misclassifications.
• Hinge Loss: A loss function penalizing misclassified points or margin
violations, combined with regularization in SVM.
• Dual Problem: Involves solving for Lagrange multipliers associated
with support vectors, facilitating the kernel trick and efficient
computation.

Linear SVM Intuition

Imagine we have two classes that are linearly separable (you can draw a
straight line between them).
SVM finds the hyperplane with the largest margin.

Mathematical Computation: SVM

Consider a binary classification problem with two classes, labeled as +1 and
-1. We have a training dataset consisting of input feature vectors X and their
corresponding class labels Y.

The equation for the linear hyperplane can be written as:

wTx+b=0wTx+b=0

Where:

• ww is the normal vector to the hyperplane (the direction perpendicular

to it).
• bb is the offset or bias term, representing the distance of the
hyperplane from the origin along the normal vector ww.

What is Naive Bayes Classifier?

Naive Bayes is a supervised learning algorithm based on Bayes' Theorem

with a strong (naive) assumption of independence between features.
Despite this "naive" assumption, it often performs exceptionally well in
practice, especially in natural language processing (NLP) and classification
problems.

Bayes’ Theorem (Foundation of Naive Bayes)

Assumptions Made by Naïve Bayes

The fundamental Naïve Bayes assumption is that each feature makes an:

independent

equal

contribution to the outcome.

Let us take an example to get some better intuition. Consider the car theft

problem with attributes Color, Type, Origin, and the target, Stolen can be

either Yes or No.

The "Naive" Assumption

Naive Bayes assumes that all features are independent given the class. That
is:

P(x1 ,x2 ,...,xn ∣y)=P(x1 ∣y)⋅P(x2 ∣y)⋅⋯⋅P(xn ∣y)

This simplifies computation significantly!

Example:

Here in our dataset, we need to classify whether the car is stolen,

given the features of the car. The columns represent these features and

the rows represent individual entries. If we take the first row of the

dataset, we can observe that the car is stolen if the Color is Red, the Type

is Sports and Origin is Domestic. So we want to classify a Red Domestic

SUV is getting stolen or not. Note that there is no example of a Red

Domestic SUV in our data set.

According to this example, Bayes theorem can be rewritten as:

The variable y is the class variable(stolen?), which represents if the car is

stolen or not given the conditions. Variable X represents the

parameters/features.

X is given as,

Here x1, x2…, xn represent the features, i.e they can be mapped to
Color, Type, and Origin. By substituting for X and expanding using the
chain rule we get,

UNIT IV Na-Ve Bayes Classifier Algorithm
No ratings yet
UNIT IV Na-Ve Bayes Classifier Algorithm
33 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Supervised ML
No ratings yet
Supervised ML
69 pages
Classification Algorithm
No ratings yet
Classification Algorithm
43 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
Week 8. Supervised Learning. Classification
No ratings yet
Week 8. Supervised Learning. Classification
45 pages
Unit 6 Ai
No ratings yet
Unit 6 Ai
28 pages
Unit 3
No ratings yet
Unit 3
9 pages
Chapter3 Classification Summary Final
No ratings yet
Chapter3 Classification Summary Final
11 pages
MLT Unit 2 - Updated
No ratings yet
MLT Unit 2 - Updated
58 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
What Is An SVM
No ratings yet
What Is An SVM
24 pages
AI ML 2024 Solved Question Paper - Vaibhavpandit - Tele - 250522 - 224429
No ratings yet
AI ML 2024 Solved Question Paper - Vaibhavpandit - Tele - 250522 - 224429
41 pages
Regression Models: by Mayuri Bhandari
No ratings yet
Regression Models: by Mayuri Bhandari
64 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Government of India Act 1858
No ratings yet
Government of India Act 1858
20 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
16 pages
Unit 3 PDF
No ratings yet
Unit 3 PDF
7 pages
S, SVM, LR
No ratings yet
S, SVM, LR
18 pages
New Misc Mod
No ratings yet
New Misc Mod
36 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
MLT Unit-2
No ratings yet
MLT Unit-2
30 pages
Unit 3
No ratings yet
Unit 3
20 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
MLT UNIT-2 Notes
No ratings yet
MLT UNIT-2 Notes
16 pages
Asset Life Cycle Cost
100% (1)
Asset Life Cycle Cost
67 pages
Support Vector Machines
No ratings yet
Support Vector Machines
12 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Unit 2 Svms Linear Logistic Regression
No ratings yet
Unit 2 Svms Linear Logistic Regression
9 pages
Unit 2
No ratings yet
Unit 2
133 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Web Design For Everyone Using Wordpress: Golam Morshed
No ratings yet
Web Design For Everyone Using Wordpress: Golam Morshed
31 pages
Lecture 3
No ratings yet
Lecture 3
51 pages
Machine Learning
No ratings yet
Machine Learning
37 pages
Sten Plans The Sten Mkii
100% (1)
Sten Plans The Sten Mkii
28 pages
Week 7. Intro To ML. Regression
No ratings yet
Week 7. Intro To ML. Regression
24 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
AIML
No ratings yet
AIML
30 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Dayananda Sagar College of Engineering: M.TECH: Digital Electronics and Communication
No ratings yet
Dayananda Sagar College of Engineering: M.TECH: Digital Electronics and Communication
4 pages
ML Points
No ratings yet
ML Points
13 pages
Research and Practice in Human Resource Management
No ratings yet
Research and Practice in Human Resource Management
6 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
No ratings yet
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
9 pages
Module 5
No ratings yet
Module 5
48 pages
Question Bank For Certification Programme of Returning Officers
No ratings yet
Question Bank For Certification Programme of Returning Officers
77 pages
ML Models
No ratings yet
ML Models
21 pages
MLRS Assignment 1 24070146008 Sreemanth Mannem
No ratings yet
MLRS Assignment 1 24070146008 Sreemanth Mannem
12 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Netbackup Troubleshooting Commands
No ratings yet
Netbackup Troubleshooting Commands
4 pages
Haven Technical Services
No ratings yet
Haven Technical Services
12 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
Kateen Menu
No ratings yet
Kateen Menu
2 pages
Property Management Presentation
100% (1)
Property Management Presentation
14 pages
Cta Cli
No ratings yet
Cta Cli
52 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Email Exchange
No ratings yet
Email Exchange
2 pages
Advanced Nuclear Energy
No ratings yet
Advanced Nuclear Energy
46 pages
Spru I 11444
No ratings yet
Spru I 11444
24 pages
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
No ratings yet
Simple Additive Weighting Method To Determining Employee Salary Increase Rate
7 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
625 Kva Gas Set
No ratings yet
625 Kva Gas Set
31 pages
Advanced Photonics Research - 2021 - Wu - High Resolution 960 540 and 1920 1080 UV Micro Light Emitting Diode Displays
No ratings yet
Advanced Photonics Research - 2021 - Wu - High Resolution 960 540 and 1920 1080 UV Micro Light Emitting Diode Displays
8 pages
Pro Wrestling Illustrated, 2005-03 (2004 in Wrestling) (C)
No ratings yet
Pro Wrestling Illustrated, 2005-03 (2004 in Wrestling) (C)
148 pages
Type JF Table PDF
No ratings yet
Type JF Table PDF
6 pages
The Interactive Effect of Job Involvement and Organizational Commitment On Job Turnover Revisited: A Note On The Mediating Role of Turnover Intention
No ratings yet
The Interactive Effect of Job Involvement and Organizational Commitment On Job Turnover Revisited: A Note On The Mediating Role of Turnover Intention
6 pages
Introduction To The USA and Canada
No ratings yet
Introduction To The USA and Canada
10 pages
Sunlight Dishwashing Liquid Msds
No ratings yet
Sunlight Dishwashing Liquid Msds
12 pages
Plane Areas
No ratings yet
Plane Areas
26 pages
Important Reminders: Step 1
No ratings yet
Important Reminders: Step 1
4 pages
3 I Specification BoQ Modular Furniture
No ratings yet
3 I Specification BoQ Modular Furniture
13 pages
Vendor Agreement Template
No ratings yet
Vendor Agreement Template
11 pages
LUVOBATCH Blowingagents EN 2022
No ratings yet
LUVOBATCH Blowingagents EN 2022
7 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Unit 3

Uploaded by

Unit 3

Uploaded by

UNIT-3

• Y = Predicted value (dependent variable)

Suppose you want to predict a student's marks (Y) based on hours

1. Simple Linear Regression – One independent variable

Simple Linear Regression

• Uses: One independent variable (X) to predict one

2. Multiple Linear Regression

• Uses: Two or more independent variables (X₁, X₂, ..., Xn)

• Simple Linear Regression = One factor affecting the result

What is Logistic Regression?

Logistic Regression is a supervised machine learning algorithm

A Decision Tree is a supervised learning algorithm used for both

Why It's Called a Decision Tree

The model makes decisions based on conditions in a hierarchical

• Medical diagnosis (e.g., disease prediction)

What is a Random Forest?

A Random Forest is an ensemble machine learning algorithm

• For classification, it outputs the majority vote of the trees.

Let’s understand the process step-by-step:

Step 1: Bootstrapping (Sampling)

• From the training dataset, multiple random subsets (with

Step 2: Growing Trees with Random Feature Selection

• For each decision tree:

Step 3: Aggregating Predictions

• For classification → Majority vote.

Key Concepts Behind Random Forest

🔸 Bagging (Bootstrap Aggregating)

• Technique to reduce variance and avoid overfitting.

• Base learner in Random Forest.

🔸 Out-of-Bag (OOB) Score

• Since trees are trained on bootstrap samples, about 1/3 of

Advantages and Disadvantages

• Handles both classification and regression tasks.

• Slower and more complex than individual decision trees.

What is Support Vector Machine (SVM)

The Main Idea Behind SVM

• Hyperplane: A line (in 2D), plane (in 3D), or a higher-dimensional space

To maximize the margin between classes = better generalization.

Support Vector Machine (SVM) Terminology

Linear SVM Intuition

Mathematical Computation: SVM

The equation for the linear hyperplane can be written as:

• ww is the normal vector to the hyperplane (the direction perpendicular

What is Naive Bayes Classifier?

Naive Bayes is a supervised learning algorithm based on Bayes' Theorem

Bayes’ Theorem (Foundation of Naive Bayes)

Assumptions Made by Naïve Bayes

contribution to the outcome.

either Yes or No.

The "Naive" Assumption

P(x1 ,x2 ,...,xn ∣y)=P(x1 ∣y)⋅P(x2 ∣y)⋅⋯⋅P(xn ∣y)

This simplifies computation significantly!

Here in our dataset, we need to classify whether the car is stolen,

is Sports and Origin is Domestic. So we want to classify a Red Domestic

SUV is getting stolen or not. Note that there is no example of a Red

Domestic SUV in our data set.

The variable y is the class variable(stolen?), which represents if the car is

stolen or not given the conditions. Variable X represents the

You might also like