0% found this document useful (0 votes)

3 views

Mod 3

The document discusses various regression techniques including Ridge and Lasso Regression, highlighting their differences in regularization type, feature selection, and use cases. It also explains Least-Squares Regression for classification, its methodology, advantages, and disadvantages. Additionally, the document covers Support Vector Machines, detailing their functionality, advantages, disadvantages, and applications.

Uploaded by

aditideo624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Mod 3

Uploaded by

aditideo624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Differentiate between Ridge and Lasso Regression

Characteristic Ridge Regression Lasso Regression

Applies L1 regularization, adding a

Applies L2 regularization, adding
Regularization penalty term proportional to
a penalty term proportional to
Type the absolute value of the
the square of the coefficients
coefficients.

Does not perform feature

Performs automatic feature
selection. All predictors are
Feature selection. Less important predictors
retained, although their
Selection are completely excluded by setting
coefficients are reduced in size to
their coefficients to zero.
minimize overfitting

Best suited for situations

Ideal when you suspect that only
where all predictors are
a subset of predictors is important,
When to use potentially relevant, and the goal
and the model should focus on those
is to reduce overfitting rather
while ignoring the irrelevant ones.
than eliminate features

Produces a model that Produces a model that is simpler,

includes all features, but their retaining only the most significant
Output model
coefficients are smaller in features and ignoring the rest by
magnitude to prevent overfitting setting their coefficients to zero.

Reduces the magnitude of

Shrinks some coefficients to exactly
coefficients, shrinking them
zero, effectively removing their
Impact on towards zero, but does not set
influence from the model. This leads
Prediction any coefficients exactly to zero.
to a simpler model with fewer
All predictors remain in the
features
model

Aditi Deorukhakar
Characteristic Ridge Regression Lasso Regression

Generally faster as it doesn’t May be slower due to the feature

Computation
involve feature selection selection process

Use when you have many

Use when you believe only some
predictors, all contributing to the
Example Use predictors are truly important (e.g.,
outcome (e.g., predicting house
Case genetic studies where only a few
prices where all features like size,
genes out of thousands are relevant).
location, etc., matter)

Find a linear regression equation for the following two sets of data:

x y
3 12
5 18
7 24
9 30
line of the form:

y=mx+c

Where:

• mmm is the slope (regression coefficient)

• c is the y-intercept

Aditi Deorukhakar
x y x̄ = 6 ȳ = 21 x - x̄ y-ȳ (x - x̄) (y - ȳ) (x - x̄)²

3 12 6 21 -3 -9 27 9

5 18 6 21 -1 -3 3 1

7 24 6 21 1 3 3 1

9 30 6 21 3 9 27 9

Explain Least-Squares Regression for classification.

Least-squares regression is a method that finds a straight line (or a surface in higher
dimensions) that best fits the data by minimizing the total squared difference between the
predicted and actual values.

While it's designed for predicting numbers, it can also be adapted for classification, especially
in binary classification problems.

How it’s used for classification

Even though classification is about predicting categories (like yes/no, spam/not spam), we can
use least-squares regression by:

1. Converting class labels into numbers:

For example, assign 0 to one class and 1 to the other.

Aditi Deorukhakar
2. Training the model:
The algorithm finds a line (or boundary) that tries to fit these numerical labels as if they
were continuous values.

3. Making predictions:
For a new input, the model gives a number—possibly between 0 and 1, or even outside
that range.

4. Classifying using a threshold:

If the predicted number is greater than a certain cutoff (usually 0.5), it is classified as
one class (say 1); otherwise, the other (say 0).

Example

Let’s say you want to classify emails as spam or not spam. You label spam as 1, not spam as 0.
The regression model is trained to predict values close to these numbers. When a new email
comes in, if the model gives a score like 0.8, you label it as spam. If it gives 0.2, it's not spam.

Pros

• Simple and quick to implement.

• Easy to understand.

• Works well when the data is simple and well separated.

Cons

• Not built specifically for classification.

• Can give results outside the expected range (like less than 0 or more than 1).

• Not good at handling more complex or non-linear patterns in data.

• Doesn't perform as well as classification-specific models like logistic regression.

Find the least square regression line Y= aX + b. Estimate the Y when the value of X equals 10.

x y
0 2
1 3
2 5
3 4
4 6

Aditi Deorukhakar
Write a short note on (a) Multivariate Regression

Aditi Deorukhakar
(b) Regularized Regression.

Aditi Deorukhakar
Write short note on

A. Least Square Regression for classification

B. Differentiate between Ridge and Lasso Regression

Discuss Support Vector Machines.

Support Vector Machine (SVM) is a powerful supervised machine learning algorithm used for
both classification and regression tasks, though it is more commonly used for classification. The
key idea behind SVM is to find the optimal separating boundary (called a hyperplane) that best
divides the dataset into different classes.

In a two-dimensional space, this boundary is simply a line. In higher dimensions, it becomes a

hyperplane. The goal of SVM is to choose this hyperplane in such a way that it maximizes the
margin, which is the distance between the hyperplane and the nearest data points of each
class. These nearest points are called support vectors, and they are crucial in defining the
position and orientation of the hyperplane.

If the data is not linearly separable, SVM uses a method called the kernel trick, which
transforms the input features into a higher-dimensional space where a linear separation is
possible. This makes SVM highly flexible and capable of handling complex, non-linear data as
well.

SVM is especially effective in high-dimensional spaces and is often used when the number of
features exceeds the number of samples.

How It Works

• For classification: SVM identifies the boundary (hyperplane) that best separates
different classes. The closest data points to this boundary are called support vectors.

Aditi Deorukhakar
• For regression: SVM tries to fit the best possible line (or surface) within a certain margin,
allowing some flexibility for prediction.

• For non-linear data: SVM uses the kernel trick (like RBF or polynomial kernels) to
transform data into a higher dimension where it becomes linearly separable.

Advantages

• Works well with high-dimensional and small datasets

• Can handle non-linear decision boundaries

• Robust to noise

• Good generalization and avoids overfitting

• Applicable in classification and regression

Disadvantages

• Slow and memory-intensive for large datasets

• Choice of kernel and parameters is critical and can be tricky

• No probabilistic output

• Doesn’t handle missing values well

• Not naturally suited for multi-class classification (requires special techniques)

Applications

• Face recognition

• Text classification

• Bioinformatics (e.g., DNA analysis, protein classification)

• Handwriting recognition

• Speech recognition

• Facial expression detection

• Predictive control systems

Aditi Deorukhakar
Find a linear regression equation for the following two sets of data:

x y
5 40
7 120
12 180
16 210
20 240
Write a short note on (a) Multivariate Regression and (b) Regularized Regression.

Write short note on

A. Least Square Regression for classification

B. Ridge and Lasso Regression

Aditi Deorukhakar

UCS-401_CSE7th M L_lect_10_Unit-Ll_Least Squares Method, Multivariate Linear Regression, Regul
No ratings yet
UCS-401_CSE7th M L_lect_10_Unit-Ll_Least Squares Method, Multivariate Linear Regression, Regul
16 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
MOD 2 AND MOD 3 PYQ
No ratings yet
MOD 2 AND MOD 3 PYQ
12 pages
Module 3
No ratings yet
Module 3
35 pages
unit-2.pptx
No ratings yet
unit-2.pptx
133 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
ML 2 nd Unit
No ratings yet
ML 2 nd Unit
50 pages
Cp4252 Ml Unit-II
No ratings yet
Cp4252 Ml Unit-II
44 pages
ML points
No ratings yet
ML points
13 pages
APznzaaV-S8wLPGsP_Add8mCHq3JcpXzeJ180tg4GWAcHx6DAgMVD3eyvT5dWstrOMVpGkO6YPvB6EzW3QMZ2MOlHap6AIHzt5bF4qrpZ6P5COArRIkGSOpTA3irJqdWr5VzZJgsslAEoNck-7XB6goMBGQ2C1xBIjiLrywLxqEZfdK9zE3-of9LPSjsbB_QkInc2mquD_oyBRUUJcHri
No ratings yet
APznzaaV-S8wLPGsP_Add8mCHq3JcpXzeJ180tg4GWAcHx6DAgMVD3eyvT5dWstrOMVpGkO6YPvB6EzW3QMZ2MOlHap6AIHzt5bF4qrpZ6P5COArRIkGSOpTA3irJqdWr5VzZJgsslAEoNck-7XB6goMBGQ2C1xBIjiLrywLxqEZfdK9zE3-of9LPSjsbB_QkInc2mquD_oyBRUUJcHri
199 pages
Module 5
No ratings yet
Module 5
48 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
ML-1
No ratings yet
ML-1
24 pages
Classification & Regression BDMDM Print
No ratings yet
Classification & Regression BDMDM Print
5 pages
Machine learning notes
No ratings yet
Machine learning notes
12 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Lecture 3
No ratings yet
Lecture 3
51 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Regression_Questionnaire
No ratings yet
Regression_Questionnaire
10 pages
Machine Learning Question Bank-Unit 3
No ratings yet
Machine Learning Question Bank-Unit 3
6 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
ml_exam_answers
No ratings yet
ml_exam_answers
26 pages
Unit 2 ML
No ratings yet
Unit 2 ML
201 pages
Predictive Analytics (2)
No ratings yet
Predictive Analytics (2)
46 pages
Machine learning
No ratings yet
Machine learning
62 pages
Unit Iii
No ratings yet
Unit Iii
27 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
Supervised Learning Regression
No ratings yet
Supervised Learning Regression
15 pages
SVM Regressor
No ratings yet
SVM Regressor
13 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
UNIT-3
No ratings yet
UNIT-3
12 pages
FAM Unit6
No ratings yet
FAM Unit6
32 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Unit -3_ML_24
No ratings yet
Unit -3_ML_24
41 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
ML UNIT-4
No ratings yet
ML UNIT-4
20 pages
Module5
No ratings yet
Module5
30 pages
Module_2
No ratings yet
Module_2
5 pages
S, SVM, LR
No ratings yet
S, SVM, LR
18 pages
chapter2- optimisation
No ratings yet
chapter2- optimisation
7 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Module 5
No ratings yet
Module 5
6 pages
DA-MODULE-3
No ratings yet
DA-MODULE-3
54 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
ML UNIT-4
No ratings yet
ML UNIT-4
34 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
29 pages
ML UNIT-4
No ratings yet
ML UNIT-4
35 pages
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
No ratings yet
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
46 pages
MLT Content
No ratings yet
MLT Content
3 pages
Machine
No ratings yet
Machine
21 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Edx Zi Ni LM4 QJT 4 Uh Hu FK CW
No ratings yet
Edx Zi Ni LM4 QJT 4 Uh Hu FK CW
211 pages
DataMigration AX2012
No ratings yet
DataMigration AX2012
29 pages
Tenaw Resume
No ratings yet
Tenaw Resume
3 pages
The Definitive Guide to Azure Data Engineering: Modern ELT, DevOps, and Analytics on the Azure Cloud Platform 1st Edition Ron C. L'Esteve download
No ratings yet
The Definitive Guide to Azure Data Engineering: Modern ELT, DevOps, and Analytics on the Azure Cloud Platform 1st Edition Ron C. L'Esteve download
31 pages
Taxi Trip Analysis Using Hive
No ratings yet
Taxi Trip Analysis Using Hive
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
HFC SE Transfer Guide
No ratings yet
HFC SE Transfer Guide
3 pages
K Medoids
No ratings yet
K Medoids
10 pages
Kisi-Kisi - Web Technologies - LKS Provinsi Kalimantan Tengah 2024
No ratings yet
Kisi-Kisi - Web Technologies - LKS Provinsi Kalimantan Tengah 2024
21 pages
E Journal WD
No ratings yet
E Journal WD
125 pages
Master Boot Record: Usman Mansoor Junaid Ali Husnain Manzoor Fahad Ali
No ratings yet
Master Boot Record: Usman Mansoor Junaid Ali Husnain Manzoor Fahad Ali
10 pages
Datasheet PM851 MSATA v10
No ratings yet
Datasheet PM851 MSATA v10
2 pages
P410CB 8x4 PDF
No ratings yet
P410CB 8x4 PDF
2 pages
BUX10
No ratings yet
BUX10
4 pages
1019 A8 Card Datasheet PDF
No ratings yet
1019 A8 Card Datasheet PDF
2 pages
If - Else - Worksheet (Class 11)
No ratings yet
If - Else - Worksheet (Class 11)
4 pages
Learned in Designing COBIT2019 Framework
No ratings yet
Learned in Designing COBIT2019 Framework
12 pages
A Geomantic Curiousity
100% (3)
A Geomantic Curiousity
17 pages
Chapter 19 Numerical Differentiation: Taylor Polynomials Lagrange Interpolation
No ratings yet
Chapter 19 Numerical Differentiation: Taylor Polynomials Lagrange Interpolation
29 pages
Application Form SEMPHIL
75% (4)
Application Form SEMPHIL
3 pages
BPA121 Eng
No ratings yet
BPA121 Eng
14 pages
Face Recognition
No ratings yet
Face Recognition
33 pages
WHERE Clause: DCL Command
No ratings yet
WHERE Clause: DCL Command
8 pages
VHL Penetration Testing Courseware V1
No ratings yet
VHL Penetration Testing Courseware V1
325 pages
How To Configure Entity Framework Caching
No ratings yet
How To Configure Entity Framework Caching
3 pages
Data Mining Introductory and Advanced Topics - Margaret h Dunham
No ratings yet
Data Mining Introductory and Advanced Topics - Margaret h Dunham
368 pages
Search ENgine
No ratings yet
Search ENgine
28 pages
Pe For GR 2
No ratings yet
Pe For GR 2
63 pages
Login Panel: Import Import Import Import Import
No ratings yet
Login Panel: Import Import Import Import Import
8 pages
Manual MP 4000
No ratings yet
Manual MP 4000
48 pages

Mod 3

Uploaded by

Mod 3

Uploaded by

Differentiate between Ridge and Lasso Regression

Characteristic Ridge Regression Lasso Regression

Applies L1 regularization, adding a

Does not perform feature

Best suited for situations

Produces a model that Produces a model that is simpler,

Reduces the magnitude of

Generally faster as it doesn’t May be slower due to the feature

Use when you have many

• mmm is the slope (regression coefficient)

Explain Least-Squares Regression for classification.

How it’s used for classification

1. Converting class labels into numbers:

4. Classifying using a threshold:

• Simple and quick to implement.

• Works well when the data is simple and well separated.

• Not built specifically for classification.

• Not good at handling more complex or non-linear patterns in data.

• Doesn't perform as well as classification-specific models like logistic regression.

A. Least Square Regression for classification

B. Differentiate between Ridge and Lasso Regression

Discuss Support Vector Machines.

In a two-dimensional space, this boundary is simply a line. In higher dimensions, it becomes a

• Works well with high-dimensional and small datasets

• Can handle non-linear decision boundaries

• Good generalization and avoids overfitting

• Applicable in classification and regression

• Slow and memory-intensive for large datasets

• Choice of kernel and parameters is critical and can be tricky

• Doesn’t handle missing values well

• Not naturally suited for multi-class classification (requires special techniques)

• Bioinformatics (e.g., DNA analysis, protein classification)

• Facial expression detection

• Predictive control systems

Write short note on

A. Least Square Regression for classification

B. Ridge and Lasso Regression

You might also like