0% found this document useful (0 votes)

31 views31 pages

UNIT2SVMKNN

The document provides information about support vector machines (SVM), K-nearest neighbors (KNN) algorithm, and density estimation techniques. It defines SVM as a supervised machine learning algorithm used for classification and regression that finds the optimal hyperplane to separate classes. It explains linear and non-linear SVMs. It also defines KNN as another supervised algorithm that classifies points based on their nearest neighbors and majority vote. Finally, it discusses different density estimation methods like histograms, kernel density estimation, and parametric vs non-parametric approaches.

Uploaded by

Aditya Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views31 pages

UNIT2SVMKNN

Uploaded by

Aditya Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

UNIT-2 (SVM and KNN)+

Density Estimation
What is SVM?
• Support Vector Machine (SVM) is a supervised machine learning algorithm used
for both classification and regression. Though we say regression problems as well
it’s best suited for classification.
• The main objective of the SVM algorithm is to find the optimal hyperplane in an
N-dimensional space that can separate the data points in different classes in the
feature space.
• The hyperplane tries that the margin between the closest points of different classes
should be as maximum as possible.
• The dimension of the hyperplane depends upon the number of features. If the
number of input features is two, then the hyperplane is just a line.
• If the number of input features is three, then the hyperplane becomes a 2-D plane.
It becomes difficult to imagine when the number of features exceeds three.
Hyperplane: There can be multiple lines/decision boundaries to segregate the
classes in n-dimensional space, but we need to find out the best decision
boundary that helps to classify the data points. This best boundary is known as
the hyperplane of SVM. Hyperplane equation is: wx+b
Types of SVM:
• Linear SVM: Linear SVM is used for linearly separable data, which
means if a dataset can be classified into two classes by using a single
straight line, then such data is termed as linearly separable data, and
classifier is used called as Linear SVM classifier.
• Non-linear SVM: Non-Linear SVM is used for non-linearly separated
data, which means if a dataset cannot be classified by using a straight
line, then such data is termed as non-linear data and classifier used is
called as Non-linear SVM classifier.
Example:
Let’s plot given points:
Each vector is augmented with bias input
1:
Find the value of a1, a2, and a3.
Solve the following equations:
After solving these equations, a1: -3.5, a2: 0.75, and a3: 0.75
Now, calculate weight vector:
TASK-1
Q 1. Positively labelled data points (2,1)(2,-1)(5,1)(5,-1) and Negatively
labelled data points (1,0)(0,1)(0,-1)(-1,0)
KNN
• The K-Nearest Neighbors (KNN) algorithm is a supervised machine learning
algorithm and finds intense application in pattern recognition, data mining, and
intrusion detection.
• It does not require any assumptions about the underlying data distribution.
• It can also handle both numerical and categorical data, making it a flexible choice for
various types of datasets in classification and regression tasks.
• It is a non-parametric method that makes predictions based on the similarity of data
points in a given dataset. K-NN is less sensitive to outliers compared to other
algorithms.
• The K-NN algorithm works by finding the K nearest neighbors to a given data point
based on a distance metric, such as Euclidean distance.
• The class or value of the data point is then determined by the majority vote or average
of the K neighbors. This approach allows the algorithm to adapt to different patterns
and make predictions based on the local structure of the data.
How to Choose the Value of K in the K-NN
Algorithm:
• There is no particular way of choosing the value K, but here are some
common conventions to keep in mind:
• Choosing a very low value will most likely lead to inaccurate
predictions.
• The commonly used value of K is 5.
• Always use an odd number as the value of K.
Example:
Step 1: Find distance
Step 2: Assign rank to calculated distances from 1 to n

1 for lowest distance

Step 3: Assign label to new datapoint on basis of k value

• If value of K=1 label for new datapoint will be Normal.

• In this scenario value of K=5, and new datapont lies in
Normal class.
Advantages and Disadvantages of K-NN Algorithm:
• Advantages of K-NN Algorithm
• It is simple to implement.
• No training is required before classification.
• Disadvantages of K-NN Algorithm
• Can be cost-intensive when working with a large data set.
• A lot of memory is required for processing large data sets.
• Choosing the right value of K can be tricky.
Density estimation:
• Density estimation is a statistical technique used to estimate the
probability density function (PDF) of a random variable.
• In simple terms, it is a method to estimate how likely it is that a given
observation belongs to a certain distribution.
• Density estimation is particularly useful in fields such as machine
learning, statistics, and data analysis, where understanding the
underlying distribution of data is important.
• There are various methods for density estimation, and here are a few common
ones:
1.Histograms:
1. Divide the data into bins.
2. Count the number of data points in each bin.
3. Normalize the counts by the total number of observations and the bin width to obtain a
probability density.
2. Kernel Density Estimation (KDE):
1. Place a kernel (smooth, continuous function, such as a Gaussian) at each data point.
2. Sum the contributions from all kernels to obtain the estimated density.
3. The bandwidth of the kernel controls the smoothness of the estimate.
In the following figure, 100 points are drawn from a bimodal distribution, and the kernel
density estimates are shown for three choices of kernels:
3. Parametric Methods:
1. Assume a specific parametric form for the underlying distribution (e.g., normal distribution,
exponential distribution).
2. Estimate the parameters of the distribution from the data.
4. Non-parametric Methods:
3. Do not assume a specific parametric form for the distribution.
4. KDE is an example of a non-parametric method.
• Density estimation is often used for tasks such as anomaly detection, clustering, and
generating synthetic data. It helps in understanding the structure of the data and can be a
crucial step in exploratory data analysis.
• In machine learning, density estimation can be part of various algorithms, such as Gaussian
Mixture Models (GMMs) and certain types of neural networks, where estimating the
underlying distribution is essential for making predictions or generating new data samples.
Parzen Window:
• The Parzen window, also known as the kernel density estimation with
a fixed kernel or the "window" method, is a non-parametric technique
used for estimating the probability density function (PDF) of a random
variable.
• It falls under the category of non-parametric density estimation
methods, where the goal is to estimate the underlying distribution of a
set of data points without assuming a specific parametric form for the
distribution.
Feature of Parzen window:
1.Kernel Function:
1. Choose a kernel function, typically a smooth, symmetric, and positive function (e.g., Gaussian
or Epanechnikov kernel).
2. The kernel function defines the shape of the "window" around each data point.
2.Window (or Bandwidth):
1. Choose a fixed window (bandwidth) size or adaptively select it based on the data.
2. The window determines the region around each data point where the kernel function
contributes to the density estimate.
3.Estimation:
1. For each data point, place a window centered at that point.
2. The contribution of each data point to the overall density estimate is given by the chosen
kernel function within its window.
3. Sum the contributions from all data points to obtain the final density estimate.
• The choice of the kernel and bandwidth is crucial, and it
affects the smoothness and accuracy of the density estimate.
Common kernels include the Gaussian, Epanechnikov, and
rectangular kernels.
• The Parzen window method is a flexible and intuitive
approach to estimate probability densities, especially in
situations where the underlying distribution is unknown or
complex.
Thank you!

(Law, Governance and Technology Series) Paweł Księżak, Sylwia Wojtczak - Toward A Conceptual Network For The Private Law of Artificial in
100% (1)
(Law, Governance and Technology Series) Paweł Księżak, Sylwia Wojtczak - Toward A Conceptual Network For The Private Law of Artificial in
299 pages
ML Unit-4
No ratings yet
ML Unit-4
29 pages
Pattern Recognition 21BR551 MODULE 03 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 03 NOTES
16 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
09 ML Nonparametric Machine Learning
No ratings yet
09 ML Nonparametric Machine Learning
19 pages
جلسه پنجم-3
No ratings yet
جلسه پنجم-3
17 pages
Comprehensiv Questions Solved
No ratings yet
Comprehensiv Questions Solved
28 pages
w5 Classification
No ratings yet
w5 Classification
34 pages
2019BurkovTheHundred pageMachineLearnin2
No ratings yet
2019BurkovTheHundred pageMachineLearnin2
33 pages
Lecture 4
No ratings yet
Lecture 4
4 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
32 pages
Unit IV Clustering.pptx
No ratings yet
Unit IV Clustering.pptx
60 pages
Dsbdunitiii T1729232981820-1
No ratings yet
Dsbdunitiii T1729232981820-1
26 pages
ML Lecture14
No ratings yet
ML Lecture14
17 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
ML Unit-2 (CEC)
No ratings yet
ML Unit-2 (CEC)
96 pages
ML 5
No ratings yet
ML 5
35 pages
UNIT IV Non Parametric Methods
No ratings yet
UNIT IV Non Parametric Methods
37 pages
5th Unit Answer Bank AIML
No ratings yet
5th Unit Answer Bank AIML
24 pages
Ch3 BayesianNetwork Onwards
No ratings yet
Ch3 BayesianNetwork Onwards
5 pages
Unit 2
No ratings yet
Unit 2
16 pages
Unit 4-Unsupervised Learning-K Means and Hierarchical Clustering
No ratings yet
Unit 4-Unsupervised Learning-K Means and Hierarchical Clustering
48 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Chapter 4
No ratings yet
Chapter 4
40 pages
Lec 04
No ratings yet
Lec 04
70 pages
MLunit 2 Mynotes
No ratings yet
MLunit 2 Mynotes
15 pages
Presentation UNIT-2
No ratings yet
Presentation UNIT-2
96 pages
Ai Unit 4
No ratings yet
Ai Unit 4
17 pages
K Means
No ratings yet
K Means
25 pages
INT354 - Unit 3
No ratings yet
INT354 - Unit 3
60 pages
4 KNN Parzen en
No ratings yet
4 KNN Parzen en
30 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
AA1 Tema4
No ratings yet
AA1 Tema4
37 pages
Document
No ratings yet
Document
6 pages
MT2023 Sol
No ratings yet
MT2023 Sol
8 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
19.1. Partitioning-Based Clustering Algorithms
No ratings yet
19.1. Partitioning-Based Clustering Algorithms
27 pages
QSRI Lecture4
No ratings yet
QSRI Lecture4
56 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
aiml chap3
No ratings yet
aiml chap3
7 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
Data Mining Unit-2
No ratings yet
Data Mining Unit-2
37 pages
Unit 5
No ratings yet
Unit 5
28 pages
Yunsu Han KNN K Means
No ratings yet
Yunsu Han KNN K Means
8 pages
Non-Parametric Density Estimation
No ratings yet
Non-Parametric Density Estimation
3 pages
Nonparametric Density Estimation Nearest Neighbors, KNN
No ratings yet
Nonparametric Density Estimation Nearest Neighbors, KNN
31 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
78 pages
TEAA - Memory Based Tecniques
No ratings yet
TEAA - Memory Based Tecniques
23 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
DM After Midz
No ratings yet
DM After Midz
22 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Chap 4
No ratings yet
Chap 4
21 pages
Module 5 Notes
No ratings yet
Module 5 Notes
4 pages
ML Unit2
No ratings yet
ML Unit2
38 pages
Kmean
No ratings yet
Kmean
24 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
UNIT1 ERM and PAC Learning
No ratings yet
UNIT1 ERM and PAC Learning
20 pages
Unit1 - Introduction To OS1
No ratings yet
Unit1 - Introduction To OS1
59 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
Ultrasonic Interferometer
No ratings yet
Ultrasonic Interferometer
4 pages
Yesterday, Today, and Tomorrow - 50 Years of Software Engineering
No ratings yet
Yesterday, Today, and Tomorrow - 50 Years of Software Engineering
7 pages
ML Lab 09 Manual - Introduction To Scikit Learn (Ver5)
No ratings yet
ML Lab 09 Manual - Introduction To Scikit Learn (Ver5)
6 pages
How To Use Generative Ai To Boost Developer Productivity Reskin
No ratings yet
How To Use Generative Ai To Boost Developer Productivity Reskin
13 pages
GovInst AI Whitepaper
No ratings yet
GovInst AI Whitepaper
39 pages
Lecture 3 APR8X01
No ratings yet
Lecture 3 APR8X01
31 pages
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
100% (1)
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
15 pages
Lab 07
No ratings yet
Lab 07
2 pages
Questionnaire - Teacher-Perception-On-Impact-Of-Artificial-Intelligence-On-Students-Learning-1409275-1735912953
No ratings yet
Questionnaire - Teacher-Perception-On-Impact-Of-Artificial-Intelligence-On-Students-Learning-1409275-1735912953
3 pages
Group 9 Project Report
No ratings yet
Group 9 Project Report
51 pages
Smart EdTech - AI Paradigms
No ratings yet
Smart EdTech - AI Paradigms
21 pages
M3AE Multimodal Representation Learning For Brain Tumor Segmentation With Missing Modalities
No ratings yet
M3AE Multimodal Representation Learning For Brain Tumor Segmentation With Missing Modalities
9 pages
Robotics: Lecture 1: Introduction To Robotics
No ratings yet
Robotics: Lecture 1: Introduction To Robotics
44 pages
Project Poster Final
No ratings yet
Project Poster Final
1 page
Grey Minimalist Business Project Presentation
No ratings yet
Grey Minimalist Business Project Presentation
29 pages
Đề thi thử tham khảo 2025
No ratings yet
Đề thi thử tham khảo 2025
8 pages
Integrate To Innovate
No ratings yet
Integrate To Innovate
15 pages
Part-A Chapter 1 Artificial Intelligence Enabled Operating System. A1.1 Introduction
No ratings yet
Part-A Chapter 1 Artificial Intelligence Enabled Operating System. A1.1 Introduction
4 pages
Top 50 Teams - Result
No ratings yet
Top 50 Teams - Result
4 pages
Actions and Basic IAM (WA L3 and L4)
No ratings yet
Actions and Basic IAM (WA L3 and L4)
35 pages
Digital Economy 2
No ratings yet
Digital Economy 2
9 pages
Predictive Modelling for the Future of P
No ratings yet
Predictive Modelling for the Future of P
4 pages
Comparative Examination of Ai Legislations and Institutions in Nigeria and Other Foreign Legioslations
No ratings yet
Comparative Examination of Ai Legislations and Institutions in Nigeria and Other Foreign Legioslations
21 pages
(D) MDM in AI-ML 2023 NEP 2023 (Mathematics)
No ratings yet
(D) MDM in AI-ML 2023 NEP 2023 (Mathematics)
11 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
6 pages
Pending As On 18.02.2025
No ratings yet
Pending As On 18.02.2025
6 pages
Project Presentation
No ratings yet
Project Presentation
21 pages
ES Imp Questions
No ratings yet
ES Imp Questions
9 pages
B1 Do You Really Know Who Frankenstein Is - Formato Planeador Actividad Extracurricular Inglés - V1 - 11032024
No ratings yet
B1 Do You Really Know Who Frankenstein Is - Formato Planeador Actividad Extracurricular Inglés - V1 - 11032024
3 pages
Evolution of Robot in AI
No ratings yet
Evolution of Robot in AI
16 pages

UNIT2SVMKNN

Uploaded by

UNIT2SVMKNN

Uploaded by

UNIT-2 (SVM and KNN)+

1 for lowest distance

• If value of K=1 label for new datapoint will be Normal.

You might also like