Unit 5

The document covers supervised learning, focusing on regression and classification techniques. It details common regression algorithms like simple and multiple linear regression, and classification methods including kNN, decision trees, and random forests. Key steps in classification learning, such as problem identification, data preprocessing, and algorithm selection, are also outlined.

Uploaded by

12302130603011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views73 pages

Unit 5

Uploaded by

12302130603011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 73

Unit: 5

Supervised Learning :
Regression and Classification
Topics
• Regression: Introduction, Example of Regression, Common Regression
Algorithms:
• Simple linear Regression, Multiple linear regression
• Classification: Introduction, Classification Model, Classification
Learning Steps,
• Classification Algorithms: kNN, Decision Tree, Random Forest, Support
Vector Machine
Regression
Introduction
• In supervised learning, when we are trying to predict a real-value variable such
as ‘Price’, ‘Weight’, etc., the problem falls under the category of regression.
• A regression problem tries to forecast results as a continuous output.
• Dependent Variable (Y) is the value to be predicted. This variable is presumed
to be functionally related to the independent variable (X).
• In other words, dependent variable(s) depends on independent variable(s).
Independent Variable (X) is called as predictor. The independent variable (X) is
used in a regression model to estimate the value of the dependent variable (Y).
• Regression is essentially finding a relationship (or) association between the
dependent variable (Y) and the independent variables (X).
COMMON REGRESSION
ALGORITHMS
• Simple linear regression
• Multiple linear regression
• Polynomial regression
• Multivariate adaptive regression splines
• Logistic regression
• Maximum likelihood estimation (least squares)
Simple linear regression
• If the regression involves only one independent variable, it is called
simple regression.
• Thus, if we take ‘Price of a used car’ as the dependent variable and the
‘Year of manufacturing of the car’ as the independent variable, we can
build a simple regression.
• Slope represents how much the line in a graph changes in the vertical
direction (Y-axis) over a change in the horizontal direction (X-axis). Slope
is also referred as the rate of change in a graph.
• Maximum and minimum points on a graph are found at points where
the slope of the curve is zero. It becomes zero either from positive or
from negative value
Simple linear regression

• a linear relationship between the

dependent variable and the predictor
variable as shown in Figure
Multiple linear regression

• If two or more independent variables are involved, it is called multiple

regression.
• If we take ‘Price of a used car’ as the dependent variable and year of
manufacturing (Year), brand of the car (Brand), and mileage run
(Miles run) as the independent variables.
• The following expression describes the equation involving the
relationship with two predictor variables, namely X and X.
• The model describes a plane in the three-dimensional space of Ŷ, X ,
and X . Parameter ‘a’ is the intercept of this plane.
• Parameters ‘b ’ and ‘b ’ are referred to as partial regression
coefficients
• Consider the example of a multiple linear regression model with two
predictor variables, namely X1 and X2:
Classification
Intoduction
• In supervised learning, the labelled training data provides the learning
basis.
• According to the definition of machine learning, this labelled training
data is the experience or prior knowledge or belief.
• Training data is the past information with known value of the class
field or ‘label’.
Supervised learning vs. unsupervised learning
CLASSIFICATION MODEL
CLASSIFICATION MODEL
• Classification is a type of supervised learning where a target feature,
which is of categorical type, is predicted for test data on the basis of
the information imparted by the training data.
• The target categorical feature is known as class
• Some typical classification problems include the following:
• Image classification
• Disease prediction
• Win–loss prediction of games
• Handwriting recognition
CLASSIFICATION LEARNING STEPS
Problem Identification
• Identifying the problem is the first step in the supervised learning
model.
• The problem needs to be a well-formed problem,i.e. a problem with
well-defined goals and benefit, which has a long-term impact.
Identification of Required Data
• The required data set that precisely represents the identified problem
needs to be identified/evaluated.
• For example: If the problem is to predict whether a tumour is
malignant or benign, then the corresponding patient data sets related
to malignant tumour and benign tumours are to be identified.
Data Pre-processing
• This is related to the cleaning/transforming the data set.
• This step ensures that all the unnecessary/irrelevant data elements
are removed.
• Data pre-processing refers to the transformations applied to the
identified data before feeding the same into the algorithm.
• Because the data is gathered from different sources, it is usually
collected in a raw format and is not ready for immediate analysis.
• This step ensures that the data is ready to be fed into the machine
learning algorithm.
Definition of Training Data Set
• Before starting the analysis, the user should decide what kind of data
set is to be used as a training set.
• In the case of signature analysis, for example, the training data set
might be a single handwritten alphabet, an entire handwritten word
(i.e. a group of the alphabets) or an entire line of handwriting (i.e.
sentences or a group of words).
Algorithm Selection
• This involves determining the structure of the learning function and
the corresponding learning algorithm. This is the most critical step of
supervised learning model.
• On the basis of various parameters, the best algorithm for a given
problem is chosen.
Training
• The learning algorithm identified in the previous step is run on the
gathered training set for further fine-tuning.
• Some supervised learning algorithms require the user to determine
specific control parameters
Evaluation with the Test Data Set
• Training data is run on the algorithm, and its performance is
measured here
COMMON
CLASSIFICATION
ALGORITHMS
Classification Algorithms
• k -Nearest Neighbour (kNN)
• Decision tree
• Random forest
• Support Vector Machine (SVM)
k-Nearest Neighbour (kNN)
• The kNN algorithm is a simple but extremely powerful classification
algorithm.
• There are many measures of similarity, the most common approach
adopted by kNN to measure similarity between two data elements is
Euclidean distance.
k-Nearest Neighbour (kNN)
• Considering a very simple data set having two features (say f1 and f2)
• Euclidean distance between two data elements d1 and d2 can be
measured by:

• Where
• f11 = value of feature f1 for data element d1
• f12 = value of feature f1 for data element d2
• f11 = value of feature f2 for data element d1
• f11 = value of feature f2 for data element d1
k-Nearest Neighbour (kNN) - Example
k-Nearest Neighbour (kNN) - 2-D representation of the
Example student data set
k-Nearest Neighbour (kNN) - Distance calculation between
Example test and training points
kNN algorithm
Input: Training data set, test data set (or data points), value of ‘k’ (i.e.number of nearest neighbors to be
considered)
Steps:
Do for all test data points
Calculate the distance (usually Euclidean distance) of the test data pointfrom the different training data
points.
Find the closest ‘k’ training data points, i.e. training data points whose distances are least from the test
data point.
If k = 1
Then assign class label of the training data point to the test data point
Else
Whichever class label is predominantly present in the training data points, assign that class label to the
test data point
End do
Strengths of the kNN algorithm
• Extremely simple algorithm – easy to understand
• Very effective in certain situations, e.g. for recommender system
design
• Very fast or almost no time required for the training phase
Weaknesses of the kNN algorithm
• Does not learn anything in the real sense. Classification is done
completely on the basis of the training data. So, it has a heavy
reliance on the training data. If the training data does not represent
the problem domain comprehensively, the algorithm fails to make an
effective classification.
• Because there is no model trained in real sense and the classification
is done completely on the basis of the training data, the classification
process is very slow.
• A large amount of computational space is required to load the
training data for classification.
Decision tree
• Decision tree learning is one of the most widely adopted algorithms
for classification.
• A decision tree is used for multi-dimensional analysis with multiple
classes.
• The goal of decision tree learning is to create a model(based on the
past data called past vector) that predicts the value of the output
variable based on the input variables in the feature vector.
Decision tree
• Each internal node tests an attribute (represented
as ‘A’/‘B’ within the boxes).
• Each branch corresponds to an attribute value (T/F)
in the above case. Each leaf node assigns a
classification.
• The first node is called as ‘Root’ Node.
• A decision tree consists of three types of nodes:
• Root Node
• Branch Node
• Leaf Node
Decision Tree - Example
Entropy of a decision tree
• Let us say S is the sample set of training examples. Then, Entropy (S)
measuring the impurity of S is defined as

• where c is the number of different class labels, and p refers to the

proportion of values falling into the i-th class label.
Entropy of a decision tree
• For example, with respect to the training data we have two values for
the target class ‘Job Offered?’ – Yes and No.
• The value of pi for class value ‘Yes’ is 0.44 (i.e. 8/18) and that for class
value ‘No’ is 0.56(i.e. 10/18). So, we can calculate the entropy as
• Entropy (S) = -0.44 log (0.44) - 0.56 log (0.56) = 0.99.
Information gain of a decision tree
• The information gain is created on the basis of the decrease in
entropy (S)after a data set is split according to a particular attribute
(A).
• Constructing a decision tree is all about finding an attribute that
returns the highest information gain.
• If the information gain is 0, it means that there is no reduction in
entropy due to split of the data set according to that particular
feature.
• The maximum amount of information gain which may happen is the
entropy of the data set before the split.
Information gain of a decision tree
• Information gain for a particular feature A is calculated by the
difference in entropy before a split (Sbs) with the entropy after the
split (Sas).
• Information Gain (S,A) = Entropy (Sbs) − Entropy (Sas)
• For performing weighted summation, the proportion of examples
falling into each partition is used as weight.
Example
Entropy and
information
gain
calculation
(Level 1)
Entropy and information gain
calculation (Level 1)
• For Attribute = ‘CGPA’,
• Total Entropy (Sas) = (6/18)*0.92 + (7/18)*0.99 + (5/18)*0.0 = 0.69
• Information gain = 0.99 – 0.69 = 0.30

• Similarly calculate for all attributes

Example
• It is quite evident that among all the features, ‘Aptitude’ results in the
best information gain when adopted for the split.
• So, at the first level, a split will be applied according to the value of
‘Aptitude’ or in other words, ‘Aptitude’ will be the first node of the
decision tree formed.
• For Aptitude = Low , entropy is 0,which indicates that always the result
will be the same irrespective of the values of the other features. Hence,
the branch towards Aptitude = Low will not continue any further.
• As a part of level 2, we will thus have only one branch to navigate in this
case – the one for Aptitude = High
Data for
Aptitude
= High
Entropy and
information
gain
calculation
(Level 2)
Level 3 Calculations
• As a part of level 3, we will thus have only one branch to navigate in
this case – the one for Communication = Bad
Entropy and
information
gain
calculation
(Level 3)
• The information gain after split with the feature CGPA is 0.81, which is
the maximum possible information gain (as the entropy before the
split was 0.81).
• Hence, as obvious, a split will be applied on the basis of the value of
‘CGPA’. Because the maximum information gain is already achieved,
the tree will not continue any further.
Algorithm for decision tree
Input: Training data set, test data set (or data points)
Steps:
Do for all attributes
Calculate the entropy Ei of the attribute F
if Ei < Emin
then Emin = Ei and Fmin = Fi
end if
End do

Split the data set into subsets using the attribute Fmin
Draw a decision tree node containing the attribute Fmin and split the dataset into subsets
Repeat the above steps until the full tree is drawn, covering all the attributes of the original table.
Avoiding overfitting in decision tree
– pruning
• The decision tree algorithm, unless a stopping criterion is applied,
may keep growing indefinitely – splitting for every feature and
dividing into smaller partitions till the point that the data is perfectly
classified. This, as is quite evident, results in an overfitting problem.
• To prevent a decision tree getting overfitted to the training data,
pruning of the decision tree is essential.
• Pruning a decision tree reduces the size of the tree such that the
model is more generalized and can classify unknown and unlabelled
data in a better way.
Avoiding overfitting in decision tree
– pruning
• There are two approaches of pruning:
• Pre-pruning: Stop growing the tree before it reaches perfection.
• Post-pruning: Allow the tree to grow entirely and then post-prune some of
the branches from it.
Strengths of decision tree
• It produces very simple understandable rules. For smaller trees, not
much mathematical and computational knowledge is required to
understand this model.
• Works well for most of the problems.
• It can handle both numerical and categorical variables.
• Can work well both with small and large training data sets.
• Decision trees provide a definite clue of which features are more
useful for classification.
Weaknesses of decision tree
• Decision tree models are often biased towards features having more
number of possible values, i.e. levels.
• This model gets overfitted or under fitted quite easily.
• Decision trees are prone to errors in classification problems with
many classes and relatively small number of training examples.
• A decision tree can be computationally expensive to train.
• Large trees are complex to understand.
Random forest model
• Random forest is an ensemble classifier, i.e. a combining classifier that
uses and combines many decision tree classifiers.
• Ensembling is usually done using the concept of bagging with
different feature sets.
• The reason for using large number of trees in random forest is to
train the trees enough such that contribution from each feature
comes in a number of models.
• After the random forest is generated by combining the trees, majority
vote is applied to combine the output of the different trees.
A simplified
random
forest model
How does random forest work?
1. If there are N variables or features in the input data set, select a subset of ‘m ’
(m < N ) features at random out of the N features. The observations or data
instances should be picked randomly.
2. Use the best split principle on these ‘m’ features to calculate the number of
nodes ‘d’.
3. Keep splitting the nodes to child nodes till the tree is grown to the maximum
possible extent.
4. Select a different subset of the training data ‘with replacement’ to train another
decision tree following steps (1) to (3). Repeat this to build and train ‘n’ decision
trees.
5. Final class assignment is done on the basis of the majority votes from the ‘n ’
trees.
Strengths of random forest
• It runs efficiently on large and expansive data sets.
• It has a robust method for estimating missing data and maintains precision when a
large proportion of the data is absent.
• It has powerful techniques for balancing errors in a class population of unbalanced
data sets.
• It gives estimates (or assessments) about which features are the most important ones
in the overall classification.
• It generates an internal unbiased estimate (gauge) of the generalisation error as the
forest generation progresses.
• Generated forests can be saved for future use on other data.
• Lastly, the random forest algorithm can be used to solve both classification and
regression problems.
Weaknesses of random forest
• This model, because it combines a number of decision tree models, is
not as easy to understand as a decision tree model.
• It is computationally much more expensive than a simple model like
decision tree.
Support vector machines
• SVM is a model, which can do linear classification as well as
regression.
• SVM is based on the concept of a surface, called a hyperplane, which
draws a boundary between data instances plotted in the multi-
dimensional feature space.
• The output prediction of an SVM is one of two conceivable classes
which are already defined in the training data.
Classification using hyperplanes

Linearly separable data instances

Classification using hyperplanes
• Support Vectors: Support vectors are the data points (representing
classes), the critical component in a data set, which are near the
identified set of lines (hyperplane). If support vectors are removed,
they will alter the position of the dividing hyperplane.
• Hyperplane and Margin: For an N-dimensional feature space, hyper-
plane is a flat subspace of dimension (N−1) that separates and
classifies a set of data.
Classification using hyperplanes
Mathematically, in a two-dimensional space, hyperplane can be defined by
the equation:
c0+c1X1+c2X2 = 0, which is nothing but an equation of a straight line.

Extending this concept to an N-dimensional space, hyperplane can be

defined by the equation:
c0+c1X1+c2X2+ … +cNXN= 0 which, in short, can be represented as
follows:

The distance between hyperplane and data points is known as margin

Identifying the correct hyperplane in
SVM
• There may be multiple options for hyper-planes dividing the data
instances belonging to the different classes.
• Need to identify which one will result in the best classification.
SVM
• Three hyperplanes: A,B, and C.
• To identify the correct hyperplane which better segregates the two
classes represented by the triangles and circles.
• Hyperplane ‘A’ has performed this task quite well.
• Three hyperplanes: A, B, and C
• To identify the correct hyperplane
which classifies the triangles and circles
in the best possible way.
• Here, maximizing the distances
between the nearest data points of
both the classes and hyperplane will
help us decide the correct hyperplane.
This distance is called as margin.
Kernel trick
• SVM has a technique called the
kernel trick to deal with nonlinearly
separable data.
• these are functions which can
transform lower dimensional input
space to a higher dimensional space.
• converts linearly non-separable data
to a linearly separable data. These
functions are called kernels
•.
Kernel trick
• Some of the common kernel
functions for transforming from a
lower dimension ‘i’ to a higher
dimension ‘j’ used by different
SVM implementations are as
follows:
Kernel trick
• When data instances of the classes are closer to each other, this
method can be used. The effectiveness of SVM depends both on the
• Selection of the kernel function
• Adoption of values for the kernel parameters
Strengths of SVM
• SVM can be used for both classification and regression.
• It is robust, i.e. not much impacted by data with noise or outliers.
• The prediction results using this model are very promising.
Weaknesses of SVM
• SVM is applicable only for binary classification, i.e. when there are
only two classes in the problem domain.
• The SVM model is very complex – almost like a black box when it
deals with a high-dimensional data set. Hence, it is very difficult and
close to impossible to understand the model in such cases.
• It is slow for a large dataset, i.e. a data set with either a large number
of features or a large number of instances.
• It is quite memory-intensive.
Thank YOU

Pre-Employment Exam
100% (6)
Pre-Employment Exam
10 pages
SDXPST 66 2515.3
No ratings yet
SDXPST 66 2515.3
816 pages
Wago App PLC Modbus
No ratings yet
Wago App PLC Modbus
94 pages
Sunspec Modbus Protocol For SMA Device
No ratings yet
Sunspec Modbus Protocol For SMA Device
19 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
ML Unit 3
No ratings yet
ML Unit 3
106 pages
Data Mining and Warehousing Mod3
No ratings yet
Data Mining and Warehousing Mod3
69 pages
(Day - 1 - 7) - Prep For Mock Conference - Info Kit (Netmission)
No ratings yet
(Day - 1 - 7) - Prep For Mock Conference - Info Kit (Netmission)
34 pages
Supervised ML
No ratings yet
Supervised ML
69 pages
ML 3RD Unit
No ratings yet
ML 3RD Unit
67 pages
4.0 Supervised Learning 4.1 Discuss Classification Model
No ratings yet
4.0 Supervised Learning 4.1 Discuss Classification Model
48 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Machine Learning
No ratings yet
Machine Learning
100 pages
Cse Vsem 503 B PR Unit 2 Notes
No ratings yet
Cse Vsem 503 B PR Unit 2 Notes
17 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
Launch of Efile Audit Application, SOP and Manual
No ratings yet
Launch of Efile Audit Application, SOP and Manual
45 pages
Aoop-A CH
No ratings yet
Aoop-A CH
34 pages
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Class 3 - Classification
No ratings yet
Class 3 - Classification
80 pages
231123 智能无线通信技术研究概况PPT 演说
No ratings yet
231123 智能无线通信技术研究概况PPT 演说
28 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Unit 3 (Classification)
No ratings yet
Unit 3 (Classification)
12 pages
Chapter 7 Supervised Learning Classification
No ratings yet
Chapter 7 Supervised Learning Classification
28 pages
Dewe-5000 220e
No ratings yet
Dewe-5000 220e
24 pages
ML UNIT - III-Complete
No ratings yet
ML UNIT - III-Complete
52 pages
ML Unit 3 Part 2
No ratings yet
ML Unit 3 Part 2
8 pages
MLP Unit-2
No ratings yet
MLP Unit-2
102 pages
Diamond 3 13 User Guide
No ratings yet
Diamond 3 13 User Guide
152 pages
Supervised Learning - SVM - DT
No ratings yet
Supervised Learning - SVM - DT
43 pages
Statistic Inference Unit 2 Notes
No ratings yet
Statistic Inference Unit 2 Notes
34 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
1-Blockchain Technology Adoption Barriers in The Indian Agricultural Supply
No ratings yet
1-Blockchain Technology Adoption Barriers in The Indian Agricultural Supply
15 pages
FO - POA - .00166-002 - Additive Manufacturing Checklist
No ratings yet
FO - POA - .00166-002 - Additive Manufacturing Checklist
2 pages
HEC-RAS 507 Unsteady
No ratings yet
HEC-RAS 507 Unsteady
9 pages
Supervised Learning
No ratings yet
Supervised Learning
46 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
Topics in Module-3-: ML & Cloud Computing For Iot
No ratings yet
Topics in Module-3-: ML & Cloud Computing For Iot
149 pages
Computer Fundamental Lab 4 PPT 10102023 073527pm 30092024 031210pm
No ratings yet
Computer Fundamental Lab 4 PPT 10102023 073527pm 30092024 031210pm
25 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
100 Days of Code - The Complete Python Pro Bootcamp For 2021
No ratings yet
100 Days of Code - The Complete Python Pro Bootcamp For 2021
15 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Unit 4
No ratings yet
Unit 4
23 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
3 Data Acquisition - Xid-10710319 - 2
No ratings yet
3 Data Acquisition - Xid-10710319 - 2
32 pages
Session 5
No ratings yet
Session 5
36 pages
Exam1 f12
No ratings yet
Exam1 f12
15 pages
Product Catalogue No Prices Edition 1
No ratings yet
Product Catalogue No Prices Edition 1
58 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
Classification
No ratings yet
Classification
58 pages
Vani Resum
No ratings yet
Vani Resum
3 pages
Ai Unit-4-1
No ratings yet
Ai Unit-4-1
9 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Module 3
No ratings yet
Module 3
63 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
Past Questions Main
No ratings yet
Past Questions Main
61 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Classification
No ratings yet
Classification
53 pages
19 Assessing Model Accuracy
No ratings yet
19 Assessing Model Accuracy
16 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Classification
No ratings yet
Classification
50 pages
Unit 4 Supervised Learning
100% (1)
Unit 4 Supervised Learning
75 pages
Ntro Technical Assistant Examination - 2019: Government of India National Technical Research Organisation
No ratings yet
Ntro Technical Assistant Examination - 2019: Government of India National Technical Research Organisation
1 page
ch-4 FML
No ratings yet
ch-4 FML
13 pages
Classification
No ratings yet
Classification
7 pages
Week 4 - Intro To ML
No ratings yet
Week 4 - Intro To ML
37 pages
Jquery Validation
No ratings yet
Jquery Validation
2 pages
Lect 1
No ratings yet
Lect 1
24 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Unit 1
No ratings yet
Unit 1
15 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
1715 Redundant I/O System Specifications: Technical Data
No ratings yet
1715 Redundant I/O System Specifications: Technical Data
20 pages
A Concise Survey Paper On Automated Plant Irrigation System
No ratings yet
A Concise Survey Paper On Automated Plant Irrigation System
7 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
5130 - 04 5G Basic Service Capabilities and Applications
100% (1)
5130 - 04 5G Basic Service Capabilities and Applications
92 pages
For More Visit WWW - Ktunotes.in
No ratings yet
For More Visit WWW - Ktunotes.in
21 pages
Ultrasound Image Optimization ("Knobology") - B-Mode
No ratings yet
Ultrasound Image Optimization ("Knobology") - B-Mode
12 pages
Doctor Appointment Booking System
No ratings yet
Doctor Appointment Booking System
3 pages
PCI Express 1x, 4x, 8x, 16x Bus Pinout Diagram @
No ratings yet
PCI Express 1x, 4x, 8x, 16x Bus Pinout Diagram @
1 page
Dice Resume CV Abhishek Goyal
No ratings yet
Dice Resume CV Abhishek Goyal
5 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Unit 5

Uploaded by

Unit 5

Uploaded by

Unit: 5

• a linear relationship between the

• If two or more independent variables are involved, it is called multiple

• where c is the number of different class labels, and p refers to the

• Similarly calculate for all attributes

Linearly separable data instances

Extending this concept to an N-dimensional space, hyperplane can be

The distance between hyperplane and data points is known as margin

You might also like