Unit 2 - NOTES1 - ML

Uploaded by

mauli.imscit21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Unit 2 - NOTES1 - ML

Uploaded by

mauli.imscit21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Machine Learning

Unit 2 - Supervised
Learning
Machine Learning
How Supervised Learning Works?

In supervised learning, models are trained using labelled dataset,
where the model learns about each type of data.

Once the training process is completed, the model is tested on the
basis of test data (a subset of the training set), and then it predicts
the output.
Classification in Machine Learning

Supervised Machine Learning algorithm can be broadly classified
into Regression and Classification Algorithms.

Classification is a process of categorizing a given set of data into
classes.

It can be performed on both structured or unstructured data.

A program learns from the given dataset or observations and then
classifies new observation into a number of classes or groups.

E.g. : Yes or No, 0 or 1, Spam or Not Spam, cat or dog, etc. Classes
can be called as targets/labels or categories.

The process starts with predicting the class of given data points.
The classes are often referred to as target, label or categories.

The classification predictive modeling is the task of
approximating the mapping function from input variables to
discrete output variables. The main goal is to identify which
class/category the new data will fall into.
The best example of an ML classification algorithm is
Email Spam Detector.
Classification in Machine Learning

E.g. : Heart disease detection can be identified as a classification problem,
this is a binary classification since there can be only two classes i.e has
heart disease or does not have heart disease. The classifier, in this case,
needs training data to understand how the given input variables are related
to the class. And once the classifier is trained accurately, it can be used to
detect whether heart disease is there or not for a particular patient.

The most common classification problems are – speech recognition, face
detection, handwriting recognition, document classification, etc. It can
be either a binary classification problem or a multi-class problem too.
Types of Classifcation

The algorithm which implements the classification on a dataset is
known as a classifier.

Binary Classifier: If the classification problem has only two
possible outcomes, then it is called as Binary Classifier.

Examples: YES or NO, MALE or FEMALE, SPAM or NOT
SPAM, CAT or DOG, etc.

Multi-class Classifier: If a classification problem has more than
two outcomes, then it is called as Multi-class Classifier.

Example: Classification of types of music.
Types Of Learners In Classification

Lazy Learners – Lazy learners simply store the training data and wait until
a testing data appears. The classification is done using the most related
data in the stored training data. They have more predicting time compared
to eager learners. Eg – k-nearest neighbor, case-based reasoning.


Eager Learners – Eager learners construct a classification model based
on the given training data before getting data for predictions. It must be
able to commit to a single hypothesis that will work for the entire space.
Due to this, they take a lot of time in training and less time for a
prediction. Eg – Decision Tree, Naive Bayes, Artificial Neural Networks.
Regression

Regression is the process of finding a model or function for distinguishing
the data into continuous real values instead of using classes or discrete
values.

Regression analysis is a statistical method to model the relationship
between a dependent (target) and independent (predictor) variables with one
or more independent variables.

Regression analysis helps us to understand how the value of the dependent
variable is changing corresponding to an independent variable when other
independent variables are held fixed.

It predicts continuous/real values such as temperature, age, salary, price, etc.

E.g. : In the below image, Compnay want to predict the sales if
it 200 for the Advertisement.
Regression

Regression is a supervised learning technique which helps in finding the
correlation between variables and enables us to predict the continuous
output variable based on the one or more predictor variables.

It is mainly used for prediction, forecasting, time series modeling, and
determining the causal-effect relationship between variables.

"Regression shows a line or curve that passes through all the datapoints
on target-predictor graph in such a way that the vertical distance between
the datapoints and the regression line is minimum."
Regression Examples

Some examples of regression can be as:


Prediction of rain using temperature and other factors

Determining Market trends

Prediction of road accidents due to rash driving.
Regression Terminology

Dependent Variable: The main factor in Regression analysis which we want to predict
or understand is called the dependent variable. It is also called target variable.

Independent Variable: It affect the dependent variables or which are used to predict
the values of the dependent variables are called independent variable, also called as a
predictor.

Outliers: Outlier is an observation which contains either very low value or very high
value in comparison to othe observed values.

Multicollinearity: If the independent variables are highly correlated with each other
than other variables, then such condition is called Multicollinearity.

Underfitting and Overfitting: If our algorithm works well with the training dataset but
not well with test dataset, then such problem is called Overfitting. And if our algorithm
does not perform well even with training dataset, then such problem is called
underfitting.
Classification vs Regression
Parameter CLASSIFICATION REGRESSION

Basic The mapping function is Mapping Function is used for

used for mapping values to the mapping of values to
predefined classes. continuous output.
Involves Discrete values Continuous values
prediction of
Nature of the Unordered Ordered
predicted data
Method of by measuring accuracy by measurement of root mean
calculation square error
Example Decision tree, logistic Regression tree (Random
Algorithms regression, etc. forest), Linear regression, etc.
Regression Techniques

Regression Analysis is a statistical process for estimating the
relationships between the dependent variables and one or more
independent variables.

A regression problem is when the output variable is a real or
continuous value, such as “salary” or “weight”. Many different
models can be used, the simplest is linear regression.
Linear Regression

Linear Regression is a machine learning algorithm based on supervised
learning.

It is a statistical method that is used for predictive analysis. Linear regression
makes predictions for continuous/real or numeric variables such as sales, salary,
age, product price, etc.

Linear regression shows the linear relationship between the independent variable
(X-axis) and the dependent variable (Y-axis), consequently called linear
regression.

It finds how the value of the dependent variable is changing according to the
value of the independent variable.

The above graph presents the linear relationship between the dependent
variable and independent variables. When the value of x (independent
variable) increases, the value of y (dependent variable) is likewise
increasing.

Mathematically, we can represent a linear regression as:

y= b0+b1x+b2x1+b3x3+ ε

Y= Dependent Variable (Target Variable)
X= Independent Variable (predictor Variable)
b0= intercept of the line (Gives an additional degree of freedom)
b1 = Linear regression coefficient (scale factor to each input value).
ε = random error
Types of Linear Regression

Simple Linear Regression:
If a single independent variable is used to predict the value of a numerical
dependent variable, then such a Linear Regression algorithm is called
Simple Linear Regression.

Multiple Linear regression:
If more than one independent variable is used to predict the value of a
numerical dependent variable, then such a Linear Regression algorithm is
called Multiple Linear Regression.


A linear line showing the relationship between the dependent and
independent variables is called a regression line. A regression line can
show two types of relationship:

Positive Linear Relationship:
If the dependent variable increases on the Y-axis and independent
variable increases on X-axis, then such a relationship is termed as a
Positive linear relationship.


Negative Linear Relationship:
If the dependent variable decreases on the Y-axis and independent
variable increases on the X-axis, then such a relationship is called a
negative linear relationship.

Logistic Regression

Logistic regression is basically a supervised classification algorithm.

It is preferred when the dependent variable is binary in nature

Logistic regression is one of the types of regression analysis technique, which gets used
when the dependent variable is discrete. Example: 0 or 1, true or false, Spam or not
spam etc.

This means the target variable can have only two values, and a sigmoid curve denotes
the relation between the target variable and the independent variable.

Logit function is used in Logistic Regression to measure the relationship between the
target variable and independent variables.

Linear Vs Logistic
Linear Regression Logistic Regression

Linear Regression is a supervised regression Logistic Regression is a supervised classification

model. model.

In Linear Regression, we predict the value by In Logistic Regression, we predict the value by 1
an integer number. or 0.

No threshold value is needed. A threshold value is added.

Here dependent variable should be numeric Here the dependent variable consists of only two
and the response variable is continuous to categories. Logistic regression estimates the odds
value. outcome of the dependent variable given a set of
quantitative or categorical independent variables.

Linear regression is used to estimate the Whereas logistic regression is used to calculate
dependent variable in case of a change in the probability of an event. For example, classify if
independent variables. For example, predict mail is spam or not.
the price of houses.
Polynomial Regression

It is the same as Multiple Linear Regression with a little modification.

It is used for curvilinear data.

It is a regression algorithm that models the relationship between a dependent(y) and independent variable(x)
as nth degree polynomial.

y= b0+b1x1+ b2x12+ b2x13+...... bnx1n

It is also called the special case of Multiple Linear Regression in ML. Because we add some polynomial terms
to the Multiple Linear regression equation to convert it into Polynomial Regression.

It is a linear model with some modification in order to increase the accuracy.


Support Vector Regression

It identifies a hyperplane with maximum margin such that
the maximum number of data points are within those
margins.

The basic idea behind SVR is to find the best fit line. In SVR,
the best fit line is the hyperplane that has the maximum number
of points.

In SVR, this straight line is referred to as hyperplane.

The data points on either side of the hyperplane that are closest
to the hyperplane are called Support Vectors which is used to
plot the boundary line.

Unlike other Regression models that try to minimize the error
between the real and predicted value, the SVR tries to fit the
best line within a threshold value.

The threshold value is the distance between the hyperplane and
boundary line.
Decision Tree Regression

It is a tree-structured classifier with three types of nodes.

The Root Node is the initial node which represents the entire
sample and may get split further into further nodes.

The Interior Nodes represent the features of a data set and the
branches represent the decision rules.

Finally, the Leaf Nodes represent the outcome. This algorithm
is very useful for solving decision-related problems.
Advantages of Supervised learning:


With the help of supervised learning, the model can predict the
output on the basis of prior experiences.

In supervised learning, we can have an exact idea about the
classes of objects.

Supervised learning model helps us to solve various real-world
problems such as fraud detection, spam filtering, etc.
Disadvantages of supervised learning:


Supervised learning models are not suitable for handling the
complex tasks.

Supervised learning cannot predict the correct output if the test
data is different from the training dataset.

Training required lots of computation times.

In supervised learning, we need enough knowledge about the
classes of object.

Introduction To Machine Learning, Third Edition by Alpaydin, Ethem
No ratings yet
Introduction To Machine Learning, Third Edition by Alpaydin, Ethem
2 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
ML 2 nd Unit
No ratings yet
ML 2 nd Unit
50 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Unit I
No ratings yet
Unit I
14 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
17 pages
5.REGRESSION-1
No ratings yet
5.REGRESSION-1
46 pages
UNIT-2 Material
No ratings yet
UNIT-2 Material
71 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Machine Learning
No ratings yet
Machine Learning
41 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
Regression: UNIT - V Regression Model
100% (1)
Regression: UNIT - V Regression Model
21 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Unit 2
No ratings yet
Unit 2
67 pages
Data Analytics
No ratings yet
Data Analytics
32 pages
Unit 2 Notes - Final
No ratings yet
Unit 2 Notes - Final
32 pages
Unit 2 Supervised Learning and Applications
No ratings yet
Unit 2 Supervised Learning and Applications
13 pages
DOC-20240831-WA0023.
No ratings yet
DOC-20240831-WA0023.
22 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
ML unit-2 half
No ratings yet
ML unit-2 half
16 pages
ML Unit-IV Notes
No ratings yet
ML Unit-IV Notes
49 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
AI 4 Unit Notes
No ratings yet
AI 4 Unit Notes
47 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
4 ML
No ratings yet
4 ML
41 pages
ARTIFICIAL INTELLIGENCE LEC 4
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 4
13 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
6 Regression Analysis
No ratings yet
6 Regression Analysis
12 pages
AI ML 3
No ratings yet
AI ML 3
27 pages
ML Unit 2
No ratings yet
ML Unit 2
27 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
ML_Introduction
No ratings yet
ML_Introduction
76 pages
Unit - 2 MLA
No ratings yet
Unit - 2 MLA
57 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Unit 2linear Regression Bayesian Learning
No ratings yet
Unit 2linear Regression Bayesian Learning
12 pages
Slide 1
No ratings yet
Slide 1
29 pages
REGRESSION
No ratings yet
REGRESSION
13 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
Data Science
No ratings yet
Data Science
5 pages
ML DL NLP Definitions
No ratings yet
ML DL NLP Definitions
22 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
UNIT 3 Regression
No ratings yet
UNIT 3 Regression
5 pages
Supervised ML
No ratings yet
Supervised ML
69 pages
ml_unit_3_notes
No ratings yet
ml_unit_3_notes
12 pages
Predictive Analytics (2)
No ratings yet
Predictive Analytics (2)
46 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
Day2
No ratings yet
Day2
52 pages
Machine Leraning Unit 2
No ratings yet
Machine Leraning Unit 2
62 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
A Machine Learning Model For Stock Market
No ratings yet
A Machine Learning Model For Stock Market
7 pages
Machine Learning - AL3451 - Important Questions With Answer
No ratings yet
Machine Learning - AL3451 - Important Questions With Answer
25 pages
Analysis On Credit Card Fraud Detection Methods
0% (1)
Analysis On Credit Card Fraud Detection Methods
7 pages
Module - 5 Wearable Devices Notes
No ratings yet
Module - 5 Wearable Devices Notes
16 pages
Artificial intelligence with python 1st Edition Prateek Joshi - The ebook in PDF format is ready for immediate access
100% (1)
Artificial intelligence with python 1st Edition Prateek Joshi - The ebook in PDF format is ready for immediate access
47 pages
Expert Systems - 2018 - Gici - Credit Scoring For A Microcredit Data Set Using The Synthetic Minority Oversampling
No ratings yet
Expert Systems - 2018 - Gici - Credit Scoring For A Microcredit Data Set Using The Synthetic Minority Oversampling
22 pages
CST Form
No ratings yet
CST Form
22 pages
regression- Naive- SVM.docx
No ratings yet
regression- Naive- SVM.docx
3 pages
Prcv 7th Sem Aiml Ite Notes Complete Long
No ratings yet
Prcv 7th Sem Aiml Ite Notes Complete Long
95 pages
A Vast Review of Recognizing The Presence of Andro
No ratings yet
A Vast Review of Recognizing The Presence of Andro
17 pages
Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms
No ratings yet
Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms
12 pages
Machine Learing r20 QP
No ratings yet
Machine Learing r20 QP
4 pages
DL Brochure 1701951996207
No ratings yet
DL Brochure 1701951996207
18 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
Hate Speech Detection Using Machine Learning
No ratings yet
Hate Speech Detection Using Machine Learning
5 pages
AIMLF-UNIT4
No ratings yet
AIMLF-UNIT4
20 pages
Using Static and Dynamic Malware Features To Perfo
No ratings yet
Using Static and Dynamic Malware Features To Perfo
12 pages
Praveen Apc Report 2018
No ratings yet
Praveen Apc Report 2018
22 pages
Bhargavi IEEE XAI 2023
No ratings yet
Bhargavi IEEE XAI 2023
6 pages
Otsu Method and Kmeans
No ratings yet
Otsu Method and Kmeans
6 pages
[Ebooks PDF] download Artificial Intelligence and Machine Learning for EDGE Computing 1st Edition Rajiv Pandey - eBook PDF full chapters
100% (5)
[Ebooks PDF] download Artificial Intelligence and Machine Learning for EDGE Computing 1st Edition Rajiv Pandey - eBook PDF full chapters
69 pages
Comp Sci - IJCSE - Conditional Naive-Bayes - Pushpalatha - Aish
No ratings yet
Comp Sci - IJCSE - Conditional Naive-Bayes - Pushpalatha - Aish
10 pages
Iris Detection: Hough Transform and Topographic Approaches
No ratings yet
Iris Detection: Hough Transform and Topographic Approaches
22 pages
Machine Learning Applied To Student Attentiveness Detection: Using Emotional and Non Emotional Measures
No ratings yet
Machine Learning Applied To Student Attentiveness Detection: Using Emotional and Non Emotional Measures
21 pages
ML Unit Wise Important Questions
No ratings yet
ML Unit Wise Important Questions
2 pages
BTech_Project_Research_Paper
No ratings yet
BTech_Project_Research_Paper
7 pages
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
No ratings yet
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
4 pages
Autism ML Paper
No ratings yet
Autism ML Paper
7 pages
Autocognisys: Iot Assisted Context-Aware Automatic Cognitive Health Assessment
No ratings yet
Autocognisys: Iot Assisted Context-Aware Automatic Cognitive Health Assessment
10 pages

Unit 2 - NOTES1 - ML

Uploaded by

Unit 2 - NOTES1 - ML

Uploaded by

Machine Learning

Basic The mapping function is Mapping Function is used for

Linear Regression is a supervised regression Logistic Regression is a supervised classification

No threshold value is needed. A threshold value is added.

You might also like