0% found this document useful (0 votes)

26 views8 pages

Linear Regression Vs Logistic Regression

Uploaded by

riteshprasad1026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views8 pages

Linear Regression Vs Logistic Regression

Uploaded by

riteshprasad1026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

What is Regression?

Regression is a statistical approach used to analyze the relationship

between a dependent variable (target variable) and one or more
independent variables (predictor variables). The objective is to determine
the most suitable function that characterizes the connection between these
variables.
It seeks to find the best-fitting model, which can be utilized to make
predictions or draw conclusions.

Linear Regression vs Logistic Regression

Linear regression is Logistic Regression

used to predict the is used to predict
continuous the categorical
dependent variable dependent variable
using a given set of using a given set of
independent independent
variables. variables.

Linear Regression is Logistic regression

used for solving is used for solving
Regression Classification
problem. problems.

In logistic
In Linear regression,
Regression, we
we predict the value
predict the values of
of continuous
categorical
variables.
variables.

In linear regression, In Logistic

we find the best fit Regression, we find
line, by which we the S-curve by
can easily predict which we can
the output. classify the samples.

Maximum
Least square
likelihood
estimation method
estimation method
is used for
is used for
estimation of
estimation of
accuracy.
accuracy.

The output for The output of

Linear Regression Logistic Regression
must be a must be a
continuous value, Categorical value
such as price, age, such as 0 or 1, Yes or
etc. No, etc.

In Linear regression, In Logistic

it is required that regression, it is not
relationship required to have the
between linear relationship
dependent variable between the
and independent dependent and
variable must be independent
linear. variable.

In logistic
In linear regression,
regression, there
there may be
should not be
collinearity between
collinearity between
the independent
the independent
variables.
variable.

Applications of PCA in Machine Learning

• PCA is used to visualize multidimensional data.

• It is used to reduce the number of dimensions in healthcare data.

• PCA can help resize an image.

• It can be used in finance to analyze stock data and forecast returns.

• PCA helps to find patterns in the high-dimensional datasets.

Here are some examples of unsupervised learning:
• Anomaly detection
Unsupervised learning can identify data points that are unusual in a dataset. For
example, cybersecurity programs can use unsupervised learning to detect
deviations in network traffic patterns that might indicate a hacker.
• Customer segmentation
Unsupervised learning can help businesses understand their customers' common
traits and purchasing habits. This can help businesses personalize their advertising
strategies.
• Recommendation engines
Unsupervised learning can help businesses discover data trends that can be used
to develop effective cross-selling strategies. For example, e-commerce or news
websites can use unsupervised learning to analyze customer behavior and
recommend products to similar users.
• Natural language processing (NLP)
Unsupervised learning can be used for various NLP applications, such as
categorizing articles in news sections, text translation, and speech recognition.
• Time series analysis
Unsupervised learning can be used to find patterns in time series data and make
predictions about future events. This is important for things like weather
forecasting, sales prediction, and stock market predictions.

a kernel function is a function that transforms data into a higher dimensional

space, allowing linear methods to be applied to non-linear problems. Kernel
functions are a key component of many machine learning algorithms,
including support vector machines (SVMs).
What is a Confusion Matrix?
A confusion matrix is a matrix that summarizes the performance of a machine
learning model on a set of test data. It is a means of displaying the number of
accurate and inaccurate instances based on the model’s predictions. It is often
used to measure the performance of classification models, which aim to predict
a categorical label for each input instance.
The matrix displays the number of instances produced by the model on the
test data.
• True Positive (TP): The model correctly predicted a positive outcome
(the actual outcome was positive).
• True Negative (TN): The model correctly predicted a negative
outcome (the actual outcome was negative).
• False Positive (FP): The model incorrectly predicted a positive
outcome (the actual outcome was negative). Also known as a Type I
error.
• False Negative (FN): The model incorrectly predicted a negative
outcome (the actual outcome was positive). Also known as a Type II
error.
Metrics based on Confusion Matrix Data
1. Accuracy
Accuracy is used to measure the performance of the model. It is the ratio of Total correct
instances to the total instances.
Accuracy =(TP + TN)/(TP +TN + FP + FN)

2. Precision
Precision is a measure of how accurate a model’s positive predictions are. It is
defined as the ratio of true positive predictions to the total number of positive
predictions made by the model.
Precision= TP/(TP+FP)

Recall
Recall measures the effectiveness of a classification model in identifying all
relevant instances from a dataset. It is the ratio of the number of true positive
(TP) instances to the sum of true positive and false negative (FN) instances.
Recall = TP/(TP +FN)
F1-Score
F1-score is used to evaluate the overall performance of a classification model.
It is the harmonic mean of precision and recall,
F1-Score = 2*Precision*Recall/Precision+Recall

Specificity
Specificity is another important metric in the evaluation of classification
models, particularly in binary classification. It measures the ability of a model
to correctly identify negative instances. Specificity is also known as the True
Negative Rate. Formula is given by:
Specificity = TN/(TN+FP)

What are Ensemble Methods?

Ensemble methods are techniques that aim at improving the accuracy of results in
models by combining multiple models instead of using a single model. The combined
models increase the accuracy of the results significantly.

Types of Ensemble Methods:

Bagging, or bootstrap

aggregation, is a machine learning technique that uses multiple models to

improve the accuracy and stability of predictive models:

Here's how bagging works:

1. 1. Random sampling
Randomly select data points with replacement from the training set to create
multiple subsets, called bootstrap samples.
2. 2. Train models
Train multiple base models, such as decision trees or neural networks, on each
bootstrap sample.
3. 3. Aggregate predictions
For regression tasks, average the predictions from all models. For classification
tasks, use majority voting to select the class with the highest number of votes.
Boosting is a machine learning technique that improves the accuracy of
predictive models by combining multiple weak learners into a single strong
learner.
Boosting is a machine learning technique that improves the accuracy of predictive
models by combining multiple weak learners into a single strong learner .

Here's how boosting works:

• Train weak learners: Train multiple models sequentially, with each model
focusing on correcting the mistakes of the previous model.
• Iterative process: Repeat the training process until the algorithm's accuracy
improves.
• Combine weak learners: Merge the weak rules into a strong rule with each
iteration.
Stacking

Stacking, another ensemble method, is often referred to as stacked generalization. This

technique works by allowing a training algorithm to ensemble several other similar
learning algorithm predictions. Stacking has been successfully implemented in
regression, density estimations, distance learning, and classifications. It can also be used
to measure the error rate involved during bagging.

Advantages of Kernel PCA

Some of the advantages of kernel PCA are:

• Higher-dimensional transformation – by mapping data into a higher-

dimensional space, kernel PCA can create a more expressive
representation, potentially leading to better separation of classes or
clusters
• Nonlinear transformation – it has the ability to capture complex and
nonlinear relationships
• Flexibility – by capturing nonlinear patterns, it’s more flexible and
adaptable to various data types. Thus, kernel PCA is used for many
domains, including image recognition and speech processing
Advantages of Standard PCA
Some of the advantages of standard PCA are:

• Computational efficiency – standard PCA is computationally more

efficient than Kernel PCA, especially for high-dimensional datasets
• Interpretability – it’s easier to understand and interpret the
transformed data
• Linearity – excels in capturing linear patterns
• Here’s the comparison in tabular form:

Feature K-Means Kernel K-Means

Divides data into clusters based Maps data to a higher-dimensional

Clustering
on linear separation using space and clusters in that space using
Method
centroids. a kernel function.

Distance Euclidean distance in the original Kernel-induced distance in the

Measure space. transformed feature space.

Data Assumes clusters are linearly Handles non-linearly separable and

Assumption separable and convex. complex-shaped clusters.

Limited to simple, linearly Works well with overlapping or non-

Flexibility
separable clusters. linear clusters.

Requires a kernel function (e.g., RBF,

Kernel Usage Not used.
polynomial).

Computationally efficient and Computationally expensive due to

Efficiency
fast. kernel calculations.

Parameter No hyperparameters, Requires selecting and tuning the

Tuning straightforward. kernel and its parameters.

Slower convergence due to kernel

Convergence Usually converges faster.
computations.

Simple datasets with linear Complex datasets with non-linear

Applications
structure. structures or overlapping clusters.

CDCS Specimen Paper C - QP
No ratings yet
CDCS Specimen Paper C - QP
30 pages
Classification
100% (2)
Classification
105 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Tybsc Cs368 Data Analytics Labbook
No ratings yet
Tybsc Cs368 Data Analytics Labbook
58 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Yamashita Vs Styer
100% (3)
Yamashita Vs Styer
4 pages
Physical Science Q4 Week 2 v2
100% (1)
Physical Science Q4 Week 2 v2
20 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Chapter 13 - Overview of A Group Audit
100% (1)
Chapter 13 - Overview of A Group Audit
48 pages
Java-Important Questions
100% (3)
Java-Important Questions
3 pages
BS en Iso 17678-2010
No ratings yet
BS en Iso 17678-2010
32 pages
CHE134P FINAL EXAM 2013 14 4t
No ratings yet
CHE134P FINAL EXAM 2013 14 4t
10 pages
ML Mindbenders: Interview Questions That'll Make You Sweat (Smartly) !
No ratings yet
ML Mindbenders: Interview Questions That'll Make You Sweat (Smartly) !
21 pages
QUESTION BANK Data Analytics
No ratings yet
QUESTION BANK Data Analytics
6 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Prediction - Accuracy
No ratings yet
Prediction - Accuracy
33 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
MLS 2 - Classification
No ratings yet
MLS 2 - Classification
13 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
3MIXTATIN
No ratings yet
3MIXTATIN
45 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Aditya Slides For IBM
No ratings yet
Aditya Slides For IBM
125 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Session-11 Machine Learning
No ratings yet
Session-11 Machine Learning
27 pages
Lec 8
No ratings yet
Lec 8
35 pages
Lecture 11
No ratings yet
Lecture 11
18 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
AI and DS QB1
No ratings yet
AI and DS QB1
31 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
MLT UNIT-2 Notes
No ratings yet
MLT UNIT-2 Notes
16 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Module 3
No ratings yet
Module 3
63 pages
Annexes 1 - 18
No ratings yet
Annexes 1 - 18
26 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
DS Unit 4
No ratings yet
DS Unit 4
13 pages
Machine Learning Concepts
No ratings yet
Machine Learning Concepts
68 pages
UNIT-2 Material
No ratings yet
UNIT-2 Material
71 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
Confusion Matrix
No ratings yet
Confusion Matrix
26 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Machine Learning
No ratings yet
Machine Learning
37 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
It 311-Ads Module 5
No ratings yet
It 311-Ads Module 5
9 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
Module 3 - ML
No ratings yet
Module 3 - ML
101 pages
R Data Analysis
No ratings yet
R Data Analysis
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
6 pages
Analytics Boot Camp
No ratings yet
Analytics Boot Camp
126 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
AIML
No ratings yet
AIML
30 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Machine Learning Note
No ratings yet
Machine Learning Note
40 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
Over Load Protection For Transformer
No ratings yet
Over Load Protection For Transformer
45 pages
Company Analysis-APPLE Inc. Sample
No ratings yet
Company Analysis-APPLE Inc. Sample
7 pages
Design Pattern in Option Pricing Part I
No ratings yet
Design Pattern in Option Pricing Part I
3 pages
Anyaya NG Imperyalista Philippine Writers Series by Elynia S. Mabanglo
No ratings yet
Anyaya NG Imperyalista Philippine Writers Series by Elynia S. Mabanglo
9 pages
Encoding 1
100% (1)
Encoding 1
54 pages
Agadu Du Du
No ratings yet
Agadu Du Du
15 pages
ASSAULT COURSE - PPSX
100% (1)
ASSAULT COURSE - PPSX
12 pages
VPI Table
No ratings yet
VPI Table
2 pages
A Simplified Method of Three Dimensional Technique For The Detection of AmpC Beta-Lactamases
No ratings yet
A Simplified Method of Three Dimensional Technique For The Detection of AmpC Beta-Lactamases
7 pages
AI (Part II)
No ratings yet
AI (Part II)
11 pages
Simon's Favorite Factoring Trick: Eugenis May 31, 2015
No ratings yet
Simon's Favorite Factoring Trick: Eugenis May 31, 2015
2 pages
Guest Faculty 2024
No ratings yet
Guest Faculty 2024
2 pages
JSW (Ritesh Prasad)
No ratings yet
JSW (Ritesh Prasad)
3 pages
T 502 Productleaflet
No ratings yet
T 502 Productleaflet
2 pages
Daily Routine For Govt Exams
No ratings yet
Daily Routine For Govt Exams
3 pages
Construction Services PDF
No ratings yet
Construction Services PDF
2 pages
Mercedeslist17 2 24
No ratings yet
Mercedeslist17 2 24
27 pages
Manual - Parts - Wiring
No ratings yet
Manual - Parts - Wiring
4 pages
Bohler Art of Interpretation PDF
No ratings yet
Bohler Art of Interpretation PDF
20 pages
Standards Based Grades in The World Language Classroom
No ratings yet
Standards Based Grades in The World Language Classroom
13 pages
Literature and Literary Criticism
No ratings yet
Literature and Literary Criticism
27 pages
Null 001.2015.issue 273 en
No ratings yet
Null 001.2015.issue 273 en
26 pages
AOC-03 F-Man UR 00989509 Application
No ratings yet
AOC-03 F-Man UR 00989509 Application
2 pages
Recommendation Forms
No ratings yet
Recommendation Forms
1 page
Order ID 6467531039
No ratings yet
Order ID 6467531039
1 page
Name ... : Grade 5: Al Andalus International School
No ratings yet
Name ... : Grade 5: Al Andalus International School
26 pages
Media 1736247784
No ratings yet
Media 1736247784
2 pages
Lecture Notes On Streamlines and Electric Flux Density
No ratings yet
Lecture Notes On Streamlines and Electric Flux Density
5 pages
8th Sem. Subject Gme
No ratings yet
8th Sem. Subject Gme
1 page
PDF Test
No ratings yet
PDF Test
1 page

Linear Regression Vs Logistic Regression

Uploaded by

Linear Regression Vs Logistic Regression

Uploaded by

What is Regression?

Regression is a statistical approach used to analyze the relationship

Linear Regression vs Logistic Regression

Linear regression is Logistic Regression

Linear Regression is Logistic regression

In linear regression, In Logistic

The output for The output of

In Linear regression, In Logistic

Applications of PCA in Machine Learning

• PCA is used to visualize multidimensional data.

• It is used to reduce the number of dimensions in healthcare data.

• PCA can help resize an image.

• It can be used in finance to analyze stock data and forecast returns.

• PCA helps to find patterns in the high-dimensional datasets.

a kernel function is a function that transforms data into a higher dimensional

What are Ensemble Methods?

Types of Ensemble Methods:

aggregation, is a machine learning technique that uses multiple models to

Here's how bagging works:

Here's how boosting works:

Stacking, another ensemble method, is often referred to as stacked generalization. This

Advantages of Kernel PCA

• Higher-dimensional transformation – by mapping data into a higher-

• Computational efficiency – standard PCA is computationally more

Feature K-Means Kernel K-Means

Divides data into clusters based Maps data to a higher-dimensional

Distance Euclidean distance in the original Kernel-induced distance in the

Data Assumes clusters are linearly Handles non-linearly separable and

Limited to simple, linearly Works well with overlapping or non-

Requires a kernel function (e.g., RBF,

Computationally efficient and Computationally expensive due to

Parameter No hyperparameters, Requires selecting and tuning the

Slower convergence due to kernel

Simple datasets with linear Complex datasets with non-linear

You might also like