100% found this document useful (1 vote)

273 views7 pages

Bias and Variance in Machine Learning

Machine learning models contain two types of errors - bias and variance. Bias results from a model's inability to capture the true patterns in the data and variance measures how much a model's predictions change with different training data. High bias means a model is underfitting while high variance means a model is overfitting. The bias-variance tradeoff aims to balance these errors to produce an optimal model that generalizes well without under or overfitting.

Uploaded by

SHIKHA SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

273 views7 pages

Bias and Variance in Machine Learning

Uploaded by

SHIKHA SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Bias and Variance in Machine Learning

Machine learning is a branch of Artificial Intelligence, which allows machines to perform data
analysis and make predictions. However, if the machine learning model is not accurate, it can
make predictions errors, and these prediction errors are usually known as Bias and Variance. In
machine learning, these errors will always be present as there is always a slight difference
between the model predictions and actual predictions. The main aim of ML/data science analysts
is to reduce these errors in order to get more accurate results. In this topic, we are going to
discuss bias and variance, Bias-variance trade-off, Underfitting and Overfitting. But before
starting, let's first understand what errors in Machine learning are?

Errors in Machine Learning?

In machine learning, an error is a measure of how accurately an algorithm can make predictions
for the previously unknown dataset. On the basis of these errors, the machine learning model is
selected that can perform best on the particular dataset. There are mainly two types of errors in
machine learning, which are:
o Reducible errors: These errors can be reduced to improve the model accuracy. Such errors can further be
classified into bias and Variance.

o Irreducible errors: These errors will always be present in the model

regardless of which algorithm has been used. The cause of these errors is unknown variables
whose value can't be reduced.

What is Bias?
In general, a machine learning model analyses the data, find patterns in it and make predictions.
While training, the model learns these patterns in the dataset and applies them to test data for
prediction. While making predictions, a difference occurs between prediction values
made by the model and actual values/expected values, and this difference is
known as bias errors or Errors due to bias. It can be defined as an inability of machine
learning algorithms such as Linear Regression to capture the true relationship between the data
points. Each algorithm begins with some amount of bias because bias occurs from assumptions
in the model, which makes the target function simple to learn. A model has either:

x
o Low Bias: A low bias model will make fewer assumptions about the form of the target function.
o High Bias: A model with a high bias makes more assumptions, and the model becomes unable to capture
the important features of our dataset. A high bias model also cannot perform well on new data.

Generally, a linear algorithm has a high bias, as it makes them learn fast. The simpler the
algorithm, the higher the bias it has likely to be introduced. Whereas a nonlinear algorithm often
has low bias.
Some examples of machine learning algorithms with low bias are Decision Trees, k-Nearest
Neighbours and Support Vector Machines. At the same time, an algorithm with high bias
is Linear Regression, Linear Discriminant Analysis and Logistic Regression.

Ways to reduce High Bias:

High bias mainly occurs due to a much simple model. Below are some ways to reduce the high
bias:

o Increase the input features as the model is underfitted.

o Decrease the regularization term.
o Use more complex models, such as including some polynomial features.

What is a Variance Error?

The variance would specify the amount of variation in the prediction if the different training data
was used. In simple words, variance tells that how much a random variable is different
from its expected value. Ideally, a model should not vary too much from one training dataset
to another, which means the algorithm should be good in understanding the hidden mapping
between inputs and output variables. Variance errors are either of low variance or high
variance.

Low variance means there is a small variation in the prediction of the target function with
changes in the training data set. At the same time, High variance shows a large variation in the
prediction of the target function with changes in the training dataset.

A model that shows high variance learns a lot and perform well with the training dataset, and
does not generalize well with the unseen dataset. As a result, such a model gives good results
with the training dataset but shows high error rates on the test dataset.

Since, with high variance, the model learns too much from the dataset, it leads to overfitting of
the model. A model with high variance has the below problems:

o A high variance model leads to overfitting.

o Increase model complexities.

Usually, nonlinear algorithms have a lot of flexibility to fit the model, have high variance.
Some examples of machine learning algorithms with low variance are, Linear Regression,
Logistic Regression, and Linear discriminant analysis. At the same time, algorithms with
high variance are decision tree, Support Vector Machine, and K-nearest neighbours.

Ways to Reduce High Variance:

o Reduce the input features or number of parameters as a model is overfitted.

o Do not use a much complex model.
o Increase the training data.
o Increase the Regularization term.

Different Combinations of Bias-Variance

There are four possible combinations of bias and variances, which are represented by the below
diagram:
1. Low-Bias, Low-Variance:
The combination of low bias and low variance shows an ideal machine learning model.
However, it is not possible practically.
2. Low-Bias, High-Variance: With low bias and high variance, model predictions are
inconsistent and accurate on average. This case occurs when the model learns with a
large number of parameters and hence leads to an overfitting
3. High-Bias, Low-Variance: With High bias and low variance, predictions are consistent
but inaccurate on average. This case occurs when a model does not learn well with the
training dataset or uses few numbers of the parameter. It leads to underfitting problems
in the model.
4. High-Bias, High-Variance:
With high bias and high variance, predictions are inconsistent and also inaccurate on
average.

How to identify High variance or High Bias?

High variance can be identified if the model has:
o Low training error and high test error.

High Bias can be identified if the model has:

o High training error and the test error is almost similar to training error.

Bias-Variance Trade-Off
While building the machine learning model, it is really important to take care of bias and
variance in order to avoid overfitting and underfitting in the model. If the model is very simple
with fewer parameters, it may have low variance and high bias. Whereas, if the model has a large
number of parameters, it will have high variance and low bias. So, it is required to make a
balance between bias and variance errors, and this balance between the bias error and variance
error is known as the Bias-Variance trade-off.
For an accurate prediction of the model, algorithms need a low variance and low bias. But this is
not possible because bias and variance are related to each other:

o If we decrease the variance, it will increase the bias.

o If we decrease the bias, it will increase the variance.

Bias-Variance trade-off is a central issue in supervised learning. Ideally, we need a model that
accurately captures the regularities in training data and simultaneously generalizes well with the
unseen dataset. Unfortunately, doing this is not possible simultaneously. Because a high variance
algorithm may perform well with training data, but it may lead to overfitting to noisy data.
Whereas, high bias algorithm generates a much simple model that may not even capture
important regularities in the data. So, we need to find a sweet spot between bias and variance to
make an optimal model.

Hence, the Bias-Variance trade-off is about finding the sweet spot to make a balance between bias and
variance errors.

ML Unit - 3
No ratings yet
ML Unit - 3
23 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
ML Unit 2
No ratings yet
ML Unit 2
21 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
Dos Question Paper
No ratings yet
Dos Question Paper
24 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
18 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Co Po Mapping Justification OS
No ratings yet
Co Po Mapping Justification OS
3 pages
402B Deep Learning
No ratings yet
402B Deep Learning
82 pages
Final PPT
No ratings yet
Final PPT
44 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
NN UNIT-1 Complete Notes With 153 Pages
No ratings yet
NN UNIT-1 Complete Notes With 153 Pages
153 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
Unit 1
No ratings yet
Unit 1
88 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
Artificial and Computational Intelligence
No ratings yet
Artificial and Computational Intelligence
124 pages
00-Mindvalley AI Mastery
No ratings yet
00-Mindvalley AI Mastery
1 page
OS Handout 2023
No ratings yet
OS Handout 2023
49 pages
Dictionaries: Advanced Data Structures 1
No ratings yet
Dictionaries: Advanced Data Structures 1
138 pages
Presentation On: Crime Analysis and Prediction Using Data Mining
No ratings yet
Presentation On: Crime Analysis and Prediction Using Data Mining
14 pages
ML Unit-2
No ratings yet
ML Unit-2
17 pages
Unit 3 Full Notes
No ratings yet
Unit 3 Full Notes
30 pages
Bda Unit 5
No ratings yet
Bda Unit 5
29 pages
4th Sem End Semester Question Papers
No ratings yet
4th Sem End Semester Question Papers
15 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
UNIT 4 - Perceptron and DL
No ratings yet
UNIT 4 - Perceptron and DL
39 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
5.hyperparameters and Validation Sets (C)
No ratings yet
5.hyperparameters and Validation Sets (C)
3 pages
Chapter-2-Fundamentals of Machine Learning
No ratings yet
Chapter-2-Fundamentals of Machine Learning
23 pages
IOT - Chapter 2 - Sensor, Actuators and Interfacing
No ratings yet
IOT - Chapter 2 - Sensor, Actuators and Interfacing
14 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
12 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
36 pages
Cs3351 Aiml Unit 4 Notes Eduengg
No ratings yet
Cs3351 Aiml Unit 4 Notes Eduengg
33 pages
UNIT V Streaming
No ratings yet
UNIT V Streaming
22 pages
Unit I Probabilistic Reasoning I 9
No ratings yet
Unit I Probabilistic Reasoning I 9
20 pages
Wa0000.
No ratings yet
Wa0000.
40 pages
DL Unit Wise Important Questions
No ratings yet
DL Unit Wise Important Questions
2 pages
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
No ratings yet
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
7 pages
Unit - 3-NNDL - Notes
No ratings yet
Unit - 3-NNDL - Notes
17 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
B.SC (CS) Real Syllabus
No ratings yet
B.SC (CS) Real Syllabus
75 pages
Unit 4
No ratings yet
Unit 4
24 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Machine Learning Report
No ratings yet
Machine Learning Report
58 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
UNIT2
No ratings yet
UNIT2
25 pages
Concepts in Deep Learning
No ratings yet
Concepts in Deep Learning
14 pages
FDP Day1
No ratings yet
FDP Day1
35 pages
Overfitting vs. Underfitting, Bias vs. Variance
No ratings yet
Overfitting vs. Underfitting, Bias vs. Variance
7 pages
CSE Dept. PPT 176 173
No ratings yet
CSE Dept. PPT 176 173
17 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
21 pages
Related Literature and Studies 1. Level
No ratings yet
Related Literature and Studies 1. Level
9 pages
ChatGPT CheatSheet 20
No ratings yet
ChatGPT CheatSheet 20
1 page
Machine Learning: PAC-Learning and VC-Dimension
No ratings yet
Machine Learning: PAC-Learning and VC-Dimension
31 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
18AI61
No ratings yet
18AI61
3 pages
Pattern Recognition and Anomaly Detection Lab
No ratings yet
Pattern Recognition and Anomaly Detection Lab
3 pages
ML Question Bank - Beena Kapadia
No ratings yet
ML Question Bank - Beena Kapadia
3 pages
Neuromorphic Computing Full Report
No ratings yet
Neuromorphic Computing Full Report
12 pages
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
No ratings yet
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
2 pages
Machine Learning by Joerg Kienitz
No ratings yet
Machine Learning by Joerg Kienitz
5 pages
Data Mining Syllabus
No ratings yet
Data Mining Syllabus
1 page
Academic Performance Prediction Based On Multisource, Multi Feature Behavioral Data
No ratings yet
Academic Performance Prediction Based On Multisource, Multi Feature Behavioral Data
6 pages
Algorithms of Oppression How Search Engines Reinforce Racism Safiya Umoja Noble PDF Download
No ratings yet
Algorithms of Oppression How Search Engines Reinforce Racism Safiya Umoja Noble PDF Download
81 pages
Fuzzy Logic and Applications PDF
No ratings yet
Fuzzy Logic and Applications PDF
13 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
The Theory of Learning Styles Applied To Distance Learning: Sciencedirect
No ratings yet
The Theory of Learning Styles Applied To Distance Learning: Sciencedirect
12 pages
Social Psychology Trends
No ratings yet
Social Psychology Trends
5 pages
EY The New Age Artificial Intelligence For Human Resource Opportunities and Functions
No ratings yet
EY The New Age Artificial Intelligence For Human Resource Opportunities and Functions
11 pages
Google - Machine Learning Glossary
No ratings yet
Google - Machine Learning Glossary
83 pages
Customer Service 2024 Leadership Vision
No ratings yet
Customer Service 2024 Leadership Vision
16 pages
Fake Homework Assignments
100% (1)
Fake Homework Assignments
5 pages
Aids 5
No ratings yet
Aids 5
58 pages
A Survey On LLM-powered Agents For Recommender Systems
No ratings yet
A Survey On LLM-powered Agents For Recommender Systems
9 pages
Ai-Based School Attendance System Using Face Recognition
No ratings yet
Ai-Based School Attendance System Using Face Recognition
12 pages
ICARECS-2025 Conference Paper
No ratings yet
ICARECS-2025 Conference Paper
12 pages
Deep Fakes
No ratings yet
Deep Fakes
16 pages
200001
100% (1)
200001
21 pages
Top Technology Trends in Government For 2022
No ratings yet
Top Technology Trends in Government For 2022
27 pages
FinPro Startup Case Study
No ratings yet
FinPro Startup Case Study
3 pages
Infographic Thesis Statement
100% (3)
Infographic Thesis Statement
7 pages
Complex System Report Ujwal Bhattarai
No ratings yet
Complex System Report Ujwal Bhattarai
19 pages
Machine-Learning-Algorith 8800284 Powerpoint
No ratings yet
Machine-Learning-Algorith 8800284 Powerpoint
10 pages
Prepared by Dr. Musa Alyaman Introduction To Engineering (0908200)
No ratings yet
Prepared by Dr. Musa Alyaman Introduction To Engineering (0908200)
24 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
CV Yen-Ling Kuo
No ratings yet
CV Yen-Ling Kuo
5 pages
Joydeep Paul 23yrs LE
No ratings yet
Joydeep Paul 23yrs LE
2 pages
Literature - Review - 2024 10 24 - 04 34 18
No ratings yet
Literature - Review - 2024 10 24 - 04 34 18
1 page

Bias and Variance in Machine Learning

Uploaded by

Bias and Variance in Machine Learning

Uploaded by

Bias and Variance in Machine Learning

Errors in Machine Learning?

o Irreducible errors: These errors will always be present in the model

Ways to reduce High Bias:

o Increase the input features as the model is underfitted.

What is a Variance Error?

o A high variance model leads to overfitting.

Ways to Reduce High Variance:

o Reduce the input features or number of parameters as a model is overfitted.

Different Combinations of Bias-Variance

How to identify High variance or High Bias?

High Bias can be identified if the model has:

o If we decrease the variance, it will increase the bias.

You might also like