0% found this document useful (0 votes)

11 views6 pages

Pa 2 Unit

Uploaded by

collegelifeofa7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Pa 2 Unit

Uploaded by

collegelifeofa7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

1.

Bias
 Definition: Bias is the error that occurs when a model is too
simple and cannot capture the underlying pattern in the data.
 Bias arises when the model makes assumptions about the data
that are too strong, leading to underfitting.
 Characteristics:
o Predictions are far off from the actual values.
o The model has a high systematic error.
 Example: Using a straight line (linear model) to fit data that
follows a curve. The model is too simple and doesn’t capture
the curve.
2. Variance
 Definition: Variance is the error that occurs when a model is
too complex and learns not only the patterns but also the noise
in the training data.
 Variance arises when the model is too flexible and overfits the
training data, performing poorly on unseen data.
 Characteristics:
o Predictions vary a lot when the model is trained on
different datasets.
o The model has low generalization to new data.
 Example: Using a very wiggly curve (overfitting) to fit a small
dataset with noise. The model learns the noise, not the true
trend.
3. Model Complexity
 Definition: Model complexity refers to how sophisticated or
flexible a model is.
 Simple models (low complexity) have low flexibility, like linear
regression, and make strong assumptions.
 Complex models (high complexity) are very flexible, like high-
degree polynomials or deep neural networks, and can fit almost
any data.
 Impact:
o Simple models: High bias, low variance (underfitting).
o Complex models: Low bias, high variance (overfitting).

4. Bias-Variance Trade-off
 Definition: The bias-variance trade-off describes the balance
between bias and variance to achieve the best model
performance.
 The goal is to minimize the total error, which is the sum of:
o Bias²: Error from underfitting.
o Variance: Error from overfitting.
o Irreducible Error: Noise in the data that cannot be
eliminated.
 Concept:
o Increasing model complexity reduces bias (model fits the
data better).
o However, increasing complexity also increases variance
(model learns noise).
o A good model strikes a balance where both bias and
variance are low.
Example of Bias-Variance Trade-off
 High Bias Example:
o A simple linear model trying to fit a nonlinear dataset.
o It underfits the data, resulting in high error (bias).
 High Variance Example:
o A very complex model (e.g., high-degree polynomial) that
fits every data point, including noise.
o It overfits the data, performing poorly on new data.
 Balanced Example:
o A moderately complex model that captures the main
patterns without overfitting.
1. Bayesian Approach
 Definition: The Bayesian approach is a method in statistics that
combines prior knowledge (what we believe before seeing
data) with evidence from the data to make predictions or
decisions.
 How it works:
1. Start with a prior belief (what you already know or
assume).
2. Collect data (new evidence).
3. Update the belief using Bayes’ Theorem to get a posterior
belief (more accurate knowledge).
 Example:
o Before checking the weather, you believe there’s a 50%
chance of rain (prior belief).
o After seeing dark clouds (new evidence), you update your
belief to a higher chance of rain (posterior belief).

2. Cross-Validation
 Definition: Cross-validation is a method used to check how well
a model will perform on new, unseen data.
 How it works:
1. Split your data into k parts (called "folds").
2. Train the model on k-1 parts and test it on the remaining
part.
3. Repeat this process k times so each part is tested once.
4. Average the results to estimate the model's performance.
 Why it’s useful: It helps avoid overfitting and gives a better idea
of how the model performs on different data.
 Example: If you split data into 5 parts, you train on 4 parts and
test on the 5th, repeating this 5 times.

3. Bootstrap Methods
 Definition: Bootstrap is a method that creates many small
datasets by randomly selecting data points (with replacement)
from the original dataset. It is used to estimate the accuracy of
a model or a statistic.
 How it works:
1. Randomly select data points from your dataset with
replacement (some points can appear more than once).
2. Create many "new datasets" (bootstrap samples).
3. Train models or calculate statistics on each sample.
4. Combine the results to estimate performance or
variability.
 Why it’s useful: It helps understand the uncertainty or
variability in predictions.
 Example: If you have 100 data points, you create 100 new
datasets by sampling from the original data (some data points
are repeated).

4. Conditional or Expected Error

 Definition: Conditional or expected error refers to the average
error a model makes on new data, given a particular input.
 Types:
o Expected Error: The error you expect on average for any
input.
o Conditional Error: The error for a specific input or set of
inputs.
 Why it matters: It helps measure how well a model will perform
on new, unseen data.
 Example:
o If a student takes many exams under the same conditions,
the expected error measures how far off their scores are,
on average, from the correct answer.

Admission Form
No ratings yet
Admission Form
1 page
Intro To Plastic Injection Molding Ebook
78% (9)
Intro To Plastic Injection Molding Ebook
43 pages
AML and KYC
0% (2)
AML and KYC
34 pages
Contributions of Filipino Scientist
100% (1)
Contributions of Filipino Scientist
2 pages
QC Yorp Forms
No ratings yet
QC Yorp Forms
4 pages
Fund Based Activities
100% (3)
Fund Based Activities
35 pages
List of Obcs in Tripura As Approved by The Govt. of India. Schemes For Welfare of O.B.Cs
No ratings yet
List of Obcs in Tripura As Approved by The Govt. of India. Schemes For Welfare of O.B.Cs
4 pages
Government of India Technical Centre, Opposite Safdarjung Airport, New Delhi-110003
No ratings yet
Government of India Technical Centre, Opposite Safdarjung Airport, New Delhi-110003
11 pages
Data Mining Cat
No ratings yet
Data Mining Cat
6 pages
BT300KTS 674 TYM Rev04
No ratings yet
BT300KTS 674 TYM Rev04
53 pages
National Liberty Alliance CLGJ Letter To District Court Judges
No ratings yet
National Liberty Alliance CLGJ Letter To District Court Judges
20 pages
Aaaa
No ratings yet
Aaaa
3 pages
Chapter7 - Exchangeability Bias-Variance Decomposition
No ratings yet
Chapter7 - Exchangeability Bias-Variance Decomposition
19 pages
CHP 3
No ratings yet
CHP 3
70 pages
Edu 210 Quiz
No ratings yet
Edu 210 Quiz
4 pages
Bias and Variance
No ratings yet
Bias and Variance
36 pages
Chapter 2 - Selected Solutions
No ratings yet
Chapter 2 - Selected Solutions
4 pages
Sost - Funda - Partnership
No ratings yet
Sost - Funda - Partnership
2 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
Sy Vs Tyson Enterprises, GR L-56763, Dec. 15, 1982
No ratings yet
Sy Vs Tyson Enterprises, GR L-56763, Dec. 15, 1982
3 pages
2500 Level-Trol Controller
No ratings yet
2500 Level-Trol Controller
6 pages
ML3 - Evaluation
100% (1)
ML3 - Evaluation
65 pages
MDL Assignment2 Spring23
No ratings yet
MDL Assignment2 Spring23
5 pages
Quality Practices and Problems in Free Software Projects: Martin Michlmayr, Francis Hunt, David Probert
No ratings yet
Quality Practices and Problems in Free Software Projects: Martin Michlmayr, Francis Hunt, David Probert
5 pages
Unit IV
No ratings yet
Unit IV
51 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Residual vs. Zero Sequence: Welcome Posts About Electrical Training Arc Flash Studies Safety Compliance
No ratings yet
Residual vs. Zero Sequence: Welcome Posts About Electrical Training Arc Flash Studies Safety Compliance
2 pages
Schischek Product Catalogue en PUB113 001 00
No ratings yet
Schischek Product Catalogue en PUB113 001 00
76 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Lec 3
No ratings yet
Lec 3
13 pages
Shelton v. Patton, Et Al. Final
No ratings yet
Shelton v. Patton, Et Al. Final
27 pages
Unit 2
No ratings yet
Unit 2
97 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Assignment Solution
No ratings yet
Assignment Solution
21 pages
Unit 4
No ratings yet
Unit 4
50 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
All DL
No ratings yet
All DL
72 pages
40 Machine Learning Interview Questions
No ratings yet
40 Machine Learning Interview Questions
55 pages
Prlog
No ratings yet
Prlog
10 pages
WEEK5 DLL ENGLISH
100% (1)
WEEK5 DLL ENGLISH
11 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Mod 7
No ratings yet
Mod 7
70 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
ML 3170724 Unit-3
No ratings yet
ML 3170724 Unit-3
48 pages
ML Unit 2
No ratings yet
ML Unit 2
35 pages
4.4 Parametric and Non-Parametric Estimator
No ratings yet
4.4 Parametric and Non-Parametric Estimator
47 pages
Unit 3
No ratings yet
Unit 3
55 pages
ML Final Notes Unit 4,5 Rishi
No ratings yet
ML Final Notes Unit 4,5 Rishi
45 pages
Machine Learning General: Definiton
No ratings yet
Machine Learning General: Definiton
14 pages
Notif VO BVO 06 2024 23082024
No ratings yet
Notif VO BVO 06 2024 23082024
1 page
Naïve Bayes & Decision Algorithm
No ratings yet
Naïve Bayes & Decision Algorithm
19 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Bais and Variance
No ratings yet
Bais and Variance
4 pages
Class X Unit 3 DBMS
No ratings yet
Class X Unit 3 DBMS
78 pages
Great Debaters
No ratings yet
Great Debaters
51 pages
Module 3 Modified
No ratings yet
Module 3 Modified
48 pages
Machine Learning Math Essentials - 12.02.2025
No ratings yet
Machine Learning Math Essentials - 12.02.2025
88 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Bias Variance
No ratings yet
Bias Variance
8 pages
DL Unit1
100% (2)
DL Unit1
79 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
Bias and Variance
No ratings yet
Bias and Variance
21 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
2 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Lec 8
No ratings yet
Lec 8
19 pages
Unit - 1 Leftover Topic Notes
No ratings yet
Unit - 1 Leftover Topic Notes
8 pages
Inquiry Worksheet 6.1 Answered
No ratings yet
Inquiry Worksheet 6.1 Answered
11 pages
Underfitting & Overfitting
No ratings yet
Underfitting & Overfitting
13 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
Aml 1
No ratings yet
Aml 1
5 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Data Science Cheat Sheet
No ratings yet
Data Science Cheat Sheet
7 pages
Parametric
No ratings yet
Parametric
15 pages
Lect 03 Evaluation Part 2
No ratings yet
Lect 03 Evaluation Part 2
40 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
SML
No ratings yet
SML
8 pages
Cbet LVL 6 Basic Electronics 4
No ratings yet
Cbet LVL 6 Basic Electronics 4
3 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
5 pages
ML UNIT 4 Notes
No ratings yet
ML UNIT 4 Notes
30 pages
000400000007AF00
No ratings yet
000400000007AF00
7 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
ML 5
No ratings yet
ML 5
26 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
10 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Pa 2 Unit

Uploaded by

Pa 2 Unit

Uploaded by

1.

4. Conditional or Expected Error

You might also like