0% found this document useful (0 votes)

42 views8 pages

Overfitting and Underfitting

Uploaded by

zufishaali2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views8 pages

Overfitting and Underfitting

Uploaded by

zufishaali2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Weekly Summary

Week [7]: [Introduction to Neural Networks]

Overfitting and underfitting are common issues in machine learning that

affect a model's ability to generalize to new data. Overfitting occurs when a
model learns the training data too well, capturing even the noise and minor
fluctuations. This results in high accuracy on training data but poor
performance on unseen data, as the model fails to generalize beyond the
specific patterns of the training set. Overfitting typically happens with
overly complex models that have too many parameters relative to the
dataset size.
On the other hand, underfitting happens when a model is too simple to
capture the underlying patterns in the data. This can lead to high errors on
both training and test data, as the model is unable to learn and represent
the relationships within the data accurately. Underfitting often occurs with
models that lack complexity or have insufficient features or training time.
Balancing complexity is key to avoiding both issues, which is why
regularization and careful tuning are critical in model development.

Day 1
Topic:
Understanding Overfitting, Underfitting, and the Bias-Variance Trade-off

Objective:
Grasp the foundational concepts of overfitting and underfitting in machine
learning, along with how the bias-variance trade-off influences model
accuracy and generalization.

Activity/Assignment/Experiment/Practical:
 Train a simple linear regression model on a dataset.
 Gradually increase model complexity by adding more polynomial
terms (e.g., quadratic, cubic) and observe changes in performance.
 Plot training and validation errors to visualize when the model starts
overfitting or underfitting.
 Analyze how model complexity affects bias (systematic error) and
variance (sensitivity to fluctuations in training data).

Learning Outcomes:
 Developed an understanding of overfitting (when a model performs
well on training data but poorly on unseen data) and underfitting
(when a model is too simple to capture the data's patterns).
 Learned that the bias-variance trade-off is about finding the right
balance: too much bias can lead to underfitting, and too much
variance can lead to overfitting.

Challenges Faced:
Grasping the impact of bias and variance on model performance and
understanding when a model is underfitting or overfitting.

Skills Developed/Improved:
 Basic understanding of evaluating model performance on training vs.
validation data.
 Insights into model complexity and its role in bias and variance.

Day 2
Topic:
Introduction to Lasso (L1) and Ridge (L2) Regularization

Objective:
Learn how Lasso and Ridge regression techniques help control overfitting
by penalizing complex models and reducing unnecessary features.

Activity/Assignment/Experiment/Practical:
 Build a linear regression model without regularization and observe
its performance.
 Implement Lasso (L1) and Ridge (L2) regression on the same dataset
and compare their performance.
 Experiment with different alpha values to understand how they
control the strength of regularization:
o Lasso (L1): Adds a penalty equal to the absolute value of
coefficients, leading to sparse models where some feature
weights can be zeroed out (feature selection).
o Ridge (L2): Adds a penalty equal to the square of coefficients,
reducing the influence of features without completely
eliminating them.

Learning Outcomes:
 Understood how regularization penalizes large coefficients, helping
to mitigate overfitting by reducing model complexity.
 Gained insight into when to use Lasso (for sparse models or feature
selection) and Ridge (for reducing multicollinearity among features).

Challenges Faced:
Finding the optimal alpha values to achieve a balance between
regularization and model accuracy.

Skills Developed/Improved:
 Practical understanding of Lasso and Ridge implementations.
 Improved capability to control model complexity through
regularization.

Day 3
Topic:
Applying Regularization to Improve Model Generalization

Objective:
Gain hands-on experience in building a more robust regression model by
using Lasso and Ridge regularization to prevent overfitting.
Activity/Assignment/Experiment/Practical:
1. Baseline Model Creation:
o Start with a linear regression model without regularization and
evaluate its accuracy using metrics such as Mean Squared Error
(MSE) and R-squared.

2. Apply Regularization:
o Implement Lasso and Ridge regression models on the same
dataset, tuning hyperparameters (like alpha) for each
regularization technique.
o Observe changes in model accuracy and coefficients to see the
impact of regularization.

3. Performance Comparison:
o Compare the regularized models against the baseline,
observing improvements in validation performance and
reduced overfitting.
o Experiment with Grid Search or Random Search to optimize
hyperparameters for maximum accuracy.

Learning Outcomes:
 Learned how Lasso and Ridge regularization can improve model
accuracy by controlling overfitting.
 Gained insight into the process of hyperparameter tuning to optimize
regularization strength for each model.

Challenges Faced:
Balancing regularization strength (alpha) to achieve an ideal generalization
without sacrificing too much accuracy.

Skills Developed/Improved:
 Hyperparameter tuning skills.
 Hands-on experience with regularization for model optimization and
improved generalization.
Day 4
Topic:
Exploring Feature Engineering and Feature Selection Techniques

Objective:
Understand the importance of creating and selecting meaningful features to
enhance model performance.

Activity/Assignment/Experiment/Practical:
1. Feature Engineering:
o Experiment with creating interaction terms (e.g., product of
two features) and polynomial features to capture more complex
relationships.
o Engineer domain-specific features if applicable, like temporal
features (e.g., month, season for time-series data) and
transformations (e.g., log, square root).

2. Feature Selection Techniques:

o Apply correlation analysis to identify highly correlated features
that may not contribute unique information.
o Use model-based feature importance (e.g., tree-based
importance from Random Forest or feature weights from
Lasso) to select the most influential features.

Learning Outcomes:
 Gained an understanding of how feature engineering improves a
model’s ability to capture relevant patterns in data.
 Learned techniques to select impactful features and eliminate
redundant or noisy ones.

Challenges Faced:
Deciding which features to keep or discard, especially with high-
dimensional datasets.
Skills Developed/Improved:
 Enhanced skills in feature engineering and selection.
 Improved understanding of feature contributions to model
performance.

Day 5
Topic:
Importance of Data Transformation for Model Optimization

Objective:
Learn how data transformation techniques like normalization,
standardization, and encoding help improve model performance and
training consistency.

Activity/Assignment/Experiment/Practical:
1. Normalization and Standardization:
o Apply normalization (scaling values between 0 and 1) and
standardization (scaling to have a mean of 0 and standard
deviation of 1) to numerical data.
o Compare model training speed and accuracy with and without
scaling.

2. Encoding Categorical Variables:

o Experiment with encoding techniques like One-Hot Encoding
(for nominal variables) and Label Encoding (for ordinal
variables).
o Use advanced techniques such as Target Encoding (encoding
based on target mean) for high-cardinality categorical features.

Learning Outcomes:
 Understood the impact of scaling on model convergence and
performance.
 Learned various encoding techniques and the importance of choosing
the right encoding based on variable type and model compatibility.

Challenges Faced:
Selecting the right transformation techniques for different data types and
managing categorical variables with numerous categories.

Skills Developed/Improved:
 Practical knowledge of data preprocessing techniques.
 Improved understanding of the role of data scaling and encoding in
optimizing model training.

Day 6
Topic:
Refining Model Input with Advanced Feature Selection Methods

Objective:
Learn advanced feature selection methods to optimize model performance
by focusing on impactful features.

Activity/Assignment/Experiment/Practical:
1. Variance Thresholding:
o Apply a variance threshold to remove low-variance features
that contribute minimal information.

2. Recursive Feature Elimination (RFE):

o Use RFE to iteratively remove the least important features
based on a model’s performance (e.g., with linear regression or
a tree-based model).

3. Model-Based Feature Importance:

o Implement a model (e.g., Random Forest or Lasso regression)
and use it to rank features by importance.
o Select the top-ranked features based on their contribution to
the model’s predictive power.

Learning Outcomes:
 Learned how advanced feature selection can improve model
efficiency and accuracy by focusing only on the most relevant
features.
 Gained insights into selecting features that best capture the
relationships in the data.

Challenges Faced:
Balancing feature selection to avoid underfitting while ensuring the model
remains interpretable.

Skills Developed/Improved:
 Proficiency with advanced feature selection methods.
 Enhanced ability to create streamlined models that maintain high
performance with fewer, more impactful features.

Blockchain Hacking Preview
100% (1)
Blockchain Hacking Preview
37 pages
Cs8381 Datastructures Lab Manual
82% (28)
Cs8381 Datastructures Lab Manual
125 pages
NIJ-0108.01 Ballistic Resistant Protective Materials
100% (1)
NIJ-0108.01 Ballistic Resistant Protective Materials
16 pages
Data Science Interview Questions (#Day11) PDF
100% (1)
Data Science Interview Questions (#Day11) PDF
11 pages
Config WCM
100% (1)
Config WCM
17 pages
Machine Learning Interview Questions.
50% (2)
Machine Learning Interview Questions.
43 pages
Tata 1412g LPT BS6 Trucks Overview - Specs, Features & Images
100% (1)
Tata 1412g LPT BS6 Trucks Overview - Specs, Features & Images
2 pages
Lecture - 6 Classification (Logistic Regression)
No ratings yet
Lecture - 6 Classification (Logistic Regression)
48 pages
Data Collection and Data Preparation
No ratings yet
Data Collection and Data Preparation
5 pages
IP-I IP Diary Monalika
No ratings yet
IP-I IP Diary Monalika
104 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
IT 2023 - Digital - (SEGi Susan 012-2820 251)
No ratings yet
IT 2023 - Digital - (SEGi Susan 012-2820 251)
24 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Lecture 7
No ratings yet
Lecture 7
29 pages
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
No ratings yet
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
43 pages
Bias Variance Annotated
No ratings yet
Bias Variance Annotated
73 pages
AIML Week7 Week8 Week9
No ratings yet
AIML Week7 Week8 Week9
6 pages
General ML Notes
No ratings yet
General ML Notes
30 pages
MLL Final Exam Prep
No ratings yet
MLL Final Exam Prep
5 pages
Bias Variance
No ratings yet
Bias Variance
14 pages
Day 31 - Fine-Tune Hyperparameters of The Re...
No ratings yet
Day 31 - Fine-Tune Hyperparameters of The Re...
5 pages
L11+ Regularization
No ratings yet
L11+ Regularization
24 pages
Unit 2
No ratings yet
Unit 2
23 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
NNDL Notes
No ratings yet
NNDL Notes
73 pages
66 Days of Data
No ratings yet
66 Days of Data
66 pages
BCI Patient Monitoring Catalogue Goodwin
No ratings yet
BCI Patient Monitoring Catalogue Goodwin
36 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
ML 5 Days
No ratings yet
ML 5 Days
7 pages
Chapter 02 Overview (R)
No ratings yet
Chapter 02 Overview (R)
43 pages
Day 11 - Implement Techniques Like Dropout A...
No ratings yet
Day 11 - Implement Techniques Like Dropout A...
5 pages
Session 3
No ratings yet
Session 3
26 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
Fire Fighting Techniques
No ratings yet
Fire Fighting Techniques
3 pages
Midterm Report
No ratings yet
Midterm Report
4 pages
ML 5
No ratings yet
ML 5
14 pages
Day 26 - Research Techniques To Address - Iden...
No ratings yet
Day 26 - Research Techniques To Address - Iden...
5 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Regularization - Ridge and Lasso
No ratings yet
Regularization - Ridge and Lasso
7 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
Ai - W7L14
No ratings yet
Ai - W7L14
22 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Machine Learning Lab Assignment: Instructions
No ratings yet
Machine Learning Lab Assignment: Instructions
4 pages
Unit No. 4
No ratings yet
Unit No. 4
4 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
5a931d082a7d0 PDF
No ratings yet
5a931d082a7d0 PDF
83 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Lecture 9.1 - Model Evaluations - Train Test Cross-Validate (Autosaved)
No ratings yet
Lecture 9.1 - Model Evaluations - Train Test Cross-Validate (Autosaved)
33 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
Bias Variance Ridge Regression
No ratings yet
Bias Variance Ridge Regression
4 pages
PA Notes 2
No ratings yet
PA Notes 2
23 pages
Machine Learning and Pattern Recognition
No ratings yet
Machine Learning and Pattern Recognition
4 pages
C2W3 Lab 02 Diagnosing Bias and Variance
No ratings yet
C2W3 Lab 02 Diagnosing Bias and Variance
11 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
ASSIGNMENT2
No ratings yet
ASSIGNMENT2
6 pages
DPT Week 1
No ratings yet
DPT Week 1
3 pages
Unit 1 - Week (1 - 4) : Planning and Thinking Skills For Architecting Data Science Solutions
No ratings yet
Unit 1 - Week (1 - 4) : Planning and Thinking Skills For Architecting Data Science Solutions
3 pages
Regularization in Machine Learning
No ratings yet
Regularization in Machine Learning
5 pages
Reflective Journal Writing 6 - 1733814927
No ratings yet
Reflective Journal Writing 6 - 1733814927
4 pages
1 - Introduction of Communication
No ratings yet
1 - Introduction of Communication
10 pages
DIP Assignment Essajan
No ratings yet
DIP Assignment Essajan
2 pages
Final Eddited Research Paper1
No ratings yet
Final Eddited Research Paper1
6 pages
ML Unit 3
No ratings yet
ML Unit 3
2 pages
Data Science Content
No ratings yet
Data Science Content
11 pages
Overhaul of WR & IMR Bearings
No ratings yet
Overhaul of WR & IMR Bearings
2 pages
EBLQ-CV3, CW1 EDLQ-CV3, CW1 4PEN522034-1 2018 01 Installer Reference Guide English
No ratings yet
EBLQ-CV3, CW1 EDLQ-CV3, CW1 4PEN522034-1 2018 01 Installer Reference Guide English
108 pages
3 - Business Communication
No ratings yet
3 - Business Communication
10 pages
Eddited Report Credit Card
No ratings yet
Eddited Report Credit Card
41 pages
Datasheet 1 RTG 1223160 E 2,400.0
No ratings yet
Datasheet 1 RTG 1223160 E 2,400.0
2 pages
Rans Simulation of Viscous Flow Around Hull of Multipurpose Amphibious Vehicle
No ratings yet
Rans Simulation of Viscous Flow Around Hull of Multipurpose Amphibious Vehicle
5 pages
Brain Rot
No ratings yet
Brain Rot
2 pages
Di Tia v17
No ratings yet
Di Tia v17
48 pages
BES - R Lab 7
No ratings yet
BES - R Lab 7
5 pages
PKG List (Submit To Mr. Jeong)
No ratings yet
PKG List (Submit To Mr. Jeong)
6 pages
Pre RMA Bench Test Instructions PDF en
No ratings yet
Pre RMA Bench Test Instructions PDF en
50 pages
Started On State Completed On Time Taken Marks Grade 5.00 100
No ratings yet
Started On State Completed On Time Taken Marks Grade 5.00 100
3 pages
Endress-Hauser Proline T-Mass A 150 6AAB EN
No ratings yet
Endress-Hauser Proline T-Mass A 150 6AAB EN
4 pages
Verified PDF Download Discrete Time Signal Processing 3rd Edition by Alan V Oppenheim Ebook and TestBank Bundle Fast Instant Download
No ratings yet
Verified PDF Download Discrete Time Signal Processing 3rd Edition by Alan V Oppenheim Ebook and TestBank Bundle Fast Instant Download
408 pages
T34 Catlogue - Catalogue - V2 - 2023
No ratings yet
T34 Catlogue - Catalogue - V2 - 2023
8 pages
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
No ratings yet
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
2 pages
PD Work File Bca I Year
No ratings yet
PD Work File Bca I Year
14 pages
Raphael
No ratings yet
Raphael
8 pages
Integrating PCA With Deep Learning Models For Stock Market Forecasting
No ratings yet
Integrating PCA With Deep Learning Models For Stock Market Forecasting
13 pages
Individual Accomplishment Report 10
No ratings yet
Individual Accomplishment Report 10
5 pages
CLADLOK Flat Panel Datasheet
No ratings yet
CLADLOK Flat Panel Datasheet
2 pages
Typing 2
No ratings yet
Typing 2
20 pages
Basic Practice (Hard)
No ratings yet
Basic Practice (Hard)
8 pages
011 Tree ?
No ratings yet
011 Tree ?
7 pages
Conference Template A4
No ratings yet
Conference Template A4
7 pages
Java Basics-1
No ratings yet
Java Basics-1
5 pages
Product Supplement For Planning Space: Access To This Documentation (" ")
No ratings yet
Product Supplement For Planning Space: Access To This Documentation (" ")
6 pages
C AND C++ Basic Practice 1
No ratings yet
C AND C++ Basic Practice 1
4 pages
003 Recursion 1-b
No ratings yet
003 Recursion 1-b
3 pages
Amani's Resume 2025
No ratings yet
Amani's Resume 2025
2 pages
Scheduled Executor Service
No ratings yet
Scheduled Executor Service
1 page
Linked Blocking Queue
No ratings yet
Linked Blocking Queue
1 page
Semaphore
No ratings yet
Semaphore
1 page
Math Class Practice
No ratings yet
Math Class Practice
1 page
Date and Time Practice
No ratings yet
Date and Time Practice
1 page
Java Basics-2
No ratings yet
Java Basics-2
1 page
Dasar Mesin Elektrik G-M Saja
No ratings yet
Dasar Mesin Elektrik G-M Saja
45 pages