0% found this document useful (0 votes)

260 views24 pages

Regularization: The Problem of Overfitting

Regularization helps address the problem of overfitting in machine learning models. It works by adding parameters to the cost function that penalize models for becoming too complex. This encourages simpler models that generalize better to new data. Regularized linear and logistic regression models minimize a cost function that includes a term for the size of the parameters, keeping them small. Setting the regularization parameter lambda too high can result in underfitting by over-penalizing complexity.

Uploaded by

Nitesh Bisht

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

260 views24 pages

Regularization: The Problem of Overfitting

Uploaded by

Nitesh Bisht

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Regularization

The problem of
overfitting
Machine Learning
Example: Linear regression (housing prices)
Price

Price

Price
Size Size Size

Overfitting: If we have too many features, the learned hypothesis

may fit the training set very well ( ), but fail
to generalize to new examples (predict prices on new examples).
Andrew Ng
Example: Logistic regression

x2 x2 x2

x1 x1 x1

( = sigmoid function)

Andrew Ng
Addressing overfitting:
size of house

Price
no. of bedrooms
no. of floors
age of house
average income in neighborhood Size
kitchen size

Andrew Ng
Addressing overfitting:

Options:
1. Reduce number of features.
Manually select which features to keep.
Model selection algorithm (later in course).
2. Regularization.
Keep all the features, but reduce magnitude/values of
parameters .
Works well when we have a lot of features, each of
which contributes a bit to predicting .

Andrew Ng
Regularization
Cost function

Machine Learning
Intuition

Price
Price

Size of house Size of house

Suppose we penalize and make , really small.

Andrew Ng
Regularization.

Small values for parameters

Simpler hypothesis
Less prone to overfitting
Housing:
Features:
Parameters:

Andrew Ng
Regularization.

Price

Size of house

Andrew Ng
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large

for our problem, say )?
- Algorithm works fine; setting to be very large cant hurt it
- Algortihm fails to eliminate overfitting.
- Algorithm results in underfitting. (Fails to fit even training data
well).
- Gradient descent will fail to converge.

Andrew Ng
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large

for our problem, say )?
Price

Size of house

Andrew Ng
Regularization
Regularized linear
regression
Machine Learning
Regularized linear regression
Gradient descent
Repeat

Andrew Ng
Normal equation

Andrew Ng
Non-invertibility (optional/advanced).
Suppose ,
(#examples) (#features)

If ,

Andrew Ng
Regularization
Regularized
logistic regression
Machine Learning
Regularized logistic regression.

x1
Cost function:

Andrew Ng
Gradient descent
Repeat

Andrew Ng
Advanced optimization
function [jVal, gradient] = costFunction(theta)
jVal = [ code to compute ];

gradient(1) = [ code to compute ];

gradient(2) = [code to compute ];

gradient(3) = [code to compute ];

gradient(n+1) = [ code to compute ];

Andrew Ng

Algorithms: CSE 202 - Homework I
No ratings yet
Algorithms: CSE 202 - Homework I
10 pages
Causal Reasoning and Large Language Models
100% (1)
Causal Reasoning and Large Language Models
42 pages
Docs Slides Lecture11
No ratings yet
Docs Slides Lecture11
18 pages
Module 1 - Tests of Hypothesis For A Single Sample
100% (1)
Module 1 - Tests of Hypothesis For A Single Sample
27 pages
Slide 4 - Linear Regression With Multiple Variables
100% (1)
Slide 4 - Linear Regression With Multiple Variables
30 pages
Machine Learning Coursera All Exercies PDF
No ratings yet
Machine Learning Coursera All Exercies PDF
117 pages
ML Cheatsheet
No ratings yet
ML Cheatsheet
1 page
Machine Learning Andrew NG Week 5 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 5 Quiz 1
3 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
Machine Learning Andrew NG Week 6 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 6 Quiz 1
8 pages
Linear Regression With Gradient Descent
100% (1)
Linear Regression With Gradient Descent
8 pages
Linear Regression With One Variable: Gradient Descent
No ratings yet
Linear Regression With One Variable: Gradient Descent
30 pages
Deloitte-Leading Beyond The Great Disruption
No ratings yet
Deloitte-Leading Beyond The Great Disruption
16 pages
Model Perf Cheat Sheet
No ratings yet
Model Perf Cheat Sheet
2 pages
Interview, Thank You Letter That Mentions Interview Afterthoughts
No ratings yet
Interview, Thank You Letter That Mentions Interview Afterthoughts
1 page
Notes On On Aims and Objectives
No ratings yet
Notes On On Aims and Objectives
4 pages
Advanced R, Second Edition by Hadley Wickham PDF Download
100% (7)
Advanced R, Second Edition by Hadley Wickham PDF Download
44 pages
Spline and Penalized Regression
No ratings yet
Spline and Penalized Regression
45 pages
7 - Experimental Research Strategy
No ratings yet
7 - Experimental Research Strategy
25 pages
Basics: Study Unit 1: Mathematical Preliminaries Chapter 1: Sections 1.1 - 1.6
No ratings yet
Basics: Study Unit 1: Mathematical Preliminaries Chapter 1: Sections 1.1 - 1.6
11 pages
2018-19 Level IV Screening For FCPS Familes-1 PDF
No ratings yet
2018-19 Level IV Screening For FCPS Familes-1 PDF
48 pages
Numerical Reasoning Questions
No ratings yet
Numerical Reasoning Questions
1 page
Tayko
No ratings yet
Tayko
450 pages
Smoothing Methods
100% (1)
Smoothing Methods
52 pages
Moreira O. Advanced Techniques For Collecting Statistical Data 2023
No ratings yet
Moreira O. Advanced Techniques For Collecting Statistical Data 2023
412 pages
InOpe - 6 - Dynamic Programming Exercises To Submit
No ratings yet
InOpe - 6 - Dynamic Programming Exercises To Submit
3 pages
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
No ratings yet
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
1 page
Business Analyst Job
No ratings yet
Business Analyst Job
9 pages
Cheat Sheet: With Stata 15
No ratings yet
Cheat Sheet: With Stata 15
1 page
Non Parametric Estimation
No ratings yet
Non Parametric Estimation
19 pages
Chapter 7 Hypothesis Testing
100% (1)
Chapter 7 Hypothesis Testing
118 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
David Castle - The Role of Intellectual Property Rights in Biotechnology Innovation (2011) PDF
No ratings yet
David Castle - The Role of Intellectual Property Rights in Biotechnology Innovation (2011) PDF
475 pages
Cracking The Shopify Software Engineering Interview - by Arun Rawlani - Medium
No ratings yet
Cracking The Shopify Software Engineering Interview - by Arun Rawlani - Medium
6 pages
14 Tips To Find Like Minded People
No ratings yet
14 Tips To Find Like Minded People
22 pages
Simulation
No ratings yet
Simulation
63 pages
Time-Series Analysis and Forecasting: Least Squares Method
No ratings yet
Time-Series Analysis and Forecasting: Least Squares Method
17 pages
Apprenticeship in England, United Kingdom
No ratings yet
Apprenticeship in England, United Kingdom
122 pages
5.probability Distributions
No ratings yet
5.probability Distributions
63 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages
Guesstimates Pre-Read 2
No ratings yet
Guesstimates Pre-Read 2
10 pages
The Simplex Method
No ratings yet
The Simplex Method
35 pages
Linear Programming With MATLAB
No ratings yet
Linear Programming With MATLAB
51 pages
Lookup Functions - Practice
No ratings yet
Lookup Functions - Practice
16 pages
Week 6 - Lecture Notes Maxima and Minima: Dy DX
No ratings yet
Week 6 - Lecture Notes Maxima and Minima: Dy DX
13 pages
Queuing Analysis: Chapter Outline
No ratings yet
Queuing Analysis: Chapter Outline
22 pages
Managerial Decisions Under Uncertainty
No ratings yet
Managerial Decisions Under Uncertainty
29 pages
Analytics Roadmap
No ratings yet
Analytics Roadmap
30 pages
BODMAS
No ratings yet
BODMAS
4 pages
Python For EveryBody
0% (1)
Python For EveryBody
8 pages
Sample Code Sas
No ratings yet
Sample Code Sas
96 pages
Study Unit 5: Calculus Chapter 6: Sections 6.1, 6.2.1, 6.3.1 Chapter 8: Section 8.1, 8.2 and 8.5
No ratings yet
Study Unit 5: Calculus Chapter 6: Sections 6.1, 6.2.1, 6.3.1 Chapter 8: Section 8.1, 8.2 and 8.5
19 pages
Regularization - AndrewNg
No ratings yet
Regularization - AndrewNg
24 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
23 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
23 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Lec 7. Regularization PDF
No ratings yet
Lec 7. Regularization PDF
17 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
16 pages
Regulariza On: The Problem of Overfi6ng
No ratings yet
Regulariza On: The Problem of Overfi6ng
19 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
7.4 Huffman Coding
No ratings yet
7.4 Huffman Coding
26 pages
Optimal Scheduling Algorithm For Distributed-Memory Machines
No ratings yet
Optimal Scheduling Algorithm For Distributed-Memory Machines
9 pages
Advanced Algorithms Class Notes
No ratings yet
Advanced Algorithms Class Notes
8 pages
Assignment 3.1 K Means Clustering in Python PART 1
No ratings yet
Assignment 3.1 K Means Clustering in Python PART 1
7 pages
A Star Algorithm
No ratings yet
A Star Algorithm
9 pages
09 - APS - Greedy Method
No ratings yet
09 - APS - Greedy Method
69 pages
Design Sana
No ratings yet
Design Sana
36 pages
DSA Self Placed: Geeksforgeeks
No ratings yet
DSA Self Placed: Geeksforgeeks
20 pages
CH13 Data Representation A2 QP 2
No ratings yet
CH13 Data Representation A2 QP 2
19 pages
Ex No: 7 Date: Write A Program For Implementing The FCFS Scheduling Algorithm
No ratings yet
Ex No: 7 Date: Write A Program For Implementing The FCFS Scheduling Algorithm
4 pages
Chapter 6 - Integer Programing Full
No ratings yet
Chapter 6 - Integer Programing Full
44 pages
Dynamic Programming
No ratings yet
Dynamic Programming
46 pages
Erwin Kalvelagen: X, y T T
No ratings yet
Erwin Kalvelagen: X, y T T
8 pages
DS Lab-Course Plan - 2023-24
No ratings yet
DS Lab-Course Plan - 2023-24
23 pages
DSA-Analysis of Algorithms
No ratings yet
DSA-Analysis of Algorithms
27 pages
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
No ratings yet
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
40 pages
(MPI Vs OpenMP) Parallel K-Means Clustering
No ratings yet
(MPI Vs OpenMP) Parallel K-Means Clustering
27 pages
Gauss Elim
No ratings yet
Gauss Elim
8 pages
Horner's Method
No ratings yet
Horner's Method
10 pages
DSA-LAB - 06 (Doubly LINKED LIST)
No ratings yet
DSA-LAB - 06 (Doubly LINKED LIST)
4 pages
Data Structures and Applications - 17-3-2023
No ratings yet
Data Structures and Applications - 17-3-2023
2 pages
Fibonacci Mips
No ratings yet
Fibonacci Mips
2 pages
Adichunchanagiri-University (3) .PDF 20241230 143049 0000
No ratings yet
Adichunchanagiri-University (3) .PDF 20241230 143049 0000
7 pages
Assignment Problem
No ratings yet
Assignment Problem
22 pages
DataSturucture Objective-Questions IIISem
No ratings yet
DataSturucture Objective-Questions IIISem
189 pages
Computer Practical: Gibbs Sampling Model Answers: Model Code in Python
No ratings yet
Computer Practical: Gibbs Sampling Model Answers: Model Code in Python
3 pages
Tutorial Tree (Student) 2024
No ratings yet
Tutorial Tree (Student) 2024
17 pages
Leetcode
No ratings yet
Leetcode
2 pages
Iyse6999 - Swelch34 - hw5: 1 Problem 1
No ratings yet
Iyse6999 - Swelch34 - hw5: 1 Problem 1
4 pages