0% found this document useful (0 votes)

10 views5 pages

Aml 1

The document discusses key concepts in machine learning, focusing on bias, variance, and their tradeoff, which are crucial for model performance. It covers various regression techniques, including linear regression, ridge, lasso, and elastic net, as well as dimensionality reduction through PCA and classification methods like Naive Bayes and SVM. Additionally, it highlights the importance of regularization to prevent overfitting and improve model accuracy.

Uploaded by

praveencertificates11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Aml 1

Uploaded by

praveencertificates11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Reading Material for AML

1. Bias in a Machine Learning Model

Definition: Bias refers to the error introduced when a model makes overly
simplistic assumptions about the data. It prevents the model from capturing
the true relationships within the data.
Impact: High bias often leads to underfitting, where the model performs
poorly on both training and test data.
Example: Using a straight line to fit a complex curve.
Key Point: Bias measures how far off the model’s predictions are from the
actual data.

2. High Variance in a Model

Definition: Variance measures the sensitivity of a model to changes in the
training data. High variance often leads to overfitting, where the model
performs well on training data but poorly on new, unseen data.
Impact: Models with high variance are too complex and memorize training
data instead of generalizing.
Key Point: High variance indicates overfitting.

3. High Bias and Low Variance

Scenario: When a model is too simple (high bias) and has low variance, it
fails to learn the underlying patterns in the data.
Outcome: This situation is referred to as underfitting.
Key Point: Balance between bias and variance is necessary for good model
performance.

4. Bias-Variance Tradeoff
Goal: Achieve a balance between bias and variance to minimize total error.
Key Concept: A good model has neither too much bias (underfitting) nor
too much variance (overfitting).

1
5. Reducing Variance
Methods:
• Simplifying the model (e.g., reducing complexity).
• Collecting more training data.
• Using techniques like regularization.
Key Point: Increasing the dataset size can help reduce variance.

6. Simple Linear Regression Formula

Formula: y = mx + c
• y: Target variable.
• x: Predictor variable.
• m: Slope of the line.
• c: Intercept.
Purpose: Predicts the relationship between one independent variable and a
dependent variable.

7. R-Squared Value in Linear Regression

Definition: The R-squared value represents the proportion of variance in
the dependent variable that is explained by the independent variables.
Range: Between 0 and 1. A higher R-squared value indicates a better fit.
Key Point: Measures how well the model fits the data.

8. Assumptions in Linear Regression

Linear regression requires:
• A linear relationship between variables.
• Independence of residuals.
• Homoscedasticity (constant variance of residuals).
Not Required: Predictors don’t need to follow a normal distribution.

2
9. Multicollinearity in Linear Regression
Definition: Multicollinearity occurs when independent variables are highly
correlated with each other.
Impact: It can make the coefficient estimates unstable and unreliable.
Key Point: Address multicollinearity using techniques like ridge regression.

10. Evaluating Regression Accuracy

Metric: Mean Squared Error (MSE).

• Measures the average squared difference between actual and predicted

values.

• Lower MSE indicates better model performance.

11. Ridge Regression

Penalty Term: Adds λ w2 to the loss function, where w are the model’s
P
coefficients.
Use Case: Addresses multicollinearity by shrinking coefficients.
Regularization Type: L2 regularization.

12. Lasso Regression

P
Penalty Term: Adds λ |w| to the loss function.
Key Feature: Performs feature selection by shrinking some coefficients to
zero.
Use Case: Preferred when feature selection is required.

13. Elastic Net

Definition: Combines L1 (lasso) and L2 (ridge) regularization, balancing
feature selection and regularization.
Advantage: Effective when features are highly correlated.

3
14. Principal Component Analysis (PCA)
Purpose: Reduces the number of features (dimensionality reduction) while
retaining the most variance in the data.
Steps:
• Scale the features (essential).

• Calculate the principal components.

• Retain components based on explained variance.

15. Naive Bayes

Based On: Bayes’ theorem.
Assumption: Features are conditionally independent.
Variants:
• Gaussian Naive Bayes: Handles continuous data with normal dis-
tribution.

• Multinomial Naive Bayes: Used for text classification.

16. Decision Tree

Leaf Node: A terminal node where no further splits occur; represents a
prediction.
Gini Impurity: Measures the homogeneity of a dataset. Lower values
indicate purer nodes.
Pruning: Reduces tree depth to prevent overfitting and reduce variance.

17. Support Vector Machines (SVM)

Parameters:
• C: Controls the tradeoff between bias and variance.

• Gamma: Determines the influence of individual data points.

Kernels:
• Linear Kernel: Suitable for linearly separable data.

4
• RBF Kernel: Effective for non-linear data by mapping to higher di-
mensions.

18. Regularization
Purpose: Reduces overfitting by adding a penalty term to the model’s loss
function.
Types:

• L1 (Lasso): Shrinks some coefficients to zero (feature selection).

• L2 (Ridge): Shrinks coefficients without setting them to zero.

ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
Regression Models Overview
No ratings yet
Regression Models Overview
170 pages
Machine Learning Interview Question
No ratings yet
Machine Learning Interview Question
72 pages
Regression
No ratings yet
Regression
45 pages
Regression Models: by Mayuri Bhandari
No ratings yet
Regression Models: by Mayuri Bhandari
64 pages
Module 2 Modified
No ratings yet
Module 2 Modified
67 pages
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
No ratings yet
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
53 pages
ML Super Imp
No ratings yet
ML Super Imp
19 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
ML 3
No ratings yet
ML 3
50 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
Zzplagiarism
No ratings yet
Zzplagiarism
23 pages
Session 3
No ratings yet
Session 3
26 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
ML DL NLP Definitions
No ratings yet
ML DL NLP Definitions
22 pages
Bookdown Demo PDF
No ratings yet
Bookdown Demo PDF
19 pages
Chapter+3+ ++Regression+Algorithms
No ratings yet
Chapter+3+ ++Regression+Algorithms
22 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
6.classification & Regression
No ratings yet
6.classification & Regression
45 pages
11 July Unit 1
No ratings yet
11 July Unit 1
47 pages
Lecture 4 and 5
No ratings yet
Lecture 4 and 5
17 pages
ML 1
No ratings yet
ML 1
24 pages
All About ML
No ratings yet
All About ML
18 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
ML Short
No ratings yet
ML Short
11 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Machine Learning
No ratings yet
Machine Learning
37 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
1 - Intro To Machine Learning
No ratings yet
1 - Intro To Machine Learning
34 pages
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
ML Unit 3
No ratings yet
ML Unit 3
23 pages
d3 It ML Jan 2023 Part 2
No ratings yet
d3 It ML Jan 2023 Part 2
32 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
Unit - 1 Leftover Topic Notes
No ratings yet
Unit - 1 Leftover Topic Notes
8 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
DSEnd
No ratings yet
DSEnd
30 pages
Parametric
No ratings yet
Parametric
15 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Data Science Module 5 Q & A
No ratings yet
Data Science Module 5 Q & A
8 pages
ML Unit 3
No ratings yet
ML Unit 3
2 pages
Unit 5
No ratings yet
Unit 5
18 pages
SML
No ratings yet
SML
8 pages
Assignment-2 ML Very Shortcut
No ratings yet
Assignment-2 ML Very Shortcut
6 pages
Machine Learning Interview Questions.
50% (2)
Machine Learning Interview Questions.
43 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Dew Point and Bubble
0% (1)
Dew Point and Bubble
6 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
Cs6551 Computer Networks: Unit - I
No ratings yet
Cs6551 Computer Networks: Unit - I
86 pages
Anti-Collision Planning Optimization in Directional Wells
No ratings yet
Anti-Collision Planning Optimization in Directional Wells
10 pages
Installation Manual Rockfall
100% (1)
Installation Manual Rockfall
38 pages
ME12 - Ch. 1
No ratings yet
ME12 - Ch. 1
13 pages
Plate Yield Line Theory 07 09 2015 PDF
No ratings yet
Plate Yield Line Theory 07 09 2015 PDF
64 pages
Coordination Chemistry
No ratings yet
Coordination Chemistry
43 pages
Extreme Response Spectrum of A Random Vibration PDF
No ratings yet
Extreme Response Spectrum of A Random Vibration PDF
196 pages
FALLSEM2022-23 HUM1024 ETH VL2022230102172 Reference Material I 05-10-2022 HDI
No ratings yet
FALLSEM2022-23 HUM1024 ETH VL2022230102172 Reference Material I 05-10-2022 HDI
7 pages
Mathematics Grade 11 Term 1 Week 2 - 2021 - M
No ratings yet
Mathematics Grade 11 Term 1 Week 2 - 2021 - M
7 pages
Quant Finance
No ratings yet
Quant Finance
43 pages
Center of Mass - DPP 04 (Extra) - Arjuna JEE AIR 2024 (Physics)
No ratings yet
Center of Mass - DPP 04 (Extra) - Arjuna JEE AIR 2024 (Physics)
5 pages
Sentiments Analysis Code Analysis
No ratings yet
Sentiments Analysis Code Analysis
42 pages
Senior Inter Important Questions
No ratings yet
Senior Inter Important Questions
48 pages
Books Doubtnut Question Bank
No ratings yet
Books Doubtnut Question Bank
122 pages
Physics Sample Entry Test 2011 Namal College, Mianwali
0% (1)
Physics Sample Entry Test 2011 Namal College, Mianwali
3 pages
EEE434 - 591 Project 2
No ratings yet
EEE434 - 591 Project 2
3 pages
Problem With Solution
No ratings yet
Problem With Solution
100 pages
DLD Lab Report
No ratings yet
DLD Lab Report
3 pages
BIBC
No ratings yet
BIBC
9 pages
NLPCourse Outline 24-25
No ratings yet
NLPCourse Outline 24-25
4 pages
DST in Psychology PDF
No ratings yet
DST in Psychology PDF
3 pages
Reporte
No ratings yet
Reporte
2 pages
Derivation of The Margin
No ratings yet
Derivation of The Margin
2 pages
Lesson 4: 4. Integration
No ratings yet
Lesson 4: 4. Integration
34 pages
Syllabus-Mathematics For Business
No ratings yet
Syllabus-Mathematics For Business
10 pages
Consumer Theory
No ratings yet
Consumer Theory
19 pages
9709 Distributions PPQs
No ratings yet
9709 Distributions PPQs
18 pages
1EE24 Time Domain Scan
No ratings yet
1EE24 Time Domain Scan
24 pages
University of Cambridge International Examinations International General Certificate of Secondary Education
No ratings yet
University of Cambridge International Examinations International General Certificate of Secondary Education
20 pages
Sheet3 Biomath
No ratings yet
Sheet3 Biomath
1 page
Chapter6 - Test Bank
No ratings yet
Chapter6 - Test Bank
4 pages
Node Voltage
No ratings yet
Node Voltage
25 pages
Fourier Transform For Signals On Dynamic Graphs
No ratings yet
Fourier Transform For Signals On Dynamic Graphs
12 pages

Aml 1

Uploaded by

Aml 1

Uploaded by

Reading Material for AML

1. Bias in a Machine Learning Model

2. High Variance in a Model

3. High Bias and Low Variance

6. Simple Linear Regression Formula

7. R-Squared Value in Linear Regression

8. Assumptions in Linear Regression

10. Evaluating Regression Accuracy

• Measures the average squared difference between actual and predicted

• Lower MSE indicates better model performance.

11. Ridge Regression

12. Lasso Regression

13. Elastic Net

• Calculate the principal components.

• Retain components based on explained variance.

15. Naive Bayes

• Multinomial Naive Bayes: Used for text classification.

16. Decision Tree

17. Support Vector Machines (SVM)

• Gamma: Determines the influence of individual data points.

• L1 (Lasso): Shrinks some coefficients to zero (feature selection).

• L2 (Ridge): Shrinks coefficients without setting them to zero.

You might also like