0% found this document useful (0 votes)

10 views4 pages

Machine Learning Lab Assignment: Instructions

Uploaded by

Kanik Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

Machine Learning Lab Assignment: Instructions

Uploaded by

Kanik Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning Lab Assignment

Author : Lab Author Date : 2024-11-25

Instructions

This lab assignment consists of multiple tasks aimed at applying and understanding
concepts from the provided slides. Complete the tasks using Python and libraries such
as NumPy, pandas, matplotlib, and scikit-learn. Include plots, metrics, and
explanations for your findings. Submit your code and a report summarizing your
results.

Task 1: Debugging Regularized Linear Regression

Objective: Explore the effects of regularization in linear regression and debug common
issues.

Steps: 1. Dataset Preparation: - Create a synthetic dataset with features X and target y
using a polynomial function (e.g., y = 3x^2 + 2x + 1 + ε, where ε is Gaussian noise). -
Split the dataset into 70% training and 30% test sets.

1. Regularized Linear Regression:

◦ Implement or use Ridge Regression (from scikit-learn) to train the model.

◦ Experiment with different values of the regularization parameter λ (e.g., λ

= 0, 0.1, 1, 10, 100).

◦ Plot the training error and test error as a function of λ.

2. Analysis:

◦ Identify underfitting and overfitting regions from the plot.

◦ Discuss how λ affects the weights w_j and the model’s ability to generalize.

Task 2: Training, Validation, and Test Set Splits

Objective: Understand the importance of data splits in model evaluation.

Steps: 1. Dataset Splitting: - Use the dataset provided in the slides: Size (sq ft): [2104,
1600, 2400, 1416, 3000, 1985, 1534, 1427, 1380, 1494] Price (k$): [400, 330, 369,
232, 540, 300, 315, 199, 212, 243] - Split the dataset into: - 60% training set - 20%
validation set - 20% test set - Display the resulting splits.

1. Linear Regression Model:

◦ Train a linear regression model on the training set.

◦ Compute the Mean Squared Error (MSE) on the training, validation, and test
sets.

◦ Compare the errors across the three subsets.

2. Analysis:

◦ Discuss why the validation set is critical for model selection.

◦ Explain the significance of keeping the test set separate from training and
validation.

Task 3: Model Selection with Polynomial Regression

Objective: Select the best polynomial model for a given dataset using validation error.

Steps: 1. Dataset Preparation: - Use the same dataset from Task 2 or generate a
synthetic dataset with non-linear patterns.

1. Polynomial Regression:

◦ Train polynomial regression models of degree d = 1, 2, ..., 10.

◦ Compute the training error and validation error for each degree d.

2. Visualization:

◦ Plot the training error and validation error as a function of d.

◦ Identify the degree that minimizes the validation error.

3. Analysis:

◦ Discuss the concepts of underfitting and overfitting based on the plot.

◦ Justify the importance of validation error in choosing the polynomial

degree.

Task 4: Effect of Regularization on Bias-Variance Tradeoff

Objective: Examine the impact of regularization on bias and variance.

Steps: 1. Synthetic Dataset: - Generate a dataset with 100 examples and a true
relationship y = 2x + 3 + ε, where ε is Gaussian noise.

1. Ridge Regression:

◦ Train Ridge Regression models with λ = 0, 0.01, 0.1, 1, 10, 100.

◦ Compute the training error, validation error, and weights w_j for each λ.

2. Analysis:

◦ Plot the errors as a function of λ.

◦ Plot the magnitude of weights (|w_j|) as a function of λ.

◦ Discuss how increasing λ affects the bias-variance tradeoff.

Task 5: Neural Network Model Selection

Objective: Choose the best neural network architecture based on cross-validation error.

Steps: 1. Dataset Preparation: - Create a synthetic dataset with multiple input features
and a non-linear target relationship.

1. Neural Network Architectures:

◦ Define three neural network architectures:

▪ Architecture 1: 25 input units → 15 hidden units → 1 output unit

▪ Architecture 2: 20 input units → 12, 12 hidden units → 1 output unit

▪ Architecture 3: 32 input units → 16, 8, 4 hidden units → 1 output unit

2. Training and Validation:

◦ Train each architecture on the training set and evaluate on the validation
set.

◦ Compute the training and validation errors for each architecture.

3. Analysis:

◦ Select the architecture with the lowest validation error.

◦ Discuss why it is important to choose the architecture based on validation

error and not training error.

Task 6: Real-World Debugging

Objective: Debug and improve a poorly performing machine learning model.

Steps: 1. Scenario: - You are given a model with the following errors: - Training Error: 10
- Validation Error: 40 - Test Error: 42 - The large gap between training and validation
error indicates overfitting.

1. Debugging:

◦ Suggest three corrective actions to address overfitting (e.g., regularization,

adding more data, reducing model complexity).
2. Implementation:

◦ Implement one of these actions (e.g., increase λ) and re-train the model.

◦ Compute the new training, validation, and test errors.

3. Analysis:

◦ Compare the errors before and after applying the corrective action.

◦ Discuss the effectiveness of your approach.

Submission Instructions

1. Submit your Python code in a Jupyter Notebook or Python script.

2. Include a PDF report summarizing:

◦ Key results (tables, plots, metrics, etc.).

◦ Explanations and analyses for each task.

3. Ensure all plots are labeled and interpretations are included.

4. Submit your work by the deadline.

Pattern Recognition and Machine Learning
100% (2)
Pattern Recognition and Machine Learning
59 pages
C2 W3 Assignment
No ratings yet
C2 W3 Assignment
437 pages
Business Analytics Project
100% (1)
Business Analytics Project
11 pages
Chi Square Test
78% (9)
Chi Square Test
49 pages
Weekly Learning Activity Sheet Statistics and Probability Grade 11 Quarter 3 Week 2 Mean and Variance of A Discrete Random Variable
100% (1)
Weekly Learning Activity Sheet Statistics and Probability Grade 11 Quarter 3 Week 2 Mean and Variance of A Discrete Random Variable
6 pages
Practice Questions
No ratings yet
Practice Questions
8 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
ICT Assignment 2
No ratings yet
ICT Assignment 2
7 pages
UDSM Statistics and Probability For Non-Majors
No ratings yet
UDSM Statistics and Probability For Non-Majors
148 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
m4 PDF
No ratings yet
m4 PDF
23 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Bookbinders Case 2
0% (3)
Bookbinders Case 2
6 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
ML Assignment 2
No ratings yet
ML Assignment 2
3 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
ML 06
No ratings yet
ML 06
6 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Cs7602 - Machine Learning Assignment 1: Submitted by
No ratings yet
Cs7602 - Machine Learning Assignment 1: Submitted by
11 pages
MBA - Business Statistics
No ratings yet
MBA - Business Statistics
5 pages
Machine Learning - SEAIML-242 (PR) b2
No ratings yet
Machine Learning - SEAIML-242 (PR) b2
39 pages
03 Homework 2022
No ratings yet
03 Homework 2022
1 page
Statistics Lesson Plan #1 Completed
No ratings yet
Statistics Lesson Plan #1 Completed
11 pages
Lab 5
No ratings yet
Lab 5
4 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Lab 04 - Logisitic Regression
No ratings yet
Lab 04 - Logisitic Regression
5 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
No ratings yet
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
6 pages
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
No ratings yet
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
8 pages
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
No ratings yet
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
25 pages
Logit and Probit Models
No ratings yet
Logit and Probit Models
44 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Slide - Python - Statistical Simulation in Python
No ratings yet
Slide - Python - Statistical Simulation in Python
107 pages
Correlation and Linear
No ratings yet
Correlation and Linear
27 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
CS6301 Homework2 KR
No ratings yet
CS6301 Homework2 KR
13 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
Sheet1 1
No ratings yet
Sheet1 1
2 pages
PythonForML2023 Laboratory07 08 Regression Classification Update2
No ratings yet
PythonForML2023 Laboratory07 08 Regression Classification Update2
6 pages
Homework #1 (100 Points) : A. Theory Problems
No ratings yet
Homework #1 (100 Points) : A. Theory Problems
4 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
C2W3 Lab 02 Diagnosing Bias and Variance
No ratings yet
C2W3 Lab 02 Diagnosing Bias and Variance
11 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
Tabel Stat Baru PDF
No ratings yet
Tabel Stat Baru PDF
19 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
ML Labs
No ratings yet
ML Labs
46 pages
Cluster Sampling
No ratings yet
Cluster Sampling
9 pages
CH-3-Sampling and Sample Size Determination
No ratings yet
CH-3-Sampling and Sample Size Determination
7 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
50 Inference
No ratings yet
50 Inference
31 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
Capstone Project - Jaro-Prof. Babji
No ratings yet
Capstone Project - Jaro-Prof. Babji
5 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Message
No ratings yet
Message
2 pages
Dda3020 2024F HW1
No ratings yet
Dda3020 2024F HW1
6 pages
Bio Statistics (Presentation)
No ratings yet
Bio Statistics (Presentation)
46 pages
Important Questions
No ratings yet
Important Questions
4 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Linear Trend Estimation
No ratings yet
Linear Trend Estimation
6 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
UserManual warpPLS.7 - 0
No ratings yet
UserManual warpPLS.7 - 0
141 pages
Effect Size Calculator 17
No ratings yet
Effect Size Calculator 17
5 pages
Kothari 2005 - Performance Matched Discretionary
No ratings yet
Kothari 2005 - Performance Matched Discretionary
35 pages
Bcm-106/Bc-02: O Kolkf D Lkaf ( DH VKSJ XF - Kr@O Kolkf D Lkaf ( DH
No ratings yet
Bcm-106/Bc-02: O Kolkf D Lkaf ( DH VKSJ XF - Kr@O Kolkf D Lkaf ( DH
11 pages
Unit 3 Descriptive Statistics Part 1
No ratings yet
Unit 3 Descriptive Statistics Part 1
41 pages
Scholarly Research Journal's: Keywords: Ripples, ANOVA, Secunderabad LIC Division, Awareness, Descriptive Statistics
No ratings yet
Scholarly Research Journal's: Keywords: Ripples, ANOVA, Secunderabad LIC Division, Awareness, Descriptive Statistics
11 pages
Random Effects Models: Yanez, Spring 2004 1 Lecture Notes XI
No ratings yet
Random Effects Models: Yanez, Spring 2004 1 Lecture Notes XI
14 pages
6305: Applied Econometrics For Policy Analysis: 1 Oaxaca-Blinder Decomposition
No ratings yet
6305: Applied Econometrics For Policy Analysis: 1 Oaxaca-Blinder Decomposition
3 pages
Experimental Design Proposal
No ratings yet
Experimental Design Proposal
2 pages
PCS Module 1 Assignment
No ratings yet
PCS Module 1 Assignment
2 pages
Applied Statistic Poster
No ratings yet
Applied Statistic Poster
2 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
The C++ Workshop: Learn to write clean, maintainable code in C++ and advance your career in software engineering
From Everand
The C++ Workshop: Learn to write clean, maintainable code in C++ and advance your career in software engineering
Dale Green
No ratings yet

Machine Learning Lab Assignment: Instructions

Uploaded by

Machine Learning Lab Assignment: Instructions

Uploaded by

Machine Learning Lab Assignment

Author : Lab Author Date : 2024-11-25

Task 1: Debugging Regularized Linear Regression

1. Regularized Linear Regression:

◦ Implement or use Ridge Regression (from scikit-learn) to train the model.

◦ Experiment with different values of the regularization parameter λ (e.g., λ

◦ Plot the training error and test error as a function of λ.

◦ Identify underfitting and overfitting regions from the plot.

Task 2: Training, Validation, and Test Set Splits

Objective: Understand the importance of data splits in model evaluation.

1. Linear Regression Model:

◦ Train a linear regression model on the training set.

◦ Compare the errors across the three subsets.

◦ Discuss why the validation set is critical for model selection.

Task 3: Model Selection with Polynomial Regression

◦ Train polynomial regression models of degree d = 1, 2, ..., 10.

◦ Plot the training error and validation error as a function of d.

◦ Identify the degree that minimizes the validation error.

◦ Discuss the concepts of underfitting and overfitting based on the plot.

◦ Justify the importance of validation error in choosing the polynomial

Task 4: Effect of Regularization on Bias-Variance Tradeoff

Objective: Examine the impact of regularization on bias and variance.

◦ Train Ridge Regression models with λ = 0, 0.01, 0.1, 1, 10, 100.

◦ Plot the errors as a function of λ.

◦ Plot the magnitude of weights (|w_j|) as a function of λ.

◦ Discuss how increasing λ affects the bias-variance tradeoff.

Task 5: Neural Network Model Selection

1. Neural Network Architectures:

◦ Define three neural network architectures:

▪ Architecture 2: 20 input units → 12, 12 hidden units → 1 output unit

▪ Architecture 3: 32 input units → 16, 8, 4 hidden units → 1 output unit

2. Training and Validation:

◦ Compute the training and validation errors for each architecture.

◦ Select the architecture with the lowest validation error.

◦ Discuss why it is important to choose the architecture based on validation

Task 6: Real-World Debugging

Objective: Debug and improve a poorly performing machine learning model.

◦ Suggest three corrective actions to address overfitting (e.g., regularization,

◦ Compute the new training, validation, and test errors.

◦ Discuss the effectiveness of your approach.

1. Submit your Python code in a Jupyter Notebook or Python script.

2. Include a PDF report summarizing:

◦ Explanations and analyses for each task.

3. Ensure all plots are labeled and interpretations are included.

4. Submit your work by the deadline.

You might also like